Encyclopedia of Communication Disorders

The MIT Encyclopedia of
Communication Disorders
The MIT Encyclopedia of
Communication Disorders
Edited by Raymond D. Kent
A Bradford Book
The MIT Press
Cambridge, Massachusetts
London, England
( 2004 Massachusetts Institute of Technology
All rights reserved. No part of this book may be reproduced in any form by any electronic or mechanical
means (including photocopying, recording, or information storage and retrieval) without permission in
writing from the publisher.
This book was set in Times New Roman on 3B2 by Asco Typesetters, Hong Kong, and was printed and
bound in the United States of America.
Library of Congress Cataloging-in-Publication Data
The MIT encyclopedia of communication disorders / edited by Raymond D. Kent.

p. cm.
Includes bibliographical references and index.
ISBN 0-262-11278-7 (cloth)
1. Communicative disorders—Encyclopedias. I. Kent, Raymond D. II. Massachusetts Institute of
Technology.
RC423.M56 2004
616.850 50 003—dc21
2003059941
Contents
Introduction ix Dysarthrias: Characteristics and Classification 126
Acknowledgments xi Dysarthrias: Management 129
Dysphagia, Oral and Pharyngeal 132
Early Recurrent Otitis Media and Speech
Part I: Voice 1 Development 135
Acoustic Assessment of Voice 3 Laryngectomy 137
Aerodynamic Assessment of Vocal Function 7 Mental Retardation and Speech in Children 140
Alaryngeal Voice and Speech Rehabilitation 10 Motor Speech Involvement in Children 142
Anatomy of the Human Larynx 13 Mutism, Neurogenic 145
Assessment of Functional Impact of Voice Orofacial Myofunctional Disorders in Children 147
Disorders 20 Phonetic Transcription of Children’s Speech 150
Electroglottographic Assessment of Voice 23 Phonological Awareness Intervention for Children with
Functional Voice Disorders 27 Expressive Phonological Impairments 153
Hypokinetic Laryngeal Movement Disorders 30 Phonological Errors, Residual 156
Infectious Diseases and Inflammatory Conditions of Phonology: Clinical Issues in Serving Speakers of
the Larynx 32 African-American Vernacular English 158
Instrumental Assessment of Children’s Voice 35 Psychosocial Problems Associated with Communicative
Laryngeal Movement Disorders: Treatment with Disorders 161
Botulinum Toxin 38 Speech and Language Disorders in Children: Computer-
Laryngeal Reinnervation Procedures 41 Based Approaches 164
Laryngeal Trauma and Peripheral Structural Speech and Language Issues in Children from Asian-
Ablations 45 Pacific Backgrounds 167
Psychogenic Voice Disorders: Direct Therapy 49 Speech Assessment, Instrumental 169
The Singing Voice 51 Speech Assessment in Children: Descriptive Linguistic
Vocal Hygiene 54 Methods 174
Vocal Production System: Evolution 56 Speech Development in Infants and Young Children
Vocalization, Neural Mechanisms of 59 with a Tracheostomy 176
Voice Acoustics 63 Speech Disfluency and Stuttering in Children 180
Voice Disorders in Children 67 Speech Disorders: Genetic Transmission 183
Voice Disorders of Aging 72 Speech Disorders in Adults, Psychogenic 186
Voice Production: Physics and Physiology 75 Speech Disorders in Children: A Psycholinguistic
Voice Quality, Perceptual Evaluation of 78 Perspective 189
Voice Rehabilitation After Conservation Speech Disorders in Children: Behavioral Approaches to
Laryngectomy 80 Remediation 192
Voice Therapy: Breathing Exercises 82 Speech Disorders in Children: Birth-Related Risk
Voice Therapy: Holistic Techniques 85 Factors 194
Voice Therapy for Adults 88 Speech Disorders in Children: Cross-Linguistic
Voice Therapy for Neurological Aging-Related Voice Data 196
Disorders 91 Speech Disorders in Children: Descriptive Linguistic
Voice Therapy for Professional Voice Users 95 Approaches 198
Speech Disorders in Children: Motor Speech Disorders
of Known Origin 200
Part II: Speech 99 Speech Disorders in Children: Speech-Language
Apraxia of Speech: Nature and Phenomenology 101 Approaches 204
Apraxia of Speech: Treatment 104 Speech Disorders Secondary to Hearing Impairment
Aprosodia 107 Acquired in Adulthood 207
Augmentative and Alternative Communication Speech Issues in Children from Latino
Approaches in Adults 110 Backgrounds 210
Augmentative and Alternative Communication Speech Sampling, Articulation Tests, and Intelligibility
Approaches in Children 112 in Children with Phonological Errors 213
Autism 115 Speech Sampling, Articulation Tests, and Intelligibility
Bilingualism, Speech Issues in 119 in Children with Residual Errors 215
Developmental Apraxia of Speech 121 Speech Sound Disorders in Children: Description and
Dialect, Regional 124 Classification 218
vi Contents
Stuttering 220 Lingustic Aspects of Child Language Impairment—

Transsexualism and Sex Reassignment: Speech Prosody 344
Di¤erences 223 Melodic Intonation Therapy 347
Ventilator-Supported Speech Production 226 Memory and Processing Capacity 349
Mental Retardation 352
Morphosyntax and Syntax 354
Part III: Language 229 Otitis Media: E¤ects on Children’s Language 358
Agrammatism 231 Perseveration 361
Agraphia 233 Phonological Analysis of Language Disorders in
Alexia 236 Aphasia 363
Alzheimer’s Disease 240 Phonology and Adult Aphasia 366
Aphasia, Global 243 Poverty: E¤ects on Language 369
Aphasia, Primary Progressive 245 Pragmatics 372
Aphasia: The Classical Syndromes 249 Prelinguistic Communication Intervention for Children
Aphasia, Wernicke’s 252 with Developmental Disabilities 375
Aphasia Treatment: Computer-Aided Preschool Language Intervention 378
Rehabilitation 254 Prosodic Deficits 381
Aphasia Treatment: Pharmacological Approaches 257 Reversibility/Mapping Disorders 383
Aphasia Treatment: Psychosocial Issues 260 Right Hemisphere Language and Communication
Aphasic Syndromes: Connectionist Models 262 Functions in Adults 386
Aphasiology, Comparative 265 Right Hemisphere Language Disorders 388
Argument Structure: Representation and Segmentation of Spoken Language by Normal Adult
Processing 269 Listeners 392
Attention and Language 272 Semantics 395
Auditory-Motor Interaction in Speech and Social Development and Language Impairment 398
Language 275 Specific Language Impairment in Children 402
Augmentative and Alternative Communication: General Syntactic Tree Pruning 405
Issues 277 Trace Deletion Hypothesis 407
Bilingualism and Language Impairment 279
Communication Disorders in Adults: Functional
Approaches to Aphasia 283 Part IV: Hearing 411
Communication Disorders in Infants and Toddlers Amplitude Compression in Hearing Aids 413
285 Assessment of and Intervention with Children Who Are
Communication Skills of People with Down Deaf or Hard of Hearing 421
Syndrome 288 Audition in Children, Development of 424
Dementia 291 Auditory Brainstem Implant 427
Dialect Speakers 294 Auditory Brainstem Response in Adults 429
Dialect Versus Disorder 297 Auditory Neuropathy in Children 433
Discourse 300 Auditory Scene Analysis 437
Discourse Impairments 302 Auditory Training 439
Functional Brain Imaging 305 Classroom Acoustics 442
Inclusion Models for Children with Developmental Clinical Decision Analysis 444
Disabilities 307 Cochlear Implants 447
Language Development in Children with Focal Cochlear Implants in Adults: Candidacy 450
Lesions 311 Cochlear Implants in Children 454
Language Disorders in Adults: Subcortical Dichotic Listening 458
Involvement 314 Electrocochleography 461
Language Disorders in African-American Electronystagmography 467
Children 318 Frequency Compression 471
Language Disorders in Latino Children 321 Functional Hearing Loss in Children 475
Language Disorders in School-Age Children: Aspects of Genetics and Craniofacial Anomalies 477
Assessment 324 Hearing Aid Fitting: Evaluation of Outcomes 480
Language Disorders in School-Age Children: Hearing Aids: Prescriptive Fitting 482
Overview 326 Hearing Aids: Sound Quality 487
Language Impairment and Reading Disability 329 Hearing Loss and the Masking-Level Di¤erence 489
Language Impairment in Children: Cross-Linguistic Hearing Loss and Teratogenic Drugs or Chemicals 493
Studies 331 Hearing Loss Screening: The School-Age Child 495
Language in Children Who Stutter 333 Hearing Protection Devices 497
Language of the Deaf: Acquisition of English 336 Masking 500
Language of the Deaf: Sign Language 339 Middle Ear Assessment in the Child 504
Contents vii
Noise-Induced Hearing Loss 508 Speechreading Training and Visual Tracking

Otoacoustic Emissions 511 543
Otoacoustic Emissions in Children 515 Suprathreshold Speech Recognition 548
Ototoxic Medications 518 Temporal Integration 550
Pediatric Audiology: The Test Battery Approach 520 Temporal Resolution 553
Physiological Bases of Hearing 522 Tinnitus 556
Pitch Perception 525 Tympanometry 558
Presbyacusis 527 Vestibular Rehabilitation 563
Pseudohypacusis 531
Pure-Tone Threshold Assessment 534 Contributors 569
Speech Perception Indices 538 Name Index 577
Speech Tracking 541 Subject Index 603
Introduction
The MIT Encyclopedia of Communication Disorders (MITECD) is a comprehensive
volume that presents essential information on communication sciences and disorders.
The pertinent disorders are those that a¤ect the production and comprehension of
spoken language and include especially disorders of speech production and percep-
tion, language expression, language comprehension, voice, and hearing. Potential
readers include clinical practitioners, students, and research specialists. Relatively
few comprehensive books of similar design and purpose exist, so MITECD stands
nearly alone as a resource for anyone interested in the broad field of communication
disorders.
MITECD is organized into the four broad categories of Voice, Speech, Language,
and Hearing. These categories represent the spectrum of topics that usually fall under
the rubric of communication disorders (also known as speech-language pathology
and audiology, among other names). For example, roughly these same categories
were used by the National Institute on Deafness and Other Communication Dis-
orders (NIDCD) in preparing its national strategic research plans over the past de-
cade. The Journal of Speech, Language, and Hearing Research, one of the most
comprehensive and influential periodicals in the field, uses the editorial categories of
speech, language, and hearing. Although voice could be subsumed under speech, the
two fields are large enough individually and su‰ciently distinct that a separation is
warranted. Voice is internationally recognized as a clinical and research specialty,
and it is represented by journals dedicated to its domain (e.g., the Journal of Voice).
The use of these four categories achieves a major categorization of knowledge but
avoids a narrow fragmentation of the field at large. It is to be expected that the
Encyclopedia would include cross-referencing within and across these four major
categories. After all, they are integrated in the definitively human behavior of lan-
guage, and disorders of communication frequently have wide-ranging e¤ects on
communication in its essential social, educational, and vocational roles.
In designing the content and structure of MITECD, it was decided that each of
these major categories should be further subdivided into Basic Science, Disorders
(nature and assessment), and Clinical Management (intervention issues). Although
these categories are not always transparent in the entire collection of entries, they
guided the delineation of chapters and the selection of contributors. These categories
are defined as follows:
Basic Science entries pertain to matters such as normal anatomy and physiology,
physics, psychology and psychophysics, and linguistics. These topics are the
foundation for clinical description and interpretation, covering basic principles
and terminology pertaining to the communication sciences. Care was taken to
avoid substantive overlap with previous MIT publications, especially the MIT
Encyclopedia of the Cognitive Sciences (MITECS).
The Disorders entries o¤er information on issues such as syndrome delineation,
definition and characterization of specific disorders, and methods for the iden-
tification and assessment of disorders. As such, these chapters reflect contempo-
rary nosology and nomenclature, as well as guidelines for clinical assessment and
diagnosis.
The Clinical Management entries discuss various interventions including behavioral,
pharmacological, surgical, and prosthetic (mechanical and electronic). There is a
general, but not necessarily one-to-one, correspondence between chapters in the
Disorders and Clinical Management categories. For example, it is possible that
several types of disorder are related to one general chapter on clinical manage-
ment. It is certainly the case that di¤erent management strategies are preferred by
di¤erent clinicians. The chapters avoid dogmatic statements regarding interven-
tions of choice.
Because the approach to communicative disorders can be quite di¤erent for chil-
dren and adults, a further cross-cutting division was made such that for many topics
x Introduction
separate chapters for children and adults are included. Although some disorders that
are first diagnosed in childhood may persist in some form throughout adulthood (e.g,
stuttering, specific language impairment, and hearing loss may be lifelong conditions
for some individuals), many disorders can have an onset either in childhood or in
adulthood and the timing of onset can have implications for both assessment and
intervention. For instance, when a child experiences a significant loss of hearing, the
sensory deficit may greatly impair the learning of speech and language. But when a
loss of the same degree has an onset in adulthood, the problem is not in acquiring
speech and language, but rather in maintaining communication skills. Certainly, it is
often true that an understanding of a given disorder has common features in both the
developmental and acquired forms, but commonality cannot be assumed as a general
condition.
Many decisions were made during the preparation of this volume. Some were
easy, but others were not. In the main, entries are uniform in length and number of
references. However, in a few instances, two or more entries were combined into a
single longer entry. Perhaps inevitably in a project with so many contributors, a small
number of entries were dropped because of personal issues, such as illness, that
interfered with timely preparation of an entry. Happily, contributors showed great
enthusiasm for this project, and their entries reflect an assembled expertise that is
high tribute to the science and clinical practice in communication disorders.
Raymond D. Kent
Acknowledgments
MITECD began as a promising idea in a conversation with Amy Brand, a previous
editor with MIT Press. The idea was further developed, refined, elaborated, and re-
fined again in many ensuing e-mail communications, and I thank Amy for her con-
stant support and assistance through the early phases of the project. When she left
MIT Press, Tom Stone, Senior Editor of Cognitive Sciences, Linguistics, and Brad-
ford Books, stepped in to provide timely advice and attention. I also thank Mary
Avery, Acquisitions Assistant, for her help in keeping this project on track. I am
indebted to all of them.
Speech, voice, language, and hearing are vast domains individually, and several
associated editors helped to select topics for inclusion in MITECD and to identify
contributors with the necessary expertise. The associate editors and their fields of re-
sponsibility are as follows:
Fred H. Bess, Ph.D., Hearing Disorders in Children
Joseph R. Du¤y, Ph.D., Speech Disorders in Adults
Steven D. Gray, M.D. (deceased), Voice Disorders in Children
Robert E. Hillman, Ph.D., Voice Disorders in Adults
Sandra Gordon-Salant, Ph.D., Hearing Disorders in Adults
Mabel L. Rice, Ph.D., Language Disorders in Children
Lawrence D. Shriberg, Ph.D., Speech Disorders in Children
David A. Swinney, Ph.D., and Lewis P. Shapiro, Ph.D., Language Disorders in
Adults
The advice and cooperation of these individuals is gratefully acknowledged. Sadly,
Dr. Steven D. Gray died within the past year. He was an extraordinary man, and
although I knew him only briefly, I was deeply impressed by his passion for knowl-
edge and life. He will be remembered as an excellent physician, creative scientist, and
valued friend and colleague to many.
Dr. Houri Vorperian greatly facilitated this project through her inspired planning
of a computer-based system for contributor communications and record manage-
ment. Sara Stuntebeck and Sara Brost worked skillfully and accurately on a variety
of tasks that went into di¤erent phases of MITECD. They o¤ered vital help with
communications, file management, proofreading, and the various and sundry tasks
that stood between the initial conception of MITECD and the submission of a full
manuscript.
P. M. Gordon and Associates took on the formidable task of assembling 200
entries into a volume that looks and reads like an encyclopedia. I thank Denise
Bracken for exacting attention to the editing craft, creative solutions to unexpected
problems, and forbearance through it all.
MITECD came to reality through the e¤orts of a large number of contributors—
too many for me to acknowledge personally here. However, I draw the reader’s at-
tention to the list of contributors included in this volume. I feel a sense of community
with all of them, because they believed in the project and worked toward its com-
pletion by preparing entries of high quality. I salute them not only for their con-
tributions to MITECD but also for their many career contributions that define them
as experts in the field. I am honored by their participation and their patient cooper-
ation with the editorial process.
Raymond D. Kent
Part I: Voice
Acoustic Assessment of Voice Table 1. Outline of Traditional Acoustic Algorithm Types
f0 statistics
Short-term perturbations
Acoustic assessment of voice in clinical applications is Long-term perturbations
dominated by measures of fundamental frequency ( f0 ), Amplitude statistics
cycle-to-cycle perturbations of period ( jitter) and inten- Short-term perturbations
sity (shimmer), and other measures of irregularity, such Long-term perturbations
as noise-to-harmonics ratio (NHR). These measures are f0 /amplitude covariations
widely used, in part because of the availability of elec- Waveform perturbations
Spectral measures
tronic and microcomputer-based instruments (e.g., Kay
Spectrographic measures
Elemetrics Computerized Speech Laboratory [CSL] or Fourier and LPC spectra
Multispeech, Real-Time Pitch, Multi-Dimensional Voice Long-term average spectra
Program [MDVP], and other software/hardware sys- Cepstra
tems), and in part because of long-term precedent for Inverse filter measures
perturbation (Lieberman, 1961) and spectral noise Radiated signal
measurements (Yanagihara, 1967). Absolute measures Flow-mask signals
of vocal intensity are equally basic but require calibra- Dynamic measures
tions and associated instrumentation (Winholtz and
Titze, 1997).
Independently, these basic acoustic descriptors—f0 , This dependence is not often assessed rigorously, per-
intensity, jitter, shimmer, and NHR—can provide haps because of the time-consuming and strenuous na-
some very basic characterizations of vocal health. ture of a full voice profile. However, an abbreviated or
The first two, f0 and intensity, have very clear percep- focused profiling in which samples related to habitual f0
tual correlates—pitch and loudness, respectively—and by a set number of semitones, or related to habitual
should be assessed for both stability and variability and intensity by a set number of decibels, could be stan-
compared to age and sex norms (Kent, 1994; Baken dardized to control for this dependence e‰ciently. Fi-
and Orliko¤, 2000). Ideally, these tasks are recorded nally, it should be understood that perturbations and
over headset microphones with direct digital acquisition NHR-type measures will usually covary for many rea-
at very high sampling rates (at least 48 kHz). The mate- sons, the simplest ones being methodological (Hillen-
rials to be assessed should be obtained following stan- brand, 1987): an increase in any one of the underlying
dardized elicitation protocols that include sustained phenomena detected by a single measure will also a¤ect
vowel phonations at habitual levels, levels spanning a the other measures.
client’s vocal range in both f0 and intensity, running
speech, and speech tasks designed to elicit variation Periodicity as a Reference. The chief problem with
(Titze, 1995; Awan, 2001). Note, however, that not all nearly all acoustic assessments of voice is the determi-
measures will be appropriate for all tasks; perturbation nation of f0 . Most voice quality algorithms are based on
statistics, for example, are usually valid only when the prior identification of the periodic component in the
extracted from sustained vowel phonations. signal (based on glottal pulses in the time domain or
These basic descriptors are not in any way com- harmonic structure in the frequency domain). Because
prehensive of the range of available measures or the phonation is ideally a nearly periodic process, it is
available signal properties and dimensions. Table 1 cate- logical to conceive of voice measures in terms of the de-
gorizes measures (Buder, 2000) based on primary basic gree to which a given sample deviates from pure period-
signal representations from which measures are derived. icity. There are many conceptual problems with this
Although these categories are intended to be exhaustive simplification, however. At the physiological level, glot-
and mutually exclusive, some more modern algorithms tal morphology is multidimensional—superior-inferior
process components through several types. (For more asymmetry is a basic feature of the two-mass model
detail on the measurement types, see Buder, 2000, and (Ishizaka and Flanagan, 1972), and some anterior-
Baken and Orliko¤, 2000.) Modern algorithmic ap- posterior asymmetry is also inevitable—rendering it un-
proaches should be selected for (1) interpretability with likely that a glottal pulse will be marked by a discrete or
respect to aerodynamic and physiological models of even a single instant of glottal closure. At the level of the
phonation and (2) the incorporation of multivariate signal, the deviations from periodicity may be either
measures to characterize vocal function. random or correlated, and in many cases they are so ex-
treme as to preclude identification of a regular period.
Interdependence of Basic Measures. The interdepen- Finally, at the perceptual level, many factors related to
dence between f0 and intensity is mapped in a voice deviations from a pure f0 can contribute to pitch per-
range profile, or phonetogram, which is an especially ception (Zwicker and Fastl, 1990).
valuable assessment for the professional voice user At any or all of these levels, it becomes questionable
(Coleman, 1993). Furthermore, the dependence of per- to characterize deviations with pure periodicity as a ref-
turbations and signal-to-noise ratios on both f0 and in- erence. In acoustic assessment, the primary level of con-
tensity is well known (Klingholz, 1990; Pabon, 1991). cern is the signal. The National Center for Voice and
4 Part I: Voice
Figure 1. Approximately 900 ms of a sustained vowel phona- SNR results for selected segments were from the ‘‘newjit’’ rou-
tion waveform (top panel) with two fundamental frequency tine of TF32 program (Milenkovic, 2001).
analyses (bottom panel). Average f0 , %jitter, %shimmer, and
Speech issued a summary statement (Titze, 1995) rec- extractions are presented for this segment, one at the
ommending a typology for categorizing deviations from targeted level of approximately 250 Hz and another
periodicity in voices (see also Baken and Orliko¤, 2000, which the tracker finds one octave below this; inspec-
for further subtypes). This typology capitalizes on the tion of the waveform and a perceived biphonia both
categorical nature of dynamic states in nonlinear sys- justify this 125-Hz analysis as a new fundamental fre-
tems; all the major categories, including stable points, quency, although it can also be understood in this
limit cycles, period-doubling/tripling/. . . , and chaos can context as a subharmonic to the original fundamen-
be observed in voice signals (Herzel et al., 1994; Satalo¤ tal. There is therefore some ambiguity as to which
and Hawkshaw, 2001). As in most highly nonlinear fundamental is valid during this episode, and an au-
dynamic systems, deviations from periodicity can be tomatic analysis could plausibly identify either frequency.
categorized on the basis of bifurcations, or sudden qual- (Here the waveform-matching algorithm implemented in
itative changes in vibratory pattern from one of these CSpeechSP [Milenkovic, 1997] does identify either fre-
states to another. quency, depending on where in the waveform the algo-
Figure 1 displays a common form for one such bifur- rithm is applied; initiating the algorithm within the
cation and illustrates the importance of accounting for subharmonic segment predisposes it to identify the lower
its presence in the application of perturbation measures. fundamental.)
In this sustained vowel phonation by a middle-aged The acoustic measures of the segments displayed in
woman with spasmodic dysphonia, a transition to sub- Figure 1 reveal the nontrivial di¤erences that result,
harmonics is clearly visible in segment b (similar pat- depending on the basic glottal pulse form under consid-
terns occur in individuals without dysphonias). Two f0 eration. When the pulses of segment a are considered,
Acoustic Assessment of Voice 5
the perturbations around the base period associated with ings, Gerratt and Kreiman (2000) have critiqued tradi-
the high f0 are low and normative; in segment b, per- tional assessments on several important methodological
turbations around the longer periods of the lower f0 are and theoretical points. However, these points may not
still low ( jitter is improved, while shimmer and the apply to acoustic analysis if (1) acoustic analysis is vali-
signal-to-noise ratio show some degradation). However, dated on its own success and not exclusively in relation
when all segments are considered together to include the to the problematic perceptual classifications, and (2)
perturbations around the high f0 tracked through seg- acoustic analysis is thoroughly grounded for interpreta-
ment b and into c, the perturbation statistics are all tion in some clear aerodynamic or physiological model
increased by an order of magnitude. Many important of phonation. Gerratt and Kreiman also argue that
methodological and theoretical questions should be clinical classification may not be derived along a contin-
raised by such common scenarios in which we must uum that is defined with reference to normal qualities,
consider not just voice typing, but the segment-by- but again, this argument may need to be reversed for the
segment validity of applying perturbation measures with acoustic domain. It is only by reference to a specific
a particular f0 as reference. If, as is often assumed, jitter model that any assessment on acoustic grounds can be
and shimmer are ascribed to ‘‘random’’ variations, then interpreted (though this does not preclude development
the correlated modulations of a strong subharmonic ep- of an independent model for a pathological phonatory
isode should be excluded. Alternatively, the perturba- mechanism). In clinical settings, acoustic voice assess-
tions might be analyzed with respect to the subharmonic ment often serves to corroborate perceptual assessment.
f0 . In any case, assessment by means of perturbation However, as guided by auditory experience and in con-
statistics with no consideration of their underlying junction with the ear and other instrumental assess-
sources is unwise. ments, careful acoustic analysis can be oriented to the
identification of physiological status.
Perceptual, Aerodynamic, and Physiological Correlates In attempting to draw safe and reasonably direct
of Acoustic Measures. Regarding perceptual voice rat- inferences from acoustic signal, aerodynamic models
Figure 2. Spectral features associated with models of phonation, correlated with high-frequency harmonic energy), and (c) pulse
including the Liljencrants-Fant (LF) model of glottal flow and skewing (which is negatively correlated with low-frequency
aperiodicity source models developed by Stevens. The LF harmonic energy; this low-frequency region is also positively
model of glottal flow is shown at top left. At bottom left is the correlated with open quotient and peak volume velocity mea-
LF model of glottal flow derivative, showing the rate of change sures of the glottal flow waveform). The e¤ect of turbulence
in flow. At right is a spectrum schematic showing four e¤ects. due to high airflow through the glottis is schematized by (d),
These e¤ects include three derived parameters of the LF model: indicating the associated appearance of high-frequency aperi-
(a) excitation strength (the maximum negative amplitude of the odic energy in the spectrum. See voice acoustics for other
flow derivative, which is positively correlated with overall har- graphical and quantitative associations between glottal status
monic energy), (b) dynamic leakage or non-zero return phase and spectral characteristics.
following the point of maximum excitation (which is negatively
6 Part I: Voice
of glottal behavior present important links to the quality measurement (pp. 119–244). San Diego, CA: Singu-
physiological domain. Attempts to recover the glottal lar Publishing Group.
flow waveform, either from a face mask-transduced Callen, D. E., Kent, R. D., Roy, N., and Tasko, S. M. (2000).
flow recording (Rothenberg, 1973) or a microphone- The use of self-organizing maps for the classification of voice
disorders. In M. J. Ball (Ed.), Voice quality measurement
transduced acoustic recording (Davis, 1975), have
(pp. 103–116). San Diego, CA: Singular Publishing Group.
proved to be labor-intensive and prone to error (Nı́ Coleman, R. F. (1993). Sources of variation in phonetograms.
Chasaide and Gobl, 1997). Rather than attempting to Journal of Voice, 7, 1–14.
eliminate the e¤ects of the vocal tract, it may be more Davis, S. B. (1975). Preliminary results using inverse filtering of
fruitful to understand its in situ relationship with pho- speech for automatic evaluation of laryngeal pathology.
nation, and infer, via the types of features displayed in Journal of the Acoustical Society of America, 58, SIII.
Figure 2, the status of the glottis as a sound source. In- Fant, G., Liljencrants, J., and Lin, Q. (1985). A four-parameter
terpretation of spectral features, such as the amplitudes model of glottal flow. Speech Transmission Laboratory
of the first harmonics and at the formant frequencies, Quarterly Progress and Status Report, 4, 1–13.
may be an e¤ective alternative when guided by knowl- Fröhlich, M., Michaelis, D., and Strube, H. (2001). SIM-
simultaneous inverse filtering and matching of a glottal flow
edge of glottal aerodynamics and acoustics (Hanson,
model for acoustic speech signals. Journal of the Acoustical
1997; Nı́ Chasaide and Gobl, 1997; Hanson and Society of America, 110, 479–488.
Chuang, 1999). Deep familiarity with acoustic mecha- Fröhlich, M., Michaelis, D., Strube, H., and Kruse, E. (2000).
nisms is essential for such interpretations (Titze, 1994; Acoustic voice analysis by means of the hoarseness dia-
Stevens, 1998), as is a model with clear and meaningful gram. Journal of Speech, Language, and Hearing Research,
parameters, such as the Liljencrants-Fant (LF) model 43, 706–720.
(Fant, Liljencrants, and Lin, 1985). The parameters of Gau‰n, J., and Sundberg, J. (1989). Spectral correlates of
the LF model have proved to be meaningful in acoustic glottal voice source waveform characteristics. Journal of
studies (Gau‰n & Sundberg, 1989) and useful in refined Speech and Hearing Research, 32, 556–565.
e¤orts at inverse filtering (Fröhlich, Michaelis, and Gerratt, B., and Kreiman, J. (2000). Theoretical and method-
ological development in the study of pathological voice
Strube, 2001). Figure 2 summarizes selected parameters
quality. Journal of Phonetics, 28, 335–342.
of the LF source model following Nı́ Chasaide and Gobl Hanson, H. M. (1997). Glottal characteristics of female
(1997) and the glottal turbulence source following speakers: Acoustic correlates. Journal of the Acoustical So-
Stevens (1998); see also voice acoustics for other ap- ciety of America, 101, 466–481.
proaches relating glottal status to spectral measures. Hanson, H. M., and Chuang, E. S. (1999). Glottal character-
Other spectral-based measures implement similar istics of male speakers: Acoustic correlates and comparison
model-based strategies by selecting spectral component with female data. Journal of the Acoustical Society of
ratios (e.g., the VTI and SPI parameters of MDVP). America, 106, 1064–1077.
Sophisticated spectral noise characterizations control for Herzel, H., Berry, D., Titze, I. R., and Saleh, M. (1994).
perturbations and modulations (Murphy, 1999; Qi, Analysis of vocal disorders with methods from nonlinear
dynamics. Journal of Speech and Hearing Research, 37,
Hillman, and Milstein, 1999), or employ curve-fitting
1008–1019.
and statistical models to produce more robust measures Hillenbrand, J. (1987). A methodological study of perturbation
(Alku, Strik, and Vilkman, 1997; Michaelis, Fröhlich, and additive noise in synthetically generated voice signals.
and Strube, 1998; Schoentgen, Bensaid, and Bucella, Journal of Speech and Hearing Research, 30, 448–461.
2000). A particularly valuable modern technique for Ishizaka, K., and Flanagan, J. L. (1972). Synthesis of voiced
detecting turbulence at the glottis, the glottal-to-noise- sounds from a two-mass model of the vocal cords. Bell
excitation ratio (Michaelis, Gramss, and Strube, 1997), System Technical Journal, 51, 1233–1268.
has been especially successful in combination with other Kent, R. D. (1994). Reference manual for communicative
measures (Fröhlich et al., 2000). The use of acoustic sciences and disorders: Speech and language. Austin, TX:
techniques for voice will only improve with the inclusion Pro-Ed.
Klingholz, F. (1990). Acoustic representation of speaking-voice
of more knowledge-based measures in multivariate rep-
quality. Journal of Voice, 4, 213–219.
resentations (Wolfe, Cornell, and Palmer, 1991; Callen Lieberman, P. (1961). Perturbations in vocal pitch. Journal of
et al., 2000; Wuyts et al., 2000). the Acoustical Society of America, 33, 597–603.
—Eugene H. Buder Michaelis, D., Fröhlich, M., and Strube, H. W. (1998). Selec-
tion and combination of acoustic features for the descrip-
References tion of pathologic voices. Journal of the Acoustical Society
of America, 103, 1628–1638.
Alku, P., Strik, H., and Vilkman, E. (1997). Parabolic spectral Michaelis, D., Gramss, T., and Strube, H. W. (1997). Glottal
parameter: A new method for quantification of the glottal to noise excitation ratio: A new measure for describing
flow. Speech Communication, 22, 67–79. patholocial voices. Acustica, 83, 700–706.
Awan, S. N. (2001). The voice diagnostic profile: A practical Milenkovic, P. (1997). CSpeechSP [Computer software]. Mad-
guide to the diagnosis of voice disorders. Gaithersburg, MD: ison, WI: University of Wisconsin–Madison.
Aspen. Milenkovic, P. (2001). TF32 [Computer software]. Madison,
Baken, R. J., and Orliko¤, R. F. (2000). Clinical measurement WI: University of Wisconsin–Madison.
of speech and voice. San Diego, CA: Singular Publishing Murphy, P. J. (1999). Perturbation-free measurement of the
Group. harmonics-to-noise ratio in voice signals using pitch syn-
Buder, E. H. (2000). Acoustic analysis of voice quality: A tab- chronous harmonic analysis. Journal of the Acoustical Soci-
ulation of algorithms 1902–1990. In M. J. Ball (Ed.), Voice ety of America, 105, 2866–2881.
Aerodynamic Assessment of Vocal Function 7
Nı́ Chasaide, A., and Gobl, C. (1997). Voice source variation. Goldman, and Mead, 1973; Watson and Hixon, 1985;
In J. Laver (Ed.), The handbook of phonetic sciences (pp. Hoit and Hixon, 1987; Hoit et al., 1990). Air volumes
427–461). Oxford, UK: Blackwell. are measured in standard metric units (liters, cubic cen-
Pabon, J. P. H. (1991). Objective acoustic voice-quality timeters, milliliters) and lung inflation levels are usually
parameters in the computer phonetogram. Journal of Voice,
specified in terms of a percentage of the vital capacity or
5, 203–216.
Qi, Y., Hillman, R., and Milstein, C. (1999). The estimation of total lung volume.
signal-to-noise ratio in continuous speech for disordered Both direct and indirect methods have been used to
voices. Journal of the Acoustical Society of America, 105, measure air volumes expended during phonation. Direct
2532–2535. measurement of orally displaced air volumes during
Rothenberg, M. (1973). A new inverse-filtering technique for phonatory tasks can be accomplished, to a limited ex-
deriving the glottal air flow waveform during voicing. Jour- tent, by means of a mouthpiece or face mask connected
nal of the Acoustical Society of America, 53, 1632–1645. to a measurement device such as a spirometer (Beckett,
Satalo¤, R. T., and Hawkshaw, M. (Eds.). (2001). Chaos in 1971) or pneumotachograph (Isshiki, 1964). The use of a
medicine: Source readings. San Diego, CA: Singular Pub- mouthpiece essentially limits speech production to sus-
lishing Group.
tained vowels, which are su‰cient for assessing selected
Schoentgen, J., Bensaid, M., and Bucella, F. (2000). Multi-
variate statistical analysis of flat vowel spectra with a view volumetric-based phonatory parameters. There are also
to characterizing dysphonic voices. Journal of Speech, Lan- concerns that face masks interfere with normal jaw
guage, and Hearing Research, 43, 1493–1508. movements and that the oral acoustic signal is degraded,
Stevens, K. N. (1998). Acoustic phonetics. Cambridge, MA: so that auditory feedback is reduced or distorted and
MIT Press. simultaneous acoustic analysis is limited. These limi-
Titze, I. R. (1994). Principles of voice production. Englewood tations, which are inherent to the use of devices placed
Cli¤s, NJ: Prentice Hall. in or around the mouth to directly collect oral airflow,
Titze, I. R. (1995). Workshop on acoustic voice analysis: Sum- plus additional measurement-related restrictions (Hill-
mary statement. Iowa City, IA: National Center for Voice man and Kobler, 2000) have helped motivate the de-
and Speech.
velopment and application of indirect measurement
Winholtz, W. S., and Titze, I. R. (1997). Conversion of a head-
mounted microphone signal into calibrated SPL units. approaches.
Journal of Voice, 11, 417–421. Most speech breathing research has been carried out
Wolfe, V., Cornell, R., and Palmer, C. (1991). Acoustic corre- using indirect approaches for estimating lung volumes
lates of pathologic voice types. Journal of Speech and by means of monitoring changes in body dimensions.
Hearing Research, 34, 509–516. The basic assumption underlying the indirect approaches
Wuyts, F. L., De Bodt, M. S., Molenberghs, G., Remacle, M., is that changes in lung volume are reflected in propor-
Heylen, L., Millet, B., et al. (2000). The Dysphonia Severity tional changes in body torso size. One relatively cum-
Index: An objective measure of vocal quality based on a bersome but time-honored approach has been to place
multiparameter approach. Journal of Speech, Language, and subjects in a sealed chamber called a body plethysmo-
Hearing Research, 43, 796–809.
graph to allow estimation of the air volume displaced by
Yanagihara, N. (1967). Significance of harmonic changes and
noise components in hoarseness. Journal of Speech and the body during respiration (Draper, Ladefoged, and
Hearing Research, 10, 531–541. Whitteridge, 1959). More often used for speech breath-
Zwicker, E., and Fastl, H. (1990). Psychoacoustics: Facts and ing research are transducers (magnetometers: Hixon,
models. Heidelberg, Germany: Springer-Verlag. Goldman, and Mead, 1973; inductance plethysmo-
graphs: Sperry, Hillman, and Perkell, 1994) that unob-
trusively monitor changes in the dimensions of the rib
Aerodynamic Assessment of Vocal cage and abdomen (referred to collectively as the chest
Function wall) that account for the majority of respiratory-related
changes in torso dimension (Mead et al., 1967). These
approaches have been primarily employed to study re-
A number of methods have been used to quantitatively spiratory function during continuous speech and singing
assess the air volumes, airflows, and air pressures in- tasks that include both voiced and voiceless sound pro-
volved in voice production. The methods have been duction, as opposed to assessing air volume usage during
mostly used in research to investigate mechanisms that phonatory tasks that involve only laryngeal production
underlie normal and disordered voice and speech proof voice (e.g., sustained vowels). There are also ongoing
duction. The clinical use of aerodynamic measures to e¤orts to develop more accurate methods for non-
assess patients with voice disorders has been increasing invasively monitoring chest wall activity to capture finer
(Colton and Casper, 1996; Hillman, Montgomery, and details of how the three-dimensional geometry of the
Zeitels, 1997; Hillman and Kobler, 2000). body is altered during respiration (see Cala et al., 1996).
Measurement of Air Volumes. Respiratory research in Measurement of Airflow. Airflow associated with pho-
human communication has focused primarily on the nation is usually specified in terms of volume velocity
measurement of the air volumes that are typically (i.e., volume of air displaced per unit of time). Volume
expended during selected speech and singing tasks, and velocity airflow rates for voice production are typically
on specifying the ranges of lung inflation levels across reported in metric units of volume displaced (liters or
which such tasks are normally performed (cf. Hixon, cubic centimeters) per second.
8 Part I: Voice
Estimates of average airflow rates can be obtained by rapidly opens and closes during flow-induced vibration
simply dividing air volume estimates by the duration of of the vocal folds (the glottal volume velocity wave-
the phonatory task. Average glottal airflow rates have form). The glottal volume velocity waveform cannot be
usually been estimated during vowel phonation by using directly observed by measuring the oral airflow signal
a mouthpiece or face mask to channel the oral air stream because the waveform is highly convoluted by the reso-
through a pneumotachograph (Isshiki, 1964). There has nance activity (formants) of the vocal tract. Thus, re-
also been somewhat limited use of hot wire anemometer covery of the glottal volume velocity waveform requires
devices (mounted in a mouthpiece) to estimate average methods that eliminate or correct for the influences of
glottal airflow during sustained vowel phonation (Woo, the vocal tract. This has typically been accomplished
Colton, and Shangold, 1987). Estimates of average glot- aerodynamically by processing the output of a fast-
tal airflow rates can be obtained from the oral airflow responding pneumotachograph (high-frequency re-
during vowel production because the vocal tract is rela- sponse) using a technique called inverse filtering, in
tively nonconstricted, with no major sources of turbulent which the major resonances of the vocal tract are esti-
airflow between the glottis and the lips. mated and the oral airflow signal is processed (inverse
There have also been e¤orts to obtain estimates of the filtered) to eliminate them (Rothenberg, 1977; Holm-
actual airflow waveform that is generated as the glottis berg, Hillman, and Perkell, 1988).
Figure 1. Instrumentation and resulting signals for

simultaneous collection of oral airflow, intraoral air
pressure, the acoustic signal, and chest wall (rib
cage and abdomen) dimensions during production of
the syllable string /pi-pi-pi/. Signals shown in the
bottom panel are processed and measured to provide
estimates of average glottal airflow rate, average
subglottal air pressure, lung volume, and glottal
waveform parameters.
Aerodynamic Assessment of Vocal Function 9
Measurement of Air Pressure. Measurements of air usually take the form of ratios that relate aerodynamic
pressures below (subglottal) and above (supraglottal) the parameters to each other, or that relate aerodynamic
vocal folds are of primary interest for characterizing the parameters to simultaneously obtained acoustic mea-
pressure di¤erential that must be achieved to initiate and sures. Common examples include (1) airway (glottal)
maintain vocal fold vibration during normal exhala- resistance (see Smitheran and Hixon, 1981), (2) vocal
tory phonation. In practice, air pressure measurements e‰ciency (Schutte, 1980; Holmberg, Hillman, and Per-
related specifically to voice production are typically kell, 1988), and (3) measures that interrelate glottal
acquired during vowel phonation when there are no volume velocity waveform parameters (Holmberg, Hill-
vocal tract constrictions of su‰cient magnitude to build man, and Perkell, 1988).
up positive supraglottal pressures. Under these condi-
tions, it is usually assumed that supraglottal pressure is Normative Data. As is the case for most measures of
essentially equal to atmospheric pressure and only sub- vocal function, there is not currently a set of normative
glottal pressure measurements are obtained. Air pres- data for aerodynamic measures that is universally
sures associated with voice and speech production are accepted and applied in research and clinical work.
usually specified in centimeters of water (cm H2 O). Methods for collecting such data have not been stan-
Both direct and indirect methods have been used to dardized, and study samples have generally not been of
measure subglottal air pressures during phonation. Di- su‰cient size or appropriately stratified in terms of age
rect measures of subglottal air pressure can be obtained and sex to ensure unbiased estimates of underlying aero-
by inserting a hypodermic needle into the subglottal air- dynamic phonatory parameters in the normal popula-
way through a puncture in the anterior neck at the cri- tion. However, there are several sources in the literature
cothyroid space (Isshiki, 1964). The needle is connected that provide estimates of normative values for selected
to a pressure transducer by tubing. This method is very aerodynamic measures (Kent, 1994; Baken, 1996; Col-
accurate but also very invasive. It is also possible to in- ton and Casper, 1996).
sert a very thin catheter through the posterior cartilagi- See also voice production: physics and physiology.
nous glottis (between the arytenoids) to sense subglottal
air pressure during phonation, or to use an array of —Robert E. Hillman
miniature transducers positioned directly above and be-
low the glottis (Cranen and Boves, 1985). These methods
References
cannot be tolerated by all subjects, and the heavy topical Baken, R. J. (1996). Clinical measurement of voice and speech.
anesthetization of the larynx that is required can a¤ect San Diego, CA: Singular Publishing Group.
normal function. Beckett, R. L. (1971). The respirometer as a diagnostic and
Indirect estimates of tracheal (subglottal) air pressure clinical tool in the speech clinic. Journal of Speech and
can be obtained via the placement of an elongated Hearing Disorders, 36, 235–241.
balloon-like device into the esophagus (Liberman, 1968). Cala, S. J., Kenyon, C. M., Ferrigno, G., Carnevali, P.,
Aliverti, A., Pedotti, A., et al. (1996). Chest wall and lung
The deflated esophageal balloon is attached to a catheter
volume estimation by optical reflectance motion analysis.
that is typically inserted transnasally and then swallowed Journal of Applied Physiology, 81, 2680–2689.
into the esophagus to be positioned at the midthoracic Colton, R. H., and Casper, J. K. (1996). Understanding voice
level. The catheter is connected to a pressure transducer problems: A physiological perspective for diagnosis and
and the balloon is slightly inflated. Accurate use of this treatment. Baltimore: Williams and Wilkins.
invasive method also requires simultaneous monitoring Cranen, B., and Boves, L. (1985). Pressure measurements dur-
of lung volume. ing speech production using semiconductor miniature pres-
Noninvasive, indirect estimates of subglottal air pressure transducers: Impact on models for speech production.
sure can be obtained by measuring intraoral air pres- Journal of the Acoustical Society of America, 77, 1543–
sure during specially constrained utterances (Smitheran 1551.
Draper, M., Ladefoged, P., and Whitteridge, P. (1959). Respi-
and Hixon, 1981). This is usually done by sensing air ratory muscles in speech. Journal of Speech and Hearing
pressure just behind the lips with a translabially placed Research, 2, 16–27.
catheter connected to a pressure transducer. These Hillman, R. E., and Kobler, J. B. (2000). Aerodynamic mea-
intraoral pressure measures are obtained as subjects sures of voice production. In R. Kent and M. Ball (Eds.),
produce strings of bilabial /p/ þ vowel syllables (e.g., The handbook of voice quality measurement, San Diego, CA:
/pi-pi-pi-pi-pi/) at constant pitch and loudness. This Singular Publishing Group.
method works because the vocal folds are abducted Hillman, R. E., Montgomery, W. M., and Zeitels, S. M.
during /p/ production, thus allowing pressure to equili- (1997). Current diagnostics and o‰ce practice: Use of ob-
brate throughout the airway, making intraoral pressure jective measures of vocal function in the multidisciplin-
equal to subglottal pressure (Fig. 1). ary management of voice disorders. Current Opinion in
Otolaryngology–Head and Neck Surgery, 5, 172–175.
Hixon, T. J., Goldman, M. D., and Mead, J. (1973). Kine-
Additional Derived Measures. There have been numer- matics of the chest wall during speech production: Volume
ous attempts to extend the utility of aerodynamic mea- displacements of the rib cage, abdomen, and lung. Journal
sures by using them in the derivation of additional of Speech and Hearing Research, 16, 78–115.
parameters aimed at better elucidating underlying Hoit, J. D., and Hixon, T. J. (1987). Age and speech breathing.
mechanisms of vocal function. Such derived measures Journal of Speech and Hearing Research, 30, 351–366.
10 Part I: Voice
Hoit, J. D., Hixon, T. J., Watson, P. J., and Morgan, W. J. the larynx, total laryngectomy is often indicated for rea-
(1990). Speech breathing in children and adolescents. Jour- sons of oncological safety (Doyle, 1994).
nal of Speech and Hearing Research, 33, 51–69.
Holmberg, E. B., Hillman, R. E., and Perkell, J. S. (1988).
Glottal airflow and transglottal air pressure measurements E¤ects of Total Laryngectomy
for male and female speakers in soft, normal, and loud The two most prominent e¤ects of total laryngectomy as
voice [published erratum appears in Journal of the Acousti- a surgical procedure are change of the normal airway
cal Society of America, 1989, 85(4), 1787]. Journal of the and loss of the normal voicing mechanism for verbal
Acoustical Society of America, 84, 511–529.
Isshiki, N. (1964). Regulatory mechanisms of vocal intensity communication. Once the larynx is surgically removed
variation. Journal of Speech and Hearing Research, 7, from the top of the trachea, the trachea is brought for-
17–29. ward to the anterior midline neck and sutured into place
Kent, R. D. (1994). Reference manual for communicative near the sternal notch. Thus, total laryngectomy neces-
sciences and disorders. San Diego, CA: Singular Publishing sitates that the airway be permanently separated from
Group. the upper aerodynamic (oral and pharyngeal) pathway.
Lieberman, P. (1968). Direct comparison of subglottal and When the laryngectomy is completed, the tracheal air-
esophageal pressure during speech. Journal of the Acoustical way will remain separate from the oral cavity, pharynx,
Society of America, 43, 1157–1164. and esophagus. Under these circumstances, not only is
Mead, J., Peterson, N., Grimgy, N., and Mead, J. (1967). Pul-
the primary structure for voice generation lost, but the
monary ventilation measured from body surface move-
ments. Science, 156, 1383–1384. intimate relationship between the pulmonary system and
Rothenberg, M. (1977). Measurement of airflow in speech. that of the structures of the upper airway, and con-
Journal of Speech and Hearing Research, 20, 155–176. sequently the vocal tract, is disrupted. Therefore, if
Schutte, H. (1980). The e‰ciency of voice production. Gronin- verbal communication is to be acquired and used post-
gen, The Netherlands: Kemper. laryngectomy, an alternative method of creating an
Smitheran, J. R., and Hixon, T. J. (1981). A clinical method alaryngeal voice source must be achieved.
for estimating laryngeal airway resistance during vowel
production. Journal of Speech and Hearing Disorders, 46,
138–146.
Methods of Postlaryngectomy Communication
Sperry, E., Hillman, R. E., and Perkell, J. S. (1994). The use of Following laryngectomy, the most significant communi-
an inductance plethysmograph to assess respiratory func- cative component to be addressed via voice and speech
tion in a patient with nodules. Journal of Medical Speech- rehabilitation is the lost voice source. Once the larynx is
Language Pathology, 2, 137–145.
removed, some alternative method of providing a new,
Watson, P. J., and Hixon, T. J. (1985). Respiratory kinematics
in classical (opera) singers. Journal of Speech and Hearing ‘‘alaryngeal’’ sound source is required. There are two
Research, 28, 104–122. general categories in which an alternative, alaryngeal
Woo, P., Colton, R. H., and Shangold, L. (1987). Phonatory voice source may be achieved. These categories are best
airflow analysis in patients with laryngeal disease. Annals of described as intrinsic and extrinsic methods. The dis-
Otology, Rhinology, and Laryngology, 96, 549–555. tinction between these two methods is contingent on the
manner in which the alaryngeal voice source is achieved.
Intrinsic alaryngeal methods imply that the alaryngeal
Alaryngeal Voice and Speech voice source is found within the system; that is, alterna-
Rehabilitation tive physical-anatomical structures are used to generate
sound. In contrast, extrinsic methods of alaryngeal
speech rely on the use of an external sound source, typi-
Loss of the larynx due to disease or injury will result in
cally an electronic source, or what is termed the artificial
numerous and significant changes that cross anatomical,
larynx, or the electrolarynx. The fundamental di¤erences
physiological, psychological, social, psychosocial, and
between intrinsic and extrinsic methods of alaryngeal
communication domains. Surgical removal of the lar-
speech are discussed below.
ynx, or total laryngectomy, involves resectioning the
entire framework of the larynx. Although total laryn-
Intrinsic Methods of Alaryngeal Speech
gectomy may occur in some instances due to traumatic
injury, the majority of cases worldwide are the result of The two most prominent methods of intrinsic alaryngeal
cancer. Approximately 75% of all laryngeal tumors arise speech are esophageal speech (Diedrich, 1966; Doyle,
from squamous epithelial tissue of the true vocal fold 1994) and tracheoesophageal (TE) speech (Singer and
(Bailey, 1985). In some instances, and because of the Blom, 1980). While these two intrinsic methods of
location of many of these lesions, less aggressive ap- alaryngeal speech are dissimilar in some respects, both
proaches to medical intervention may be pursued. This rely on generation of an alaryngeal voice source by cre-
may include radiation therapy or partial surgical resec- ating oscillation of tissues in the area of the lower phar-
tion, which seeks to conserve portions of the larynx, or ynx and upper esophagus. This vibratory structure is
the use of combined chemoradiation protocols (Hillman somewhat variable in regard to width, height, and loca-
et al., 1998; Orliko¤ et al., 1999). However, when mation (Diedrich and Youngstrom, 1966; Damste, 1986);
lignant lesions are su‰ciently large or when the location hence, the preferred term for this alaryngeal voicing
of the tumor threatens the lymphatic compartment of source is the pharyngoesophageal (PE) segment. One
Alaryngeal Voice and Speech Rehabilitation 11
muscle that comprises the PE segment is the cricophar- Regardless of which method of insu¿ation is used,
yngeal muscle. Beyond the commonality in the use of the esophageal speakers will exhibit limitations in the phy-
PE segment as a vicarious voicing source for both sical dimensions of speech. Specifically, fundamental
esophageal and TE methods of alaryngeal speech, the frequency is reduced by about one octave (Curry and
manner in which these methods are achieved does di¤er. Snidecor, 1961), intensity is reduced by about 10 dB SPL
from that of the normal speaker (Weinberg, Horii, and
Esophageal Speech. For esophageal speech, the Smith, 1980), and the durational characteristics of
speaker must move air from the oral cavity across the speech are also reduced. Speech intelligibility is also
tonically closed PE segment in order to insu¿ate decreased due to limits in the aerodynamic and voicing
the esophageal reservoir (located inferior to the PE seg- characteristics of esophageal speech. As it is not an
ment). Two methods of insu¿ation may be utilized. abductory-adductory system, voiced-for-voiceless per-
These methods might be best described as being either ceptual errors (e.g., perceptual identification of b for p)
direct or indirect approaches to insu¿ation. Direct are common. This is a direct consequence of the esoph-
methods require the individual speaker to actively ma- ageal speaker’s inability to insu¿ate large or continuous
nipulate air in the oral cavity to e¤ect a change in pres- volumes of air into the reservoir. Esophageal speakers
sure. When pressure build-up is achieved in the oral must frequently reinsu¿ate the esophageal reservoir to
cavity via compression maneuvers, and when the pres- maintain voicing. Because of this, it is not uncommon to
sure becomes of su‰cient magnitude to overcome the see esophageal speakers exhibit pauses at unusual points
muscular resistance of the PE segment, air will move in an utterance, which ultimately alters the normal
across the segment (inferiorly) into the esophagus. This rhythm of speech. Similarly, the prosodic contour of
may be accomplished with nonspeech tasks (tongue esophageal speech and associated features is often per-
maneuvers) or as a result of producing specific sounds ceived to be abnormal. In contrast to esophageal speech,
(e.g., stop consonants). the TE method capitalizes on the individual’s access to
In contrast, for the indirect (inhalation) method of air pulmonary air for esophageal insu¿ation, which o¤ers
insu¿ation, the speaker indirectly creates a negative several distinct advantages relative to esophageal speech.
pressure in the esophageal reservoir via rapid inhalation
through the tracheostoma. This results in a negative Tracheoesophageal Speech. TE speech uses the same
pressure in the esophagus relative to the normal atmo- voicing source as traditional esophageal speech, the PE
spheric pressure within the oral cavity/vocal tract (Die- segment. However, in TE speech the speaker is able to
drich and Youngstrom, 1966; Diedrich, 1968; Doyle, access and use pulmonary air as a driving source. This is
1994). Air then moves passively across the PE segment achieved by the surgical creation of a controlled midline
in order to equalize pressures between the pharynx and puncture in the trachea, followed by insertion of a one-
esophagus. Once insu¿ation occurs, this air can be used way TE puncture voice prosthesis (Singer and Blom,
to generate PE segment vibration in the same manner 1980), either at the time of laryngectomy or as a second
following other methods of air insu¿ation. While a dis- procedure at some point following laryngectomy. Thus,
tinction between direct and indirect methods permits TE speech is best described as a surgical-prosthetic
increased understanding of the physical requirements method of voice restoration. Though widely used, TE
for esophageal voice production, many esophageal voice restoration is not problem-free. Limitations in
speakers who exhibit high levels of proficiency will often application must be considered, and complications may
utilize both methods for insu¿ation. Regardless of occur.
which method of air insu¿ation is used, this air can then The design of the TE puncture voice prosthesis is such
be forced back up across the PE segment, and as a result, that when the tracheostoma is occluded, either by hand
the tissue of this sphincter will oscillate. This esophageal or via use of a complementary tracheostoma breathing
sound source can then be manipulated in the upper valve, air is directed from the trachea through the pros-
regions of the vocal tract into the sounds of speech. thesis and into the esophageal reservoir. This access
The acquisition of esophageal speech is a complex permits a variety of frequency, intensity, and durational
process of skill building that must be achieved under the variables to be altered in a fashion di¤erent from that of
direction of an experienced instructor. Clinical emphasis the traditional esophageal speaker (Robbins et al., 1984;
typically involves tasks that address four skills believed Pauloski, 1998). Because the TE speaker has direct ac-
to be fundamental to functional esophageal speech cess to a pulmonary air source, his or her ability to
(Berlin, 1963): (1) the ability to phonate reliably on de- modify the physical (frequency, intensity, and dura-
mand, (2) the ability to maintain a short latency between tional) characteristics of the signal in response to
air insu¿ation and esophageal phonation, (3) the ability changes in the aerodynamic driving source, along with
to maintain adequate duration of voicing, and (4) the associated changes in prosodic elements of the speech
ability to sustain voicing while articulating. These foun- signal (i.e., stress, intonation, juncture), is enhanced
dation skills have been shown to reflect those progressive considerably. Such changes have a positive impact on
abilities that have historically defined speech skills of auditory-perceptual judgments of this method of alaryn-
‘‘superior’’ esophageal speakers (Wepman et al., 1953; geal speech.
Snidecor, 1968). However, the successful acquisition of While the frequency of TE speech is still reduced from
esophageal speech may be limited, for many reasons. that of normal speech, the intensity is greater, and the
12 Part I: Voice
durational capabilities meet or exceed those of normal one method can be used with a functional communica-
speakers (Robbins et al., 1984). Finally, research into tive outcome in most instances. Professionals who work
the influence of increased aerodynamic support in TE with individuals who have undergone total laryngectomy
speakers relative to traditional esophageal speech on must focus on identifying a method that meets each
speech intelligibility has suggested that positive e¤ects speaker’s particular needs. Although clinical interven-
may be observed (Doyle, Danhauer, and Reed, 1988) tion must focus on making any given alaryngeal method
despite continued voiced-for-voiceless perceptual errors. as proficient as possible, the individual speaker’s needs,
Clearly, the rapidity of speech reacquisition in addition as well as the relative strengths and weaknesses of each
to the relative increases in speech intelligibility and the method, must be considered. In this way, use of a given
changes in the overall physical character of TE speech method may be enhanced so that the individual may
o¤ers considerable advantages from the perspective of achieve the best level of social reentry following lar-
communication rehabilitation. yngectomy. Further, nothing prevents an individual
from using multiple methods of alaryngeal speech, al-
Artificial Laryngeal Speech. Extrinsic methods of though one or another may be preferred in a given
alaryngeal voice production are common. Although communication context or environment. But an im-
some pneumatic devices have been introduced, they are portant caveat is necessary: Just because a method of
not widely used today. The most frequently used extrin- alaryngeal speech has been acquired and it has been
sic method of producing alaryngeal speech uses an elec- deemed ‘‘proficient’’ at the clinical level (e.g., results in
tronic artificial larynx, or electrolarynx. These devices good speech intelligibility) and is ‘‘functional’’ for basic
provide an external energy (voice) source that is intro- communication purposes, this does not imply that ‘‘re-
duced either directly into the oral cavity (intraoral) or by habilitation’’ has been successfully achieved.
placing a device directly on the tissues of the neck The reacquisition of verbal communication is without
(transcervical). Whether the electrolaryngeal tone is question a critical component of recovery and rehabili-
introduced into the oral cavity directly or through tation postlaryngectomy; however, it is only one dimen-
transmission via tissues of the neck, the speaker is able to sion of the complex picture of a successful return to as
modulate the electrolaryngeal source into speech. normal a life as possible. All individuals who have un-
The electrolayrnx is generally easy to use. Speech dergone a laryngectomy will confront myriad restrictions
can be acquired relatively quickly, and the device o¤ers in multiple domains, including anatomical, physio-
a reasonable method of functional communication to logical, psychological, communicative, and social. As a
those who have undergone total laryngectomy (Doyle, result, postlaryngectomy rehabilitation e¤orts that ad-
1994). Its major limitations have traditionally related to dress these areas may increase the likelihood of a suc-
negative judgments of electrolaryngeal speech relative to cessful postlaryngectomy outcome.
the mechanical nature of many devices. Current research See also laryngectomy.
is seeking to modify the nature of the electronic sound
source produced. The intelligibility of electrolaryngeal —Philip C. Doyle and Tanya L. Eadie
speech is relatively good, given the external nature of the
alaryngeal voice source and the electronic character of
References
sound production. A reduction in speech intelligibility is
primarily observed for voiceless consonants (i.e., voiced- Bailey, B. J. (1985). Glottic carcinoma. In B. J. Bailey and
for-voiceless errors) due to the fact that the electrolarynx H. F. Biller (Eds.), Surgery of the larynx (pp. 257–278).
is a continuous sound source (Weiss and Basili, 1985). Philadelphia: Saunders.
Berlin, C. I. (1963). Clinical measurement of esophageal
speech: I. Methodology and curves of skill acquisition.
Rehabilitative Considerations Journal of Speech and Hearing Disorders, 28, 42–51.
All methods of alaryngeal speech, whether esophageal, Curry, E. T., and Snidecor, J. C. (1961). Physical measurement
and pitch perception in esophageal speech. Laryngoscope,
TE, or electrolaryngeal, have distinct advantages and
71, 415–424.
disadvantages. Advantages for esophageal speech include Damste, P. H. (1986). Some obstacles to learning esophageal
a nonmechanical and hands-free method of communi- speech. In R. L. Keith and F. L. Darley (Eds.), Laryn-
cation. For TE speech, pitch is near normal, loudness gectomee rehabilitation (2nd ed., pp. 85–92). San Diego:
exceeds normal, and speech rate and prosody is near College-Hill Press.
normal; for artificial larynx speech, it may be acquired Diedrich, W. M. (1968). The mechanism of esophageal speech.
quickly by most people and may be used in conditions of Annals of the New York Academy of the Sciences, 155,
background noise. In contrast, disadvantages for esoph- 303–317.
ageal speech include lowered pitch, loudness, and speech Diedrich, W. M., and Youngstrom, K. A. (1966). Alaryngeal
rate. For TE speech, it involves use and maintenance of speech. Springfield, IL: Charles C. Thomas.
Doyle, P. C. (1994). Foundations of voice and speech rehabili-
a prosthetic device with associated costs; for artificial
tation following laryngeal cancer. San Diego, CA: Singular
larynx speech, a mechanical quality is common and it Publishing Group.
requires the use of one hand. While ‘‘normal’’ speech Doyle, P. C., Danhauer, J. L., and Reed, C. G. (1988). Lis-
cannot be restored with these methods, no matter how teners’ perceptions of consonants produced by esophageal
proficient the speaker’s skills, all methods are viable and tracheoesophageal talkers. Journal of Speech and
postlaryngectomy communication options, and at least Hearing Disorders, 53, 400–407.
Anatomy of the Human Larynx 13
Hillman, R. E., Walsh, M. J., Wolf, G. T., Fisher, S. G., and Hamaker, R. C., Singer, M. I., and Blom, E. D. (1985). Pri-
Hong, W. K. (1998). Functional outcomes following treat- mary voice restoration at laryngectomy. Archives of Oto-
ment for advanced laryngeal cancer. Annals of Otology, laryngology, 111, 182–186.
Rhinology and Laryngology, 107, 2–27. Iverson-Thoburn, S. K., and Hayden, P. A. (2000). Alaryngeal
Orliko¤, R. F., Kraus, D. S., Budnick, A. S., Pfister, D. G., speech utilization: A survey. Journal of Medical Speech-
and Zelefsky, M. J. (1999). Vocal function following suc- Language Pathology, 8, 85–99.
cessful chemoradiation treatment for advanced laryngeal Pfister, D. G., Strong, E. W., Harrison, L. B., Haines, I. E.,
cancer: Preliminary results. Phonoscope, 2, 67–77. Pfister, D. A., Sessions, R., et al. (1991). Larynx preserva-
Pauloski, B. R. (1998). Acoustic and aerodynamic character- tion with combined chemotherapy and radiation therapy in
istics of tracheoesophageal voice. In E. D. Blom, M. I. advanced but respectable head and neck cancer. Journal of
Singer, and R. C. Hamaker (Eds.), Tracheoesophageal voice Clinical Oncology, 9, 850–859.
restoration following total laryngectomy (pp. 123–141). San Reed, C. G. (1983). Surgical-prosthetic techniques for alaryn-
Diego, CA: Singular Publishing Group. geal speech. Communicative Disorders, 8, 109–124.
Robbins, J., Fisher, H. B., Blom, E. D., and Singer, M. I. Salmon, S. J. (1996/1997). Using an artificial larynx. In E.
(1984). A comparative acoustic study of normal, esopha- Lauder (Ed.), Self-help for the laryngectomee (pp. 31–33).
geal, and tracheoesophageal speech production. Journal of San Antonio, TX: Lauder Enterprises.
Speech and Hearing Disorders, 49, 202–210. Scarpino, J., and Weinberg, B. (1981). Junctural contrasts in
Singer, M. I., and Blom, E. D. (1980). An endoscopic tech- esophageal and normal speech. Journal of Speech and
nique for restoration of voice after laryngectomy. Annals of Hearing Research, 46, 120–126.
Otology, Rhinology, and Laryngology, 89, 529–533. Shanks, J. C. (1986). Essentials for alaryngeal speech: Psy-
Snidecor, J. C. (1968). Speech rehabilitation of the laryngec- chology and physiology. In R. L. Keith and F. L. Darley
tomized. Springfield, IL: Charles C. Thomas. (Eds.), Laryngectomee rehabiliation (pp. 337–349). San
Weiss, M. S., and Basili, A. M. (1985). Electrolaryngeal speech Diego, CA: College-Hill Press.
produced by laryngectomized subjects: Perceptual char- Shipp, T. (1967). Frequency, duration, and perceptual mea-
acteristics. Journal of Speech and Hearing Research, 28, sures in relation to judgment of alaryngeal speech accept-
294–300. ability. Journal of Speech and Hearing Research, 10, 417–
Wepman, J. M., MacGahan, J. A., Rickard, J. C., and Shel- 427.
ton, N. W. (1953). The objective measurement of progres- Singer, M. I. (1988). The upper esophageal sphincter: Role in
sive esophageal speech development. Journal of Speech and alaryngeal speech acquisition. Head and Neck Surgery,
Hearing Disorders, 18, 247–251. Supplement II, S118–S123.
Singer, M. I., Hamaker, R. C., Blom, E. D., and Yoshida,
G. Y. (1989). Applications of the voice prosthesis during
Further Readings laryngectomy. Annals of Otology, Rhinology, and Laryngol-
ogy, 98, 921–925.
Andrews, J. C., Mickel, R. D., Monahan, G. P., Hanson, Weinberg, B. (1982). Speech after laryngectomy: An overview
D. G., and Ward, P. H. (1987). Major complications fol- and review of acoustic and temporal characteristics of
lowing tracheoesophageal puncture for voice restoration. esophageal speech. In A. Sekey (Ed.), Electroacoustic anal-
Laryngoscope, 97, 562–567. ysis and enhancement of alaryngeal speech (pp. 5–48).
Batsakis, J. G. (1979). Tumors of the head and neck: Clinical Springfield, IL: Charles C. Thomas.
and pathological considerations (2nd ed.). Baltimore: Wil- Weinberg, B., Horii, Y., and Smith, B. E. (1980). Long-time
liams and Wilkins. spectral and intensity characteristics of esophageal speech.
Blom, E. D., Singer, M. I., and Hamaker, R. C. (1982). Tra- Journal of the Acoustical Society of America, 67, 1781–
cheostoma valve for postlaryngectomy voice rehabilita- 1784.
tion. Annals of Otology, Rhinology, and Laryngology, 91, Williams, S., and Watson, J. B. (1985). Di¤erences in speaking
576–578. proficiencies in three laryngectomy groups. Archives of
Doyle, P. C. (1997). Speech and voice rehabilitation of patients Otolaryngology, 111, 216–219.
treated for head and neck cancer. Current Opinion in Oto- Woodson, G. E., Rosen, C. A., Murry, T., Madasu, R., Wong,
laryngology and Head and Neck Surgery, 5, 161–168. F., Hengested, A., et al. (1996). Assessing vocal function
Doyle, P. C., and Keith, R. L. (Eds.). (2003). Contemporary after chemoradiation for advanced laryngeal carcinoma.
considerations in the treatment and rehabilitation of head and Archives of Otolaryngology–Head and Neck Surgery, 122,
neck cancer: Voice, speech, and swallowing. Austin, TX: 858–864.
Pro-Ed.
Gandour, J., and Weinberg, B. (1984). Production of intona-
tion and contrastive contrasts in electrolaryngeal speech. Anatomy of the Human Larynx
Journal of Speech and Hearing Research, 27, 605–612.
Gates, G., Ryan, W. J., Cantu, E., and Hearne, E.
(1982). Current status of laryngectomy rehabilitation: II. The larynx is an organ that sits in the hypopharynx,
Causes of failure. American Journal of Otolaryngology, 3, at the crossroads of the upper respiratory and upper di-
8–14. gestive tracts. The larynx is intimately involved in respi-
Gates, G., Ryan, W. J., Cooper, J. C., Lawlis, G. F., Cantu, ration, deglution, and phonation. Although it is the
E., Hayashi, T., et al. (1982). Current status of laryn-
primary sound generator of the peripheral speech mech-
gectomee rehabilitation: I. Results of therapy. American
Journal of Otolaryngology, 3, 1–7. anism, it must be viewed primarily as a respiratory
Gates, G., Ryan, W. J., and Lauder, E. (1982). Current status organ. In this capacity it controls the flow of air into and
of laryngectomee rehabilitation: IV. Attitudes about laryn- out of the lower respiratory tract, prevents food from
gectomee rehabilitation should change. American Journal of becoming lodged in the trachea or bronchi (which would
Otolaryngology, 3, 97–103. threaten life and interfere with breathing), and, through
14 Part I: Voice
the cough reflex, assists in dislodging material from the

lower airway. The larynx also plays a central role in the
development of the intrathoracic and intra-abdominal
pressures needed for lifting, elimination of bodily wastes,
and sound production.
Throughout life, the larynx undergoes maturational
and involutional (aging) changes (Kahane, 1996), which
influence its capacity as a sound source. Despite these
naturally and slowly occurring structural changes, the
larynx continues to function relatively flawlessly. This is
a tribute to the elegance of its structure.
Regional Anatomical Relationships. The larynx is

located in the midline of the neck. It lies in front of the
vertebral column and between the hyoid bone above and
the trachea below. In adults, it lies between the third and
sixth cervical vertebrae. The root, or pharyngeal portion,
of the tongue is interconnected with the epiglottis of the
larynx by three fibroelastic bands, the glossoepiglottic
folds. The lowermost portion of the pharynx, the hypo-
pharynx, surrounds the posterior aspect of the larynx.
Muscle fibers of the inferior pharyngeal constrictor at-
tach to the posterolateral aspect of the thyroid and cri-
coid cartilages. The esophagus lies inferior and posterior
to the larynx. It is a muscular tube that interconnects the
pharynx and the stomach. Muscle fibers originating
from the cricoid cartilage form part of the muscular
valve, which opens to allow food to pass from the phar-
ynx into the esophagus.
Cartilaginous Skeleton. The larynx is composed of five

major cartilages: thyroid, cricoid, one pair of arytenoids,
and the epiglottis (Fig. 1). The hyoid bone, though inti-
mately associated with the larynx, is not part of it. The
cartilaginous components of the larynx are joined by Figure 1. Laryngeal cartilages shown separately (top) and
ligaments and membranes. The thyroid and cricoid carti- articulated (bottom) at the laryngeal joints. The hyoid bone is
not part of the larynx but is attached to it by the thyrohyoid
lages are composed of hyaline cartilage, which provides membrane. (From Orliko¤, R. F., and Kahane, J. C. [1996].
them with form and rigidity. They are interconnected by Structure and function of the larynx. In N. J. Lass [Ed.], Prin-
the cricothyroid joints and surround the laryngeal cavity. ciples of experimental phonetics. St. Louis: Mosby. Reproduced
These cartilages support the soft tissues of the laryngeal with permission.)
cavity, thereby protecting this vital passageway for
unencumbered movement of air into and out of the
lower airway. The thyroid cartilage is composed of two attachment for the vocal folds, all but one pair of the
quadrangular plates that are united at midline in an intrinsic laryngeal muscles, and the vestibular folds.
angle called the thyroid angle or laryngeal prominence. The thyroid, cricoid and arytenoid cartilages are
In the male, the junction of the laminae forms an acute interconnected to each other by two movable joints, the
angle, while in the female it is obtuse. This sexual cricothyroid and cricoarytenoid joints. The cricothyroid
dimorphism emerges after puberty. The cricoid cartilage joint joins the thyroid and cricoid cartilages and allows
is signet ring shaped and sits on top of the first ring of the cricoid cartilage to rotate upward toward the cricoid
the trachea, ensuring continuity of the airway from the (Stone and Nuttal, 1974). Since the vocal folds are
larynx into the trachea (the origin of the lower respira- attached anteriorly to the inside face of the thyroid car-
tory tract). The epiglottis is a flexible leaf-shaped carti- tilage and posteriorly to the arytenoid cartilages, which
lage whose deformability results from its elastic cartilage in turn are attached to the upper rim of the cricoid,
composition. During swallowing, the epiglottis closes this rotation e¤ects lengthening and shortening of the
over the entrance into the laryngeal cavity, thus pre- vocal folds, with concomitant changes in tension. Such
venting food and liquids from passing into the laryngeal changes in tension are the principal method of changing
cavity, which could obstruct the airway and interfere the rate of vibration of the vocal folds. The cricoary-
with breathing. The arytenoid cartilages are intercon- tenoid joint joins the arytenoid cartilages to the supero-
nected to the cricoid cartilage via the cricoarytenoid lateral rim of the cricoid. Rocking motions of the
joint. These pyramid-shaped cartilages serve as points of arytenoids on the upper rim of the cricoid cartilage allow
The laryngeal cavity is conventionally divided into

three regions. The upper portion is a somewhat ex-
panded supraglottal cavity or vestibule whose walls
are reinforced by the quadrangular membrane. The
middle region, called the glottal region, is bounded by
the vocal folds; it is the narrowest portion. The lowest
region, the infraglottal or subglottal region, is bounded
by the conus elasticus. The area of primary laryngeal
valving is the glottal region, where the shape and size of
the rima glottidis or glottis (space between the vocal
folds) is modified during respiration, vocalization, and
sphincteric closure. The rima glottidis consists of an
intramembranous portion, which is bordered by the soft
tissues of the vocal folds, and an intracartilaginous por-
tion, the posterior two-fifths of the rima glottidis, which
is located between the vocal processes and the bases of
the arytenoid cartilages. The anterior two-thirds of the
glottis is an area of dynamic change occasioned by the
positioning and aerodynamic displacement of the vocal
folds. The overall dimensions of the intracartilaginous
glottis remain relatively stable except during strenuous
sphincteric valving.
The epithelium that lines the laryngeal cavity exhibits
regional specializations. Stratified squamous epithelium
covers surfaces subjected to contact, compressive, and
vibratory forces. Typical respiratory epithelium (pseudo-
stratified ciliated columnar epithelium with goblet cells)
is plentiful in the laryngeal cavity and lines the supra-
Figure 2. The laryngeal cavity, as viewed posteriorly. (From
glottis, ventricles, and nonvibrating portions of the vocal
Kahane, J. C. [1988]. Anatomy and physiology of the organs
of the peripheral speech mechanism. In N. J. Lass, L. L. folds; it also provides filtration and moisturization
McReynolds, J. L. Northern, and D. E. Yoder [Eds.], Hand- of flowing air. The epithelium and immediately underly-
book of speech-language pathology and audiology. Toronto: ing connective tissue form the muscosa, which is sup-
B. C. Decker. Reproduced with permission.) plied by an array of sensory receptors sensitive to
pressure, chemical, and tactile stimuli, pain, and direc-
tion and velocity of airflow (Wyke and Kirchner, 1976).
the arytenoids and the attached vocal folds to be drawn These receptors are innervated by sensory branches
away (abducted) from midline and brought toward from the superior and recurrent laryngeal nerves. They
(adducted) midline. The importance of these actions has are essential components of the exquisitely sensitive
been emphasized by von Leden and Moore (1961), as protective reflex mechanism within the larynx that in-
they are necessary for developing the transglottal impe- cludes initiating coughing, throat clearing, and sphinc-
dances to airflow that are needed to initiate vocal fold teric closure.
vibration. The e¤ect of such movements is to change the
size and shape of the glottis, the space between the vocal Laryngeal Muscles. The larynx is acted upon by ex-
folds, which is of importance in laryngeal articulation, trinsic and intrinsic laryngeal muscles (Tables 1 and 2).
producing devoicing and pauses, and facilitating modes The extrinsic laryngeal muscles are attached at one end
of vocal atttack. to the larynx and have one or more sites of attachment
to a distant site (e.g., the sternum or hyoid bone). The
Laryngeal Cavity. The laryngeal cartilages surround an suprahyoid and infrahyoid muscles attach to the hyoid
irregularly shaped tube called the laryngeal cavity, which bone and are generally considered extrinsic laryngeal
forms the interior of the larynx (Fig. 2). It extends from muscles (Fig. 3). Although these muscles do not attach
the laryngeal inlet (laryngeal aditus), through which it to the larynx, they influence laryngeal position in the
communicates with the hypopharynx, to the level of the neck through their action on the hyoid bone. The
inferior border of the cricoid cartilage. Here the laryn- thyroid cartilage is connected to the hyoid bone by the
geal cavity is continuous with the lumen of the trachea. hyothyroid membrane and ligaments. The larynx is
The walls of the laryngeal cavity are formed by fibro- moved through displacement of the hyoid bone. The
elastic tissues lined with epithelium. These fibroelastic suprahyoid and infrahyoid muscles also stabilize the
tissues (quadrangular membrane and conus elasticus) hyoid bone, allowing other muscles in the neck to act
restore the dimensions of the laryngeal cavity, which directly on the laryngeal cartilages. The suprahyoid and
become altered through muscle activity, passive stretch infrahyoid muscles are innervated by a combination
from adjacent structures, and aeromechanical forces. of cranial and spinal nerves. Cranial nerves V and VII
16 Part I: Voice
Table 1. Morphological Characteristics of the Suprahyoid and Infrahyoid Muscles

Muscles Origin Insertion Function Innervation
Suprahyoid Muscles
Anterior digastric Digastric fossa of mandible Body of hyoid bone Raises hyoid bone Cranial nerve V
Posterior digastric Mastoid notch of temporal To hyoid bone via an Raises and retracts Cranial nerve VII
bone intermediate tendon hyoid bone
Stylohyoid Posterior border of styloid Body of hyoid Raises hyoid bone Cranial nerve VII
process
Mylohyoid Mylohyoid line of mandible Median raphe, Raises hyoid bone Cranial nerve V
extending from deep
surface of mandible at
midline to hyoid bone
Geniohyoid Inferior pair of genial Anterior surface of body Raises hyoid bone Cervical nerve I carried
tubercles of mandible of hyoid bone and draws it via descendens
forward hypoglossi
Infrahyoid Muscles
Sternohyoid Deep surface of manubrium; Medial portion of Depresses hyoid Ansa cervicalis
medial end of clavical inferior surface of bone
body of hyoid bone
Omohyoid From upper border of Inferior aspect of body Depresses hyoid Cervical nerves I–III
scapula (inferior belly) of hyoid bone bone carried by the ansa
into tendon issuing cervicalis
superior belly
Sternothyroid Posterior surface of Oblique line of thyroid Lowers hyoid bone; Ansa cervicalis
manubrium; edge of first cartilage stabilizes hyoid
costal cartilage bone
Thyrohyoid Oblique line of thyroid Lower border of body When larynx is Cervical nerve I, through
cartilage and greater wing of stabilized, lowers descendens hypoglossi
hyoid bone hyoid bone; when
hyoid is fixed,
larynx is raised
Table 2. Morphological Characteristics of the Intrinsic Laryngeal Muscles

Muscle Origin Insertion Function Innervation
Cricothyroid Lateral surface of cricoid Pars recta fibers attach to Rotational approximation External branch
cartilage arch; fibers anterior lateral half of of the cricoid and of superior
divide into upper inferior border of thyroid thyroid cartilages; laryngeal nerve
portion (pars recta) cartilage; pars obliqua lengthens and tenses (cranial nerve X)
and lower portion fibers attach to anterior vocal folds
(pars obliqua) margin of inferior corner of
thyroid cartilage
Lateral Upper border of arch of Anterior aspect of muscular Adducts vocal folds; Recurrent laryngeal
cricoarytenoid cricoid cartilage process of arytenoid closes rima glottidis nerve (cranial
cartilage nerve X)
Posterior Cricoid lamina Muscular process of arytenoid Abducts vocal folds; Recurrent laryngeal
cricoarytenoid cartilage opens rima glottidis nerve (cranial
nerve X)
Interarytenoid
Transverse Horizontally coursing Dorsolateral ridge of opposite Approximates bases of Recurrent laryngeal
fibers fibers extending arytenoid cartilage arytenoid cartilages, nerve (cranial
between the dorso- assists vocal fold nerve X)
lateral ridges of each adduction
arytenoid cartilage
Oblique fibers Obliquely coursing fibers Inserts onto apex of opposite Same as transverse fibers Recurrent laryngeal
from base of one arytenoid cartilage nerve (cranial
arytenoid cartilage nerve X)
Thyroarytenoid Deep surface of thyroid Fovea oblonga of arytenoid Adduction, tensor, Recurrent laryngeal
cartilage at midline cartilage; vocalis fibers relaxer of vocal folds nerve (cranial
attach close to vocal (depending on what nerve X)
process; muscularis fibers parts of muscles are
attach more laterally active)
biomechanical outcomes of the actions of the intrinsic

laryngeal muscles are (1) abduction and adduction of
the vocal folds, (2) changing the position of the laryngeal
cartilages relative to each other, (3) transiently changing
the dimensions and physical properties of the vocal folds
(i.e., length, tension, mass per unit area, compliance, and
elasticity), and (4) modifying laryngeal airway resistance
by changing the size or shape of the glottis.
The intrinsic laryngeal muscles are innervated by
nerve fibers carried in the trunk of the vagus nerve.
These branches are usually referred to as the superior
and inferior laryngeal nerves. The cricothyroid muscle
is innervated by the superior laryngeal nerve, while all
other intrinsic laryngeal muscles are innervated by the
inferior (recurrent) laryngeal nerve. Sensory fibers from
these nerves supply the entire laryngeal cavity.
Histochemical studies of intrinsic laryngeal muscles
(Matzelt and Vosteen, 1963; Rosenfield et al., 1982)
have enabled us to appreciate the unique properties of
the intrinsic muscles. The intrinsic laryngeal muscles
contain, in varying proportions, fibers that control fine
movements for prolonged periods (type 1 fibers) and
fibers that develop tension rapidly within a muscle (type
Figure 3. The extrinsic laryngeal muscles. (From Bateman,
2 fibers). In particular, laryngeal muscles di¤er from the
H. E., and Mason, R. M. [1984]. Applied anatomy and physiol-
ogy of the speech and hearing mechanism. Springfield, IL:
standard morphological reference for striated muscles,
Charles C Thomas. Reproduced with permission.) the limb muscles, in several ways: (1) they typically have
a smaller mean diameter of muscle fibers; (2) they are
less regular in shape; (3) the muscle fibers are generally
supply all of the suprahyoid muscles except the genio- uniform in diameter across the various intrinsic muscles;
hyoid. All of the infrahyoid muscles are innervated by (4) individual muscle fibers tend not to be uniform in
spinal nerves from the upper (cervical) portion of the their directionality within a fascicle but exhibit greater
spinal cord. variability in the course of muscle fibers, owing to the
The suprahyoid and infrahyoid muscles have been tendency for fibers to intermingle in their longitudinal
implicated in fundamental frequency control under a and transverse planes; and (5) laryngeal muscles have a
construct proposed by Sonninen (1956), called the ex- greater investment of connective tissues.
ternal frame function. Sonninen suggested that the
extrinsic laryngeal muscles are involved in producing Vocal Folds. The vocal folds are multilayered vibra-
fundamental frequency changes by exerting forces on the tors, not a single homogeneous band. Hirano (1974)
laryngeal skeleton that e¤ect length and tension changes showed that the vocal folds are composed of several
in the vocal folds. layers of tissues, each with di¤erent physical properties
The designation of extrinsic laryngeal muscles and only 1.2 mm thick. The vocal fold consists of one
adopted here is based on strict anatomical definition as layer of epithelium, three layers of connective tissue
well as on research data on the action of the extrinsic (lamina propria), and the vocalis fibers of the thyroary-
laryngeal muscles during speech and singing. One of the tenoid muscle (Fig. 5). Based on examination of ultra-
most convincing studies in this area was done by Shipp high-speed films and biomechanical testing of the vocal
(1975), who showed that the sternothyroid and thyro- folds, Hirano (1974) found that functionally, the epithe-
hyoid muscles systematically change the vertical position lium and superficial layer of the lamina propria form the
of the larynx in the neck, particularly with changes in cover, which is the most mobile portion of the vocal fold.
fundamental frequency. Shipp demonstrated that the Wavelike mucosal disturbances travel along the surface
sternothyroid lowers the larynx with decreasing pitch, during sound production. These movements are essential
while the thyrohyoid raises it. for developing the agitation and patterning of air mole-
The intrinsic muscles of the larynx (Fig. 4) are a col- cules in transglottal airflow during voice production.
lection of small muscles whose points of attachment are The superficial layer of the lamina propria is com-
all in the larynx (to the laryngeal cartilages). The ana- posed of sparse amounts of loosely interwoven collage-
tomical properties of the intrinsic laryngeal muscles are nous and elastic fibers. This area, also known as
summarized in Table 2. The muscles can be categorized Reinke’s space, is important clinically because it is the
according to their e¤ects on the shape of the rima glot- principal site of swelling or edema formation in the
tidis, the positioning of the folds relative to midline, vocal folds following vocal abuse or in laryngitis. The
and the vibratory behavior of the vocal folds. Hirano intermediate and deep layers of the lamina propria are
and Kakita (1985) nicely summarized these behaviors called the transition. The vocal ligament is formed from
(Table 3). Among the most important functional or elastic and collagenous fibers in these layers. It provides
18 Part I: Voice
Figure 4. The intrinsic laryngeal muscles as shown in lateral (A), Northern, and D. E. Yoder [Eds.], Handbook of speech-
posterior (B), and superior (C) views. (From Kahane, J. C. language pathology and audiology. Toronto: B. C. Decker.
[1988]. Anatomy and physiology of the organs of the periph- Reproduced with permission.)
eral speech mechanism. In N. J. Lass, L. L. McReynolds, J. L.
Table 3. Actions of Intrinsic Laryngeal Muscles on Vocal Fold Position and Shape
Vocal Fold
Parameter CT VOC LCA IA PCA
Position Paramedian Adduct Adduct Adduct Adduct
Level Lower Lower Lower 0 Elevate
Length Elongate Shorten Elongate (Shorten) Elongate
Thickness Thin Thicken Thin (Thicken) Thin
Edge Sharpen Round Sharpen 0 Round
Muscle (body) Sti¤en Sti¤en Sti¤en (Slacken) Sti¤en
Mucosa (cover Sti¤en Slacken Sti¤en (Slacken) Sti¤en
and transition)
Note: 0 indicates no e¤ect; parentheses indicate slight e¤ect; italics indicate marked e¤ect;
normal type indicates consistent, strong e¤ect.
Abbreviations: CT, cricothyroid muscle; VOC, vocalis muscle; LCA, lateral cricoarytenoid
muscle; IA, interarytenoid muscle; PCA, posterior cricoarytenoid muscle.
From Hirano, M., and Kakita, Y. (1985). Cover-body theory of vocal fold vibration. In
R. G. Danilo¤ (Ed.), Speech science: Recent advances. San Diego, CA: College-Hill Press.
Reproduced with permission.
Figure 5. Schematic of the layered

structure of the vocal folds. The lead-
ing edge of the vocal fold with its epi-
thelium is at left. Co, collaginous
fibers; Elf, elastic fibers; M, vocalis
muscle fibers. (From Hirano, M.
[1975]. O‰cial report: Phonosurgery.
Basic and clinical investigations. Oto-
logia [Fukuoka], 21, 239–440. Repro-
duced with permission.)
resiliency and longitudinal stability to the vocal folds M. A. Crary (Eds.), Organic voice disorders (pp. 89–111).
during voice production. The transition is sti¤er than the San Diego, CA: Singular Publishing Group.
cover but more pliant than the vocalis muscle fibers, Matzelt, D., and Vosteen, K. H. (1963). Electroenoptische und
which form the body of the vocal folds. These muscle enzymatische Untersuchungen an menschlicher Kehlkopf-
muskulatur. Archiv für Ohren-Nasen- und Kehlkopfheil-
fibers are active in regulating fundamental frequency by
kunde, 181, 447–457.
influencing the tension in the vocal fold and the compli- Rosenfield, D. B., Miller, R. H., Sessions, R. B., and Patten,
ance and elasticity of the vibrating surface (cover). B. M. (1982). Morphologic and histochemical characteristics
See also voice production: physics and physiology. of laryngeal muscle. Archives of Otolaryngology, 108, 662–
666.
—Joel C. Kahane
Shipp, T. (1975). Vertical laryngeal positioning during contin-
References uous and discrete vocal frequency change. Journal of Speech
and Hearing Research, 18, 707–718.
Faaborg-Andersen, K., and Sonninen, A. (1960). The function Sonninen, A. (1956). The role of the external laryngeal muscles
of the extrinsic laryngeal muscles at di¤erent pitch. Acta in length-adjustment of the vocal cords in singing. Archives
Otolaryngologica, 51, 89–93. of Otolaryngology (Stockholm), Supplement, 118, 218–231.
Hirano, M. (1974). Morphological structure of the vocal Stone, R. E., and Nuttal, A. L. (1974). Relative movements of
cord as a vibrator and its vartions. Folia Phoniatrica, 26, the thyroid and cricoid cartilages assisted by neural stimu-
89–94. lation in dogs. Acta Otolaryngologica, 78, 135–140.
Hirano, M., and Kakita, Y. (1985). Cover-body theory of vo- von Leden, H., and Moore, P. (1961). The mechanics of the crico-
cal fold vibration. In R. G. Danolo¤ (Ed.), Speech science: arytenoid joint. Archives of Otolaryngology, 73, 541–550.
Recent advances (pp. 1–46). San Diego, CA: College-Hill Wyke, B. D., and Kirchner, J. A. (1976). Neurology of the
Press. larynx. In R. Hinchcli¤e and D. Harrison (Eds.), Scientific
Kahane, J. C. (1996). Life span changes in the larynx: An foundations of otolaryngology (pp. 546–574). London:
anatomical perspective. In W. S. Brown, B. P. Vinson, and Heinemann.
20 Part I: Voice
Further Readings relationship of communication ability to global quality-

of-life measurement. Hassan and Weymuller (1993), List
Fink, B. (1975). The human larynx. New York: Raven Press. et al. (1998), Picarillo (1994), and Murry et al. (1998)
Hast, M. H. (1970). The developmental anatomy of the larynx.
Otolaryngology Clinics of North America, 3, 413–438.
have all demonstrated that voice communication is an
Hirano, M. (1981). Structure of the vocal fold in normal and essential element in patients’ perception of their quality
disease states anatomical and physical studies. In C. L. of life following treatment for head and neck cancer.
Ludlow and M. O. Hart (Eds.), Proceedings of the Confer- Patient-based assessment of voice handicap has been
ence on the Assessment of Vocal Pathology (ASHA Reports lacking in the area of noncancerous voice disorders. The
11). Rockville, MD: American Speech-Language-Hearing developments and improvements of software for assess-
Assoc. ing acoustic objective measures of voice and relating
Hirano, M., and Sato, K. (1993). Histological color atlas of the measures of abnormal voices to normal voices have gone
human larynx. San Diego, CA: Singular Publishing Group. on for a number of years. However, objective measures
Kahane, J. C. (1998). Functional histology of the larynx primarily assess specific treatments and do not encom-
and vocal folds. In C. W. Cummings, J. M. Frederickson,
L. A. Harker, C. J. Krause, and D. E. Schuller (Eds.),
pass functional outcomes from the patient’s perspective.
Otolaryngology Head and Neck Surgery (pp. 1853–1868). These measures do not necessarily discriminate the se-
St. Louis: Mosby. verity of handicap as it relates to specific professions.
Konig, W. F., and von Leden, H. (1961). The peripheral ner- Objective test batteries are useful to quantify disease se-
vous system of the human larynx: I. The mucous mem- verity (Rosen, Lombard, and Murry, 2000), categorize
brane. Archives of Otolaryngology, 74, 1–14. acoustic/physiological profiles of the disease (Hartl et al.,
Negus, V. (1928). The mechanism of the larynx. St. Louis: 2001), and measure changes that occur as a result of
Mosby. treatment (Dejonckere, 2000). A few objective and sub-
Orliko¤, R. F., and Kahane, J. C. (1996). Structure and func- jective measures are correlated with the diagnosis of
tion of the larynx. In N. J. Lass (Ed.), Principles of experi- the voice disorder (Wolfe, Fitch, and Martin, 1997), but
mental phonetics. St. Louis: Mosby.
Rossi, G., and Cortesina, G. (1965). Morphological study of
until recently, none have been related to the patient’s
the laryngeal muscles in man: Insertions and courses of perception of the severity of his or her problem. This
muscle fibers, motor end-plates and proprioceptors. Acta latter issue is important in all diseases and disorders
Otolaryngologica, 59, 575–592. when life is not threatened since it is ultimately the
patient’s perception of disease severity and his or her
motivation to seek treatment that dictates the degree of
treatment success.
Functional impact relates to the degree of handicap
Assessment of Functional Impact of or disability. Accordingly, there are three levels of a
disorder: impairment, disability, and handicap (World
Voice Disorders Health Organization, 1980). Handicap is the impact of
the impairment of the disability on the social, environ-
Introduction mental, or economic functioning of the individual.
Treatment usually relates to the physical well-being of a
Voice disorders occur in approximately 6% of all adults patient, and it is this physical well-being that generally
and in as many as 12% of children. Within the adult takes priority when attempting to assess the severity of
group, specific professions report the presence of a voice the handicap. A more comprehensive approach might
problem that interferes with their employment. As many seek to address the patient’s own impression of the se-
as 50% of teachers and 33% of secretaries complain of verity of the disorder and how the disorder interferes
voice problems that restrict their ability to work or to with the individual’s professional and personal lifestyle.
function in a normal social environment (Smith et al., Measurement of functional impact is somewhat
1998). The restriction of work, or lifestyle, due to a voice di¤erent from assessment of disease status in that it
disorder has gone virtually undocumented until recently. does not directly address treatment e‰cacy, but rather
While voice scientists and clinicians have focused most addresses the value of a particular treatment for a par-
of their energy, talent, and time on diagnosing and ticular individual. This may be considered treatment ef-
measuring the severity of voice disorders with various fectiveness. E‰cacy, on the other hand, looks at whether
perceptual, acoustic, or physiological instruments, little or not a treatment can produce an expected result based
attention has been given to the e¤ects of a voice disorder on previous studies. Functional impact relates to the de-
on the daily needs of the patient. Over the past few gree of impact a disorder has on an individual patient,
years, interest has increased in determining the func- not necessarily to the severity of the disease.
tional impact of the voice disorder due to the Internet in
using patient-based outcome measures to establish e‰-
cacy of treatments and the desire to match treatment
Voice Disorders and Outcomes Research
needs with patient’s needs. This article reviews the evo- Assessment of functional impact on the voice is barely
lution of the assessment of functional impact of voice beyond the infancy stage. Interest in the issues relating to
disorders and selected applications of those assessments. functional use of the voice stems from the development
Assessment of the physiological consequences of of instruments to measure all aspects of vocal function
voice disorders has evolved from a strong interest in the related to the patient, the disease, and the treatment.
Assessment of Functional Impact of Voice Disorders 21
Moreover, there are certain parameters of voice dis- functioning, bodily pain, general health, vitality, social
orders that cannot be easily measured in the voice labo- functioning, mental health, and health transition. The
ratory, such as endurance, acceptance of a new voice, SF-36 has been used for a wide range of disease-specific
and vocal e¤ectiveness. topics once it was shown to be a valid measure of
The measurement of voice handicap must take into the degree of general health. The SF-36 is a pencil-and-
account issues such as ‘‘can the person teach in the paper test that has been used in numerous studies for
classroom all day?’’ or ‘‘can a shop foreman talk loud assessing outcomes of treatment. In addition, because
enough to be heard over the noise of factory machines?’’ each scale has been determined to be a reliable and valid
An outcome measure that takes into account the measure of health in and of itself, this assessment has
patient’s ability to speak in the classroom or a factory been used to validate other assessments of quality of life
will undoubtedly provide a more accurate assessment and handicap that are disease specific. However, one of
of voice handicap (although not necessarily an accurate the di‰culties with using such a test for a specific disease
assessment of the disease, recovery from disease, or is that one or more of the subscales may not be impor-
quality of voice) than the acoustic measures obtained in tant or appropriate. For example, when considering cer-
the voice laboratory. Thus, patient-based measures of tain voice disorders, the subscale of the SF-36 known as
voice handicap provide significant information that can- bodily pain may not be quite appropriate. Thus, the SF-
not be obtained from biological and physiologic vari- 36 is not a direct assessment of voice handicap but rather
ables traditionally used in voice assessment models. a general measure of well-being.
Voice handicap measures may measure an individu- The challenge to develop a specific scale related to a
al’s perceived level of general health, an individual’s specific organ function such as a scale for voice disorders
quality of life, her ability to continue with her current presents problems unlike the development of the SF-36
employment versus opting for a change in employment, or other general quality-of-life scales.
her satisfaction with treatment regardless of the disease
state, or the cost of the treatment. Outcome of treatment
Assessing Voice Handicap
for laryngeal cancer is typically measured using Kaplan-
Myer curves (Adelstein et al., 1990). While this tool Currently there are no federal regulations defining voice
measures the disease-related status of the patient, it does handicap, unlike the handicap measures associated with
not presume to assess overall patient satisfaction with hearing loss, which is regulated by the Department of
treatment. Rather, the degree to which swallowing status Labor. The task of measuring the severity of a voice
improves and voice communication returns to normal disorder may be somewhat di‰cult because of the areas
are measured by instruments that generally focus on that are a¤ected, namely emotional, physical, functional,
quality of life (McHorney et al., 1993). economic, etc. Moreover, as already indicated, while
Voice disorders are somewhat di¤erent than the measures such a perceptual judgments of voice charac-
treatment of a life threatening disease such as laryngeal teristics, videostroboscopic visual perceptual findings,
cancer. Treatment that involves surgery, pharmacology, acoustic perceptual judgments, as well as physiological
or voice therapy requires the patient’s full cooperation measures objectively obtained provide some input as
throughout the course of treatment. The quality and ac- to the severity of the voice compared to normal, these
curacy of surgery or the level of voice therapy may not measures do not provide insight as to the degree of
necessarily reflect the long-term outcome if the patient handicap and disability that a specific patient is experi-
does not cooperate with the treatment procedure. As- encing. It should be noted, however, that there are
sessment of voice handicap involves the patient’s ability handicap/disability measures developed for other aspects
to use his or her voice under normal circumstances of of communication, namely hearing loss and dizziness
social and work-related speaking situations. The voice (Newman et al., 1990; Jacobson et al., 1994). These
handicap will be reflected to the extent that the voice is measures have been used to quantify functional outcome
usable in those situations. following various interventions in auditory function.
Outcome Measures: General Health Versus Development of the Voice Handicap Index
Specific Disease In 1997, Jacobson and her colleagues proposed a mea-
There are two primary ways to assess the handicap of sure of voice handicap known as the Voice Handicap
a voice disorder. One is to look at the patient’s overall Index (VHI) (Jacobson et al., 1998). This patient self-
well-being. The other is to compare his or her voice to assessment tool consists of ten items in each of three
normal voice measures. The first usually encompasses domains: emotional, physical, and functional aspects of
social factors as well as physical factors that are related voice disorders. The functional subscale includes state-
to the specific disorder. One measure that has been used ments that describe the impact of a person’s voice on
to look at the e¤ect of disease on life is the Medical his daily activities. The emotional subscale indicates the
Outcomes Study (MOS), a 36-item short-form general patient’s a¤ective responses to the voice disorder. The
health survey (McHorney et al., 1993). The 36-item items in the physical subscale are statements that relate
short form, otherwise known as SF-36, measures eight to either the patient’s perception of laryngeal discomfort
areas of health that are commonly a¤ected or changed or the voice output characteristics such as too low or too
by diseases and treatments: physical functioning, role high a pitch. From an original 85-item list, a 30-item
22 Part I: Voice
questionnaire using a five-point response scale from 0, either because of surgery, voice therapy, or a combina-
indicating he ‘‘never’’ felt this about his voice problem to tion of both.
4, where he ‘‘always’’ felt this to be the case, was finally The same investigators examined the application of
obtained. This 30-item questionnaire was then assessed the VHI to a specific group of patients with voice dis-
for test-retest stability in total as well as the three sub- orders, singers (Murry and Rosen, 2000). Singers are
scales, and was validated against the SF-36. A shift in unique in that they often complain of problems related
the total score of 18 points or greater is required in order only to their singing voice. Murry and Rosen examined
to be certain that a change is due to intervention and not 73 professional and 33 nonprofessional singers and
to unexplained variability. The Voice Handicap Index compared them with a control group of 369 nonsingers.
was designed to assess all types of voice disorders, even The mean VHI score for the 106 singers was 34.7,
those encountered by tracheoesophageal speakers. A compared with a mean of 53.2 for the 336 nonsingers.
detailed analysis of patient data using this test has The VHI significantly separated singers from nonsingers
recently been published (Benninger et al., 1998). in terms of severity. Moreover, the mean VHI score for
Since the VHI has been published, others have pro- the professional singers was significantly lower (31.0 vs.
posed similar tests of handicap. Hogikian (1999) and 43.2) than for the recreational singers. Although lower
Glicklich (1999) have both demonstrated their assess- VHI scores were found in singers than in nonsingers, this
ment tools to have validity and reliability in assessing a does not imply that the VHI is not a useful instrument
patient’s perception of the severity of a voice problem. for assessing voice problems in singers. On the contrary,
One of the additional uses of the VHI as suggested by several questions were singled out as specifically sensi-
Benninger and others is to assess measures after treat- tive to singers. The findings of this study should alert
ment (1998). Murry and Rosen (2001) evaluated the clinicians that the use of the VHI points to the specific
VHI in three groups of speakers to determine the rela- needs as well as the seriousness of a singer’s handicap.
tive severity of voice disorders in patients with muscular Although the quality of voice may be mildly disordered,
tension dysphonia (MTD), benign vocal fold lesions the voice handicap may be significant.
(polyps/cysts), and vocal fold paralysis prior to and fol- Recently, Rosen and Murry (in press) presented re-
lowing treatments. Figure 1 shows that subjects with liability data on a revised 10-question VHI. The results
vocal fold paralysis displayed the highest self-perception suggest that a 10-question VHI produces is highly cor-
of handicap both before and after treatment. Subjects related with the original VHI. The 10-item questionnaire
with benign vocal fold lesions demonstrated the lowest provides a quick, reliable assessment of the patient’s
perception of handicap severity before and after treat- perception of voice handicap.
ment. It can be seen that in general, there was a 50% Other measures of voice outcome have been proposed
or greater improvement in the mean VHI for the com- and studied. Recently, Gliklich, Glovsky, and Mont-
bined groups. However, the patients with vocal fold gomery examined outcomes in patients with vocal fold
paralysis initially began with the highest pretreatment paralysis (Hogikyan and Sethuraman, 1999). The in-
VHI and remained with the highest VHI after treatment. strument, which contains five questions, is known as the
Although the VHI scores following treatment were sig- Voice Outcome Survey (VOS). Overall reliability of the
nificantly lower, there still remained a measure of hand- VOS was related to the subscales of the SF-36 for a
icap in all subjects. Overall, in 81% of the patients, there group of patients with unilateral vocal fold paralysis.
was a perception of significantly reduced voice handicap, Additional work has been done by Hogikyan (1999).
These authors presented a measure of voice-related
quality of life (VR-QOL). They also found that this
self-administered 10-question patient assessment of se-
verity was related to changes in treatment. Their sub-
jects consisted primarily of unilateral vocal fold paralysis
patients and showed a significant change from pre- to
post-treatment.
A recent addition to functional assessment is the
Voice Activity and Participation Profile (VAPP). This
tool assesses the e¤ects voice disorders have on limiting
and participating in activities which require use of the
voice (Ma and Yiu, 2001). Activity limitation refers to
constraints imposed on voice activities and participation
restriction refers to a reduction or avoidance of voice
activities. This 28-item tool examines five areas: self-
perceived severity of the voice problem; e¤ect on the job;
e¤ect on daily communication; e¤ect on social commi-
nication; and e¤ect on emotion. The VAPP has been
found to be a reliable and valid assessment tool for
Figure 1. Pre- and post-treatment voice handicap scores for assessing self-perceived voice severity as it relates to
selected populations. activity and participation in vocal activities.
Electroglottographic Assessment of Voice 23
Summary daily activities. Journal of Speech, Language, and Hearing

Research, 44, 511–524.
The study of functional voice assessment to identify the McHorney, C. A., Ware, J. E., Jr., Lu, J. F., and Sherbourne,
degree of handicap is novel for benign voice disorders. C. D. (1993). The MOS 36-item short form health survey
For many years, investigators have focused on acoustic (SF-36): II. Psychometric and clinical tests of validity in
and aerodynamic measures of voice production to assess measuring physical and medical health constructs. Medical
change in voice following treatment. These measures, Care, 31, 247–263.
although extremely useful in understanding treatment Murry, T., Madassu, R., Martin, A., and Robbins, K. T.
(1998). Acute and chronic changes in swallowing and qual-
e‰cacy, have not shed significant light on patients’ per-
ity of life following intraarterial chemoradiation for organ
ception of their disorder. Measures such as the VHI, preservation in patients with advanced head and neck can-
VOS, and VR-QOL have demonstrated that regardless cer. Head and Neck Surgery, 20, 31–37.
of age, sex, or disease type, the degree of handicap can Murry, T., and Rosen, C. A. (2001). Occupational voice dis-
be identified. Furthermore, treatment for these handi- orders and the voice handicap index. In P. Dejonckere
caps can also be assessed in terms of e¤ectiveness for the (Ed.), Occupational voice disorders: Care and cure (pp. 113–
patient. Patients’ self-assessment of perceived severity 128). The Hague, the Netherlands: Kugler Publications.
also allows investigators to make valid comparisons of Murry, T., and Rosen, C. A. (2000). Voice Handicap Index
the impact of an intervention for patients who use their results in singers. Journal of Voice, 14, 370–377.
voices in di¤erent environments and the patients’ per- Newman, C., Weinstein, B., Jacobson, G., and Hug, G. (1990).
The hearing handicap inventory for adults: Psychometric
ception of the treatment from a functional perspective.
adequacy and audiometric correlates. Ear and Hearing, 11,
Assessment of voice based on a patient’s perceived se- 430–433.
verity and the need to recover vocal function may be Picarillo, J. F. (1994). Outcome research and otolaryngology.
the most appropriate manner to assess severity of voice Otolaryngology–Head and Neck Surgery, 111, 764–769.
handicap. Rosen, C. A., and Murry, T. (in press). The VHI 10: An out-
come measure following voice disorder treatment. Journal
—Thomas Murry and Clark A. Rosen of Voice.
Rosen, C., Lombard, L. E., and Murry, T. (2000). Acoustic,
References aerodynamic and videostroboscopic features of bilateral
vocal fold lesions. Annals of Otology Rhinology and Lar-
Adelstein, D. J., Sharon, V. M., Earle, A. S., et al. (1990). ynology, 109, 823–828.
Long-term results after chemoradiotherapy of locally con- Smith, E., Lemke, J., Taylor, M., Kirchner, L., and Ho¤man,
fined squamous cell head and neck cancer. American Jour- H. (1998). Frequency of voice problems among teachers
nal of Clinical Oncology, 13, 440–447. and other occupations. Journal of Voice, 12, 480–488.
Benninger, M. S., Atiuja, A. S., Gardner, G., and Grywalski, Wolfe, V., Fitch, J., and Martin, D. (1997). Acoustic measures
C. (1998). Assessing outcomes for dysphonic patients. of dysphonic severity across and within voice types. Folia
Journal of Voice, 12, 540–550. Phoniatrica, 49, 292–299.
Dejonckere, P. H. (2000). Perceptual and laboratory assess- World Health Organization. (1980). International Classification
ment of dysphonia. Otolaryngology Clinics of North Amer- of Impairments, Disabilities and Handicaps: A manual of
ica, 33, 33–34. classification relating to the consequences of disease (pp. 25–
Glicklich, R. E., Glovsky, R. M., Montgomery, W. W. (1999). 43). Geneva: World Health Organization.
Validation of a voice outcome survey for unilateral vocal
fold paralysis. Otolaryngology–Head and Neck Surgery,
120, 152–158. Electroglottographic Assessment of
Hartl, D. M., Hans, S., Vaissiere, J., Riquet, M., et al. (2001).
Objective voice analysis after autologous fat injection for Voice
unilateral vocal fold paralysis. Annals of Otology, Rhinol-
ogy, and Laryngology, 110, 229–235. A number of instruments can be used to help character-
Hassan, S. J., and Weymuller, E. A. (1993). Assessment of
ize the behavior of the glottis and vocal folds during
quality of life in head and neck cancer patients. Head and
Neck Surgery, 15, 485–494. phonation. The signals derived from these instruments
Hogikyan, N. D., and Sethuraman, G. (1999). Validation of are called glottographic waveforms or glottograms (Titze
an instrument to measure voice-related quality of life (V- and Talkin, 1981). Among the more common glotto-
RQOL). Journal of Voice, 13, 557–559. grams are those that track change in glottal flow, via
Jacobson, B. H., Johnson, A., Grywalski, C., et al. (1998). The inverse filtering; glottal width, via kymography; glottal
Voice Handicap Index (VHI): Development and validation. area, via photoglottography; and vocal fold movement,
Journal of Voice, 12, 540–550. via ultrasonography (Baken and Orliko¤, 2000). Such
Jacobson, G. P., Ramadan, N. M., Aggarwal, S., and New- signals can be used to obtain several di¤erent physio-
man, C. W. (1994). The development of the Henry Ford logical measures, including the glottal open quotient
Hospital Headache Disability Inventory (HDI). Neurology,
44, 837–842.
and the maximum flow declination rate, both of which
List, M. A., Ritter-Sterr, C., and Lansky, S. B. (1998). A per- are highly valuable in the assessment of vocal function.
formance status scale for head and neck patients. Cancer, Unfortunately, the routine application of these tech-
66, 564–569. niques has been hampered by the cumbersome and time-
Ma, E. P., and Yiu, E. M. (2001). Voice activity and partici- consuming way in which these signals must be acquired,
pation profile: Assessing the impact of voice disorders on conditioned, and analyzed. One glottographic method,
24 Part I: Voice
electroglottography (EGG), has emerged as the most

commonly used technique, for several reasons: (1) it is
noninvasive, requiring no probe placement within the
vocal tract; (2) it is easy to acquire, alone or in conjunc-
tion with other speech signals; and (3) it o¤ers unique
information about the mucoundulatory behavior of the
vocal folds, which contemporary theory suggests is a
critical element in the assessment of voice production.
Electroglottography (known as electrolaryngography
in the United Kingdom) is a plethysmographic technique
that entails fixing a pair of surface electrodes to each side
of the neck at the thyroid lamina, approximating the
level of the vocal folds. An imperceptible low-amplitude,
high-frequency current is then passed between these
electrodes. Because of their electrolyte content, tissue
and body fluids are relatively good conductors of elec-
tricity, whereas air is a particularly poor conductor.
When the vocal folds separate, the current path is forced
to circumvent the glottal air space, decreasing e¤ective
voltage. Contact between the vocal folds a¤ords a con-
duit through which current can take a more direct route
across the neck. Electrical impedance is thus highest
when the current path must completely bypass an open
glottis and progressively decreases as greater contact be-
tween the vocal folds is achieved. In this way, the voltage
across the neck is modulated by the contact of the vocal
folds, forming the basis of the EGG signal. The glottal
region, however, is quite small compared with the total
region through which the current is flowing. In fact,
most of the changes in transcervical impedance are due to
strap muscle activity, laryngeal height variation induced
by respiration and articulation, and pulsatile blood vol-
ume changes. Because increasing and decreasing vocal
fold contact has a relatively small e¤ect on the overall Figure 1. At the top is shown a schematic representation of a
impedance, the electroglottogram is both high-pass fil- single cycle of vocal fold vibration viewed coronally (left) and
tered to remove the far slower nonphonatory impedance superiorly (right) (after Hirano, 1981). Below it is a normal
changes and amplified to boost the laryngeal contribu- electroglottogram depicting relative vocal fold contact area.
tion to the signal. The result is a waveform—sometimes The numbered points on the trace correspond approximately to
designated Lx—that varies chiefly as a function of vocal the points of the cycle depicted above. The contact phases of
the vibratory cycle are shown beneath the electroglottogram.
fold contact area (Gilbert, Potter, and Hoodin, 1984).
First proposed by Fabre in 1957 as a means to assess
laryngeal physiology, the clinical potential of EGG was the pattern of vocal fold vibration can be characterized
recognized by the mid-1960s. Interest in EGG increased quite well (Fig. 1). The contact pattern will vary as a
in the 1970s as the importance of mucosal wave dynam- consequence of several factors, including bilateral vocal
ics for vocal fold vibration was confirmed, and accel- fold mass and tension, medial compression, and the
erated greatly in the 1980s with the advent of personal anatomy and orientation of the medial surfaces. Con-
computers and commercially available EGGs that were siderable research has been devoted to establishing the
technologically superior to previous instruments. Today, important features of the EGG and how they relate
EGG has a worldwide reputation as a useful tool to to specific aspects of vocal fold status and behavior.
supplement the evaluation and treatment of vocal pa- Despite these e¤orts, however, the contact area function
thology. The clinical challenge, however, is that a valid is far from perfectly understood, especially in the face of
and reliable EGG assessment demands a firm under- pathology. Given the complexity of the ‘‘rolling and
standing of normal vocal fold vibratory behavior along peeling’’ motion of the glottal margins and the myriad
with recognition of the specific capabilities and limita- possibilities for abnormality of tissue structure or bio-
tions of the technique. mechanics, it is not surprising that e¤orts to formulate
Instead of a simple mediolateral oscillation, the vocal simple rules relating abnormal details to specific pathol-
folds engage in a quite complex undulatory movement ogies have not met with notable success. In short, the
during phonation, such that their inferior margins ap- clinical value of EGG rests in documenting the vibratory
proximate before the more superior margins make con- consequence of pathology rather than in diagnosing the
tact. Because EGG tracks e¤ective medial contact area, pathology itself.
Electroglottographic Assessment of Voice 25
Using multiple glottographic techniques, Baer,

Löfqvist, and McGarr (1983) demonstrated that, for
normal modal-register phonation, the ‘‘depth of closure’’
was very shallow just before glottal opening and quite
deep soon after closure was initiated. Most important,
they showed that the instant at which the glottis first
appears occurs sometime before all contact is lost, and
that the instant of glottal closure occurs sometime after
the vocal folds first make contact. Thus, although the
EGG is sensitive to the depth of contact, it cannot be
used to determine the width, area, or shape of the glottis.
For this reason, EGG is not a valid technique for the
measurement of glottal open time or, therefore, the open
quotient. Likewise, since EGG does not specify which
parts of the vocal folds are in contact, it cannot be used
to measure glottal closed time, nor can it, without addi-
tional evidence, be used to determine whether maximal
vocal fold contact indeed represents complete oblitera-
tion of the glottal space. Identifying the exact moment
when (and if ) all medial contact is lost has also proved
particularly problematic. Once the vocal folds do lose
contact, however, it can no longer be assumed that the
EGG signal conveys any information whatsoever about
laryngeal behavior. During such intervals, the signal
may vary solely as a function of the instrument’s auto-
matic gain control and filtering (Rothenberg, 1981).
Although the EGG provides useful information only
about those parts of the vibratory cycle during which
there is some vocal fold contact, these characteristics
may provide important clinical insight, especially when
paired with videostroboscopy and other data traces.
EGG, with its ability to demonstrate contact change in
both the horizontal and vertical planes, can quite e¤ec-
Figure 2. Typical electroglottograms obtained from a normal
tively document the normal voice registers (Fig. 2) as
man prolonging phonation in the low-frequency pulse,
well as abnormal and unstable modes of vibration (Fig. moderate-frequency modal, and high-frequency falsetto voice
3). However, to qualitatively assess EGG wave charac- registers.
teristics and to derive useful indices of vocal fold contact
behavior, it may be best to view the EGG in terms of
a vibratory cycle composed of a contact phase and a tween 1 for a contact phase maximally skewed to the
minimal-contact phase (see Fig. 1). The contact phase left and þ1 for a contact phase maximally skewed to the
includes intervals of increasing and decreasing contact, right. For normal modal-register phonation, CI varies
whereas the peak represents maximal vocal fold contact between 0.6 and 0.4 for both men and women, but,
and, presumably, maximal glottal closure. The minimal- as can be seen in Figure 2, it is markedly di¤erent for
contact phase is that portion of the EGG wave during other voice registers. Pulse-register EGGs typically have
which the vocal folds are probably not in contact. Much CIs in the vicinity of 0.8, whereas in falsetto it would
clinical misinterpretation can be avoided if no attempt not be uncommon to have a CI that approximates zero,
is made to equate the vibratory contact phase with the indicating a symmetrical or nearly symmetrical contact
glottal closed phase or the minimal-contact phase with phase.
the glottal open phase. Another EGG measure that is gaining some currency
For the typical modal-register EGG, the contact in the clinical literature is the contact quotient (CQ).
phase is asymmetrical; that is, the increase in contact Defined as the duration of the contact phase relative to
takes less time than the interval of decreasing contact. the period of the entire vibratory cycle, there is evidence
The degree of contact asymmetry is thought to vary not from both in vivo testing and mathematical modeling to
only as a consequence of vocal fold tension but also as a suggest that CQ varies with the degree of medial com-
function of vertical mucosal convergence and dynamics pression of the vocal folds (see Fig. 3) along a hypo-
(i.e., phasing; Titze, 1990). A dimensionless ratio, the adducted ‘‘loose’’ (or ‘‘breathy’’) to a hyperadducted
contact index (CI), can be used to assess contact sym- ‘‘tight’’ (or ‘‘pressed’’) phonatory continuum (Rothen-
metry (Orliko¤, 1991). Defined as the di¤erence between berg and Mahshie, 1988; Titze, 1990). Under typical
the increasing and decreasing contact durations divided vocal circumstances, CQ is within the range of 40%–
by the duration of the contact phase, CI will vary be- 60%, and despite the propensity for a posterior glottal
26 Part I: Voice
intonation, voicing, and fluency characteristics. In fact,

EGG has, for many, become the preferred means by
which to measure vocal fundamental frequency and jitter.
In summary, EGG provides an innocuous, straight-
forward, and convenient way to assess vocal fold vibra-
tion through its ability to track the relative area of
contact. Although it does not supply valid information
about the opening and closing of the glottis, the tech-
nique a¤ords a unique perspective on vocal fold be-
havior. When conservatively interpreted, and when
combined with other tools of laryngeal evaluation, EGG
can substantially further the clinician’s understanding of
the malfunctioning larynx and play an e¤ective role in
therapeutics as well.
See also acoustic assessment of voice.
—Robert F. Orliko¤
References
Baer, T., Löfqvist, A., and McGarr, N. S. (1983). Laryngeal
vibrations: A comparison between high-speed filming and
glottographic techniques. Journal of the Acoustical Society
of America, 73, 1304–1308.
Baken, R. J., and Orliko¤, R. F. (2000). Laryngeal function. In
Clinical measurement of speech and voice (2nd ed., pp. 394–
451). San Diego, CA: Singular Publishing Group.
Fabre, P. (1957). Un procédé électrique percutané d’inscription
de l’accolement glottique au cours de la phonation: Glot-
tographie de haute fréquence. Premiers resultats. Bulletin de
l’Académie Nationale de Médecine, 141, 66–69.
Gilbert, H. R., Potter, C. R., and Hoodin, R. (1984). Laryn-
gograph as a measure of vocal fold contact area. Journal of
Speech and Hearing Research, 27, 178–182.
Hirano, M. (1981). Clinical examination of voice. New York:
Figure 3. Electroglottograms representing di¤erent abnormal Springer-Verlag.
modes of vocal fold vibration. Kakita, Y. (1988). Simultaneous observation of the vibratory
pattern, sound pressure, and airflow signals using a physical
model of the vocal folds. In O. Fujimura (Ed.), Vocal
chink in women, there does not seem to be a significant physiology: Voice production, mechanisms, and functions
sex e¤ect. This is probably due to the fact that EGG (pp. 207–218). New York: Raven Press.
(and thus the CQ) is insensitive to glottal gaps that are Orliko¤, R. F. (1991). Assessment of the dynamics of vocal
not time varying. Unlike men, however, women tend fold contact from the electroglottogram: Data from normal
to show an increase in CQ with vocal F0. It has been male subjects. Journal of Speech and Hearing Research, 34,
conjectured that this may be the result of greater medial 1066–1072.
compression employed by women at higher F0s that Orliko¤, R. F. (1995). Vocal stability and vocal tract configu-
serves to diminish the posterior glottal gap. Nonetheless, ration: An acoustic and electroglottographic investigation.
a strong relationship between CQ and vocal intensity has Journal of Voice, 9, 173–181.
Rothenberg, M. (1981). Some relations between glottal air flow
been documented in both men and women, consistent
and vocal fold contact area. ASHA Reports, 11, 88–96.
with the known relationship between vocal power and Rothenberg, M., and Mahshie, J. J. (1988). Monitoring vocal
the adductory presetting of the vocal folds. Because fold abduction through vocal fold contact area. Journal of
vocal intensity is also related to the rate of vocal fold Speech and Hearing Research, 31, 338–351.
contact (Kakita, 1988), there have been some prelimi- Titze, I. R. (1990). Interpretation of the electroglottographic
nary attempts to derive useful EGG measures of the signal. Journal of Voice, 4, 1–9.
contact rise time. Titze, I. R., and Talkin, D. (1981). Simulation and interpreta-
Because EGG is relatively una¤ected by vocal tract tion of glottographic waveforms. ASHA Reports, 11, 48–55.
resonance and turbulence noise (Orliko¤, 1995), it al-
lows evaluation of vocal fold behavior under conditions Further Readings
not well-suited to other voice assessment techniques. For Abberton, E., and Fourcin, A. J. (1972). Laryngographic
this reason, and because the EGG waveshape is a rela- analysis and intonation. British Journal of Disorders of
tively simple one, the EGG has found some success both Communication, 7, 24–29.
as a trigger signal for laryngeal videostroboscopy and as Baken, R. J. (1992). Electroglottography. Journal of Voice, 6,
a means to define and describe phonatory onset, o¤set, 98–110.
Functional Voice Disorders 27
Carlson, E. (1993). Accent method plus direct visual feedback Functional Voice Disorders
of electroglottographic signals. In J. C. Stemple (Ed.), Voice
therapy: Clinical studies (pp. 57–71). St. Louis: Mosby–
Year Book. The human voice is acutely responsive to changes in
Carlson, E. (1995). Electrolaryngography in the assessment
emotional state, and the larynx plays a prominent role as
and treatment of incomplete mutation (puberphonia) in
adults. European Journal of Disorders of Communication, an instrument for the expression of intense emotions
30, 140–148. such as fear, anger, grief, and joy. Consequently, many
Childers, D. G., Hicks, D. M., Moore, G. P., and Alsaka, regard the voice as a sensitive barometer of emotions
Y. A. (1986). A model of vocal fold vibratory motion, con- and the larynx as the control valve that regulates the re-
tact area, and the electroglottogram. Journal of the Acousti- lease of these emotions (Aronson, 1990). Furthermore,
cal Society of America, 80, 1309–1320. the voice is one of the most individual and characteristic
Childers, D. G., Hicks, D. M., Moore, G. P., Eskenazi, L., and expressions of a person—a ‘‘mirror of personality.’’
Lalwani, A. L. (1990). Electroglottography and vocal fold Thus, when the voice becomes disordered, it is not un-
physiology. Journal of Speech and Hearing Research, 33, common for clinicians to suggest personality traits,
245–254.
psychological factors, or emotional or inhibitory pro-
Childers, D. G., and Krishnamurthy, A. K. (1985). A critical
review of electroglottography. CRC Critical Review of Bio- cesses as primary causal mechanisms. This is especially
medical Engineering, 12, 131–161. true in the case of functional dysphonia or aphonia,
Colton, R. H., and Conture, E. G. (1990). Problems and pit- in which no visible structural or neurological laryngeal
falls of electroglottography. Journal of Voice, 4, 10–24. pathology exists to explain the partial or complete loss of
Cranen, B. (1991). Simultaneous modelling of EGG, PGG, voice.
and glottal flow. In J. Gau‰n and B. Hammarberg (Eds.), Functional dysphonia, which may account for more
Vocal fold physiology: Acoustic, perceptual, and physiologi- than 10% of cases referred to multidisciplinary voice
cal aspects of voice mechanisms (pp. 57–64). San Diego, clinics, occurs predominantly in women, commonly fol-
CA: Singular Publishing Group. lows upper respiratory infection symptoms, and varies
Croatto, L., and Ferrero, F. E. (1979). L’esame elettro-
glottografico appliato ad alcuni casi di disodia. Acta Pho-
in its response to treatment (Bridger and Epstein, 1983;
niatrica Latina, 2, 213–224. Schalen and Andersson, 1992). The term functional
Fourcin, A. J. (1981). Laryngographic assessment of phona- implies a voice disturbance of physiological function
tory function. ASHA Reports, 11, 116–127. rather than anatomical structure. In clinical circles,
Gleeson, M. J., and Fourcin, A. J. (1983). Clinical analysis of functional is usually contrasted with organic and often
laryngeal trauma secondary to intubation. Journal of the carries the added meaning of psychogenic. Stress, emo-
Royal Society of Medicine, 76, 928–932. tion, and psychological conflict are frequently presumed
Gómez Gonzáles, J. L., and del Cañizo Alvarez, C. (1988). to cause or exacerbate functional symptoms.
Nuevas tecnicas de exploración funcional ları́ngea: La Some confusion surrounds the diagnostic category
electroglotografı́a. Anales Oto-Rino-Otoları́ngologica Ibero- of functional dysphonia because it includes an array of
Americana, 15, 239–362.
Hacki, T. (1989). Klassifizierung von Glottisdysfunktionen mit
medically unexplained voice disorders: psychogenic,
Hilfe der Elecktroglottographie. Folia Phoniatrica, 41, 43–48. conversion, hysterical, tension-fatigue syndrome, hyper-
Hertegård, S., and Gau‰n, J. (1995). Glottal area and vibra- kinetic, muscle misuse, and muscle tension dysphonia.
tory patterns studied with simultaneous stroboscopy, flow Although each diagnostic label implies some degree of
glottography, and electroglottography. Journal of Speech etiologic heterogeneity, whether these disorders are
and Hearing Research, 38, 85–100. qualitatively di¤erent and etiologically distinct remains
Kitzing, P. (1990). Clinical applications of electroglottography. unclear. When applied clinically, these various labels
Journal of Voice, 4, 238–249. frequently reflect clinician supposition, bias, or pref-
Kitzing, P. (2000). Electroglottography. In A. Ferlito (Ed.), erence. Voice disorder taxonomies have yet to be
Diseases of the larynx (pp. 127–138). New York: Oxford adequately operationalized; consequently, diagnostic
University Press.
Motta, G., Cesari, U., Iengo, M., and Motta, G., Jr. (1990).
categories often lack clear thresholds or discrete boun-
Clinical application of electroglottography. Folia Phonia- daries to determine patient inclusion or exclusion. To
trica, 42, 111–117. improve precision, some clinicians prefer the term psy-
Neil, W. F., Wechsler, E., and Robinson, J. M. (1977). Elec- chogenic voice disorder, to put the emphasis on the psy-
trolaryngography in laryngeal disorders. Clinical Otolaryn- chological origins of the disorder. According to Aronson
gology, 2, 33–40. (1990), a psychogenic voice disorder is synonymous with
Nieto Altazarra, A., and Echarri San Martin, R. (1996). Elec- a functional one but o¤ers the clinician the advantage
troglotografı́a. In R. Garcı́a-Tapia Urrutia and I. Cobeta of stating confidently, after an exploration of its causes,
Marco (Eds.), Diagnóstico y tratamiento de los transtornos that the voice disorder is a manifestation of one or more
de la voz (pp. 163–169). Madrid, Spain: Editorial Garsi. forms of psychological disequilibrium. At the purely
Orliko¤, R. F. (1998). Scrambled EGG: The uses and abuses
of electroglottography. Phonoscope, 1, 37–53.
phenomenological level there may be little di¤erence
Roubeau, C., Chevrie-Muller, C., and Arabia-Guidet, C. between functional and psychogenic voice disorders.
(1987). Electroglottographic study of the changes of voice Therefore, in this discussion, the terms functional and
registers. Folia Phoniatrica, 39, 280–289. psychogenic will be used synonymously, which reflects
Wechsler, E. (1977). A laryngographic study of voice disorders. current trends in the clinical literature (nosological im-
British Journal of Disorders of Communication, 12, 9–22. precision notwithstanding).
28 Part I: Voice
In clinical practice, ‘‘psychogenic voice disorder’’ without strain), breathiness, and high-pitched falsetto, as
should not be a default diagnosis for a voice problem well as voice and pitch breaks that vary in consistency
of undetermined cause. Rather, at least three criteria and severity.
should be met before such a diagnosis is o¤ered: symp- In conversion voice disorders, psychological factors
tom psychogenicity, symptom incongruity, and symp- are judged to be associated with the voice symptoms
tom reversibility (Sapir, 1995). Symptom psychogenicity because conflicts or other stressors precede the onset or
refers to the finding that the voice disorder is logically exacerbation of the dysphonia. In short, patients convert
linked in time of onset, course, and severity to an iden- intrapsychic distress into a voice symptom. The voice
tifiable psychological antecedent, such as a stressful life loss, whether partial or complete, is also often inter-
event or interpersonal conflict. Such information is preted to have symbolic meaning. Primary or secondary
acquired through a complete case history and psycho- gains are thought to play an important role in main-
social interview. Symptom incongruity refers to the obtaining and reinforcing the conversion disorder. Primary
servation that the vocal symptoms are physiologically gain refers to anxiety alleviation accomplished by pre-
incompatible with existing or suspected disease, are venting the psychological conflict from entering con-
internally inconsistent, and are incongruent with other scious awareness. Secondary gain refers to the avoidance
speech and language characteristics. An often cited ex- of an undesirable activity or responsibility and the extra
ample of symptom incongruity is complete aphonia attention or support conferred on the patient.
(whispered speech) in a patient who has a normal throat Butcher and colleagues (Butcher et al., 1987; Butcher,
clear, cough, laugh, or hum, whereby the presence of Elias, and Raven, 1993; Butcher, 1995) have argued that
such normal nonspeech vocalization is at odds with there is little research evidence that conversion disorder
assumptions regarding neural integrity and function of is the most common cause of functional voice loss.
the laryngeal system. Finally, symptom reversibility Butcher advised that the conversion label should be re-
refers to complete, sustained amelioration of the voice served for cases of aphonia in which lack of concern and
disorder with short-term voice therapy (usually one or motivation to improve the voice coexists with clear evi-
two sessions) or through psychological abreaction. Fur- dence of a temporally linked psychosocial stressor. In the
thermore, maintaining the voice improvement requires place of conversion, Butcher (1995) o¤ered two alterna-
no compensatory e¤ort on the part of the patient. In tive models to account for psychogenic voice loss. Both
general, psychogenic dysphonia may be suspected when models minimized the role of primary and secondary
strong evidence exists for symptom incongruity and gain in maintaining the voice disorder. The first was
symptom psychogenicity, but it is confirmed only when a slightly reformulated psychoanalytic model that stated,
there is unmistakable evidence of symptom reversibility. ‘‘if predisposed by social and cultural bias as well as
A wide array of psychopathological processes con- early learning experiences, and then exposed to inter-
tributing to voice symptom formation in functional dys- personal di‰culties that stimulate internal conflict,
phonia have been proposed. These mechanisms include, particularly in situations involving conflict over self-
but are not limited to, conversion reaction, hysteria, expression or voicing feelings, intrapsychic conflict or
hypochondriasis, anxiety, depression and various per- stress becomes channeled into musculoskeletal tension,
sonality dispositions or emotional stresses or conflicts which physically inhibits voice production’’ (p. 472). The
that induce laryngeal musculoskeletal tension. Roy and second model, based on cognitive-behavioral principles,
Bless (2000) provide a more complete exploration of the stated that ‘‘life stresses and interpersonal problems in
putative psychological and personality processes involved an individual predisposed to having di‰culties express-
in functional dysphonia, as well as related research. ing feelings or views would produce involuntary anxiety
The dominant psychological explanation for dyspho- symptoms and musculoskeletal tension, which would
nia unaccounted for by pathological findings is the con- center on and inhibit voice production’’ (p. 473). Both
cept of conversion disorder. According to the DSM-IV, models clearly emphasized the inhibitory e¤ects of excess
conversion disorder involves unexplained symptoms or laryngeal muscle tension on voice production, although
deficits a¤ecting voluntary motor or sensory function through slightly di¤erent causal mechanisms.
that suggest a neurological or other general medical Recently, Roy and Bless (2000) proposed a theory
condition (American Psychiatric Association, 1994). The that links personality to the development of functional
conversion symptom represents an unconscious simula- dysphonia. The ‘‘trait theory of functional dysphonia’’
tion of illness that ostensibly prevents conscious aware- shares Butcher’s (1995) theme of inhibitory laryngeal
ness of emotional conflict or stress, thereby displacing behavior but attributes this muscularly inhibited voice
the mental conflict and reducing anxiety. When the la- production to specific personality types. In brief, the
ryngeal system is involved, the condition is referred to as authors speculate that the combination of personality
conversion dysphonia or aphonia. In aphonia, patients traits such as introversion and neuroticism (trait anxiety)
lose their voice suddenly and completely and articulate and constraint leads to predictable and conditioned la-
in a whisper. The whisper may be pure, harsh, or sharp, ryngeal inhibitory responses to certain environmental
with occasional high-pitched squeaklike traces of pho- signals or cues. For instance, when undesirable punish-
nation. In dysphonia, phonation is preserved but dis- ing or frustrating outcomes have been paired with pre-
turbed in quality, pitch, or loudness. Myriad dysphonia vious attempts to speak out, this can lead to muscularly
types are encountered, including hoarseness (with or inhibited voice. The authors contend that this conflict
Functional Voice Disorders 29
between laryngeal inhibition and activation (with origins complaints, and (4) introversion in the functional dys-
in personality and nervous system functioning) results in phonia population. Patients have been described as
elevated laryngeal tension states and can give rise to in- inhibited, stress reactive, socially anxious, nonassertive,
complete or disordered vocalization in a structurally and and with a tendency toward restraint (Friedl, Friedrich,
neurologically intact larynx. and Egger, 1990; Gerritsma, 1991; Roy, Bless, and Hei-
As is apparent from the foregoing discussion, the ex- sey, 2000a, 2000b).
quisite sensitivity and prolonged hypercontraction of the In conclusion, the larynx can be a site of neuromus-
intrinsic and extrinsic laryngeal muscles in response to cular tension arising from stress, emotional inhibition,
stress, anxiety, depression, and inhibited emotional ex- fear or threat, communication breakdown, and certain
pression is frequently cited as the common denominator personality types. This tension can produce severely dis-
underlying the majority of functional voice problems. ordered voice in the context of a structurally normal
Nichol, Morrison, and Rammage (1993) proposed that larynx. Although the precise mechanisms underlying and
excess muscle tension arises from overactivity of auto- maintaining psychogenic voice problems remain unclear,
nomic and voluntary nervous systems in individuals the voice disorder is a powerful reminder of the intimate
who are unduly aroused and anxious. They added that relationship between mind and body.
such overactivity leads to hypertonicity of the intrinsic See also psychogenic voice disorders: direct
and extrinsic laryngeal muscles, resulting in muscle ten- therapy.
sion dysphonias sometimes associated with adjustment
or anxiety disorders, or with certain personality trait —Nelson Roy
disturbances.
Finally, some researchers have noted that their ‘‘psy- References
chogenic dysphonia and aphonia’’ patients had an
abnormally high number of reported allergy, asthma, American Psychiatric Association. (1994). Diagnostic and sta-
or upper respiratory infection symptoms, suggesting a tistical manual of mental disorders—Fourth edition. Wash-
ington, DC: American Psychiatric Press.
link between psychological factors and respiratory and
Aronson, A. E. (1990). Clinical voice disorders: An interdisci-
phonatory disorders (Milutinovic, 1991; Schalen and plinary approach (3rd ed.). New York: Thieme.
Andersson, 1992). They have speculated that organic Aronson, A. E., Peterson, H. W., and Litin, E. M. (1966).
changes in the larynx, pharynx, and nose facilitate the Psychiatric symptomatology in functional dysphonia and
appearance of a functional voice problem; that is, these aphonia. Journal of Speech and Hearing Disorders, 31, 115–
changes direct the somatization of psychodynamic con- 127.
flict. Likewise, Rammage, Nichol, and Morrison (1987) Bridger, M. M., and Epstein, R. (1983). Functional voice dis-
proposed that a relatively minor organic change such as orders: A review of 109 patients. Journal of Laryngology
edema, infection, or reflux laryngitis may trigger func- and Otology, 97, 1145–1148.
tional misuse, particularly if the individual is exceedingly Butcher, P. (1995). Psychological processes in psychogenic
voice disorder. European Journal of Disorders of Communi-
anxious about his or her voice or health. In a similar cation, 30, 467–474.
vein, the same authors felt that anticipation of poor Butcher, P., Elias, A., Raven, R. (1993). Psychogenic voice
voice production in hypochondriacal, dependent, or disorders and cognitive behaviour therapy. San Diego, CA:
obsessive-compulsive individuals leads to excessive vigi- Singular Publishing Group.
lance over sensations arising from the throat (larynx) Butcher, P., Elias, A., Raven, R., Yeatman, J., and Littlejohns,
and respiratory system that may lead to altered voice D. (1987). Psychogenic voice disorder unresponsive to
production. speech therapy: Psychological characteristics and cognitive-
Research evidence to support the various psycho- behaviour therapy. British Journal of Disorders of Commu-
logical mechanisms o¤ered to explain functional voice nication, 22, 81–92.
problems has seldom been provided. A complete review Friedl, W., Friedrich, G., and Egger, J. (1990). Personality and
coping with stress in patients su¤ering from functional dys-
of the relevant findings and interpretations is provided in phonia. Folia Phoniatrica, 42, 144–149.
Roy et al. (1997). The empirical literature evaluating the Gerritsma, E. J. (1991). An investigation into some personality
functional dysphonia–psychology relationship is charac- characteristics of patients with psychogenic aphonia and
terized by divergent results regarding the frequency and dysphonia. Folia Phoniatrica, 43, 13–20.
degree of specific personality traits (Aronson, Peterson, House, A. O., and Andrews, H. B. (1987). The psychiatric and
and Litin, 1966; Kinzl, Biebl, and Rauchegger, 1988; social characteristics of patients with functional dysphonia.
Gerritsma, 1991; Roy, Bless, and Heisey, 2000a, 2000b), Journal of Psychosomatic Research, 3, 483–490.
conversion reaction (House and Andrews, 1987; Roy Kinzl, J., Biebl, W., and Rauchegger, H. (1988). Functional
et al., 1997), and psychopathological symptoms such aphonia: Psychosomatic aspects of diagnosis and therapy.
as depression and anxiety (Aronson, Peterson, and Litin, Folia Phoniatrica, 40, 131–137.
Milutinovic, Z. (1991). Inflammatory changes as a risk factor
1966; Pfau, 1975; House and Andrews, 1987; Gerritsma, in the development of phononeurosis. Folia Phoniatrica, 43,
1991; Roy et al., 1997; White, Deary, and Wilson, 1997; 177–180.
Roy, Bless, and Heisey, 2000a, 2000b). Despite method- Nichol, H., Morrison, M. D., and Rammage, L. A. (1993).
ological di¤erences, these studies have identified a gen- Interdisciplinary approach to functional voice disorders:
eral trend toward elevated levels of (1) state and trait The psychiatrist’s role. Otolaryngology–Head and Neck
anxiety, (2) depression, (3) somatic preoccupation or Surgury, 108, 643–647.
30 Part I: Voice
Pfau, E. M. (1975). Psychologische Untersuchungsergegnisse Hypokinetic Laryngeal Movement

für Ätiologie der psychogenen Dysphonien. Folia Phonia-
trica, 25, 298–306. Disorders
Rammage, L. A., Nichol, H., and Morrison, M. D. (1987).
The psychopathology of voice disorders. Human Communi-
cations Canada, 11, 21–25. Hypokinetic laryngeal movement disorders are observed
Roy, N., and Bless, D. M. (2000). Toward a theory of the dis- most often in individuals diagnosed with the neuro-
positional bases of functional dysphonia and vocal nodules: logical disorder, parkinsonism. Parkinsonism has the
Exploring the role of personality and emotional adjustment. following features: bradykinesia, postural instability,
In R. D. Kent and M. J. Ball (Eds.), Voice quality mea- rigidity, resting tremor, and freezing (motor blocks)
surement (pp. 461–480). San Diego, CA: Singular Publish- (Fahn, 1986). For the diagnosis to be made, at least two
ing Group. of these five features should be present, and one of the
Roy, N., Bless, D. M., and Heisey, D. (2000a). Personality and
two features should be either tremor or rigidity. Parkin-
voice disorders: A superfactor trait analysis. Journal of
Speech, Language and Hearing Research, 43, 749–768. sonism as a syndrome can be classified as idiopathic
Roy, N., Bless, D. M., and Heisey, D. (2000b). Personality and Parkinson’s disease (PD) (i.e., symptoms of unknown
voice disorders: A multitrait-multidisorder analysis. Journal cause); secondary (or symptomatic) PD, caused by a
of Voice, 14, 521–548. known and identifiable cause; or parkinsonism-plus syn-
Roy, N., McGrory, J. J., Tasko, S. M., Bless, D. M., Heisey, D., dromes, in which symptoms of parkinsonism are caused
and Ford, C. N. (1997). Psychological correlates of func- by a known gene defect or have a distinctive pathology.
tional dysphonia: An evaluation using the Minnesota Multi- The specific diagnosis depends on findings in the clinical
phasic Personality Inventory. Journal of Voice, 11, 443–451. history, the neurological examination, and laboratory
Sapir, S. (1995). Psychogenic spasmodic dysphonia: A case tests. No single feature is completely reliable for di¤er-
study with expert opinions. Journal of Voice, 9, 270–281.
Schalen, L., and Andersson, K. (1992). Di¤erential diagnosis
entiating among the di¤erent causes of parkinsonism.
and treatment of psychogenic voice disorder. Clinical Oto- Idiopathic PD is the most common type of parkin-
laryngology, 17, 225–230. sonism encountered by the neurologist. Pathologically,
White, A., Deary, I. J., and Wilson, J. A. (1997). Psychiatric idiopathic PD a¤ects many structures in the central
disturbance and personality traits in dysphonic patients. Euro- nervous system (CNS), with preferential involvement
pean Journal of Disorders of Communication, 32, 121–128. of dopaminergic neurons in the substantia nigra pars
compacta (SNpc). Lewy bodies, eosinophilic intra-
Further Readings cytoplasmatic inclusions, can be found in these neurons
(Galvin, Lee, and Trojanowski, 2001). Alpha-synuclein
Deary, I. J., Scott, S., Wilson, I. M., White, A., MacKenzie, is the primary component of Lewy body fibrils (Galvin,
K., and Wilson, J. A. (1998). Personality and psychological Lee, and Trojanowski, 2001). However, only about 75%
distress in dysphonia. British Journal of Health Psychology, of patients with the clinical diagnosis of idiopathic PD
2, 333–341. are found at autopsy to have the pathological CNS
Friedl, W., Friedrich, G., Egger, J., and Fitzek, I. (1993). Psy-
changes characteristic of PD (Hughes et al., 1992).
chogenic aspects of functional dysphonia. Folia Phoniatrica,
45, 10–13. Many patients and their families consider the reduced
Green, G. (1988). The inter-relationship between vocal and ability to communicate one of the most di‰cult aspects
psychological characteristics: A literature review. Australian of PD. Hypokinetic dysarthria, characterized by a soft
Journal of Human Communication Disorders, 16, 31–43. voice, monotone, a breathy, hoarse voice quality, and
Gunther, V., Mayr-Graft, A., Miller, C., and Kinzl, H. (1996). imprecise articulation (Darley, Aronson, and Brown,
A comparative study of psychological aspects of recurring 1975; Logemann et al., 1978), and reduced facial ex-
and non-recurring functional aphonias. European Archives pression (masked facies) contribute to limitations in
of Otorhinolaryngology, 253, 240–244. communication in the vast majority of individuals with
House, A. O., and Andrews, H. B. (1988). Life events and dif- idiopathic PD (Pitcairn et al., 1990). During the course
ficulties preceding the onset of functional dysphonia. Jour-
of the disease, approximately 45%–89% of patients will
nal of Psychosomatic Research, 32, 311–319.
Koufman, J. A., and Blalock, P. D. (1982). Classification and report speech problems (Logemann and Fisher, 1981;
approach to patients with functional voice disorders. Annals Sapir et al., 2002). Repetitive speech phenomena (Benke
of Otology, Rhinology, and Laryngology, 91, 372–377. et al., 2000), voice tremor, and hyperkinetic dysarthria
Morrison, M. D., and Rammage, L. (1993). Muscle misuse may also be encountered in individuals with idiopathic
voice disorders: Description and classification. Acta Oto- PD. When hyperkinetic dysarthria is reported in idio-
laryngologica (Stockholm), 113, 428–434. pathic PD, it is most frequently seen together with other
Moses, P. J. (1954). The voice of neurosis. New York: Grune motor complications (e.g., dyskinesia) of prolonged
and Stratton. levodopa therapy (Critchley, 1981).
Pennebaker, J. W., and Watson, D. (1991). The psychology of Logemann et al. (1978) suggested that the clusters of
somatic symptoms. In L. J. Kirmayer and J. M. Robbins
speech symptoms they observed in 200 individuals with
(Eds.), Current concepts of somatization. Washington, DC:
American Psychiatric Press. PD represented a progression in dysfunction, beginning
Roy, N., and Bless, D. M. (2000). Personality traits and psy- with disordered phonation in recently diagnosed patients
chological factors in voice pathology: A foundation for and extending to include disordered articulation and
future research. Journal of Speech, Language, and Hearing other aspects of speech in more advanced cases. Recent
Research, 43, 737–748. findings by Sapir et al. (2002) are consistent with this
Hypokinetic Laryngeal Movement Disorders 31
suggestion. Sapir et al. (2002) observed voice disorders in reduced and variable single motor unit activity in the
individuals with recent onset of PD and low Unified thyroarytenoid muscle of individuals with idiopathic PD
Parkinson Disease Rating Scale (UPDRS) scores; in (Luschei et al., 1999) are consistent with a number of
individuals with longer duration of disease and higher hypotheses, the most plausible of which is reduced cen-
UPDRS scores, they observed a significantly higher in- tral drive to laryngeal motor neuron pools.
cidence of abnormal articulation and fluency, in addition Although the origin of the hypophonia in PD is cur-
to the disordered voice. Hypokinetic dysarthria of par- rently undefined, Ramig and colleagues (e.g., Fox et al.,
kinsonism is considered to be a part of basal ganglia 2002) have hypothesized that there are at least three
damage (Darley, Aronson, and Brown, 1975). However, features underlying the voice disorder in individuals with
there are no studies on pathological changes in the PD: (1) an overall neural amplitude scaledown (Penny
hypokinetic dysarthria of idiopathic PD. A significant and Young, 1983) to the laryngeal mechanism (reduced
correlation between neuronal loss and gliosis in SNpc amplitude of neural drive to the muscles of the larynx);
and substantia nigra pars reticulata (SNpr) and severity (2) problems in sensory perception of e¤ort (Berardelli et
of hypokinetic dysarthria was found in patients with al., 1986), which prevents the individual with idiopathic
Parkinson-plus syndromes (Kluin et al., 2001). Speech PD from accurately monitoring his or her vocal output;
and voice characteristics may di¤er between idiopathic which results in (3) the individual’s di‰culty in inde-
PD and Parkinson-plus syndromes (e.g., Shy-Drager pendently generating (through internal cueing or scaling)
syndrome, progressive supranuclear palsy, multisystem adequate vocal e¤ort (Hallet and Khoshbin, 1980) to
atrophy). In addition to the classic hypokinetic symp- produce normal loudness. Reduced neural drive, prob-
toms, these patients may have more slurring, a strained, lems in sensory perception of e¤ort, and problems scal-
strangled voice, pallilalia, and hypernasality (Country- ing adequate vocal output e¤ort may be significant
man, Ramig, and Pawlas, 1994) and their symptoms factors underlying the voice problems in individuals with
may progress more rapidly. PD.
Certain aspects of hypokinetic dysarthria in idio-
pathic PD have been studied extensively. Hypophonia —Lorraine Olson Ramig, Mitchell F. Brin, Miodrag
(reduced loudness, monotone, a breathy, hoarse quality) Velickovic, and Cynthia Fox
may be observed in as many as 89% of individuals with
idiopathic PD (Logemann et al., 1978). Fox and Ramig
References
(1997) reported that sound pressure levels in individuals Aronson, A. E. (1990). Clinical voice disorders. New York:
with idiopathic PD were significantly lower (2–4 dB Thieme-Stratton.
[30 cm]) across a variety of speech tasks than in an age- Baker, K. K., Ramig, L. O., Luschei, E. S., and Smith, M. E.
and sex-matched control group. Lack of vocal fold clo- (1998). Thyroarytenoid muscle activity associated with
sure, including bowing of the vocal cords and anterior hypophonia in Parkinson disease and aging. Neurology, 51,
and posterior chinks (Hanson, Gerratt, and Ward, 1984; 1592–1598.
Benke, T., Hohenstein, C., Poewe, W., and Butterworth, B.
Smith et al., 1995), has been implicated as a cause of this (2000). Repetitive speech phenomena in Parkinson’s dis-
hypophonia. Perez et al. (1996) used videostroboscopic ease. Journal of Neurology, Neurosurgery, and Psychiatry,
observations to study vocal fold vibration in individuals 69, 319–324.
with idiopathic PD. They reported abnormal phase clo- Berardelli, A., Dick, J. P., Rothwell, J. C., Day, B. L., and
sure and symmetry and tremor (both at rest and during Marsden, C. D. (1986). Scaling of the size of the first ago-
phonation) in nearly 50% of patients. Whereas reduced nist EMG burst during rapid wrist movements in patients
loudness and disordered voice quality in idiopathic PD with Parkinson’s disease. Journal of Neurology, Neuro-
have been associated with glottal incompetence (lack of surgery, and Psychiatry, 49, 1273–1279.
vocal fold closure—e.g., bowing; Hanson, Gerratt, and Countryman, S., Ramig, L. O., and Pawlas, A. A. (1994).
Ward, 1984; Smith et al., 1995; Perez et al., 1996), the Speech and voice deficits in Parkinsonian Plus syndromes:
Can they be treated? Journal of Medical Speech-Language
specific origin of this glottal incompetence has not been Pathology, 2, 211–225.
clearly defined. Rigidity or fatigue secondary to rigidity, Critchley, E. M. (1981). Speech disorders of Parkinsonism: A
paralysis, reduced thyroarytenoid longitudinal tension review. Journal of Neurology, Neurosurgery, and Psychiatry,
secondary to cricothyroid rigidity (Aronson, 1990), and 44, 751–758.
misperception of voice loudness (Ho, Bradshaw, and Darley, F. L., Aronson, A. E., and Brown, J. B. (1975). Motor
Iansek 2000; Sapir et al., 2002) are among the explana- speech disorders. Philadelphia: Saunders.
tions. It has been suggested that glottal incompetence Fahn, S. (1986). Parkinson’s disease and other basal ganglion
(e.g., vocal fold bowing) might be due to loss of muscle disorders. In A. K. Asbury, G. M. McKhann, and W. I.
or connective tissue volume, either throughout the entire McDonald (Eds.), Diseases of the nervous system: Clinical
vocal fold or localized near the free margin of the vocal neurobiology. Philadelphia: Ardmore.
Fox, C., Morrison, C. E., Ramig, L. O., and Sapir, S. (2002).
fold. Recent physiological studies of laryngeal function Current perspectives on the Lee Silverman voice treatment
in idiopathic PD have shown a reduced amplitude of (LSVT) for individuals with idiopathic Parkinson’s disease.
electromyographic activity in the thyroarytenoid mus- American Journal of Speech-Language Pathology, 11, 111–
cle accompanying glottal incompetence when compared 123.
with both aged-matched and younger controls (Baker Fox, C., and Ramig, L. (1997). Vocal sound pressure level and
et al., 1998). These findings and the observation of self-perception of speech and voice in men and women with
32 Part I: Voice
idiopathic Parkinson disease. American Journal of Speech- Conner, N. P., Abbs, J. H., Cole, K. J., and Gracco, V. L.
Language Pathology, 6, 85–94. (1989). Parkinsonian deficits in serial multiarticulate move-
Galvin, J. E., Lee, V. M., and Trojanowski, J. Q. (2001). Syn- ments for speech. Brain, 112, 997–1009.
ucleinopathies: Clinical and pathological implications. Contreras-Vidal, J., and Stelmach, G. (1995). A neural model
Archives of Neurology, 58, 186–190. of basal ganglia-thalamocortical relations in normal and
Hallet, M., and Khoshbin, S. (1980). A physiological mecha- parkinsonian movement. Biological Cybernetics, 73, 467–
nism of bradykinesia. Brain, 103, 301–314. 476.
Hanson, D., Gerratt, B., and Ward, P. (1984). Cinegraphic DeLong, M. R. (1990). Primate models of movement disorders
observations of laryngeal function in Parkinson’s disease. of basal ganglia origin. Trends in Neuroscience, 13, 281–
Laryngoscope, 94, 348–353. 285.
Ho, A. K., Bradshaw, J. L., and Iansek, T. (2000). Volume Forrest, K., Weismer, G., and Turner, G. (1989). Kinematic,
perception in parkinsonian speech. Movement Disorders, 15, acoustic and perceptual analysis of connected speech pro-
1125–1131. duced by Parkinsonian and normal geriatric adults. Journal
Hughes, A. J., Daniel, S. E., Kilford, L., and Lees, A. J. of the Acoustical Society of America, 85, 2608–2622.
(1992). Accuracy of clinical diagnosis of idiopathic Parkin- Jobst, E. E., Melnick, M. E., Byl, N. N., Dowling, G. A., and
son’s disease: A clinico-pathological study of 100 cases. Amino¤, M. J. (1997). Sensory perception in Parkinson
Journal of Neurology, Neurosurgery, and Psychiatry, 55, disease. Archives of Neurology, 54, 450–454.
181–184. Jurgens, U., and von Cramon, D. (1982). On the role of
Kluin, K. J., Gilman, S., Foster, N. L., Sima, A., D’Amato, the anterior cingulated cortex in phonation: A case report.
C., Bruch, L., et al. (2001). Neuropathological correlates Brain and Language, 15, 234–248.
of dysarthria in progressive supranuclear palsy. Archives of Larson, C. (1985). The midbrain periaqueductal fray: A brain-
Neurology, 58, 265–269. stem structure involved in vocalization. Journal of Speech
Logemann, J. A., and Fisher, H. B. (1981). Vocal tract control and Hearing Research, 28, 241–249.
in Parkinson’s disease: Phonetic feature analysis of mis- Ludlow, C. L., and Bassich, C. J. (1984). Relationships be-
articulations. Journal of Speech and Hearing Disorders, 46, tween perceptual ratings and acoustic measures of hypo-
348–352. kinetic speech. In M. R. McNeil, J. C. Rosenbek, and A. E.
Logemann, J. A., Fisher, H. B., Boshes, B., and Blonsky, E. Aronson (Eds.), Dysarthria of speech: Physiology-acoustics-
(1978). Frequency and concurrence of vocal tract dysfunc- linguistics-management. San Diego, CA: College-Hill Press.
tions in the speech of a large sample of Parkinson’s patients. Netsell, R., Daniel, B., and Celesia, G. G. (1975). Acceleration
Journal of Speech and Hearing Disorders, 43, 47–57. and weakness in parkinsonian dysarthria. Journal of Speech
Luschei, E. S., Ramig, L. O., Baker, K. L., and Smith, M. E. and Hearing Disorders, 40, 170–178.
(1999). Discharge characteristics of laryngeal single motor Sarno, M. T. (1968). Speech impairment in Parkinson’s dis-
units during phonation in young and older adults and in ease. Journal of Speech and Hearing Disorders, 49, 269–275.
persons with Parkinson disease. Journal of Neurophysiology, Schneider, J. S., Diamond, S. G., and Markham, C. H. (1986).
81, 2131–2139. Deficits in orofacial sensorimotor function in Parkinson’s
Penny, J. B., and Young, A. B. (1983). Speculations on the disease. Annals of Neurology, 19, 275–282.
functional anatomy of basal ganglia disorders. Annual Re- Solomon, N. P., Hixon, T. J. (1993). Speech breathing in Par-
view of Neurosciences, 6, 73–94. kinson’s disease. Journal of Speech and Hearing Research,
Perez, K., Ramig, L. O., Smith, M., and Dromey, C. (1996). 36, 294–310.
The Parkinson larynx: Tremor and videostroboscopic find- Stewart, C., Winfield, L., Hunt, A., et al. (1995). Speech dys-
ings. Journal of Voice, 10, 354–361. function in early Parkinson’s disease. Movement Disorders,
Pitcairn, T., Clemie, S., Gray, J., and Pentland, B. (1990). 10, 562–565.
Non-verbal cues in the self-presentation of parkinsonian
patients. British Journal of Clinical Psychology, 29, 177–
184.
Sapir, S., Pawlas, A. A., Ramig, L. O., Countryman, S., et al. Infectious Diseases and Inflammatory
(2002). Speech and voice abnormalities in Parkinson dis-
ease: Relation to severity of motor impairment, duration of Conditions of the Larynx
disease, medication, depression, gender, and age. Journal of
Medical Speech-Language Pathology, 9, 213–226.
Smith, M. E., Ramig, L. O., Dromey, C., et al. (1995). Inten- Infectious and inflammatory conditions of the larynx
sive voice treatment in Parkinson disease: Laryngostrobo- can a¤ect the voice, swallowing, and breathing to
scopic findings. Journal of Voice, 9, 453–459. varying extents. Changes can be acute or chronic and
can occur in isolation or as part of systemic processes.
Further Readings The conditions described in this article are grouped by
etiology.
Ackerman, H., and Ziegler, W. (1991). Articulatory deficits
in Parkinsonian dysarthria. Journal of Neurology, Neuro- Infectious Diseases
surgery, and Psychiatry, 54, 1093–1098.
Brown, R. G., and Marsden, C. D. (1988). An investigation of
Viral Laryngotracheitis. Viral laryngotracheitis is the
the phenomenon of ‘‘set’’ in Parkinson’s disease. Movement
Disorders, 3, 152–161. most common infectious laryngeal disease. It is typically
Caliguri, M. P. (1989). Short-term fluctuations in orofacial associated with upper respiratory infection, for example,
motor control in Parkinson’s disease. In K. M. Yorkson by rhinoviruses and adenoviruses. Dysphonia is usually
and D. R. Beukelman (Eds.), Recent advances in clinical self-limiting but may create major problems for a pro-
dysarthria. Boston: College-Hill Press. fessional voice user. The larger diameter upper airway in
Infectious Diseases and Inflammatory Conditions of the Larynx 33
adults makes airway obstruction much less likely than in ness and odynophagia, typically out of proportion to the
children. size of the lesion. However, these symptoms are not
In a typical clinical scenario, a performer with mild universally present. The vocal folds are most commonly
upper respiratory symptoms has to carry on performing a¤ected, although all areas of the larynx can be
but complains of reduced vocal pitch and increased ef- involved. Laryngeal tuberculosis is often di‰cult to dis-
fort on singing high notes. Mild vocal fold edema and tinguish from carcinoma on laryngoscopy. Chest radi-
erythema may occur but can be normal for this patient ography and the purified protein derivative (PPD) test
group. Thickened, erythematous tracheal mucosa visible help establish the diagnosis, although biopsy and histo-
between the vocal folds supports the diagnosis. logical confirmation may be required. Patients are treated
Hydration and rest may be su‰cient treatment. with antituberculous chemotherapy. The laryngeal symp-
However, if the performer decides to proceed with the toms usually respond within 2 weeks.
show, high-dose steroids can reduce inflammation, and Leprosy is rare in developing countries. Laryngeal
antibiotics may prevent opportunistic bacterial infection. infection by Mycobacterium leprae can cause nodules,
Cough suppressants, expectorants, and steam inhala- ulceration, and fibrosis. Lesions are often painless but
tions may also be useful. Careful vocal warmup should may progress over the years to laryngeal stenosis.
be undertaken before performing, and ‘‘rescue’’ must be Treatment is with antileprosy chemotherapy (Soni, 1992).
balanced against the risk of vocal injury.
Other Bacterial Infections. Laryngeal actinomycosis
Other Viral Infections. Herpes simplex and herpes zos- can occur in immunocompromised patients and follow-
ter infection have been reported in association with vocal ing laryngeal radiotherapy (Nelson and Tybor, 1992).
fold paralysis (Flowers and Kernodle, 1990; Nishizaki Biopsy may be required to distinguish it from radio-
et al., 1997). Laryngeal vesicles, ulceration, or plaques necrosis or tumor. Treatment requires prolonged anti-
may lead to suspicion of the diagnosis, and antiviral biotic therapy.
therapy should be instituted early. New laryngeal muscle Scleroma is a chronic granulomatous disease due to
weakness may also occur in post-polio syndrome (Rob- Klebsiella scleromatis. Primary involvement is in the
inson, Hillel, and Waugh, 1998). Viral infection has also nose, but the larynx can also be a¤ected. Subglottic
been implicated in the pathogenesis of certain laryn- stenosis is the main concern (Amoils and Shindo, 1996).
geal tumors. The most established association is between
human papillomavirus (HPV) and laryngeal papil- Fungal Laryngitis. Fungal laryngitis is rare and typi-
lomatosis (Levi et al., 1989). HPV, Epstein-Barr virus, cally occurs in immunocompromised individuals. Fungi
and even herpes simplex virus have been implicated in include yeasts and molds. Yeast infections are more fre-
the development of laryngeal malignancy (Ferlito et al., quent in the larynx, with Candida albicans most com-
1997; Garcia-Milian et al., 1998; Pou et al., 2000). monly identified (Vrabec, 1993). Predisposing factors in
nonimmunocompromised patients include antibiotic and
Bacterial Laryngitis. Bacterial laryngitis is most com- inhaled steroid use, and foreign bodies such as silicone
monly due to Hemophilus influenzae, Staphylococcus voice prostheses.
aureus, Streptococcus pneumoniae, and beta-hemolytic The degree of hoarseness in laryngeal candidiasis may
streptococcus. Pain and fever may be severe, with air- not reflect the extent of infection. Pain and associated
way and swallowing di‰culties generally overshadowing swallowing di‰culty may be present. Typically, thick
voice loss. Typically the supraglottis is involved, with the white exudates are seen, and oropharyngeal involvement
aryepiglottic folds appearing boggy and edematous, can coexist. Biopsy may show epithelial hyperplasia with
often more so than the epiglottis. Unlike in children, a pseudocarcinomatous appearance. Potential complica-
laryngoscopy is usually safe in adults and is the best tions include scarring, airway obstruction, and systemic
means of diagnosis. Possible underlying causes such as a dissemination.
laryngeal foreign body should be considered. Treatment In mild localized disease, topical nystatin or clo-
includes intravenous antibiotics, hydration, humidifica- trimazole are usually e¤ective. Discontinuing antibiotics
tion, and corticosteroids. Close observation is essential or inhaled steroids should be considered. More severe
in case airway support is needed. Rarely, infected mu- cases may require oral antifungal azoles such as keto-
cous retention cysts and epiglottic abscesses occur (Stack conazole, fluconazole, or itraconazole. Intravenous
and Ridley, 1995). Tracheostomy and drainage may be amphotericin is e‰cacious but has potentially severe side
required. e¤ects. It is usually used for invasive or systemic disease.
Less common fungal diseases include blastomycosis,
Mycobacterial Infections. Laryngeal tuberculosis is histoplasmosis, and coccidiomycosis. Infection may be
rare in industrialized countries but must be considered in confused with laryngeal carcinoma, and special histo-
the di¤erential diagnosis of laryngeal disease, especially logical stains are usually required for diagnosis. Long-
in patients with AIDS or other immune deficiencies term treatment with amphotericin B may be necessary.
(Singh et al., 1996). Tuberculosis can infect the larynx
primarily, by direct spread from the lungs, or by hema- Syphilis. Syphilis is caused by the spirochete Trepo-
togenous or lymphatic dissemination (Ramandan, nema pallidum. Laryngeal involvement is rare but may
Tarayi, and Baroudy, 1993). Most patients have hoarse- occur in later stages of the disease. Secondary syphilis
34 Part I: Voice
may present with laryngeal papules, ulcers and edema 50% of cases. Dapsone, corticosteroids, and immuno-
that mimic carcinoma, or tuberculous laryngitis. Ter- suppressive drugs have been used to control the disease.
tiary syphilis may cause gummas, leading to scarring Repeated attacks of laryngeal chondritis can cause sub-
and stenosis (Lacy, Alderson, and Parker, 1994). Sero- glottic scarring, necessitating permanent tracheostomy
logic tests are diagnostic. Active disease is treated with (Spraggs, Tostevin, and Howard, 1997).
penicillin.
Cicatricial Pemphigoid. This chronic subepithelial
Inflammatory Processes bullous disease predominantly involves the mucous
membranes. Acute laryngeal lesions are painful, and
Chronic Laryngitis. Chronic laryngeal inflammation examination shows mucosal erosion and ulceration.
can result from smoking, gastroesophageal reflux Later, scarring and stenosis may occur, with supraglottic
(GER), voice abuse, or allergy. Patients often complain involvement (Hanson, Olsen, and Rogers, 1988). Treat-
of hoarseness, sore throat, a globus sensation, and throat ment includes dapsone, systemic or intralesional ste-
clearing. The vocal folds are usually thickened, dull, and roids, and cyclophosphamide. Scarring may require laser
erythematous. Posterior laryngeal involvement usually excision and sometimes tracheostomy.
suggests GER. Besides direct chemical irritation, GER
can promote laryngeal muscle misuse, which contributes Amyloidosis. Amyloidosis is characterized by deposi-
to wear-and-tear injury (Gill and Morrison, 1998). tion of acellular proteinaceous material (amyloid) in
Although seasonal allergies may cause vocal fold tissues (Lewis et al., 1992). It can occur primarily or
edema and hoarseness (Jackson-Menaldi, Dzul, and secondary to other diseases such as multiple myeloma or
Holland, 1999), it is surprising that allergy-induced tuberculosis. Deposits may be localized or generalized.
chronic laryngitis is not more common. Even patients Laryngeal involvement is usually due to primary local-
with significant nasal allergies or asthma have a low ized disease. Submucosal deposits may a¤ect any part of
incidence of voice problems. The severity of other aller- the larynx but most commonly occur in the ventricular
gic accompaniments helps the clinician identify patients folds. Treatment is by conservative laser excision.
with dysphonia of allergic cause. Recurrences are frequent.
Treatment of chronic laryngitis includes voice rest Sarcoidosis. Sarcoidosis is a multiorgan granulomatous
and elimination of irritants. Dietary modifications and disease of unknown etiology. About 6% of cases involve
postural measures such as elevating the head of the bed the larynx, producing dysphonia and airway obstruction.
can reduce GER. Proton pump inhibitors can be e¤ec- Pale, di¤use swelling of the epiglottis and aryepiglottic
tive for persistent laryngeal symptoms (Hanson, Kamel, folds is characteristic (Benjamin, Dalton, and Crox-
and Kahrilas, 1995). son, 1995). Systemic or intralesional steroids, anti-
lepromatous therapy, and laser debulking are all possible
Traumatic and Iatrogenic Causes. Inflammatory pol- treatments.
yps, polypoid degeneration, and contact granuloma can
arise from vocal trauma. Smoking contributes to poly- Wegener’s Granulomatosis. Wegener’s granulomatosis
poid degeneration, and intubation injury can cause con- is an idiopathic syndrome characterized by vasculitis
tact granulomas. GER may promote inflammation in all and necrotizing granulomas of the respiratory tract and
these conditions. Granulomas can also form many years kidneys. The larynx is involved in 8% of cases (Wax-
after Teflon injection for glottic insu‰ciency. man and Bose, 1986). Ulcerative lesions and subglottic
stenosis may occur, causing hoarseness and dyspnea.
Rheumatoid Arthritis and Systemic Lupus Erythe- Treatment includes corticosteroids and cyclophospha-
matosus. Laryngeal involvement occurs in almost a mide. Laser resection or open surgery is sometimes
third of patients with rheumatoid arthritis (Lofgren and necessary for airway maintenance.
Montgomery, 1962). Patients present with a variety of
symptoms. In the acute phase the larynx may be tender —David P. Lau and Murray D. Morrison
and inflamed. In the chronic phase the laryngeal mucosa
may appear normal, but cricoarytenoid joint ankylosis References
may be present. Submucosal rheumatoid nodules or Amoils, C., and Shindo, M. (1996). Laryngotracheal manifes-
‘‘bamboo nodes’’ can form in the membranous vocal tations of rhinoscleroma. Annals of Otology, Rhinology, and
folds. If the mucosal wave is severely damped, micro- Laryngology, 105, 336–340.
laryngeal excision can improve the voice. Corticosteroids Benjamin, B., Dalton, C., and Croxson, G. (1995). Laryngo-
can be injected intracordally following excision. Other scopic diagnosis of laryngeal sarcoid. Annals of Otology,
autoimmune diseases such as systemic lupus erythe- Rhinology, and Laryngology, 104, 529–531.
matosus can cause similar laryngeal pathology (Woo, Ferlito, A., Weiss, L., Rinaldo, A., et al. (1997). Clinicopatho-
logic consultation: Lymphoepithelial carcinoma of the
Mendelsohn, and Humphrey, 1995). larynx, hypopharynx and trachea. Annals of Otology, Rhi-
nology, and Laryngology, 106, 437–444.
Relapsing Polychondritis. Relapsing polychondritis is Flowers, R., and Kernodle, D. (1990). Vagal mononeuritis
an autoimmune disease causing inflammation of carti- caused by herpes simplex virus: Association with unilateral
laginous structures. The pinna is most commonly af- vocal cord paralysis. American Journal of Medicine, 88,
fected, although laryngeal involvement occurs in around 686–688.
Instrumental Assessment of Children’s Voice 35
Garcia-Milian, R., Hernandez, H., Panade, L., et al. (1998). Further Readings
Detection and typing of human papillomavirus DNA in
benign and malignant tumours of laryngeal epithelium. Badaracco, G., Venuti, A., Morello, R., Muller, A., and Mar-
Acta Oto-Laryngologica, 118, 754–758. cante, M. (2000). Human papillomavirus in head and neck
Gill, C., and Morrison, M. (1998). Esophagolaryngeal reflux in carcinomas: Prevalence, physical status and relationship
a porcine animal model. Journal of Otolaryngology, 27, 76– with clinical/pathological parameters. Anticancer Research,
80. 20, 1305.
Hanson, D., Kamel, P., and Kahrilas, P. (1995). Outcomes of Cleary, K., and Batsakis, J. (1995). Mycobacterial disease of
antireflux therapy for the treatment of chronic laryngitis. the head and neck: Current perspective. Annals of Otology,
Annals of Otology, Rhinology, and Laryngology, 104, 550– Rhinology, and Laryngology, 104, 830–833.
555. Herridge, M., Pearson, F., and Downey, G. (1996). Subglottic
Hanson, R., Olsen, K., and Rogers, R. (1988). Upper aero- stenosis complicating Wegener’s granulomatosis: Surgical
digestive tract manifestations of cicatricial pemphigoid. repair as a viable treatment option. Journal of Thoracic and
Annals of Otology, Rhinology, and Laryngology, 97, 493–499. Cardiovascular Surgery, 111, 961–966.
Jackson-Menaldi, C., Dzul, A., and Holland, R. (1999). Aller- Jones, K. (1998). Infections and manifestations of systemic
gies and vocal fold edema: A preliminary report. Journal of disease in the larynx. In C. W. Cummings, J. M. Fre-
Voice, 13, 113–122. drickson, L. A. Harker, C. J. Krause, D. E. Schuller, and
Lacy, P., Alderson, D., and Parker, A. (1994). Late congenital M. A. Richardson (Eds.), Otolaryngology—head and neck
syphilis of the larynx and pharynx presenting at endo- surgery (3rd ed., pp. 1979–1988). St Louis: Mosby.
tracheal intubation. Journal of Laryngology and Otology, Langford, C., and Van Waes, C. (1997). Upper airway ob-
108, 688–689. struction in the rheumatic diseases. Rheumatic Diseases
Levi, J., Delcelo, R., Alberti, V., Torloni, H., and Villa, L. Clinics of North America, 23, 345–363.
(1989). Human papillomavirus DNA in respiratory papil- Morrison, M., Rammage, L., Nichol, H., Pullan, B., May, P.,
lomatosis detected by in situ hybridization and polymerase and Salkeld, L. (2001). Management of the voice and its dis-
chain reaction. American Journal of Pathology, 135, 1179– orders (2nd ed.). San Diego, CA: Singular Publishing Group.
1184. Raymond, A., Sneige, N., and Batsakis, J. (1992). Amyloidosis
Lewis, J., Olsen, K., Kurtin, P., and Kyle, R. (1992). Laryngeal in the upper aerodigestive tracts. Annals of Otology, Rhi-
amyloidosis: A clinicopathologic and immunohistochemical nology, and Laryngology, 101, 794–796.
review. Otolaryngology–Head and Neck Surgery, 106, 372– Richter, B., Fradis, M., Kohler, G., and Ridder, G. (2001).
377. Epiglottic tuberculosis: Di¤erential diagnosis and treat-
Lofgren, R., and Montgomery, W. (1962). Incidence of laryn- ment. Case report and review of the literature. Annals of
geal involvement in rheumatoid arthritis. New England Otology, Rhinology, and Laryngology, 110, 197–201.
Journal of Medicine, 267, 193. Ridder, G., Strohhacker, H., Lohle, E., Golz, A., and Fradis,
Nelson, E., and Tybor, A. (1992). Actinomycosis of the larynx. M. (2000). Laryngeal sarcoidosis: Treatment with the anti-
Ear, Nose, and Throat Journal, 71, 356–358. leprosy drug clofazimine. Annals of Otology, Rhinology, and
Nishizaki, K., Onada, K., Akagi, H., Yuen, K., Ogawa, T., Laryngology, 109, 1146–1149.
and Masuda, Y. (1997). Laryngeal zoster with unilateral Satalo¤, R. (1997). Common infections and inflammations and
laryngeal paralysis. ORL: Journal for Oto-rhino-laryngology other conditions. In R. T. Satalo¤ (Ed.), Professional voice:
and Its Related Specialties, 59, 235–237. The science and art of clinical care (2nd ed., pp. 429–436).
Pou, A., Vrabec, J., Jordan, J., Wilson, D., Wang, S., and San Diego, CA: Singular Publishing Group.
Payne, D. (2000). Prevalence of herpes simplex virus in Tami, T., Ferlito, A., Rinaldo, A., Lee, K., and Singh, B.
malignant laryngeal lesions. Laryngoscope, 110, 194–197. (1999). Laryngeal pathology in the acquired immunodefi-
Ramandan, H., Tarayi, A., and Baroudy, F. (1993). Laryngeal ciency syndrome: Diagnostic and therapeutic dilemmas.
tuberculosis: Presentation of 16 cases and review of the lit- Annals of Otology, Rhinology, and Laryngology, 108, 214–
erature. Journal of Otolaryngology, 22, 39–41. 220.
Robinson, L., Hillel, A., and Waugh, P. (1998). New laryngeal Thompson, L. (2001). Pathology of the larynx, hypopharynx
muscle weakness in post-polio syndrome. Laryngoscope, and tarachea. In Y. Fu, B. M. Wenig, E. Abemayor, and
108, 732–734. B. L. Wenig (Eds.), Head and neck pathology with clini-
Singh, B., Balwally, A., Nash, M., Har-El, G., and Lucente, F. cal correlations (pp. 369–454). New York: Churchill
(1996). Laryngeal tuberculosis in HIV-infected patients: A Livingstone.
di‰cult diagnosis. Laryngoscope, 106, 1238–1240. Vrabec, J., Molina, C., and West, B. (2000). Herpes simplex
Soni, N. (1992). Leprosy of the larynx. Journal of Laryngology viral laryngitis. Annals of Otology, Rhinology, and Laryn-
and Otology, 106, 518–520. gology, 109, 611–614.
Spraggs, P., Tostevin, P., and Howard, D. (1997). Manage-
ment of laryngotracheobronchial sequelae and complica-
tions of relapsing polychondritis. Laryngoscope, 107, 936–
941. Instrumental Assessment of
Stack, B., and Ridley, M. (1995). Epiglottic abscess. Head and
Neck, 17, 263–265. Children’s Voice
Vrabec, D. (1993). Fungal infections of the larynx. Otolaryn-
gologic Clinics of North America, 26, 1091–1114.
Waxman, J., and Bose, W. (1986). Laryngeal manifestations of
Disorders of voice may a¤ect up to 5% of children, and
Wegener’s granulomatosis: Case reports and review of the instrumental procedures such as acoustics, aerody-
literature. Journal of Rheumatology, 13, 408–411. namics, or electroglottography (EGG) may complement
Woo, P., Mendelsohn, J., and Humphrey, D. (1995). Rheu- auditory-perceptual and imaging procedures by provid-
matoid nodules of the larynx. Otolaryngology–Head and ing objective measures that help in determining the
Neck Surgery, 113, 147–150. nature and severity of laryngeal pathology. The use of
36 Part I: Voice
these procedures should take into account the devel- Sex di¤erences in f0 emerge especially strongly during
opmental features of the larynx and special problems adolescence. The overall f0 decline from infancy to
associated with a pediatric population. adulthood is about one octave for girls and two octaves
An important starting point is the developmental for boys. There is some question as to when the sex
anatomy and physiology of the larynx. This background di¤erence emerges. Lee et al. (1999) observed that f0
is essential in understanding children’s vocal function as di¤erences between male and female children were sta-
determined by instrumental assessments. The larynx of tistically significant beginning at about age 12 years, but
the infant and young child di¤ers considerably in its Glaze et al. (1988) observed di¤erences between boys
anatomy and physiology from the adult larynx (see and girls for the age period 5–11 years. Further, Hacki
anatomy of the human larynx). The vocal folds in an and Heitmuller (1999) reported a lowering of both the
infant are about 3–5 mm long, and the composition of habitual pitch and the entire speaking pitch range be-
the folds is uniform. That is, the infant’s vocal folds are tween the ages of 7 and 8 years for girls and between the
not only very short compared with those of the adult, ages of 8 and 9 years for boys. Sex di¤erences emerge
but they lack the lamination seen in the adult folds. The strongly with the onset of mutation. Hacki and Heit-
lamination has been central to modern theories of pho- muller (1999) concluded that the beginning of the muta-
nation, and its absence in infants and marginal develop- tion occurs at age 10–11 years. Mean f0 change is
ment in young children presents interesting challenges to pronounced in males between the ages of about 12 and
theories of phonation applied to a pediatric population. 15 years. For example, Lee et al. (1999) reported a 78%
An early stage of development of the lamina propria decrease in f0 for males between these ages. No signifi-
begins between 1 and 4 years, with the appearance of the cant change was observed after the age of 15 years,
vocal ligament (intermediate and deep layers of the which indicates that the voice change is e¤ectively com-
lamina propria). During this same interval, the length of plete by that age (Hollien, Green, and Massey, 1994;
the vocal fold increases (reaching about 7.5 mm by age Kent and Vorperian, 1995).
5) and the entire laryngeal framework increases in size. Other acoustic aspects of children’s voices have not
The di¤erentiation of the superficial layer of the lamina been extensively studied. In apparently the only large-
propria apparently is not complete until at least the age scale study of its kind, Campisi et al. (2002) provided
of 12 years. normative data for children for the parameters of
Studies on the time of first appearance of sexual the Multi-Dimensional Voice Program (MDVP). On the
dimorphism in laryngeal size are conflicting, ranging majority of parameters (excluding, of course, f0 ), the
from 3 years to no sex di¤erences in laryngeal size ob- mean values for children were fairly consistent with
servable during early childhood. Sexual dimorphism of those for adults, which simplifies the clinical application
vocal fold length has been reported to appear at about of MDVP. However, this conclusion does not apply to
age 6–7 years. These reported anatomical di¤erences the pubescent period, during which variability in ampli-
do not appear to contribute to significant di¤erences in tude and fundamental frequency increases in both girls
vocal fundamental frequency ( f0 ) between males and and boys, but markedly so in the latter (Boltezar, Bur-
females until puberty, at which time laryngeal growth ger, and Zargi, 1997). It should also be noted voice
is remarkable, especially in boys. For example, in boys, training can a¤ect the degree of aperiodicity in children’s
the anteroposterior dimension of the thyroid cartilage voices (Dejonckere et al., 1996) (see acoustic assess-
increases threefold, along with increases in vocal fold ment of voice).
length.
Aerodynamic Studies of Children’s Voice. There are
Acoustic Studies of Children’s Voice. Mean f0 has been only limited data describing developmental patterns
one of the most thoroughly studied aspects of the pedi- in voice aerodynamics. Table 1 shows normative data
atric voice. For infants’ nondistress utterances, such as for flow, pressure, and laryngeal airway resistance from
cooing and babbling, mean f0 falls in the range of 300– three sources (Netsell et al., 1994; Keilman and Bader,
600 Hz and appears to be stable until about 9 months, 1995; Zajac, 1995, 1998). All of the data were collected
when it begins to decline until adulthood (Kent and during the production of /pi/ syllable trains, following
Read, 2002). A relatively sharp decline occurs between the procedure first described by Smitheran and Hixon
the ages of 12 months and 3 years, so that by the age of 3 (1981). Flow appears to increase with age, ranging from
years, the mean f0 in both males and females is about 75–79 mL/s in children aged 3–5 years to 127–188 mL/s
250 Hz. Mean f0 is stable or gradually falling between 6 in adults. Pressure decreases slightly with age, ranging
and 11 years, and the value of 250 Hz may be taken as from 8.4 cm H2 O in children ages 3–5 years to 5.3–6.0
a reasonable estimate of f0 in both boys and girls. Some cm H2 O in adults. Laryngeal airway pressure decreases
studies report no significant change in f0 during this with age, ranging from 111–119 cm H2 O/L/s in children
developmental period, but Glaze et al. (1988) reported aged 3–6 years to 34–43 cm H2 O/L/s in adults. This
that f0 decreased with increasing age, height, and weight decrease in laryngeal airway pressure occurs as a func-
for boys and girls ages 5–11 years, and Ferrand and tion of the rate of flow increase exceeding the rate of
Bloom (1996) observed a decrease in the mean, maxi- pressure decrease across the age range.
mum, and range of f0 in boys, but not in girls, at about Netsell et al. (1994) explained the developmental
7–8 years of age. changes in flow, pressure, and laryngeal airway pressure
Instrumental Assessment of Children’s Voice 37
Table 1. Aerodynamic normative data from three sources: N (Netsell et al., 1994), K (Keilman & Bader, 1995), and Z (Zajac,
1995, 1998). All data were collected using the methodology described by Smitheran and Hixon (1981). Values shown are means,
with standard deviations in parentheses
Age Flow Pressure LAR
Reference (yr) Sex N (mL/s) (cm H2 O) (cm H2 O/L/s)
N 3–5 F 10 79 (16) 8.4 (1.3) 111 (26)
N 3–5 M 10 75 (20) 8.4 (1.4) 119 (20)
K 4–7 F&M 7.46 (2.26)
N 6–9 F 10 86 (19) 7.4 (1.5) 89 (25)
N 6–9 M 9 101 (42) 8.3 (2.0) 97 (39)
Z 7–11 F&M 10 123 (30) 11.4 (2.3) 95.3 (24.4)
K 8–12 F&M 6.81 (2.29)
N 9–12 F 10 121 (21) 7.1 (1.2) 59 (7)
N 9–12 M 10 115 (42) 7.9 (1.3) 77 (23)
K 13–15 F&M 5.97 (2.07)
K 4–15 F&M 100 50–150 87.82 (62.95)
N Adult F 10 127 (29) 5.3 (1.2) 43 (10)
N Adult M 10 188 (51) 6.0 (1.4) 34 (9)
F ¼ female, M ¼ male, N ¼ number of participants, LAR ¼ laryngeal airway resistance.
as secondary to an increasing airway size and decreasing that instrumental procedures can be used successfully
dependence on expiratory muscle forces alone for speech with children as young as 3 years of age. Therefore, these
breathing with age. No consistent di¤erences in aero- procedures may play a valuable role in the objective as-
dynamic parameters were observed between female and sessment of voice in children.
male children. High standard deviations reflect consid- See also voice disorders in children.
erable variation between children of similar ages (see
—Ray D. Kent and Nathan V. Welham
aerodynamic assessment of voice).
Electroglottographic Studies of Children’s Voice. Al- References

though EGG data on children’s voice are not abun-
dant, one study provides normative data on a sample Boltezar, I. H., Burger, Z. R., and Zargi, M. (1997). Instability
of 164 children, 79 girls and 85 boys, ages 3–16 years of voice in adolescence: Pathologic condition or normal
(Cheyne, Nuss, and Hillman, 1999). Cheyne et al. developmental variation? Journal of Pediatrics, 130, 185–
190.
reported no significant e¤ect of age on the EGG mea-
Campisi, P., Tewfik, T. L., Manoukian, J. J., Schloss, M. D.,
sures of jitter, open quotient, closing quotient, and Pelland-Blais, E., and Sadeghi, N. (2002). Computer-
opening quotient. The means and standard deviations assisted voice analysis: Establishing a pediatric database.
(in parentheses) for these measures were as follows: jit- Archives of Otolaryngology–Head and Neck Surgery, 128,
ter—0.76% (0.61), open quotient—54.8% (3.3), closing 156–160.
quotient—14.1% (3.8), and opening quotient—31.1% Cheyne, H. A., Nuss, R. C., and Hillman, R. E. (1999). Elec-
(4.1). These values are reasonably similar to values troglottography in the pediatric population. Archives of
reported for adults, although caution should be observed Otolaryngology–Head and Neck Surgery, 125, 1105–1108.
because of di¤erences in procedures across studies Dejonckere, P. H. (1999). Voice problems in children: Patho-
(Takahashi and Koike, 1975) (see electroglotto- genesis and diagnosis. International Journal of Pediatric
Otorhinolaryngology, 49(Suppl. 1), S311–S314.
graphic assessment of voice). Dejonckere, P. H., Wieneke, G. H., Bloemenkamp, D., and
One of the most striking features of the instrumental Lebacq, J. (1996). F0 -perturbation and f0 -loudness dynam-
studies of children’s voice is that, except for f0 and the ics in voices of normal children, with and without education
aerodynamic measures, the values obtained from instru- in singing. International Journal of Pediatric Otorhino-
mental procedures change relatively little from child- laryngology, 35, 107–115.
hood to adulthood. This stability is remarkable in view Ferrand, C. T., and Bloom, R. L. (1996). Gender di¤erences in
of the major changes that are observed in laryngeal children’s intonational patterns. Journal of Voice, 10, 284–
anatomy and physiology. Apparently, children are able 291.
to maintain normal voice quality in the face of consid- Glaze, L. E., Bless, D. M., Milenkovic, P., and Susser, R.
erable alteration in the apparatus of voice production. (1988). Acoustic characteristics of children’s voice. Journal
of Voice, 2, 312–319.
With the mutation, however, stability is challenged, and
Hacki, T., and Heitmuller, S. (1999). Development of the
the suitability of published normative data is open to child’s voice: Premutation, mutation. International Journal
question. The maintenance of rather stable values across of Pediatric Otorhinolaryngology, 49(Suppl. 1), S141–S144.
a substantial period of childhood (from about 5 to 12 Hollien, H., Green, R., and Massey, K. (1994). Longitudinal
years) for many acoustic and EGG parameters holds a research on adolescent voice change in males. Journal of the
distinct advantage for clinical application. It is also clear Acoustical Society of America, 96, 2646–2654.
38 Part I: Voice
Keilman, A., and Bader, C. (1995). Development of aerody- isolation. A 5-Hz tremor can be heard on prolonged
namic aspects in children’s voice. International Journal of vowels, owing to intensity and frequency modulation.
Pediatric Otorhinolaryngology, 31, 183–190. Tremor can a¤ect either or both the adductor or abduc-
Kent, R. D., and Read, C. (2002). The acoustic analysis tor muscles, producing voice breaks in vowels or breathy
of speech (2nd ed.). San Diego, CA: Singular/Thompson
intervals in the abductor type. Voice tremor occurs more
Learning.
Kent, R. D., and Vorperian, H. K. (1995). Anatomic develop- often in women, sometimes with an associated head
ment of the craniofacial-oral-laryngeal systems: A review. tremor. A variety of muscles may be involved in voice
Journal of Medical Speech-Language Pathology, 3, 145–190. tremor (Koda and Ludlow, 1992).
Lee, S., Potamianos, A., and Narayanan, S. (1999). Acoustics Intermittent voice breaks are specific to the spasmodic
of children’s speech: Developmental changes of temporal dysphonias, either prolonged glottal stops and intermit-
and spectral parameters. Journal of the Acoustical Society of tent intervals of a strained or strangled voice quality
America, 105, 1455–1468. during vowels in ADSD or prolonged voiceless con-
Netsell, R., Lotz, W. K., Peters, J. E., and Schulte, L. (1994). sonants (p, t, k, f, s, h), which are perceived as breathy
Developmental patterns of laryngeal and respiratory func- breaks, in ABSD. Other idiopathic voice disorders, such
tion for speech production. Journal of Voice, 8, 123–131.
as muscular tension dysphonia, do not involve intermit-
Smitheran, J., and Hixon, T. (1981). A clinical method for es-
timating laryngeal airway resistance during vowel produc- tent spasmodic changes in the voice. Rather, consistent
tion. Journal of Speech and Hearing Disorders, 46, 138–146. abnormal hypertense laryngeal postures are maintained
Takahashi, H., and Koike, Y. (1975). Some perceptual dimen- during voice production. Such persons may respond to
sions and acoustical correlates of pathological voices. Acta manual laryngeal manipulation (Roy, Ford, and Bless,
Otolaryngolica Supplement (Stockholm), 338, 1–24. 1996). Muscular tension dysphonia may be confused
Zajac, D. J. (1995). Laryngeal airway resistance in children with spasmodic dysphonia when ADSD patients develop
with cleft palate and adequate velopharyngeal function. increased muscle tension in an e¤ort to overcome vocal
Cleft Palate–Craniofacial Journal, 32, 138–144. instability, resulting in symptoms of both disorders.
Zajac, D. J. (1998). E¤ects of a pressure target on laryngeal Some patients with voice tremor may also develop mus-
airway resistance in children. Journal of Communication
cular tension dysphonia in an e¤ort to overcome vocal
Disorders, 31, 212–213.
instability.
Paradoxical breathing dystonia is rare, with adduc-
tory movements of the vocal folds during inspiration
Laryngeal Movement Disorders: that remit during sleep (Marion et al., 1992). It di¤ers
Treatment with Botulinum Toxin from vocal fold dysfunction, which is usually intermit-
tent and often coincides with irritants a¤ecting the upper
airway (Christopher et al., 1983; Morrison, Rammage,
The laryngeal dystonias include spasmodic dysphonia, and Emami, 1999).
tremor, and paradoxical breathing dystonia. All of these Botulinum toxin type A (BTX-A) is e¤ective in
conditions are idiopathic and all have distinctive symp- treating a myriad of hyperkinetic disorders by partially
toms, which form the basis for diagnosis. In adductor denervating the muscle. The toxin is injected into mus-
spasmodic dysphonia (ADSD), voice breaks during cle, di¤uses, and is endocytosed into nerve endings. The
vowels are associated with involuntary spasmodic mus- toxin cleaves SNAP 25, a vesicle-docking protein essen-
cle bursts in the thyroarytenoid and other adductor la- tial for acetylcholine release into the neuromuscular
ryngeal muscles, although bursts can also occur in the junction (Aoki, 2001). When acetylcholine release is
cricothyroid muscle in some persons (Nash and Ludlow, blocked, the muscle fibers become temporarily dener-
1996). When voice breaks are absent, however, muscle vated. The e¤ect is reversible: within a few weeks
activation is normal in both adductor and abductor la- new nerve endings sprout, which may provide synaptic
ryngeal muscles (Van Pelt, Ludlow, and Smith, 1994). In transmission and some reduction in muscle weakness.
the abductor type of spasmodic dysphonia (ABSD), These nerve endings are later replaced by restitution of
breathy breaks are due to prolonged vocal fold open- the original end-plates (de Paiva et al., 1999). In ADSD,
ing during voiceless consonants. The posterior cricoary- BTX-A injection, either small bilateral injections or a
tenoid muscle is often involved in ABSD, although not unilateral injection produces a partial chemodenerva-
in all patients (Cyrus et al., 2001). In the 1980s, ‘‘spastic’’ tion of the thyroarytenoid muscle for up to 4 months.
dysphonia was renamed ‘‘spasmodic’’ dysphonia to de- Reductions in spasmodic muscle bursts relate to voice
note the intermittent aspect of the voice breaks and was improvement (Bielamowicz and Ludlow, 2000). This
classified as a task-specific focal laryngeal dystonia therapy is e¤ective in least 90% of ADSD patients, as
(Blitzer and Brin, 1991). Abnormalities in laryngeal has been demonstrated in a small randomized con-
adductor responses to sensory stimulation are found in trolled trial (Truong et al., 1991) and in multiple case
both ADSD and ABSD (Deleyiannis et al., 1999), indi- series (Ludlow et al., 1988; Blitzer and Brin, 1991; Blit-
cating a reduction in the normal central suppression zer, Brin, and Stewart, 1998). BTX-A is less e¤ective in
of laryngeal sensorimotor responses in these disorders. ABSD. When only ABSD patients with cricothyroid
ADSD a¤ects 85% of patients with spasmodic dyspho- muscle spasms are injected in that muscle, significant
nia; the other 15% have ABSD. improvements occur in 60% of cases (Ludlow et al.,
Vocal tremor is present in at least one-third of 1991). Similarly, two-thirds of ABSD patients obtain
patients with ADSD or ABSD and can also occur in some degree of benefit from posterior cricoarytenoid
Laryngeal Movement Disorders: Treatment with Botulinum Toxin 39
injections (Blitzer et al., 1992). When speech symptoms than in others. In all cases BTX-A causes partial dener-
were measured in blinded fashion before and after teat- vation, which reduces the force that can be exerted by
ment, BTX-A was less e¤ective in ABSD (Bielamowicz a muscle following injection. In ADSD, then, the force-
et al., 2001) than in ADSD (Ludlow et al., 1988). fulness of vocal fold hyperadduction is reduced and
Patients with adductor tremor confined to the vocal patients are less able to produce voice breaks even
folds often receive some benefit from thyroarytenoid if muscle spasms continue to occur. Central control
muscle BTX-A injections. When objective measures changes also appear to occur, however, in persons with
were used (Warrick et al., 2000), BTX-A injection was ADSD following BTX-A injection. When thyroary-
beneficial in 50% of patients with voice tremors. Either tenoid muscle injections were unilateral, spasmodic
unilateral or bilateral thyroarytenoid injections can be bursts were significantly reduced on both sides of the
used, although larger doses are sometimes more e¤ec- larynx, and there were also reductions in overall levels of
tive. BTX-A is much less e¤ective, however, for treating muscle tone (measured in microvolts) and maximum
tremor than it is in ADSD (Warrick et al., 2000), and it activity levels (Bielamowicz and Ludlow, 2000). Such
is rarely helpful in patients with abductor tremor. reductions in muscle activity and spasms may be the
BTX-A administered as either unilateral or bilateral result of reductions in muscle spindle and mechano-
injections into the thyroarytenoid muscle has been receptor feedback, resulting in lower motor neuron
used successfully to treat paradoxical breathing dystonia pool activity for all the laryngeal muscles. The physio-
(Marion et al., 1992; Grillone et al., 1994). logical e¤ects of BTX-A may be greater on the fusimo-
Changes in laryngeal function following BTX-A in- tor system than on muscle fiber innervation (On et al.,
jection in persons with ADSD are similar, whether the 1999). Although only one portion of the human thyro-
injection was unilateral or bilateral. A few persons re- arytenoid muscle may contain muscle spindles (Sanders
port a sense of reduction in laryngeal tension within 8 et al., 1998), mucosal mechanoreceptor feedback will
hours following injection, although voice loudness is also change with reductions in adductory force between
not yet reduced. Voice loudness and breaks gradually the vocal folds following BTX-A injection. Perhaps
diminish as BTX-A di¤uses through the muscle, causing changes in sensory feedback account for the longer pe-
progressive denervation. Most people report that bene- riod of benefit in ADSD than in other laryngeal move-
fits become apparent the second day, while the side ment disorders, although the duration of side e¤ects is
e¤ects of progressive breathiness and swallowing di‰- similar in all disorders. Future approaches to altering
culties increase over the 3–5 days after injection. Di‰- sensory feedback may also have a role in the treatment
culty swallowing liquids may occur and occasionally of laryngeal dystonia, in addition to e¤erent denervation
results in aspiration. Patients are advised to ingest liq- by BTX-A.
uids slowly and in small volumes, by sipping through
a straw. The di‰culties with swallowing gradually sub- —Christy L. Ludlow
side between the first and second weeks after an injection
(Ludlow, Rhew, and Nash, 1994), possibly as patients References
learn to compensate. The breathiness resolves some- Aoki, K. R. (2001). Pharmacology and immunology of botu-
what later, reaching normal loudness levels as late as 3–4 linum toxin serotypes. Journal of Neurology, 248(Suppl. 1),
weeks after injection, during the period when axonal 3–10.
sprouting may occur (de Paiva et al., 1999). Because Bielamowicz, S., and Ludlow, C. L. (2000). E¤ects of botu-
improvements in voice volume seem independent of re- linum toxin on pathophysiology in spasmodic dysphonia.
covery of swallowing, di¤erent mechanisms may under- Annals of Otology, Rhinology, and Laryngology, 109, 194–
lie recovery of these functions. 203.
The benefit is greatest between 1 and 3 months after Bielamowicz, S., Squire, S., Bidus, K., and Ludlow, C. L.
injection, when the patient’s voice is close to normal (2001). Assessment of posterior cricoarytenoid botulinum
volume, voice breaks are significantly reduced, and toxin injections in patients with abductor spasmodic dys-
phonia. Annals of Otology, Rhinology, and Laryngology,
speech is more fluent, with reduced hoarseness (Ludlow 110, 406–412.
et al., 1988). This benefit period di¤ers among the dis- Blitzer, A., and Brin, M. F. (1991). Laryngeal dystonia: A
orders, lasting from 3 to 5 months in ADSD but from series with botulinum toxin therapy. Annals of Otology,
1 to 3 months in other disorders such as ABSD and Rhinology, and Laryngology, 100, 85–89.
tremor. Blitzer, A., Brin, M. F., and Stewart, C. F. (1998). Botulinum
The return of symptoms in ADSD is gradual, usually toxin management of spasmodic dysphonia (laryngeal dys-
occurring over a period of about 2 months during end- tonia): A 12-year experience in more than 900 patients.
plate reinnervation. To maintain symptom control, most Laryngoscope, 108, 1435–1441.
patients return for injection about 3 months before the Blitzer, A., Brin, M., Stewart, C., Aviv, J. E., and Fahn, S.
full return of symptoms. Some individuals, however, (1992). Abductor laryngeal dystonia: A series treated with
botulinum toxin. Laryngoscope, 102, 163–167.
maintain benefit for more than a year following injec- Christopher, K. L., Wood, R., Eckert, R. C., Blager, F. B.,
tion, returning 2 or more years later for reinjection. Raney, R. A., and Souhrada, J. F. (1983). Vocal-cord dys-
The mechanisms responsible for benefit from BTX- function presenting as asthma. New England Journal of
A in laryngeal dystonia likely di¤er with the di¤erent Medicine, 306, 1566–1570.
pathophysiologies: although BTX-A is beneficial in Cyrus, C. B., Bielamowicz, S., Evans, F. J., and Ludlow,
many hyperkinetic disorders, it is more e¤ective in some C. L. (2001). Adductor muscle activity abnormalities in
40 Part I: Voice
abductor spasmodic dysphonia. Otolaryngology–Head and Further Readings

Neck Surgery, 124, 23–30.
de Paiva, A., Meunier, F. A., Molgo, J., Aoki, K. R., and Barkmeier, J. M., Case, J. L., and Ludlow, C. L. (2001). Iden-
Dolly, J. O. (1999). Functional repair of motor endplates tification of symptoms for spasmodic dysphonia and vocal
after botulinum neurotoxin type A poisoning: Biphasic tremor: A comparison of expert and nonexpert judges.
switch of synaptic activity between nerve sprouts and their Journal of Communication Disorders, 34, 21–37.
parent terminals. Proceedings of the National Academy of Blitzer, A., Lovelace, R. E., Brin, M. F., Fahn, S., and Fink,
Sciences of the United States of America, 96, 3200–3205. M. E. (1985). Electromyographic findings in focal laryngeal
Deleyiannis, F. W., Gillespie, M., Bielamowicz, S., Yamashita, dystonia (spastic dysphonia). Annals of Otology, Rhinology,
T., and Ludlow, C. L. (1999). Laryngeal long latency re- and Laryngology, 94, 591–594.
sponse conditioning in abductor spasmodic dysphonia. Braun, N., Abd, A., Baer, J., Blitzer, A., Stewart, C., and Brin,
Annals of Otology, Rhinology, and Laryngology, 108, 612– M. (1995). Dyspnea in dystonia: A functional evaluation.
619. Chest, 107, 1309–1316.
Grillone, G. A., Blitzer, A., Brin, M. F., Annino, D. J., and Brin, M. F., Blitzer, A., and Stewart, C. (1998). Laryngeal
Saint-Hilaire, M. H. (1994). Treatment of adductor laryn- dystonia (spasmodic dysphonia): Observations of 901
geal breathing dystonia with botulinum toxin type A. La- patients and treatment with botulinum toxin. Advances in
ryngoscope, 24, 30. Neurology, 78, 237–252.
Koda, J., and Ludlow, C. L. (1992). An evaluation of laryngeal Brin, M. F., Stewart, C., Blitzer, A., and Diamond, B. (1994).
muscle activation in patients with voice tremor. Otolaryn- Laryngeal botulinum toxin injections for disabling stutter-
gology–Head and Neck Surgery, 107, 684–696. ing in adults. Neurology, 44, 2262–2266.
Ludlow, C. L., Naunton, R. F., Sedory, S. E., Schulz, G. M., Cohen, L. G., Ludlow, C. L., Warden, M., Estegui, M. D.,
and Hallett, M. (1988). E¤ects of botulinum toxin injections Agostino, R., Sedory, S. E., et al. (1989). Blink reflex curves
on speech in adductor spasmodic dysphonia. Neurology, 38, in patients with spasmodic dysphonia. Neurology, 39, 572–
1220–1225. 577.
Ludlow, C. L., Naunton, R. F., Terada, S., and Anderson, Davidson, B., and Ludlow, C. L. (1996). Long term e¤ects of
B. J. (1991). Successful treatment of selected cases of abduc- botulinum toxin injections in spasmodic dysphonia. Oto-
tor spasmodic dysphonia using botulinum toxin injection. laryngology–Head and Neck Surgery, 105, 33–42.
Otolaryngology–Head and Neck Surgery, 104, 849–855. Jankovic, J. (1987). Botulinum A toxin for cranial-cervical
Ludlow, C. L., Rhew, K., and Nash, E. A. (1994). Botulinum dystonia: A double-blind, placebo-controlled study. Neu-
toxin injection for adductor spasmodic dysphonia. In J. rology, 37, 616–623.
Jankovic and M. Hallett (Eds.), Therapy with botulinum Jankovic, J. (1988). Cranial-cervical dyskinesias: An overview.
toxin (pp. 437–450). New York: Marcel Dekker. Advances in Neurology, 49, 1–13.
Marion, M. H., Klap, P., Perrin, A., and Cohen, M. (1992). Jankovic, J., and Linden, C. V. (1988). Dystonia and tremor
Stridor and focal laryngeal dystonia. Lancet, 339, 457– induced by peripheral trauma: Predisposing factors. Journal
458. of Neurology, Neurosurgery, and Psychiatry, 51, 1512–
Morrison, M., Rammage, L., and Emami, A. J. (1999). The 1519.
irritable larynx syndrome. Journal of Voice, 13, 447–455. Lange, D. J., Rubin, M., Greene, P. E., Kang, U. J., Mosko-
Nash, E. A., and Ludlow, C. L. (1996). Laryngeal muscle witz, C. B., Brin, M. F., et al. (1991). Distant e¤ects of
activity during speech breaks in adductor spasmodic dys- locally injected botulinum toxin: A double-blind study of
phonia. Laryngoscope, 106, 484–489. single fiber EMG changes. Muscle and Nerve, 14, 672–675.
On, A. Y., Kirazli, Y., Kismali, B., and Aksit, R. (1999). Ludlow, C. L. (1990). Treatment of speech and voice disorders
Mechanisms of action of phenol block and botulinus toxin with botulinum toxin. Journal of the American Medical As-
type A in relieving spasticity: Electrophysiologic investiga- sociation, 264, 2671–2675.
tion and follow-up. American Journal of Physical Medicine Ludlow, C. L. (1995). Pathophysiology of the spasmodic dys-
and Rehabilitation, 78, 344–349. phonias. In F. Bell-Berti and L. J. Rapheal (Eds.), Pro-
Roy, N., Ford, C. N., and Bless, D. M. (1996). Muscle tension ducing speech: Contemporary issues for Katherine Sa¤ord
dysphonia and spasmodic dysphonia: The role of manual Harris (pp. 291–308). Woodbury, NY: American Institute
laryngeal tension reduction in diagnosis and management. of Physics.
Annals of Otology, Rhinology, and Laryngology, 105, 851– Ludlow, C. L., VanPelt, F., and Koda, J. (1992). Character-
856. istics of late responses to superior laryngeal nerve stimu-
Sanders, I., Han, Y., Wang, J., and Biller, H. (1998). Muscle lation in humans. Annals of Otology, Rhinology, and
spindles are concentrated in the superior vocalis subcom- Laryngology, 101, 127–134.
partment of the human thyroarytenoid muscle. Journal of Miller, R. H., Woodson, G. E., and Jankovic, J. (1987). Botu-
Voice, 12, 7–16. linum toxin injection of the vocal fold for spasmodic dys-
Truong, D. D., Rontal, M., Rolnick, M., Aronson, A. E., and phonia. Archives of Otolaryngology–Head and Neck
Mistura, K. (1991). Double-blind controlled study of botu- Surgery, 113, 603–605.
linum toxin in adductor spasmodic dysphonia. Laryngo- Murry, T., Cannito, M. P., and Woodson, G. E. (1994). Spas-
scope, 101, 630–634. modic dysphonia: Emotional status and botulinum toxin
Van Pelt, F., Ludlow, C. L., and Smith, P. J. (1994). Compar- treatment. Archives of Otolaryngology–Head and Neck
ison of muscle activation patterns in adductor and abductor Surgery, 120, 310–316.
spasmodic dysphonia. Annals of Otology, Rhinology, and Rhew, K., Fiedler, D., and Ludlow, C. L. (1994). Endoscopic
Laryngology, 103, 192–200. technique for injection of botulinum toxin through the
Warrick, P., Dromey, C., Irish, J. C., Durkin, L., Pakiam, A., flexible nasolaryngoscope. Otolaryngology–Head and Neck
and Lang, A. (2000). Botulinum toxin for essential tremor Surgery, 111, 787–794.
of the voice with multiple anatomical sites of tremor: A Rosenbaum, F., and Jankovic, J. (1988). Task-specific focal
crossover design study of unilateral versus bilateral injec- tremor and dystonia: Categorization of occupational
tion. Laryngoscope, 110, 1366–1374. movement disorders. Neurology, 38, 522–527.
Laryngeal Reinnervation Procedures 41
Sapienza, C. M., Walton, S., and Murry, T. (2000). Adductor

spasmodic dysphonia and muscular tension dysphonia:
Acoustic analysis of sustained phonation and reading.
Journal of Voice, 14, 502–520.
Sedory-Holzer, S. E., and Ludlow, C. L. (1996). The swallow-
ing side e¤ects of botulinum toxin type A injection in spas-
modic dysphonia. Laryngoscope, 106, 86–92.
Stager, S. V., and Ludlow, C. L. (1994). Responses of stutterers
and vocal tremor patients to treatment with botulinum
toxin. In J. Jankovic and M. Hallett (Eds.), Therapy with
botulinum toxin (pp. 481–490). New York: Marcel Dekker.
Stewart, C. F., Allen, E. L., Tureen, P., Diamond, B. E., Blit-
zer, A., and Brin, M. F. (1997). Adductor spasmodic dys-
phonia: Standard evaluation of symptoms and severity.
Witsell, D. L., Weissler, M. C., Donovan, K., Howard, J. F.,
and Martinkosky, S. J. (1994). Measurement of laryngeal
resistance in the evaluation of botulinum toxin injection for
treatment of focal laryngeal dystonia. Laryngoscope, 104,
8–11.
Zwirner, P., Murry, T., and Woodson, G. E. (1993). A
comparison of bilateral and unilateral botulinum toxin
treatments for spasmodic dysphonia. European Archives of
Otorhinolaryngology, 250, 271–276.
Laryngeal Reinnervation Procedures
The human larynx is a neuromuscularly complex organ

responsible for three primary and often opposing func-
tions: respiration, swallowing, and speech. The most
primitive responsibilities include functioning as a con-
duit to bring air to the lungs and protecting the
respiratory tract during swallowing. These duties are
physiologically opposite; the larynx must form a wide
caliber during respiration but also be capable of forming Figure 1. Stylized left lateral view of the left SLN. The internal
(sensory) branch pierces the thyrohyoid membrane and termi-
a tight sphincter during swallowing. Speech functions are
nates in the supraglottic submucosal receptors. The external
fine-tuned permutations of laryngeal opening and clo- (motor) branch terminates in the cricothyroid muscle. Branch-
sure against pulmonary airflow. Innervation of this or- ing of the SLN occurs proximally as it exits the carotid sheath.
gan is complex, and its design is still being elucidated.
E¤erent fibers to the larynx from the brainstem motor
nuclei travel by way of the vagus nerve to the superior
laryngeal nerve (SLN) and the recurrent laryngeal Anatomy and Technique of Reinnervation. The tech-
nerves (RLNs). A¤erent fibers emanate from intra- nique of reinnervation is similar for both sensory and
mucosal and intramuscular receptors and travel along motor systems. The nerve in question is identified
pathways that include the SLN (supraglottic larynx) and through a transcervical approach. Usually a horizontal
the RLN (subglottic larynx). Autonomic fibers also skin incision is placed into a neck skin crease at about
innervate the larynx, but these are poorly understood the level of the cricoid cartilage. Subplatysmal flaps are
(see also anatomy of the human larynx). elevated and retracted. The larynx is further exposed
Reinnervation of the larynx was first reported 1909 after splitting the strap muscles in the midline. Sensation
by Horsley, who described reanastomosis of a severed to the supraglottic mucosal is supplied via the internal
RLN. Reports of laryngeal reinnervation spotted the branch of the SLN. This structure is easily identified as it
literature over the next several decades, but it was not pierces the thyrohyoid membrane on either side (Fig. 1).
until the last 30 years that reinnervation techniques were The motor innervation of the larynx is somewhat more
refined and became performed with relative frequency. complicated. All of the intralaryngeal muscles are inner-
Surely this is associated with advances in surgical optics vated by the RLN. This nerve approaches the larynx
as well as microsurgical instrumentation and technique. from below in the tracheoesophageal groove. The nerve
Indications for laryngeal reinnervation include func- enters the larynx from deep to the cricothyroid joint and
tional reanimation of the paralyzed larynx, prevention of immediately splits into an anterior and a posterior divi-
denervation atrophy, restoration of laryngeal sensation, sion (Fig. 2). The anterior division supplies all of the
and modification of pathological innervation (Crumley, laryngeal adductors except the cricothyroid muscle; the
1991; Aviv et al., 1997; Berke et al., 1999). posterior division supplies the only abductor of the
42 Part I: Voice
the thyroid cartilage. A large inferiorly based window is

made in the thyroid lamina and centered over the infe-
rior tubercle. Once the cartilage is opened the anterior
branch can be seen coursing obliquely toward the termi-
nus in the midportion of the thyroarytenoid muscle.
The posterior division is approached by rotation of the
larynx, similar to an external approach to the arytenoid.
Identification and dissection of the fine distal nerve
branches is usually carried out under louposcopic or mi-
croscopic magnification using precision instruments.
Once the damaged nerve is identified, it is severed
sharply with a single cut of a sharp instrument. This is
important to avoid crushing trauma to the nerve stump.
The new motor or sensory nerve is brought into the field
under zero tension and then anastomosed with several
epineural sutures of fine microsurgical material (9-0 or
10-0 nylon or silk). The anastomosis must be tension-
free. The donor nerve is still connected to its proximal
(motor) or distal (sensory) cell bodies. Selection of the
appropriate donor nerve is discussed subsequently.
Over the next 3–9 months healing occurs, with
neurontization of the motor end-plates or the sensory
receptors. Although physiological reinnervation is the
goal, relatively few reports have demonstrated true voli-
tional movement. Typically, one can hope to prevent
muscle atrophy and help restore muscle bulk. Sensory
reinnervation is less clear.
Prior to the microsurgical age, most reinnervation
procedures of the larynx were carried out with nerve-
muscle pedicle implantation into the a¤ected muscle.
With modern techniques, end-to-end nerve-nerve anas-
tomosis with epineural suture fixation is a superior and
Figure 2. Stylized left lateral view of the left RLN. The far more reliable technique of reinnervation.
branching pattern is quite consistent from patient to patient.
The RLN divides into an anterior and a posterior division just Reinnervation for Laryngeal Paralysis. Paralysis of the
deep to the cricothyroid joint. The posterior division travels to
the posterior cricoarytenoid (PCA) muscle. The anterior divi-
larynx is described as unilateral or bilateral. The diag-
sion gives o¤ branches at the interarytenoid (IA) and lateral nosis is made on the basis of history and physical exam-
cricoarytenoid (LCA) muscles and then terminates in the mid- ination including laryngoscopic findings, and sometimes
portion of the thyroarytenoid (TA) muscle. The branches to on the basis of laryngeal electromyography or radio-
the TA and LCA are easily seen through a large inferiorly graphic imaging. The unilaterally paralyzed larynx is
based cartilage window reminiscent of those done for thyro- more common and is characterized by a lateralized vocal
plasty. If preserved during an anterior approach, the inferior cord that prevents complete glottic closure during laryn-
cornu of the thyroid cartilage protects the RLN’s posterior di- geal tasks. The patient seeks care for dysphonia and
vision and the IA branch during further dissection. aspiration or cough during swallowing. The etiology
is commonly idiopathic, but the condition may be due
to inflammatory neuropathy, iatrogenic trauma, or neo-
plastic invasion of the recurrent nerve. Although many
patients are successfully treated with various static pro-
larynx, the posterior cricoarytenoid muscle. The ante- cedures, one can argue that, theoretically, the best results
rior division further arborizes to innervate each of the would restore the organ to its preexisting physiological
intrinsic adductors in a well-defined order: the interary- state. Over the past 20 years there has been increased
tenoid followed by the lateral cricoarytenoid, and lastly interest in physiological restoration, a concept champ-
the thyroarytenoid muscles. The interarytenoid muscle is ioned by Crumley (Crumley, 1983, 1984, 1991; Crumley,
thought to receive bilateral innervation, while the other Izdebski, and McMicken, 1988). A series of patients
muscles all receive unilateral innervation. The external with unilateral vocal cord paralysis were treated with
branch of the SLN innervates the only external adductor anastomosis of the distal RLN trunk to the ansa cervi-
of the larynx—the cricothyroid muscle. Although the calis. The ansa cervicalis is a good example of an ac-
SLN and the main trunk of the RLN can be approached ceptable motor donor nerve (Crumley, 1991; Berke et
without opening the larynx, the adductor branches of the al., 1999). This nerve normally supplies motor neurons
RLN can only be successfully approached after opening to the strap muscles (extrinsic accessory muscles of
Laryngeal Reinnervation Procedures 43
the larynx), whose function is to elevate and lower the use of the SLN as a motor source for the posterior cri-
larynx during swallowing. Fortunately, when the ansa coarytenoid muscle (Maniglia et al., 1989). Most tech-
cervicalis is sacrificed, the patient does not have notice- niques of reinnervation for bilateral vocal cord paralysis
able disability. The size match to the RLN is excellent still have not enjoyed the success of unilateral reinner-
when using either the whole nerve bundle for main trunk vation and are only performed by a few practitioners.
anastomosis or an easily identifiable fascicle for connec- The majority of patients with bilateral vocal cord paral-
tion to the anterior branch. ysis undergo a static procedure such as cordotomy, ary-
Hemilaryngeal reinnervation with the ansa cervicalis tenoidectomy, or tracheotomy to improve their airway.
has been shown to improve voicing in those patients
undergoing the procedure (Crumley and Izdebski, 1986; Sensory Palsy. Recent work has highlighted the im-
Chhetri et al., 1999). Evaluation of these patients, how- portance of laryngeal sensation. After development of
ever, does not demonstrate restoration of normal mus- an air-pulse quantification system to measure sensation,
cular physiology. The a¤ected vocal cord has good bulk, it was shown that patients with stroke and dysphagia
but volitional movement typically is not restored. Pro- have a high incidence of laryngosensory deficit. Studies
ponents of other techniques have argued that the ansa performed in the laboratory demonstrated that reanas-
cervicalis may not have enough axons to properly tomosis of the internal branch of the SLN restored
regenerate the RLN, or that synkinesis has occurred protective laryngeal reflexes (Blumin, Berke, and Black-
(Paniello, Lee, and Dahm, 1999). Synkinesis refers to well, 1999). Clinically, anastomosis of the greater auric-
mass firing of a motor nerve that can occur after rein- ular nerve to the internal branch of the SLN has been
nervation. In the facial nerve, for example, one may see successfully used to restore sensation to the larynx (Aviv
mass movement of the face with volitional movement et al., 1997).
because all the braches are essentially acting as one. The
RLN contains both abductor and adductor (as well as a Modification of Dystonia. Spasmodic dysphonia is an
small amount of sensory and autonomic) fibers. With idiopathic focal dystonia of the larynx. The majority
reinnervation, one may hypothesize that mass firing of of patients have the adductor variety, characterized by
all fibers cancels the firings of individual fibers out and intermittent and paroxysmal spasms of the vocal cords
thus produces a static vocal cord. With this concept in during connected speech. The mainstay of treatment for
mind, some have recommended combining reinnerva- this disorder is botulinum toxin (Botox) injections into
tion with another static procedure to augment results the a¤ected laryngeal adductor muscles. Unfortunately,
(combination with arytenoid adduction) or to avoid the the e¤ect of Botox is temporary, and repeated injections
potential for synkinesis by performing the anastomosis are needed indefinitely. A laryngeal denervation and
of the donor nerve to the anterior branch of the recur- reinnervation procedure has been designed to provide a
rent nerve (Green et al., 1992; Nasri et al., 1994; Chhetri permanent alternative to Botox treatment (Berke et al.,
et al., 1999). 1999). In this procedure, the distalmost branches of the
Paniello has proposed that the ansa cervicalis is not laryngeal adductors are severed from their muscle inser-
the best donor nerve to the larynx (Paniello, Lee, and tion. These ‘‘bad’’ nerve stumps are then sutured outside
Dahm, 1999). He suggests that the hypoglossal nerve the larynx to avoid spontaneous reinnervation. A fasci-
would be more appropriate because of increased axon cle of the ansa cervicalis is then suture-anastomosed
bulk and little donor morbidity. Animal experiments to the distal thyroarytenoid branch for reinnervation.
with this technique have demonstrated volitional and Reinnervation maintains tone of the thyroarytenoid,
reflexive movement of the reinnervated vocal cord. thus preventing atrophy and theoretically protecting that
Neurological bilateral vocal cord paralysis is often muscle by occupying the motor end-plates with neurons
post-traumatic or iatrogenic. These patients are troubled una¤ected by the dystonia. This approach has had great
by a fixed small airway and often find themselves tra- success, with about 95% of patients achieving freedom
cheotomy dependent. Therapy is directed at restoring from further therapy.
airway caliber while avoiding aspiration. Voicing issues
are usually considered secondary to the airway concerns.
Laryngeal Transplantation. Transplantation of a phys-
Although most practitioners currently treat with static
iologically functional larynx is the sought-after grail of
techniques, physiological restoration would be preferred.
reinnervation. For a fully functional larynx, eight nerves
In the 1970s, Tucker advocated reinnervation of the
would be anastomosed—bilateral anterior and posterior
posterior cricoarytenoid with a nerve muscle pedicle of branches of the RLN and bilateral external and internal
the sternohyoid muscle and ansa cervicalis (Tucker,
branches of the SLN. To date, success has been reliably
1978). Although his results were supportive of his hy-
achieved in the canine model (Berke et al., 1993) and
pothesis, many have had trouble repeating them. More partially achieved in one human (Strome et al., 2001).
recently, European groups have studied phrenic nerve
Current research has been aimed at preventing trans-
to posterior branch of the RLN transfers (van Lith-Bijl
plant rejection. The technique of microneural, micro-
et al., 1997, 1998). The phrenic nerve innervates the di-
vascular, and mucosal anastomosis has been well
aphragm and normally fires with inspiration; it has been
worked out in the animal model.
shown that one of its nerve roots can be sacrificed with-
out paralysis of the diaphragm. Others have suggested —Joel H. Blumin and Gerald S. Berke
44 Part I: Voice
References van Lith-Bijl, J. T., Stolk, R. J., Tonnaer, J. A. D. M.,

Groenhout, C., Konings, P. N. M., and Mahieu, H. F.
Aviv, J. E., Mohr, J. P., Blitzer, A., Thomson, J. E., and Close, (1998). Laryngeal abductor reinnervation with a phrenic
L. G. (1997). Restoration of laryngopharyngeal sensation nerve transfer after a 9-month delay. Archives of Otolaryn-
by neural anastomosis. Archives of Otolaryngology–Head gology–Head and Neck Surgery, 124, 393–398.
and Neck Surgery, 123, 154–160.
Berke, G. S., Blackwell, K. E., Gerratt, B. R., Verneil, A., Further Readings
Jackson, K. S., and Sercarz, J. A. (1999). Selective laryngeal
adductor denervation-reinnervation: A new surgical treat- Benninger, M. S., Crumley, R. L., Ford, C. N., Gould, W. J.,
ment for adductor spasmodic dysphonia. Annals of Otology, Hanson, D. G., Osso¤, R. H., et al. (1994). Evaluation
Rhinology, and Laryngology, 108, 227–231. and treatment of the unilateral paralyzed vocal fold.
Berke, G. S., Ye, M., Block, R. M., Sloan, S., and Sercarz, J. Otolaryngology–Head and Neck Surgery, 111, 497–508.
(1993). Orthotopic laryngeal transplantation: Is it time? Crumley, R. L. (1989). Laryngeal synkinesis: Its significance to
Laryngoscope, 103, 857–864. the laryngologist. Annals of Otology, Rhinology, and Laryn-
Blumin, J. H., Ye, M., Berke, G. S., and Blackwell, K. E. gology, 98, 87–92.
(1999). Recovery of laryngeal sensation after superior laryn- Crumley, R. L. (1991). Muscle transfer for laryngeal paralysis:
geal nerve anastomosis. Laryngoscope, 109, 1637–1641. Restoration of inspiratory vocal cord abduction by phrenic-
Chhetri, D. K., Gerratt, B. R., Kreiman, J., and Berke, G. S. omohyoid transfer. Archives of Otolaryngology–Head and
(1999). Combined arytenoid adduction and laryngeal rein- Neck Surgery, 117, 1113–1117.
nervation in the treatment of vocal fold paralysis. Laryngo- Crumley, R. L. (2000). Laryngeal synkinesis revisited. Annals
scope, 109, 1928–1936. of Otology, Rhinology, and Laryngology, 109, 365–371.
Crumley, R. L. (1983). Phrenic nerve graft for bilateral vocal Flint, P. W., Downs, D. H., and Coltrera, M. D. (1991).
cord paralysis. Laryngoscope, 93, 425–428. Laryngeal synkinesis following reinnervation in the rat:
Crumley, R. L. (1984). Selective reinnervation of vocal cord Neuroanatomic and physiologic study using retrograde flu-
adductors in unilateral vocal cord paralysis. Annals of orescent tracers and electromyography. Annals of Otology,
Otology, Rhinology, and Laryngology, 93, 351–356. Rhinology, and Laryngology, 100, 797–806.
Crumley, R. L. (1991). Update: Ansa cervicalis to recurrent Jacobs, I. N., Sanders, I., Wu, B. L., and Biller, H. F. (1990).
laryngeal nerve anastomosis for unilateral laryngeal paraly- Reinnervation of the canine posterior cricoarytenoid muscle
sis. Laryngoscope, 101, 384–387. with sympathetic preganglionic neurons. Annals of Otology,
Crumley, R. L., Izdebski, K. (1986). Voice quality following Rhinology, and Laryngology, 99, 167–174.
laryngeal reinnervation by ansa hypoglossi transfer. Laryn- Kreyer, R., and Pomaroli, A. (2000). Anastomosis between
goscope, 96, 611–616. the external branch of the superior laryngeal nerve and
Crumley, R. L., Izdebski, K., and McMicken, B. (1988). Nerve the recurrent laryngeal nerve. Clinical Anatomy, 13, 79–
transfer versus Teflon injection for vocal cord paralysis: 82.
A comparison. Laryngoscope, 98, 1200–1204. Marie, J. P., Lerosey, Y., Dehesdin, D., Tadie, M., and
Green, D. C., Berke, G. S., Graves, M. C., Natividad, M. Andrieu-Guitrancourt, J. (1999). Cervical anatomy of
(1992). Physiologic motion after vocal cord reinnervation: phrenic nerve roots in the rabbit. Annals of Otology, Rhi-
A preliminary study. Laryngoscope, 102, 14–22. nology, and Laryngology, 108, 516–521.
Horsley, J. S. (1909). Suture of the recurrent laryngeal nerve Paniello, R. C., West, S. E., and Lee, P. (2001). Laryngeal
with report of a case. Transactions of the Southern Surgical reinnervation with the hypoglossal nerve: I. Physiology,
Gynecology Association, 22, 161–167. histochemistry, electromyography, and retrograde labeling
Maniglia, A. J., Dodds, B., Sorensen, K., Katirji, M. B., and in a canine model. Annals of Otology, Rhinology, and
Rosenbaum, M. L. (1989). Newer technique of laryngeal Laryngology, 110, 532–542.
reinnervation: Superior laryngeal nerve (motor branch) as Peterson, K. L., Andrews, R., Manek, A., Ye, M., and Sercarz,
a driver of the posterior cricoarytenoid muscle. Annals of J. A. (1998). Objective measures of laryngeal function after
Otology, Rhinology, and Laryngology, 98, 907–909. reinnervation of the anterior and posterior recurrent laryn-
Nasri, S., Sercarz, J. A., Ye, M., Kreiman, J., Gerratt, B. R., geal nerve branches. Laryngoscope, 108, 889–898.
and Berke, G. S. (1994). E¤ects of arytenoid adduction on Sercarz, J. A., Nguyen, L., Nasri, S., Graves, M. C., Wenokur,
laryngeal function following ansa cervicalis nerve transfer R., and Berke, G. S. (1997). Physiologic motion after la-
for vocal fold paralysis in an in vivo canine model. Laryn- ryngeal nerve reinnervation: A new method. Otolaryngol-
goscope, 104, 1187–1193. ogy–Head and Neck Surgery, 116, 466–474.
Paniello, R. C., Lee, P., and Dahm, J. D. (1999). Hypoglossal Tucker, H. M. (1999). Long-term preservation of voice im-
nerve transfer for laryngeal reinnervation: A preliminary provement following surgical medialization and reinnerva-
study. Annals of Otology, Rhinology, and Laryngology, 108, tion for unilateral vocal fold paralysis. Journal of Voice, 13,
239–244. 251–256.
Strome, M., Stein, J., Esclamado, R., Hicks, D., Lorenz, R. R., van Lith-Bijl, J. T., and Mahieu, H. F. (1998). Reinnervation
Braun, W., et al. (2001). Laryngeal transplantation and 40- aspects of laryngeal transplantation. European Archives of
month follow-up. New England Journal of Medicine, 3443, Otorhinolaryngology, 255, 515–520.
1676–1679. Weed, D. T., Chongkolwatana, C., Kawamura, Y., Burkey,
Tucker, H. M. (1978). Human laryngeal reinnervation: Long B. B., Netterville, J. L., Osso¤, R. H., et al. (1995). Rein-
term experience with the nerve muscle pedicle technique. nervation of the allograft larynx in the rat laryngeal trans-
Laryngoscope, 88, 598–604. plant model. Otolaryngology–Head and Neck Surgery, 113,
van Lith-Bijl, J. T., Stolk, R. J., Tonnaer, J. A. D. M., 517–529.
Groenhout, C., Konings, P. N. M., and Mahieu, H. F. Zheng, H., Li, Z., Zhou, S., Cuan, Y., and Wen, W. (1996).
(1997). Selective laryngeal reinnervation with separate Update: laryngeal reinnervation for unilateral vocal cord
phrenic and ansa cervicalis nerve transfers. Archives of paralysis with the ansa cervicalis. Laryngoscope, 106, 1522–
Otolaryngology–Head and Neck Surgery, 123, 406–411. 1527.
Laryngeal Trauma and Peripheral Structural Ablations 45
Zheng, H., Zhou, S., Chen, S., Li, Z., and Cuan, Y. (1998). An Injuries to the intrinsic structures of the oral cavity
experimental comparison of di¤erent kinds of laryngeal are also rare, although when they do occur, changes in
muscle reinnervation. Otolaryngology–Head and Neck Sur- speech, deglutition, and swallowing may exist. Although
gery, 119, 540–547. the clinical literature is meager in relation to injuries of
the lip, alveolus, floor of the mouth, tongue, hard palate,
and velum, such injuries or their medical treatment
may have a significant impact on speech. Trauma to the
Laryngeal Trauma and Peripheral mandible also can directly impact verbal communica-
Structural Ablations tion. Unfortunately, the literature in this area is sparse,
and information on speech outcomes following injuries
of this type is frequently anecdotal. However, in
Alterations of the vibratory, articulatory, or resonance addressing any type of traumatic injury to the peripheral
system consequent to traumatic injury or the treatment structures of the speech mechanism, assessment methods
of disease can significantly alter the functions of both typically employed with the dysarthrias may be most
voice and speech, and potentially deglutition and swal- appropriate (e.g., Dworkin, 1991). In this regard, the
lowing. In some instances, subsequent changes to the point-place system may provide essential information
larynx and oral peripheral system may be relatively on the extent and degree of impairment of speech sub-
minor and without substantial consequence to the indi- systems (Netsell and Daniel, 1979).
vidual. In other instances these changes may result in
dramatic alteration of one or more anatomical structures Speech Considerations. Speech management initially
necessary for normal voice and speech production, in focuses on identifying which subsystems are impaired,
addition to other oral functions. Traumatic injury and the severity of impairment, and the consequent reduction
surgical treatment for disease also may a¤ect isolated in speech intelligibility and communicative proficiency.
structures of the peripheral speech mechanism, or may A comprehensive evaluation that involves aerodynamic,
have more widespread influences on entire speech acoustic, and auditory-perceptual components is essen-
subsystems (e.g., articulatory, velopharyngeal) and the tial. Information from each of these areas is valuable in
related structures necessary for competent and e¤ective identifying the problem, developing management strat-
verbal communication. egies, and monitoring patient progress. Because of vari-
ability in the extent of traumatic injuries, the literature
Laryngeal Trauma on the dysarthrias often provides a useful framework
for establishing clinical goals and evaluating treatment
Trauma to the larynx is a relatively rare clinical entity. e¤ectiveness.
Estimates in the literature indicate that acute laryngeal
trauma accounts for 1 in 15,000 to 1 in 42,000 emer- Peripheral Structural Changes Resulting from
gency room visits. A large number of such traumatic Surgical Ablation
injuries are the result of accidental blunt trauma to the
neck; causes include motor vehicle collisions, falls, ath- In contrast to peripheral structural changes due to trau-
letic injuries, and the like. Another portion of these matic injury, structural changes due to surgical ablation
injuries are the result of violence, such as shooting or for oral cancer also result in alterations in functions
stabbing, which may result in penetrating injuries not necessary for speech or swallowing. The treatment itself
only of the larynx but also of other critical structures has clear potential to a¤ect speech production, but
in the neck. Blunt laryngeal trauma is most commonly changes in speech may be variable and ultimately de-
reported in persons less than 30 years old. pend on the structures treated.
In cases of blunt trauma to the larynx, the primary
presenting symptoms are hoarseness, pain, dyspnea Malignant Conditions A¤ecting Peripheral Structures of
(shortness of breath), dysphagia, and swelling of the the Speech System. Head and neck squamous cell car-
tissues of the neck (cervical emphysema). Injuries may cinoma may occur in the epithelial tissue of the mouth,
involve fractures of laryngeal cartilages, partial or com- nasal cavity, pharynx, or larynx. Tumors of the head
plete dislocations, lacerations of soft tissues, or com- and neck account for approximately 5% of all malig-
bined types of injury. Because laryngeal trauma, no nancies in men and 2% of all malignancies in women
matter how minor, holds real potential to a¤ect breath- (Franceschi et al., 2000). The majority of head and neck
ing, medical intervention is first directed at determining cancers involve squamous epithelial tissue. These tumors
airway patency and, when necessary, maintaining an have the potential to be invasive; therefore, the potential
adequate airway through emergency airway manage- for spread of disease is substantial, and radical dissection
ment (Schaefer, 1992). When airway compromise is of regional lymphatics is often required. Because of the
observed, emergency tracheotomy is common. Laryn- possibility of disease spread, radiation therapy is com-
geal trauma is truly an emergency medical condition. monly used to eliminate occult disease, and this treat-
When injuries are severe, additional surgical treatment ment may have negative consequences on vocal and
may be warranted. Thus, whereas vocal disturbances are speech functions.
possible, the airway is of primary concern; changes to As tumors increase in size, they may invade adja-
the voice are of secondary importance. cent tissue, which frequently requires more extensive
46 Part I: Voice
treatment because of the threat of distant spread of dis- tures for speech may exist post-treatment. Surgical re-
ease. Some individuals are asymptomatic at the time of duction of the gum ridge (alveolus) as well as the hard
diagnosis; others report pain, di‰culty swallowing (dys- and soft palates may occur. Although defects of the
phagia), or di‰culty breathing. The destructive nature of alveolus may be augmented quite well prosthetically,
a malignant tumor may cause adjacent muscular struc- more significant surgical resections of the hard and soft
tures of the tongue or floor of mouth to become fixed. palate frequently have a dramatic influence on speech
In other situations, the tumor may encroach on the production. The primary deficit observed is velopharyn-
nerve supply, which is likely to result in loss of sensa- geal incompetence due to structural defects that elim-
tion (paresthesia), paralysis, or pain. In both circum- inate the ability to e¤ectively seal the oral cavity from
stances, the potential for metastatic spread of disease is the nasal cavity (Brown et al., 2000).
substantial. When considerable portions of both the hard and the
Current treatment protocols for tumors involving soft palate are removed, the integrity of the oral valve
structures of the peripheral speech mechanism include for articulatory shaping and the subsequent demands of
surgery, radiotherapy, or chemotherapy, alone or in the resonance system are substantially a¤ected, often
combination. The choice of modality usually depends with profound influences on oral communication. The
on tumor location, disease stage, cell type, and other goals of rehabilitation include reducing the surgical de-
factors. Each treatment modality is associated with some fect through prosthetic management so that the individ-
additional morbidity that can significantly a¤ect struc- ual can eat, drink, and attain some level of functional
ture, function, cosmesis, and quality of life. Surgery car- speech. Rehabilitation involves a multidisciplinary team
ries a clear potential for anatomical and physiological and occurs over several months, during which the post-
changes that may directly alter speech and swallowing, surgical anatomy changes and the prosthesis is changed
while at the same time creating significant cosmetic in tandem. Healing and other e¤ects of treatment also
deformities. Similarly, radiotherapy is commonly asso- need to be addressed during this period.
ciated with a range of side e¤ects. Radiation delivered to
the head and neck a¤ects both abnormal and normal Defects of the Tongue, Floor of Mouth, Alveolus, Lip, and
tissues. Salivary glands may be damaged, with a result- Mandible. Surgery for traumatic injury or tumors that
ing decrease in salivary flow leading to a dry mouth invade the mandible, tongue, and floor of the mouth
(xerostomia). This decrease in saliva may then challenge is frequently more radical than surgery for injuries or
normal oral hygiene and health, which may result in tumors of the maxilla. The mandible and tongue are
dental caries following radiation treatment. Osteoradio- movable structures that are intricately involved in swal-
necrosis may result in the exposure of bone, tissue lowing and speech production. Focal excisions may
changes, or infection, which usually cause significant not result in any significant level of noticeable change
discomfort to the individual. Osteoradionecrosis may be in deglutition, swallowing, or speech. However, when
decreased if the radiation exposure is limited to less larger resections are performed on the tongue (partial or
than 60 cGy (Thorn et al., 2000). In some cases, surgery total glossectomy) or floor of the mouth, reconstruction
may be necessary to debride infected tissue or to remove with one of a variety of flap procedures is necessary.
damaged bone, with a subsequent need for grafting. Large resections and reconstructions also increase the
Hyperbaric oxygen therapy is sometimes used to reduce potential for speech disruption because of limited mo-
the degree of osteoradionecrosis (Marx, Johnson, and bility of the reconstructed areas (e.g., portions of the
Kline, 1985). This therapy facilitates oxygen uptake tongue). Defects in the alveolar ridge or lip may have
in blood and related tissues, which in turn improves variable e¤ects on speech and general oral function. The
vascularity, thus reducing tissue damage and related literature in this area is scant to nonexistent; however,
osteoradionecrotic changes following extractions (Marx, changes to the lip and alveolus may certainly limit the
1983a, 1983b). production of specific speech sounds. For example, an-
Once primary medical treatment (surgery, radiother- terior sounds in which the lip is an active articulator
apy) has been completed, the goals of rehabilitation are (e.g., labiodental sounds), sounds that rely on pressure
to maintain or restore anatomical structure and physio- build-up (e.g., stop-plosive sounds), or sounds that re-
logical function. However, if the lesion is extensive and quire a fixed position for continuous generation or oral
destructive, with distant spread, and subsequent treat- airway turbulence (e.g., lingual-alveolar sounds), among
ment is contraindicated, palliative care may be initiated others, may be altered by surgical ablation of these
instead. Palliative care focuses on pain control and structures. In many instances, these types of problems
maintaining some level of nutrition, respiration, and hu- can be addressed using methods commonly employed
man dignity. for articulation therapy. Slowing the speech rate may
help to augment articulatory precision in the presence of
Defects of the Maxilla and Velum. Surgery is the pre- such defects following surgical treatment.
ferred method for treating cancer of the maxilla. Such Isolated mandibular tumors are rare, but regional
procedures vary in the extent of resection and may be spread of cancer to the mandible from other oral struc-
performed transorally or transfacially. Because surgical tures is not uncommon. Extended lesions involving
extirpation of maxillary tumors may involve extensive the mandible often require surgical resection. Because
resection, a significant reduction in the essential struc- cancer has invaded bone, resections are often substan-
Laryngeal Trauma and Peripheral Structural Ablations 47
tial, subsequently requiring reconstruction with donor speech concerns. The surgical obturator allows careful
bone or plate reconstruction methods. However, changes packing of the wound site, keeps food and other debris
in the mobility of the mandible pose a risk for changes in from entering the wound, permits eating, drinking, and
the overall acoustic structure of speech due to changes in swallowing without a nasogastric tube, and allows im-
oral resonance. Prosthetic rehabilitation for jaw defects mediate oral communication (Doyle, 1999; Leeper and
resulting from injury or malignancy is essential (Adis- Gratton, 1999). However, certain factors may alter the
man, 1990). However, mandibular reconstructions may course of rehabilitation. Fabrication of a prosthesis is
permit the individual to manipulate the mandible with more di‰cult when teeth are absent, as this creates
relative e‰ciency for speech and swallowing movements. problems in fitting and retention of the prosthesis, which
Again, this area is not well documented in the clinical may result in secondary problems. When the jaw is dis-
literature. rupted, changes in symmetry may emerge that may in-
fluence chewing and swallowing. For soft palate defects,
Oral Articulatory Evaluation. Total or partial removal early prostheses can be modified throughout the course
of the tongue (glossectomy) or resection of the anterior of treatment to facilitate swallowing and speech. Because
maxilla or mandible results in significant speech impair- respiration is paramount, the devices must be created so
ment. The results of peripheral structural ablations of that they do not impede normal breathing.
the oral speech system are best addressed using the In general, intraoral prostheses for those with surgical
methods recommended for treatment of the dysarthrias defects of peripheral oral structures seek to maintain
(e.g., Kent et al., 1989). Acoustic analysis, including the oral and nasal cavities as separate entities and to
evaluation of formant frequency, amplitude, and spec- reduce velopharyngeal orifice area. Reduction of surgi-
tral moments, is of clinical value (Gillis and Leonard, cally induced velopharyngeal incompetence is essential
1982; Leonard and Gillis, 1982; Tobey and Lincks, to enhancing residual speech capabilities, particularly
1989). Acoustic considerations relative to speech intelli- following larger resections of the maxilla. In addition to
gibility should also be addressed in this population using the obvious defects in the structural integrity of the pe-
tools developed for the dysarthrias (Kent et al., 1989). ripheral oral mechanism, treatment may disrupt neural
processes, including both a¤erent and e¤erent compo-
Prosthetic Management. To optimize the care of each nents of the system. As such, speech assessment must
patient, the expertise of multiple professionals, including evaluate the motor capacity as well as the capacity of the
a dentist, prosthodontist, head and neck surgeon, pros- sensory system.
thetist, speech-language pathologist, and radiologist, is When reductions in the ability of the tongue to ap-
necessary. Surgical treatment of maxillary tumors can proximate superior structures of the oral cavity exist
result in a variety of problems postsurgically. For ex- after treatment, attempts to facilitate this ability may be
ample, leakage of food or drink into the nasal cavity achieved by constructing a ‘‘palatal drop’’ prosthesis,
significantly disrupts eating and drinking. This may be which supplements lingual inability by bringing the new
compounded by di‰culties in chewing food or by dys- hard palate inferior in the oral cavity. The individual
phagia, so that nutritional problems must always be may then have greater ability to make the necessary oral
considered. Additionally, such surgical defects reduce contacts to improve articulatory precision and speech
articulatory precision and change the acoustic struc- intelligibility (e.g., Aramany et al., 1982; Gillis and
ture of speech because of changes in the volume of the Leonard, 1983). Such a prosthesis is useful in patients
vocal tract as well as resultant hypernasality and who have healed after undergoing a maxillectomy or
nasal emission (abnormal and perhaps excessive flow of maxillary-mandibular resection. Prosthetic adaptations
air through the nasal cavity). Together, these types of of this type may also benefit eating and swallowing.
changes to the peripheral oral structures will almost cer- These devices serve both speech and swallowing by
tainly result in decreased speech intelligibility and com- helping to reduce velopharyngeal defects.
munication. In order for speech improvement to occur,
the surgical defect must be augmented with a prosthe- Speech Assessment
sis. This is typically a multistage process, with the first
obturator being placed in situ at the time of surgery. The Systematic assessment of the peripheral speech mecha-
initial obturator is maintained for 1–2 weeks, at which nism includes formal evaluation of all subsystems—
time an interim device is fabricated. Most individuals use respiratory, laryngeal, velopharyngeal, and articulatory.
the interim device until complete healing has occurred As each component of the system is assessed, its relative
and related treatments are completed, usually anywhere contributions to the overall speech deficits observed can
from 3 to 6 months after surgery. At this time a final or be better defined for purposes of treatment monitoring.
definitive obturator is fabricated. Although some minor Subsystem evaluation may entail instrumentally based
adjustments to the definitive prosthesis are not uncom- assessments (speech aerodynamics and acoustics), which
mon, this obturator will be maintained permanently elicit information on the aeromechanical relationship
(Desjardins, 1977, 1978; Anderson et al., 1992). to oral port size and tongue–hard palate valving (Warren
and DuBois, 1964), auditory-perceptual evaluations of
Prosthetic Management of Surgical Defects. Early oral speech or voice parameters, and measures of speech in-
rehabilitation is essential, for reasons that go beyond telligibility (Leeper, Sills, and Charles, 1993; Leeper and
48 Part I: Voice
Gratton, 1999). Use of Netsell and Daniel’s (1979) Leonard, R. J., and Gillis, R. E. (1982). E¤ects of a prosthetic
physiological model allows the clinician to identify the tongue on vowel intelligibility and food management in
relative contribution of each speech subsystem and to a patient with total glossectomy. Journal of Speech and
target appropriate therapy techniques in an e¤ort to Hearing Disorders, 47, 25–29.
Marx, R. E. (1983a). Osteoradionecrosis: A new concept of its
optimize the system. The physiologic approach also
pathophysiology. Journal of Oral and Maxillofacial Sur-
permits continuous evaluation of each component in a gery, 41, 283–288.
comparative manner for prosthesis-in and prosthesis-out Marx, R. E. (1983b). A new concept in the treatment of
conditions. Such evaluations aid in fine-tuning prosthetic osteoradionecrosis. Journal of Oral and Maxillofacial Sur-
management. gery, 41, 351–357.
Once obturation occurs, direct speech treatment goals Marx, R. E., Johnson, R. P., and Kline, S. N. (1985). Preven-
may include improving the control of the respiratory tion of osteoradionecrosis: A randomized prospective clini-
support system for speech, determining and maintain- cal trial of hyperbaric oxygen vs. penicillin. Journal of the
ing the optimal speech rate, increasing the accuracy of American Dental Association, 111, 49–54.
specific sound production (or potentially the directed Netsell, R., and Daniel, B. (1979). Dysarthria in adults: Physi-
ologic approach to rehabilitation. Archives of Physical
compensation of sound substitutions), improving intelli-
Medicine and Rehabilitation, 60, 502–508.
gibility through overarticulation, modulating vocal in- Schaefer, S. D. (1992). The acute management of external
tensity to improve oral resonance, and related tasks that laryngeal trauma. Archives of Otolaryngology–Head and
seek to improve intelligibility. Treatment goals may best Neck Surgery, 118, 598–604.
be achieved by using the methods previously described Thorn, J. J., Hansen, H. S., Specht, L., and Bastholt, L. (2000).
for the dysarthrias (Dworkin, 1991). Osteoradionecrosis of the jaws: Clinical characteristics
and relation to the field of irradiation. Journal of Oral and
—Philip C. Doyle Maxillofacial Surgery, 58, 1088–1093.
Tobey, E. A., and Lincks, J. (1989). Acoustic analyses of
References speech changes after maxillectomy and prosthodontic man-
agement. Journal of Prosthetic Dentistry, 62, 449–455.
Adisman, I. (1990). Prosthesis serviceability for acquired jaw Warren, D. W., and DuBois, A. B. (1964). A pressure-flow
defects. Dental Clinics of North America, 34, 265–284. technique for measuring velopharyngeal orifice area during
Anderson, J. D., Awde, J. D., Leeper, H. A., and Sills, P. S. spontaneous speech. Cleft Palate Journal, 1, 52–71.
(1992). Prosthetic management of the head and neck cancer
patient. Canadian Journal of Oncology, 2, 110–118. Further Readings
Aramany, M. A., Downs, J. A., Beery, Q. C., and Ayslan, F.
(1982). Prosthodontic rehabilitation for glossectomy Balogh, J., and Sutherland, S. (1983). Osteoradionecrosis of the
patients. Journal of Prosthetic Dentistry, 48, 78–81. mandible: A review. American Journal of Otolaryngology,
Brown, J. S., Rogers, S. N., McNally, D. N., and Boyle, M. 13, 82.
(2000). A modified classification for the maxillectomy de- Batsakis, J. G. (1979). Squamous cell carcinomas of the oral
fect. Head and Neck, 22, 17–26. cavity and the oropharynx. In J. G. Batsakis (Ed.), Tumours
Desjardins, R. P. (1977). Early rehabilitative management of of the head and neck: Clinical and pathological consid-
the maxillectomy patient. Journal of Prosthetic Dentistry, erations (2nd ed.). Baltimore: Williams and Wilkins.
38, 311–318. Beumer, J., Curtis, T., and Marunick, M. (1996). Maxillofacial
Desjardins, R. P. (1978). Obturator design for acquired maxil- rehabilitation: Prosthodontic and surgical considerations. St.
lary defects. Journal of Prosthetic Dentistry, 39, 424–435. Louis: Ishiyaki EuroAmerica.
Doyle, P. C. (1999). Postlaryngectomy speech rehabilitation: Cherian, T. A., and Raman, V. R. R. (1993). External laryn-
Contemporary concerns in clinical care. Journal of Speech- geal trauma: Analysis of 30 cases. Journal of Laryngology
Language Pathology and Audiology, 23, 109–116. and Otology, 107, 920–923.
Dworkin, J. P. (1991). Motor speech disorders: A treatment Doyle, P. C. (1994). Foundations of voice and speech rehabili-
guide. St. Louis: Mosby. tation following laryngeal cancer (pp. 97–122). San Diego,
Franceschi, S., Bidoli, E., Herrero, R., and Munoz, N. (2000). CA: Singular Publishing Group.
Comparison of cancers of the oral cavity and pharynx Gussack, G. S., Jurkovich, G. J., and Luterman, A. (1986).
worldwide: Etiological clues. Oral Oncology, 36, 106–115. Laryngotracheal trauma. Laryngoscope, 96, 660–665.
Gillis, R. E., and Leonard, R. J. (1983). Prosthetic treatment Goldenberg, D., Golz, A., Flax-Goldenberg, R., and Joa-
for speech and swallowing in patients with total glossec- chims, H. Z. (1997). Severe laryngeal injury cased by blunt
tomy. Journal of Prosthetic Dentistry, 50, 808–814. trauma to the neck: A case report. Journal of Laryngology
Kent, R. D., Weismer, G., Kent, J. F., and Rosenbek, J. C. and Otology, 111, 1174–1176.
(1989). Toward phonetic intelligibility testing in dysarthria. Haribhakti, V., Kavarana, N., and Tibrewala, A. (1993). Oral
Journal of Speech and Hearing Disorders, 54, 482–499. cavity reconstruction: An objective assessment of function.
Leeper, H. A., and Gratton, D. G. (1999). Oral prosthetic Head and Neck, 15, 119–124.
rehabilitation of individuals with head and neck cancer: A Hassan, S. J., and Weymuller, E. A. (1993). Assessment of
review of current practice. Journal of Speech-Language Pa- quality of life in head and neck cancer patients. Head and
thology and Audiology, 23, 117–133. Neck, 15, 485–496.
Leeper, H. A., Sills, P. S., and Charles, D. (1993). Prostho- Jewett, B. S., Shockley, W. W., and Rutledge, R. (1999). Ex-
dontic management of maxillofacial and palatal defects. ternal laryngeal trauma analysis of 392 patients. Archives of
In K. T. Moller and C. D. Starr (Eds.), Cleft palate: Inter- Otolaryngology–Head and Neck Surgery, 125, 877–880.
disciplinary issues and treatment (p. 145). Austin, TX: Kornblith, A. B., Zlotolow, I. M., Gooen, J., Huryn, J. M.,
Pro-Ed. Lerner, T., Strong, E. W., Shah, J. P., Spiro, R. H., and
Psychogenic Voice Disorders: Direct Therapy 49
Holland, J. C. (1996). Quality of life of maxillectomy psychogenic voice loss, the precise mechanisms underly-
patients using an obturator prosthesis. Head and Neck, 18, ing such psychologically based disorders have not been
323–334. fully elucidated (see functional voice disorders for a
Lapointe, H. J., Lampe, H. B., and Taylor, S. M. (1996). review). Despite considerable controversy surrounding
Comparison of maxillectomy patients with immediate ver-
causal mechanisms, the clinical voice literature is replete
sus delayed obturator prosthesis placement. Journal of Oto-
laryngology, 25, 308–312. with evidence that symptomatic voice therapy for
Light, J. (1997). Functional assessment testing for maxillofacial psychogenic disorders can often result in rapid and dra-
prosthetics. Journal of Prosthetic Dentistry, 77, 388–393. matic voice improvement (Koufman and Blalock, 1982;
List, M. A., Ritter-Sterr, C. A., Baker, T. M., Colangelo, L. A., Aronson, 1990; Milutinovic, 1990; Carding and Horsley,
Matz, G., Pauloksi, B. R., and Logemann, J. A. (1996). 1992; Roy and Leeper, 1993; Gunther et al., 1996; Roy
Longitudinal assessments of quality of life in laryngeal can- et al., 1997; Andersson and Schalen, 1998; Carding,
cer patients. Head and Neck, 18, 1–10. Horsley, and Docherty, 1999; Stemple, 2000). The fol-
Logemann, J. A., Kahrilas, P. J., Hurst, P., Davis, J., and lowing discussion considers voice therapy techniques
Krugler, C. (1989). E¤ects of intraoral prosthetics on swal- aimed at directly alleviating vocal symptoms without
lowing in patients with oral cancer. Dysphagia, 4, 118–120.
specific attention to the putative psychological dysfunc-
Mahanna, G. K., Beukelman, D. R., Marshall, J. A., Gaebler,
C. A., and Sullivan, M. (1998). Obturator prostheses after tion underlying the disorder.
cancer surgery: An approach to speech outcome assessment. Before symptomatic therapy is begun, the laryngo-
Journal of Prosthetic Dentistry, 79, 310–316. logical findings are reviewed with the patient, and he or
Myers, E. M., and Iko, B. O. (1987). The management of acute she is reassured regarding the absence of any structural
laryngeal trauma. Journal of Trauma, 4, 448–452. laryngeal pathology. An explanation of the problem
Pauloski, B., Logemann, J., Rademaker, A., McConnel, F., is then provided by the clinician. While the specific
Heiser, M., Cardinale, S., et al. (1993). Speech and swal- approach and emphasis vary among clinicians, the dis-
lowing function after anterior tongue and floor of mouth cussion typically includes some explanation of the un-
resection with distal flap reconstruction. Journal of Speech toward e¤ects of excess or dysregulated muscle tension
on voice production and its probable link to stress,
Robbins, K., Bowman, J., and Jacob, R. (1987). Post-
glossectomy deglutitory and articulatory rehabilitation with situational conflicts, or other psychological precursors
palatal augmentation prosthesis. Archives of Otolaryngol- that were identified during the interview. The confident
ogy–Head and Neck Surgery, 113, 1214–1218. clinician provides brief information regarding the ther-
Sandor, G. K. B., Leeper, H. A., and Carmichael, R. P. (1997). apy plan and the likelihood of a positive outcome.
Speech and velopharyngeal function. Oral and Maxillofacial Because excess or dysregulated laryngeal muscle
Surgery Clinics of North America, 9, 147–165. tension is frequently o¤ered as the proximal cause of
Schaefer, S. D., and Close, L. C. (1989). Acute management of the psychogenic voice disorder, many voice therapies
laryngeal trauma. Annals of Otology, Rhinology, and Laryn- including yawn-sigh resonant voice therapy, visual and
gology, 98, 98–104. electromyographic biofeedback, chewing therapy, pro-
Warren, D. W. (1979). PERCI: A method for rating palatal
gressive relaxation, and circumlaryngeal massage aimed
e‰ciency. Cleft Palate Journal, 16, 279–285.
Wedel, A., Yontchev, E., Carlsson, G., and Ow, R. (1994). at reducing or rebalancing such tension (Boone and
Masticatory function in patients with congenital and McFarlane, 2000). Prolonged hypercontraction of la-
acquired maxillofacial defects. Journal of Prosthetic Den- ryngeal muscles is often associated with elevation of the
tistry, 72, 303–308. larynx and hyoid bone, with associated pain and dis-
comfort when the circumlaryngeal region is palpated.
Aronson (1990) and Roy and Bless (1998) have de-
scribed manual techniques to determine the presence and
Psychogenic Voice Disorders: Direct degree of laryngeal musculoskeletal tension, as well as
Therapy methods to relieve such tension during the diagnostic
assessment and management session. Circumlaryngeal
massage is one such treatment approach. Skillfully ap-
Psychogenic voice disorder refers to a voice disorder plied, systematic kneading of the extralaryngeal region
that is a manifestation of some confirmed psychological is believed to stretch muscle tissue and fascia, promote
disequilibrium (Aronson, 1990). In its purest form, the local circulation with removal of metabolic wastes, relax
psychogenic voice disorder is not associated with struc- tense muscles, and relieve pain and discomfort asso-
tural laryngeal changes or frank central or peripheral ciated with muscle spasms. The hypothesized physical
nervous system pathology. It is asserted that the larynx, e¤ect of such massage is reduced laryngeal height and
by virtue of its neural connections to emotional centers sti¤ness and increased mobility. Once the larynx is
within the brain, is vulnerable to excess or poorly regu- ‘‘released’’ and range of motion is normalized, propor-
lated musculoskeletal tension arising from stress, con- tional improvement in voice is said to follow. Improve-
flict, fear, and emotional inhibition (Case, 1996). Such ment in voice and reductions in pain and laryngeal
dysregulated laryngeal muscle tension can interfere with height suggest a relief of tension (Roy and Ferguson,
normal voice and give rise to complete aphonia (i.e., 2001). In a similar vein, Roy and Bless (1998) also
whispered speech) or partial voice loss (dysphonia). Al- recently described a number of manual laryngeal repos-
though numerous theories have been o¤ered to explain turing techniques that can stimulate improved voice
50 Part I: Voice
and briefly interrupt patterns of muscle misuse. These Because there are few studies directly comparing the
brief moments of voice improvement associated with e¤ectiveness of specific therapy techniques, not much
laryngeal reposturing maneuvers are immediately iden- is known about whether one therapy approach for psy-
tified for the patient and reinforced. Digital cues can chogenic voice disorder is superior to another. Although
then be faded and the patient taught to rely on sensory signs of improvement should typically be observed
feedback (auditory, kinesthetic, and proprioceptive) to within the first session, some patients may need an
maintain improved laryngeal positioning and muscle extended, intensive treatment session or several sessions,
balance. Any partial relapses or return of abnormal depending on several variables, including the therapy
voice during the therapy process can be dealt with by techniques selected, clinician experience and confidence
reassurance, verbal reinstruction, or manual cueing. in administering the approach, and patient motivation
Once the larynx is correctly positioned, recovery of nor- and tolerance, to mention only a few.
mal voice can occur rapidly. The anecdotal clinical literature suggests that the
Certain patients with psychogenic dysphonia and prognosis for sustained removal of abnormal symptoms
aphonia appear to have lost kinesthetic awareness in psychogenic aphonia or dysphonia may depend on
and volitional control over voice production for speech several factors. First, the time between the onset of voice
and communication purposes, yet display normal voic- problem and the initiation of therapy may be important.
ing for vegetative or nonspeech acts. For instance, some The sooner voice therapy is initiated following the onset
aphonic and severely dysphonic patients may be able to of the voice problem, the better the prognosis. If months
clear the throat, grunt, cough, sigh, gargle, laugh, hum, or years have elapsed, it may be more di‰cult to elimi-
or produce a high-pitched squeak with normal or near- nate the abnormal symptoms. Second, the more extreme
normal voice quality. Such preserved voicing for non- the voice symptoms, the better the prognosis for im-
speech purposes represents a clue to the capacity for provement. Aphonia and extreme tension, according to
normal phonation. In symptomatic therapy, then, the some authorities, may be easier to modify than inter-
patient is asked to produce such vocal behaviors. The mittent or mild dysphonia. Third, if significant second-
goal of these voice maneuvers is to elicit even a brief ary gain is present, this may interfere with progress and
trace of clearer voice so that it may be shaped toward contribute to a poorer treatment outcome. Finally, if
normal quality or extended to longer utterances. These the underlying psychological triggers are no longer
e¤orts follow a trial-and-error pattern and require the active, then normal voice should be established quickly
seasoned clinician to be constantly vigilant, listening for and improvement should be sustained (Aronson, 1990;
any brief moments of clearer voice. When improved Du¤y, 1995; Case, 1996; Colton and Casper, 1996). As a
voice is elicited, it is instantly reinforced, and the clini- caveat, however, the foregoing observations have rarely
cian provides immediate feedback regarding the positive been studied in any objective manner; therefore they are
change. During this process, the patient needs to be an best regarded as clinical impressions rather than factual
active participant and is encouraged to continually self- statements.
monitor the type and manner of voice produced. Once The long-term e¤ectiveness of direct voice therapy for
this brief but relatively normal voice is reliably achieved, psychogenic voice disorders also has not been rigorously
it is shaped and extended into sustained vowels, words, evaluated (Pannbacker, 1998; Ramig and Verdolini,
simple phrases, and oral reading. When this phase of 1998). Most clinicians report that relapse is infrequent
intervention is successful, the patient is then engaged in and isolated, yet others report more frequent post-
casual conversation that begins with basic biographical treatment recurrences. Of the few investigations that
information and proceeds to brief narratives, and then exist, the results regarding the durability of voice im-
conversation about any topic and with anyone in the provement following direct therapy are mixed (Gunther
clinical setting. If established, the restored voice is usu- et al., 1996; Roy et al., 1997; Andersson and Schalen,
ally maintained without compensatory e¤ort and may 1998; Carding, Horsley, and Docherty, 1999). It should
improve further during conversation. Finally, the clini- be acknowledged that following direct voice therapy,
cian should debrief the patient regarding the cause of the only the symptom of psychological disturbance has been
voice improvement, discuss the patient’s feelings about removed, not the disturbance itself (Brodnitz, 1962;
the improved voice, and review possible causes of the Kinzl, Biebl, and Rauchegger, 1988). Therefore, the
problem and the prognosis for maintaining normal nature of psychological dysfunction needs to be better
voice. understood. If the situational, emotional, or personality
Certainly, direct symptomatic therapy for psycho- features that contributed to the development of the psy-
genic voice disorders can produce rapid changes; how- chogenic voice disorder remain unchanged following
ever, in some cases voice therapy can be a frustrating behavioral treatment, it would be logical to expect that
and protracted experience for both clinician and patient such persistent factors would increase the probability of
(Bridger and Epstein, 1983; Fex et al., 1994). The rate of future recurrences (Nichol, Morrison, and Rammage,
improvement during therapy for psychogenic voice dis- 1993; Andersson and Schalen, 1998). Therefore, in some
orders varies. Patients may progress gradually through cases, post-treatment referral to a psychiatrist or psy-
various stages of dysphonia on their way to normal voice chologist may be necessary to achieve more enduring
recovery. Other patients will appear to experience sud- improvements in the patient’s emotional adjustment and
den return of voice without necessarily transitioning voice function (Butcher, Elias, and Raven, 1993; Roy
through phases of decreasing severity (Aronson, 1990). et al., 1997).
The Singing Voice 51
In summary, psychogenic voice disorders are power- Milutinovic, Z. (1990). Results of vocal therapy for phono-
ful examples of the intimate connection between mind neurosis: Behavior approach. Folia Phoniatrica, 42, 173–177.
and body. These voice disorders, which occur in the ab- Nichol, H., Morrison, M. D., and Rammage, L. A. (1993).
sence of structural laryngeal pathology, often represent Interdisciplinary approach to functional voice disorders:
The psychiatrist’s role. Otolaryngology–Head and Neck
some of the most severely disturbed voices encountered
Surgery, 108, 643–647.
by voice pathologists. In an experienced clinician’s Pannbacker, M. (1998). Voice treatment techniques: A review
hands, direct symptomatic therapy for psychogenic voice and recommendations for outcome studies. American Jour-
disorders can produce rapid and remarkable restoration nal of Speech-Language Pathology, 7, 49–64.
of normal voice. Much remains to be learned regarding Ramig, L. O., and Verdolini, K. (1998). Treatment e‰cacy:
the underlying bases of these disorders and the long-term Voice disorders. Journal of Speech, Language, and Hearing
e¤ect of direct therapeutic interventions. Research, 41(Suppl.), S101–S116.
See also functional voice disorders. Roy, N., and Bless, D. M. (1998). Manual circumlaryngeal
techniques in the assessment and treatment of voice dis-
—Nelson Roy orders. Current Opinion in Otolaryngology and Head and
References Roy, N., Bless, D. M., Heisey, D., and Ford, C. F. (1997).
Manual circumlaryngeal therapy for functional dysphonia:
Andersson, K., and Schalen, L. (1998). Etiology and treatment An evaluation of short- and long-term treatment outcomes.
of psychogenic voice disorder: Results of a follow-up study Journal of Voice, 11, 321–331.
of thirty patients. Journal of Voice, 12, 96–106. Roy, N., and Ferguson, N. A. (2001). Formant frequency
Aronson, A. E. (1990). Clinical voice disorders: An interdisci- changes following manual circumlaryngeal therapy for func-
plinary approach (3rd ed.). New York: Thieme. tional dysphonia: Evidence of laryngeal lowering? Journal
Boone, D. R., and McFarlane, S. C. (2000). The voice and of Medical Speech-Language Pathology, 9, 169–175.
voice therapy (6th ed.). Boston: Allyn and Bacon. Roy, N., and Leeper, H. A. (1993). E¤ects of the manual
Bridger, M. M., and Epstein, R. (1983). Functional voice dis- laryngeal musculoskeletal tension reduction technique as
orders: A review of 109 patients. Journal of Laryngology a treatment for functional voice disorders: Perceptual and
and Otology, 97, 1145–1148. acoustic measures. Journal of Voice, 7, 242–249.
Brodnitz, F. S. (1962). Functional disorders of the voice. In Roy, N., McGrory, J. J., Tasko, S. M., Bless, D. M., Heisey,
N. M. Levin (Ed.), Voice and speech disorders: Medical D., and Ford, C. N. (1997). Psychological correlates of
aspects (pp. 453–481). Springfield, IL: Charles C Thomas. functional dysphonia: An evaluation using the Minnesota
Butcher, P., Elias, A., and Raven, R. (1993). Psychogenic voice Multiphasic Personality Inventory. Journal of Voice, 11,
disorders and cognitive behaviour therapy. San Diego, CA: 443–451.
Singular Publishing Group. Schalen, L., and Andersson, K. (1992). Di¤erential diagnosis
Butcher, P., Elias, A., Raven, R., Yeatman, J., and Littlejohns, and treatment of psychogenic voice disorder. Clinical Oto-
D. (1987). Psychogenic voice disorder unresponsive to laryngology, 17, 225–230.
speech therapy: Psychological characteristics and cognitive- Stemple, J. (2000). Voice therapy: Clinical studies (2nd ed.).
behaviour therapy. British Journal of Disorders of Commu- San Diego, CA: Singular Publishing Group.
nication, 22, 81–92.
Carding, P., and Horsley, I. (1992). An evaluation study of
voice therapy in non-organic dysphonia. European Journal
of Disorders of Communication, 27, 137–158. The Singing Voice
Carding, P., Horsley, I., and Docherty, G. (1999). A study of
the e¤ectiveness of voice therapy in the treatment of 45
patients with nonorganic dysphonia. Journal of Voice, 13, The functioning of the voice organ in singing is similar
72–104. to that in speech. Thus, the origin of the sound is the
Case, J. L. (1996). Clinical management of voice disorders (3rd voice source—the pulsating airflow through the glottis.
ed.). Austin, TX: Pro-Ed.
Colton, R., and Casper, J. K. (1996). Understanding voice
The voice source is mainly controlled by three physio-
problems: A physiological perspective for diagnosis and logical factors, subglottal pressure, length and sti¤ness
treatment. Baltimore: Williams and Wilkins. of the vocal folds, and the degree of glottal adduction.
Du¤y, J. R. (1995). Motor speech disorders: Substrates, di¤er- These control parameters determine vocal loudness, F0,
ential diagnosis, and management. St. Louis: Mosby. and mode of phonation, respectively. The voice source is
Fex, F., Fex, S., Shiromoto, O., and Hirano, M. (1994). a complex tone composed of a series of harmonic par-
Acoustic analysis of functional dysphonia: Before and after tials of amplitudes decreasing by about 12 dB per octave
voice therapy (Accent Method). Journal of Voice, 8, 163– as measured in flow units. It propagates through the vo-
167. cal tract and is thereby filtered in a manner determined
Gunther, V., Mayr-Graft, A., Miller, C., and Kinzl, H. (1996). by its resonance or formant frequencies. These frequen-
A comparative study of psychological aspects of recurring
and non-recurring functional aphonias. European Archives
cies are determined by the vocal tract shape. For most
of Otorhinolaryngology, 253, 240–244. vowel sounds, the two lowest formant frequencies deter-
Kinzl, J., Biebl, W., and Rauchegger, H. (1988). Functional mine vowel quality, while the higher formant frequencies
aphonia: Psychosomatic aspects of diagnosis and therapy. belong to the personal voice characteristics.
Folia Phoniatrica, 40, 131–137.
Koufman, J. A., and Blalock, P. D. (1982). Classification and Breathing. Subglottal pressure determines vocal loud-
approach to patients with functional voice disorders. Annals ness and is therefore used for expressive purposes in
of Otology, Rhinology, and Laryngology, 91, 372–377. singing. It is also varied with F0, such that higher pitches
52 Part I: Voice
are sung with higher subglottal pressures than lower The acoustic significance of the waveform is straight-
pitches. As a consequence, singers need to vary sub- forward. The slope of the source spectrum is determined
glottal pressure constantly, adapting it to both loudness mainly by the negative peak of the di¤erentiated flow
and pitch. This is in sharp contrast to speech, where waveform, frequently referred to as the maximum flow
subglottal pressure is much more constant. Singers declination rate. It represents the main excitation of the
therefore need to develop a virtuosic control of the vocal tract. This steepness is linearly related to the sub-
breathing apparatus. In addition, subglottal pressures in glottal pressure in such a way that a doubling of sub-
singing are varied over a larger range than in speech. glottal pressure causes an SPL increase of about 10 dB.
Thus, while in loud speech subglottal pressure may be The amplitude gain of higher partials is greater than that
raised to 1.5 or 2 kPa, singers may use pressures as high of lower partials. Thus, if the sound level of a vowel
as 4 or 6 kPa for loud tones sung at high pitches. sound is increased by 10 dB, the partials near 3 kHz
Subglottal pressure is determined by active forces typically increase by about 17 dB.
produced by the breathing muscles and passive forces The air volume contained in a flow pulse is decisive
produced by gravity and the elasticity of the breathing to the amplitude of the source spectrum fundamental
apparatus. Elasticity, generated by the lungs and the rib and is strongly influenced by the overall glottal adduc-
cage, varies with lung volume. At high lung volumes, tion force. Thus, for a given subglottal pressure, a firmer
elasticity produces an exhalatory force that may amount adduction produces a smaller air volume in a pulse,
to 3 kPa or more. At low lung volumes, elasticity con- which reduces the amplitude of the fundamental. An
tributes an inhalatory force. Whereas in conversational exaggerated glottal adduction thus attenuates the fun-
speech, no more than about 15%–20% of the total lung damental. This phonation mode is generally referred to
capacity is used, classically trained singers use an aver- as hyperfunctional or pressed. The opposite extreme—
age lung volume range that is more than twice as large that is, the habitual use of too faint adduction—is called
and occasionally may vary from 100% to 0% of the total hypofunctional and prevents the vocal folds from closing
vital capacity in long phrases. the glottis also during the vibratory cycle. As a result,
As the elasticity forces change from exhalatory at airflow escapes the glottis during the quasi-closed phase.
high lung volumes to inhalatory at low lung volumes, This generates noise and produces a strong fundamental.
they reach an equilibrium at a certain lung volume. This This phonation mode is often referred to as breathy.
lung volume is called the functional residual capacity In classical singing, pressed phonation is typically
(FRC). In tidal breathing, inhalations are started from avoided. Instead, singers seem to strive to reduce glottal
FRC. In both speech and singing, lung volumes above adduction to the minimum that will still result in glottal
FRC are preferred. Because much higher lung volumes closure during the closed phase. This generates a source
are used in singing than in speech, singers need to deal spectrum with strong high partials and a strong funda-
with much greater exhalatory elasticity forces. mental. This type of phonation has been called flow
phonation or resonant voice. In nonclassical singing, on
Voice Source. The airflow waveform of the voice the other hand, pressed phonation is occasionally used
source is characterized by quasi-triangular pulses, pro- for high, loud tones, apparently for expressive purposes.
duced when the vocal folds open the glottis, and fol- A main characteristic of classical singing is the vi-
lowed by horizontal portions near or at zero airflow, brato. It corresponds to a quasi-periodic modulation of
produced when the folds close the glottis more or less F0 (Fig. 2). The pitch perceived from a vibrato tone
completely (Fig. 1). corresponds to its average F0. The modulation fre-
quency, mostly between 5 and 7 Hz, is generally referred
to as the vibrato rate and is rather constant for a singer.
Curiously enough, however, it tends to increase some-
what toward the end of tones. The peak-to-peak modu-
lation range is varied between nil and less than two
semitones, or F0 2 1=6 . With increasing age, singers’ vi-
brato rates tend to decrease by about one-half hertz per
decade of years, and vibrato extent tends to increase by
about 15 cent per decade.
The vibrato is generated by a rhythmical pulsation of
the cricothyroid muscles. When contracting, these mus-
cles cause a stretching of the vocal folds, and so raise F0.
The neural origin of this pulsation is not understood.
One possibility is that it emanates from a cocontraction
of the cricothyroid and vocalis muscles.
In speech, pitch is perceived in a continuous fashion,
such that a continuous variation in F0 is heard as a
continuous variation of pitch. In music, on the other
Figure 1. Typical flow glottogram showing transglottal airflow hand, pitch is perceived categorically, where the catego-
versus time. ries are scale tones or the intervals between them. Thus,
The Singing Voice 53
Figure 3. Spectra of the vowel [u] as spoken and sung by a

classically trained baritone singer.
Figure 2. Example of vibrato. Its resonance frequency can be somewhere between 2.5
and 3 kHz. If this resonance is appropriately tuned, it
will provide a formant cluster.
the F0 continuum is divided logarithmically into a series A common method to achieve a wide pharynx seems
of bins, each of which corresponds to a scale tone. The to be to lower the larynx, which is typically observed in
width of each scale tone is approximately 6% wide, and classically trained singers. Lowering the larynx lengthens
the center frequency of a scale tone is 2 1=12 higher than the pharynx cavity. As F2 of the vowel [i] is mainly
its lower neighbor. dependent on the pharynx length, it will be lowered by
The demands for pitch accuracy are quite high in a lowering of the larynx. In nonclassical singing, more
singing. Experts generally find that a tone is out of tune speechlike formant frequencies are used, and no singer’s
when it deviates from the target F0 by more than about formant is produced.
7 cent, or 0.07 of a semitone interval. This corresponds The center frequency of the singer’s formant varies
to less than one-tenth of a typical vibrato extent. The between voice classifications. On average, it tends to be
target F0 generally agrees with equal-tempered tuning, about 2.4, 2.6, and 2.8 kHz for basses, baritones, and
where the interval between adjacent scale tones corre- tenors, respectively. These di¤erences, which contribute
sponds to the F0 ratio of 1 : 2 1=12 . However, apparently significantly to the characteristic voice qualities of these
depending on the musical context, the target F0 for a classifications, probably reflect di¤erences in vocal tract
scale tone may deviate from its value in equal-tempered length.
tuning by about a tenth of a semitone interval. The singer’s formant spectrum peak is particularly
prominent in bass and baritone singers. In tenors and
Resonance. The formant frequencies in classical sing- altos it is less prominent and in sopranos it is generally
ing di¤er between voice classifications. Thus, basses have not observable. Thus, sopranos do not seem to produce
lower formant frequencies than baritones, who have a singer’s formant.
lower formant frequencies than tenors. These di¤erences The singer’s formant seems to serve the purpose of
probably reflect di¤erences in vocal tract length. The enhancing the voice when accompanied by a loud or-
formant frequencies also deviate from those typically chestra. The long-term-average spectrum of a symphonic
found in speech. For example, the second formant of orchestra typically shows a peak near 0.5 kHz followed
the vowel [i] is generally considerably lower in classical by a descent of about 9 dB per octave toward higher
singing than in speech, such that the vowel quality frequencies (Fig. 4). Therefore, the sound level of an or-
approaches that of the vowel [y]. chestra is comparatively low in the frequency region of
These formant frequency deviations are related to the singer’s formant, so that the singer’s formant makes
the singer’s formant, a marked spectrum envelope peak the singer’s voice easier to perceive. As the singer’s for-
between approximately 2.5 and 3 kHz, that appears in mant is produced mainly by vocal tract resonance, it can
all voiced sounds produced by classically trained male be regarded as a manifestation of vocal economy. It does
singers and altos (Fig. 3). It is produced by a clustering not appear in nonclassical singing, where the soloist is
of F3, F4, and F5. This clustering seems to be achieved provided with a sound amplification system that takes
by combining a narrow opening of the larynx tube with care of audibility problems. Also, it is absent or much
a wide pharynx. If the area ratio between the larynx tube less prominent among choral singers, whose voices are
opening and the pharynx approximates 1 : 6 or less, the supposed to blend, such that individual singers’ voices
larynx tube acts as a separate resonator in the sense that are di‰cult to discern.
its resonance frequency is rather insensitive to the cross- The approximate pitch range of a singer is about two
sectional area in the remaining parts of the vocal tract. octaves. Typical ranges for basses, baritones, tenors,
54 Part I: Voice
References
Sundberg, J. (1987). The science of the singing voice. DeKalb,
IL: Northern Illinois University Press.
Titze, I. (1994). Principles of voice production. Englewood
Cli¤s, NJ: Prentice Hall.
Vocal Hygiene
Vocal hygiene has been part of the voice treatment liter-

ature continuously since the publication of Mackenzie’s
The Hygiene of Vocal Organs, in 1886. In it the author, a
noted otolaryngologist, described many magical pre-
scriptions used by famous singers to care for their voices.
Figure 4. Long-term-average spectrum of an orchestra with In 1911, a German work by Barth included a chapter
and without a tenor soloist (heavy solid and dashed curves). with detailed discussion of vocal hygiene. The ideas
The thin solid curve shows a rough approximation of a corre- about vocal hygiene expressed in this book were similar
sponding analysis of neutral speech at conversational loudness. to those expressed in the current literature. Concern was
raised about the e¤ects of tobacco, alcohol, loud and
excessive talking, hormones, faulty habits, and diet on
altos, and sopranos are E2–E4 (82–330 Hz), G2–G4 the voice. Another classic text was Diseases and Injuries
(98–392 Hz), C3–C5 (131–523 Hz), F3–F5 (175– of the Larynx, published in 1942. The authors, Jackson
698 Hz), and C4–C6 (262–1047 Hz), respectively. This and Jackson, implicated various vocal abuses as the
implies that F0 is often higher than the typical value of primary causes of voice disorders, and cited rest and
F1 in some vowels. Singers, however, seem to avoid the refraint from the behavior as the appropriate treatment.
situation in which F0 is higher than F1. Instead, they Luchsinger and Arnold (1965) stressed the need for at-
increase F1 so that it is always higher than F0. For the tention to the physiological norm as the primary postu-
vowel [a], this is achieved by widening the jaw opening; late of vocal hygiene and preventive laryngeal medicine.
the higher the pitch, the wider the jaw opening. For Remarkably, these authors discussed the importance of
other vowels, singers seem first to reduce the tongue this type of attention not only for teachers and voice
constriction of the vocal tract, and resort to a widening professionals, but also for children in the classroom.
of the jaw opening when the e¤ect of this neutralization Subsequently, virtually all voice texts have addressed the
of the articulation fails to produce further increase of F1. issue of vocal hygiene.
Because F1 and F2 are decisive to the perception of Both the general public and professionals in numer-
vowels, the substantial departures from the typical for- ous disciplines commonly use the term hygiene. The 29th
mant frequency values in speech a¤ect vowel identifica- edition of Dorland’s Medical Dictionary defines it as ‘‘the
tion. Yet vowel identification is surprisingly successful science of health and its preservation.’’ Thus, we can
also at high F0. Most isolated vowel sounds can be take vocal hygiene to mean the science of vocal health
correctly identified up to an F0 of about 500 Hz. Above and the proper care of the vocal mechanism. Despite
this frequency, identification deteriorates quickly and long-held beliefs about the value of certain activities
remains low for most vowels at F0 higher than 700 Hz. most frequently discussed as constituting vocal hygiene,
In reality, however, text intelligibility can be greatly the science on which these ideas are based was, until
improved by consonants. quite recently, more implied and deduced than specific.
Patient education and vocal hygiene are both integral
Health Risks. Because singers are extremely dependent to voice therapy. Persons who are educated about the
on the perfect functioning of their voices, they often need structure and function of the phonatory mechanism are
medical attention. A frequent origin of their voice dis- better able to grasp the need for care to restore it to
orders is a cold, which typically causes dryness of the health and to maintain its health. Thus, the goal of pa-
glottal mucosa. This disturbs the normal function of the tient education is understanding. Vocal hygiene, on the
vocal folds. Also relevant would be their use of high other hand, focuses on changing an individual’s vocal
subglottal pressures. An inappropriate vocal technique, behavior. In some instances, a therapy program may be
sometimes associated with a habitually exaggerated based completely on vocal hygiene. More frequently,
glottal adduction or with singing in a too high pitch however, vocal hygiene is but one spoke in a total ther-
range, also tends to cause voice disorders, which in apy program that also includes directed instruction in
some cases may lead to developing vocal nodules. Such voice production techniques.
nodules generally disappear after voice rest, and surgical Although there are commonalities among vocal hy-
treatment is mostly considered inappropriate. giene programs regardless of the pathophysiology of
See also voice acoustics. the voice disorder, that pathophysiology should dictate
some specific di¤erences in the vocal hygiene approach.
—Johan Sundberg In addition to the nature of the voice disorder, factors
Vocal Hygiene 55
such as timing of the program relative to surgery (i.e., nating. Many questions remain in this area, such as the
pre or post), and whether the vocal hygiene training most e¤ective method of hydration.
stands alone or is but one aspect of a more extensive Other major components of vocal hygiene programs
therapy process, must also inform specific aspects of the are reducing vocal intensity by eliminating shouting or
vocal hygiene program. speaking above high ambient noise levels, avoiding fre-
Hydration and environmental humidification are quent throat clearing and other phonotraumatic behav-
particularly important to the health of the voice, and, as iors. The force of collision (or impact) of the vocal folds
such, should be a focus of all vocal hygiene programs. A has been described by Titze (1994) as proportional to
number of authors have studied the e¤ects of hydration vibrational amplitude and vibrational frequency. This
and dehydration of vocal fold mucosa and viscosity of was explored further in phonation by Jiang and Titze
the folds on phonation threshold pressure (PTP). (PTP (1994), who showed that intraglottal contact increases
is the minimum subglottal pressure required to initiate with increased vocal fold adduction. Titze (1994) theor-
and maintain vocal fold vibration.) For example, Ver- ized that if a vibrational dose reaches and exceeds a
dolini et al. (1990, 1994) studied PTP in normal speakers threshold level in a predisposed individual, tissue injury
subjected to hydrating and dehydrating conditions. Both will probably ensue. This lends support to the wide-
PTP and self-perceived vocal e¤ort were lower after spread belief that loud and excessive voice use, and in-
hydration. Jiang, Ng, and Hanson (1999) showed that deed other forms of harsh vocal productions, can cause
vocal fold oscillations cease in a matter of minutes in vocal fold pathology. It also supports the view that
fresh excised canine larynges deprived of humidified air. teachers and others in vocally demanding professions are
Rehydration by dripping saline onto the folds restored prone to vibration overdose, with inadequate recovery
the oscillations, demonstrating the need for hydration time. Thus, the stage is set for cyclic tissue injury, repair,
and surface moisture for lower PTP. and eventual voice or tissue change.
In one light, viscosity is a measure of the resistance The complexity of vocal physiology suggests a direct
to deformation of the vocal fold tissue. Viscosity is connection between viscosity and hydration, phona-
increased by hydration and decreased by drying—hence tion threshold pressure, and the e¤ects of collision and
the importance of vocal fold hydration to ease of pho- shearing forces. The greater the viscosity of vocal fold
nation. Moreover, it appears that the body has robust tissue, the higher the PTP that is required and the greater
cellular and neurophysiological mechanims to conserve is the internal friction or shearing force in the vocal fold.
the necessary hydration of airway tissues. In a study of These e¤ects may explain vocal fold injuries, particularly
patients undergoing dialysis, rapid removal of significant with long-term vocal use that involves increased im-
amounts of body water increased PTP and was asso- pact stress on the tissues during collision and shearing
ciated with symptoms of mild vocal dysfunction in some stresses (Jiang and Titze, 1994). Thus, issues of collision
patients. Restoration of the body fluid reversed this and the impact forces associated with increased loud-
trend (Fisher et al., 2001). Jiang, Lin, and Hanson ness and phonotraumatic vocalization are appropriately
(2000) noted the presence of mucous glands in the tissue addressed in vocal hygiene programs and in directed
of the vestibular folds and observed that these glands therapy approaches.
distribute a very important layer of lubricating mucus to Reflux, both gastroesophageal and laryngopharyn-
the surface of the vocal folds. Environmental hydration geal, a¤ects the health of the larynx and pharynx.
facilitates the vocal fold vibratory behavior, mainly be- Gastric acid and gastric pepsin, the latter implicated
cause of the increased water content in this mucous layer in the delayed healing of submucosal laryngeal injury
and in the superficial epithelium. The viscosity of secre- (Koufman, 1991), have been found in refluxed material.
tions is thickened with ingestion of foods or medications Laryngopharyngeal reflux has been implicated in a long
with a drying or diuretic e¤ect, radiation therapy, in- list of laryngeal conditions, including chronic or inter-
adequate fluid intake, and the reduction in mucus pro- mittent dysphonia, vocal fatigue, chronic throat clearing,
duction in aging. reflux laryngitis, vocal nodules, and malignant tissue
Thus, there appear to be a number of mechanisms, changes. Treatment may include dietary changes, life-
not yet fully understood, by which the hydration of vocal style modifications, and medication. Surgery is usually a
fold mucosa and the viscosity of the vocal folds are treatment of last resort. Ca¤eine, tobacco, alcohol, fried
directly involved in the e¤ort required to initiate and foods, and excessive food intake have all been implicated
maintain phonation. Both environmental humidity and in exacerbating the symptoms of laryngopharyngeal
surface hydration are important physiological factors reflux. Thus, vocal hygiene programs that address
in determining the energy needed to sustain phona- healthy diet and lifestyle and that include reflux pre-
tion. External or superficial hydration may occur as a cautions appear to be well-founded. It is now common
by-product of drinking large amounts of water, which practice for patients scheduled for any laryngeal surgery
increases the secretions in and around the larynx and to be placed on a preoperative course of antireflux med-
lowers the viscosity of those secretions. Steam inhala- ication that will be continued through the postoperative
tion and environmental humidification further hydrate healing stage. Although this is clearly a medical treat-
the surface of the vocal folds, and mucolytic agents may ment, the speech-language pathologist should provide
decrease the viscosity of the vocal folds. Clearly, the information and supportive guidance through vocal
lower the phonation threshold pressure, the less air hygiene instruction to ensure that patients follow the
pressure is required and the greater is the ease of pho- prescribed protocol.
56 Part I: Voice
An unanswered question is whether a vocal hygiene pepsin in the development of laryngeal injury. Laryngo-
therapy program alone is adequate treatment for vocal scope, 101(Suppl. 53), 1–78.
problems. Roy et al. (2001) found no significant im- Luchsinger, R., and Arnold, G. E. (1965). Voice-speech-
provement in a group of teachers with voice disorders language. Belmont, CA: Wadsworth.
Mackenzie, M. (1886). The hygiene of vocal organs. London:
after a course of didactic training in vocal hygiene.
Macmillan.
Teachers who received a directed voice therapy program Roy, N., Gray, S. D., Simon, M., Dove, H., Corbin-Lewis, K.,
(Vocal Function Exercises), however, experienced sig- and Stemple, J. C. (2001). An evaluation of the e¤ects of
nificant improvement. It should be noted that the vocal two treatment approaches for teachers with voice disorders:
hygiene program used in this study, being purely didactic A prospective randomized clinical trial. Journal of Speech,
and requiring no activity on the part of the participants, Language, and Hearing Research, 44, 286–296.
might more appropriately be described as a patient edu- Roy, N., Weinrich, V., Gray, S. D., Tanner, K., Toledo, S. W.,
cation program. Chan (1994) reported that a group Dove, H., et al. (2001). Voice amplification versus vocal
of non-voice-disordered kindergarten teachers did show hygiene instruction for teachers with voice disorders: A
positive behavioral changes following a program of treatment outcome study. Journal of Speech, Language, and
vocal education and hygiene. In another study, Roy
Titze, I. R. (1994). Principles of voice production. Englewood
et al. (2002) examined the outcome of voice amplifica- Cli¤s, NJ: Prentice Hall.
tion versus vocal hygiene instruction in a group of voice- Verdolini, K., Titze, I. R., and Fennell, A. (1994). Dependence
disordered teachers. Most pairwise contrasts directly of phonatory e¤ort on hydration level. Journal of Speech,
comparing the e¤ects of the two approaches failed to Language, and Hearing Research, 37, 1001–1007.
reach significance. Although the vocal hygiene group Verdolini-Marsten, K., Titze, I., and Druker, D. (1990).
showed changes in the desired direction on all dependent Changes in phonation threshold pressure with induced con-
measures, the study results suggest that the benefits of ditions of hydration. Journal of Voice, 4, 142–151.
amplification may have exceeded those of vocal hygiene
instruction. Of note, the amplification group reported
higher levels of extraclinical compliance with the pro-
gram than the vocal hygiene group. This bears out the Vocal Production System: Evolution
received wisdom that it is easier to take a pill—or wear
an amplification device—than to change habits.
Although study results are mixed, there is insu‰cient The human vocal production system is similar in broad
evidence to suggest that vocal hygiene instruction be outline to that of other terrestrial vertebrates. All tetra-
abandoned. The underlying rationale for vocal hygiene pods (nonfish vertebrates: amphibians, reptiles, birds,
is su‰ciently compelling that a vocal hygiene program and mammals) inherit from a common ancestor three
should continue to be a component of a broad-based key components: (1) a respiratory system with lungs;
voice therapy intervention. (2) a larynx that acts primarily as a quick-closing gate
to protect the lungs, and often secondarily to produce
—Janina K. Casper
sound; and (3) a supralaryngeal vocal tract which filters
this sound before emitting it into the environment. De-
References spite this shared plan, a wide variety of interesting mod-
Barth, E. (1911). Einfuhrung in die Physiologie, Pathologie und ifications of the vocal production system are known. The
Hygiene der Menschlichen Stimme und Sprache. Leipzig: functioning of the basic tetrapod vocal production sys-
Thieme. tem can be understood within the theoretical framework
Chan, R. W. (1994). Does the voice improve with vocal hy- of the myoelastic-aerodynamic and source/filter theories
giene education? A study of some instrumental voice mea- familiar to speech scientists.
sures in a group of kindergarten teachers. Journal of Voice, The lungs and attendant respiratory musculature
8, 279–291.
provide the air stream powering phonation. In primi-
Fisher, K. V., Ligon, J., Sobecks, J. L., and Rose, D. M.
(2001). Phonatory e¤ects of body fluid removal. Journal of tive air-breathing vertebrates, the lungs were inflated
Speech, Language, and Hearing Research, 44, 354–367. by rhythmic compression of the oral cavity, or ‘‘buccal
Jackson, C., and Jackson, C. L. (1942). Diseases and injuries of pumping,’’ and this system is still used by lungfish and
the larynx. New York: Macmillan. amphibians (Brainerd and Ditelberg, 1993). Inspiration
Jiang, J., Lin, E., and Hanson, D. (2000). Vocal fold physiol- by active expansion of the thorax evolved later, in the
ogy. Otolaryngologic Clinics of North America, 1, 699–718. ancestor of reptiles, birds, and mammals. This was
Jiang, J., Ng, J., and Hanson, D. (1999). The e¤ects of rehy- powered originally by the intercostal muscles (as in liz-
dration on phonation in excised canine larynges. Journal of ards or crocodilians) and later (in mammals only) by a
Voice, 13, 51–59. muscular diaphragm (Liem, 1985). Phonation is typi-
Jiang, J., and Titze, I. R. (1994). Measurement of vocal fold
intraglottal pressure and impact stress. Journal of Voice, 8,
cally powered by passive deflation of the elastic lungs,
132–144. or in some cases by active compression of the hypaxial
Koufman, J. A. (1991). The otolaryngologic manifestations musculature. In many frogs, air expired from the lungs
of gastroesophageal reflux disease: A clinical investigation during phonation is captured in an elastic air sac, which
of 225 patients using ambulatory 24 hr pH monitoring then deflates, returning the air to the lungs. This allows
and an experimental investigation of the role of acid and frogs to produce multiple calls from the same volume
Vocal Production System: Evolution 57
of air. The inflated sac also increases the e‰ciency with larynx of males expands to fill the entire thoracic cavity,
which sound is radiated into the environment (Gans, pushing the heart, lungs, and trachea down into the ab-
1973). domen (Schneider, Kuhn, and Keleman, 1967). A simi-
The lungs are protected by a larynx in all tetrapods. lar though less impressive increase in larynx dimensions
This structure primitively includes a pair of barlike car- is observed in human males and is partially responsible
tilages that can be separated (for breathing) or pushed for the voice change at puberty (Titze, 1989).
together (to seal the airway) (Negus, 1949). Expiration Sounds created by the larynx must pass through the
through the partially closed larynx creates a turbulent air contained in the pharyngeal, oral, and nasal cavities,
hiss—perhaps the most primitive vocalization, which collectively termed the supralaryngeal vocal tract or
virtually all tetrapods can produce. However, more so- simply vocal tract. Like any column of air, this air has
phisticated vocalizations became possible after the inno- mass and elasticity and vibrates preferentially at certain
vation of elastic membranes within the larynx, the vocal resonant frequencies. Vocal tract resonances are termed
cords, which are found in most frogs, vocal reptiles formants (from the Latin formare, to shape): they act as
(geckos, crocodilians), and mammals. Although the filters to shape the spectrum of the vocal output. Because
larynx in these species can support a wide variety of all tetrapods have a vocal tract, all have formants. For-
vocalizations, its primary function as a protective gate- mant frequencies are determined by the length and shape
way appears to have constrained laryngeal anatomy. In of the vocal tract. Because the vocal tract in mammals
birds, a novel phonatory structure called the syrinx rests within the confines of the head, and skull size and
evolved at the base of the trachea. Dedicated to vocal body size are tightly linked (Fitch, 2000b), formant fre-
production, and freed from the necessity of tracheal quencies provide a possible indicator of body size not
protection, the avian syrinx is a remarkably diverse as easily ‘‘faked’’ as the laryngeal cue of fundamental
structure underlying the great variety of bird sounds frequency. Large animals have long vocal tracts and
(King, 1989). low formants. Together with demonstrations of formant
Although our knowledge of animal phonation is still perception by nonhuman animals (Sommers et al., 1992;
limited, phonation in nonhumans appears to follow the Fitch and Kelley, 2000), this suggests that formants
principles of the myoelastic-aerodynamic theory of hu- may have provided a cue to size in primitive vertebrates
man phonation. The airflow from the lungs sets the vo- (Fitch, 1997). However, it is possible to break the ana-
cal folds (or syringeal membranes) into vibration, and tomical link between vocal tract length and body size,
the rate of vibration is passively determined by the size and some intriguing morphological adaptations have
and tension of these tissues. Vibration at a particular arisen to elongate the vocal tract (presumably resulting
frequency does not typically require neural activity at from selection to sound larger; Fig. 1C–E ). Elongations
that frequency. Thus, relatively normal phonation can of the nasal vocal tract are seen in the long nose of
be obtained by blowing moist air through an excised male proboscis monkeys or the impressive nasal crests
larynx, and rodents and bats can produce ultrasonic of hadrosaur dinosaurs (Weishampel, 1981). Vocal tract
vocalizations at 40 kHz and higher (Suthers and Fattu, elongation can also be achieved by lowering the larynx;
1973). However, cat purring relies on an active tensing this is seen in extreme form in the red deer Cervus ela-
of the vocal fold musculature at the 20–30 Hz funda- phus, which retract the larynx to the sternum during ter-
mental frequency of the purr (Frazer Sissom, Rice, and ritorial roaring (Fitch and Reby, 2001). Again, a similar
Peters, 1991). During phonation, the movements of the change occurs in human males at puberty: the larynx
vocal folds can be periodic and stable (leading to tonal descends slightly to give men a longer vocal tract and
sounds) or highly aperiodic or even chaotic (e.g., in lower formants than same-sized women (Fitch and
screams); while such aperiodic vocalizations are rare Giedd, 1999).
in nonpathological human voices, they can be important Human speech is thus produced by the same conser-
in animal vocal repertoires (Fitch, Neubauer, and Her- vative vocal production system of lungs, larynx, and vo-
zel, 2002). cal tract shared by all tetrapods. However, the evolution
Because the length of the vocal folds determines the of the human speech apparatus involved several impor-
lowest frequency at which they could vibrate (Titze, tant changes. One was the loss of laryngeal air sacs. All
1994), with long folds producing lower frequencies, one great apes posses large balloon-like sacs that open into
might expect that a low fundamental would provide a the larynx directly above the glottis (Negus, 1949; Schön
reliable indication of large body size. However, the size Ybarra, 1995). Parsimony suggests that the common
of the larynx is not tightly constrained by body size. ancestor of apes and humans also had such air sacs,
Thus, a huge larynx has independently evolved in many which were subsequently lost in human evolution. How-
mammal species, probably in response to selection for ever, air sacs are occasionally observed in humans in
low-pitched voices (Fig. 1A, B). For example, in howler pathological situations, a laryngocele is a congenital or
monkeys (genus Alouatta) the larynx and hyoid have acquired air sac that is attached to the larynx through
grown to fill the space between mandible and sternum, the laryngeal ventricle at precisely the same location as
giving these small monkeys remarkably impressive and in the great apes (Stell and Maran, 1975). Because the
low-pitched voices (Kelemen and Sade, 1960). The most function of air sacs in ape vocalizations is not under-
extreme example of laryngeal hypertrophy is seen in the stood, the significance of their loss in human evolution is
hammerhead bat Hypsignathus monstrosus, in which the unknown.
58 Part I: Voice
Figure 1. Examples of unusual vocal

adaptations among vertebrates (not
to scale). A, Hammerheaded bat,
Hypsignathus monstrosus, has a huge
larynx (gray) enlarged to fill the
thoracic cavity. B, Howler monkeys
Alouatta spp. have the largest rela-
tive larynx size among primates,
which together with the enlarged
hyoid fills the space beneath the
mandible (larynx and hyoid shown
in gray). C, Male red deer Cervus
elaphus have a permanently de-
scended larynx, which they lower to
the sternum when roaring, resulting
in an extremely elongated vocal
tract (shown in gray). D, Humans—
Homo sapiens—have a descended
larynx, resulting in an elongated
‘‘two-tube’’ vocal tract (shown in
gray). E, The now extinct duck-
billed dinosaur Parasaurolophus
had a hugely elongated nasal cavity
(shown in gray) that filled the bony
crest adorning the skull.
A second change in the vocal production system dur- may be associated with an increase in breathing control
ing human evolution was the descent of the larynx from necessary for singing and speech in our own species.
its normal mammalian position high in the throat to See also vocalization, neural mechanisms of.
a lower position in the neck (Negus, 1949). In the 1960s,
—W. Tecumseh Fitch
speech scientists realized that this ‘‘descended larynx’’
allows humans to produce a wider variety of formant
patterns than would be possible with a high larynx
References
(Lieberman, Klatt, and Wilson, 1969). In particular, the Brainerd, E. L., and Ditelberg, J. S. (1993). Lung ventilation in
‘‘point vowels’’ /i, a, u/ seem to be impossible to attain salamanders and the evolution of vertebrate air-breathing
unless the tongue body is bent and able to move freely mechanisms. Biological Journal of the Linnean Society, 49,
within the oropharyngeal cavity. Given the existence 163–183.
of these vowels in virtually all languages (Maddieson, Fitch, W. T. (1997). Vocal tract length and formant fre-
quency dispersion correlate with body size in rhesus ma-
1984), speech typical of modern humans appears to re-
caques. Journal of the Acoustical Society of America, 102,
quire a descended larynx. Of course, all mammals can 1213–1222.
produce a diversity of sounds, which could have served Fitch, W. T. (2000a). The phonetic potential of nonhuman
a simpler speech system. Also, most mammals appear vocal tracts: Comparative cineradiographic observations of
to lower the larynx during vocalization (Fitch, 2000a), vocalizing animals. Phonetica, 57, 205–218.
lessening the gap between humans and other animals. Fitch, W. T. (2000b). Skull dimensions in relation to body size
Despite these caveats, the descended larynx is clearly an in nonhuman mammals: The causal bases for acoustic al-
important component of human spoken language (Lie- lometry. Zoology, 103, 40–58.
berman, 1984). The existence of nonhuman mammals Fitch, W. T., and Giedd, J. (1999). Morphology and develop-
with a descended larynx raises the possibility that this ment of the human vocal tract: A study using magnetic
resonance imaging. Journal of the Acoustical Society of
trait initially arose to exaggerate size in early hominids America, 106, 1511–1522.
and was later coopted for use in speech (Fitch and Reby, Fitch, W. T., and Kelley, J. P. (2000). Perception of vocal tract
2001). Finally, recent fossils suggest that an expansion resonances by whooping cranes, Grus americana. Ethology,
of the thoracic intervertebral canal occurred during the 106, 559–574.
evolution of Homo some time after the earliest Homo Fitch, W. T., Neubauer, J., and Herzel, H. (2002). Calls out of
erectus (MacLarnon and Hewitt, 1999). This change chaos: The adaptive significance of nonlinear phenomena in
Vocalization, Neural Mechanisms of 59
mammalian vocal production. Animal Behaviour, 63, 407– Vocalization, Neural Mechanisms of
418.
Fitch, W. T., and Reby, D. (2001). The descended larynx is not
uniquely human. Proceedings of the Royal Society, Biologi- The capacity for speech and language separates humans
cal Sciences, 268, 1669–1675.
from other animals and is the cornerstone of our intel-
Frazer Sissom, D. E., Rice, D. A., and Peters, G. (1991). How
cats purr. Journal of Zoology (London), 223, 67–78. lectual and creative abilities. This capacity evolved from
Gans, C. (1973). Sound production in the Salientia: Mecha- rudimentary forms of communication in the ancestors
nism and evolution of the emitter. American Zoologist, 13, of humans. By studying these mechanisms in animals
1179–1194. that represent stages of phylogenetic development, we
Kelemen, G., and Sade, J. (1960). The vocal organ of the can gain insight into the neural control of human speech
howling monkey (Alouatta palliata). Journal of Morphology, that is necessary for understanding many disorders of
107, 123–140. human communication.
King, A. S. (1989). Functional anatomy of the syrinx. In A. S. Vocalization is an integral part of speech and is
King and J. McLelland (Eds.), Form and function in birds widespread in mammalian aural communication sys-
(pp. 105–192). New York: Academic Press.
tems. The limbic system, a group of neural structures
Lieberman, P. (1984). The biology and evolution of language.
Cambridge, MA: Harvard University Press. controlling motivation and emotion, also controls
Lieberman, P. H., Klatt, D. H., and Wilson, W. H. (1969). Vocal most mammalian vocalizations. Although there are little
tract limitations on the vowel repertoires of rhesus monkey supporting empirical data, many human emotional
and other nonhuman primates. Science, 164, 1185–1187. vocalizations probably involve the limbic system. This
Liem, K. F. (1985). Ventilation. In M. Hildebrand (Ed.), discussion considers the limbic system and those neural
Functional vertebrate morphology (pp. 185–209). Cam- mechanisms thought to be necessary for normal speech
bridge, MA: Belknap Press/Harvard University Press. and language to occur.
MacLarnon, A., and Hewitt, G. (1999). The evolution of The anterior cingulate gyrus (ACG), which lies on the
human speech: The role of enhanced breathing control. mesial surface of the frontal cortex just above and ante-
American Journal of Physical Anthropology, 109, 341–363.
rior to the genu of the corpus callosum, is considered
Maddieson, I. (1984). Patterns of sounds. Cambridge, UK:
Cambridge University Press. part of the limbic system (Fig. 1). Electrical stimulation
Negus, V. E. (1949). The Comparative anatomy and physiology of the ACG in monkeys elicits vocalization and auto-
of the larynx. New York: Hafner. nomic responses (Jürgens, 1994). Monkeys become mute
Schneider, R., Kuhn, H.-J., and Kelemen, G. (1967). Der Lar- when the ACG is lesioned, and single neurons in the
ynx des männlichen Hypsignathus monstrosus Allen, 1861 ACG become active with vocalization or in response to
(Pteropodidae, Megachiroptera, Mammalia). Zeitschrift für vocalizations from conspecifics (Sutton, Larson, and
wissenschaftliche Zoologie, 175, 1–53. Lindeman, 1974; Müller-Preuss, 1988; West and Larson,
Schön Ybarra, M. (1995). A comparative approach to the 1995). Electrical stimulation of the ACG in humans may
nonhuman primate vocal tract: Implications for sound pro- also result in oral movements or postural distortions
duction. In E. Zimmerman and J. D. Newman (Eds.), Cur-
representative of an ‘‘archaic’’ level of behavior (Brown,
rent topics in primate vocal communication (pp. 185–198).
New York: Plenum Press. 1988). Damage to the ACG in humans results in akinetic
Sommers, M. S., Moody, D. B., Prosen, C. A., and Stebbins, mutism that is accompanied by open eyes, a fixed gaze,
W. C. (1992). Formant frequency discrimination by Japa- lack of limb movement, lack of apparent a¤ect, and
nese macaques (Macaca fuscata). Journal of the Acoustical nonreactance to painful stimuli (Jürgens and von
Society of America, 91, 3499–3510. Cramon, 1982). These symptoms reflect a lack of drive
Stell, P. M., and Maran, A. G. D. (1975). Laryngocoele. Jour- to initiate vocalization and many other behaviors
nal of Laryngology and Otology, 89, 915–924. (Brown, 1988). During recovery, a patient’s ability to
Suthers, R. A., and Fattu, J. M. (1973). Mechanisms of sound communicate gradually returns, first as a whisper, then
production in echolocating bats. American Zoologist, 13, with vocalization. However, the vocalizations lack pro-
1215–1226.
sodic features and are characterized as expressionless
Titze, I. R. (1989). Physiologic and acoustic di¤erences be-
tween male and female voices. Journal of the Acoustical (Jürgens and von Cramon, 1982). These observations
Society of America, 85, 1699–1707. support the view that the ACG controls motivation for
Titze, I. R. (1994). Principles of voice production. Englewood primitive forms of behavior, including prelinguistic
Cli¤s, NJ: Prentice Hall. vocalization.
Weishampel, D. B. (1981). Acoustic analysis of potential The ACG has reciprocal connections with several
vocalization in lambeosaurine dinosaurs (Reptilia: Ornith- cortical and subcortical sites, including a premotor area
ischia). Paleobiology, 7, 252–261. homologous with Broca’s area, the superior temporal
gyrus, the posterior cingulate gyrus, and the supplemen-
tary motor area (SMA) (Müller-Preuss, Newman, and
Jürgens, 1980). Electrical stimulation of the SMA elicits
vocalization, speech arrests, hesitation, distortions, and
palilalic iterations (Brown, 1988). Damage to the SMA
may result in mutism, poor initiation of speech, or
repetitive utterances, and during recovery, patients
often go through a period in which the production of
60 Part I: Voice
Figure 1. The lateral and mesial surface of the human brain.
propositional speech remains severely impaired but portant for vocalization. In specific cases, it is necessary
nonpropositional speech (e.g., counting) remains rela- to know whether mutism results from psychogenic or
tively una¤ected (Brown, 1988). Studies of other motor physiological mechanisms.
systems in primates suggest that the SMA is involved in The perisylvian cortex may control vocalization by
selection and initiation of a remembered motor act or one or more pathways to the medulla. The perisylvian
the correct sequencing of motor acts (Picard and Strick, cortex is reciprocally connected to the ACG, which
1997). Speech and vocalization fall into these categories, projects to midbrain mechanisms involved in vocaliza-
as both are remembered and require proper sequencing. tion. The perisylvian cortex also projects directly to the
Output from the SMA to other vocalization motor areas medulla, where motor neurons controlling laryngeal
is a subsequent stage in the execution of vocalization. muscles are located (Kuypers, 1958). These neuro-
The ACG is also connected with the perisylvian anatomical projections are supported by observations of
cortex of the left hemisphere (Müller-Preuss, Newman, a short time delay (13 ms) between stimulation of the
and Jürgens, 1980), an area important for speech and cortex and excitation of laryngeal muscles (Ludlow and
language. Damage to Broca’s area may cause total or Lou, 1996). The perisylvian cortex also includes the right
partial mutism, along with expressive aphasia or apraxia superior temporal gyrus and Heschl’s gyrus, which are
(Du¤y, 1995). However, vocalization is less frequently preferentially active for perception of complex tones,
a¤ected than speech articulation or language (Du¤y, singing, perception of one’s own voice, and perhaps
1995), and the e¤ect is usually temporary. In some cases, control of the voice by self-monitoring auditory feed-
aphonia may arise from widespread damage, and recov- back (Perry et al., 1999; Belin et al., 2000).
ery following therapy may suggest a di¤use, motiva- The other widely studied limbic system structure
tional, or psychogenic etiology (Sapir and Aronson, known for its role in vocalization, is the midbrain peri-
1987). Mutism seems to occur more frequently when the aqueductal gray (PAG) (Jürgens, 1994). Lesions of the
opercular region of the pre- and postcentral gyri is PAG in humans and animals lead to mutism (Jürgens,
damaged bilaterally or when the damage extends deep 1994). Electrical and chemical stimulation of the PAG in
into the cortex, a¤ecting the insula and possibly the many animal species elicits species-specific vocalizations
basal ganglia (Jürgens, Kirzinger, and von Cramon, (Jürgens, 1994). A variety of techniques have shown that
1982; Starkstein, Berthier, and Leiguarda, 1988; Du¤y, PAG neurons, utilizing excitatory amino acid transmit-
1995). Additional evidence linking the insula to vocal- ters (glutamate), activate or suppress coordinated groups
ization comes from recent studies of apraxia in humans of oral, facial, respiratory, and laryngeal muscles for
(Dronkers, 1996) and findings of increased blood flow in species-specific vocalization (Larson, 1991; Jürgens,
the insula during singing (Perry et al., 1999). Further 1994). The specific pattern of activation or suppression
research is necessary to determine whether the opercular is determined by descending inputs from the ACG
cortex alone or deeper structures (e.g., insula) are im- and limbic system, along with sensory feedback from the
Vocalization, Neural Mechanisms of 61
auditory, laryngeal and respiratory systems (Davis, the NRA (Holstege, 1989), or directly from the cerebral
Zhang, and Bandler, 1993; Ambalavanar et al., 1999). cortex (Kuypers, 1958). Sensory feedback for the reflex-
The resultant vocalizations convey the a¤ective state of ive control of laryngeal muscles flows through the supe-
the organism. Although this system probably is respon- rior and recurrent laryngeal nerves to the nucleus of the
sible for emotional vocalizations in humans, it is un- solitary tract and spinal nucleus of the trigeminal nerve
known whether this pathway is involved in normal, (Tan and Lim, 1992).
nonemotive speech and language. In summary, vocalization is controlled by two path-
Neurons of the PAG project to several sites in the ways, one that is primitive and found in most animals,
pons and medulla, one of which is the nucleus retro- and one that is found only in humans and perhaps an-
ambiguus (NRA) (Holstege, 1989). The NRA in turn thropoid apes (Fig. 2). The pathway found in all mam-
projects to the nucleus ambiguus (NA) and spinal cord mals extends from the ACG through the limbic system
motor neurons of the respiratory muscles. Lesions of the and midbrain PAG to medullary and spinal motor neu-
NRA eliminate vocalizations evoked by PAG stimula- rons, and seems to control most emotional vocalizations.
tion (Shiba et al., 1997), and stimulation of the NRA Voluntary vocal control, found primarily in humans,
elicits vocalization (Zhang, Bandler, and Davis, 1995). aided by sensory feedback, may be exerted through a
Thus, the NRA lies functionally between the PAG and direct pathway from the motor cortex to the medulla.
motor neurons of laryngeal and respiratory muscles The tendency for vocalization and human speech to
controlling vocalization and may play a role in coordi- be strongly a¤ected by emotions may suggest that all
nating these neuronal groups (Shiba et al., 1997; Luthe, vocalizations rely at least in part on the ACG-PAG
Hausler, and Jürgens, 2000). pathway. Details of how these two parallel pathways are
The PAG also projects to the parvocellular reticular integrated are unknown.
formation, where neurons modulate their activity with See also vocal production system: evolution.
temporal and acoustical variations in monkey calls, and
lesions alter the acoustical structure of vocalizations —Charles R. Larson
(Luthe, Hausler, and Jürgens, 2000). These data suggest
that the parvocellular reticular formation is important References
for the regulation of vocal quality and pitch. Ambalavanar, R., Tanaka, Y., Damirjian, M., and Ludlow,
Finally, the NA contains laryngeal motor neurons C. L. (1999). Laryngeal a¤erent stimulation enhances fos
and is crucial to vocalization. Motor neurons in the NA immunoreactivity in the periaqueductal gray in the cat.
control laryngeal muscles during vocalization, swallow- Journal of Comparative Neurology, 409, 411–423.
ing, and respiration (Yajima and Larson, 1993), and Belin, P., Zatorre, R. J., Ladaille, P., Ahad, P., and Pike, B.
lesions of the NA abolish vocalizations elicited by PAG (2000). Voice-selective areas in human auditory cortex. Na-
stimulation (Jürgens and Pratt, 1979). The NA receives ture, 403, 309–312.
projections either indirectly from the PAG, by way of Brown, J. (1988). Cingulate gyrus and supplementary motor
correlates of vocalization in man. In J. D. Newman (Ed.),
The physiological control of mammalian vocalization (pp.
227–243). New York: Plenum Press.
Davis, P. J., Zhang, S. P., and Bandler, R. (1993). Pulmonary
and upper airway a¤erent influences on the motor pattern
of vocalization evoked by excitation of the midbrain peri-
aqueductal gray of the cat. Brain Research, 607, 61–80.
Dronkers, N. F. (1996). A new brain region for coordinating
speech articulation. Nature, 384, 159–161.
Du¤y, J. R. (1995). Motor speech disorders. St. Louis: Mosby.
Holstege, G. (1989). An anatomical study on the final common
pathway for vocalization in the cat. Journal of Comparative
Neurology, 284, 242–252.
Jürgens, U. (1994). The role of the periaqueductal grey in vocal
behaviour. Behavioural Brain Research, 62, 107–117.
Jürgens, U., Kirzinger, A., and von Cramon, D. (1982). The
e¤ects of deep-reaching lesions in the cortical face area on
phonation: A combined case report and experimental mon-
key study. Cortex, 18, 125–140.
Jürgens, U., and Pratt, R. (1979). Role of the periaqueductal
grey in vocal expression of emotion. Brain Research, 167,
367–378.
Jürgens, U., and von Cramon, D. (1982). On the role of the
Figure 2. Block diagram and arrows indicating known connec- anterior cingulate cortex in phonation: A case report. Brain
tions between principal structures involved in vocalization. and Language, 15, 234–248.
Structures inside the dashed box are involved in vocalization Kuypers, H. G. H. M. (1958). Corticobulbar connexions to the
in other mammals as well as in humans. Structures outside pons and lower brain-stem in man. Brain, 81, 364–388.
the dashed box may be found only in humans and perhaps an- Larson, C. R. (1991). On the relation of PAG neurons to la-
thropoid apes. NA, nucleus ambiguous; NRA, nucleus retro- ryngeal and respiratory muscles during vocalization in the
ambiguus; PAG, periaqueductal gray; RF, reticular formation. monkey. Brain Research, 552, 77–86.
62 Part I: Voice
Ludlow, C. L., and Lou, G. (1996). Observations on human Brown, J. W., and Perecman, E. (1985). Neurological basis of
laryngeal muscle control. In P. J. Davis and N. H. Fletcher language processing. In J. K. Darby (Ed.), Speech and lan-
(Eds.), Vocal fold physiology: Controlling complexity and guage evaluation in neurology: Adult disorders (pp. 45–81).
chaos (pp. 201–218). San Diego: Singular Publishing Group. New York: Grune and Stratton.
Luthe, L., Hausler, U., and Jürgens, U. (2000). Neuronal Davis, P. J., and Nail, B. S. (1984). On the location and size of
activity in the medulla oblongata during vocalization: A laryngeal motoneurons in the cat and rabbit. Journal of
single-unit recording study in the squirrel monkey. Behav- Comparative Neurology, 230, 13–32.
ioural Brain Research, 116, 197–210. Gemba, H., Miki, N., and Sasaki, K. (1995). Cortical field
Müller-Preuss, P. (1988). Neural correlates of audio-vocal be- potentials preceding vocalization and influences of cere-
havior: Properties of anterior limbic cortex and related bellar hemispherectomy upon them in monkeys. Brain Re-
areas. In J. D. Newman (Ed.), The physiological control of search, 69, 143–151.
mammalian vocalization (pp. 245–262). New York: Plenum Jonas, S. (1987). The supplementary motor region and speech.
Press. In E. Perecman (Ed.), The frontal lobes revisited. New
Müller-Preuss, P., Newman, J. D., and Jürgens, U. (1980). York: IRBN Press.
Anatomical and physiological evidence for a relationship Jürgens, U. (1982). Amygdalar vocalization pathways in the
between the ‘‘cingular’’ vocalization area and the auditory squirrel monkey. Brain Research, 241, 189–196.
cortex in the squirrel monkey. Brain Research, 202, 307–315. Jürgens, U. (2000). Localization of a pontine vocalization-
Perry, D. W., Zatorre, R. J., Petrides, M., Alivisatos, B., controlling area. Journal of the Acoustical Society of Amer-
Meyer, E., and Evans, A. C. (1999). Localization of ica, 108, 1393–1396.
cerebral activity during simple singing. NeuroReport, 10, Jürgens, U., and Zwirner, P. (2000). Individual hemispheric
3453–3458. asymmetry in vocal fold control of the squirrel monkey.
Picard, N., and Strick, P. L. (1997). Activation on the Behavioural Brain Research, 109, 213–217.
medial wall during remembered sequences of reaching Kalia, M., and Mesulam, M. (1980). Brainstem projections of
movements in monkeys. Journal of Neurophysiology, 77, sensory and motor components of the vagus complex in the
2197–2201. cat: II. Laryngeal, tracheobronchial, pulmonary, cardiac,
Sapir, S., and Aronson, A. E. (1987). Coexisting psychogenic and gastrointestinal branches. Journal of Comparative Neu-
and neurogenic dysphonia: A source of diagnostic confusion. rology, 193, 467–508.
British Journal of Disorders of Communication, 22, 73–80. Kirzinger, A., and Jürgens, U. (1982). Cortical lesion e¤ects
Shiba, K., Umezaki, T., Zheng, Y., and Miller, A. D. (1997). and vocalization in the squirrel monkey. Brain Research,
The nucleus retroambigualis controls laryngeal muscle 233, 299–315.
activity during vocalization in the cat. Experimental Brain Kirzinger, A., and Jürgens, U. (1991). Vocalization-correlated
Research, 115, 513–519. single-unit activity in the brain stem of the squirrel monkey.
Starkstein, S. E., Berthier, M., and Leiguarda, R. (1988). Bi- Experimental Brain Research, 84, 545–560.
lateral opercular syndrome and crossed aphemia due to a Lamandella, J. T. (1977). The limbic system in human com-
right insular lesion: A clinicopathological study. Brain and munication. In J. Whitaker and H. A. Whitaker (Eds.),
Language, 34, 253–261. Studies in neurolinguistics (vol. 3, pp. 157–222). New York:
Sutton, D., Larson, C., and Lindeman, R. C. (1974). Neo- Academic Press.
cortical and limbic lesion e¤ects on primate phonation. Larson, C. R. (1975). E¤ects of cerebellar lesions on conditioned
Brain Research, 71, 61–75. monkey phonation. Unpublished doctoral dissertation, Uni-
Tan, C. K., and Lim, H. H. (1992). Central projection of the versity of Washington, Seattle.
sensory fibres of the recurrent laryngeal nerve of the cat. Larson, C. R., Wilson, K. E., and Luschei, E. S. (1983). Pre-
Acta Anatomica, 143, 306–308. liminary observations on cortical and brainstem mecha-
West, R. A., and Larson, C. R. (1995). Neurons of the anterior nisms of laryngeal control. In D. M. Bless and J. H. Abbs
mesial cortex related to faciovocal behavior in the awake (Eds.), Vocal fold physiology: Contemporary research and
monkey. Journal of Neurophysiology, 74, 1156–1169. clinical issues. San Diego, CA: College-Hill Press.
Yajima, Y., and Larson, C. R. (1993). Multifunctional prop- Ludlow, C. L., Schulz, G. M., Yamashita, T., and Deleyiannis,
erties of ambiguous neurons identified electrophysiologi- F. W.-B. (1995). Abnormalities in long latency responses to
cally during vocalization in the awake monkey. Journal of superior laryngeal nerve stimulation in adductor spasmodic
Neurophysiology, 70, 529–540. dysphonia. Annals of Otology, Rhinology, and Laryngology,
Zhang, S. P., Bandler, R., and Davis, P. J. (1995). Brain stem 104, 928–935.
integration of vocalization: Role of the nucleus retro- Marshall, R. C., Gandour, J., and Windsor, J. (1988). Selective
ambigualis. Journal of Neurophysiology, 74, 2500–2512. impairment of phonation: A case study. Brain and Lan-
guage, 35, 313–339.
Further Readings Müller-Preuss, P., and Jürgens, U. (1976). Projections from the
‘‘cingular’’ vocalization area in the squirrel monkey. Brain
Adametz, J., and O’Leary, J. L. (1959). Experimental mutism Research, 103, 29–43.
resulting from periaqueductal lesions in cats. Neurology, 9, Ortega, J. D., DeRosier, E., Park, S., and Larson, C. R.
636–642. (1988). Brainstem mechanisms of laryngeal control as
Aronson, A. E. (1985). Clinical voice disorders. New York: revealed by microstimulation studies. In O. Fujimura (Ed.),
Thieme-Stratton. Vocal physiology: Voice production, mechanisms and func-
Barris, R. W., and Schuman, H. R. (1953). Bilateral anterior tions (vol. 2, pp. 19–28). New York: Raven Press.
cingulate gyrus lesions. Neurology, 3, 44–52. Paus, T., Petrides, M., Evans, A. C., and Meyer, E. (1993).
Botez, M. I., and Barbeau, A. (1971). Role of subcortical Role of the human anterior cingulate cortex in the control
structures, and particularly of the thalamus, in the mecha- of oculomotor, manual and speech responses: A positron
nisms of speech and language. International Journal of emission tomography study. Journal of Neurophysiology,
Neurology, 8, 300–320. 70, 453–469.
Voice Acoustics 63
Penfield, W., and Welch, K. (1951). The supplementary motor is usually large compared with the impedance looking
area of the cerebral cortex. Archives of Neurology and Psy- into the vocal tract from the glottis. Thus, in most cases it
chiatry, 66, 289–317. is reasonable to represent the glottal source as a volume-
von Cramon, D., and Jürgens, U. (1983). The anterior cingu- velocity source that produces similar glottal pulses for
late cortex and the phonatory control in monkey and man.
di¤erent vocal tract configurations.
Neurosciences and Biobehavior Review, 7, 423–425.
Ward, A. A. (1948). The cingular gyrus: Area 24. Journal of This source Us (t) is filtered by the vocal tract, as
Neurophysiology, 11, 13–23. depicted in Figure 2. The volume velocity at the lips is
Zatorre, R. J., and Samson, S. (1991). Role of the right tem- Um (t), and the output sound pressure at a distance r
poral neocortex in retention of pitch in auditory short-term from the lips is pr (t). The magnitudes of the spectral
memory. Brain, 114, 2403–2417. components of Us ( f ) and pr ( f ) are shown below the
corresponding waveforms in Figure 2. When a non-
nasal vowel is produced, the vocal tract transfer function
Voice Acoustics T( f ), defined as the ratio Um ( f )=Us ( f ), is an all-pole
transfer function. The sound pressure pr is related to Um
by a radiation characteristic R( f ). The magnitude of this
The basic acoustic source during normal phonation is radiation characteristic is approximately
a waveform consisting of a quasi-periodic sequence of r 2pf
pulses of volume velocity Us (t) that pass between the jR( f )j ¼ ; (1)
vibrating vocal folds (Fig. 1A). For modal vocal fold 4pr
vibration, the volume velocity is zero in the time interval where r ¼ density of air. Thus we have
between the pulses, and there is a relatively abrupt dis-
continuity in slope at the time the volume velocity pr ( f ) ¼ Us ( f ) T( f ) R( f ): (2)
decreases to zero. The periodic nature of this waveform The magnitude of pr ( f ) can be written as
is reflected in the harmonic structure of the spectrum
r
(Fig. 1B). The amplitudes of the harmonics at high fre- jpr ( f )j ¼ jUs ( f ) 2pf j jT( f )j
: (3)
quencies decrease as 1= f 2 , where f ¼ frequency, i.e., at 4pr
about 12 dB per octave. The frequency of this source The expression jUs ( f ) 2pf j is the magnitude of the
waveform varies from one individual to another and Fourier transform of the derivative Us0 (t). Thus the out-
within an utterance. In the time domain (Fig. 1A), a put sound pressure can be considered to be the result of
change in frequency is represented in the number of filtering Us0 (t) by the vocal tract transfer function T( f ),
pulses per second; in the frequency domain (Fig. 1B), the multiplied by a constant. That is, the derivative Us0 (t)
frequency is represented by the spacing between the can be viewed as the e¤ective excitation of the vocal
harmonics. The shape of the individual pulses can also tract.
vary with the speaker, and during an utterance the shape For the ideal or modal volume-velocity waveform
can be modified depending on the position within the (Fig. 1), this derivative has the form shown in Figure 3A
utterance and the prominence of the syllable. (Fant, Liljencrants, and Lin, 1985). Each pulse has a
When the position and tension of the vocal folds are sequence of two components: (1) an initial smooth por-
properly adjusted, a positive pressure below the glottis tion where the waveform is first positive, then passes
will cause the vocal folds to vibrate. As the cross- through zero (corresponding to the peak of the pulse
sectional area of the glottis changes during a cycle of in Fig. 1), and then reaches a maximum negative value;
vibration, the airflow is modulated. During the open and (2) a second portion where the waveform returns
phase of the cycle, the impedance of the glottal opening abruptly to zero, corresponding to the discontinuity in
Figure 1. A, Idealized waveform of

glottal volume velocity Us (t) for
modal vocal fold vibration for an
adult male speaker. B, Spectrum
of waveform in A.
64 Part I: Voice
Figure 2. Schema showing how the

acoustic source at the glottis is fil-
tered by the vocal tract to yield a
volume velocity Um (t) at the lips,
which is radiated to obtain the sound
pressure pr (t) at some distance from
the lips. At the left of the figure both
the source waveform Us (t) and its
spectrum Us ( f ) are shown. At the
right is the waveform pr (t) and
spectrum pr ( f ) of the sound pres-
sure. (Adapted with permission from
Stevens, 1994.)
slope of the original waveform Us (t) at the time the vocal increases roughly as Ps3/2 (Ladefoged and McKinney,
folds come together. The principal acoustic excitation of 1963; Isshiki, 1964; Tanaka and Gould, 1983).
the vocal tract occurs at the time of this discontinuity. Changes in the configuration of the membranous and
For this ideal or modal derivative waveform, the spec- cartilaginous portions of the vocal folds relative to the
trum (Fig. 3B) at high frequencies decreases as 1= f , i.e., modal configuration can lead to changes in the wave-
at 6 dB/octave, reflecting the discontinuity at closure. form and spectrum of the glottal source. For some
For normal speech production, there are several ways speakers and for some styles of speaking, the vocal folds
in which the glottal waveform can di¤er from the modal and arytenoid cartilages are configured such that the
waveform (or its derivative). One obvious attribute is glottis is never completely closed during a cycle of vi-
the frequency f 0 of the glottal pulses, which is controlled bration, introducing several acoustic consequences.
primarily by changing the tension of the vocal folds, First, the speed with which the vocal folds approach the
although the subglottal pressure also influences the fre- midline is reduced; the e¤ect on the derivative waveform
quency, particularly when the folds are relatively slack Us0 (t) is that the maximum negative value is reduced
(Titze, 1989). Increasing or decreasing the subglottal (that is, it is less negative). Thus, the excitation of the
pressure Ps causes increases or decreases in the ampli- vocal tract and the overall amplitude of the output are
tude of the glottal pulses, or, more specifically, in the decreased. Second, there is continuing airflow through-
magnitude of the discontinuity in slope at the time of out the cycle. The inertia of the air in the glottis and
glottal closure. The magnitude of the glottal excitation supraglottal airways prevents the occurrence of the
Figure 3. A, Derivative Us0 (t) of the

modal volume-velocity waveform
in Figure 1. B, Spectrum of wave-
form in A.
Voice Acoustics 65
spectrum (Hanson, 1997). The three consequences just

described lead to a vowel for which the spectrum ampli-
tude A1 in the F1 range is reduced and the amplitudes of
the spectral prominences due to higher formants are
reduced relative to A1.
Still another consequence of glottal vibration with a
partially open glottis is that there is increased average
airflow through the glottis, as shown in the Us (t) wave-
form in Figure 4. This increased flow causes an increased
amplitude of noise generated by turbulence in the vicin-
ity of the glottis. Thus, in addition to the quasi-periodic
source, there is an aspiration noise source with a contin-
uous spectrum (Klatt and Klatt, 1990). Since the flow is
modulated by the periodic fluctuation in glottal area, the
noise source is also modulated. This type of phonation
Figure 4. Schematized representation of volume velocity wave- has been called ‘‘breathy-voiced.’’
form Us (t) and its derivative Us0 (t) when the glottis is never The aspiration noise source can be represented as an
completely closed within a cycle of vibration. equivalent acoustic volume-velocity source that is added
to the periodic source. In contrast to the periodic source,
the noise source has a spectrum that tilts upward with
abrupt discontinuity in Us0 (t) that occurs at the time of increasing frequency. It appears to have a broad peak at
vocal fold closure in modal phonation (Rothenberg, high frequencies, around 2–4 kHz (Stevens, 1998). Fig-
1981). Rather, there is a non-zero return phase following ure 5A shows estimated spectra of the periodic and noise
the maximum negative peak, during which Us0 (t) gradu- components that would occur during modal phonation.
ally returns to zero (Fant, Liljencrants, and Lin, 1985). The noise component is relatively weak, and is generated
The derivative waveform Us0 (t) then has a shape that is only during the open phase of glottal vibration. Phona-
schematized in Figure 4. The corresponding waveform tion with a more abducted glottis of the type represented
Us (t) is shown below the waveform Us0 (t). The spectral in Figure 4 leads to greater noise energy and reduced
consequence of this non-zero return phase is a reduction high-frequency amplitude of the periodic component,
in the high-frequency spectrum amplitude of Us0 (t) rela- and the noise component may dominate the periodic
tive to the low-frequency spectrum amplitude. A third component at high frequencies (Fig. 5B). With breathy-
consequence of a somewhat abducted glottal configura- voiced phonation, the individual harmonics correspond-
tion is an increased loss of acoustic energy from the ing to the periodic component may be obscured by the
vocal tract through the partially open glottis and into noise component at high frequencies. At low frequencies,
the subglottal airways. This energy loss a¤ects the vocal however, the harmonics are well defined, since the noise
tract filter rather than the source waveform. It is most component is weak in this frequency region.
apparent in the first formant range and results in an Figure 6 shows spectra of a vowel produced by a
increased bandwidth of F1, causing a reduction in A1, speaker with modal glottal vibration (A) and the same
the amplitude of the first-formant prominence in the vowel produced by a speaker with a somewhat abducted
Figure 5. Schematized representation of spectra of the e¤ec- the periodic component is represented by the amplitudes of
tive periodic and noise components of the glottal source for the harmonics. The spectrum of the noise is calculated with a
modal vibration (A) and breathy voicing (B). The spectrum of bandwidth of about 300 Hz.
66 Part I: Voice
Figure 6. A, Spectrum of the vowel /e/ produced by a male male speaker who apparently phonated with a glottal chink.
speaker with approximately modal phonation. Below the spec- The waveforms below are as described in A. The noise in the
trum are waveforms of this vowel before and after being fil- waveform in the F3 region (and above) obscures the individual
tered with a bandpass filter centered on F3, with a bandwidth glottal pulses. The spectra are from Hanson and Chuang
of 600 Hz. The individual glottal pulses as filtered by F3 of the (1999). See text.
vowel are evident. B, Spectrum of the vowel /e/ produced by a
glottis (B). Below the spectra are waveforms of the vowel Adduction of the vocal folds relative to their modal
before and after being filtered by a broad bandpass filter configuration can also lead to changes in the source
(bandwidth of 600 Hz) centered on the third-formant waveform. As the vocal folds are adducted, pressed
frequency F3. Filtered waveforms of this type have been voicing occurs, in which the glottal pulses are nar-
used to highlight the presence of noise at high fre- rower and of lower amplitude than in modal phonation,
quencies during phonation by a speaker with a breathy and may occur aperiodically (glottalization). In addi-
voice (Klatt and Klatt, 1990). The noise is also evident tion, phonation-threshold pressure increases, eventually
in the spectrum at high frequencies for the speaker of reaching a point where the folds no longer vibrate.
Figure 6B. Comparison of the two spectra in Figure 6 The above description of the glottal vibration pattern
also shows the greater spectrum tilt and the reduced for various degrees of glottal abduction and adduction
prominence of the first formant peak associated with suggests that there is an optimum glottal width that
an abducted glottis, as already noted. gives rise to a maximum in sound energy (Hanson and
As the average glottal area increases, the transglottal Stevens, 2002). This optimum configuration has been
pressure required to maintain vibration (phonation examined experimentally by Verdolini et al. (1998).
threshold pressure) increases. Therefore, for a given There are substantial individual and sex di¤erences in
subglottal pressure an increase in the glottal area can the degree to which the folds are abducted or adducted
lead to cessation of vocalfold vibration. during phonation. These di¤erences lead to significant
Voice Disorders in Children 67
Hanson, H. M., and Stevens, K. N. (2002). A quasiarticulatory

approach to controlling acoustic source parameters in a
Klatt-type formant synthesizer using HLsyn. Journal of the
Holmberg, E. B., Hillman, R. E., and Perkell, J. S. (1988).
Glottal airflow and transglottal air pressure measurements
for male and female speakers in soft, normal, and loud
voice. Journal of the Acoustical Society of America, 84, 511–
529.
Isshiki, N. (1964). Regulatory mechanism of voice intensity
variation. Journal of Speech and Hearing Research, 7, 17–
29.
Klatt, D. H., and Klatt, L. C. (1990). Analysis, synthesis, and
perception of voice quality variations among female and
male talkers. Journal of the Acoustical Society of America,
87, 820–857.
Figure 7. Distributions of H1*-A3*, a measure that reflects the Ladefoged, P., and McKinney, N. P. (1963). Loudness, sound
reduction of the high-frequency spectrum relative to the low- pressure, and subglottal pressure in speech. Journal of the
frequency spectrum, for male (black bars) and female (gray Acoustical Society of America, 35, 454–460.
bars) speakers. H1 is the amplitude of the first harmonic and Rothenberg, M. (1981). Acoustic interaction between the glot-
A3 is the amplitude of the strongest harmonic in the F3 peak. tal source and the vocal tract. In K. N. Stevens and M.
The asterisks indicate that corrections have been applied to H1 Hirano (Eds.), Vocal fold physiology (pp. 305–323). Tokyo:
and A3, as described in the text. (Adapted with permission University of Tokyo Press.
from Hanson and Chuang, 1999.) Stevens, K. N. (1994). Scientific substrates of speech produc-
tion. In F. D. Minifie (Ed.), Introduction to communica-
tion sciences and disorders (pp. 399–437). San Diego, CA:
di¤erences in the waveform and spectrum of the glottal Singular Publishing Group.
source, and consequently in the spectral characteristics Stevens, K. N. (1998). Acoustic phonetics. Cambridge, MA:
of vowels generated by these sources (Hanson, 1997; MIT Press.
Tanaka, S., and Gould, W. J. (1983). Relationships between
Hanson and Chuang, 1999). Similar observations have vocal intensity and noninvasively obtained aerodynamic
also been made by Holmberg, Hillman, and Perkell parameters in normal subjects. Journal of the Acoustical
(1988) using di¤erent measurement techniques. One Society of America, 73, 1316–1321.
acoustic measure that reflects the reduction of the Titze, I. R. (1989). On the relation between subglottal pressure
high-frequency spectrum amplitude relative to the low- and fundamental frequency in phonation. Journal of the
frequency spectrum amplitude is the di¤erence H1*-A3* Acoustical Society of America, 85, 901–912.
(in dB) between the amplitude of the first harmonic and Verdolini, K., Drucker, D. G., Palmer, P. M., and Samawi, H.
the amplitude of the third-formant spectrum promi- (1998). Laryngeal adduction in resonant voice. Journal of
nence. (The asterisks indicate that corrections are made Voice, 12, 315–327.
in H1 due to the possible influence of the first formant,
and in A3 due to the influence of the frequencies of the Voice Disorders in Children
first and second formants.) Distributions of values of
H1*-A3* are given in Figure 7 for a population of 22
female and 21 male speakers. The female speakers ap- Investigations of voice using aerodynamic techniques
pear to have a greater spectrum tilt on average, suggest- have been reported for more than 30 years. Investigators
ing a somewhat less abrupt glottal closure during a cycle realized early on that voice production is an aero-
and a greater tendency for lack of complete closure mechanical event and that vocal tract aerodynamics
throughout the cycle. Note the substantial ranges of reflect the interactions between laryngeal anatomy and
20 dB or more within each sex. complex physiological events. Aerodynamic events do
—Kenneth N. Stevens and Helen M. Hanson not always have a one-to-one correspondence with vocal
tract physiology in a dynamic biological system, but
References careful control of stimuli and a good knowledge of
laryngeal physiology make airflow and air pressure
Fant, G., Liljencrants, J., and Lin, Q. G. (1985). A four- measurements invaluable tools.
parameter model of glottal flow. Speech Transmission Lab- Airflow (rate of air movement or velocity) and air
oratory Quarterly Progress and Status Report, 4, 1–13. pressure (force per unit area of air molecules) in the vo-
Stockholm: Royal Institute of Technology. cal tract are good reflectors of vocal physiology. For
Hanson, H. M. (1997). Glottal characteristics of female
speakers: Acoustic correlates. Journal of the Acoustical So-
example, at a simple level, airflow through the glottis
ciety of America, 101, 466–481. (Vg) is an excellent indicator of whether the vocal
Hanson, H. M., and Chuang, E. S. (1999). Glottal character- folds are open or closed. When the vocal folds are open,
istics of male speakers: Acoustic correlates and comparison there is airflow through the glottis, and when the vocal
with female data. Journal of the Acoustical Society of folds are completely closed, there is zero airflow. With
America, 106, 1094–1077. other physiological events held constant, the amount of
68 Part I: Voice
airflow can be an excellent indicator of the degree of dynamic vocal function. The first technique was inverse
opening between the vocal folds. filtering of the easily accessible oral airflow signal
Subglottal air pressure directly reflects changes in the (Rothenberg, 1977). Rothenberg’s procedure allowed
size of the subglottal air cavity. A simplified version of derivation of the glottal airflow waveform. The derived
Boyle’s law predicts the relationship in that a particular volume velocity waveform provides airflow values, per-
pressure (P) in a closed volume (V) of air must equal a mitting detailed, quantifiable analysis of vocal fold
constant (K), that is, K ¼ PV. Subglottal air pressure physiology. The measures made from the derived vol-
will increase when the size of the lungs is decreased; ume velocity waveform can be related to the speed of
conversely, subglottal pressure will decrease when the opening and closing of the vocal folds, the closed time of
size of the lungs is made larger. Changes in subglottal air the vocal folds, the amplitude of vibration, the overall
pressure are mainly regulated through muscular forces shape of the vibratory waveform, and the degree of
controlling the size of the rib cage, with glottal resistance glottal opening during the closed part of the cycle.
or glottal flow used to help increase or decrease the The second aerodynamic technique developed was for
pressures (the glottis can be viewed as a valve that helps the estimation of subglottal pressure and laryngeal air-
regulate pulmonary flows and pressures). way resistance (Rlaw) through noninvasive procedures
A small number of classic studies used average air- (Lofqvist, Carlborg, and Kitzing, 1982; Smitheran and
flow and intraoral air pressure to investigate voice pro- Hixon, 1981). Subglottal air pressure is of primary im-
duction in children (Subtelny, Worth, and Sakuda, 1966; portance, because it is responsible for generating the
Arkebauer, Hixon, and Hardy, 1967; Van Hattum and pressure di¤erential causing vocal fold vibration (the
Worth, 1967; Beckett, Theolke and Cowan, 1971; Diggs, pressure that drives the vocal folds). Subglottal pressure
1972; Bernthal and Beukelman, 1978; Stathopoulos and is also important for controlling sound pressure level and
Weismer, 1986). Measures of flow and pressure were for contributing to changes in fundamental frequency—
used to reflect laryngeal and respiratory function. Dur- all factors essential for normal voice production. The
ing voice production, children produce lower average estimation of Rlaw o¤ers a more general interpretation
airflow than adults, and boys tend to produce higher of laryngeal dynamics and can be used as a screening
average airflow than girls of the same age. Supraglottal measure to quantify values outside normal ranges of
and glottal airway opening most likely account for the vocal function.
di¤erent average airflow values as a function of age Measures made using the Smitheran and Hixon
and sex. Assuming that pressure is the same across all (1981) technique include the following:
speakers, a smaller supraglottal or glottal opening yields
a higher resistance at the constriction and therefore a 1. Average oral air flow: Measured during the open
restricted or lower flow of air. The findings related to vowel /A/ at midpoint to obtain an estimate of laryn-
intraoral air pressure have indicated that children pro- geal airflow.
duce higher intraoral air pressures than adults, especially 2. Intraoral air pressure: Measured peak pressure dur-
because they tend to speak at higher sound pressure ing the voiceless [p] to obtain an estimate of sub-
levels (SPLs). The higher pressures produced by children glottal pressure.
versus adults reflect two physiological events. First, 3. Estimated laryngeal airway resistance: Calculated
children tend to speak at a higher SPL than adults, and by dividing the estimated subglottal pressure by esti-
second, children’s airways are smaller and less compliant mated laryngeal airflow. This calculation is based
than adults’ (Stathopoulos and Weismer, 1986). Intu- on analogy with Olm’s law, R ¼ V=I, where R ¼
itively, it would appear that the greater peak intraoral air resistance, V ¼ voltage, and I ¼ current. In the
pressure in children should lead to a greater magnitude speech system, R ¼ laryngeal airway resistance, V ¼
of oral airflow. It is likely that children’s smaller glottal subglottal pressure (P), and I ¼ laryngeal airflow (V).
and supraglottal areas substantially counteract the po- Thus, R ¼ P=V.
tentially large flows resulting from their high intraoral Measures made using the derived glottal airflow
air pressures. waveform important to vocal fold physiology include the
Children were found to be capable of maintaining the following (Holmberg, Hillman, and Perkell, 1988):
same linguistic contrasts as adults through manipula-
tion of physiological events such as lung cavity size and 1. Airflow open quotient: This measure is comparable to
driving pressure, and laryngeal and articulatory config- the original open quotient defined by Timcke, von
uration. Other intraoral air pressure distinctions in chil- Leden, and Moore (1958). The open time of the vocal
dren are similar to the overall trends described for adult folds (defined as the interval of time between the
pressures. Like adults, children produce higher pressures instant of opening and the instant of closing of the
during (1) voiceless compared to voiced consonants, vocal cords) is divided by the period of the glottal
(2) prevocalic compared to postvocalic consonants, (3) cycle. Opening and closing instants on the airflow
stressed compared to unstressed syllables, and (4) stops waveform are taken at a point equal to 20% of alter-
compared to fricatives. nating airflow (OQ-20%).
In the 1970s and 1980s, two important aerodynamic 2. Speed quotient: The speed quotient is determined as
techniques relative to voice production were developed the time it takes for the vocal folds to open divided by
that stimulated new ways of analyzing children’s aero- the time it takes for the vocal folds to close. Opening
and closing instants on the waveform are taken at a

point equal to 20% of alternating air flow. The mea-
sure reflects how fast the vocal folds are opening and
closing and the asymmetry of the opening and closing
phases.
3. Maximum flow declination rate: The measure is
obtained during the closing portion of the vocal fold
cycle and reflects the fastest rate of airflow shut-o¤.
Di¤erentiating the airflow waveform and then identi-
fying the greatest negative peak on di¤erentiated
waveform locates the fastest declination. The flow
measure corresponds to how fast the vocal folds are
closing.
4. Alternating glottal airflow: This measure is calculated
by taking the glottal airflow maximum minus mini-
mum. This measure reflects the amplitude of vibra-
tion and can reflect the glottal area during vibratory Figure 1. Estimated subglottal pressure as a function of age and
cycle. sound pressure level.
5. Minimum flow: This measure is calculated by sub-
tracting minimum flow from zero. It is indicative of
airflow leak due to glottal opening during the closed SPLs (Fig. 1). Anatomical di¤erences in the upper and
part of the cycle. lower airway will a¤ect the aerodynamic output of the
vocal tract. The increased airway resistance in children
Additional measures important to vocal fold physiol- could substantially increase tracheal pressures (Muller
ogy include the following: and Brown, 1980).
6. Fundamental frequency: This measure is obtained Airflow open quotient (OQ-20%): Open quotient has
from the inverse-filtered waveform by means of a traditionally been very closely correlated with SPL. In
peak-picking program. It is the lowest vibrating fre- adults, it is widely believed that as SPL increases, the
quency of the vocal folds and corresponds perceptu- open quotient decreases. That is, the vocal folds remain
ally to pitch. closed for a longer proportion of the vibratory cycle as
7. Sound pressure level: This measure is obtained at the vocal intensity increases. As seen in Figures 2A and 2B,
midpoint of the vowel from a microphone signal and which show data from a wide age span and both sexes,
corresponds physically to vocal intensity and percep- only adults and older teenagers produce lower open
tually to loudness. quotients for higher SPLs. It is notable that the younger
children and women produce higher OQ-20%, indicating
Voice production arises from a multidimensional that the vocal folds are open for a longer proportion of
system of anatomical, physiological, and neurological the cycle than in men and older boys, regardless of vocal
components and from the complex coordination of these intensity.
biological systems. Many of the measures listed above Maximum flow declination rate (MFDR): Children
have been used to derive vocal physiology. Stathopoulos and adults regulate their airflow shut-o¤ through a
and Sapienza (1997) empirically explored applying ob- combination of laryngeal and respiratory strategies.
jective voice measures to children’s productions and dis- Their MFDRs range from about 250 cc/s/s for com-
cussed the data relative to developmental anatomical fortable levels of SPL to about 1200 cc/s/s for quite high
data (Stathopoulos, 2000). From these cross-sectional SPLs. In children and adults, MFDR increases as SPL
data as a function of children’s ages, a clearer picture of increases (Fig. 3). Increasing MFDR as SPL increases
child vocal physiology has emerged. Because the ana- a¤ects the acoustic waveform by emphasizing the high-
tomical structure in children is constantly growing and frequency components of the acoustic source spectra
changing, children continually alter their movements to (Titze, 1988).
make their voices sound ‘‘normal.’’ Figures 1 through 7 Alternating glottal airflow: Fourteen-year-old boys
show cross-sectional vocal aerodynamic data obtained in and men produce higher alternating glottal airflows than
children ages 4–14 years. One of the striking features younger children and women during vowel production
that emerge from the aerodynamic data is the change in for the high SPLs (Fig. 4). We can interpret the flow
function at 14 years of age for boys. After that age, boys data to indicate that older boys and men produce higher
and men functionally group together, while women and alternating glottal airflows because of their larger laryn-
children seem to have more in common aerodynamically geal structures and greater glottal areas. Additionally,
and physiologically. The data are discussed in relation to men and boys increase their amplitude of vibration dur-
their physiological implications. ing the high SPLs more than women and children do.
Estimated subglottal pressure: Children produce Greater SPLs result in greater lateral excursion of the
higher subglottal pressures than adults, and all speakers vibrating vocal folds; hence the higher alternating glottal
produce higher pressures when they produce higher airflows for adults. Younger children also increase their
70 Part I: Voice
Figure 2. A, Airflow open quotient as a function of age and sex.

B, Airflow open quotient as a function of age and sound pres-
sure level. Figure 4. A, Alternating glottal airflow as a function of age and
sex. B, Alternating glottal airflow as a function of age and
sound pressure level.
amplitude of vibration when they increase their SPL,

and we would assume an increase in the alternating flow
values. The interpretation is somewhat complicated by
the fact that younger children and women have a shorter
vocal fold length and smaller area (Flanagan, 1958),
thereby limiting airflow through the glottis.
Fundamental frequency: As expected, older boys and
men produce lower fundamental frequencies than
women and younger children. An interesting result pre-
dicted by Titze’s (1988) modeling data is that the 4- and
6-year-olds produce unusually high f0 values when they
increase their SPL to high levels (Fig. 5). Changes in
fundamental frequency are more easily e¤ected by
Figure 3. Maximum flow declination rate (MFDR) as a func- increasing tracheal pressure when the vocal fold is char-
tion of age, sex, and sound pressure level. acterized by a smaller e¤ective vibrating mass, as in
young children ages 4–6 years.
Laryngeal airway resistance: Children produce voice
with higher Rlaw than 14-year-olds and adults, and all
speakers increase their Rlaw when increasing their SPL
(Fig. 6). Since Rlaw is calculated by dividing subglottal
Figure 5. Fundamental frequency as a function of age, sex, and

sound pressure level.
Figure 7. Length of vocal fold as a function of age and sex.
vocal function at age 14 in boys. It is not merely coinci-

dental that at 14 years, male larynges continue to in-
crease in size to approximate the size of adult male
larynges, whereas larynges in teenage girls plateau and
approximate the size of adult female larynges (Fig. 7).
Regardless of whether it is size or other anatomical fac-
tors a¤ecting vocal function, it is clear that use of an
adult male model for depicting normal vocal function
is inappropriate for children. Age- and sex-appropriate
aerodynamic, acoustic, and physiological models of
normal voice need to be referred to for the diagnosis and
remediation of voice disorders.
See also instrumental assessment of children’s
voice.
—Elaine T. Stathopoulos
Figure 6. Laryngeal airway resistance as a function of age and
sound pressure level. References
Arkebauer, H. J., Hixon, T. J., and Hardy, J. C. (1967). Peak
pressure by laryngeal airflow, the high Rlaw for high intraoral air pressure during speech. Journal of Speech and
SPL is largely due to higher values of subglottal pres- Hearing Research, 10, 196–208.
sure, since the average glottal airflow is the same across Beckett, R. L., Theolke, W. M., and Cowan, L. A. (1971).
age groups. A basic assumption needs to be discussed Normative study of airflow in children. British Journal of
Disorders in Communication, 6, 13–17.
here, and that is, that glottal airflow will increase when Bernthal, J. E., and Beukelman, D. R. (1978). Intraoral air
subglottal air pressure increases if laryngeal configura- pressure during the production of /p/ and /b/ by children,
tion/resistance is held constant. The fact that subglottal youths, and adults. Journal of Speech and Hearing Research,
pressure increases for high SPLs but flow does not in- 21, 361–371.
crease clearly indicates that Rlaw must be increasing. Diggs, C. C. (1972). Intraoral air pressure for selected English
Physiologically, the shape and configuration of the consonants: A normative study of children. Unpublished
laryngeal airway must be decreasing in size to maintain master’s thesis, Purdue University, Lafayette, Indiana.
the constant airflow in the setting of increasing sub- Flanagan, J. L. (1958). Some properties of the glottal sound
glottal pressures. In sum, children and adults alike source. Journal of Speech and Hearing Disorders, 1, 99–
continually modify their glottal airway to control the 116.
Holmberg, E. B., Hillman, R. B., and Perkell, J. (1988). Glot-
important variables of subglottal pressure and SPL. tal airflow and transglottal air pressure measurements for
The cross-sectional aerodynamic data, and in partic- male and female speakers in soft, normal, and loud voice.
ular the flow data, make a compelling argument that the Journal of the Acoustical Society of America, 84, 511–
primary factor a¤ecting children’s vocal physiology is 529.
the size of their laryngeal structure. A general scan of Lofqvist, A., Carlborg, B., and Kitzing, P. (1982). Initial vali-
the cross-sectional data discussed here shows a change in dation of an indirect measure of subglottal pressure
72 Part I: Voice
during vowels. Journal of Acoustical Society of America, 72, advanced age. Such conditions include neurological dis-
633–635. orders, benign lesions, trauma, inflammatory processes,
Muller, E. M., and Brown, W. S. (1980). Variations in the and endocrine disorders (Morrison and Gore-Hickman,
supraglottal air pressure waveform and their articulatory 1986; Woo et al., 1992). Carcinoma of the head and
interpretation. Speech and Language Advance in Basic Re-
neck occasionally occurs late in life, although more
search and Practice, 4, 318–389.
Rothenberg, M. (1977). Measurement of airflow in speech. commonly it is diagnosed between the ages of 50 and 70
Journal of Speech and Hearing Research, 20, 155–176. (Leon et al., 1998). Interestingly, multiple etiologic fac-
Smitheran, J. R., and Hixon, T. J. (1981). A clinical method tors related to a voice disorder are more common in
for estimating laryngeal airway resistance during vowel elderly patients than in younger adults. In addition,
production. Journal of Speech, Language, and Hearing Re- elderly persons are at increased risk for laryngeal side
search, 46, 138–146. e¤ects from pharmacological agents, since prescription
Stathopoulos, E. T. (2000). A review of the development of the and nonprescription drugs are used disproportionately
child voice: An anatomical and functional perspective. In P. by the elderly (Linville, 2001).
White (Ed.), Child voice. International Symposium. Stock- Elderly patients often exhibit neurological voice dis-
holm, Sweden: KTH Voice Research Centre.
orders, particularly in later stages of old age. Estimates
Stathopoulos, E. T., and Sapienza, C. M. (1997). Devel-
opmental changes in laryngeal and respiratory function of the incidence of peripheral laryngeal nerve damage in
with variations in sound pressure level. Journal of Speech, elderly dysphonic patients range from 7% to 21%. Gen-
Language, and Hearing Research, 40, 595–614. erally, peripheral paralysis in the elderly tends to be
Stathopoulos, E. T., and Weismer, G. (1986). Oral airflow associated with disease processes associated with aging
and air pressure during speech production: A comparative (such as lung neoplasm), as opposed to idiopathic
study of children, youths, and adults. Folia Phoniatrica, 37, peripheral paralysis, which occurs infrequently (Morri-
152–159. son and Gore-Hickman, 1986; Woo et al., 1992). Symp-
Subtelny, J. D., Worth, J. H., and Sakuda, M. (1966). Intraoral toms of peripheral paralysis include glottic insu‰ciency,
air pressure and rate of flow during speech. Journal of reduced loudness, breathiness, and diplophonia. Voice
therapy for peripheral paralysis frequently involves
Timcke, R., von Leden, H., and Moore, P. (1958). Laryn-
geal vibrations: Measurements of the glottic wave. AMA increasing vocal fold adductory force to facilitate closure
Archives of Otolaryngology, 68, 1–19. of the glottis and improving breath support to minimize
Titze, I. R. (1988). Regulation of vocal power and e‰ciency by fatigue and improve speech phrasing. After age 60, cen-
subglottal pressure and glottal width. In O. Fujimura (Ed.), tral neurological disorders such as stroke, focal dystonia,
Vocal fold physiology: Voice production, mechanisms, and Parkinson’s disease, Alzheimer’s disease, and essential
functions (pp. 227–238). New York: Raven Press. tremor also occur frequently. Treatment for central dis-
Van Hattum, R. J., and Worth, J. H. (1967). Airflow rates in orders involves attention to specific deficits in vocal fold
normal speakers. Cleft Palate Journal, 4, 137–147. function such as positioning deficits, instability of vibra-
tion, and incoordination of movements. Functioning of
the velopharynx, tongue, jaw, lips, diaphragm, abdo-
men, and rib cage may also be compromised and may
Voice Disorders of Aging require treatment. Treatment may focus on vocal fold
movement patterns, postural changes, coordination of
respiratory and phonatory systems, respiratory support,
Voice disorders a¿ict up to 12% of the elderly pop- speech prosody, velopharyngeal closure, or speech intel-
ulation (Shindo and Hanson, 1990). Voice disorders in ligibility. In some cases, augmentative communication
elderly persons can result from normal age-related strategies might be used, or medical treatment may be
changes in the voice production mechanism or from combined with speech or voice therapy to improve out-
pathological conditions separate from normal aging comes (Ramig and Scherer, 1992).
(Linville, 2001). However, distinguishing between pa- A variety of benign vocal lesions are particularly
thology and normal age-related changes can be di‰cult. prevalent in the elderly, including Reineke’s edema,
Indeed, a number of investigators have concluded that polypoid degeneration, unilateral sessile polyp, and be-
the vast majority of elderly patients with voice disorders nign epithelial lesions with variable dysplastic changes
su¤er from a disease process associated with aging rather (Morrison and Gore-Hickman, 1986; Woo et al., 1992).
than from a disorder involving physiological aging alone Reineke’s edema and polypoid degeneration occur more
(Morrison and Gore-Hickman, 1986; Woo, Casper, commonly in women and are characterized by chronic,
Colton, and Brewer, 1992). Therefore, a thorough med- di¤use edema extending along the entire length of the
ical examination and history are required to rule out vocal fold. The specific site of the edema is the superficial
pathological processes a¤ecting voice in elderly patients layer of the lamina propria. Although the etiology of
(Hagen, Lyons, and Nuss, 1996). In addition, strobo- Reineke’s edema and polypoid degeneration is uncer-
scopic examination of the vocal folds is recommended to tain, reflux, cigarette smoking, and vocal abuse/misuse
detect abnormalities of mucosal wave and amplitude of have been mentioned as possible causal factors (Kouf-
vocal fold vibration that a¤ect voice production (Woo man, 1995; Zeitels et al., 1997). Some degree of edema
et al., 1992). and epithelial thickening is a normal accompaniment of
A number of pathological conditions that a¤ect voice aging in some individuals. The reason why women are at
are prevalent in the elderly population simply because of greater risk than men for developing pathological epi-
Voice Disorders of Aging 73
thelial changes as they age is unknown, although di¤er- Debruyne et al., 1997; Francis and Wartofsky, 1992;
ences in vocal use patterns could be a factor. That is, el- Satalo¤, Emerich, and Hoover, 1997). Elderly persons
derly women may be more likely to develop hypertensive also may experience vocal symptoms as a consequence
phonatory patterns in an e¤ort to compensate for the of hypoparathyroidism or hyperparathyroidism, or
age-related pitch lowering that accompanies vocal fold from neuropathic disturbances resulting from diabetes
thickening and edema (Linville, 2001). (Maceri, 1986).
The incidence of functional hypertensive dysphonia Lifestyle factors and variability among elderly
among elderly speakers is disputed. Some investigators speakers often blur the distinction between normal and
report significant evidence of phonatory behaviors con- disordered voice. Elderly persons di¤er in the rate and
sistent with hypertension, such as hyperactivity of the extent to which they exhibit normal age-related ana-
ventricular vocal folds in the elderly population (Hagen, tomical, physiological, and neurological changes. They
Lyons, and Nuss, 1996; Morrison and Gore-Hickman, also di¤er in lifestyle. These factors result in consider-
1986). Others report a low incidence of vocal fold lesions able variation in phonatory characteristics, articulatory
commonly associated with hyperfunction (vocal nodules, precision habits, and respiratory function capabilities
pedunculated polyps), as well as relatively few cases of among elderly speakers (Linville, 2001).
functional dysphonia without tissue changes (Woo et al., Lifestyle factors can either postpone or exacerbate the
1992). Clinicians are in agreement, however, that elderly e¤ects of aging on the voice. Although a potentially
patients need to be evaluated for evidence of hyper- limitless combination of environmental factors combine
tensive phonation and provided with therapy to promote to a¤ect aging, perhaps the most controllable and po-
more relaxed phonatory adjustments when evidence of tentially significant lifestyle factors are physical fitness
hypertension is found, such as visible tension in the cer- and cigarette smoking. The elderly population is ex-
vical muscles, a report of increased phonatory e¤ort, a tremely variable in fitness levels. The rate and extent
pattern of glottal attack, high laryngeal position, and/or of decline in motor and sensory performance with aging
anteroposterior laryngeal compression. varies both within and across elderly individuals (Finch
Inflammatory conditions such as pachydermia, laryn- and Schneider, 1985). Declines in motor performance
gitis sicca, and nonspecific laryngitis also are diagnosed are directly related to muscle use and can be minimized
with some regularity in the elderly (see infectious dis- by a lifestyle that includes exercise. The benefits of daily
eases and inflammatory conditions of the larynx). exercise include facilitated muscle contraction, enhanced
These conditions might arise as a consequence of smok- nerve conduction velocity, and increased blood flow
ing, reflux, medications, or poor hydration and often (Spirduso, 1982; Finch and Schneider, 1985; De Vito et
coexist with vocal fold lesions that may be either be- al., 1997). A healthy lifestyle that includes regular exer-
nign or malignant. Age-related laryngeal changes such cise may also positively influence laryngeal performance,
as mucous gland degeneration might be a factor in although a direct link has yet to be established (Ringel
development of laryngitis sicca (Morrison and Gore- and Chodzko-Zajko, 1987). However, there is evidence
Hickman, 1986; Woo et al., 1992). Gastroesophageal that variability on measures of phonatory function in
reflux disease (GERD) is another inflammatory condi- elderly speakers is reduced by controlling for a speaker’s
tion that is reported to occur with greater frequency in physiological condition (Ramig and Ringel, 1983).
the elderly (Richter, 2000). Since GERD has been pres- Physical conditioning programs that include aerobic ex-
ent for a longer time in the elderly in comparison with ercise often are recommended for aging professional
younger adults, it is a more complicated disease in this singers to improve respiratory and abdominal con-
group. Often elderly patients report less severe heartburn ditioning and to avoid tremolo, as well as to improve
but have more severe erosive damage to the esophagus endurance, accuracy, and agility. Physical conditioning
(Katz, 1998; Richter, 2000). is also important in nonsingers to prevent dysphonia in
Because of advanced age, elderly patients may be at later life (Satalo¤, Spiegel, and Rosen, 1997; Linville,
increased risk for traumatic injury to the vocal folds. 2001).
Trauma might manifest as granuloma or scar tissue from The e¤ects of smoking coexist with changes related to
previous surgical procedures requiring general anesthe- normal aging in elderly smokers. Smoking amplifies the
sia, or from other traumatic vocal fold injuries. Vocal impact of normal age-related changes in both the pul-
fold scarring may be present as a consequence of previ- monary and laryngeal systems. Elderly smokers demon-
ous vocal fold surgery, burns, intubation, inflammatory strate accelerated declines in pulmonary function, even
processes, or radiation therapy for glottic carcinoma if no pulmonary disease is detected (Hill and Fisher,
(Morrison and Gore-Hickman, 1986; Kahane and 1993; Lee et al., 1999). Smoking also has a definite e¤ect
Beckford, 1991). on the larynx and alters laryngeal function. Clinicians
Age-related changes in the endocrine system also af- must consider smoking history in assessing an elderly
fect the voice. Secretion disorders of the thyroid (both speaker’s voice (Linville, 2001).
hyperthyroidism and hypothyroidism) occur commonly Clinicians also must be mindful of the overall health
in the elderly and often produce voice symptoms, either status of older patients presenting with voice disorders.
as a consequence of altered hormone levels or as a re- Elderly dysphonic patients often are in poor general
sult of increased pressure on the recurrent laryngeal health and have a high incidence of systemic illness.
nerve. In addition, voice changes are possible with thy- Pulmonary disease and hypertensive cardiac disease
roidectomy, even if the procedure is uncomplicated (e.g., have been cited as particularly prevalent in elderly voice
74 Part I: Voice
patients (Woo et al., 1992). If multiple health problems Richter, J. (2000). Gastroesophageal disease in the older pa-
are present, elderly dysphonic patients may be less com- tient: Presentation, treatment, and complications. American
pliant in following therapeutic regimens, or treatment Journal of Gastroenterology, 95, 368–373.
for voice problems may need to be postponed. In gen- Ringel, R., and Chodzko-Zajko, W. (1987). Vocal indices of
biological age. Journal of Voice, 1, 31–37.
eral, the diagnosis and treatment of voice disorders are
Satalo¤, R., Emerich, K., and Hoover, C. (1997). Endocrine
more complicated if multiple medical conditions are dysfunction. In R. Satalo¤ (Ed.), Professional voice: The
present (Linville, 2001). science and art of clinical care (pp. 291–297). San Diego,
—Sue Ellen Linville CA: Singular Publishing Group.
Satalo¤, R., Spiegel, J., and Rosen, D. (1997). The e¤ects of
age on the voice. In R. Satalo¤ (Ed.), Professional voice:
References The science and art of clinical care (pp. 259–267). San
Diego, CA: Singular Publishing Group.
Debruyne, F., Ostyn, F., Delaere, P., Wellens, W., and Shindo, M., and Hanson, S. (1990). Geriatric voice and laryn-
Decoster, W. (1997). Temporary voice changes after un- geal dysfunction. Otolaryngologic Clinics of North America,
complicated thyroidectomy. Acta Oto-Rhino-Laryngologica 23, 1035–1044.
(Belgium), 51, 137–140. Spirduso, W. (1982). Physical fitness in relation to motor aging.
De Vito, G., Hernandez, R., Gonzalez, V., Felici, F., and Fig- In J. Mortimer, F. Pirozzolo, and G. Maletta (Eds.), The
ura, F. (1997). Low intensity physical training in older sub- aging motor system (pp. 120–151). New York: Praeger.
jects. Journal of Sports Medicine and Physical Fitness, 37, Woo, P., Casper, J., Colton, R., and Brewer, D. (1992). Dys-
72–77. phonia in the aging: Physiology versus disease. Laryngo-
Finch, C., and Schneider, E. (1985). Handbook of the biology of scope, 102, 139–144.
aging. New York: Van Nostrand Reinhold. Zeitels, S., Hillman, R., Bunting, G., and Vaughn, T. (1997).
Francis, T., and Wartofsky, L. (1992). Common thyroid dis- Reineke’s edema: Phonatory mechanisms and management
orders in the elderly. Postgraduate Medicine, 92, 225. strategies. Annals of Otology, Rhinology, and Laryngology,
Hagen, P., Lyons, F., and Nuss, D. (1996). Dysphonia in the 106, 533–543.
elderly: Diagnosis and management of age-related voice
changes. Southern Medical Journal, 89, 204–207.
Hill, R., and Fisher, E. (1993). Smoking cessation in the older Further Readings
chronic smoker: A review. In D. Mahler (Ed.), Lung biol-
ogy in health and disease (vol. 63, pp. 189–218). New York: Abitbol, J., Abitbol, P., and Abitbol, B. (1999). Sex hormones
Marcel Dekker. and the female voice. Journal of Voice, 13, 424–446.
Kahane, J., and Beckford, N. (1991). The aging larynx and Benjamin, B. (1997). Speech production of normally
voice. In D. Ripich (Ed.), Handbook of geriatric communi- aging adults. Seminars in Speech and Language, 18, 135–
cation disorders. Austin, TX: Pro-Ed. 141.
Katz, P. (1998). Gastroesophageal reflux disease. Journal of the Benninger, M., Alessi, D., Archer, S., Bastian, R., Ford, C.,
American Geriatrics Society, 46, 1558–1565. Koufman, J., et al. (1996). Vocal fold scarring: Current
Koufman, J. (1995). Gastroesophageal reflux and voice dis- concepts and management. Otolaryngology–Head and Neck
orders. In C. Korovin and W. Gould (Eds.), Diagnosis Surgery, 115, 474–482.
and treatment of voice disorders (pp. 161–175). New York: Finucane, P., and Anderson, C. (1995). Thyroid disease in
Igaku-Shoin. older patients: Diagnosis and treatment. Drugs and Aging,
Lee, H., Lim, M., Lu, C., Liu, V., Fahn, H., Zhang, C., et al. 6, 268–277.
(1999). Concurrent increase of oxidative DNA damage and Hirano, M., Kurita, S., and Sakaguchi, S. (1989). Ageing of
lipid peroxidation together with mitochondrial DNA muta- the vibratory tissue of human vocal folds. Acta Otolaryn-
tion in human lung tissues during aging: Smoking enhances gologica, 107, 428–433.
oxidative stress on the aged tissues. Archives of Biochemistry Hoit, J., and Hixon, T. (1987). Age and speech breathing.
and Biophysics, 362, 309–316. Journal of Speech and Hearing Research, 30, 351–366.
Leon, X., Quer, M., Agudelo, D., Lopez-Pousa, A., De Juan, Hoit, J., Hixon, T., Altman, M., and Morgan, W. (1989).
M., Diez, S., and Burgues, J. (1998). Influence of age on Speech breathing in women. Journal of Speech and Hearing
laryngeal carcinoma. Annals of Otology, Rhinology, and Research, 32, 353–365.
Laryngology, 107, 164–169. Kahane, J. (1990). Age-related changes in the peripheral speech
Linville, S. E. (2001). Vocal aging. San Diego, CA: Singular mechanism: Structural and physiological changes. In Pro-
Publishing Group. ceedings of the Research Symposium on Communicative
Maceri, D. (1986). Head and neck manifestations of endo- Sciences and Disorders and Aging (ASHA Reports No. 19).
crine disease. Otolaryngologic Clinics of North America, 19, Rockville, MD: American Speech-Language-Hearing As-
171–180. sociation, pp. 75–87.
Morrison, M., and Gore-Hickman, P. (1986). Voice disorders Koch, W., Patel, H., Brennan, J., Boyle, J., and Sidransky, D.
in the elderly. Journal of Otolaryngology, 15, 231–234. (1995). Squamous cell carcinoma of the head and neck in
Ramig, L., and Ringel, R. (1983). E¤ects of physiological the elderly. Archives of Otolaryngology–Head and Neck
aging on selected acoustic characteristics of voice. Journal Surgery, 121, 262–265.
of Speech and Hearing Research, 26, 22–30. Linville, S. E. (1992). Glottal gap configurations in two age
Ramig, L., and Scherer, R. (1992). Speech therapy for neuro- groups of women. Journal of Speech and Hearing Research,
logic disorders of the larynx. In A. Blitzer, M. Brin, C. 35, 1209–1215.
Sasaki, S. Fahn, and K. Harris (Eds.), Neurologic disorders Linville, S. E. (2000). The aging voice. In R. D. Kent and M. J.
of the larynx (pp. 163–181). New York: Thieme Medical Ball (Eds.), Voice quality measurement (pp. 359–376). San
Publishers. Diego, CA: Singular Publishing Group.
Voice Production: Physics and Physiology 75
Liss, J., Weismer, G., and Rosenbek, J. (1990). Selected The most general expression of forces in the larynx
acoustic characteristics of speech production in very old dealing with motion of the vocal folds during phonation
males. Journal of Gerontology Psychological Sciences, 45, is
P35–P45.
Lu, F., Casiano, R., Lundy, D., and Xue, J. (1998). Vocal F (x) ¼ mx 00 þ bx 0 þ kx; (1)
evaluation of thyroplasty type I in the treatment of non- where F (x) is the air pressure forces on the vocal fold
paralytic glottic incompetence. Annals of Otology, Rhinol-
ogy, and Laryngology, 103, 547–553. tissues, m is the mass of the tissue in motion, b is a vis-
Luborsky, M., and McMullen, D. (1999). Culture and aging. cous coe‰cient, k is a spring constant coe‰cient, x is the
In J. Cavanaugh and S. Whitbourne (Eds.), Gerontology: position of the tissue from rest, x 0 is the velocity of the
An interdisciplinary perspective (pp. 65–90). New York: tissue, and x 00 is the acceleration of the tissue. In multi-
Oxford University Press. mass models of phonation (e.g., Ishizaka and Flanagan,
Lundy, D., Silva, C., Casiano, R., Lu, F. L., and Xue, 1972), this equation is used for each mass proposed.
J. (1998). Cause of hoarseness in elderly patients. Each term on the right-hand side characterizes forces in
Otolaryngology–Head and Neck Surgery, 118, 481–485. the tissue, and the left-hand side represents the external
Morrison, M., Rammage, L., and Nichol, H. (1989). Evalua- forces. This equation emphasizes the understanding that
tion and management of voice disorders in the elderly. In J. mass, viscosity, and sti¤ness each play a role in the mo-
Goldstein, H. Kashima, and C. Koopermann (Eds.), Geri-
atric otorhinolaryngology. Philadelphia: B.C. Decker. tion (normal or abnormal) of the vocal folds, that these
Murry, T., Xu, J., and Woodson, G. (1998). Glottal configu- are associated with the acceleration, velocity, and dis-
ration associated with fundamental frequency and vocal placement of the tissue, respectively, and they are bal-
register. Journal of Voice, 12, 44–49. anced by the external air pressure forces acting on the
Omori, K., Slavit, D., Kacker, A., and Blaugrund, S. (1998). vocal folds.
Influence of size and etiology of glottal gap in glottic in- Glottal adduction has three parts. (1) How close the
competence dysphonia. Laryngoscope, 108, 514–518. vocal processes are to each other determines the poste-
Omori, K., Slavit, D., Matos, C., Kojima, H., Kacker, A., rior prephonatory closeness of the membranous vocal
and Blaugrund, S. (1997). Vocal fold atrophy: Quantitative folds. (2) The space created by the intercartilaginous
glottic measurement and vocal function. Annals of Otology,
glottis determines the ‘‘constant’’ opening there through
Rhinology, and Laryngology, 106, 544–551.
Owens, N., Fretwell, M. D., Willey, C., and Murphy, S. S. which some or all of the dc (baseline) air will flow. (3)
(1994). Distinguishing between the fit and frail elderly, The closeness of the membranous vocal folds partly
and optimizing pharmacotherapy. Drugs and Aging, 4, 47– determines whether vocal fold oscillation can take place.
55. To permit oscillation, the vocal folds must be within
Ramig, L., Gray, S., Baker, K., Corbin-Lewis, K., Buder, E., the phonatory adductory range (not too far apart, and
Luschei, E., et al. (2001). The aging voice: A review, treat- yet not too overly compressed; Scherer, 1995), and the
ment data and familial and genetic perspectives. Folia Pho- transglottal pressure must be at or greater than the pho-
niatrica et Logopedica, 53, 252–265. natory threshold pressure (Titze, 1992) for the prevailing
Sinard, R., and Hall, D. (1998). The aging voice: How to conditions of the vocal fold tissues and adduction.
di¤erentiate disease from normal changes. Geriatrics, 53,
The fundamental frequency F0, related to the pitch of
76–79.
Slavit, D. (1999). Phonosurgery in the elderly: A review. Ear, the voice, will tend to rise if the tension of the tissue in
Nose, and Throat Journal, 78, 505–512. motion increases, and will tend to fall if the length, mass,
Tanaka, S., Hirano, M., and Chijiwa, K. (1994). Some aspects or density increases. The most general expression to date
of vocal fold bowing. Annals of Otology, Rhinology, and for pitch control has been o¤ered by Titze (1994), viz.,
Laryngology, 103, 357–362. pffiffiffiffiffiffiffiffiffiffiffiffi
Tolep, K., Higgins, N., Muza, S., Criner, G., and Kelsen, S. F0 ¼ (0:5=L) (sp =r) (1 þ (da =d ) (sam =sp ) aTA ) 0:5 ;
(1995). Comparison of diaphragm strength between healthy (2)
adult elderly and young men. American Journal of Respira- where L is the vibrating length of the vocal folds, sp is
tory and Critical Care Medicine, 152, 677–682.
the passive tension of the tissue in motion, da =d is the
Weismer, G., and Liss, J. (1991). Speech motor control and
aging. In D. Ripich (Ed.), Handbook of geriatric communi- ratio of the depth of the thyroarytenoid (TA) muscle in
cation disorders. Austin, TX: Pro-Ed. vibration to the total depth in vibration (the other tissue
in motion is the more medial mucosal tissue), sam is the
maximum active stress that the TA muscle can produce,
Voice Production: Physics and aTA is the activity level of the TA muscle, and r is the
Physiology density of the tissue in motion. When the vocal folds
are lengthened by rotation of the thyroid and cricoid
cartilages through the contraction of the cricothyroid
When the vocal folds are near each other, a su‰cient (CT) muscles, the passive stretching of the vocal folds
transglottal pressure will set them into oscillation. This increases their passive tension, and thus L and sp tend to
oscillation produces cycles of airflow that create the counter each other, with sp being more dominant (F0
acoustic signal known as phonation, the voicing sound generally rises with vocal fold elongation). Increasing
source, the voice signal, or more generally, voice. This subglottal pressure increases the lateral amplitude of
article discusses some of the mechanistic aspects of pho- motion of the vocal folds, thus increasing sp (via greater
nation. passive stretch; Titze, 1994) and da , thereby increasing
76 Part I: Voice
sity values in the valleys of the resonant structure (Titze,

1994). The radiation away from the lips will increase the
spectrum slope (by about 6 dB per octave).
Maintenance of vocal fold oscillation during phona-
tion depends on the tissue characteristics mentioned
above, as well as the changing shape of the glottis and
the changing intraglottal air pressures during each cycle.
Figure 1. One cycle of glottal airflow. Uac is the varying portion
During glottal opening, the shape of the glottis corre-
of the waveform, and Udc is the o¤set or bias flow. The flow sponding to the vibrating vocal folds is convergent
peak is the maximum flow in the cycle, MFDR is the maxi- (wider in the lower glottis, narrower in the upper glottis),
mum flow declination rate (derivative of the flow), typically and the pressures on the walls of the glottis are positive
located on the right-hand side of the flow pulse, and the corner due to this shape and to the (always) positive subglottal
curvature at the end of the flow pulse describes how sharp the pressure (for normal egressive phonation) (Fig. 2). This
corner ‘‘shut-o¤ ’’ is. The flow peak, MFDR, and corner positive pressure separates the folds during glottal open-
sharpness are all important for the spectral aspects of the flow ing. During glottal closing, the shape of the glottis is di-
pulse (see text). vergent (narrower in the lower glottis, wider in the upper
glottis), and the pressures on the walls of the lower glot-
tis are negative because of this shape (Fig. 2), and nega-
F0. Increasing the contraction of the TA muscle (aTA ) tive throughout the glottis when there also is rarefaction
would tend to sti¤en the muscle and shorten the vocal (negative pressure) of the supraglottal region. This
fold length (L), both of which would raise F0 but at the alternation in glottal shape and intraglottal pressures,
same time decrease the passive tension (sp ) and the depth along with the alternation of the internal forces of the
of vibration (da ), which would decrease F0. Typically, vocal folds, maintains the oscillation of the vocal folds.
large changes in F0 are associated with increased con- The exact glottal shape and intraglottal pressure
traction of both the CT and TA muscles (Hirano, Ohala, changes, however, need to be established in the human
and Vennard, 1969; Titze, 1994). Thus, the primary larynx for the wide range of possible phonatory and vo-
control for F0 is through the coordinative contraction of cal tract acoustic conditions.
the TA and CT muscles, and subglottal pressure. F0
control, including the di¤erentiated contraction of the
complex TA muscle, anterior pull by the hyoid bone
(Honda, 1983), cricoid tilt via tracheal pull (Sundberg,
Leanderson, and von Euler, 1989), and the associations
with adduction and vocal quality all need much study.
The intensity of voiced sounds, related to the loudness
of the voice, is a combination and coordination of res-
piratory, laryngeal, and vocal tract aspects. Intensity
increases with an increase in subglottal pressure, which
itself depends on both lung volume reduction (an in-
crease in air pressure in the lungs) and adduction of the
vocal folds (which o¤ers resistance to the flow of air
from the lungs). An increase in the subglottal pressure
during phonation can a¤ect the cyclic glottal flow wave-
form (Fig. 1) by increasing its flow peak, increasing the
maximum flow declination rate (MFDR, the maximum
rate that the flow shuts o¤ as the glottis is closing), and
the sharpness of the baseline corner when the flow is near
zero (or near its minimum value in the cycle). Greater
peak flow, MFDR, and corner sharpness respectively Figure 2. Pressure profiles within the glottis. The upper trace
increase the intensity of F0, the intensity of the first for- corresponds to the data for a glottis with a 10 convergence
mant region (at least), and the intensity of the higher and the lower trace to data for a glottis with a 10 divergence,
partials (Fant, Liljencrants, and Lin, 1985; Gau‰n and both having a minimal glottal diameter of 0.04 cm (using a
Sundberg, 1989). Glottal adduction level greatly a¤ects Plexiglas model of the larynx; Scherer and Shinwari, 2000).
the source spectrum or quality of the voice, increasing The transglottal pressure was 10 cm H2 O in this illustration.
the negative slope of the spectrum as one changes voice Glottal entrance is at the minimum diameter position for the
divergent glottis. The length of the glottal duct was 0.3 cm.
production from highly compressed voice (a relatively Supraglottal pressure was taken to be atmospheric (zero). The
flat spectrum) to normal adduction to highly breathy convergent glottis shows positive pressures and the divergent
voice (a relatively steep spectrum) (Scherer, 1995). The glottis shows negative pressures throughout most of the glottis.
vocal tract filter function will augment the spectral in- The curvature at the glottal exit of the convergent glottis pre-
tensity values of the glottal flow source in the region of vents the pressures from being positive throughout (Scherer,
the formants (resonances), and will decrease their inten- DeWitt, and Kucinschi, 2001).
Voice Production: Physics and Physiology 77
(Zhang et al., 2001). The turbulence and vorticities of

the glottal flow may also contribute sound sources
(Zhang et al., 2001). The false vocal folds themselves
may contribute significant control of the flow resistance
through the larynx, from more resistance (decreasing the
flow if the false folds are quite close) to less resistance
(increasing the glottal flow when the false folds are in an
intermediate position) (Agarwal and Scherer, in press).
Computer modeling needs to be practical, as in two-
mass modeling (Ishizaka and Flanagan, 1972), but also
closer to physiological reality, as in finite element mod-
eling (Alipour, Berry, and Titze, 2000; Alipour and
Scherer, 2000). The most complete approach so far is
to combine finite element modeling of the tissue with
computational fluid dynamics of the flow (to solve the
Navier-Stokes equations; Alipour and Titze, 1996).
However, we still need models of phonation that are
Figure 3. Factors leading to pitch, quality, and loudness pro- helpful in describing and predicting subtle aspects of
duction. See text.
laryngeal function necessary for di¤erentiating vocal
pathologies, phonation styles and types, and approaches
for phonosurgery, as well as for providing rehabilitation
and training feedback for clients.
When the two medial vocal fold surfaces are not mir- See also voice acoustics.
ror images of each other across the midline, the geomet-
ric asymmetry creates di¤erent pressures on the two sides —Ron Scherer
(i.e., pressure asymmetries; Scherer et al., 2001, 2002)
and therefore di¤erent driving forces on the two sides.
Also, if there is tissue asymmetry, that is, if the two vocal References
folds themselves do not have equal values of tension Agarwal, M., Scherer, R. C., and Hollien, H. (in press). The
(sti¤ness) and mass, one vocal fold may not vibrate like false vocal folds: Shape and size in coronal view during
the other one, creating roughness, subharmonics, and phonation. Journal of Voice.
cyclic groupings (Isshiki and Ishizaka, 1976; Gerratt Alipour, F., Berry, D. A., and Titze, I. R. (2000). A finite-
et al., 1988; Wong et al., 1991; Titze, Baken, and Herzel, element model of vocal-fold vibration. Journal of the
1993; Steinecke and Herzel, 1995). Acoustical Society of America, 108, 3003–3012.
Alipour, F., and Scherer, R. C. (2000). Vocal fold bulging:
Figure 3 summarizes some basic aspects of phona-
E¤ects on phonation using a biophysical computer model.
tion. The upper left suggests muscle contraction e¤ects Journal of Voice, 14, 470–483.
of vocal fold length (via CT and TA action), adduction Alipour, F., and Titze, I. R. (1996). Combined simulation of
(via TA, lateral cricoarytenoid, posterior cricoarytenoid, airflow and vocal fold vibrations. In P. Davis and N.
and interarytenoid muscle contraction), tension (via CT, Fletcher (Eds.), Vocal fold physiology: Controlling com-
TA, and adduction), and glottal shape (via vocal fold plexity and chaos (pp. 17–29). San Diego, CA: Singular
length, adduction, and TA rounding e¤ect). When lung Publishing Group.
volume reduction is then employed, glottal airflow and Fant, G., Liljencrants, J., and Lin, Q. (1985). A four-parameter
subglottal pressure are created, resulting in motion of the model of glottal flow. Speech Transmission Laboratory
vocal folds (if the adduction and pressure are su‰cient), Quarterly Progress and Status Report, 4, 1–13. Stockholm:
Royal Institute of Technology.
glottal flow resistance (transglottal pressure divided by
Gau‰n, J., and Sundberg, J. (1989). Spectral correlates of
the airflow), and the fundamental frequency (and pitch) glottal voice source waveform characteristics. Journal of
of the voice. With the vocal tract included, the glottal Speech and Hearing Research, 32, 556–565.
flow is a¤ected by the resonances of the vocal tract Gerratt, B. R., Precoda, K., Hanson, D., and Berke, G. S.
(pressures acting at the glottis level) and the inertance of (1988). Source characteristics of diplophonia. Journal of the
the air of the vocal tract (to skew the glottal flow wave- Acoustical Society of America, 83, S66.
form to the right; Rothenberg, 1983), and the output Hirano, M., Ohala, J., and Vennard, W. (1969). The function
spectra (quality) and intensity (loudness) result from the of the laryngeal muscles in regulating fundamental fre-
combination of the glottal flow, resonance, and radia- quency and intensity of phonation. Journal of Speech and
tion from the lips. Hearing Research, 12, 616–628.
Honda, K. (1983). Relationship between pitch control and
Many basic issues of glottal aerodynamics, aero-
vowel articulation. In D. M. Bless and J. H. Abbs (Eds.),
acoustics, and modeling remain unclear for both normal Vocal fold physiology: Contemporary research and clinical
and abnormal phonation. The glottal flow (the volume issues. San Diego, CA: College-Hill Press.
velocity flow) is considered a primary sound source, and Ishizaka, K., and Flanagan, J. L. (1972). Synthesis of voiced
the presence of the false vocal folds may interfere with sounds from a two-mass model of the vocal cords. Bell
the glottal jet and create a secondary sound source System Technology Journal, 51, 1233–1268.
78 Part I: Voice
Isshiki, N., and Ishizaka, K. (1976). Computer simulation of ciated certain kinds of voices with specific character
pathological vocal cord vibration. Journal of the Acoustical traits; for example, a nasal voice indicated a spiteful and
Society of America, 60, 1193–1198. immoral character. Ancient writers on oratory empha-
Rothenberg, M. (1983). An interactive model for the voice sized voice quality as an essential component of polished
source. In D. M. Bless and J. H. Abbs (Eds.), Vocal fold
speech and described methods for conveying a range of
physiology: Contemporary research and clinical issues (pp.
155–165). San Diego, CA: College-Hill Press. emotions appropriately, for cultivating power, brilliance,
Scherer, R. C. (1995). Laryngeal function during phonation. In and sweetness, and for avoiding undesirable character-
J. S. Rubin, R. T. Satalo¤, G. S. Korovin, and W. J. Gould istics like roughness, brassiness, or shrillness (see Laver,
(Eds.), Diagnosis and treatment of voice disorders (pp. 86– 1981, for review).
104). New York: Igaku-Shoin. Evaluation of vocal quality is an important part of
Scherer, R. C., DeWitt, K., and Kucinschi, B. R. (2001). The the diagnosis and treatment of voice disorders. Patients
e¤ect of exit radii on intraglottal pressure distributions in usually seek clinical care because of their own perception
the convergent glottis. Journal of the Acoustical Society of of a voice quality deviation, and most often they judge
America, 110, 2267–2269. the success of treatment for the voice problem by im-
Scherer, R. C., and Shinwari, D. (2000). Glottal pressure pro-
provement in their voice quality. A clinician may also
files for a diameter of 0.04 cm. Journal of the Acoustical
Society of America, 107, 2905 (A). judge success by documenting changes in laryngeal
Scherer, R. C., Shinwari, D., DeWitt, K., Zhang, C., Kucin- anatomy or physiology, but in general, patients are more
schi, B., and Afjeh, A. (2001). Intraglottal pressure profiles concerned with how their voices sound after treatment.
for a symmetric and oblique glottis with a divergence angle Researchers from other disciplines are also interested
of 10 degrees. Journal of the Acoustical Society of America, in measuring vocal quality. For example, linguists are
109, 1616–1630. interested in how changes in voice quality can signal
Scherer, R. C., Shinwari, D., De Witt, K. J., Zhang, C., changes in meaning; psychologists are concerned with
Kucinschi, B. R., and Afjeh, A. A. (2002). Intraglottal the perception of emotion and other personal informa-
pressure profiles for a symmetric and oblique glottis with a tion encoded in voice; engineers seek to develop algo-
uniform duct. Journal of the Acoustical Society of America,
rithms for signal compression and transmission that
112(4), 1253–1256.
Steinecke, I., and Herzel, H. (1995). Bifurcations in an asym- preserve voice quality; and law enforcement o‰cials
metric vocal-fold model. Journal of the Acoustical Society of need to assess the accuracy of speaker identifications.
America, 97, 1874–1884. Despite this long intellectual history and the substan-
Sundberg, J., Leanderson, R., and von Euler, C. (1989). tial cross-disciplinary importance of voice quality, mea-
Activity relationship between diaphragm and cricothyroid surement of voice quality is problematic, both clinically
muscles. Journal of Voice, 3, 225–232. and experimentally. Most techniques for assessing voice
Titze, I. R. (1992). Phonation threshold pressure: A missing quality fall into one of two general categories: perceptual
link in glottal aerodynamics. Journal of the Acoustical So- assessment protocols, or protocols employing an acous-
ciety of America, 91, 2926–2935. tic or physiologic measurement as an index of quality. In
perceptual assessments, a listener (or listeners) rates a
Titze, I. R., Baken, R. J., and Herzel, H. (1993). Evidence of voice on a numerical scale or a set of scales representing
chaos in vocal fold vibration. In I. R. Titze (Ed.), Vocal fold the extent to which the voice is characterized by critical
physiology: Frontiers in basic science (pp. 143–188). San aspects of voice quality. For example, Fairbanks (1960)
Diego, CA: Singular Publishing Group. recommended that voices be assessed on 5-point scales
Wong, D., Ito, M. R., Cox, N. B., and Titze, I. R. (1991). for the qualities harshness, hoarseness, and breathiness.
Observation of perturbations in a lumped-element model In the GRBAS protocol (Hirano, 1981), listeners evalu-
of the vocal folds with application to some pathological ate voices on the scales Grade (or extent of pathology),
cases. Journal of the Acoustical Society of America, 89, 383– Roughness, Breathiness, Asthenicity (weakness or lack
394. of power in the voice), and Strain, with each scale rang-
Zhang, C., Zhao, W., Frankel, S. H., and Mongeau, L. (2001).
ing from 0 (normal) to 4 (severely disordered). A recent
Computational aeroacoustics of phonation: E¤ect of sub-
glottal pressure, glottal oscillation frequency, and ven- revision to this protocol (Dejonckere et al., 1998) has
tricular folds. Journal of the Acoustical Society, 109, 2412 expanded it to GIRBAS by adding a scale for Instabil-
(A). ity. Many other similar protocols have been proposed.
For example, the Wilson Voice Profile System (Wilson,
1977) includes 7-point scales for laryngeal tone, laryn-
geal tension, vocal abuse, loudness, pitch, vocal inflec-
Voice Quality, Perceptual Evaluation of tions, pitch breaks, diplophonia (perception of two
pitches in the voice), resonance, nasal emission, rate, and
overall vocal e‰ciency. A 13-scale protocol proposed by
Voice quality is the auditory perception of acoustic ele- Hammarberg and Gau‰n (1995) includes scales for
ments of phonation that characterize an individual assessing aphonia (lack of voice), breathiness, tension,
speaker. Thus, it is an interaction between the acoustic laxness, creakiness, roughness, gratings, pitch instability,
speech signal and a listener’s perception of that signal. voice breaks, diplophonia, falsetto, pitch, and loudness.
Voice quality has been of interest to scholars for as long Even more elaborate protocols have been proposed by
as people have studied speech. The ancient Greeks asso- Gelfer (1988; 17 parameters) and Laver (approximately
Voice Quality, Perceptual Evaluation of 79
50 parameters; e.g., Greene and Mathieson, 1989). ticular acoustic stimulus. Thus, acoustic measures that
Methods like visual-analog scaling (making a mark on purport to quantify vocal quality can only derive their
an undi¤erentiated line to indicate the amount of a validity as measures of voice quality from their causal
quality present) or direct magnitude estimation (assign- association with auditory perception. Practically, con-
ing any number—as opposed to one of a finite number sistent correlations have never been found between per-
of scale values—to indicate the amount of a quality ceptual and instrumental measures of voice, suggesting
present) have also been applied in e¤orts to quantify that such instrumental measures are not stable indices of
voice quality. Ratings may be made with reference to perceived quality. Finally, correlation does not imply
‘‘anchor’’ stimuli that exemplify the di¤erent scale causality: simply knowing the relationship of an acoustic
values, or with reference to a listener’s own internal variable to a perceptual one does not necessarily illumi-
standards for the di¤erent levels of a quality. nate its contribution to perceived quality. Even if an
The usefulness of such protocols for perceptual as- acoustic variable were important to a listener’s judg-
sessment is limited by di‰culties in establishing the cor- ment of vocal quality, the nature of that contribution
rect and adequate set of scales needed to document the would not be revealed by a correlation coe‰cient. Fur-
sound of a voice. Researchers have never agreed on a ther, given the great variability in perceptual strategies
standardized set of scales for assessing voice quality, and and habits that individual listeners demonstrate in their
some evidence suggests that di¤erences between listeners use of traditional rating scales, the overall correlation
in perceptual strategies are so large that standardization between acoustic and perceptual variables, averaged
e¤orts are doomed to failure (Kreiman and Gerratt, across samples of listeners and voices, fails to provide
1996). In addition, listeners are apparently unable to useful insight into the perceptual process. (See Kreiman
agree in their ratings of voices. Evidence suggests that on and Gerratt, 2000, for an extended review of these
average, more than 60% of the variance in ratings of issues.)
voice quality is due to factors other than di¤erences Gerratt and Kreiman (2001) proposed an alternative
between voices in the quality being rated. For example, solution to this dilemma. They measured vocal quality
scale ratings may vary depending on variable listener by asking listeners to copy natural voice samples with a
attention, di‰culty isolating single perceptual dimen- speech synthesizer. In this method, listeners vary speech
sions within a complex acoustic stimulus, and di¤erences synthesis parameters to create an acceptable auditory
in listeners’ previous experience with a class of voices match to a natural voice stimulus. When a listener
(Kreiman and Gerratt, 1998). Evidence suggests that chooses the best match to a test stimulus, the synthesis
traditional perceptual scaling methods are e¤ectively settings parametrically represent the listener’s perception
matching tasks, where external stimuli (the voices) are of voice quality. Because listeners directly compare each
compared to stored mental representations that serve as synthetic token they create with the target natural voice,
internal standards for the various rating scales. These they need not refer to internal standards for particular
idiosyncratic, internal standards appear to vary with lis- voice qualities. Further, listeners can manipulate acous-
teners’ previous experience with voices (Verdonck de tic parameters and hear the result of their manipulations
Leeuw, 1998) and with the context in which a judgment immediately. This process helps listeners focus attention
is made, and may vary substantially across listeners on individual acoustic dimensions, reducing the percep-
as well as within a given listener. In addition, severity tual complexity of the assessment task and the associated
of vocal deviation, di‰culty isolating individual dimen- response variability. Preliminary evaluation of this
sions in complex perceptual contexts, and factors like method demonstrated near-perfect agreement among
lapses in attention can also influence perceptual mea- listeners in their assessments of voice quality, presum-
sures of voice (de Krom, 1994). These factors (and pos- ably because this analysis-synthesis method controls the
sibly others) presumably all add uncontrolled variability major sources of variance in quality judgments while
to scalar ratings of vocal quality, and contribute to lis- avoiding the use of dubiously valid scales for quality.
tener disagreement (see Gerratt and Kreiman, 2001, for These results indicate that listeners do in fact agree in
review). their perceptual assessments of pathological voice qual-
In response to these substantial di‰culties, some ity, and that tools can be devised to measure perception
researchers suggest substituting objective measures of reliably. However, how such protocols will function in
physiologic function, airflow, or the acoustic signal for clinical (rather than research) applications remains to be
these flawed perceptual measures, for example, using a demonstrated. Much more research is certainly needed
measure of acoustic frequency perturbation as a de facto to determine a meaningful, parsimonious set of acoustic
measure of perceived roughness (see acoustic assess- parameters that successfully characterizes all possible
ment of voice). This approach reflects the prevailing normal and pathological voice qualities. Such a set could
view that listeners are inherently unable to agree in their obviate the need for voice quality labels, allowing
perception of such complex auditory stimuli. Theoretical researchers and clinicians to replace quality labels with
and practical di‰culties also beset this approach. Theo- acoustic parameters that are causally linked to auditory
retically, we cannot know the perceptual importance perception, and whose levels objectively, completely, and
of particular aspects of the acoustic signal without valid validly specify the voice quality of interest.
measures of that perceptual response, because voice
quality is by definition the perceptual response to a par- —Bruce Gerratt and Jody Kreiman
80 Part I: Voice
References Voice Rehabilitation After

de Krom, G. (1994). Consistency and reliability of voice qual- Conservation Laryngectomy
ity ratings for di¤erent types of speech fragments. Journal of
Dejonckere, P. H., Remacle, M., Fresnel-Elbaz, E., Woisard,
V., Crevier, L., and Millet, B. (1998). Reliability and Partial or conservation laryngectomy procedures are
clinical relevance of perceptual evaluation of pathological performed not only to surgically remove a malignant
voices. Revue de Laryngologie Otologie Rhinologie, 119, lesion from the larynx, but also to preserve some func-
247–248. tional valving capacity of the laryngeal mechanism.
Fairbanks, G. (1960). Voice and articulation drillbook. New Retention of adequate valvular function allows conser-
York: Harper. vation of some degree of vocal function and safe
Gelfer, M. P. (1988). Perceptual attributes of voice: Develop- swallowing. As such, the primary goal of conservation
ment and use of rating scales. Journal of Voice, 2, 320– laryngectomy procedures is cancer control and oncologic
326.
safety, with a secondary goal of maintaining upper air-
Gerratt, B. R., and Kreiman, J. (2001). Measuring vocal qual-
ity with speech synthesis. Journal of the Acoustical Society way sphincteric function and phonatory capacity post-
of America, 110, 2560–2566. surgery. However, conservation laryngectomy will
Greene, M. C. L., and Mathieson, L. (1989). The voice and its always necessitate tissue ablation, with disruption of the
disorders. London: Whurr. vibratory integrity of at least one vocal fold (Bailey,
Hammarberg, B., and Gau‰n, J. (1995). Perceptual and 1981). From the standpoint of voice production, any
acoustic characteristics of quality di¤erences in pathological degree of laryngeal tissue ablation has direct and poten-
voices as related to physiological aspects. In O. Fujimura tially highly negative implications for the functional
and M. Hirano (Eds.), Vocal fold physiology: Voice quality capacity of the postoperative larynx. Changes in laryn-
control (pp. 283–303). San Diego, CA: Singular Publishing geal structure result in aerodynamic, vibratory, and ulti-
Group.
Hirano, M. (1981). Clinical examination of voice. New York:
mately acoustic changes in the voice signal (Berke,
Springer-Verlag. Gerratt, and Hanson, 1983; Rizer, Schecter, and Cole-
Kreiman, J., and Gerratt, B. R. (1996). The perceptual struc- man, 1984; Doyle, 1994).
ture of pathologic voice quality. Journal of the Acoustical Vocal characteristics following conservation laryn-
Society of America, 100, 1787–1795. gectomy are a consequence of anatomical influences
Kreiman, J., and Gerratt, B. R. (1998). Validity of rating scale and the resultant physiological function of the post-
measures of voice quality. Journal of the Acoustical Society surgical laryngeal sphincter, as well as secondary physi-
of America, 104, 1598–1608. ological compensation. In some instances, this level of
Kreiman, J., and Gerratt, B. R. (2000). Measuring vocal qual- compensation may facilitate the communicative process,
ity. In R. D. Kent and M. J. Ball (Eds.), Voice quality but in other instances such compensations may be detri-
measurement (pp. 73–102). San Diego, CA: Singular Pub-
lishing Group.
mental to the speaker’s communicative e¤ectiveness
Laver, J. (1981). The analysis of vocal quality: From the clas- (Doyle, 1997). Perceptual observations following a vari-
sical period to the 20th century. In R. Asher and E. Hen- ety of conservation laryngectomy procedures have been
derson (Eds.), Toward a history of phonetics (pp. 79–99). diverse, but data clearly indicate perceived changes
Edinburgh: Edinburgh University Press. in voice quality, the degree of air leakage through the
Verdonck de Leeuw, I. M. (1998). Perceptual analysis of voice reconstructed laryngeal sphincter, the appearance of
quality: Trained and naive raters, and self-ratings. In compensatory hypervalving, and other features (Blau-
G. de Krom (Ed.), Proceedings of Voicedata98 Symposium grund et al., 1984; Leeper, Heeneman, and Reynolds,
on Databases in Voice Quality Research and Education 1990; Hoasjoe et al., 1992; Doyle et al., 1995; Keith,
(pp. 12–15). Utrecht, the Netherlands: Utrecht Institute of Leeper, and Doyle, 1996). Two factors in particular,
Linguistics.
Wilson, F. B. (1977). Voice disorders. Austin; TX: Learning
glottic insu‰ciency and the relative degree of compli-
Concepts. ance and resistance to airflow o¤ered by the recon-
structed valve, appear to play a significant role in
compensatory behaviors influencing auditory-perceptual
Further Readings assessments of voice quality (Doyle, 1997). Excessive
closure of the laryngeal mechanism at either glottic or
Gerratt, B. R., and Kreiman, J. (2001). Measuring vocal qual- supraglottic (or both) levels might decrease air escape,
ity with speech synthesis. Journal of the Acoustical Society but may also create abnormalities in voice quality due
of America, 110, 2560–2566. to active (volitional) hyperclosure (Leeper, Heeneman,
Kent, R. D., and Ball, M. J. (2000). Voice quality measure- and Reynolds, 1990; Doyle et al., 1995; Keith, Leeper,
ment. San Diego, CA: Singular Publishing Group. and Doyle, 1996; Doyle, 1997). Similarly, volitional,
Kreiman, J., and Gerratt, B. R. (2000). Sources of listener
disagreement in voice quality assessment. Journal of the
compensatory adjustments in respiratory volume in an
Acoustical Society of America, 108, 1867–1879. e¤ort to drive a noncompliant voicing source charac-
Kreiman, J., Gerratt, B. R., Kempster, G., Erman, A., and terized by postsurgical increases in its resistance to
Berke, G. S. (1993). Perceptual evaluation of voice quality: airflow may negatively influence auditory-perceptual
Review, tutorial, and a framework for future research. judgments of the voice by listeners. This may then call
Journal of Speech and Hearing Research, 36, 21–40. attention to the voice, with varied degrees of social pen-
Voice Rehabilitation After Conservation Laryngectomy 81
alty. In this regard, the ultimate postsurgical e¤ects of e‰ciency and generate the best voice quality without
conservation laryngectomy on voice quality may result excessive physical e¤ort. Clinical goals that focus on
in unique limitations for men and women, and as such ‘‘easy’’ voice production without excessive speech rate
require clinical consideration. are appropriate targets. Common facilitation methods
The clinical evaluation of individuals who have may involve the use of visual or auditory feedback, ear
undergone conservation laryngectomy initially focuses training, and respiration training (Boone, 1977; Boone
on identifying behaviors that hold the greatest poten- and McFarland, 1994; Doyle, 1997).
tial to negatively alter voice quality. Excessive vocal Maladaptive compensations following conservation
e¤ort and a harsh, strained voice quality are commonly laryngectomy often tend to be hyperfunctional behav-
observed (Doyle et al., 1995). Standard evaluation iors. However, a subgroup of individuals may present
may include videoendoscopy (via both rigid and flexible with weak and inaudible voices because of pain or dis-
endoscopy) and acoustic, aerodynamic, and auditory- comfort in the early postsurgical period. Such compen-
perceptual assessment. Until recently, only limited sations may remain when the discomfort has resolved,
comprehensive data on vocal characteristics of those and may result in perceptible limitations in verbal com-
undergoing conservation laryngectomy have been avail- munication. In such cases of hypofunctional behavior,
able. Careful, systematic perceptual assessment has voice therapy is usually directed toward facilitating
direct clinical implications in that information from such increased approximation of the laryngeal valve by means
an assessment will lead to the definition of treatment of traditional voice therapy methods (Boone, 1977; Col-
goals and methods of monitoring potential progress. A ton and Casper, 1990). A weak voice requires the clini-
comprehensive framework for the evaluation and treat- cian to orient therapy tasks toward systematically
ment of voice alterations in those who have undergone increasing glottal resistance. Although a ‘‘rough’’ or
conservation laryngectomy is available (Doyle, 1977). ‘‘e¤ortful’’ voice may be judged as abnormal, it may
Depending on the auditory-perceptual character of be preferable for some speakers when compared to a
the voice, the clinician should be able to discern func- breathy voice quality. This is of particular importance
tional (physiological) changes to the sphincter that may when evaluating goals and potential voice outcomes rel-
have a direct influence on voice quality. Those auditory- ative to the speaker’s sex.
perceptual features that most negatively a¤ect overall The physical and psychological demands placed on
voice quality should form the initial targets for thera- the patient during initial attempts at voicing might
peutic intervention. For example, the speaker’s attempt increase levels of tension that ultimately may reduce
to increase vocal loudness may create a level of hyper- the individual’s phonatory capability. Those individuals
closure that is detrimental to judgments of voice quality. who exhibit increased fundamental frequency, exces-
Although the rationale for such ‘‘abnormal’’ behavior sively aperiodic voices, or intermittent voice stoppages
is easily understood, the speaker must understand the may be experiencing problems that result from post-
relative levels of penalty it creates in a communicative operative physiological overcompensation because they
context. Further treatment goals should focus on (1) are struggling to produce voice. Many individuals who
enhancing residual vocal functions and capacities, and have undergone conservative laryngectomy may demon-
(2) e¤orts to reduce or eliminate compensatory behav- strate considerable e¤ort during attempts at postsurgical
iors that negatively alter the voice signal (Doyle, 1997). voice production, particularly early during treatment.
Thus, primary treatment targets will frequently address Because active glottic hypofunction is infrequently
changes in voice quality and/or vocal e¤ort. Increased noted in those who have undergone conservative laryn-
e¤ort may be compensatory in an attempt to alter pitch gectomy, clinical tasks that focus on reducing over-
or loudness, or simply to initiate the generation of compensation (i.e., hyperfunctional closure) are more
voice. Clinical assessment should determine whether commonly used.
voice change is due to under- or overcompensation for See also laryngectomy.
the disrupted sphincter. Therefore, strategies for voice
therapy must address changes in anatomical and physi- —Philip C. Doyle
ological function, the contributions of volitional com-
pensation, and whether changes in voice quality may be References
the result of multiple factors.
Voice therapy strategics for those who have under- Bailey, B. J. (1981). Partial laryngectomy and laryngoplasty.
gone conservation laryngectomy have evolved from Laryngoscope, 81, 1742–1771.
strategics used in traditional voice therapy (e.g., Colton Berke, G. S., Gerratt, B. R., and Hanson, D. G. (1983).
and Casper, 1990; Boone and McFarland, 1994). Doyle An acoustic analysis of the e¤ects of surgical therapy on
(1997) has suggested that therapy following conservation voice quality. Otolaryngology–Head and Neck Surgery, 91,
502–508.
laryngectomy should focus on ‘‘(1) smooth and easy Blaugrund, S. M., Gould, W. J., Haji, T., Metzler, J., Bloch,
phonation; (2) a slow, productive transition to voice C., and Baer, T. (1984). Voice analysis of the partially
generation at the initiation of voice and speech produc- ablated larynx: A preliminary report. Annals of Otology,
tion, (3) increasing the length of utterance in conjunc- Rhinology, and Laryngology, 93, 311–317.
tion with consistently easy phonation, and (4) control of Boone, D. R. (1977). The voice and voice therapy (2nd ed.).
speech rate via phrasing.’’ The intent is to improve vocal Englewood Cli¤s, NJ: Prentice Hall.
82 Part I: Voice
Boone, D. R., and McFarlane, S. C. (1994). The voice and Kirchner, J. A. (1984). Pathways and pitfalls in partial laryn-
voice therapy (5th ed.). Englewood Cli¤s, NJ: Prentice Hall. gectomy. Annals of Otology, Rhinology, and Laryngology,
Colton, R. H., and Casper, J. K. (1990). Understanding voice 93, 301–305.
problems: A physiological perspective for diagnosis and Kirchner, J. A. (1989). What have whole organ sections con-
treatment. Baltimore: Williams and Wilkins. tributed to the treatment of laryngeal cancer? Annals of
Doyle, P. C. (1994). Foundations of voice and speech rehabili- Otology, Rhinology, and Laryngology, 98, 661–667.
tation following laryngeal cancer. San Diego, CA: Singular Lefebvre, J. (1998). Larynx preservation: The discussion is
Publishing Group. not closed. Otolaryngology–Head and Neck Surgery, 118,
Doyle, P. C. (1997). Voice refinement following conserva- 389–393.
tion surgery for cancer of the larynx: A conceptual model Pressman, J. J. (1956). Submucosal compartmentalization of
for therapeutic intervention. American Journal of Speech- the larynx. Annals of Otology, Rhinology, and Laryngology,
Language Pathology, 6, 27–35. 65, 766–773.
Doyle, P. C., Leeper, H. A., Houghton-Jones, C., Heeneman, Robbins, K. T., and Michaels, L. (1985). Feasibility of subtotal
H., and Martin, G. F. (1995). Perceptual characteristics of laryngectomy based on whole-organ examination. Archives
hemilaryngectomized and near-total laryngectomized male of Otolaryngology, 111, 356–360.
speakers. Journal of Medical Speech-Language Pathology, Tucker, G. (1964). Some clinical inferences from the study of
3, 131–143. serial laryngeal sections. Laryngoscope, 73, 728–748.
Hoasjoe, D. R., Martin, G. F., Doyle, P. C., and Wong, F. S. Tucker, G. F., and Smith, H. R. (1982). A histological dem-
(1992). A comparative acoustic analysis of voice production onstration of the development of laryngeal connective tissue
by near-total laryngectomy and normal laryngeal speakers. compartments. Transactions of the American Academy of
Journal of Otolaryngology, 21, 39–43. Ophthalmology and Otolaryngology, 66, 308–318.
Keith, R. L., Leeper, H. A., and Doyle, P. C. (1996). Micro-
acoustical measures of voice following near-total laryn-
gectomy. Otolaryngology–Head and Neck Surgery, 113,
698–694. Voice Therapy: Breathing Exercises
Leeper, H. A., Heeneman, H., and Reynolds, C. (1990). Vocal
function following vertical hemilaryngectomy: A pre-
liminary investigation. Journal of Otolaryngology, 19, 62–67. Breathing—the mechanical process of moving air in and
Rizer, F. M., Schecter, G. L., and Coleman, R. F. (1984). out of the lungs—plays an important role in both speech
Voice quality and intelligibility characteristics of the recon- and voice production; however, the emphasis placed on
structed larynx and pseudolarynx. Otolaryngology–Head breathing exercises relative to voice disorders in the
and Neck Surgery, 92, 635–638. published literature is mixed. A review of voice therapy
techniques by Casper and Murray (2000) did not suggest
Further Readings any breathing exercises for voice disorders. Some books
Biacabe, B., Crevier-Buchman, L., Hans, S., Laccourreye, O., on voice and voice disorders have no discussion of
and Brasnu, D. (1999). Vocal function after vertical partial changing breathing behavior relative to voice disorders
laryngectomy with glottic reconstruction by false vocal fold (Case, 1984; Colton and Casper, 1996), others do (e.g.,
flap: Durational and frequency measures. Laryngoscope, Aronson, 1980; Boone and McFarlane, 2000; Cooper,
109, 698–704. 1973; Stemple, Glaze, and Gerdeman, 1995). Although
Crevier-Buchman, L., Laccourreye, O., Wuyts, F. L., Mon- breathing exercises are advocated by some, little is
frais-Pfauwadel, M., Pillot, C., and Brasnu, D. (1998). known about the role played by breathing, either directly
Comparison and evolution of perceptual and acoustic or indirectly, in disorders of the voice. Reed (1980) noted
characteristics of voice after supracricoid partial laryn- the lack of empirical evidence that breathing exercises
gectomy with cricohyoidoepiglottopexy. Acta Otolaryn-
were useful in ameliorating voice disorders.
gologica, 118, 594–599.
DeSanto, L. W., Pearson, B. W., and Olsen, K. D. (1989). At present, then, there is a paucity of data on the re-
Utility of near-total laryngectomy for supraglottic, pharyn- lationship of breathing to voice disorders. The data that
geal, base-of-tongue, and other cancers. Annals of Otology, do exist generally describe the breathing patterns that
Rhinology, and Laryngology, 98, 2–7. accompany voice disorders, but there are no data on
De Vincentiis, M., Minni, A., Gallo, A., and DiNardo, A. what kind of breathing behavior might contribute to
(1998). Supracricoid partial laryngectomies: Oncologic and voice disorders. For example, Sapienza, Stathopoulos,
functional results. Head and Neck, 20, 504–509. and Brown (1997) studied breathing kinematics during
Fung, K., Yoo, J., Leeper, H. A., Hawkins, S., Heeneman, H., reading in ten women with vocal fold nodules. They
Doyle, P. C., et al. (2001). Vocal function following radia- found that the women used more air per syllable, more
tion for non-laryngeal versus laryngeal tumors of the head
and neck. Laryngoscope, 111, 1920–1924.
lung volume per phrase, and initiated breath groups at
Gavilan, J., Herranz, J., Prim, P., and Rabanal, I. (1996). higher lung volumes than women without vocal nodules.
Speech results and complications of near-total laryn- However, as the authors point out, the breathing behav-
gectomy. Annals of Otology, Rhinology, and Laryngology, ior observed in the women with nodules was most likely
105, 729–733. in response to ine‰cient valving at the larynx and did
Hanamitsu, M., Kataoka, H., Takeuchi, E., and Kitajima, K. not cause the nodules.
(1999). Comparative study of vocal function after near-total
laryngectomy. Laryngoscope, 109, 1320–1323. Normal Breathing
Kirchner, J. A. (1975). Growth and spread of laryngeal can-
cer as related to partial laryngectomy. Laryngoscope, 85, When assessing and planning therapy for disorders of
1516–1521. the voice, it is important to know what normal function
Voice Therapy: Breathing Exercises 83
is. Hixon (1975) provides a useful parameterization of E¤ects of Posture

breathing for speech that includes volume, pressure,
flow, and body configuration or shape. For conversa- Although much is known about speech breathing, little
tional speech in the upright body position, the following of this information has found its way into the clinical
apply. Lung volume is the amount of air available for literature and been applied to voice therapy. As a result,
speaking or vocalizing. The volumes used for speech are only one breathing technique to improve voice pro-
usually within the midvolume range of vital capacity duction is usually described: The client is placed supine
(VC), beginning at 60% VC and ending at around 40% and increased outward movement of the abdomen is
VC (Hixon, 1973; Hixon, Mead, and Goldman, 1976; observed as the client breathes at rest.
Hoit et al., 1989). This volume range is e‰cient and The changes that occur in speech breathing with a
economical, in that extra e¤ort is not required to over- switch from the upright position are numerous and re-
come recoil forces (Hixon, Mead, and Goldman, 1976). flect the di¤erent e¤ects of gravity (see Hoit, 1995, for a
Most lung volume exchange for speech is brought about comprehensive tutorial). In supine speech breathing
by rib cage displacement and not by displacement of the involves approximately 20% less of VC. The change of
abdomen (Hixon, 1973). The rib cage is e‰cient in dis- body configuration from the upright position to the su-
placing lung volume because it covers a greater surface pine position means that rib cage volume decreases and
area of the lungs, it consists of muscle fiber types that are abdominal volume increases. This modification changes
able to generate fast and accurate pressure changes, and the mechanics of the breathing muscles and requires a
it is well endowed with spindle organs for purposes of di¤erent motor control strategy for speech production.
sensory feedback. Pressure (translaryngeal pressure) is For example, in the supine position there is little or no
related to the intensity of the voice (Bouhuys, Proctor, muscular activity of the abdomen during speaking,
and Mead, 1966). Pressure for conversational speech is whereas in the upright body position the abdominal
typically around 4–7 cm H2 O, or 0.4–0.7 kPa (Isshiki, muscles are quite active (Hixon, Mead, and Goldman,
1964; Stathopoulos and Sapienza, 1993). Pressure is 1976; Hoit et al., 1988). In light of the mechanical and
generated by both muscular and inherent recoil forces, neural control issues discussed earlier, it seems unwar-
and the interworking of these forces depends on the level ranted to position an individual supine to teach ‘‘natu-
of lung volume (Hixon, Mead, and Goldman, 1976). ral’’ breathing for speech and voice. With regard to
Reference to flow is in the macro sense and denotes breathing at rest, it should be noted that this task is not
shorter inspiratory durations relative to longer expira- specific to speech. Kelso, Saltzman, and Tuller (1986)
tory durations. This di¤erence in timing reflects the hypothesize that the control of speech is task-specific.
speaker’s desire to maintain the flow of speech in his or Hixon (1982) showed kinematic data from a patient with
her favor. In the upright body position, body configura- Friedreich’s ataxia whose abdominal wall was assumed
tion refers to the size of the abdomen relative to the size to be paralyzed because it showed no inward displace-
of rib cage. For speech production in the upright body ment and no movement during speech. However, when
position, the abdomen is smaller and the rib cage is this patient laughed, his abdominal wall was displaced
larger relative to relaxation (Hixon, Goldman, and inward and displayed a great amount of movement. As
Mead, 1973). Hoit (1995) points out, it seems unlikely that techniques
The upright body configuration provides an econom- for changing breathing behavior learned in a resting,
ical and e‰cient mechanical advantage to the breathing supine position would generalize to an upright body po-
apparatus. The abdomen not only produces lung volume sition. It seems curious why this technique is advocated.
change in the expiratory direction, it also optimizes the It may be that this technique does not change breathing
function of the diaphragm and rib cage. It does so in two behavior but is e¤ective in relaxing individuals with
ways. First, inward abdominal placement lifts the dia- voice disorders. Relaxation techniques have been advo-
phragm and the rib cage (Goldman, 1974). This action cated to reduce systemic muscular tension in individ-
positions the expiratory muscle fibers of the rib cage and uals with voice disorders (Boone and McFarlane, 2000;
muscle fibers of the diaphragm on a more favorable Greene and Mathieson, 1989), and breathing exercises
portion of their length-tension curve. This allows quicker are known to be beneficial in reducing heart rate and
and more forceful contractions for both inspiration and blood pressure (Grossman et al., 2001; Han, Stegen,
expiration while using less neural energy. Second, this Valck, Clement, and Woestijne, 1996).
inward abdominal position as it is maintained provides a
Other Approaches
platform against which the diaphragm and rib cage can
push in order to produce the necessary pressures and If learning breathing techniques in the supine position is
flows for speech. If the abdomen did not o¤er resistance not useful, what else might be done with the breathing
to the rib cage during the expiratory phase of speech apparatus to ameliorate voice disorders? Perhaps mod-
breathing, it would be forced outward and would move ifying lung volume would be useful. Hixon and Putnam
in a paradoxical manner during expiration. Paradoxical (1983) described breathing behavior in a 30-year-old
motion results in reduced economy of movement, or woman (a local television broadcaster) with a functional
wasted motion. Thus, the pressure generated by the rib voice problem. Although she had a normal voice and no
cage would alter the shape of the breathing apparatus positive laryngeal signs, audible inspiratory turbulence
and would not assist in developing as rapid and as large was evident during her broadcasts. Using breathing ki-
an alveolar pressure change (Hixon and Weismer, 1995). nematic measurement techniques, Hixon and Putnam
84 Part I: Voice
found that this person spoke in the lower range of her Greene, M., and Mathieson, L. (1989). The voice and its dis-
VC, between 45% and 10%. They believed the noisy orders (5th ed.). London: Whurr.
inspirations were due to the turbulence created by in- Grossman, E., Grossman, A., Schein, M., Zimlichman, R., and
creased resistance in the lower airways that occurs at low Gavish, B. (2001). Breathing-control lowers blood pressure.
Journal of Human Hypertension, 14, 263–269.
lung volume. Therefore, it is possible the telecaster’s
Han, J., Stegen, K., Valck, C. D., Clement, J., and van de
noisy inspirations could have been eliminated or reduced Woestijne, K. (1996). Influence of breathing therapy on
if she were to produce speech at higher (more normal) complaints, anxiety and breathing pattern in patients with
lung volumes. However, the authors did not report any hyperventilation syndrome and anxiety disorders. Journal of
attempts to modify lung volume. Of note, the woman Psychosomatic Research, 41, 481–493.
said that when she spoke at lower lung volumes, her voice Hixon, T. (1975). Respiratory-laryngeal evaluation. Paper pre-
sounded more authoritative, and that her voice seemed sented at the Veterans Administration Workshop on Motor
to be much lighter when she spoke at higher lung volumes. Speech Disorders, Madison, WI.
When a person inspires to higher lung volumes, the Hixon, T., and Putnam, A. (1983). Voice abnormalities in re-
downward movement of the diaphragm pulls on the lation to respiratory kinematics. Seminars in Speech and
Language, 5, 217–231.
trachea, and this pulling is believed to generate passive
Hixon, T., and Weismer, G. (1995). Perspectives on the Edin-
adductory forces on the vocal folds (Zenker and Zenker, burgh study of speech breathing. Journal of Speech and
1960). Solomon, Garlitz, and Milbrath (2000) found that Hearing Research, 38, 42–60.
in men, there was a tendency for laryngeal airway resis- Hixon, T. J. (1973). Respiratory function in speech. In F. D.
tance to be reduced during syllable production at high Minifie, T. J. Hixon, and F. Williams (Eds.), Normal
lung volumes compared with low lung volumes. Mil- aspects of speech, hearing, and language. Englewood Cli¤s,
stein (1999), using video-endoscopic and breathing ki- NJ: Prentice-Hall.
nematic analysis, found that at high lung volumes, the Hixon, T. J. (1982). Speech breathing kinematics and mecha-
laryngeal area appeared more dilated and the larynx nism inferences therefrom. In S. Grillner, B. Lindblom,
was in a lower vertical position in the neck than dur- J. Lubker, and A. Persson (Eds.), Speech motor control
(pp. 75–93). Oxford, UK: Pergamon Press.
ing phonation at lower lung volumes. Plassman and
Hixon, T. J., Goldman, M. D., and Mead, J. (1973). Kine-
Lansing (1990) showed that with training, individuals matics of the chest wall during speech production: Volume
can produce consistently higher lung volumes during displacements of the rib cage, abdomen, and lung. Journal
inspiration. of Speech and Hearing Research, 16, 78–115.
Even after the call by Reed (1980) more than 20 years Hixon, T. J., Mead, J., and Goldman, M. D. (1976). Dynamics
ago for more empirical data on breathing exercises to of the chest wall during speech production: Function of the
treat voice disorders, little has been done. More research thorax, rib cage, diaphragm, and abdomen. Journal of
in this area is decisively needed. Research e¤orts should Speech and Hearing Research, 19, 297–356.
focus first on how and whether abnormal breathing be- Hoit, J. (1995). Influence of body position on breathing and its
havior contributes to voice disorders. Then researchers implications for the evaluation and treatment of voice dis-
orders. Journal of Voice, 9, 341–347.
should examine what techniques are viable for changing
Hoit, J. D., Hixon, T. J., Altman, M. E., and Morgan, W. J.
this abnormal breathing behavior—if it exists. E‰cacy (1989). Speech breathing in women. Journal of Speech and
research is of great importance because of the reluctance Hearing Research, 32, 353–365.
of third-party insurers to cover voice disorders. Hoit, J. D., Plassman, B. L., Lansing, R. W., and Hixon,
T. J. (1988). Abdominal muscle activity during speech
—Peter Watson
production. Journal of Applied Physiology, 65, 2656–
References 2664.
Isshiki, N. (1964). Regulatory mechanisms of voice intensity
Aronson, A. (1980). Clinical voice disorders: An interdisciplin- variation. Journal of Speech and Hearing Research, 7,
ary approach. New York: B.C. Decker. 17–29.
Boone, D., and McFarlane, S. (2000). The voice and voice Kelso, J. A. S., Saltzman, E. L., and Tuller, B. (1986). The
therapy (6th ed.). Boston: Allyn and Bacon. dynamic perspective on speech production: Data and
Bouhuys, A., Proctor, D. F., and Mead, J. (1966). Kinetic theory. Journal of Phonetics, 14, 29–59.
aspects of singing. Journal of Applied Physiology, 21, Milstein, C. (1999). Laryngeal function associated with changes
483–496. in lung volume during voice and speech production in normal
Case, J. (1984). Clinical management of voice disorders. Rock- speaking women. Unpublished dissertation, University of
ville, MD: Aspen. Arizona, Tucson.
Casper, J., and Murray, T. (2000). Voice therapy methods in Plassman, B., and Lansing, R. (1990). Perceptual cues used to
dysphonia. Otolaryngologic Clinics of North America, 33, reproduce an inspired lung volume. Journal of Applied
983–1002. Physiology, 69, 1123–1130.
Colton, R., and Casper, J. (1996). Understanding voice prob- Reed, C. (1980). Voice therapy: A need for research. Journal of
lems (2nd ed.). Baltimore: Williams and Wilkins. Speech and Hearing Disorders, 45, 157–169.
Cooper, M. (1973). Modern techniques of vocal rehabilitation. Sapienza, C. M., Stathopoulos, E. T., and W. S. Brown, J.
Springfield: Charles C. Thomas. (1997). Speech breathing during reading in women with
Goldman, M. (1974). Mechanical coupling of the diaphragm vocal nodules. Journal of Voice, 11, 195–201.
and rib cage. In L. Pengally, A. Rebuk, and C. Reed (Eds.), Solomon, N., Garlitz, S., and Milbrath, R. (2000). Respiratory
Loaded breathing (pp. 50–63). London: Churchill Living- and laryngeal contributions to maximum phonation dura-
stone. tion. Journal of Voice, 14, 331–340.
Voice Therapy: Holistic Techniques 85
Stathopoulos, E. T., and Sapienza, C. (1993). Respiratory Watson, P. J., Hixon, T. J., Stathopoulos, E. T., and Sullivan,
and laryngeal function of women and men during vocal in- D. R. (1990). Respiratory kinematics in female classical
tensity variation. Journal of Speech and Hearing Research, singers. Journal of Voice, 4, 120–128.
36, 64–75. Watson, P. J., Hoit, J. D., Lansing, R. W., and Hixon, T. J.
Stemple, J., Glaze, L., and Gerdeman, B. (1995). Clinical voice (1989). Abdominal muscle activity during classical singing.
pathology: Theory and management (2nd ed.). San Diego, Journal of Voice, 3, 24–31.
CA: Singular Publishing Group. Webb, R., Williams, F., and Minifie, F. (1967). E¤ects of
Zenker, W., and Zenker, A. (1960). Über die Regelung der verbal decision behavior upon respiration during speech
Stimmlippenspannung durch von aussen eingreifende production. Journal of Speech and Hearing Research, 10,
Mechansimen. Folia Phoniatricia, 12, 1–36. 49–56.
Winkworth, A. L., and Davis, P. J. (1997). Speech breathing
Further Readings and the lombard e¤ect. Journal of Speech, Language, and
Agostoni, E., and Mead, J. (1986). Statics of the respiratory Winkworth, A. L., Davis, P. J., Ellis, E., and Adams, R. D.
system. In P. T. Macklem and J. Mead (Eds.), The respira- (1994). Variability and consistency in speech breathing dur-
tory system (3rd ed., vol. 3, pp. 387–409). Bethesda, MD: ing reading: Lung volumes, speech intensity, and linguistic
American Physiological Society. factors. Journal of Speech and Hearing Research, 37,
Druz, W. S., and Sharp, J. T. (1981). Activity of respiratory 535–556.
muscles in upright and recumbent humans. Journal of
Applied Physiolgy, Respiration, and Environment, 51,
1552–1561. Voice Therapy: Holistic Techniques
Goldman-Eisler. (1968). The significance of breathing in
speech. In Psycholinguistics: Experiments in spontaneous
speech (pp. 100–106). New York: Academic Press. Whenever a voice disorder is present, a change in the
Konno, K., and Mead, J. (1967). Measurement of the separate normal functioning of the physiology responsible for
volume changes of rib cage and abdomen during breathing. voice production may be assumed. These physiological
Journal of Applied Physiology, 22, 407–422.
Lansing, R. W., and Banzett, R. B. (1996). Psychophysical events are measurable and may be modified by voice
methods in the study of respiratory sensation. In L. Adams therapy. Normal voice production depends on a relative
and A. Guz (Eds.), Respiratory sensation. New York: Mar- balance among airflow, supplied by the respiratory sys-
cel Dekker. tem; laryngeal muscle strength, balance, coordination,
Loring, S. H., and Bruce, E. N. (1986). Methods for study of and stamina; and coordination among these and the
the chest wall. In P. T. Macklem and J. Mead (Eds.), supraglottic resonators (pharynx, oral cavity, nasal
Handbook of physiology (vol. 3, pp. 415–428). Betheseda, cavity). Any disturbance in the relative physiological
MD: American Physiological Society. balance of these vocal subsystems may lead to or be
Loring, S., Mead, J., and Griscom, N. T. (1985). Dependence perceived as a voice disorder. Disturbances may occur in
of diaphragmatic length on lung volume and thoraco-
respiratory volume, power, pressure, and flow, and may
abdominal configuration. Journal of Applied Physiology,
59, 1961–1970. manifest in vocal fold tone, mass, sti¤ness, flexibility,
McFarland, D. H., and Smith, A. (1989). Surface recordings and approximation. Finally, the coupling of the supra-
of respiratory muscle activity during speech: Some prelimi- glottic resonators and the placement of the laryngeal
nary findings. Journal of Speech and Hearing Research, 32, tone may cause or be implicated in a voice disorder
657–667. (Titze, 1994).
Mead, J., and Agostoni, E. (1986). Dynamics of breathing. In The overall causes of vocal disturbances may be me-
P. T. Macklem and J. Mead (Eds.), The respiratory system chanical, neurological, or psychological. Whatever the
(3rd ed., vol. 3, pp. 411–427). Bethesda, MD: American cause, one management approach is direct modification
Physiological Society. of the inappropriate physiological activity through direct
Mitchell, H., Hoit, J., and Watson, P. (1996). Cognitive-
exercise and manipulation. When all three subsystems of
linguistic demands and speech breathing. Journal of Speech
and Hearing Research, 39, 93–104. voice are addressed in one exercise, this is considered
Porter, R. J., Hogue, D. M., and Tobey, E. A. (1994). Dy- holistic voice therapy. Examples of holistic voice therapy
namic analysis of speech and non-speech respiration. In include Vocal Function Exercises (Stemple, Glaze, and
F. Bell-Berti and L. J. Raphael (Eds.), Studies in speech Klaben, 2000), Resonant Voice Therapy (Verdolini,
production: A Festschrift for Katherine Sa¤ord Harris 2000), the Accent Method of voice therapy (Kotby,
(pp. 1–25). Washington, DC: American Institute of Physics. 1995; Harris, 2000), and the Lee Silverman Voice
Smith, J. C., and Loring, S. H. (1986). Passive mechanical Treatment (Ramig, 2000). The following discussion
properties of the chest wall. In P. T. Macklem and J. Mead considers the use of Vocal Function Exercises to
(Eds.), Handbook of physiology (vol. 3, pp. 429–442). strengthen and balance the vocal mechanism.
Betheseda, MD: American Physiological Society.
The Vocal Function Exercise program is based on an
Watson, P. J., and Hixon, T. J. (1985). Respiratory kinematics
in classical (opera) singers. Journal of Speech and Hearing assumption that has not been proved empirically.
Research, 28, 104–122. Nonetheless, this assumption and the clinical logic that
Watson, P., Hixon, T., and Maher, M. (1987). To breathe or follows have been supported through many years of
not to breathe—that is the question: An investigation of clinical experience and observation, as well as sev-
speech breathing kinematics in world-class Shakespearean eral e‰cacy studies (Stemple, 1994; Sabol, Lee, and
actors. Journal of Voice, 1, 269–272. Stemple, 1995; Roy et al., 2001). In a double-blind,
86 Part I: Voice
placebo-controlled study, Stemple et al. (1994) demon- muscles of the larynx must be in equilibrium. Briess’s
strated that Vocal Function Exercises were e¤ective in exercises concentrated on restoring the balance in the
enhancing voice production in young women without laryngeal musculature and decreasing tension of the
vocal pathology. The primary physiological e¤ects were hyperfunctioning muscles. Unfortunately, many as-
reflected in increased phonation volumes at all pitch sumptions Briess made regarding laryngeal muscle func-
levels, decreased airflow rates, and a subsequent increase tion were incorrect, and his therapy methods were not
in maximum phonation times. Frequency ranges were widely followed. The concept of direct exercise to
extended significantly in the downward direction. strengthen voice production persisted. Barnes (1977)
Sabol, Lee, and Stemple (1995), experimenting with described a modification of Briess’s work that she
the value of Vocal Function Exercises in the practice termed Briess Exercises. These exercises were modi-
regimen of singers, used graduate students of opera as fied and expanded by Stemple (1984) into Vocal Func-
subjects. Significant improvements in the physiologic tion Exercises. The exercise program strives to balance
measurements of voice production were achieved, in- and strengthen the subsystems of voice production,
cluding increased airflow volume, decreased airflow whether the disorder is one of vocal hyperfunction or
rates, and increased maximum phonation time, even in hypofunction.
this group of superior voice users. The exercises are simple to teach and, when presented
Roy et al. (2001) studied the e‰cacy of Vocal Func- appropriately, seem reasonable to patients. Indeed,
tion Exercises in a population with voice pathology. many patients are enthusiastic to have a concrete pro-
Teachers who reported experiencing voice disorders were gram, similar in concept to physical therapy, during
randomly assigned to three groups: Vocal Function which they may plot the progress of their return to vocal
Exercises, vocal hygiene, and control groups. For 6 e‰ciency. The program begins by describing the prob-
weeks the experimental groups followed their respec- lem to the patient, using illustrations as needed or the
tive therapy programs and were monitored by speech- patient’s own stroboscopic evaluation video. The patient
language pathologists trained by the experimenters in is then taught a series of four exercises to be practiced at
the two approaches. Pre- and post-testing of all three home, two times each, twice a day, preferably morning
groups using the Voice Handicap Index (VHI; Jacobson and evening. These exercises include the following:
et al., 1997) revealed significant improvement in the
Vocal Function Exercise group, and no improvement in 1. Sustain the /i/ vowel for as long as possible on a mu-
the vocal hygiene group. Subjects in the control group sical note: F above middle C for females and boys, F
rated themselves worse. below middle C for males. (Pitches may be modified
The laryngeal mechanism is similar to other muscle up or down to fit the needs of the patient. Seldom are
systems and may become strained and imbalanced for a they modified by more than two scale steps in either
variety of reasons (Saxon and Schneider, 1995). Indeed, direction.)
the analogy that we often draw with patients is a com- The goal of the exercise is based on airflow volume.
parison of knee rehabilitation with rehabilitation of the In our clinic, the goal is based on reaching 80–100 mL/s
voice. Both the knee and the larynx consist of muscle, of airflow. So, if the flow volume is 4000 mL, the goal is
cartilage, and connective tissue. When the knee is 40–45 s. When airflow measurements are not available,
injured, rehabilitation includes a short period of im- the goal is equal to the longest /s/ that the patient is able
mobilization to reduce the e¤ects of the acute injury. to sustain. Placement of the tone should be in an extreme
The immobilization is followed by assisted ambulation, forward focus, almost but not quite nasal. All exercises
and then the primary rehabilitation begins, in the form are produced as softly as possible, but not breathy. The
of systematic exercise. This exercise is designed to voice must be engaged. This is considered a warm-up
strengthen and balance all of the supportive knee mus- exercise.
cles for the purpose of returning the knee as close to its
2. Glide from your lowest note to your highest note on
normal functioning as possible.
the word knoll.
Rehabilitation of the voice may also involve a short
period of voice rest after acute injury or surgery to per- The goal is to achieve no voice breaks. The glide
mit healing of the mucosa to occur. The patient may requires the use of all laryngeal muscles. It stretches the
then begin conservative voice use and follow through vocal folds and encourages a systematic, slow engage-
with all of the management approaches that seem nec- ment of the cricothyroid muscles. The word knoll
essary. Full voice use is then resumed quickly, and the encourages a forward placement of the tone as well as an
therapy program often is successful in returning the expanded open pharynx. The patient’s lips are to be
patient to normal voice production. Often, however, rounded, and a sympathetic vibration should be felt on
patients are not fully rehabilitated because an important the lips. (A lip trill, tongue trill, or the word whoop may
step was neglected—the systematic exercise program to also be used.) Voice breaks will typically occur in the
regain the balance among airflow, laryngeal muscle transitions between low and high registers. When breaks
activity, and supraglottic placement of the tone. occur, the patient is encouraged to continue the glide
A series of laryngeal muscle exercises was first de- without hesitation. When the voice breaks at the top of
scribed by Bertram Briess (1957, 1959). Briess suggested the current range and the patient typically has more
that for the voice to be most e¤ective, the intrinsic range, the glide may be continued without voice as the
Voice Therapy: Holistic Techniques 87
folds will continue to stretch. Glides improve muscular approximate the correct notes well with practice and
control and flexibility. This is considered a stretching guidance from the voice pathologist.
exercise. Finally, patients are given a chart on which to mark
their sustained times, which is a means of plotting prog-
3. Glide from your highest note to your lowest note on ress. Progress is monitored over time and, because of
the word knoll. normal daily variability, patients are encouraged not
The goal is to achieve no voice breaks. The patient is to compare today with tomorrow, and so on. Rather,
instructed to feel a half-yawn in the throat throughout weekly comparisons are encouraged. The estimated time
this exercise. By keeping the pharynx open and focus- of completion for the program is 6–8 weeks. Some
ing the sympathetic vibration at the lips, the downward patients experience minor laryngeal aching for the first
glide encourages a slow, systematic engagement of the day or two of the program, similar to the muscle aching
thyroarytenoid muscles without the presence of a back- that might occur with any new muscular exercise. As this
focused growl. In fact, no growl is permitted. (A lip trill, discomfort will soon subside, they are encouraged to
tongue trill, or the word boom may also be used.) This is continue the program through the discomfort should it
considered a contracting exercise. occur.
When the patient has reached the predetermined
4. Sustain the musical notes C-D-E-F-G for as long as therapy goal, and the voice quality and other vocal
possible on the word knoll minus the kn. The range symptoms have improved, then a tapering maintenance
should be around middle C for females and boys, an program is recommended. Although some professional
octave below middle C for men. voice users choose to remain in peak vocal condition,
many of our patients desire to taper the exercise pro-
The goal is the same as for exercise 1. The -oll is pro-
gram. The following systematic taper is recommended:
duced with an open pharynx and constricted, sympa-
thetically vibrating lips. The shape of the pharynx in Full program, 2 times each, 2 times per day
respect to the lips is like an inverted megaphone. This Full program, 2 times each, 1 time per day (morning)
exercise may be tailored to the patient’s present vocal Full program, 1 time each, 1 time per day (morning)
ability. Although the basic range of middle C (an octave Exercise 4, 2 times each, 1 time per day (morning)
lower for men) is appropriate for most voices, the exer- Exercise 4, 1 time each, 1 time per day (morning)
cises may be customized up or down to fit the current Exercise 4, 1 time each, 3 times per week (morning)
vocal condition or a particular voice type. Seldom, Exercise 4, 1 time each, 1 time per week (morning)
however, is the exercise shifted more than two scale steps
in either direction. This is considered a low-impact Each taper should last 1 week. Patients should maintain
adductory power exercise. 85% of their peak time; otherwise they should move up
one step in the taper until the 85% criterion is met.
The quality of the tone is also monitored for voice
Vocal Function Exercises provide a holistic voice
breaks, wavering, and breathiness. Tone quality
treatment program that attends to the three major sub-
improves as times increase and pathologic conditions
systems of voice production. The program appears to
begin to resolve. All exercises are done as softly as pos-
benefit patients with a wide range of voice disorders be-
sible. It is much more di‰cult to produce soft tones; cause it is reasonable in regard to time and e¤ort. It is
therefore, the vocal subsystems will receive a better
similar to other recognizable exercise programs: the
workout than if louder tones are produced. Extreme care
concept of physical therapy for the vocal folds is under-
is taken to teach the production of a forward tone that standable; progress may be easily plotted, which is
lacks tension. In addition, attention is paid to the glot-
inherently motivating; and it appears to balance and
tal onset of the tone. The patient is asked to breathe in
strengthen the relationships among airflow, laryngeal
deeply, with attention paid to training abdominal
muscle activity, and supraglottic placement.
breathing, posturing the vowel momentarily, and then
initiating the exercise gesture without a forceful glottal —Joseph Stemple
attack or an aspirated breathy attack. It is explained to
the patient that maximum phonation times increase as References
the e‰ciency of the vocal fold vibration improves. Times
do not increase with improved lung capacity. (Even Barnes, J. (1977, October). Briess exercises. Workshop pre-
aerobic exercise does not improve lung capacity, but sented at the Southwestern Ohio Speech and Hearing As-
rather the e‰ciency of oxygen exchange with the circu- sociation, Cincinnati, OH.
latory system, thus giving the sense of more air.) Behrman, A., and Orliko¤, R. (1997). Instrumentation in voice
The musical notes are matched to the notes produced assessment and treatment: What’s the use? American Jour-
nal of Speech-Language Pathology, 6(4), 9–16.
by an inexpensive pitch pipe that the patient purchases Bless, D. (1991). Assessment of laryngeal function. In C. Ford
for use at home, or a tape recording of live voice doing and D. Bless (Eds.), Phonosurgery (pp. 91–122). New York:
the exercises may be given to the patient for home use. Raven Press.
Many patients find the tape-recorded voice easier to Briess, B. (1957). Voice therapy: Part 1. Identification of spe-
match than the pitch pipe. We have found that patients cific laryngeal muscle dysfunction by voice testing. Archives
who think they are ‘‘tone deaf’’ can often be taught to of Otolaryngology, 66, 61–69.
88 Part I: Voice
Briess, B. (1959). Voice therapy: Part II. Essential treatment tions, voice therapy is e¤ective in reducing such dis-
phases of laryngeal muscle dysfunction. Archives of Oto- ruptions. Health-related concerns are less common
laryngology, 69, 61–69. precipitants of voice therapy in adults. However, physi-
Harris, S. (2000). The accent method of voice therapy. In cal disease such as cancerous, precancerous, inflam-
J. Stemple (Ed.), Voice therapy: Clinical studies (2nd ed.,
matory, or neurogenic disease may exist and may be
pp. 62–75). San Diego, CA: Singular Publishing Group.
Hicks, D. (1991). Functional voice assessment: What to mea- exacerbated by behavioral factors such as smoking, diet,
sure and why. In Assessment of speech and voice production: hydration, or phonotrauma. Voice therapy may be a
Research and clinical applications (pp. 204–209). Bethesda, useful adjunct to medical or surgical treatment in these
MD: National Institute of Deafness and Other Communi- cases. Finally, voice therapy may be indicated in cases of
cative Disorders. diagnostic uncertainty. A classic situation is the need
Jacobson, B., Johnson, A., Grywalski, C., Silbergleit, A., to distinguish between functional and neurogenic con-
Jacobson, G., Benninger, M., and Newman, C. (1997). The ditions. The restoration of a normal or near-normal
Voice Handicap Index (VHI): Development and validation. voice with therapy may suggest a functional origin of
American Journal of Speech-Language Pathology, 6(3), the problem. Lack of voice restoration suggests the need
66–70.
for further clinical studies to rule out neurological causes.
Kotby, N. (1995). The accent method of voice therapy. San
Diego, CA: Singular Publishing Group. Voice therapy can be characterized with reference
Ramig, L. (2000). Lee Silverman voice treatment for indi- to several di¤erent classification schemes, which results
viduals with neurological disorders: Parkinson disease. In in a certain amount of nosological confusion. Many of
J. Stemple (Ed.), Voice therapy: Clinical studies (2nd ed., the conditions listed in the various classifications map
pp. 76–81). San Diego, CA: Singular Publishing Group. to several di¤erent voice therapy options, and by the
Roy, N., Gray, S., Simon, M., Dove, H., Corbin-Lewis, K., same token, each therapy option maps to multiple
and Stemple, J. (2001). An evaluation of the e¤ects of two classifications. Here we review voice therapy in relation
treatment approaches for teachers with voice disorders: A to (1) vocal biomechanics and (2) a specific therapy
prospective randomized clinical trial. Journal of Speech and approach—roughly the ‘‘what’’ and ‘‘how’’ of voice
therapy.
Sabol, J., Lee, L., and Stemple, J. (1995). The value of vocal
function exercises in the practice regimen of singers. Journal
of Voice, 9, 27–36. Vocal Biomechanics. The preponderance of voice
Saxon, K., and Schneider, C. (1995). Vocal exercise physiology. problems that are amenable to voice therapy involve
San Diego, CA: Singular Publishing Group. some form of abnormality in vocal fold adduction. Pho-
Stemple, J. (1984). Clinical voice pathology: Theory and man- notraumatic lesions such as nodules, polyps, and non-
agement (1st ed.). Columbus, OH: Merrill. specific inflammation consequent on voice use are
Stemple, J. (2000). Vocal function exercises. In J. Stemple traceable to hyperadduction resulting from vocal fold
(Ed.), Voice therapy: Clinical studies (2nd ed., pp. 34–46). impact stress. Adduction causes monotonic increases in
San Diego, CA: Singular Publishing Group. impact stress (Jiang and Titze, 1994). In turn, impact
Stemple, J., Glaze, L., and Klaben, B. (2000). Clinical voice
stress appears to be a primary cause of phonotrauma
pathology: Theory and management (3rd ed.). San Diego,
CA: Singular Publishing Group. (Titze, 1994). Thus, therapy targeting a reduction in
Stemple, J., Lee, L., D’Amico, B., and Pickup, B. (1994). E‰- adduction is indicated in cases of hyperadduction. An-
cacy of vocal function exercises as a method of improving other large group of diagnostic conditions involves
voice production. Journal of Voice, 8, 271–278. hypoadduction of the vocal folds. Examples include
Titze, I. (1991). Measurements for assessment of voice dis- vocal fold paralysis, paresis, atrophy, bowing, and non-
orders. In Assessment of speech and voice production: Re- adducted hyperfunction (muscle tension dysphonia; for a
search and clinical applications (pp. 42–49). Bethesda, MD: discussion, see Hillman et al., 1989). Treatment that
National Institute of Deafness and Other Communicative increases vocal fold closure is indicated in such cases.
Disorders. Voice therapy addresses adductory deviations using a
Titze, I. (1994). Principles of voice production. Englewood
variety of biomechanical solutions. The traditional ap-
Cli¤s, NJ: Prentice-Hall.
Verdolini, K. (2000). Resonant voice therapy. In J. Stemple proach to hyperadduction and its sequelae has targeted
(Ed.), Voice therapy: Clinical studies (2nd ed., pp. 46–62). the use of widely separated vocal folds and small-
San Diego, CA: Singular Publishing Group. amplitude oscillations during voice production; exam-
ples are use of a ‘‘quiet, breathy voice’’ (Casper et al.,
1989; Casper, 1993) or quiet ‘‘yawn-sigh’’ phonation
(Boone and McFarlane, 1993). This general approach is
sensible for the reduction of hyperadduction and thus
Voice Therapy for Adults phonotraumatic changes, in that vocal fold impact
stress, and phonotrauma, should be reduced by it. There
is evidence that the quiet, breathy voice approach is
Voice therapy for adults may be motivated by func- e¤ective in reducing signs and symptoms of phono-
tional, health-related, or diagnostic considerations. traumatic lesions for individuals who use it outside the
Functional issues are the usual indication. Adults with clinic (Verdolini-Marston et al., 1995). However, indi-
voice problems often experience significant functional viduals may also restrict their use of a quiet, breathy
disruptions in occupational, social, communicative, voice extraclinically because it is functionally limiting
physical, or emotional domains, and in selected popula- (Verdolini-Marston et al., 1995).
Voice Therapy for Adults 89
The traditional approach to hypoadduction has in- a wide range of laryngeal diseases, including inflamma-
volved ‘‘pushing’’ and ‘‘pulling’’ exercises, which should tory and even neurogenic and malignant disease. Voice
reduce the glottal gap (e.g., Boone and McFarlane, therapy can play a supportive role to the medical or
1994). Indeed, some data corroborate clinicians’ impres- surgical treatment of laryngopharyngeal reflux by edu-
sions that this approach can increase voice intensity in cating patients regarding behavioral issues such as diet
individuals with glottal incompetence (Yamaguchi et al., and sleeping position. Some data are consistent with the
1993). view that control of laryngopharyngeal reflux can im-
A more recent approach to treating adductory prove both laryngeal appearance and voice symptoms in
abnormalities has focused on the use of a single ‘‘ideal’’ individuals with a diagnosis of laryngopharyngeal reflux
vocal fold configuration as the target for both hyper- (Shaw et al., 1996; Hamdan et al., 2001). However, vo-
adduction and hypoadduction. The configuration in- cal hygiene programs alone in voice therapy apparently
volves barely separated vocal folds, which is ‘‘ideal’’ produce little benefit if they are not coupled with voice
because it optimizes the trade-o¤ between voice output production work.
strength (relatively strong) and vocal fold impact stress,
and thus reduces the potential for phonotraumatic injury Specific Therapy Approach. Recently, interest has
(Berry et al., 2001). Voice produced with this intermedi- emerged in cognitive mechanisms involved in skill ac-
ate laryngeal configuration has been called ‘‘resonant quisition and factors a¤ecting patient compliance as
voice,’’ perceptually corresponding to anterior oral related to voice training and therapy models. Speech-
vibratory sensations during ‘‘easy’’ voicing (Verdolini language pathologists may train individuals to acquire
et al., 1998). Programmatic approaches to resonant the basic biomechanical changes described in preceding
voice training have shown reductions in phonatory ef- paragraphs, and others. The traditional approach is
fort, voice quality, and laryngeal appearance (Verdolini- eclectic and entails implementing a series of facilitating
Marston et al., 1995), as well as reductions in functional techniques such as the ‘‘yawn-sigh’’ and ‘‘push-pull’’
disruptions due to voice problems in individuals with techniques, as well as other maneuvers, such as altering
conditions known or presumed to be related to hyper- the tongue position, changing the loudness of the voice,
adduction, such as nodules. Moreover, there is evidence using chant talk, and using digital manipulation. Facili-
that individuals use this type of voicing outside the clinic tating techniques are used by many clinicians and are
more than the traditional ‘‘quiet, breathy voice’’ because generally considered e¤ective. However, formal e‰cacy
it is functionally tractable (Verdolini-Marston et al., data are lacking for most of the techniques. An ex-
1995). Resonant voice training may also be useful in ception is digital manipulation, specifically manual
improving vocal and functional status in individuals circumlaryngeal therapy (laryngeal massage), used for
with hypoadducted dysphonia. Recent theoretical mod- idiopathic, presumably hyperfunctional dysphonia. Brief
eling has indicated that nonlinear source (vocal fold)– courses of aggressive laryngeal massage by skilled prac-
filter (vocal tract) interactions are critical in maximizing titioners have dramatically improved voice in individuals
voice output germane to resonant voice and other voice with this condition (Roy et al., 1997). Also, variants of
types (Titze, 2002). ‘‘yawn-sigh’’ phonation, such as falsetto and breathy
A relatively small number of clinical cases involve voicing, may temporarily improve symptoms of adduc-
vocal fold elongation abnormalities as the salient feature tory spasmodic dysphonia and increase the duration of
of the vocal condition. Often, the medical condition in- the e¤ectiveness of botulinum toxin injections (Murry
volves cricothyroid paresis, although thyroarytenoid pa- and Woodson, 1995).
resis may also be implicated. Voice therapy has been less Several programmatic approaches to voice therapy
successful in treating such conditions. Other elongation have been developed, some of which have been sub-
abnormalities are functional, as in mutational falsetto. mitted to formal clinical studies. An example is the Lee
The clinical consensus is that voice therapy generally is Silverman Voice Treatment (LSVT). This treatment uses
useful in treating mutational falsetto. ‘‘loud’’ voice to treat not only hypoadduction and
Finally, in addition to addressing laryngeal kinemat- hypophonia, but also prosodic and articulatory deficien-
ics, voice therapy usually also addresses nonphonatory cies in individuals with Parkinson’s disease. LSVT uti-
aspects of biomechanics that influence the vocal fold lizes a predetermined hierarchy of speech tasks in 16
mucosa. Such issues are addressed in voice hygiene pro- therapy sessions delivered over 4 weeks. In comparison
grams (see voice hygiene). Mucosal performance and with control and alternative treatment groups, LSVT
mucosal vulnerability to trauma are the key concerns. has increased vocal loudness and voice inflection for
The primary issues targeted are hydration and behav- as long as 2 years following therapy termination
ioral control of laryngopharyngeal reflux. Dehydration (Ramig, Sapir, Fox et al., 2001; Ramig, Sapir, Coun-
increases the pulmonary e¤ort required for phonation, tryman et al., 2001). Critical aspects of LSVT that may
whereas hydration decreases it and also decreases laryn- contribute to its success include a large number of repe-
geal phonotrauma (e.g., Titze, 1988; Verdolini, Titze, titions of the target ‘‘loud voice’’ in a variety of physical
and Fennell, 1994; Solomon and DiMattia, 2000). Thus, contexts.
hydration regimens are appropriate for individuals with Another programmatic approach to voice therapy,
voice problems and dehydration (Verdolini-Marston, the Lessac-Madsen Resonant Voice Therapy (LMRVT),
Sandage, and Titze, 1994). There is increasing support was developed for individuals with either hyper- or
for the view that laryngopharyngeal reflux plays a role in hypoadducted voice problems associated with nodules,
90 Part I: Voice
polyps, nonspecific phonotraumatic changes, paralysis, References

paresis, atrophy, bowing, and sulcus vocalis. LMRVT
targets the use of barely touching or barely separated Bastian, R. W. (1987). Laryngeal image biofeedback for voice
vocal folds for phonation, a configuration considered to disorder patients. Journal of Voice, 1, 279–282.
Berry, D. A., Verdolini, K., Montequin, D. W., Hess, M. M.,
be ideal because it maximizes the ratio of voice output Chan, R. W., and Titze, I. R. (2001). A quantitative output-
intensity to vocal fold impact intensity (Berry et al., cost ratio in voice production. Journal of Speech-Language-
2001). In LMRVT, eight structured therapy sessions Hearing Research, 44, 29–37.
typically are delivered over 8 weeks. Training empha- Boone, D. R., and McFarlane, S. C. (1993). A critical view of
sizes sensory processing and the extension of ‘‘resonant the yawn-sigh as a voice therapy technique. Journal of
voice’’ to a variety of communicative and emotional Voice, 7, 75–80.
environments. Data on preliminary versions of LMRVT Boone, D. R., and McFarlane, S. C. (1994). The voice and
indicate that it is as useful as quiet, breathy voice train- voice therapy (5th ed.). Englewood Cli¤s, NJ: Prentice Hall.
ing for sorority women with phonotrauma or the use Byl, N. N., Merzenich, M. M., and Jenkins, W. M. (1996). A
of amplification for teachers with voice problems in primate genesis model of focal dystonia and repetitive strain
injury: I. Learning-induced dedi¤erentiation of the repre-
reducing various combinations of phonatory e¤ort, sentation of the hand in the primary somatosensory cortex
voice quality, laryngeal appearance, and functional sta- in adult monkeys. Neurology, 47, 508–520.
tus (Verdolini-Marston et al., 1995). Casper, J. K. (1993). Objective methods for the evaluation of
Another programmatic approach to voice therapy for vocal function. In J. Stemple (Ed.), Voice therapy: Clinical
both hyper- and hypoadducted conditions is called Vocal methods (pp. 39–45). St. Louis: Mosby–Year Book.
Function Exercises (VFE; Stemple et al., 1994). This Casper, J. K., Colton, R. H., Brewer, D. W., and Woo, P.
approach targets similar vocal fold biomechanics as (1989). Investigation of selected voice therapy techniques.
LMRVT, that is, vocal folds that are barely touching or Paper presented at the 18th Symposium of the Voice Foun-
barely separated, for phonation. Training consists of dation, Care of the Professional Voice, New York.
repeating maximally sustained vowels and pitch glides Hamdan, A. L., Sharara, A. I., Younes, A., and Fuleihan, N.
(2001). E¤ect of aggressive therapy on laryngeal symptoms
twice daily over a period of 4–6 weeks. Carryover exer- and voice characteristics in patients with gastroesophageal
cises to conversational speech may also be used. A 6- reflux. Acta Otolaryngologica, 121, 868–872.
week program of VFE in teachers with voice problems Hillman, R. E., Holmberg, E. B., Perkell, J. S., Walsh, M.,
resulted in greater self-perceived voice improvement, and Vaughan, C. (1989). Objective assessment of vocal
greater phonatory ease, and better voice clarity than that hyperfunction: An experimental framework and initial
achieved with vocal hygiene treatment alone (Roy et al., results. Journal of Speech and Hearing Research, 32, 373–
2001). 392.
Another program, Accent Therapy, addresses the Jiang, J. J., and Titze, I. R. (1994). Measurement of vocal fold
ideal laryngeal configuration—barely touching or barely intraglottal stress and impact stress. Journal of Voice, 8,
separated vocal folds—in individuals with hyper- and 132–144.
Koufman, J. A., Amin, M. R., and Panetti, M. (2000). Rele-
hypoadducted conditions (Smith and Thyme, 1976). vance of reflux in 113 consecutive patients with laryngeal
Training entails the use of specified rhythmic, prosodi- and voice disorders. Otolaryngology–Head and Neck Sur-
cally stressed vocal repetitions, beginning with sustained gery, 123, 385–388.
consonants and progressing to phrases and extended Murry, T., and Woodson, G. (1995). Combined-modality
speech. The Accent Method is more widely used in Eu- treatment of adductor spasmodic dysphonia with botulinum
rope and Asia than in the United States. toxin and voice therapy. Journal of Voice, 6, 271–276.
Electromyographic biofeedback has been reported to Ramig, L. O., Sapir, S., Countryman, S., Pawlas, A. A.,
be e¤ective in reducing laryngeal hyperfunction and la- O’Brien, C., Hoehn, M., et al. (2001). Intensive voice treat-
ryngeal appearance in individuals with voice problems ment (LSVT) for patients with Parkinson’s disease: A 2
linked to hyperadduction (nodules). Also, visual feed- year follow up. Journal of Neurology, Neurosurgery, and
Psychiatry, 71, 493–498.
back using videoendoscopy may be useful in treating Ramig, L. O., Sapir, S., Fox, C., and Countryman, S. (2001).
numerous voice conditions; specific clinical observa- Changes in vocal loudness following intensive voice treat-
tions have been reported relative to ventricular phona- ment (LSVT) in individuals with Parkinson’s disease:
tion (Bastian, 1987). A comparison with untreated patients and normal age-
Finally, some clinicians have found that sensory matched controls. Movement Disorders, 16, 79–83.
di¤erentiation exercises may help in the treatment of Roy, N., Gray, S. D., Simon, M., Dove, H., Corbin-Lewis, K.,
repetitive strain injury—one of the fastest growing oc- and Stemple, J. C. (2001). An evaluation of the e¤ects of
cupational injuries. Repetitive strain injury involves two treatment approaches for teachers with voice disorders:
decreased use of manual digits or voice and pain subse- A prospective randomized clinical trial. Journal of Speech-
quent to overuse. Attention to sensory di¤erentiation in Language-Hearing Research, 44, 286–296.
Shaw, G. Y., Searl, J. P., Young, J. L., and Miner, P. B.
the treatment of repetitive strain injury is motivated by (1996). Subjective, laryngoscopic, and acoustic measure-
reports of fused representation for groups of movements ments of laryngeal reflux before and after treatment with
in sensory cortex following extensive digit use (e.g., Byl, omeprazole. Journal of Voice, 10, 410–418.
Merzenich, and Jenkins, 1996). Smith, S., and Thyme, K. (1976). Statistic research on changes
in speech due to pedagogic treatment (the accent method).
—Katherine Verdolini Folia Phoniatrica, 28, 98–103.
Voice Therapy for Neurological Aging-Related Voice Disorders 91
Solomon, N. P., and Di Mattia, M. S. (2000). E¤ects of a vo- Voice Therapy for Neurological Aging-
cally fatiguing task and systemic hydration on phonation
threshold pressure. Journal of Voice, 14, 341–362. Related Voice Disorders
Stemple, J. C., Lee, L., D’Amico, and Pickup, B. (1994). E‰-
cacy of vocal function exercises as a method of improving
voice production. Journal of Voice, 8, 271–278.
Titze, I. R. (1994). Mechanical stress in phonation. Journal of Introduction
Voice, 8, 99–105.
Titze, I. R. (2002). Regulating glottal airflow in phonation: The neurobiological changes that a person undergoes
Application of the maximum power transfer theorem to a with advancing age produce structural and functional
low dimensional phonation model. Journal of the Acoustical changes in all of the organs and organ systems in the
Society of America, 111, 367–376. body. The upper respiratory system, the larynx, vocal
Verdolini, K. (2000). Case study: Resonant voice therapy. In tract, and oral cavity all reflect both normal and abnor-
J. Stemple (Ed.), Voice therapy: Clinical studies (2nd ed., mal changes that result from aging. In 1983, Ramig and
pp. 46–62). San Diego, CA: Singular Publishing Group. Ringel suggested that age-related changes of the voice
Verdolini, K., Druker, D. G., Palmer, P. M., and Samawi, H.
must be viewed as part of the normal process of physio-
(1998). Laryngeal adduction in resonant voice. Journal of
Voice, 12, 315–327. logical aging of the entire body (Ramig and Ringel,
Verdolini, K., Titze, I. R., and Fennell, A. (1994). Dependence 1983). Neurological, musculoskeletal, and circulatory
of phonatory e¤ort on hydration level. Journal of Speech remodeling account for changes in laryngeal function
and Hearing Research, 37, 1001–1007. and vocal output in older adults. These changes, how-
Verdolini-Marston, K., Burke, M. K., Lessac, A., Glaze, L., ever, do not necessarily result in abnormal voice quality.
and Caldwell, E. (1995). Preliminary study of two methods A thorough laryngological examination coupled with a
of treatment for laryngeal nodules. Journal of Voice, 9, complete voice assessment will likely reveal obvious
74–85. voice disorders associated with aging. It still remains
Verdolini-Marston, K., Sandage, M., and Titze, I. R. (1994). for the clinicians along with the help of the patient
E¤ect of hydration treatments on laryngeal nodules and
to identify and distinguish normal age-related voice
polyps and related voice measures. Journal of Voice, 8,
30–47. changes from voice disorders. This entry describes
Yamaguchi, H., Yotsukura, Y., Hirosaku, S., Watanabe, Y., neurological aging-related voice disorders and their
Hirose, H., Kobayashi, N., and Bless, D. (1993). Pushing treatment options. Traumatic or idiopathic vocal fold
exercise program to correct glottal incompetence. Journal of paralysis is described in another entry, as is Parkinson’s
Voice, 7, 250–256. disease. This article focuses on neurologically based
voice disorders associated with general aging.
Further Readings Voice production in the elderly is associated with
other bodily changes that occur with advancing age
Boone, D. R., McFarlane, S. C. (1994). The voice and voice (Chodzko-Zajko and Ringel, 1987), although changes in
therapy (5th ed.). Englewood Cli¤s, NJ: Prentice Hall.
Colton, R., and Casper, J. K. (1996). Understand voice prob-
specific organs may derive from various causes and
lems: A physiological perspective for diagnosis and treatment mechanisms. The e¤ects of normal aging are somewhat
(2nd ed.). Baltimore: Williams and Wilkins. similar across organ systems. Aging of the vocal organs,
Kotby, M. N. (1995). The Accent Method of voice therapy. San like other organ systems, is associated with decreased
Diego, CA: Singular Publishing Group. strength, accuracy, endurance, speed, coordination, or-
Stemple, J. C. (2000). Voice therapy: Clinical studies (2nd ed.). gan system interaction (i.e., larynx and respiratory sys-
San Diego, CA: Singular/Thompson Learning. tems), nerve conduction velocity, circulatory function,
Stemple, J. C., Glaze, L. E., and Gerdeman, R. K. (1995). and chemical degradation at synaptic junctions.
Clinical voice pathology: Theory and management (2nd ed.). Anatomical (Hirano, Kurita, and Yukizane, 1989;
San Diego, CA: Singular Publishing Group. Kahane, 1987) and histological studies (Luchsinger and
Arnold, 1965) clearly demonstrate that di¤erences in
Verdolini, K., and Krebs, D. (1999). Some considerations structure and function do exist as a result of aging. The
on the science of special challenges in voice training. In vocal fold epithelium, the layers of the lamina propria,
G. Nair, Voice tradition and technology: A state-of-the-art and the muscles of the larynx change with aging. The
studio (pp. 227–239). San Diego, CA: Singular Publishing vocal folds lose collaginous fibers, leading to increased
Group. sti¤ness.
Verdolini, K., Ostrem, J., DeVore, K., and McCoy, S. (1998). The neurological impact to the aging larynx includes
National Center for Voice and Speech’s guide to vocology. central and peripheral motor nervous system changes.
Iowa City, IA: National Center for Voice and Speech. Central nervous system changes include nerve cell losses
Verdolini, K., and Ramig, L. (2001). Review: Occupational in the cortex of the frontal, parietal, and temporal lobes
risks for voice problems. Journal of Logopedics, Phoniatrics,
and Vocology, 26, 37–46.
of the brain. This results in the slowing of motor move-
ments (Scheibel and Scheibel, 1975). Nerve conduction
velocity also contributes to speed of voluntary move-
ments such as pitch changes, increased loudness, and
speed of articulation (Leonard et al., 1997). Nervous
system changes are also associated with tremor, a
92 Part I: Voice
Table 1. Diagnoses of Subjects Age 65 and Older Seen at the function which result in patient perceived and listener
University of Pittsburgh Voice Center perceived vocal dysfunction. Indeed, if the neuromotor
Diagnosis N % systems are intact and the elderly patient is healthy, the
speaking and singing voice is not likely to be perceived
Vocal fold atrophy 46 23
as ‘‘old’’ nor function as ‘‘old’’ (McGlone and Hollien,
Vocal fold paralysis 39 19
Laryngopharyngeal reflux 32 16 1963). Conversely, the voice may be perceived as ‘‘old’’
Parkinson’s disease 26 13 not solely due to neurological changes in the larynx and
Essential tremor 8 4 upper airway, but due to muscular weakness of the
Other neurological disorders 16 8 upper body (Ramig and Ringel, 1983), cardiovascular
Muscular tension dysphonia—primary 15 7 changes (Orliko¤, 1990), or decreased hearing acuity
Muscular tension dysphonia—secondary 38 19 resulting in excessive vocal force (Chodzko-Zajko and
Edema 14 6 Ringel, 1987).
Spasmodic dysphonia 7 3 There are, however, certain aspects of voice produc-
N ¼ 205. The total number of diagnoses is larger since some tion that are characteristically associated with age-
patients had more than one diagnosis. related neuropathy. The clinical examination of elderly
individuals who complain of voice disorders should spe-
cifically address and test for loss of vocal range and
volume, vocal fatigue, increased breathy quality during
condition seen more in the elderly than in young indi- extended conversations, presence of tremor, and pitch
viduals. Finally, dopaminergic changes which decline breaks (especially breaks into falsetto and hoarse voice
with aging may also a¤ect the speed of motor processing quality). Elderly singers should be evaluated for pitch
(Morgan and Finch, 1988). inaccuracies, increased breathiness, and changes in vi-
The peripheral changes that occur in the elderly are brato (Tanaka, Hirano, and Chijina, 1994).
thought to be broadly related to environmental e¤ects of A careful examination of the elderly patient with a
trauma (Woo et al., 1992), selective denervation of type complaint about his or her voice consists of an extensive
II fast twitch muscle fibers (Lexell, 1997) and decrease history including medications, previous surgeries, and
in distal and motor neurons, resulting in decreased current and previously diagnosed diseases. Acoustic,
contractile strength and an increase in muscle fatigue perceptual, and physiological assessment of vocal func-
(Doherty and Brown, 1993). tion may reveal evidence of tremor, vocal volume defi-
ciencies, and/or vocal fatigue. Examination of the larynx
Voice Changes Related to Neurological Aging and vocal folds via flexible endoscopy as well as strobo-
The central and peripheral degeneration and con- videolaryngoscopy is essential to reveal vocal use pat-
comitant regenerative neural changes that occur with terns, asymmetrical vibration, scarring, tremor (of the
neurological aging may result in a number of voice dis- larynx or other structures), atrophy, or lesions. In the
orders. Excluding vocal fold paralysis, these neurological absence of suspected malignancies or frank aspiration
changes account for disorders of voice quality and over- due to lack of glottic closure, voice therapy is the treat-
all vocal output. Murry and Rosen reported on 205 ment of choice for most elderly patients with neurologi-
patients 65 years of age and older. Table 1 shows the cal aging-related dysphonias.
diagnosis of this group (Murry and Rosen, 1999).
The most common symptoms reported by this group Treatments for Neurological Aging-Related
of patients are shown in Table 2. Voice Disorders
Neurological changes to the voice accompanying
aging are related to decreased neurological structure and Treatments for elderly patients with mobile vocal folds
presenting with dysphonia include behavioral, pharma-
cological, and surgical approaches. A review of surgical
treatments can be found in Ford (1986), Koufman
Table 2. The Most Common Voice Symptoms Reported by (2000), Postma (1998), and Durson (1996). The use of
Patients 65 Years of Age and Older medications and their relationship to vocal production
Symptom % of Patients and vocal aging can be found in the work of Satalo¤ and
colleagues (1997) and Vogel (1995).
Loss of volume 28
Raspy or hoarse voice 24 Voice therapy for neurological aging-related voice
Vocal fatigue 22 disorders varies, depending on the patient’s complaints,
Di‰culty breathing during speech 18 diagnosis, and vocal use requirements. The most com-
Talking in noisy environments 15 mon needs of patients with neurological age-related
Loss of clarity 16 voice disorders are to increase loudness and endurance,
Tremor 7 to reduce hoarseness or breathy voice qualities, and to
Intermittent voice loss 6 maintain a broad pitch range for singing. These needs
Articulation-related problems 5 are met with vocal education including awareness of
Total exceeds 100% as some individuals reported more than vocal hygiene; direct vocal exercises; and management of
one complaint. the vocal environment. Prior to voice therapy, and as
Voice Therapy for Neurological Aging-Related Voice Disorders 93
part of the diagnostic process, a thorough audiological early postoperative period. It is not appropriate for
assessment of the patient should be done. If the patient treatment of vocal fold paralysis, conditions with in-
wears a hearing aid or aids, he or she should wear them complete glottal closure, or a scarred vocal fold.
for the therapy sessions. Resonant voice, or voice with forward focus, usually
refers to an easy voice associated with vibratory sensa-
Vocal Education tions in facial bones (Verdolini-Marston, Burke, and
Lessass, 1995). Therapy focuses on the production of
Vocal education coupled with vocal hygiene provides the this voice primarily through feeling and hearing. Exer-
patient with an understanding of the aging process as it
cises to place the vocal mechanism in a specific manner
relates to voice use. An understanding of how all body
coupled with humming help the patient identify to opti-
organ systems are a¤ected by normal aging helps to ex- mum pitch/placement for maximum voice quality. Res-
plain why the voice may not have the same quality, pitch
onant voice therapy is described as being useful in the
range, endurance, or loudness that was present in earlier
treatment of vocal fold lesions, functional voice prob-
years. Since the voice is the product of respiratory and
lems, mild vocal atrophy, and paralysis.
vocal tract functions, all of the systems that contribute
Manual circumlaryngeal massage (manual laryngeal
to the aging of these organs are responsible for the
musculoskeletal tension reduction) is a direct, hands-on
final vocal output. Recently, Murry and Rosen pub-
approach in which the clinician massages and manipu-
lished a vocal education and hygiene program for
lates the laryngeal area in a particular manner while
patients (Murry and Rosen, 2000). This program is
observing changes in voice quality as the patient pho-
an excellent guide for all aging patients with neurologi-
nates. The technique was first proposed by Aronson
cal voice disorders. Nursing homes, senior citizen resi- (1990) and later elaborated by Morrison and Rammage
dences, and geriatric specialists should consider o¤ering
(1993). Roy and colleagues reported on their use of
this outline as the first step in patient education when
the massage technique in controlled studies (Roy, Bless,
patients complain of voice disorders.
and Heisey, 1997). They reported almost normal voice
Direct Voice Therapy following a single session in 93% of 17 subjects with
hyperfunctional dysphonia.
Voice therapy is one treatment modality for almost General body massage in which muscles are kneaded
all types of neurological aging-related voice disorders. and manipulated is known to reduce muscle tensions.
The recent explosion of knowledge about the larynx is This concept is adapted to massage the muscles in the
matched by an equal growth of interest in its physiology, laryngeal area. One focus of the circumlaryngeal mas-
its disorders, and their treatment. Increased use of la- sage is to relieve the contraction of those muscles and
ryngeal imaging and knowledge of laryngeal physiology allow the larynx to lower. This technique is most often
have provided a base for behavioral therapy that is in- used with patients who report neck or upper body ten-
creasingly focused on the specific nature of the observed sion or sti¤ness, tenderness in the neck muscles, odyno-
pathophysiology. While treatment is designed to restore phonia, or those who demonstrate rigid postures. Vocal
maximum vocal function, the aging process of weakness, function exercises are designed to pinpoint and exercise
muscle wasting, and system endurance may not restore specific laryngeal muscles. The four steps address warm-
the voice to its youthful characteristics. Rather, the up of the muscles, stretching and contracting of muscles,
desired goals should be e¤ective vocal communication and building muscle power. The softness of the pro-
and forestalling continued vocal deterioration (Satalo¤ ductions is said to increase muscular and respiratory
et al., 1997). e¤ort and control. The exercises are hypothesized to
Specific techniques for the aging voice have evolved restrengthen and balance laryngeal musculature, to im-
from our understanding of the aging neuromuscular prove vocal fold flexibility and movement, and to re-
process. Confidential voice is the voice that one might balance airflow (Stemple, Glaze, and Gerdeman, 1995).
typically use to describe or discuss confidential matters. Ramig and colleagues developed a structured inten-
Theoretically, it is produced with minimal vocal fold sive therapy program, the Lee Silverman voice treat-
contact. The confidential voice technique is used to (1) ment program, of four sessions per week for 4 weeks
eliminate hyperfunctional and traumatic behaviors; (2) specifically for patients with idiopathic Parkinson’s dis-
allow lesions such as vocal nodules to heal in the absence ease (Ramig, Bonitati, and Lemke, 1994). Since then,
of continued pounding; (3) eliminate excessive muscular the e‰cacy of the treatment for this population has been
tension and vocal fatigue; (4) reset the internal volume extended to include aging patients and patients with
meter; and (5) force a heightened awareness of voice other forms of progressive neurological disease. This
use and the vocal environment. The goal is to create treatment method of voice therapy may be the most
healthier vocal folds and a neutral state from which promising of all for neurological aging-related voice
healthy voice use can be taught and developed through a disorders.
variety of other techniques (Verdolini-Marston, Burke, The Lee Silverman voice treatment program is based
and Lessass, 1995; Leddy, Samlan, and Poburka, 1997). on the principle that, to counteract the physical e¤ects of
The confidential voice technique is appropriately reduced amplitude of motor acts including voice and
used to treat benign lesions, muscle tension dysphonia, speech production, rigidity, bradykinesia, and reduction
hyperfunctional dysphonia, and vocal fatigue in the in respiratory e¤ort, it is necessary to push the entire
94 Part I: Voice
phonatory mechanism to exert greater e¤ort by focus- maintain voice or retard vocal weakness, fatigue, and
ing on loudness. To increase loudness, the respiratory loss of clear voice quality.
system must provide more driving power, and the The vocal environment should not be ignored as a
vocal folds must adduct more completely. Indeed, it factor in communication for patients with neurological
seems that the respiratory, the laryngeal, and the articu- aging-related voice disorders. Voice use is maximized in
latory mechanisms all benefit from the e¤ort to increase environments where background noise is minimal, sound
loudness. absorption materials such as rugs and cushions are used
The program is highly structured and involves five in large meeting rooms, and proper lighting is available
essential concepts: (1) voice is the focus; (2) a high degree to help with visual components of communication.
of e¤ort is required; (3) treatment is intensive; (4) the
patient’s self-perception must be calibrated; and (5) pro- Summary
ductions and outcomes are quantified. The scope of this Progressive neurological aging-related disorders o¤er a
article does not permit inclusion of the extensive litera- challenge to the speech-language pathologist. In the ab-
ture available on this method or an extensive description sence of surgery for vocal fold paralysis, the patient with
of the therapy protocol, which is available in published mobile vocal folds and a neurological disorder may
form. benefit from specific exercises to maintain vocal com-
The accent method, originally developed by Smith, munication. Diagnosis, which identifies the vocal use
has been used to treat all types of dysphonias (Smith and habits of the patient is critical to identify strategies and
Thyme, 1976). It has been adopted more widely abroad the specific exercises needed to maintain and/or improve
than in the United States and focuses on breathing as the voice production.
underlying control mechanism of vocal output and uses See also voice disorders of aging.
accentuated and rhythmic movements of the body and
then of voicing. Easy voice production with an open- —Thomas Murry
throat feeling is stressed, and attention is paid primarily
to an abdominal/diaphragmatic breathing pattern. This References
method is useful for treating those individuals with Aronson, A. E. (1990). Clinical voice disorders. New York:
vocal fatigue, endurance problems, or overall volume Thieme-Stratton.
weakness. Brody, H. (1992). The aging brain. Acta Neurologica Scanda-
All voice therapy is a directed way of changing a navica (Supplement), 137, 40–44.
particular behavior or set of behaviors. Regardless of the Chodzko-Zajko, W. J., and Ringel, R. L. (1987). Physiological
methods used, voice therapy demands the cooperation aspects of aging. Journal of Voice, 18–26.
of the patient in ways that may be novel and unusual. Doherty, T., and Brown, W. (1993). The estimated numbers
Voice therapy di¤ers from the medical approach, which and relative sizes of the nar motor units as selected by mul-
requires only that a pill be taken or an injection received. tiple point stimulation in young and older adults. Muscle
and Nerve, 16, 355–366.
It di¤ers from the surgical approach, wherein the sur- Durson, G., Satalo¤, R. T., and Speigel, J. R. (1996). Superior
geon does the work. It di¤ers from the work of voice and laryngeal nerve paresis and paralysis. Journal of Voice, 10,
acting coaches, who work to enhance and strengthen a 206–211.
normal voice. Voice clinicians work with individuals Ford, C. N. (1986). Histologic studies on the fate of soluble
who never thought about the voice until they acquired a collagen injected into canine vocal folds. Laryngoscope, 96,
voice disorder. They are primarily interested in rapid 1248–1257.
restoration of normal voice, a task that cannot always be Hirano, M., Kurita, S., and Yukizane, K. (1989). Asymmetry
accomplished. of the laryngeal framework: A morphologic study of ca-
daver larynges. Annals of Otology, Rhinology, and Laryn-
gology, 98, 135–140.
Vocal Tremor Hollien, H., and Shipp, T. (1972). Speaking fundamental fre-
quency and chronological age in males. Journal of Speech
One neurological aging-related disorder that often resists and Hearing Research, 15, 155–159.
change is vocal tremor. Tremor often accompanies many Kahane, J. C. (1987). Connective tissue changes in the larynx
voice disorders having a neurological component. Vocal and their e¤ects on voice. Journal of Voice, 1, 27–30.
tremor has been treated in the past with medications, Koufman, J. A., Postma, G. N., Cummins, M. M., and Bla-
and in some cases with laryngeal framework surgery, lock, P. D. (2000). Vocal fold paresis. Otolaryngology–Head
when vocal fold atrophy is also diagnosed. The specific and Neck Surgery, 122, 537–541.
therapeutic techniques presented in this article may also Leddy, M., Samlan, R., and Poburka, B. (1997). E¤ective
be helpful in reducing the perception of tremor especially treatments for hyperfunctional voice disorders. Advances for
those that focus on increasing vocal fold closure (i.e., Speech-Language Pathologists and Audiologists, 14, 18.
Leonard, C., Matsumoto, T., Diedrich, P., and McMillan, J.
Lee Silverman voice treatment and the Accent Method). (1997). Changes in neural modulation and motor control
Finally, the treatment of aging-related dysphonias during voluntary movements of older individuals. Journal of
should include techniques used in training singers Gerontology, Medical Sciences, 52A, M320–M325.
and actors (Satalo¤ et al., 1997). General physical con- Lexell, J. (1997). Evidence for nervous system degeneration
ditioning, warmup, increased respiratory function exer- with advancing age. Journal of Nutrition, 127, 1011S–
cises, and the ability to monitor voice change help to 1013S.
Voice Therapy for Professional Voice Users 95
Luchsinger, R., and Arnold, G. E. (1965). Voice, speech, lan- Vogel, D., and Carter, J. (1995). The e¤ects of drugs on com-
guage: Clinical communicology—Its physiology and pathol- munication disorders. San Diego, CA: Singular Publishing
ogy. Belmont, CA: Wadsworth. Group.
McGlone, R., and Hollien, H. (1963). Vocal pitch character- Woo, P., Casper, J., Colton, R., and Brewer, D. (1992). Dsy-
istics of aged women. Journal of Speech and Hearing Re- phonia in the aging: Physiology versus disease. Laryngo-
search, 6, 164–170. scope, 102, 139–144.
Morgan, D., and Finch, C. (1988). Dopaminergic changes in
the basal ganglia: A generalized phenomenon of aging in
mammals. Annuals of the New York Academy of Sciences,
Part III. Central Neuronal Alterations Related to Motor Voice Therapy for Professional
Behavior Control in Normal Aging: Basil Ganglia, 145–159. Voice Users
Morrison, M. D., and Ramage, L. A. (1993). Muscle misuse
voice disorders: Description and classification. Acta Otola-
ryngologica, 113, 428–434. A professional voice user is a person whose job function
Murry, T., Rosen, C. A. (1999). Vocal aging: Treatment critically depends on use of the voice. Not only singers
options (abstr.). In Proceedings of 2nd World Voice Con- and actors but teachers, lawyers, clergy, counselors, air
gress. São Paulo, Brazil.
Murry, T., and Rosen, C. A. (2000). Vocal education for the tra‰c controllers, telemarketers, firefighters, police, and
professional voice user and singer. In C. A. Rosen and auctioneers are among those who use their voices sig-
T. Murry (Eds.), Otolaryngologic Clinics of North America, nificantly in their line of work.
33, 967–982. Probably the preponderance of professional voice
Orliko¤, R. F. (1990). The relationship of age and cardio- users who seek treatment for voice problems have voice-
vascular health to certain acoustic characteristics of induced conditions. Typically, such conditions involve
male voices. Journal of Speech and Hearing Research, 33, either phonotrauma or functional problems. The full
450–457. range of non-use-related vocal pathologies may occur in
Postma, G. N., Blalock, P. D., and Koufman, J. A. (1998). professional voice users as well, at about the same rate
Bilateral medialization laryngoplasty. Laryngoscope, 108,
as in the population at large. However, special consid-
1429–1434.
Ramig, L., and Ringel, R. L. (1983). E¤ects of physiological erations may be required in therapy for professional
aging on selected acoustic characteristics of voice. Journal voice users because of their job demands.
of Speech and Hearing Research, 26, 22–30. The teaching profession is at highest risk for voice
Ramig, L. O., Bonitati, C. M., and Lemke, J. H. (1994). Voice problems. In 1999, teachers made up between 5% and
treatment for patients with Parkinson’s disease: Develop- 6% of the employed population in the United States.
ment of an approach and preliminary e‰cacy data. Journal At any given time, between one-fifth and one-half of
of Medical Speech-Language Pathology, 2, 191–209. teachers in the United States and elsewhere are experi-
Rosenburg, S., Malmgren, L. T., and Woo, P. (1989). Age- encing a voice problem (Sapir et al., 1993; Russell,
related change in the internal branch of the rat superior Oates, and Greenwood, 1998; E. Smith et al., 1998).
laryngeal nerve. Archives of Otolaryngology–Head and
Voice problems appear to occur at about the same rates
Roy, N., Bless, D. M., and Heisey, D. (1997). Manual circu- among singers. Other occupations at risk for voice
laryngeal therapy for functional dysphonia: An evaluation problems are lawyers, clergy, telemarketers, and possibly
of short- and long-term treatment outcomes. Journal of even counselors and social workers. Increasingly, pho-
Voice, 11, 321–331. notrauma is considered an occupational hazard in these
Satalo¤, R. T., Emerich, K. A., and Hoover, C. A. (1997). populations (Villkman, 2000).
Endocrine dysfunction. In R. T. Satalo¤ (Ed.), Professional A new occupational hazard for voice problems has
voice (pp. 291–299). San Diego, CA: Singular Publishing recently surfaced in the form of repetitive strain injury.
Group. This condition, one of the fastest growing occupational
Satalo¤, R. T., Rosen, D. C., Hawkshaw, M., and Spiegel, injuries in the United States in general, involves weak-
J. R. (1997). The aging adult voice. Journal of Voice, 11,
ness and pain from somatic overuse. Symptoms of re-
156–160.
Scheibel, M., and Scheibel, A. (1975). Structural changes in the petitive strain injury typically begin in the fingers after
aging brain. In H. Brody (Ed.), Clinical morphological and keyboard use. However, laryngeal symptoms may de-
neuromechanical aspects of the aging nervous system. New velop if the individual replaces the keyboard with voice
York: Raven Press. recognition software.
Smith, S., and Thyme, K. (1976). Statistic research on changes The consequences of voice problems for professionals
in speech due to pedagogic treatment (the accent method). are not trivial and may include temporary or permanent
Folia Phoniatrica, 28, 98–103. loss of work. Conservative estimates of costs associated
Stemple, J. C., Glaze, L. E., and Gerdeman, B. K. (1995). with voice problems in teachers alone are on the order of
Clinical voice pathology (2nd ed.). San Diego, CA: Singular $2 billion annually in the United States (Verdolini and
Publishing Group.
Ramig, 2001). Thus, voice problems can be devastating
Tanaka, S., Hirano, M., and Chijina, K. (1994). Some aspects
of vocal fold bowing. Annals of Otology, Rhinology, and both occupationally and personally to many professional
Laryngology, 103, 357–362. voice users.
Verdolini-Marston, K., Burke, M. K., and Lessas, A. (1995). A The goal of treatment for professional voice users is
preliminary study on two methods of treatment for laryn- to restore the best possible voice use—and, where rele-
geal nodules. Journal of Voice, 9, 74–85. vant, anatomy and physical function—relative to the job
96 Part I: Voice
in question. Vocal hygiene, including hydration and However, exceptions exist. Also, theatre and increas-
reflux control, plays a role in most treatment programs ingly singing training and voice therapy incorporate
for professional voice users (see vocal hygiene). Surgi- general body work (alignment, movement) as a central
cal management may be appropriate in selected cases. part of training.
However, for most professional voice users, the mainstay In respect to training modalities, traditional speech-
of intervention for voice problems is behavioral work on language pathology models tend to be more analytical
voice production, or voice therapy. and less experiential than typical performing arts models
Traditional therapy for phonotrauma in professional of training. The motor learning literature indicates that
groups that use the voice quantitatively (e.g., teachers, the performers may be right. The literature describes a
clergy, attorneys), that is, over an extended period of critical dependence of motor learning on sensory pro-
time or at sustained loudness, has emphasized voice cessing and deemphasizes mechanical instruction (Ver-
conservation. In this approach, however, individuals are dolini, 1997; see also Wulf, Höß, and Prinz, 1998). The
limited at least as much by the treatment as by the dis- motor learning literature also clearly indicates the need
ease. The current emphasis is on training individuals to for special attention to transfer in training. Skills ac-
meet their voice needs while they recover from existing quired in a clinic or studio may transfer poorly to
problems, and to prevent new ones. An intermediate untrained stimuli in untrained environments if less spe-
vocal fold configuration, involving slight separation of cific transfer exercises are used. Biofeedback may be a
the vocal processes during phonation, appears relevant useful adjunct to voice therapy and training; however,
to this goal (Berry et al., 2001). A variety of training cautions exist. Terminal biofeedback, provided after
methods are available for this approach to vocalization, the completion of performance, contributes to greater
including Lessac-Madsen resonant voice therapy (Ver- learning than on-line feedback, which occurs during on-
dolini, 2000), Vocal Function Exercises (Stemple et al., going performance (Armstrong, 1970, cited in Schmidt
1994), the Accent Method (S. Smith and Thyme, 1976), and Lee, 1999, pp. 316–317). Also, systematic fading of
and flow mode therapy (see, e.g., Gau‰n and Sundberg, biofeedback support appears critical for transfer.
1989). Training in this general laryngeal configuration The voice therapist may need to address special chal-
appears to be more e¤ective than vocal hygiene in- lenges in the physical and political environments of per-
tervention alone and more e¤ective than intensive res- formers. Stage environments can be frankly toxic, and
piratory training in reducing self-reported functional compromising to vocal and overall physical health. Spe-
problems due to voice in at least one class of professional cific noxious substances that have been measured on
voice users, teachers (Roy et al., 2001). stages include aromatic diisocyanates, Penicillium fre-
Therapy for individuals with qualitative both quali- quentans and formaldehyde in cork granulate, cobalt
tative voice needs recognizes that a special sound of and aluminum (pigment components), and alveolar-size
the voice is required occupationally. Therapy for per- quartz sand (Richter et al., 2002). Open-air performing
formers—singers and actors—with voice problems is environments can present particular vocal challenges to
conceptually challenging, for many reasons. Vocal per- performers, especially if these are unmiked. Heavy cos-
formers have exacting voice needs, which may be com- tumes weighing 80–90 lb or more and unusual, con-
plicated by pathology; the voice training of singers and torted postures required during vocal performance may
actors is not standardized; few scientific studies on add further challenges and may even contribute to
training e‰cacy exist; and performers are subject to a injury.
suite of special personality, career, and lifestyle issues. Politically, performers may find themselves contrac-
All of these factors make many speech-language practi- tually linked to heavy performance schedules without
tioners feel that a specialty focus on vocology is impor- the possibility of rest if they are ill or vocally indisposed.
tant in working with performing artists. Performers are threatened with loss of income, loss of
Voice therapy for performers often replicates voice health care benefits, and loss of professional reputation if
pedagogy methods. The primary di¤erences are an em- they refuse to perform when they should not. Another
phasis on injury reduction and a shorter-term interven- political issue has to do with directors’ drive toward
tion, with specific, measurable goals, in voice therapy. meeting commercial goals. Such goals may dictate vocal
The most comprehensive technical framework for pro- practices that are at odds with performers’ best interest.
fessional voice training in general has been proposed by Directors and producers may sometimes show little con-
Estill (2000). The system identifies 11 or 12 physical cern for performers’ vocal health, because numerous
‘‘degrees of freedom,’’ such as voice onset type, false vocalists are available to replace injured ones who are
vocal fold position, laryngeal height, palatal position, unwilling or unable to perform.
and aryepiglottic space, that are independently varied to It is probably safe to say that individuals who are
create ‘‘recipes’’ for a variety of sung and spoken voice drawn to vocal performance are more extroverted, and
qualities. Research conducted thus far has corroborated more emotionally variable, on average, than many indi-
some aspects of the approach (e.g., Titze, 2002). The viduals in the population at large. The vocal practitioner
system recently has gained currency in voice therapy as should be comfortable dealing with performers’ individ-
well as vocal pedagogy. Voice training for acting tends ual personal styles. Moreover, mental attitude toward
to be less technically oriented and more ‘‘meaning performance plays a central role in the performing do-
driven’’ than singing training (e.g. Linklater, 1997). main. The principles of sports psychology fully apply to
Voice Therapy for Professional Voice Users 97
the performing arts. A robust finding is that intermediate Linklater, K. (1997). Thoughts on theatre, therapy and the art
anxiety levels, as opposed to low or high anxiety, tend to of voice. In M. Hampton and B. Acker (Eds.), The vocal
maximize physical performance. Performers need to find vision (pp. 3–12). New York: Applause.
ways to establish intermediate arousal states and stay Richter, B., Löhle, E., Knapp, B., Weikert, M., Schlömicher-
Their, J., and Verdolini, K. (2002). Harmful substances on
there even in high-stress situations. Also, the direction of
the opera stage: Possible negative e¤ects on singers’ respi-
attention appears key for distinguishing ‘‘chokers’’ ratory tract. Journal of Voice, 16, 72–80.
(people who tend to perform poorly under pressure) Roy, N., Gray, S. D., Simon, M., Dove, H., Corbin-Lewis, K.,
from persons who perform well under high stress. and Stemple, J. C. (2001). An evaluation of the e¤ects of
According to some reports, chokers tend to show a pre- two treatment approaches for teachers with voice disorders:
dominance of left hemisphere activation when under the A prospective randomized clinical trial. Journal of Speech-
gun, implying verbal analytic thinking and evaluative Language-Hearing Research, 44, 286–296.
self-awareness. High-level performers tend to show more Russell, A., Oates, J., and Greenwood, K. M. (1998). Prevalence
distributed brain activation, including right-hemisphere of voice problems in teachers. Journal of Voice, 12, 467–479.
activity consistent with imagery and target awareness Sapir, S., Keidar, A., and Marthers-Schmidt, B. (1993). Vocal
attrition in teachers: Survey findings. European Journal of
(Crews, 2001). Many other findings from the sports psy-
Disorders of Communication, 27, 129–135.
chology literature are applicable to attitude issues in Schmidt, R. A., and Lee, T. D. (1999). Motor control and
vocal performance. learning: A behavioral emphasis (3rd ed.). Champaign, IL:
Vocal performers may have erratic lifestyles that are Human Kinetics.
linked to their jobs. Touring groups literally may live on Smith, E., Lemke, J., Taylor, M., Kirchner, H. L., and Ho¤man,
buses. Exercise and fresh air may be restricted. Daily H. (1998). Frequency of voice problems among teachers and
routines may be nonexistent. Pay may be poor and spo- other occupations. Journal of Voice, 12, 480–499.
radic. Benefits often are not provided unless the per- Smith, S., and Thyme, K. (1976). Statistic research on changes
formers belong to a union. Vocal performers with voice in speech due to pedagogic treatment (the accent method).
problems often cannot pay for treatment because their Folia Phoniatrica, 28, 98–103.
Stemple, J. C., Lee, L., D’Amico, B., and Pickup, B. (1994).
voice problems lead to lack of employment and thus lack
E‰cacy of vocal function exercises as a method of improv-
of income and benefits. Clinics wishing to work with ing voice production. Journal of Voice, 83, 271–278.
professional voice users should be equipped to provide Titze, I. R. (2002). Regulating glottal airflow in phonation:
some form of fiscal support for treatment. Application of the maximum power transfer theorem to a
Practitioners working with vocal performers agree low dimensional phonation model. Journal of the Acoustical
that no single individual can fully assist a vocalist with Society of America, 111(1 Pt 1), 367–376.
voice problems. Rather, convergent e¤orts are required Verdolini, K. (1997). Principles of skill acquisition: Implica-
across specialities, to minimally include an otolaryn- tions for voice training. In M. Hampton and B. Acker
gologist, speech-language pathologist, voice teacher or (Eds.), The vocal vision: Views on voice (pp. 65–80). New
coach, and, patient. Di¤erent individuals take the lead, York: Applause.
Verdolini, K. (2000). Case study: Resonant voice therapy.
depending on the issues at hand. The physician is re-
In J. Stemple (Ed.), Voice therapy: Clinical studies (2nd ed.,
sponsible for medical issues. The speech-language pa- pp. 46–62). San Diego, CA: Singular Publishing Group.
thologist and voice teacher generally work together on Verdolini, K., and Ramig, L. O. (2001). Review: Occupational
technical issues. The voice teacher is the most appropri- risks for voice problems. Logopedics, Phoniatrics, Vocology,
ate person to address career issues with the performer, 26, 37–46.
particularly issues that bear on a potential mismatch be- Villkman, E. (2000). Voice problems at work: A challenge for
tween the individual’s aspirations and capabilities. The occupational safety and health arrangement. Folia Pho-
importance of communication across individuals within niatrica et Logopaedica, 52, 120–125.
the team cannot be overemphasized. Wulf, G., Höß, M., and Prinz, W. (1998). Instructions for
motor learning: Di¤erential e¤ects of internal versus ex-
—Katherine Verdolini ternal focus of attention. Journal of Motor Behavior, 30,
169–179.
References
Further Readings
Berry, D. A., Verdolini, K., Montequin, D. W., Hess, M. M.,
Chan, R. W., and Titze, I. R. (2001). A quantitative output- DeVore, K., Verdolini, K. (1998). Professional speaking voice
cost ratio in voice production. Journal of Speech-Language- training and applications to speech-language pathology.
Hearing Research, 44, 29–37. Current Opinion in Otolaryngology–Head and Neck Sur-
Crews, D. (2001). First prize: Putting under stress. Golf, 43, gery, 6, 145–150.
94–96. Hampton, M., and Acker, B. (Eds.), The vocal vision: Views on
Estill, J. (2000). Level one primer of basic figures. Santa Rosa, voice. New York: Applause.
CA: Estill Voice Training Systems. Lessac, A. (1996). The use and training of the human voice: A
Gau‰n, J., and Sundberg, J. (1989). Spectral correlates of practical approach to speech and voice dynamics (3rd ed.).
glottal voice source waveform characteristics. Journal of Mountain View, CA: Mayfield.
Speech and Hearing Research, 32, 556–565. Linklater, K. (1976). Freeing the natural voice. New York:
Lessac, A. (1997). The use and training of the human voice: Drama Book Specialists.
Bio-dynamic approach to vocal life. Mountain View, CA: Machlin, E. (1992). Speech for the stage (rev. ed.). New York:
Mayfield. Theatre Arts Books.
98 Part I: Voice
Maisel, E. (1974). The Alexander technique: The essential writings Satalo¤, R. T. (1997). Professional voice: The science and art of
of F. Matthias Alexander. London: Thames and Hudson. clinical care (2nd ed.). San Diego, CA: Singular Publishing
Miller, R. (1977). English, French, German and Italian tech- Group.
niques of singing: A study in national tonal preferences and Skinner, E., Monich, T., and Mansell, L. (1990). Speak with
how they relate to functional e‰ciency. Metuchen, NJ: distinction (rev. ed.). New York: Applause.
Scarecrow Press. Sundberg, J. (1977, March). The acoustics of the singing voice.
Nair, G. (1999). Voice tradition and technology: A state-of-the- Scientific American, 236, 82–91.
art studio. San Diego, CA: Singular Publishing Group. Sundberg, J. (1987). The science of the singing voice. DeKalb,
Orlick, T. (2000). In pursuit of excellence. Champaign, IL: IL: Northern Illinois University Press.
Human Kinetics. Thurman, L., and Welch, G. (1997). Body mind and voice:
Raphael, B. N. (1997). Special considerations relating to Foundations of voice education. Collegeville, MN: Voice-
members of the acting profession. In R. T. Satalo¤ (Ed.), Care Network.
Professional voice: The science and art of clinical care (2nd ed., Verdolini, K., Ostrem, J., DeVore, K., and McCoy, S.
pp. 203–205). San Diego, CA: Singular Publishing Group. (1998). National Center for Voice and Speech’s guide to
Rodenburg, P. (1997). The actor speaks. London: Methuen vocology. Iowa City, IA: National Center for Voice and
Drama. Speech.
Schmidt, R. A., and Lee, T. D. (1999). Motor control and Verdolini, K., and Ramig, L. O. (2001). Review: Occupational
learning: A behavioral emphasis (3rd ed.). Champaign, IL: risks for voice problems. Logopedics, Phoniatrics, Vocology,
Human Kinetics. 26, 37–46.
Part II: Speech
Apraxia of Speech: Nature and cent presented with a seizure disorder and 16% had a
diagnosis of degenerative disease, including Creutzfeldt-
Phenomenology Jakob disease and leukoencephalopathy (of the
remaining, 9% were unspecified, 4% were associated
Apraxia of speech is with dementia, and 3% were associated with primary
progressive aphasia). In 15% of cases the AOS was
a phonetic-motoric disorder of speech production caused by traumatically induced (12% neurosurgically and 3%
ine‰ciencies in the translation of a well-formed and filled concomitant with closed head injury), and in the
phonologic frame to previously learned kinematic parameters remaining cases the cause was undetermined or was of
assembled for carrying out the intended movement, resulting in
intra- and inter-articulator temporal and spatial segmental and
mixed etiology. Without doubt, these proportions are
prosodic distortions. It is characterized by distortions of seg- influenced by the type of patients typically seen at the
ments, intersegment transitionalization resulting in extended Mayo Clinic, and may not be representative of other
durations of consonants, vowels and time between sounds, syl- patient care sites.
lables and words. These distortions are often perceived as Among all of the acquired speech and language
sound substitutions and the misassignment of stress and other pathologies of neurological origin, AOS may be the
phrasal and sentence-level prosodic abnormalities. Errors are most infrequent. Its occurrence unaccompanied by
relatively consistent in location within the utterance and in- dysarthria, aphasia, limb apraxia, or oral-nonspeech
variable in type. It is not attributable to deficits of muscle tone apraxia is extremely rare. Comorbidity estimates
or reflexes, nor to deficits in the processing of auditory, tactile, averaged across studies and summarized by McNeil,
kinesthetic, proprioceptive, or language information. (McNeil,
Robin, and Schmidt, 1997, p. 329)
Doyle, and Wambaugh (2000) indicated an AOS/oral-
nonspeech apraxia comorbidity of 68%, an AOS/limb
The kernel perceptual behaviors that di¤erentiate apraxia comorbidity of 67%, an AOS/limb apraxia and
apraxia of speech (AOS) from other motor speech dis- oral-nonspeech apraxia comorbidity of 83%, an AOS/
orders and from phonological paraphasia are (1) length- aphasia comorbidity of 81%, and an AOS/dysarthria
ened segment (slow movements) and intersegment comorbidity of 31%. Its frequent co-occurrence with
(segment segregation) durations (overall slowed speech), other disorders and its frequent diagnostic confusion
resulting in (2) abnormal prosody across multisyllable with those disorders that share surface features with it
words and phrases, with a tendency to make errors on suggest that the occurrence of AOS in isolation (pure
more stressed than unstressed syllables; (3) relatively AOS) is extremely rare.
consistent trial-to-trial location of errors and relatively The lesion responsible for AOS has been studied since
nonvariable error types; and (4) impaired measures of Darley (1968) and Darley, Aronson, and Brown (1975)
coarticulation. Although apraxic speakers may produce proposed it as a neurogenic speech pathology that is
a preponderance of sound substitutions, these sub- theoretically and clinically di¤erent from aphasia and
stitutions do not serve as evidence of either AOS or the dysarthrias. Because Darley defined AOS as a disor-
phonemic paraphasia. Sound distortions serve as evi- der of motor programming, the responsible lesion has
dence of a motor-level mechanism or influence in the been sought in the motor circuitry, especially in Broca’s
absence of an anatomical explanation; however, they area. Luria (1966) proposed that the frontal lobe mech-
are not localizable to one part of the motor control ar- anisms for storing and accessing motor plans or pro-
chitecture and, taken alone, do not di¤erentiate AOS grams for limb gestures or for speech segments were
from the dysarthrias. Acoustically well-produced (non- represented in Broca’s area. He also proposed the facial
distorted) sound-level serial order (e.g., perseverative, region of the postcentral gyrus in the parietal lobe as a
anticipatory, and exchange) errors that cross word critical area governing coordinated movement between
boundaries are not compatible with motor planning- or gestures (speech or nonspeech). AOS-producing lesions
programming-generated mechanisms and are attribut- subtending Broca’s area (Mohr et al., 1978) as well as
able to the phonological encoding mechanism. those in the postcentral gyrus (Square, Darley, and
Although this motor speech disorder has a languor- Sommers, 1982; Marquardt and Sussman, 1984; McNeil
ous and tortuous theoretical and clinical history and is et al., 1990) have received support. Retrospective studies
frequently confused with other motor speech disorders of admittedly poorly defined and poorly described per-
and with phonemic paraphasia, a first-pass estimate of sons purported to have AOS (e.g., Kertesz, 1984) do not
some of its epidemiological characteristics has been pre- show a single site or common cluster of lesion sites re-
sented by McNeil, Doyle, and Wambaugh (2000). sponsible for the disorder. Prospective studies of the
Based on retrospective analysis of the records of AOS-producing lesion have been undertaken by a num-
3417 individuals evaluated at the Mayo Clinic for ber of investigators. Deutsch (1984) was perhaps the first
acquired neurogenic communication disorders, including to conduct a prospective search, with a result that set the
dysarthria, AOS, aphasia, and other neurogenic speech, stage for most of the rest of the results to follow. He
language, and cognitive disorders, Du¤y (1995) reported found that 50% of his AOS subjects (N ¼ 18) had a
a 4.6% prevalence of AOS. Based on this same retro- lesion in the frontal lobe and 50% had posterior le-
spective analysis of 107 patient records indicating a di- sions. Marquardt and Sussman’s (1984) prospective
agnosis of AOS, Du¤y reported that 58% had a vascular study of 12 subjects with AOS also failed to reveal a
etiology and 6% presented with a neoplasm. One per- consistent relationship among lesion location (cortical
102 Part II: Speech
versus subcortical, anterior versus posterior), lesion vol- smaller verbal motor patterns) by an indirect route.
ume, and the presence or absence of AOS. Dronkers Speech produced by normal speakers for infrequently
(1997) reported that 100% of 25 individuals with AOS occurring syllables, using the indirect route, are said to
had a discrete left hemispheric cortical lesion in the pre- share many of the core features of apraxic speakers, such
central gyrus of the insula. One hundred percent of a as (1) articulatory prolongation, (2) syllable segregation,
control group of 19 individuals with left hemispheric (3) inability to increase the speech rate and maintain
lesions in the same arterial distribution as the AOS sub- articulatory integrity, and (4) reduced coarticulation.
jects but without the presence of AOS were reported to AOS is therefore proposed to be a deficit of the direct
have had a complete sparing of this specific region of the encoding route, with a reliance on the indirect encoding
insula. McNeil et al. (1990) reported computed tomo- route.
graphic lesion data from four individuals with AOS un- Based on experimental evidence that the phonologi-
accompanied by other neurogenic speech or language cal similarity e¤ect should not be present in persons
pathologies. The only common lesion site for these with AOS, Rogers and Storkel (1998, 1999) hypothe-
‘‘pure’’ apraxic speakers was in the facial region of the sized a reduced bu¤er capacity as the mechanism re-
left postcentral gyrus. Two of the four AOS subjects had sponsible for AOS. In this account, the apraxic speaker
involvement of the insula, while two of three subjects with a reduced bu¤er capacity is required to reload or
with phonemic paraphasia (diagnosed with conduction reprogram the appropriate (unspecified) bu¤er in a
aphasia) had a lesion in the insula. Two of the four sub- feature-by-feature, sound-by-sound, syllable-by-syllable,
jects with AOS and one of the three subjects with con- or motor-control-variable-by-motor-control-variable
duction aphasia evinced involvement of Broca’s area. fashion. This requirement would give rise to essentially
The unambiguous results of the Dronkers study not- the same observable features of AOS as those commonly
withstanding, the lesions responsible for AOS remain used to define the entity and consistent with the observ-
open to study. It is clear, however, that the major able features discussed earlier.
anterior/posterior divisions common to aphasiology and Van der Merwe (1997) proposed a model of sensori-
traditional neurology as sites responsible for nonfluent/ motor speech disorders in which AOS is defined as a
fluent (respectively) disorders of speech production are disorder of motor planning. Critical to this view is the
challenged by the AOS/lesion data that are available separation of motor plans from motor programs. In this
to date. model, motor plans carry information (e.g., lip round-
ing, jaw depression, glottal closure, raising or lowering
Theoretical Accounts of the tongue tip, interarticulator phasing/coarticulation)
that is articulator-specific, not muscle-specific. Motor
The study of and clinical approach to AOS operate plans are derived from specific speech sounds and specify
under a scientific paradigm generally consistent with the the spatial and temporal goals of the planned unit.
mechanisms ascribed to apraxia. That is, the majority of Motor programs, on the other hand, specify the move-
practitioners view AOS as a disorder of previously ment parameters (e.g., muscle tone, direction, force,
learned movements that is di¤erent from other speech range, and rate of movement) of specific muscles or
movement disorders (i.e., the dysarthrias). The diagnosis muscle groups. For Van der Merwe, disorders of motor
can be confidently applied when assurance can be programs result in the dysarthrias and cannot account
obtained that the person has the cognitive or linguistic for the di¤erent set of physiological and behavioral signs
knowledge underlying the intended movement and the of AOS. The attributes ascribed to motor plans in this
fundamental structural and sensorimotor abilities to model are consistent with the array of cardinal behav-
carry out the movement. Additionally, most definitions ioral features of AOS.
of apraxia suggest an impairment of movements carried Though their view is expanded from the traditional
out volitionally but executed successfully when per- view of AOS as simply a disorder of motor program-
formed automatically. The diagnosis requires that ming, McNeil, Doyle, and Wambaugh (2000) argue that
patients display the ability to process the language un- a combined motor planning and motor programming
derlying the movement. These criteria are generally impairment as specified by Van der Merwe (1997) is
consistent with those used for the identification of other required to account for the array of well-established
apraxias, including oral nonspeech (buccofacial), writing perceptual, acoustic, and physiological features.
(agraphic), and limb apraxia. Other theoretical accounts of AOS include the over-
Although AOS is predominantly viewed as a disorder specification of phonological representations theory of
of motor programming (Wertz, LaPointe, and Rosen- Dogil, Mayer, and Vollmer (1994) and the coalitional/
bek, 1984), derived from its historical roots based in dynamical systems breakdown theory of Kelso and
other apraxias (particularly limb-kinetic apraxia; Tuller (1981). These accounts have received consider-
McNeil, Doyle, and Wambaugh, 2000), there are com- ably less examination in the literature and will not be
peting theories. Whiteside and Varley (1998) proposed a described here.
deficit of the direct phonetic encoding route to account AOS is an infrequently occurring pathology that is
for AOS. In this theory, normal speech production in- clinically recognized by most professionals dedicated to
volves the retrieval from storage of verbal motor pat- the management of speech production disorders. It is
terns for frequently used syllables (the direct route), or classified as a motor speech disorder in the scientific
the patterns are calculated anew (presumably from literature. When it occurs, it is frequently accompanied
Apraxia of Speech: Nature and Phenomenology 103
by other speech, language, and apraxic disorders. Its Rogers, M. A., and Storkel, H. (1998). Reprogramming pho-
defining features are not widely agreed upon; however, nologically similar utterances: The role of phonetic features
evidence from perceptual, acoustic, kinematic, aerody- in pre-motor encoding. Journal of Speech and Hearing Re-
namic, and electromyographic studies, informed by research, 41, 258–274.
Rogers, M. A., and Storkel, H. L. (1999). Planning speech one
cent models of speech motor control and phonological
syllable at a time: The reduced bu¤er capacity hypothesis in
encoding, have led to clearer criteria for subject/patient apraxia of speech. Aphasiology, 13, 793–805.
selection and a resurgence of interest in its proposed Square, P. A., Darley, F. L., and Sommers, R. I. (1982). An
mechanisms. The lesions responsible for acquired AOS analysis of the productive errors made by pure apractic
remain a matter for future study. speakers with di¤ering loci of lesions. Clinical Aphasiology,
See also apraxia of speech: treatment; devel- 10, 245–250.
opmental apraxia of speech. Van der Merwe, A. (1997). A theoretical framework for the
characterization of pathological speech sensorimotor con-
—Malcolm R. McNeil and Patrick J. Doyle trol. In M. R. McNeil (Ed.), Clinical management of sen-
sorimotor speech disorders (pp. 1–25). New York: Thieme.
References Wertz, R. T., LaPointe, L. L., and Rosenbek, J. C. (1984).
Apraxia of speech in adults: The disorder and its manage-
Darley, F. L. (1968). Apraxia of speech: 107 years of termino- ment. Orlando, FL: Grune and Stratton.
logical confusion. Paper presented at the American Speech Whiteside, S. P., and Varley, R. A. (1998). A reconceptualisa-
and Hearing Association Convention, Denver. tion of apraxia of speech: A synthesis of evidence. Cortex,
Darley, F. L., Aronson, A. E., and Brown, J. (1975). Motor 34, 221–231.
speech disorders. Philadelphia: Saunders.
Deutsch, S. (1984). Prediction of site of lesion from speech Further Readings
apraxic error patterns. In J. C. Rosenbek, M. R. McNeil,
and A. E. Aronson (Eds.), Apraxia of speech: Physiology, Ballard, K. J., Granier, J. P., and Robin, D. A. (2000). Un-
acoustics, linguistics, management (pp. 113–134). San Di- derstanding the nature of apraxia of speech: Theory, analy-
ego, CA: College-Hill Press. sis and treatment. Aphasiology, 14, 969–995.
Dogil, G., Mayer, J., and Vollmer, K. (1994). A representa- Code, C. (1998). Models, theories and heuristics in apraxia of
tional account for apraxia of speech. Paper presented at the speech. Clinical Linguistics and Phonetics, 12, 47–65.
Fourth Symposium of the International Clinical Phonetics Croot, K. (2001). Integrating the investigation of apraxic,
and Linguistics Association, New Orleans, LA. aphasic and articulatory disorders in speech production:
Dronkers, N. F. (1997). A new brain region for coordinating A move towards sound theory. Aphasiology, 15, 58–62.
speech coordination. Nature, 384, 159–161. Dogil, G., and Mayer, J. (1998). Selective phonological im-
Du¤y, J. R. (1995). Motor speech disorders: Substrates, di¤er- pairment: A case of apraxia of speech. Phonology, 15,
ential diagnosis and management. St. Louis: Mosby. 143–188.
Kelso, J. A. S., and Tuller, B. (1981). Toward a theory of Geshwind, N. (1975). The apraxias: Neural mechanisms of
apractic syndromes. Brain and Language, 12, 224–245. disorders of learned movement. American Scientist, 63,
Kertesz, A. (1984). Subcortical lesions and verbal apraxia. 188–195.
In J. C. Rosenbek, M. R. McNeil, and A. E. Aronson Goodglass, H. (1992). Diagnosis of conduction aphasia. In
(Eds.), Apraxia of speech: Physiology, acoustics, linguistics, S. E. Kohn (Ed.), Conduction aphasia (pp. 3–50). Hillsdale,
management (pp. 73–90). San Diego, CA: College-Hill NJ: Erlbaum.
Press. Keele, S. W., Cohen, A., and Ivry, R. (1990). Motor programs:
Luria, A. R. (1966). Human brain and psychological processes. Concepts and issues. In M. Jeannerod (Ed.), Attention
New York: Harper. and performance: XIII. Motor representation and control
Marquardt, T., and Sussman, H. (1984). The elusive lesion- (pp. 77–110). Hillsdale, NJ: Erlbaum.
apraxia of speech link in Broca’s aphasia. In J. C. Rosen- Kent, R. D., and McNeil, M. R. (1987). Relative timing of
bek, M. R. McNeil, and A. E. Aronson (Eds.), Apraxia sentence repetition in apraxia of speech and conduction
of speech: Physiology, acoustics, linguistics, management aphasia. In J. H. Ryalls (Ed.), Phonetic approaches to
(pp. 91–112). San Diego, CA: College-Hill Press. speech production in aphasia and related disorders (pp.
McNeil, M. R., Doyle, P. J., and Wambaugh, J. (2000). 181–220). Boston: College-Hill Press.
Apraxia of speech: A treatable disorder of motor planning Kent, R. D., and Rosenbek, J. C. (1983). Acoustic patterns of
and programming. In S. E. Nadeau, L. J. Gonzalez Rothi, apraxia of speech. Journal of Speech and Hearing Research,
and B. Crosson (Eds.), Aphasia and language: Theory to 26, 231–249.
practice (pp. 221–266). New York: Guilford Press. Lebrun, Y. (1990). Apraxia of speech: A critical review. Neu-
McNeil, M. R., Robin, D. A., and Schmidt, R. A. (1997). rolinguistics, 5, 379–406.
Apraxia of speech: Definition, di¤erentiation, and treat- Martin, A. D. (1974). Some objections to the term apraxia of
ment. In M. R. McNeil (Ed.), Clinical management of sen- speech. Journal of Speech and Hearing Disorders, 39, 53–
sorimotor speech disorders (pp. 311–344). New York: 64.
Thieme. McNeil, M. R., and Kent, R. D. (1990). Motoric character-
McNeil, M. R., Weismer, G., Adams, S., and Mulligan, M. istics of adult aphasic and apraxic speakers. In G. E. Ham-
(1990). Oral structure nonspeech motor control in normal mond (Ed.), Cerebral control of speech and limb movements
dysarthric aphasic and apraxic speakers: Isometric force (pp. 349–386). Amsterdam: North-Holland.
and static position control. Journal of Speech and Hearing Mlcoch, A. G., and Noll, J. D. (1980). Speech production
Research, 33, 255–268. models as related to the concept of apraxia of speech. In
Mohr, J. P., Pessin, M. S., Finkelstein, S., Funkenstein, H., N. J. Lass (Ed.), Speech and language: Advances in basic
Duncan, G. W., and Davis, K. R. (1978). Broca aphasia: research and practice (vol. 4, pp. 201–238). New York:
Pathological and clinical. Neurology, 28, 311–324. Academic Press.
104 Part II: Speech
Miller, N. (1986). Dyspraxia and its management. Rockville, phasing of the articulators at the segmental and syllable
MD: Aspen. levels, and (2) segmental sequencing of longer speech
Miller, N. (2001). Dual or duel route? Aphasiology, 15, 62–68. units (Square-Storer and Hayden, 1989).
Rogers, M. A., and Spencer, K. A. (2001). Spoken word pro- More recently, arguments supporting the application
duction without assembly: Is it possible? Aphasiology, 15,
of motor learning principles (Schmidt, 1988) for the
68–74.
Rosenbek, J. C. (2001). Darley and apraxia of speech in adults. purposes of specifying the structure of AOS treatment
Aphasiology, 15, 261–273. sessions have been proposed, based on evidence that
Rosenbek, J. C., Kent, R. D., and LaPointe, L. L. (1984). such principles facilitate learning and retention of motor
Apraxia of speech: An overview and some perspectives. In routines involved in skilled limb movements (Schmidt,
J. C. Rosenbek, M. R. McNeil, and A. E. Aronson (Eds.), 1991). The empirical support for each approach to
Apraxia of speech: Physiology, acoustics, linguistics, man- treatment is reviewed here.
agement (pp. 1–72). San Diego, CA: College-Hill Press.
Rothi, L. J. G., and Heilman, K. M. (Eds.). (1997). Apraxia: Enhancing Articulatory Kinematics at the Segmental
The neuropsychology of action. East Sussex, UK: Psychol- Level. Several facilitative techniques have been recom-
ogy Press.
mended to enhance postural shaping and phasing of the
Roy, E. A. (Ed.). (1985). Neuropsychological studies of apraxia
and related disorders. Amsterdam: North-Holland. articulators at the segmental and syllable levels and have
Square-Storer, P. A. (Ed.). (1989). Acquired apraxia of speech been described in detail by Wertz, LaPointe, and Rosen-
in aphasic adults. London: Taylor and Francis. bek (1984). These techniques include (1) phonetic deri-
Varley, R., and Whiteside, S. P. (2001). What is the underlying vation, which refers to the shaping of speech sounds
impairment in acquired apraxia of speech? Aphasiology, 15, based on corresponding nonspeech postures, (2) pro-
39–84. gressive approximation, which involves the gradual
Ziegler, W. (2001). Apraxia of speech is not a lexical disorder. shaping of targeted speech segments from other speech
Aphasiology, 15, 74–77. segments, (3) integral stimulation and phonetic placement,
Ziegler, W., and von Cramon, D. (1985). Anticipatory coartic- which employ visual models, verbal descriptions and
ulation in a patient with apraxia of speech. Brain and
physical manipulations to achieve the desired articu-
Language, 26, 117–130.
Ziegler, W., and von Cramon, D. (1986). Disturbed coarticulatory posture, and movement, and (4) minimal pairs
lation in apraxia of speech: Acoustic evidence. Brain and contrasts, which requires patients to produce syllable or
Language, 29, 3–47. word pairs in which one member of the pair di¤ers min-
imally with respect to manner, place, or voicing features
from the other member of the pair.
Apraxia of Speech: Treatment Several early studies examined, in isolation or in
various combinations, the e¤ects of these facilitative
techniques on speech production, and reported positive
In the years since Darley (1968) first described apraxia of treatment responses (Rosenbek et al., 1973; Holtzapple
speech (AOS) as an articulatory programming disorder and Marshall, 1977; Deal and Florance, 1978; Thomp-
that could not be accounted for by disrupted linguistic or son and Young, 1983; LaPointe, 1984; Wertz, 1984).
fundamental motor processes, considerable work has However, most of these studies su¤ered from method-
been done to elucidate the perceptual, acoustic, kine- ological limitations, including inadequate subject selec-
matic, aerodynamic, and electromyographic features tion criteria, nonreplicable treatment protocols, and
that characterize AOS (cf. McNeil, Robin, and Schmidt, pre-experimental research designs, which precluded firm
1997; McNeil, Doyle, and Wambaugh, 2000). Explana- conclusions regarding the validity and generalizability of
tory models consistent with these observations have been the reported treatment e¤ects. Contemporary investiga-
proposed (Van der Merwe, 1997). tions have addressed these methodological shortcomings
Overwhelmingly, the evidence supports a conceptual- and support earlier findings regarding the positive e¤ects
ization of AOS as a of treatment techniques aimed at enhancing articulatory
neurogenic speech disorder caused by ine‰ciencies in the spec- kinematic aspects of speech at the sound, syllable, and
ification of intended articulatory movement parameters or word levels.
motor programs which result in intra- and interarticulator Specifically, in a series of investigations using single-
temporal and spatial segmental and prosodic distortions. Such subject experimental designs Wambaugh and colleagues
movement distortions are realized as extended segmental, in- examined the e¤ects of a procedurally explicit treatment
tersegmental transitionalization, syllable and word durations, protocol employing the facilitative techniques of integral
and are frequently perceived as sound substitutions, the mis- stimulation, phonetic placement, and minimal pair con-
assignment of stress, and other phrasal and sentence-level trasts in 11 well-described subjects with AOS (Wam-
prosodic abnormalities. (McNeil, Robin, and Schmidt, 1997,
p. 329)
baugh et al., 1996, 1998, 1999; Wambaugh, West, and
Doyle, 1998; Wambaugh and Cort, 1998; Wambaugh,
Traditional and contemporary conceptualizations of 2000). These studies revealed positive treatment e¤ects
the disorder have resulted in specific assumptions re- on targeted phonemes in trained and untrained words
garding appropriate tactics and targets of intervention, for all subjects across all studies, and positive main-
and a number of treatment approaches have been pro- tenance e¤ects of targeted sounds at 6 weeks post-
posed that seek to enhance (1) postural shaping and treatment. In addition, two subjects showed positive
Apraxia of Speech: Treatment 105
generalization of trained sounds to novel stimulus con- units. However, until these findings can be systematically
texts (i.e., untrained phrases), and one subject showed replicated, their generalizability remains unknown.
positive generalization to untrained sounds within the
same sound class (voiced stops). These results provide General Principles of Motor Learning. The contempo-
initial experimental evidence that treatment strategies rary explication of AOS as a disorder of motor planning
designed to enhance postural shaping and phasing of and programming has given rise to a call for the appli-
the articulators are e‰cacious in improving sound cation of motor learning principles in the treatment of
production of treated and untreated words. Further, AOS (McNeil et al., 1997, 2000; Ballard, 2001). The
there is limited evidence that for some patients and some habituation, transfer, and retention of skilled movements
sounds, generalization to untrained contexts may be (i.e., motor learning) and their controlling variables have
expected. been studied extensively in limb systems from the per-
spective of schema theory (Schmidt, 1975). This research
Enhancing Segmental Sequencing of Longer Speech has led to the specification of several principles regarding
Units. Several facilitative techniques have been recom- the structure of practice and feedback that were found
mended to improve speech production in persons with to enhance retention of skilled limb movements post-
AOS, based on the premise that the sequencing and co- treatment, and greater transfer of treatment e¤ects to
ordination of movement parameters required for the novel movements (Schmidt, 1991). Three such principles
production of longer speech units (and other complex are particularly relevant to the treatment of AOS: (1) the
motor behaviors) are governed by internal oscillatory need for intensive and repeated practice of the targeted
mechanisms (Gracco, 1990) and temporal constraints skilled movements, (2) the order in which targeted
(Kent and Adams, 1989). Treatment programs and tac- movements are practiced, and (3) the nature and sched-
tics grounded in this framework employ techniques ule of feedback.
designed to reduce or control speech rate while enhanc- With respect to the first of these principles, clinical
ing the natural rhythm and stress contours of the tar- management of AOS has long espoused intensive drill of
geted speech unit. The e¤ects of several such specific targeted speech behaviors (Rosenbek, 1978; Wertz et al.,
facilitative techniques have been studied. These include 1984). However, no studies have examined the e¤ects of
metronomic pacing (Shane and Darley, 1978; Dworkin, manipulating the number of treatment trials on the ac-
Abkarian, and Johns, 1988; Dworkin and Abkarian, quisition and retention of speech targets in AOS, and
1996; Wambaugh and Martinez, 1999), prolonged little attention has been paid to the structure of drills
speech (Southwood, 1987), vibrotactile stimulation used in treatment. That is, research on motor learning
(Rubow et al., 1982), and intersystemic facilitation (i.e., in limb systems has shown that practicing several di¤er-
finger counting) (Simmons, 1978). In addition, the e¤ects ent skilled actions in random order within training ses-
of similarly motivated treatment programs, melodic in- sions facilitates greater retention and transfer of targeted
tonation therapy (Sparks, 2001) and surface prompts actions than does blocked practice of skilled movements
(Square, Chumpelik, and Adams, 1985), have also been (Schmidt, 1991). This finding has been replicated by
reported. Knock et al. (2000) in two adult subjects with AOS in
As with studies examining the e¤ects of techniques the only study to date to experimentally manipulate
designed to enhance articulatory kinematic aspects of random versus blocked practice to examine acquisition,
speech at the segmental level, the empirical evidence retention, and transfer of speech movements.
supporting the facilitative e¤ects of rhythmic pacing, The final principle to be discussed concerns the nature
rate control, and stress manipulations on the production and schedule of feedback employed in the training of
of longer speech units in adults with AOS is limited. skilled movements. Two types of feedback have been
That is, among the reports cited, only five subjects were studied, knowledge of results (KR) and knowledge of
studied under conditions that permit valid conclusions performance (KP). KR provides information only with
to be drawn regarding the relationship between applica- respect to whether the intended movement was per-
tion of the facilitative technique and the dependent formed accurately or not. KP provides information re-
measures reported (Southwood, 1987; Dworkin et al., garding aspects of the movement that deviate from the
1988; Dworkin and Abkarian, 1996; Wambaugh et al., intended action and how the intended action is to be
1999). Whereas each of these studies reported positive performed. Schmidt and Lee (1999) argue that KP is
results, it is di‰cult to compare them because of di¤er- most beneficial during the early stages of training but
ences in the severity of the disorder, in the frequency, that KR administered at low response frequencies pro-
duration, and context in which the various facilitative motes greater retention of skilled movements. Both types
techniques were applied, in the behaviors targeted for of feedback are frequently employed in the treatment
intervention, and in the extent to which important of AOS. Indeed, the facilitative techniques of integral
aspects of treatment e¤ectiveness (i.e., generalized stimulation and phonetic placement provide the type
e¤ects) were evaluated. As such, the limited available of information that is consistent with the concept of
evidence suggests that techniques that reduce the rate of KP. However, these facilitative techniques are most fre-
articulatory movements and highlight rhythmic and quently used as antecedent conditions to enhance target
prosodic aspects of speech production may be e‰cacious performance, and response-contingent feedback fre-
in improving segmental coordination in longer speech quently takes the form of KR. The e¤ects of the nature,
106 Part II: Speech
schedule, and timing of performance feedback have not sorimotor speech disorders (pp. 311–344). New York:
been systematically investigated in AOS. Thieme.
In summary, AOS is a treatable disorder of motor Rosenbek, J. C. (1978). Treating apraxia of speech. In D. F.
planning and programming. Studies examining the ef- Johns (Ed.), Clinical Management of neurogenic communi-
cative disorders (pp. 191–241). Boston: Little, Brown.
fects of facilitative techniques aimed at improving pos-
Rosenbek, J. C., Lemme, M. L., Ahern, M. B., Harris, E. H.,
tural shaping and phasing of the articulators at the and Wertz, R. T. (1973). A treatment for apraxia of speech
segmental level and sequencing and coordination of seg- in adults. Journal of Speech and Hearing Disorders, 38,
ments into long utterances have reported positive out- 462–472.
comes. These studies are in need of carefully controlled Rubow, R. T., Rosenbek, J. C., Collins, M. J., and Longstreth,
systematic replications before generalizability can be in- D. (1982). Vibrotactile stimulation for intersystemic re-
ferred. Further, the e¤ects of motor learning principles organization in the treatment of apraxia of speech. Archives
(Schmidt and Lee, 1999) on the habituation, mainte- of Physical Medicine Rehabilitation, 63, 150–153.
nance, and transfer of speech behaviors require system- Schmidt, R. A. (1975). A schema theory of discrete motor skill
atic evaluation in persons with AOS. learning. Psychological Review, 82, 225–260.
Schmidt, R. A. (1988). Motor control and learning: A be-
—Patrick J. Doyle and Malcolm R. McNeil havioral emphasis (2nd ed.). Champaign, IL: Human
Kinetics.
References Schmidt, R. A. (1991). Frequent augmented feedback can de-
grade learning: Evidence and interpretations. In R. Requin
Ballard, K. J. (2001). Principles of motor learning and treatment and G. E. Stelmach (Eds.), Tutorials in neuroscience. Dor-
for AOS. Paper presented at the biennial Motor Speech drecht: Kluwer Academic.
Conference, Williamsburg, VA. Schmidt, R. A., and Lee, T. D. (1999). Motor control and
Darley, F. L. (1968). Apraxia of speech: 107 years of termino- learning: A behavioral emphasis (3rd ed.). Champaign, IL:
logical confusion. Paper presented at the annual convention Human Kinetics.
of the American Speech and Hearing Association, Denver, Shane, H. C., and Darley, F. L. (1978). The e¤ect of auditory
CO. rhythmic stimulation on articulatory accuracy in apraxia of
Deal, J. L., and Florance, C. L. (1978). Modification of the speech. Cortex, 14, 444–450.
eight-step continuum for treatment of apraxia of speech in Simmons, N. N. (1978). Finger counting as an intersystemic
adults. Journal of Speech and Hearing Disorders, 43, 89–95. reorganizer in apraxia of speech. In R. H. Brookshire (Ed.),
Dworkin, J. P., and Abkarian, G. G. (1996). Treatment of Clinical Aphasiology Conference proceedings (pp. 174–179).
phonation in a patient with apraxia and dysarthria second- Minneapolis: BRK.
ary to severe closed head injury. Journal of Medical Speech- Southwood, H. (1987). The use of prolonged speech in the
Language Pathology, 4, 105–115. treatment of apraxia of speech. In R. H. Brookshire (Ed.),
Dworkin, J. P., Abkarian, G. G., and Johns, D. F. (1988). Clinical Aphasiology Conference proceedings (pp. 277–287).
Apraxia of speech: The e¤ectiveness of a treatment regime. Minneapolis: BRK.
Journal of Speech and Hearing Disorders, 53, 280–294. Sparks, R. W. (2001). Melodic intonation therapy. In R. Cha-
Gracco, V. L. (1990). Characteristics of speech as a motor pey (Ed.), Language intervention strategies in adult aphasia
control system. In G. E. Hammond (Ed.), Cerebral control (4th ed., pp. 703–717). Baltimore: Lippincott Williams and
of speech and limb movements. Amsterdam: North-Holland. Wilkins.
Holtzapple, P., and Marshall, N. (1977). The application of Square, P. A., Chumpelik, D., and Adams, S. (1985). E‰cacy
multiphonemic articulation therapy with apraxic patients. of the PROMPT system of therapy for the treatment of
In R. H. Brookshire (Ed.), Clinical Aphasiology Conference acquired apraxia of speech. In R. H. Brookshire (Ed.),
proceedings (pp. 46–58). Minneapolis: BRK. Clinical Aphasiology Conference proceedings (pp. 319–320).
Kent, R. D., and Adams, S. G. (1989). The concept and Minneapolis: BRK.
measurement of coordination in speech disorders. In Square-Storer, P., and Hayden, D. C. (1989). PROMPT treat-
S. A. Wallace (Ed.), Advances in psychology: Perspectives ment. In P. Square-Storer (Ed.), Acquired apraxia of speech
on the coordination of movement (pp. 415–450). New York: in aphasic adults (pp. 190–219). London: Taylor and
Elsevier. Francis.
Knock, T. R., Ballard, K. J., Robin, D. A., and Schmidt, R. Thompson, C. K., and Young, E. C. (1983). A phonological
(2000). Influence of order of stimulus presentation on process approach to apraxia of speech: An experimental
speech motor learning: A principled approach to treatment analysis of cluster reduction. Paper presented at a meeting of
for apraxia of speech. Aphasiology, 14, 653–668. the American Speech Language and Hearing Association,
LaPointe, L. L. (1984). Sequential treatment of split lists: A Cincinnati, OH.
case report. In J. Rosenbek, M. McNeil, and A. Aronson Van der Merwe, A. (1997). A theoretical framework for the
(Eds.), Apraxia of speech: Physiology, acoustics, linguistics, characterization of pathological speech sensorimotor con-
management (pp. 277–286). San Diego, CA: College-Hill trol. In McNeil (Ed.), Clinical management of sensorimotor
Press. speech disorders (pp. 1–25). New York: Thieme.
McNeil, M. R., Doyle, P. J., and Wambaugh, J. (2000). Wambaugh, J. L. (2000). Stimulus generalization e¤ects of
Apraxia of speech: A treatable disorder of motor planning treatment for articulatory errors in apraxia of speech. Poster
and programming. In S. E. Nadeau, L. J. Gonzalez-Rothi, session presented at the biennial Motor Speech Conference,
and B. Crosson (Eds.), Aphasia and language: From theory San Antonio, TX.
to practice (pp. 221–265). New York: Guilford Press. Wambaugh, J. L., and Cort, R. (1998). Treatment for AOS:
McNeil, M. R., Robin, D. A., and Schmidt, R. A. (1997). Perceptual and VOT changes in sound production. Poster
Apraxia of speech: Definition, di¤erentiation, and treat- session presented at the biennial Motor Speech Conference,
ment. In M. R. McNeil (Ed.), Clinical management of sen- Tucson, AZ.
Aprosodia 107
Wambaugh, J. L., Doyle, P. J., Kalinyak, M., and West, J. apraxia of speech. In T. E. Prescott (Ed.), Clinical Aphasi-
(1996). A minimal contrast treatment for apraxia of speech. ology, 20, 285–298. Austin, TX: Pro-Ed.
Clinical Aphasiology, 24, 97–108. Robin, D. A. (1992). Developmental apraxia of speech:
Wambaugh, J. L., Kalinyak-Flizar, M. M., West, J. E., and Just another motor problem. American Journal of Speech-
Doyle, P. J. (1998). E¤ects of treatment for sound errors in Language Pathology: A Journal of Clinical Practice, 1,
apraxia of speech and aphasia. Journal of Speech Language 19–22.
and Hearing Research, 41, 725–743. Schmidt, R. A., and Bjork, R. A. (1992). New conceptualiza-
Wambaugh, J. L., and Martinez, A. L. (1999). E¤ects of rate tions of practice: Common principles in three paradigms
and rhythm control on sound production in apraxia of suggest new concepts for training. Psychological Science, 3,
speech. Aphasiology, 14, 603–617. 207–217.
Wambaugh, J. L., Martinez, A. L., McNeil, M. R., and Shea, C. H., and Morgan, R. L. (1979). Contextual interfer-
Rogers, M. (1999). Sound production treatment for apraxia ence e¤ects on the acquisition retention, and transfer of a
of speech: Overgeneralization and maintenance e¤ects. motor skill. Journal of Experimental Psychology: Human
Aphasiology, 13, 821–837. Learning and Memory, 5, 179–187.
Wambaugh, J. L., West, J. E., and Doyle, P. J. (1998). Treat- Shea, J. B., and Wright, D. L. (1991). When forgetting benefits
ment for apraxia of speech: E¤ects of targeting sound motor retention. Research Quarterly for Exercise and Sport,
groups. Aphasiology, 12, 731–743. 62, 293–301.
Wertz, R. T. (1984). Response to treatment in patients with Simmons, N. N. (1980). Choice of stimulus modes in treating
apraxia of speech. In J. C. Rosenbek, M. R. McNeil, and apraxia of speech: A case study. In R. H. Brookshire (Ed.),
A. E. Aronson (Eds.), Apraxia of speech: Physiology, acous- Clinical Aphasiology Conference proceedings (pp. 302–307).
tics, linguistics, management (pp. 257–276). San Diego, CA: Minneapolis: BRK.
College-Hill Press. Square, P. A., Chumpelik, D., Morningstar, D., and Adams, S.
Wertz, R. T., LaPointe, L. L., and Rosenbek, J. C. (1984). (1986). E‰cacy of the PROMPT system of therapy for
Apraxia of speech in adults: The disorder and its manage- the treatment of acquired apraxia of speech: A follow-up
ment. Orlando, FL: Grune and Stratton. investigation. In R. H. Brookshire (Ed.), Clinical Aphasiol-
ogy Conference proceedings (pp. 221–226). Minneapolis:
Further Readings BRK.
Square-Storer, P. (1989). Traditional therapies for AOS:
Dabul, B., and Bollier, B. (1976). Therapeutic approaches to Reviewed and rationalized. In P. Square-Storer (Ed.),
apraxia. Journal of Speech and Hearing Disorders, 41, Acquired apraxia of speech in aphasic adults. London:
268–276. Taylor and Francis.
Freed, D. B., Marshall, R. C., and Frazier, K. E. (1977). The Square-Storer, P., and Martin, R. E. (1994). The nature and
long-term e¤ectiveness of PROMPT treatment in a severely treatment of neuromotor speech disorders in aphasia. In
apractic-aphasic speaker. Aphasiology, 11, 365–372. R. Chapey (Ed.), Language intervention strategies in adult
Florance, C. L., Rabidoux, P. L., and McCauslin, L. S. (1980). aphasia (pp. 467–499). Baltimore: Williams and Wilkins.
An environmental manipulation approach to treating Wambaugh, J. L., and Doyle, P. D. (1994). Treatment for
apraxia of speech. In R. H. Brookshire (Ed.), Clinical acquired apraxia of speech: A review of e‰cacy reports.
Aphasiology Conference proceedings (pp. 285–293). Minne- Clinical Aphasiology, 22, 231–243.
apolis: BRK. Wulf, G., and Schmidt, R. A. (1994). Feedback-induced vari-
Howard, S., and Varley, R. (1995). EPG in therapy: Using ability and the learning of generalized motor programs.
electropalatography to treat severe acquired apraxia of Journal of Motor Behavior, 26, 348–361.
speech. European Journal of Disorders of Communication,
30, 246–255.
Keith, R. L., and Aronson, A. E. (1975). Singing as therapy for Aprosodia
apraxia of speech and aphasia: Report of a case. Brain and
Language, 2, 483–488.
Lane, V. W., and Samples, J. M. (1981). Facilitating commu- Prosody consists of alterations in pitch, stress, and du-
nication skills in adult apraxics: Application of blisssymbols ration across words, phrases, and sentences. These same
in a group setting. Journal of Communication Disorders, 14,
parameters are defined acoustically as fundamental fre-
157–167.
Lee, T. D., and Magill, R. A. (1983). The locus of contextual quency, intensity, and timing. It is the variation in these
interference in motor-skill acquisition. Journal of Experi- parameters that not only provides the melodic contour
mental Psychology: Learning, Memory, and Cognition, 9, of speech, but also invests spoken language with linguis-
730–746. tic and emotional meaning. Prosody is thus crucial to
McNeil, M. R., Prescott, T. E., and Lemme, M. L. (1976). An conveying and understanding communicative intent.
application of electro-myographic biofeedback to aphasia/ The term ‘‘aprosodia’’ was first used by Monrad-
apraxia treatment. In R. H. Brookshire (Ed.), Clinical Krohn (1947) to describe loss of the prosodic features
Aphasiology Conference proceedings (pp. 151–171). Minne- of speech. It resurfaced in the 1980s in the work of
apolis: BRK. Ross and his colleagues to refer to the attenuated use
Rabidoux, P., Florance, C., and McCauslin, L. (1980). The use
of a Handi Voice in the treatment of a severely apractic
of and decreased sensitivity to prosodic cues by right
nonverbal patient. In R. H. Brookshire (Ed.), Clinical hemisphere damaged patients (Ross and Mesulam, 1979;
Aphasiology Conference proceedings (pp. 294–301). Minne- Ross, 1981; Gorelick and Ross, 1987).
apolis: BRK. Prosodic deficits in expression or comprehension can
Raymer, A. M., and Thompson, C. K. (1991). E¤ects of verbal accompany a variety of cognitive, linguistic, and psychi-
plus gestural treatment in a patient with aphasia and severe atric conditions, including dysarthria and other motor
108 Part II: Speech
speech disorders, aphasia, chronic alcoholism, schizo- been somewhat limited by uncertainty about the under-
phrenia, depression, and mania, as well as right hemi- lying mechanisms of aprosodia. It is not clear the extent
sphere damage (RHD) (Du¤y, 1995; Myers, 1998; to which expressive aprosodia is a motor problem, a
Monnot, Nixon, Lovallo, and Ross, 2001). The term pragmatic problem, a resource allocation problem, or
aprosodia, however, typically refers to the prosodic some combination of conditions. Similarly, it is not clear
impairments that can accompany RHD from stroke, whether receptive aprosodia is due to perceptual inter-
head injury, or progressive neurologic disease with a ference in decoding prosodic features, to restricted at-
right hemisphere focus. Even the disturbed prosody of tention (which may reduce sensitivity to prosodic cues),
other illnesses, such as schizophrenia, may be the result or to some as yet unspecified mechanism.
of alterations in right frontal and extrapyramidal areas, Much of the research in prosodic processing has been
areas considered important to prosodic impairment sub- conducted to answer questions about the laterality of
sequent to RHD (Sweet, Primeau, Fichtner et al., 1998; brain function. Subjects with unilateral left or right brain
Ross et al., 2001). damage have been asked to produce linguistic and emo-
The clinical presentation of expressive aprosodia is a tional prosody in spontaneous speech, in imitation, and
flattened, monotonic, somewhat robotic, stilted prosodic in reading tasks at the single word, phrase, and sentence
production characterized by reduced variation in proso- level. In receptive tasks they have been asked to deter-
dic features and somewhat uniform intersyllable pause mine the emotional valence of expressive speech and to
time. The condition often, but not always, accompanies discriminate between various linguistic forms and emo-
flat a¤ect, a more general form of reduced environmen- tional content in normal and in filtered-speech para-
tal responsivity, reduced sensitivity to the paralinguistic digms. Linguistic tasks include discriminating between
features of communication (gesture, body language, fa- nouns and noun phrases based on contrastive stress pat-
cial expression), and attenuated animation in facial ex- terns (e.g., greenhouse versus green house); using stress
pression subsequent to RHD. Aprosodia can occur in patterns to identify sentence meaning (Joe gave Ella
the absence of dysarthria and other motor speech dis- flowers versus Joe gave Ella flowers); and identifying
orders, in the absence of depression or other psychiatric sentence types based on prosodic contour (e.g., the rising
disturbances, and in the absence of motor programming intonation pattern for interrogatives versus the flatter
deficits typically associated with apraxia of speech. Be- pattern for declaratives).
cause it is associated with damage to the right side of The emphasis on laterality of function has helped to
the brain, it usually occurs in the absence of linguistic establish that both hemispheres as well as some sub-
impairments (Du¤y, 1995; Myers, 1998). cortical structures contribute to normal prosodic pro-
Expressive aprosodia is easily recognized in patients cessing. The extensive literature supporting a particular
with flat a¤ect. Deficits in prosodic perception and role for the right hemisphere in processing content gen-
comprehension are less apparent in clinical presenta- erated the central hypotheses guiding prosodic laterality
tion. It is important to note that receptive and expres- research. The first hypothesis suggests that a¤ective or
sive prosodic processing can be di¤erentially a¤ected in emotional prosody is in the domain of the right hemi-
aprosodia. sphere (Heilman, Scholes, and Watson, 1975; Borod,
First observed in the emotional domain, aprosodia Ko¤, Lorch et al., 1985; Blonder, Bowers, and Heilman,
has also been found to occur in the linguistic domain. 1991). Another hypothesis holds that prosodic cues
Thus, patients may have problems both encoding and themselves are lateralized, independent of their function
decoding the tone of spoken messages and the intention (emotional or linguistic) (Van Lancker and Sidtis, 1992).
behind the message as conveyed through both linguistic Finally, lesion localization studies have found that cer-
and emotional prosody. tain subcortical structures, the basal ganglia in particu-
In the acute stage, patients with aprosodia are usually lar, play a role in prosodic processing (Cancelliere and
unaware of the problem until it is pointed out to them. Kertesz, 1990; Bradvik, Dravins, Holtas et al., 1991).
Even then, they may deny it, particularly if they su¤er Cancelliere and Kertesz (1990) speculated that the basal
from other forms of denial of deficit. Severity of neglect, ganglia may be important not only because of their role
for example, has been found to correlate with prosodic in motor control, but also because of their limbic and
deficits (Starkstein, Federo¤, Price et al., 1994). In rare frontal connections which may influence the expression
cases, aprosodia may last for months and even years of emotion in motor action.
when other signs of RHD have abated. Patients with Research findings have varied as a function of task
persistent aprosodia may be aware of the problem but type, subject selection criteria, and methods of data
feel incapable of correcting it. analysis. Subjects across and within studies may vary in
Treatment of aprosodia is often limited to training terms of time post-onset, the presence or absence of ne-
patients to adopt compensatory techniques. Patients glect and dysarthria, intrahemispheric site of lesion, and
may be taught to attend more carefully to other forms of severity of attentional and other cognitive deficits. With
emotional expression (e.g., gesture, facial expression) the exception of site of lesion, these variables have rarely
and to signal mood by explicitly stating their mood to been taken into account in research design. Data analy-
the listener. There has, however, been at least one report sis has varied across studies. In some studies it has been
of successful symptomatic treatment using pitch bio- based on perceptual judgments by one or more listeners,
feedback and modeling (Stringer, 1996). Treatment has which adds a subjective component. In others, data are
Aprosodia 109
submitted to acoustic analysis, which a¤ords increased sodic deficits can occur in both left as well as right
objective control but in some cases may not match lis- hemisphere damage, and has furthered our understand-
tener perception of severity of impairment (Ryalls, Joa- ing of the mechanisms and di¤erences in prosodic pro-
nette, and Feldman, 1987). cessing across the hemispheres. However, the focus on
In general, acoustic analyses of prosodic productions laterality has had some drawbacks for understanding
by RHD patients supports the theory that prosody is aprosodia per se. The main problem is that while sub-
lateralized according to individual prosodic cues rather jects in laterality studies are selected for unilateral brain
than according to the function prosody serves (emo- damage, they are not screened for prosodic impairment.
tional versus linguistic). In particular, pitch cues are Thus, the data pool on which we rely for conclusions
considered to be in the domain of the right hemisphere. about the nature of RHD prosodic deficits consists
Duration and timing cues are considered to be in the largely of subjects with and subjects without prosodic
domain of the left hemisphere (Robin, Tranel, and impairment. The characteristics of aprosodia, its mecha-
Damasio, 1990; Van Lancker and Sidtis, 1992; Baum nisms, duration, frequency of occurrence in the general
and Pell, 1997). RHD population, and the presence/absence of other
Research suggests that reduced pitch variation and a RHD deficits that may accompany it have yet to be
somewhat restricted pitch range appear to be significant clearly delineated. These issues will remain unclear until
factors in the impaired prosodic production of RHD a working definition of aprosodia is established and de-
subjects (Colsher, Cooper, and Gra¤-Radford, 1987; scriptive studies using that definition as a means of
Behrens, 1989; Baum and Pell, 1997; Pell, 1999a). RHD screening patients are undertaken.
patients are minimally if at all impaired in the produc- See also prosodic deficits, right hemisphere lan-
tion of emphatic stress. However, they may have an guage and communication functions in adults;
abnormally flat pitch pattern in declarative sentences, right hemisphere language disorder.
less than normal variation in pitch for interrogative sen-
tences, and may produce emotionally toned sentences —Penelope S. Myers
with less than normal acoustic variation (Behrens, 1988;
Emmory, 1987; Pell, 1999). Pitch variation is crucial References
to signaling emotions, which may explain why impaired Baum, S. R., and Pell, M. D. (1997). Production of a¤ective
production of emotional prosody appears particularly and linguistic prosody by brain-damaged patients. Aphasi-
prominent in aprosodia. Interestingly, in the case of ology, 11, 177–198.
tonal languages (e.g., Chinese, Thai, and Norwegian) in Behrens, S. J. (1988). The role of the right hemisphere in the
which pitch patterns in individual words serve a seman- production of linguistic stress. Brain and Language, 33,
tic role, pitch has been found to be a left hemisphere 104–127.
function (Packard, 1986; Ryalls and Reinvang, 1986; Behrens, S. J. (1989). Characterizing sentence intonation in a
Gandour et al., 1992). right hemisphere-damaged population. Brain and Language,
37, 181–200.
Prosodic perception or comprehension deficits asso- Blonder, L. X., Bowers, D., and Heilman, K. M. (1991). The
ciated with aprosodia tend to follow the pattern found in role of the right hemisphere in emotional communication.
production. Non-temporal properties such as pitch Brain, 114, 1115–1127.
appear to be more problematic than time-dependent Borod, J. C., Ko¤, E., Lorch, M. P., and Nicholas, M. (1985).
properties such as duration and timing (Divenyi and Channels of emotional expression in patients with unilateral
Robinson, 1989; Robin et al., 1990; Van Lancker and brain damage. Archives of Neurology, 42, 345–348.
Sidtis, 1992). For example, Van Lancker and Sidtis Bradvik, B., Dravins, C., Holtas, S., Rosen, I., Ryding, E., and
(1992) found that right- and left-hemisphere-damaged Ingvar, D. H. (1991). Disturbances of speech prosody fol-
patients used di¤erent cues to identify emotional stimuli. lowing right hemisphere infarcts. Acta Neurologica Scandi-
Patients with RHD tended to base their decisions on navica, 84, 114–126.
Cancelliere, A. E. B., and Kertesz, A. (1990). Lesion localiza-
durational cues rather than on fundamental frequency tion in acquired deficits of emotional expression and com-
variability while left-hemisphere-damaged patients did prehension. Brain and Cognition, 13, 133–147.
the opposite. These data suggest a perceptual, rather Chobor, K. L., and Brown, J. W. (1987). Phoneme and tim-
than a functional (linguistic versus emotional), impair- bre monitoring in left and right cerebrovascular accident
ment. Although a study by Pell and Baum (1997) failed patients. Brain and Language, 30, 78–284.
to replicate these results, the data are supported by data Colsher, P. L., Cooper, W. E., and Gra¤-Radford, N. (1987).
from dichotic listening and other studies that have Intonation variability in the speech of right-hemisphere
investigated temporal versus time-independent cues such damaged patients. Brain and Language, 32, 379–383.
as pitch information (Chobor and Brown, 1987; Sidtis Divenyi, P. L., and Robinson, A. J. (1989). Nonlinguistic au-
and Volpe, 1988; Divenyi and Robinson, 1989; Robin ditory compatabilities in aphasia. Brain and Language, 37,
290–396.
et al., 1990). Du¤y, J. R. (1995). Motor speech disorders. St. Louis: Mosby.
Almost all studies of prosodic deficits have focused on Emmory, K. D. (1987). The neurological substrates for pro-
whether unilateral brain damage produces prosodic def- sodic aspects of speech. Brain and Language, 30, 305–
icits, rather than describing the characteristics of proso- 320.
dic problems in patients known to have prosodic deficits. Gandour, J., Ponglorpisit, S., Khunadorn, F., Dechongkit, S.,
The body of laterality research has established that pro- Boongird, P., Boonklam, R., and Potisuk, S. (1992). Lexical
110 Part II: Speech
tones in Thai after unilateral brain damage. Brain and Lan- Van Lancker, D., and Sidtis, J. J. (1992). The identification
guage, 43, 275–307. of a¤ective-prosodic stimuli by left- and right-hemisphere-
Gorelick, P. B., and Ross, E. D. (1987). The aprosodias: Fur- damaged subjects: All errors are not created equal. Journal
ther functional-anatomical evidence for the organization of of Speech and Hearing Research, 35, 963–970.
a¤ective language in the right hemisphere. Journal of Neu-
rology, Neurosurgery, and Psychiatry, 50, 553–560. Further Reading
Heilman, K. M., Scholes, R., and Watson, R. T. (1975). Au-
ditory a¤ective agnosia: Disturbed comprehension of af- Baum, S. R., and Pell, M. D. (1999). The neural bases of
fective speech. Journal of Neurology, Neurosurgery, and prosody: Insights from lesion studies and neuroimaging.
Psychiatry, 38, 69–72. Aphasiology, 13, 581–608.
Mohrad-Krohn, G. H. (1947). Dysprosody or altered ‘‘melody
of language.’’ Brain, 70, 405–415.
Monnot, M., Nixon, S., Lovallo, W., and Ross, E. (2001). Augmentative and Alternative
Altered emotional perception in alcoholics: Deficits in af-
fective prosody comprehension. Alcoholism: Clinical and Communication Approaches in Adults
Experimental Research, 25, 362–369.
Myers, P. S. (1998). Prosodic deficits. In Right hemisphere
damage: Disorders of communication and cognition (pp. 73– An augmentative and alternative communication (AAC)
90). San Diego, CA: Singular. system is an integrated group of components used by
Packard, J. (1986). Tone production deficits in nonfluent individuals with severe communication disorders to
aphasic Chinese speech. Brain and Language, 29, 212–223. enhance their competent communication. Competent
Pell, M. D. (1999a). Fundamental frequency encoding of lin- communication serves a variety of functions, of which
guistic and emotional prosody by right hemisphere dam- we can isolate four: (1) communication of wants and
aged speakers. Brain and Language, 62, 161–192. needs, (2) information transfer, (3) social closeness, and
Pell, M. D. (1999b). The temporal organization of a¤ective (4) social etiquette (Light, 1988). These four functions
and non-a¤ective speech in patients with right-hemisphere broadly encompass all communicative interactions. An
infarcts. Cortex, 35, 163–182.
appropriate AAC system addresses not only basic com-
Pell, M. D., and Baum, S. R. (1997). Unilateral brain damage,
prosodic comprehension deficits, and the acoustic cues to munication of wants and needs, but also the establish-
prosody. Brain and Language, 57, 195–214. ment, maintenance, and development of interpersonal
Robin, D. A., Tranel, D., and Damasio, H. (1990). Auditory relationships using information transfer, social closeness,
perception of temporal and spectral events in patients with and social etiquette.
focal left and right cerebral lesions. Brain and Language, 39, AAC is considered multimodal, and as such it incor-
539–555. porates the full communication abilities of the adult. It
Ross, E. D. (1981). The aprosodias: Functional-anatomic or- includes any existing natural speech or vocalizations,
ganization of the a¤ective components of language in the gestures, formal sign language, and aided communica-
right hemisphere. Archives of Neurology, 38, 561–569. tion. ‘‘AAC allows individuals to use every mode possi-
Ross, E. D., and Mesulam, M.-M. (1979). Dominant language
functions of the right hemisphere? Prosody and emotional
ble to communicate’’ (Light and Binger, 1998, p. 1).
gesturing. Archives of Neurology, 36, 144–148. AAC systems are typically described as high-
Ross, E. D., Orbelo, D. M., Cartwright, J., Hansel, S., Bur- technology, low- or light-technology, and no-technology
gard, M., Testa, J. A., and Buck, R. (2001). A¤ective- in respect to the aids used in implementation. High-
prosodic deficits in schizophrenia: Profiles of patients with technology AAC systems use electronic devices to sup-
brain damage and comparison with relation schizophrenic port digitized or synthesized communication strategies.
symptoms. Journal of Neurology, Neurosurgery, and Psy- Low- or light-technology systems include items such
chiatry, 70, 597–604. as communication boards (symbols), communication
Ryalls, J., Joanette, Y., and Feldman, L. (1987). An acoustic books, and light pointing devices. A no-technology sys-
comparison of normal and right-hemisphere-damage speech tem involves the use of strategies and techniques, such as
prosody. Cortex, 23, 685–694.
Ryalls, J., and Reinvang, I. (1986). Functional lateralization of
body movements, gestures, and sign language, without
linguistic tones: Acoustic evidence from Norwegian. Lan- the use of specific aids or devices.
guage and Speech, 29, 389–398. AAC is used to assist adults with a wide range of
Sidtis, J. J., and Volpe, B. T. (1988). Selective loss of complex disabilities, including congenital disabilities (e.g., cere-
pitch or speech discrimination after unilateral lesion. Brain bral palsy, mental retardation), acquired disabilities
and Language, 34, 235–245. (e.g., traumatic head injury, stroke), and degenerative
Starkstein, S. E., Federo¤, J. P., Price, T. R., Leiguarda, R. C., conditions (e.g., multiple sclerosis, amyotrophic lateral
and Robinsn, R. G. (1994). Neuropsychological and neu- sclerosis) (American Speech-Language-Hearing Associ-
roradiologic correlates of emotional prosody comprehen- ation [ASHA], 1989). Individuals at any point across the
sion. Neurology, 44, 516–522. life span and in any stage of communication ability may
Stringer, A. Y. (1996). Treatment of motor aprosodia with
pitch biofeedback and expression modelling. Brain Injury,
use AAC (see the companion entry, augmentative and
10, 583–590. alternative communication approaches in children).
Sweet, L. H., Primeau, M., Fichtner, C. G., Lutz, G. (1998). Adults with severe communication disorders benefit
Dissociation of a¤ect recognition and mood state from from AAC. ASHA (1991, p. 10) describes these people
blunting in patients with schizophrenia. Psychiatry Re- as ‘‘those for whom natural gestural, speech, and/or
search, 81, 301–308. written communication is temporarily or permanently
Augmentative and Alternative Communication Approaches in Adults 111
inadequate to meet all of their communication needs.’’ tion should be completed when an individual reaches
An important consideration is that ‘‘although some 50% of his or her habitual speaking rate (approximately
individuals may be able to produce a limited amount 100 words per minute) on a standard intelligibility as-
of speech, it is inadequate to meet their varied com- sessment (such as the Sentence Intelligibility Test; York-
munication needs’’ (ASHA, 1991, p. 10). AAC may also ston, Beukelman, and Tice, 1996). Frequent objective
be used to support comprehension and cognitive abilities measurement of speaking rate is important to provide
by capitalizing on residual skills and thus facilitating timely AAC intervention. Access to a communication
communication. system is increasingly important as ALS advances
Many adults with severe communication disorders (Mathy, Yorkston, and Gutmann, 2000).
demonstrate some ability to communicate using natural Traumatic brain injury (TBI) refers to injuries to the
speech. Natural speech is more time-e‰cient and lin- brain that involve rapid acceleration and deceleration,
guistically flexible than other modes (involving AAC). whereby the brain is whipped back and forth in a quick
Speech supplementation AAC techniques (alphabet motion, which results in compromised neurological
and topic supplementation) used in conjunction with function (Levin, Benton, and Grossman, 1982). The goal
natural speech can provide extensive contextual knowl- of AAC in TBI is to provide a series of communication
edge to increase the listener’s ability to understand systems and strategies so that individuals can communi-
a message. As the quality of the acoustic signal and cate at the level at which they are currently functioning
the quality of environmental information improve, (Doyle et al., 2000). Generally, recovery of cognitive
comprehensibility—intelligibility in context—of mes- functioning is categorized into phases (Blackstone,
sages is enhanced (Lindblom, 1990). Similarly, poor- 1989). In the early phase, the person is minimally re-
quality acoustic signals and poor environmental sponsive to external stimuli. AAC goals include provid-
information result in a deterioration in message coming support to respond to one-step motor commands and
prehensibility. When a speaker experiences reduced discriminate one of an array of choices (objects, people,
acoustic speech quality, optimizing any available con- locations). AAC applications during this phase include
textual information through AAC techniques will in- low-technology pictures and communication boards and
crease the comprehensibility of the message. ‘‘Given that choices of real objects to support communication. In the
communication e¤ectiveness varies across social sit- middle phase, the person exhibits improved consistency
uations and listeners, it is important that individuals who of responses to external stimuli. It is in this phase that
use natural speech, speech-supplementation, and AAC persons who are unable to speak because of severe cog-
strategies learn to switch communication modes de- nitive confusion become able to speak. If they do not
pending on the situation and the listener’’ (Hustad and become speakers by the end of this phase, it is likely a
Beukelman, 2000, p. 103). result of chronic motor control and language impair-
The patterns of communication disorders in adults ments. AAC goals during this phase address providing a
vary from condition to condition. Persons with aphasia, way to indicate basic needs and giving a response
traumatic brain injury, Parkinson’s disease, Guillain- modality that increases participation in the evaluation
Barré syndrome, multiple sclerosis, and numerous motor and treatment process. AAC intervention strategies
speech impairments benefit from using AAC (Beukel- usually involve nonelectronic, low-technology, or no-
man and Mirenda, 1998). ACC approaches for a few technology interventions to express needs. In the late
adult severe communication disorders are described phase, if the person continues to be nonspeaking, it is
here. likely the result of specific motor or language impair-
Amyotrophic lateral sclerosis (ALS) is a disease of ment. AAC intervention may be complicated by co-
rapid degeneration involving the motor neurons of the existing cognitive deficits. Intervention goals address
brain and spinal cord that leaves cognitive abilities gen- provision of functional ways to interact with listeners in
erally intact. The cause is unknown, and there is no a variety of settings and to assist the individual to par-
known cure. For those whose initial impairments are in ticipate in social, vocational, educational, and recre-
the brainstem, speech symptoms typically occur early in ational settings. AAC intervention makes use of both
the disease progression. On average, speech intelligibility low- and high-technology strategies in this phase of
in this (bulbar) group declines precipitously approxi- recovery.
mately 10 months after diagnosis. For those whose im- Brainstem stroke (cerebrovascular accident, or CVA)
pairment begins in the lower spine, speech intelligibility disrupts the circulation serving the lower brainstem. The
declines precipitously approximately 25 months after di- result is often severe dysarthria or anarthria, and re-
agnosis. Some individuals maintain functional speech duced ability to control the muscles of the face, mouth,
much longer. Clinically, a drop in speaking rate predicts and larynx voluntarily or reflexively. Communication
the onset of the abrupt drop in speech intelligibility symptoms vary considerably with the extent of damage.
(Ball, Beukelman, and Pattee, 2001). As a group, 80% of Some individuals are dysarthric but able to communi-
individuals with ALS eventually require use of AAC. cate partial or complete messages, while others may be
Because the drop in intelligibility is so sudden, intelligi- unable to speak. AAC intervention is typically described
bility is not a good measure to use in determining the in five stages (Beukelman and Mirenda, 1998). In stage
timing of an AAC evaluation. Rather, because the 1, the person exhibits no functional speech. The goal of
speaking rate declines more gradually, an AAC evalua- intervention is to provide early communication so that
112 Part II: Speech
the person can respond to yes/no questions, initial choice Lindblom, B. (1990). On the communication process: Speaker-
making, pointing, and introduction of a multipurpose listener interaction and the development of speech. Aug-
AAC device. In stage 2, the goal is to reestablish speech mentative and Alternative Communication, 6, 220–230.
by working directly to develop control over the respira- Mathy, P., Yorkston, K., and Gutmann, M. (2000). AAC for
individuals with amyotrophic lateral sclerosis. In K. York-
tory, phonatory, velopharyngeal, and articulatory sub-
ston, D. Beukelman, and J. Reichle (Eds.), Augmentative
systems. Early in this stage, the AAC system will support and alternative communication for adults with acquired
the majority of interactions; however, late in this stage neurologic disorders (pp. 183–229). Baltimore: Paul H.
persons are able to convey an increasing percentage of Brookes.
messages with natural speech. In stage 3, the person Yorkston, K., Beukelman, D., and Tice, R. (1996). Sentence
exhibits independent use of natural speech. The AAC Intelligibility Test. Lincoln, NE: Tice Technologies.
intervention focuses on intelligibility, with alphabet sup-
plementation used early, but later only to resolve com- Further Readings
munication breakdowns. In this stage, the use of AAC
may become necessary only to support writing. In stages Beukelman, D., Yorkston, K., and Reichle, J. (2000). Aug-
mentative and alternative communication for adults with
4 and 5, the person no longer needs to use an AAC acquired neurologic disorders. Baltimore: Paul H. Brookes.
system. Collier, B. (1997). See what we say: Vocabulary and tips for
In summary, adults with severe communication dis- adults who use augmentative and alternative communication.
orders are able to take advantage of increased commu- North York, Ontario, Canada: William Bobek Productions.
nication through the use of AAC. The staging of AAC DeRuyter, F., and Kennedy, M. (1991). Augmentative com-
interventions is influenced by the individual’s communi- munication following traumatic brain injury. In D. Beukel-
cation abilities and the natural course of the disorder, man and K. Yorkston (Eds.), Communication disorders
whether advancing, remitting, or stable. following traumatic brain injury: Management of cognitive,
language, and motor impairments (pp. 317–365). Austin,
—Laura J. Ball TX: Pro-Ed.
Fried-Oken, M., and Bersani, H. (2000). Speaking up and
References spelling it out: Personal essays on augmentative and alterna-
tive communication. Baltimore: Paul H. Brookes.
American Speech-Language-Hearing Association. (1989). Garrett, K., and Kimelman, M. (2000). AAC and aphasia:
Competencies for speech-language pathologists provid- Cognitive-linguistic considerations. In D. Beukelman, K.
ing services in augmentative communication. ASHA, 31, Yorkston, and J. Reichle (Eds.), Augmentative and alterna-
107–110. tive communication for adults with acquired neurologic dis-
American Speech-Language-Hearing Association. (1991). Re- orders (pp. 339–374). Baltimore: Paul H. Brookes.
port: Augmentative and alternative communication. ASHA, Klasner, E., and Yorkston, K. (2000). AAC for Huntington
33(Suppl. 5), 9–12. disease and Parkinson’s disease: Planning for change. In K.
Ball, L., Beukelman, D., and Pattee, G. (2001). A protocol for Yorkston, D. Beukelman, and J. Reichle (Eds.), Augmen-
identification of early bulbar signs in ALS. Journal of Neu- tative and alternative communication for adults with acquired
rological Sciences, 191, 43–53. neurologic disorders (pp. 233–270). Baltimore: Paul H.
Beukelman, D., and Mirenda, P. (1998). Augmentative and al- Brookes.
ternative communication: Management of severe communi- Ladtkow, M., and Culp, D. (1992). Augmentative communi-
cation disorders in children and adults. Baltimore: Paul H. cation with the traumatically brain injured population. In
Brookes. K. Yorkston (Ed.), Augmentative communication in the
Blackstone, S. (1989). For consumer: Societal rehabilitation. medical setting (pp. 139–243). Tucson, AZ: Communication
Augmentative Communication News, 2(3), 1–3. Skill Builders.
Doyle, M., Kennedy, M., Jausalaitis, G., and Phillips, B. Yorkston, K. (1992). Augmentative communication in the med-
(2000). AAC and traumatic brain injury: Influence of cog- ical setting. San Antonio, TX: Psychological Corporation.
nition on system design and use. In K. Yorkston, D. Beu- Yorkston, K., Miller, R., and Strand, E. (1995). Management
kelman, and J. Reichle (Eds.), Augmentative and alternative of speech and swallowing in degenerative diseases. Tucson,
communication for adults with acquired neurologic disorders AZ: Communication Skill Builders.
(pp. 271–304). Baltimore: Paul H. Brookes.
Hustad, K., and Beukelman, D. (2000). Integrating AAC
strategies with natural speech in adults. In K. Yorkston, D.
Beukelman, and J. Reichle (Eds.), Augmentative and alter- Augmentative and Alternative
native communication for adults with acquired neurologic Communication Approaches in
disorders (pp. 83–106). Baltimore: Paul H. Brookes. Children
Levin, H., Benton, A., and Grossman, R. (1982). Neuro-
behavioral consequences of closed head injury. New York:
Oxford University Press. The acquisition of communication skills is a dynamic,
Light, J. (1988). Interaction involving individuals using aug-
bidirectional process of interactions between speaker and
mentative and alternative communication systems: State of
the art and future directions. Augmentative and Alternative listener. Children who are unable to meet their daily
Communication, 5(3), 66–82. needs using their own speech require alternative sys-
Light, J., and Binger, C. (1998). Building communicative com- tems to support their communication interaction e¤orts
petence with individuals who use augmentative and alternative (Reichle, Beukelman, and Light, 2001). An augmenta-
communication. Baltimore: Paul H. Brookes. tive and alternative communication (AAC) system is an
Augmentative and Alternative Communication Approaches in Children 113
integrated group of components used by a child to en- that can be used without change in interaction with a
hance or develop competent communication. It includes variety of di¤erent listeners. Examples include ‘‘Hello’’;
any existing natural speech or vocalizations, gestures, ‘‘What are you doing?’’; ‘‘What’s that?’’; ‘‘I like that!’’;
formal sign language, and aided communication. ‘‘AAC and ‘‘Leave me alone!’’
allows individuals to use every mode possible to com- Extensive instructional resources are available to
municate’’ (Light and Drager, 1998, p. 1). school-age children. In the United States, the federal
The goal of AAC support is to provide children with government has mandated publicly funded education for
access to the power of communication, language, and children with disabilities, in the form of the Individuals
literacy. This power allows them to express their needs with Disabilities Education Act, and provides a legal
and wants, develop social closeness, exchange informa- basis for AAC interventions. Public policy changes have
tion, and participate in social, educational, and com- been adopted in numerous other countries to address
munity activities (Beukelman and Mirenda, 1998). In resources available to children with disabilities.
addition, it provides a foundation for language develop- AAC interventionists facilitate transitions from the
ment and facilitates literacy development (Light and preschool setting to the school setting by ensuring com-
Drager, 2001). Timeliness in implementing an AAC prehensive communication through systematic planning
system is paramount (Reichle, Beukelman, and Light, and establishing a foundation for communication. A
2001). The earlier that graphic and gestural mode framework for integrating children into general educa-
supports can be put into place, the greater will be the tion programs may be implemented by following the
child’s ability to advance in communication develop- participation model (Beukelman and Mirenda, 1998),
ment. which includes four variables that can be manipulated to
Children experience significant cognitive, linguistic, achieve appropriate participation for any child. Children
and physical growth throughout their formative years, transitioning from preschool to elementary school, self-
from preschool through high school. AAC support for contained to departmentalized programs, or school to
children must address both their current communication post-school (vocational) will attain optimal participation
needs as well as predict future communication needs and when consideration is made for integration, social par-
abilities, so that they will be prepared to communicate ticipation, academic participation, and independence.
e¤ectively as they mature. Because participation in the An AAC system must be designed to support literacy
general classroom requires many kinds of extensive and other academic skill development as well as peer
communication, e¤ective AAC systems that are age ap- interactions. It must be appealing to children so that
propriate and context appropriate serve as critical tools they find the system attractive and will continue using it
for academic success (Sturm, 1998, p. 391). Early inter- (Light and Drager, 2001).
ventions allow children to develop the linguistic, opera- AAC systems are used by children with a variety of
tional, and social competencies necessary to support severe communication disorders. Cerebral palsy is a
their participation in academic settings. developmental neuromuscular disorder resulting from a
Many young children and those with severe multiple nonprogressive abnormality of the brain. Children with
disabilities cannot use traditional spelling and reading severe cerebral palsy primarily experience motor control
skills to access their AAC systems. Very young children, problems that impair their control of their speech mech-
who are preliterate, have not yet developed reading and anisms. The resulting motor speech disorder (dysarthria)
writing skills, while older children with severe cognitive may be so severe that AAC technology is required to
impairments may remain nonliterate. For individuals support communication. Large numbers of persons with
who are not literate, messages within their AAC sys- cerebral palsy successfully use AAC technology (Beu-
tems must be represented by one or more symbols or kelman, Yorkston, and Smith, 1985; Mirenda and
codes. With children, early communication develop- Mathy-Laikko, 1989). Typically, AAC support is pro-
ment focuses on vocabulary that is needed to communi- vided to these children by a team of interventionists. In
cate essential messages and to develop language skills. addition to the communication/AAC specialist, the pri-
Careful analysis of environmental and communication mary team often includes occupational and physical
needs is used to develop vocabulary for the child’s AAC therapists, technologists, teachers, and parents. A sec-
system. This vocabulary selection assessment includes ondary support team might include orthotists, rehabili-
examination of the ongoing process of vocabulary and tation engineers, and pediatric ophthalmologists.
message maintenance. Intellectual disability, or mental retardation, is char-
The vocabulary needs of children comprise contextual acterized by significantly subaverage intellectual func-
variations, including school talk, in which they speak tioning coexisting with limitations in adaptive skills
with relatively unfamiliar adults in order to acquire (communication, self-care, home living, social skills,
knowledge. Home talk is used with familiar persons to community use, self-direction, health and safety, aca-
meet needs and develop social closeness, as well as to demics, leisure, and work) that appear before the age of
assist parents in understanding their child. An example 18 (Luckasson et al., 1992). For children with commu-
of vocabulary needs is exhibited by preschool children, nication impairments, it is important to engage in AAC
who have been found to use generic small talk for nearly instruction and interactions in natural rather than segre-
half of their utterances, when in preschool and at home gated environments. Calculator and Bedrosian noted
(Ball et al., 1999). Generic small talk refers to messages that ‘‘communication is neither any more nor less than
114 Part II: Speech
a tool that facilitates individuals’ abilities to function tion has been documented extensively (Kent, 1993;
in the various activities of daily living’’ (1988, p. 104). Camarata, 1996). The use of AAC strategies to support
Children who are unable to speak because of cognitive the communicative interactions of children with such se-
limitations, with and without accompanying physical vere DAS that their speech is unintelligible is receiving
impairments, have demonstrated considerable success increased attention (Culp, 1989; Cumley and Swanson,
using AAC strategies involving high-technology (elec- 2000).
tronic devices) and low-technology (communication In summary, children with severe communication
boards and books) options (Light, Collier, and Parnes, disorders benefit from using AAC systems, from a vari-
1985a, 1985b, 1985c). ety of perspectives. Children with an assortment of clin-
Autism and pervasive developmental disorders are ical disorders are able to take advantage of increased
described with three main diagnostic features: (1) im- communication through the use of AAC. The provi-
paired social interaction, (2) impaired communication, sion of AAC intervention is influenced by the child’s
and (3) restricted, repetitive, and stereotypical patterns communication abilities and access to memberships.
of behaviors, interests, and activities (American Psy- Membership involves integration, social participation,
chiatric Association, 1994). These disorders occur as academic participation, and ultimately independence. A
a spectrum of impairments of di¤erent causes (Wing, web site (http://aac.unl.edu) provides current informa-
1996). Children with a pervasive developmental disorder tion about AAC resources for children and adults.
may have cognitive, social/communicative, language,
and processing impairments. Early intervention with an —Laura J. Ball
emphasis on speech, language, and communication is References
extremely important (Dawson and Osterling, 1997). A
range of intervention approaches has been suggested, American Psychiatric Association. (1994). Diagnostic and sta-
and as a result, AAC interventionists may need to work tistical manual for mental disorders (4th ed.). Washington,
with professionals whose views di¤er from their own, DC: Author.
thus necessitating considerable collaboration (Simeons- Ball, L., Marvin, C., Beukelman, D., Lasker, J., and Rupp,
D. (1999). Generic small talk use by preschool children.
son, Olley, and Rosenthal, 1987; Dawson and Osterling,
Augmentative and Alternative Communication, 15, 145–155.
1997; Freeman, 1997). Bernthal, J., and Bankson, N. (1993). Articulation and phono-
Developmental apraxia of speech (DAS) results in logical disorders (3rd ed.). Englewood Cli¤s, NJ: Prentice
language delays, communication problems that influence Hall.
academic performance, communication problems that Beukelman, D., and Mirenda, P. (1998). Augmentative and al-
limit e¤ective social interaction, and significant speech ternative communication: Management of severe communi-
production disorder. Children with suspected DAS have cation disorders in children and adults (2nd ed.). Baltimore:
di‰culty performing purposeful voluntary movements Paul H. Brookes.
for speech (Caruso and Strand, 1999). Their phonologi- Beukelman, D., Yorkston, K., and Smith, K. (1985). Third-
cal systems are impaired because of their di‰culties in party payer response to requests for purchase of com-
munication augmentation systems: A study of Washington
managing the intense motor demands of connected state. Augmentative and Alternative Communication, 1, 5–
speech (Strand and McCauley, 1999). Children with 9.
DAS have a guarded prognosis for the acquisition of Calculator, S., and Bedrosian, J. (1988). Communication as-
intelligible speech (Bernthal and Bankson, 1993). DAS- sessment and intervention for adults with mental retardation.
related speech disorders may result in prolonged periods San Diego, CA: College-Hill Press.
of unintelligibility, particularly during the early elemen- Camarata, S. (1996). On the importance of integrating
tary grades. naturalistic language, social intervention, and speech-
There is ongoing debate over the best way to manage intelligibility training. In L. Koegel and R. Koegel (Eds.),
suspected DAS. Some children with DAS have been Positive behavioral support: Including people with di‰cult
treated with phonologically based interventions and behavior in the community (pp. 333–351). Baltimore: Paul
H. Brookes.
others with motor learning tasks. Some interventionists Caruso, A., and Strand, E. (1999). Motor speech disorders in
support very intense schedules of interventions. These children: Definitions, background, and a theoretical frame-
arguments have changed little in the last 20 years. How- work. In A. Caruso and E. Strand (Eds.), Clinical manage-
ever, the need to provide these children with some means ment of motor speech disorders in children (pp. 1–28). New
to communicate so that they can successfully participate York: Thieme.
socially and in educational activities is becoming in- Culp, D. (1989). Developmental apraxia of speech and aug-
creasingly accepted. Cumley (1997) studied children with mentative communication: A case example. Augmentative
DAS who were provided with AAC technology. He and Alternative Communication, 5, 27–34.
reported that the group of children with lower speech Cumley, G. (1997). Introduction of augmentative and alternative
intelligibility scores used their AAC technology more modality: E¤ects on the quality and quantity of communica-
tion interactions of children with severe phonological dis-
frequently than children with higher intelligibility. When orders. Unpublished doctoral dissertation, University of
children with DAS with low intelligibility scores used Nebraska, Lincoln.
AAC technology, they did not reduce their speech ef- Cumley, G., and Swanson, S. (2000). Augmentative and
forts, but rather used the technology to resolve commu- alternative communication options for children with devel-
nication breakdowns. The negative e¤ect of reduced opmental apraxia of speech: Three case studies. Augmenta-
speech intelligibility on social and educational participa- tive and Alternative Communication, 15, 110–125.
Autism 115
Dawson, G., and Osterling, J. (1997). Early intervention Bristow, D., and Fristoe, M. (1988). E¤ects of test adaptations
in autism. Journal of Applied Behavioral Analysis, 24, on test performance. Augmentative and Alternative Com-
369–378. munication, 4, 171.
Freeman, B. (1997). Guidelines for evaluating intervention Culp, D. E. (1996). ChalkTalk: Augmentative communication in
programs for children with autism. Journal of Autism and the classroom. Anchorage: Assistive Technology Library of
Developmental Disorders, 27(6), 641–651. Alaska.
Kent, R. (1993). Speech intelligibility and communicative Cumley, G., and Swanson, S. (1999). Augmentative and
competence in children. In A. Kaiser and D. Gray (Eds.), alternative communication options for children with devel-
Enhancing children’s communication: Research foundations opmental apraxia of speech: Three Case Studies. Augmen-
for intervention. Baltimore: Paul H. Brookes. tative and Alternative Communication, 15(2), 110–125.
Light, J., Collier, B., and Parnes, P. (1985a). Communication Dowden, P. (1997). Augmentative and alternative communi-
interaction between young nonspeaking physically disabled cation decision making for children with severely unintelli-
children and their primary caregivers: Part I. Discourse gible speech. Augmentative and Alternative Communication,
patterns. Augmentative and Alternative Communication, 13, 48–58.
1(2), 74–83. Elder, P. G. (1996). Engineering training environments for
Light, J., Collier, B., and Parnes, P. (1985b). Communication interactive augmentative communication: Strategies for
interaction between young nonspeaking physically disabled adolescents and adults who are moderately/severely develop-
children and their primary caregivers: Part II. Communica- mentally delayed. Solana Beach, CA: Mayer-Johnson.
tive function. Augmentative and Alternative Communication, Goossens, C., Crain, S., and Elder, P. (1992). Engineering the
1(3), 98–107. preschool environment for interactive, symbolic communica-
Light, J., Collier, B., and Parnes, P. (1985c). Communication tion. Birmingham, AL: Southeast Augmentative Communi-
interaction between young nonspeaking physically disabled cation Conference Publications.
children and their primary caregivers: Part III. Modes of King-Debaun, P. (1993). Storytime just for fun: Stories, sym-
communication. Augmentative and Alternative Communica- bols, and emergent literacy activities for young, special needs
tion, 1(4), 125–133. children. Park City, UT: Creative Communicating.
Light, J., and Drager, K. (1998). Building communicative com- Light, J. B. (1998). Building communicative competence with
petence with individuals who use augmentative and alternative individuals who use augmentative and alternative communi-
communication. Baltimore: Paul H. Brookes. cation. Baltimore: Paul H. Brookes.
Light, J., and Drager, K. (2001, Aug. 2). Improving the design Mirenda, P., Iacono, T., and Williams, R. (1990). Communi-
of AAC technologies for young children. Paper presented at a cation options for persons with severe and profound dis-
meeting of the United States Society of Augmentative and abilities: State of the art and future directions. Journal of the
Alternative Communication, St. Paul, MN. Association for Persons with Severe Handicaps, 15, 3–21.
Luckasson, R., Coulter, D., Polloway, E., Reiss, S., Schalock, Reichle, J. (1993). Communication alternatives to challenging
R., Snell, M., et al. (1992). Mental retardation: Definition, behavior: Integrating functional assessment and intervention
classification, and systems of support (9th ed.). Washington, strategies (vol. 3). Baltimore: Paul H. Brookes.
DC: American Association on Mental Retardation. Reichle, J., Beukelman, D., and Light, J. (2002). Exemplary
Mirenda, P., and Mathy-Laikko, P. (1989). Augmentative and practices for beginning communicators: Implications for
alternative communication applications for persons with AAC (vol. 2). Baltimore: Paul H. Brookes.
severe congenital communication disorders: An introduction.
Augmentative and Alternative Communication, 5(1), 3–14.
Reichle, J., Beukelman, D., and Light, J. (2001). Exemplary Autism
practices for beginning communicators: Implications for
AAC (vol. 2). Baltimore: Paul H. Brookes.
Simeonsson, R., Olley, J., and Rosenthal, S. (1987). Early The term autism was first used in 1943 by Leo Kanner
intervention for children with autism. In M. Guralnick and to describe a syndrome of ‘‘disturbances in a¤ective
F. Bennett (Eds.), The e¤ectiveness of early intervention for contact,’’ which he observed in 11 boys who lacked the
at-risk and handicapped children (pp. 275–293). New York: dysmorphology often seen in mental retardation, but
Academic Press. who were missing the social motivation toward commu-
Strand, E., and McCauley, R. (1999). Assessment procedures
nication and interaction that is typically present even in
for treatment planning in children with phonologic and
motor speech disorders. In A. Caruso and E. Strand (Eds.), children with severe intellectual deficits. Despite their
Clinical management of motor speech disorders in children obvious impairments in social communication, the chil-
(pp. 73–108). New York: Thieme. dren Kanner observed did surprisingly well on some
Sturm, J. (1998). Educational inclusion of AAC users. In parts of IQ tests, leading Kanner to believe they did not
D. Beukelman and P. Mirenda (Eds.), Augmentative and have mental retardation.
alternative communication: Management of severe com- Kanner’s observation about intelligence has been
munication disorders in children and adults (2nd ed., pp. modified by subsequent research. When developmentally
391–424). Baltimore: Paul H. Brookes. appropriate, individually administered IQ testing is
Wing, L. (1996). The autistic spectrum: A guide for parents and administered, approximately 80% of people with autism
professionals. London: Constable.
score in the mentally retarded range, and scores remain
stable over time (Rutter et al., 1994). However, individ-
uals with autism do show unusual scatter in their abili-
Further Readings ties, with nonverbal, visually based performance often
Bedrosian, J. (1997). Language acquisition in young AAC sys- significantly exceeding verbal skills; unlike the perfor-
tem users: Issues and directions for future research. Aug- mance seen in children with other kinds of retardation,
mentative and Alternative Communication, 13, 179–185. whose scores are comparable across all kinds of tasks.
116 Part II: Speech
Recent research on the genetics of autism suggests Mutism. Approximately half of people with autism
that there are heritable factors that may convey suscep- never develop speech. Nonverbal communication, too,
tibility (Rutter et al., 1997). This vulnerability may be is greatly restricted (Paul, 1987). The range of com-
expressed in a range of social, communicative, and cog- municative intentions expressed is limited to requesting
nitive di‰culties expressed in varying degrees in parents, and protesting. Showing o¤, labeling, acknowledging,
siblings, and other relatives of individuals with autism. and establishing joint attention, seen in normal pre-
Although genetic factors appear to contribute to some verbal children, are absent in this population. Wants
degree to the appearance of autism, the condition can and needs are expressed preverbally, but forms for ex-
also be associated with other medical conditions. Dykens pression are aberrant. Some examples are pulling a
and Volkmar (1997) reported the following: person toward a desired object without making eye
contact, instead of pointing, and the use of mala-
Approximately 25% of individuals with autism develop daptive and self-injurious behaviors to express desires
seizures. (Donnellan et al., 1984). Pointing, showing, and turn-
Tuberous sclerosis (a disease characterized by abnor- taking are significantly reduced.
mal tissue growth) is associated with autism with For people with autism who do develop speech, both
higher than expected prevalence. verbal and nonverbal forms of communication are
The co-occurrence of autism and fragile X syndrome impaired. Forty percent of people with autism exhibit
(the most common heritable form of mental retarda- echolalia, an imitation of speech they have heard—
tion) is also higher than would be expected by chance. either immediate echolalia, a direct parroting of speech
directed to them, or delayed echolalia, in which they
Autism is considered one of a class of disabilities repeat snatches of language they have heard earlier,
referred to as pervasive developmental disorders, ac- from other people or on TV, radio, and so on. Both
cording to the Diagnostic and Statistical Manual of the kinds of echolalia are used to serve communicative
American Psychiatric Association (4th ed., 1994). The functions, such as responding to questions they do not
diagnostic criteria for autism are more explicitly stated understand (Prizant and Duchan, 1981). Echolalia
in DSM-IV than the criteria for other pervasive devel- decreases, as in normal development, with increases in
opmental disorders. The criteria for autism are the result language comprehension.
of a large field study conducted by Volkmar et al. (1994). A significant delay in comprehension is one of the
The field trial showed that the criteria specified in strongest distinctions between people with autism and
DSM-IV exhibit reliability and temporal stability. Simi- those with other developmental disabilities (Rutter,
lar research on diagnostic criteria for other pervasive Maywood, and Howlin, 1992). Formal aspects of
developmental disorders is not yet available. language production are on par with developmental
The primary diagnostic criteria for autism include the level. Children with autism are similar to mental age–
following: matched children in the acquisition of rule-governed
Early onset. Many parents first become concerned at syntax, but language development lags behind non-
the end of the first year of life, when a child does not verbal mental age (Lord and Paul, 1997). Articulation
start talking. At this period of development, children is on par with mental age in children with autism who
with autism also show reduced interest in other people; speak; however, high-functioning adults with autism
less use of communicative gestures such as pointing, show higher than expected rates of speech distortions
showing, and waving; and noncommunicative sound (Shriberg et al., 2001).
making, perhaps including echoing that is far in advance Word use is a major area of deficit in those who
of what can be produced in spontaneous or meaningful speak (Tager-Flusberg, 1995). Words are assigned to
contexts. There may also be unusual preoccupations the same categories that others use (Minshew and
with objects (e.g., an intense interest in vacuum cleaners) Goldsein, 1993), and scores on vocabulary tests are
or actions (such as twanging rubber bands) that are not often a strength. However, words may be used with
like the preoccupations of other children at this age. idiosyncratic meanings, and di‰culty is seen with
Impairment in social interaction. This is the hallmark deictic terms (i.e., you/I, here/there), whose meaning
of the autistic syndrome. Children with autism do not changes, depending on the point of view of the
use facial expressions, eye contact, body posture, or ges- speaker). This was first thought to reflect a lack of self,
tures to engage in social interaction as other children do. as evidenced by di‰culty with saying I. More recent
They are less interested in sharing attention to objects research suggests that the flexibility required to shift
and to other people, and they rarely attempt to direct referents and di‰culty assessing others’ state of
others’ attention to objects or events they want to point knowledge are more likely to account for this obser-
out. They show only fleeting interest in peers, and often vation (Lee, Hobson, and Chiat, 1994).
appear content to be left on their own to pursue their Pragmatic, interpersonal uses of language present
solitary preferred activities. the greatest challenges to speakers with autism. The
Impairment in communication. Language and com- rate of initiation of communication is low (Stone and
municative di‰culties are also core symptoms in Caro-Martinez, 1990), and speech is often idiosyn-
autism. Communicative di¤erences in autism include the cratic and contextually inappropriate (Lord and Paul,
following: 1997). Few references are made to mental states, and
Autism 117
people with autism have di‰culty inferring the mental trum, prevalence estimates rose to 1 per 1000 (Bryson,
states of others (Tager-Flusberg, 1995). Deficits are 1997).
seen in providing relevant responses or adding new in- Currently, there is a great deal of debate about inci-
formation to established topics; primitive strategies dence and prevalence, particularly about whether inci-
such as imitation are used to continue conversations dence is rising significantly. Although clinicians see more
(Tager-Flusberg and Anderson, 1991). children today who receive a label of autism than they
For individuals at the highest levels of functioning, did 10 years ago, this is likely to be due to a broadening
conversation is often restricted to obessive interests. of the definition of the disorder to include children who
There is little awareness of listeners’ lack of interest in show some subset of symptoms without the full-blown
extended talk about these topics. Di‰culty is seen in syndrome. Using this broad definition, current preva-
adapting conversation to take into account all partic- lence estimates range from 1 in 500 to as low as 1 in 300
ipants’ purposes. Very talkative people with autism are (Fombonne, 1999). Although there is some debate about
impaired in their ability to use language in functional, the precise ratio, autism is more prevalent in males than
communicative ways (Lord and Paul, 1997), unlike in females (Bryson, 1997).
other kinds of children with language impairments, In the vast majority of cases, children with autism
whose language use improves with increased amount grow up to be adults with autism. Only 1%–2% of cases
of speech. have a fully normal outcome (Paul, 1987). The classic
Paralinguistic features such as voice quality, into- image of the autistic child—mute or echolalic, with
nation, and stress are frequently impaired in speakers stereotypic behaviors and a great need to preserve
with autism. Monotonic intonation is one of the most sameness—is most characteristic of the preschool pe-
frequently recognized aspects of speech in autism. It is riod. As children with autism grow older, they gener-
a major contributor to listeners’ perception of oddness ally progress toward more, though still aberrant, social
(Mesibov, 1992). The use of pragmatic stress in spon- involvement. In adolescence, 10%–35% of children with
taneous speech and speech fluency are also impaired autism show some degree of regression (Gillberg and
(Shriberg et al., 2001). Schaumann, 1981). Still, with continued intervention,
growth in both language and cognitive skills can be seen
Stereotypic patterns of behavior. Abnormal preoc- (Howlin and Goode, 1998).
cupations with objects or parts of objects are character- Approximately 75% of adults with autism require
istic of autism, as is a need for routines and rituals high degrees of support in living, with only about 20%
always to be carried out in precisely the same way. gainfully employed (Howlin and Goode, 1998). Out-
Children with autism become exceedingly agitated over come in adulthood is related to IQ, with good outcomes
small changes in routine. Stereotyped motor behaviors, almost always associated with IQs above 60 (Rutter,
such as hand flapping, are also typical but are related to Greenfield, and Lockyer, 1967). The development of
developmental level and are likely to emerge in the pre- functional speech by age 5 is also a strong predictor of
school period. good outcome (DeMyer et al., 1973).
Delays in imaginative play. Children with autism are Major changes have taken place in the treatments
more impaired in symbolic play behaviors than in other used to address autistic behaviors. Although a variety of
aspects of cognition, although strengths are seen in con- pharmacological agents have been tried, and some are
structive play, such as stacking and nesting (Schuler, e¤ective at treating certain symptoms (see McDougle,
Prizant, and Wetherby, 1997). 1997, for a review), the primary forms of treatment for
There is no medical or biological profile that can be autism are behavioral and educational. Early interven-
used to diagnose autism, nor is there one diagnostic test tion, when provided with a high degree of intensity
that definitely identifies this syndrome. Current assess- (at least 20 hours per week), has proved particularly
ment methods make use primarily of multidimensional e¤ective (Rogers, 1996). There is ongoing debate about
scales, either interview or observational, that provide the best methods of treatment, particularly for lower
separate documentation of aberrant behaviors in each of functioning children. There are proponents of operant
the three areas that are known to be characteristic of applied behavior treatments (Lovaas, 1987), of natural-
the syndrome: social reciprocity, communication, and istic child-centered approaches (Greenspan and Wieder,
restricted, repetitive behaviors. The most widely used for 1997), and of approaches that are some hybrid of the
research purposes are the Autism Diagnostic Interview two (Prizant and Wetherby, 1998). Recent innovations
(Lord, Rutter, and Le Conteur, 1994) and the Autism focus on the use of alternative communication systems
Diagnostic Observation Scale (Lord et al., 2000). (e.g., Bondi and Frost, 1998) and on the use of environ-
Until recently, autism was thought to be a rare disor- mental compensatory supports, such as visual calendars,
der, with prevalence estimates of 4–5 per 10,000 (Lotter, to facilitate communication and learning (Quill, 1998).
1966). However, these prevalence figures were based on Although all of these approaches have been shown to be
identifying the disorder in children who, like the classic associated with growth in young children with autism,
patients described by Kanner, had IQs within normal no definitive study has yet compared approaches or
range. As it became recognized that the social and com- measured long-term change.
municative deficits characteristic of autism could be For higher functioning and older individuals with
found in children along the full range of the IQ spec- autism, most interventions are derived from more
118 Part II: Speech
general strategies used in children with language impair- Lord, C., Rutter, M., and Le Conteur, A. (1994). Autism Di-
ments. These strategies focus on the development of agnostic Interview—Revised: A revised version of a diag-
conversational skills, the use of scripts to support com- nostic interview for caregivers of individuals with possible
munication, strategies for communicative repair, and pervasive developmental disorders. Journal of Autism and
Developmental Disorders, 24, 659–685.
the use of reading to support social interaction (Prizant
Lotter, V. (1966). Epidemiology of autistic conditions in young
et al., 1997). ‘‘Social stories,’’ in which anecdotal narra- children. Social Psychiatry, 1, 124–137.
tives are encouraged to support social understanding and Lovaas, O. (1987). Behavioral treatment and normal educa-
participation, is a new method that is often used with tional and intellectual functioning in young autistic chil-
higher functioning individuals (Gray, 1995). dren. Journal of Consulting and Clinical Psychology, 55,
3–9.
—Rhea Paul McDougle, C. (1997). Psychopharmocology. In D. Cohen and
F. Volkmar (Eds.), Handbook of autism and pervasive
References developmental disorders (pp. 707–729). New York: Wiley.
Mesibov, G. (1992). Treatment issues with high functioning
American Psychiatric Association. (1994). Diagnostic and sta- adolescents and adults with autism. In E. Schopler and
tistical manual of the American Psychiatric Association G. Mesibov (Eds.), High functioning individuals with autism
(4th ed.). Washington, DC: Author. (pp. 143–156). New York: Plenum Press.
Bondi, A., and Frost, L. (1998). The picture exchange com- Minshew, N., and Goldsein, G. (1993). Is autism an amnesic
munication system. Seminars in Speech and Language, 19, disorder? Evidence from the California Verbal Learning
373–389. Test. Neuropsychology, 7, 261–170.
Bryson, S. (1997). Epidemiology of autism: Overview and Paul, R. (1987). Communication. In D. J. Cohen and A. M.
issues outstanding. In D. Cohen and F. Volkmar (Eds.), Donnellan (Eds.), Handbook of autism and pervasive devel-
Handbook of autism and pervasive developmental disorders opmental disorders (pp. 61–84). New York: Wiley.
(pp. 41–46). New York: Wiley. Prizant, B., and Duchan, J. (1981). The functions of immediate
DeMyer, M., Barton, S., DeMyer, W., Norton, J., Allen, J., and echolalia in autistic children. Journal of Speech and Hearing
Steele, R. (1973). Prognosis in autism: A follow-up study. Disorders, 46, 241–249.
Journal of Autism and Childhood Schizophrenia, 3, 199–246. Prizant, B., and Wetherby, A. (1998). Understanding the con-
Donnellan, A., Mirenda, P., Mesaros, P., and Fassbender, L. tinuum of discrete-trial traditional behavioral to social-
(1984). Analyzing the communicative function of aberrant pragmatic developmental approaches in communication
behavior. Journal of the Association for Persons with Severe enhancement for young children with autism/PDD. Semi-
Handicaps, 9, 202–212. nars in Speech and Language, 19, 329–354.
Dykens, E., and Volkmar, F. (1997). Medical conditions asso- Prizant, B., Schuler, A., Wetherby, A., and Rydell, P. (1997).
ciated with autism. In D. Cohen and F. Volkmar (Eds.), Enhancing language and communication development:
Handbook of autism and pervasive developmental disorders Language approaches. In D. Cohen and F. Volkmar (Eds.),
(pp. 388–410). New York: Wiley. Handbook of autism and pervasive developmental disorders
Fombonne, E. (1999). The epidemiology of autism: A review. (pp. 572–605). New York: Wiley.
Psychological Medicine, 29, 769–786. Quill, K. (1998). Environmental supports to enhance social-
Gillberg, C., and Schaumann, H. (1981). Infantile autism and communication. Seminars in Speech and Language, 19,
puberty. Journal of Autism and Developmental Disorders, 407–423.
11, 365–371. Rogers, S. (1996). Brief report: Early intervention in autism.
Gray, C. (1995). The original social story book. Arlington, TX: Journal of Autism and Developmental Disorders, 26, 243–
Future Horizons. 246.
Greenspan, S., and Wieder, S. (1997). An integrated devel- Rutter, M., Bailey, A., Bolton, P., and Le Conteur, A. (1994).
opmental approach to interventions for young children with Autism and known medical conditions: Myth and sub-
severe di‰culties relating and communicating. Zero to stance. Journal of Child Psychology and Psychiatry and
Three, 18, 5–17. Allied Disciplines, 35, 311–322.
Howlin, P., and Goode, S. (1998). Outcome in adult life for Rutter, M., Bailey, A., Simono¤, E., Pickles, A. (1997). Ge-
people with autism and Asperger syndrome. In F. Volkmar netic influences of autism. In D. Cohen and F. Volkmar
(Ed.), Autism and pervasive developmental disorders (pp. (Eds.), Handbook of autism and pervasive developmental
209–241). Cambridge, UK: Cambridge University Press. disorders (pp. 370–387). New York: Wiley.
Kanner, L. (1943). Autistic disturbances of a¤ective contact. Rutter, M., Greenfield, D., and Lockyer, L. (1967). A five to
Nervous Child, 2, 416–426. fifteen year follow-up study of infantile psychosis: II. Social
Lee, A., Hobson, R., and Chiat, S. (1994). I, you, me, and and behavioral outcome. British Journal of Psychiatry, 113,
autism: An experimental study. Journal of Autism and 1183–1199.
Developmental Disorders, 24, 155–176. Rutter, M., Maywood, L., and Howlin, P. (1992). Language
Lord, C., and Paul, R. (1997). Language and communication delay and social development. In P. Fletcher and D. Hall
in autism. In D. Cohen and F. Volkmar (Eds.), Hand- (Eds.), Specific speech and language disorders in children
book of autism and pervasive developmental disorders (pp. (pp. 63–78). London: Whurr.
195–225). New York: Wiley. Schuler, A., Prizant, B., and Wetherby, A. (1997). Enhancing
Lord, C., Risi, S., Lambrecht, L., Cook, E. H., Jr., Leventhal, language and communication development: Prelinguistic
B. L., DiLavore, P. C., Pickles, A., and Rutter, M. (2000). approaches. In D. Cohen and F. Volkmar (Eds.), Hand-
The Autism Diagnostic Observation Schedule—Generic: A book of autism and pervasive developmental disorders (pp.
standard measure of social and communication deficits 539–571). New York: Wiley.
associated with the spectrum of autism. Journal of Autism Shriberg, L., Paul, R., McSweeney, J., Klin, A., Cohen, D.,
and Developmental Disorders, 30(3), 205–223. and Volkmar, F. (2001). Speech and prosody in high func-
Bilingualism, Speech Issues in 119
tioning adolescents and adults with autism and Asperger Bilingualism, Speech Issues in
syndrome. Journal of Speech, Language, and Hearing Re-
search, 44, 1097–1115.
Stone, W., and Caro-Martinez, L. (1990). Naturalistic obser- In evaluating the properties of bilingual speech, an an-
vations of spontaneous communication in autistic chil-
terior question that must be answered is who qualifies as
dren. Journal of Autism and Developmental Disorders, 20,
437–453. bilingual. Scholars have struggled with this question for
Tager-Flusberg, H. (1995). Dissociations in form and function decades. Bilingualism defies delimitation and is open to a
in the acquisition of language by autistic children. In H. variety of descriptions and interpretations. For example,
Tager-Flusberg (Ed.), Constraints on language acquisition: Bloomfield (1933) required native-like control of two
Studies of atypical children (pp. 175–194). Hillsdale, NJ: languages, while Weinreich (1968) and Mackey (1970)
Erlbaum. considered as bilingual an individual who alternately
Tager-Flusberg, H., and Anderson, M. (1991). The develop- used two languages. Beatens-Beardsmore (1982), ob-
ment of contingent discourse ability in autistic children. serving a wide range of variations in di¤erent contexts,
Journal of Child Psychology and Psychiatry, 32, 1123–1143. concluded that it is not possible to formulate a single
Volkmar, F. (1998). Autism and pervasive developmental dis-
neat definition, and stated that bilingualism as a concept
orders. Cambridge, UK: Cambridge University Press.
Volkmar, F., Klin, A., Siegel, B., Szatmari, P., Lord, C., et al. has ‘‘open-ended semantics.’’
(1994). Field trial for autistic disorder in DSM-IV. Ameri- It has long been recognized that bilingual indi-
can Journal of Psychiatry, 151, 1361–1367. viduals form a heterogeneous population in that their
abilities in their two languages are not uniform. Al-
Further Readings though some bilingual speakers may have attained a
native-like production in each language, the great ma-
Baron-Cohen, S. (1995). Mindblindness. Cambridge, MA: MIT jority are not balanced between the two languages. The
Press. result is interference from the dominant language.
Baron-Cohen, S., and Bolton, P. (1993). Autism: The facts. Whether a child becomes bilingual simultaneously (two
Oxford, UK: Oxford University Press.
Bauman, M., and Kempter, T. (1994). The neurobiology of
languages are acquired simultaneously) or successively
autism. Baltimore: Johns Hopkins University Press. (one language, generally the home language, is acquired
Catalano, R. (1998). When autism strikes. New York: Plenum earlier, and the other language is acquired, for example,
Press. when the child goes to school), it is impossible to rule
Cohen, D., and Volkmar, F. (Eds.). (1997). Handbook of out interference. In the former case, this may happen
autism and pervasive developmental disorders. New York: because of di¤erent degrees of exposure to the two lan-
Wiley. guages; in the latter, the earlier acquired language may
Dawson, G. (1989). Autism: Nature, diagnosis, and treatment. put its imprint on the one acquired later. More children
New York: Guilford Press. become bilingual successively, and the influence of one
Donnellan, A. (1985). Classic readings in autism. New York: language on the other is more evident. Yet even here
Teachers’ College Press.
Frith, U. (1989). Autism: Explaining the enigma. Oxford, UK:
there is no uniformity among speakers, and the range
Blackwell. of interference from the dominant language forms a
Glidden, L. (2000). International review of research in mental continuum.
retardation: Autism. San Diego, CA: Academic Press. The di¤erent patterns that a bilingual child reveals in
Happe, F. (1995). Autism: An introduction to psychological speech may not necessarily be the result of interference.
theory. Cambridge, MA: Harvard University Press. Bilingual children, like their monolingual counterparts,
Hogdon, L. (1999). Solving behavior problems in autism. Troy, may su¤er speech and/or language disorders. Thus,
MI: QuirkRoberts Publishing. when children who grow up with more than one lan-
Matson, J. (1994). Autism in children and adults. Pacific Grove, guage produce patterns that are erroneous with respect
CA: Brooks/Cole. to the speech of monolingual speakers, it is crucial to
Morgan, H. (1996). Adults with autism. Cambridge, UK:
Cambridge University Press.
determine whether these nonconforming patterns are due
Powers, M. (1989). Children with autism: A parents’ guide. to the influence of the child’s other language or are
Bethesda, MD: Woodbine House. indications of a speech-language disorder.
Quill, K. (1995). Teaching children with autism. Albany, NY: To be able to make accurate diagnoses, speech-
Delmar. language pathologists must use information from inter-
Rimland, B. (1962). Infantile autism. New York: Appleton- ference patterns, normal and disordered phonological
Century-Crofts. development in general as well as in the two languages,
Rutter, M., and Schopler, M. (1978). Autism. New York: Ple- and the specific dialect features. In assessing the phono-
num Press. logical development of a bilingual child, both languages
Schopler, E., and Mesibov, G. (1985–2000). Current issues in should be the focus of attention, and each should be
autism (series). New York: Plenum Press.
Sigman, M., and Capps, L. (1997). Children with autism.
examined in detail, even if the child seems to be a domi-
Cambridge, MA: Harvard University Press. nant speaker of one of the languages. To this end, all
Sperry, V. (2000). Fragile success. Baltimore: Paul H. Brookes. phonemes of the languages should be assessed in dif-
Wetherby, A., and Prizant, B. (2000). Autistic spectrum dis- ferent word positions, and phonotactic patterns should
orders. Baltimore: Paul H. Brookes. be evaluated. Assessment tools that are designed for
Wing, L. (1972). Autistic children. New York: Brunner/Mazel. English, no matter how perfect they are, will not be
120 Part II: Speech
appropriate for the other language and may be the cause whose primary language is Japanese, Italian, Spanish, or
of over- or underdiagnosis. Portuguese.
Certain cases lend themselves to obvious identifica- Besides the interference patterns and common devel-
tion of interference. For example, if we encounter in the opmental processes, speech-language pathologists must
English language productions of a Portuguese-English be watchful for some unusual (idiosyncratic) processes
bilingual child forms like [tSiz] for ‘‘tease’’ and [tSIp] for that are observed in children (Grunwell, 1987; Dodd,
‘‘tip,’’ whereby target /t/ turns into [tS ], we can, with 1993). Processes such as unusual cluster reduction, as in
confidence, say that these renditions were due to Portu- [ren] for train (instead of the expected [ten]), fricative
guese interference, as such substitutions are not com- gliding, as in [wIg] for fig, frication of stops, as in [v0n]
monly observed in developmental phonologies, and the for ban, and backing, as in [p0k] for pat, may occur in
change of /t/ to [tS ] before /i/ is a rule of Portuguese children with phonological disorders.
phonology. Studies that have examined the phonological patterns
The decision is not always so straightforward, how- in normally developing bilingual children (Gildersleeve,
ever. For example, substitutions may reflect certain Davis, and Stubbe, 1996) and bilingual children with a
developmental simplification processes that are univer- suspected speech disorder (Dodd, Holm, and Wei, 1997)
sally phonetically motivated and shared by many lan- indicate that children in both groups exhibit patterns
guages. If a bilingual child’s speech reveals any such di¤erent from matched, monolingual peers. Compared
processes, and if the first language of the child does not with their monolingual peers, normally developing bi-
have the opportunities for such processes to surface, then lingual children and bilingual children with phonological
it would be very di‰cult to identify the dominant lan- disorders had a lower overall intelligibility rating, made
guage as the culprit and label the situation as one of in- more errors overall, distorted more sounds, and pro-
terference. For example, if a 6-year-old child bilingual in duced more uncommon error patterns. As for the di¤er-
Spanish and English reveals processes such as final ence between normally developing bilingual children and
obstruent devoicing (e.g., [b0k] for ‘‘bag,’’ [bEt] for bilingual children with phonological disorders, it appears
‘‘bed’’) and/or deletion of clusters that do not follow so- that children with phonological disorders manifest more
nority sequencing ([tap] for ‘‘stop,’’ [pIt] for ‘‘spit’’), we common simplification patterns, suppress such patterns
cannot claim that these changes are due to Spanish in- over time more slowly, and are likely to have uncommon
terference. Rather, these processes are among the com- processes.
monly occurring developmental processes that occur in As speech-language pathologists become more adept
the speech of children in many languages. However, be- at di¤erentiating common and uncommon phonological
cause these common simplification processes are usually patterns and interference patterns in bilingual children,
suppressed in normally developing children by age 6, this they will also need to consider not only the languages of
particular situation suggests a delay or disorder. In this the client, but also the specific dialects of those lan-
case, these processes may not have surfaced until age 6 guages. Just as there are several varieties of English
because none of these patterns are demanded by the spoken in di¤erent countries (e.g., British, American,
structure of Spanish. In other words, because Spanish Australian, South African, Canadian, Indian) and even
has no voiced obstruents in final position and no con- within one country (New England variety, Southern
sonant clusters that do not follow sonority sequencing, it variety, General American, and African American Ver-
is impossible to refer to the first language as the expla- nacular in the United States), other languages also show
nation. In such instances we must attribute these pat- dialectal variation. Because none of these varieties or
terns to universally motivated developmental processes dialects of a given language is or can be considered a
that have not been eliminated according to the expected disordered form of that language, the child’s dialectal
timetable. information is essential. Any assessment of the child’s
We may also encounter a third situation in which speech must be made with respect to the norm of the
the seemingly clear distinction between interference particular variety she or he is learning. Not accounting
and the developmental processes is blurred. This occurs for dialect features may either result in the misdiagnosis
when one or more of the developmental processes are of a phonological disorder or escalate the child’s severity
also the patterns followed by the first (dominant) lan- rating.
guage. An example is final obstruent devoicing in the Last but definitely not least is the desperate need for
English language productions of a child with German, information on phonological development in bilingual
Russian, Polish, or Turkish as the first language. Al- children and assessment procedures unique to these in-
though final obstruent devoicing is a natural process dividuals. Language skills in bilingual persons have al-
that even occurs in the early speech of monolingual most always been appraised in reference to monolingual
English-speaking children, it is also a feature of the lan- standards (Grosjean, 1992). Accordingly, a bilingual
guages listed. Thus, the result is a natural tendency that child is assessed with two procedures, one for each
receives extra impetus from the rule of the primary sys- language, that are designed to evaluate monolingual
tem. Other examples that could be included in the same speakers of these languages. This assumes that a bilin-
category would be consonant cluster reduction in chil- gual individual is two monolingual individuals in one
dren whose primary language is Japanese, Turkish, or person. However, because of the constant interaction of
Finnish, and single obstruent coda deletion in children the two languages, each phonological system of a bilin-
Developmental Apraxia of Speech 121
gual child may, and in most cases will, not necessarily be Goldstein, B., and Iglesias, A. (1996a). Phonological patterns
acquired in a way identical to that of a monolingual in normally developing Spanish speaking 3- and 4-year-olds
child (Watson, 1991). of Puerto Rican descent. Language, Speech, and Hearing
In order to characterize bilingual phonology accu- Services in the Schools, 27, 82–90.
Goldstein, B., and Iglesias, A. (1996b). Phonological patterns
rately, detailed information on both languages being
in Puerto Rican Spanish-speaking children with phonologi-
acquired by the children is indispensable. However, data cal disorders. Journal of Communication Disorders, 29, 367–
on the developmental patterns in two languages sepa- 387.
rately would not be adequate, as information on phono- Grunwell, P. (1985). Phonological assessment of child speech.
logical development in bilingual children is the real key London: Nfer-Nelson.
to understanding bilingual phonology. Because bilingual Hodson, B. W. (1986). Assessment of phonological processes in
speakers’ abilities in the two languages vary immensely Spanish. San Diego, CA: Los Amigos.
from one individual to another, developing assessment Hodson, B. W., and Paden, E. (1991). Targeting intelligible
tools for phonological development is a huge task, per- speech (2nd ed.). Austin, TX: Pro-Ed.
haps the biggest challenge for the field. Keyser, H. (Ed.). (1995). Bilingual speech-language pathology:
An Hispanic focus. San Diego, CA: Singular Publishing
See also bilingualism and language impairment.
Group.
—Mehmet Yavas Magnusson, E. (1983). The phonology of language disordered
children: Production, perception and awareness. Lund, Swe-
den: Gleerup.
References Mann, D., and Hodson, B. (1994). Spanish-speaking children’s
Beatens-Beardsmore, H. (1982). Bilingualism: Basic principles. phonologies: Assessment and remediation of disorders.
Clevedon, UK: Tieto. Seminars in Speech and Language, 15, 137–147.
Bloomfield, L. (1933). Language. New York: Wiley. Mason, M., Smith, M., and Hinshaw, M. (1976). Medida
Dodd, B. (1993). Speech disordered children. In G. Blanken, Espanola de Articulacion. San Ysidoro, CA: San Ysidoro
J. Dittman, H. Grimm, J. Marshall, and C. Wallesh (Eds.), School District.
Linguistics: Disorders and pathologies (pp. 825–834). Berlin: Nettelbladt, U. (1983). Developmental studies of dysphonology
De Gruyter. in children. Lund, Sweden: Gleerup.
Dodd, B., Holm, A., and Wei, L. (1997). Speech disorder in So, L. (1992). Cantonese Segmental Phonology Test. Hong
preschool children exposed to Cantonese and English. Kong University, Department of Speech and Hearing
Clinical Linguistics and Phonetics, 11, 229–243. Sciences.
Gildersleeve, C., Davis, B., and Stubbe, E. (1996). When mono- So, L., and Dodd, B. (1995). The acquisition of phonology by
lingual rules don’t apply: Speech development in a bilingual Cantonese speaking children. Journal of Child Language,
environment. Paper presented at the annual convention of 22, 473–495.
the American Speech-Language-Hearing Association, Seat- Stoel-Gammon, C., and Dunn, C. (1985). Normal and dis-
tle, WA. ordered phonology in children. Austin, TX: Pro-Ed.
Grosjean, F. (1992). Another view of bilingualism. In R. J. Toronto, A. (1977). Southwest Spanish Articulation Test. Aus-
Harris (Ed.), Cognitive processing in bilinguals (pp. 51–62). tin, TX: National Education Laboratory Publishers.
New York: Elsevier. Yavas, M. (1998). Phonology: Development and disorders. San
Grunwell, P. (1987). Clinical phonology (2nd ed.). London: Diego, CA: Singular Publishing Group.
Croom Helm. Yavas, M. (2002). VOT patterns in bilingual phonological de-
Mackey, W. (1970). Interference, integration and the syn- velopment. In F. Windsor, L. Kelly, and N. Hewlett (Eds.),
chronic fallacy. In J. Alatis (Ed.), Bilingualism and language Investigations in clinical linguistics (pp. 341–350). Mahwah,
contact. Washington, DC: Georgetown University Press. NJ: Erlbaum.
Watson, I. (1991). Phonological processing in two languages. Yavas, M., and Goldstein, B. (1998). Speech sound di¤erences
In E. Bialystok (Ed.), Language processing in bilingual chil- and disorders in first and second language acquisition:
dren. Cambridge, UK: Cambridge University Press. Theoretical issues and clinical applications. American Jour-
Weinreich, U. (1968). Languages in contact. The Hague: nal of Speech and Language Pathology, 7.
Mouton. Yavas, M., Hernandorena, C., and Lamprecht, R. (1991).
Avaliacao fonologica da Crianca (Phonological assessment
tool for Brazilian Portuguese). Porto Alegre, Brazil: Artes
Further Readings Medicas.
Bernthal, J. E., and Bankson, N. W. (Eds.). (1998). Articula-
tion and phonological disorders (4th ed.). Needham Heights,
MA: Allyn and Bacon.
Bortolini, U., and Leonard, L. (1991). The speech of phono-
logically disordered children acquiring Italian. Clinical Lin- Developmental Apraxia of Speech
guistics and Phonetics, 5, 1–12.
Cheng, L. R. (1987). Assessing Asian language performance: Developmental apraxia of speech (DAS) is a devel-
Guidelines for evaluating limited-English-proficient students.
opmental speech disorder frequently defined as di‰culty
Rockville, MD: Aspen.
Deuchar, M., and Quay, S. (2000). Bilingual acquisition. New in programming of sequential speech movements based
York: Oxford University Press. on presumed underlying neurological di¤erences. Theo-
Ferguson, C. A., Menn, L., and Stoel-Gammon, C. (Eds.). retical constructs motivating understanding of DAS
(1992). Phonological development: Models, research, impli- have been quite diverse. Motor-based or pre-motor
cations. Timonium, MD: York Press. planning speech output deficits (e.g., Hall, Jordon, and
122 Part II: Speech
Robin, 1993), phonologically based deficits in represen- appropriate decisions regarding clinical intervention,
tation (e.g., Velleman and Strand, 1993), or deficits in and valid theory building to understand the underlying
neural tissue with organizational consequences (e.g., nature of the disorder. Use of DAS as an ‘‘umbrella term
Crary, 1984; Sussman, 1988) have been posited. Reflect- for children with persisting and serious speech di‰culties
ing these varied views of causality, a variety of terms in the absence of obvious causation, regardless of the
have been employed: developmental apraxia of speech, precise nature of their unintelligibility’’ (Stackhouse,
developmental verbal dyspraxia, and developmental 1992, p. 30) is to be avoided. Such practice continues
articulatory dyspraxia. Clinically, DAS has most often to cloud the issue of precise definition of the pres-
been defined by exclusion from functional speech disor- ence and prevalence of the disorder in child clinical
der or delay using a complex of behavioral symptoms populations.
(e.g., Stackhouse, 1992; Shriberg, Aram, and Kwiat- Accordingly, a review of the range of behavioral cor-
kowski, 1997). relates presently in use is of crucial importance to careful
The characterization of DAS was originally derived definition and understanding of DAS. The relationship
from apraxia of speech in adults, a disorder category of behavioral correlates to di¤erential diagnosis from
based on acquired brain damage resulting in di‰culty in ‘‘functional’’ speech disorder or delay is of primary im-
programming speech movements (Broca, 1861). Morley, portance to discriminating DAS as a subcategory of
Court, and Miller (1954) first applied the term dyspraxia functional speech disorder. If no single defining charac-
to children based on a proposed similarity in behavioral teristic or complex of characteristics emerges to define
correlates with adult apraxic symptoms. A neurological DAS, the utility of the label is seriously questionable for
etiology was implied by the analogy but has not been either clinical or research purposes. In every instance,
conclusively delineated, even with increasingly sophisti- observed behaviors need to be evaluated against devel-
cated instrumental techniques for understanding brain- opmental behaviors appropriate to the client’s chrono-
behavior relations (see Bennett and Netsell, 1999; logical age. In the case of very young clients, the
LeNormand et al., 2000). Little coherence and consensus di¤erential diagnosis of DAS is complicated (Davis
is available in this literature at present. In addition, de- and Velleman, 2000). Some listed characteristics may
spite nearly 40 years of research, di¤erential diagnostic be normal aspects of earliest periods of speech and
correlates and range of severity levels characterizing language development (e.g., predominant use of simple
DAS remain imprecisely defined. Guyette and Deidrich syllable shapes or variability in production patterns
(1981) have suggested that DAS may not be a theoreti- at the onset of meaningful speech; see Vihman, 1997,
cally or clinically definable entity, as current empirical for a review of normal phonetic and phonological
evidence does not produce any behavioral symptom not development).
overlapping with other categories of developmental Before the clinical symptoms presently employed to
speech disorder or delay. In addition, no currently define DAS are outlined, specific issues with available
available theoretical constructs specifically disprove research will be reviewed briefly. It should be empha-
other possible theories for the origins of DAS (see Davis, sized that behavioral inclusion criteria are not con-
Jakielski, and Marquardt, 1998). In contrast to devel- sistently reported and di¤ering criteria are included
opmental disorder categories such as hearing impair- across studies. Criteria for inclusion in studies then be-
ment or cleft palate, lack of a link of underlying cause or come recognized symptoms of involvement, achieving a
theoretical base with behavioral correlates results in an circularity that is not helpful for producing valid char-
‘‘etiological’’ disorder label with no clearly established acterization of the disorder (Stackhouse, 1992). Subject
basis. Evidence for a neurological etiology for DAS is ages vary widely, from preschoolers (Bradford and
based on behavioral correlates that are ascribed to a Dodd, 1996) to adults (Ferry, Hall, and Hicks, 1975).
neurological basis, thus achieving a circular argument Some studies include control populations of functional
structure for neural origins (Marquardt, Sussman, and speech disorders for di¤erential diagnosis (Stackhouse,
Davis, 2000). 1992; Dodd, 1995); others do not (Horowitz, 1984).
Despite the lack of consensus on theoretical motiva- Associated language and praxis behaviors are included
tion, etiology, or empirical evidence precisely defining as di¤erential diagnostic correlates in some studies
behavioral correlates, there is some consensus among (Crary and Towne, 1984), while others explicitly exclude
practicing clinicians as well as researchers (e.g., Shriberg, these deficits (e.g., Hall, Jordon, and Robin, 1993). Se-
Aram, and Kwiatkowski, 1997) that DAS exists. It thus verity is not reported consistently. When it is reported,
represents an incompletely understood disorder that the basis for assigning severity judgments is inconsistent
poses important challenges both to practicing clinicians across studies. A consequence of this inconsistency is
and to the establishment of a consistent research base lack of consensus on severity level appropriate to the
for overall understanding. An ethical di¤erential diag- DAS label. In some reports, the defining characteristic is
nosis for clinical intervention and research investiga- severe and persistent disorder (e.g., Shriberg, Aram, and
tions should, accordingly, be based on awareness of the Kwiatkowski, 1997). In other reports (e.g., Thoonen
current state of empirically established data regarding et al., 1997), a continuum of severity is explored. In the
theories and behavioral correlates defining this disorder. latter conceptualization, DAS can manifest as mild,
Cautious application of the diagnostic label should be moderate, or severe speech disorder.
the norm, founded on a clear understanding of positive Despite the foregoing critique, the large available
benefits to the client in discerning long-term prognosis, literature on DAS suggests some consensus on behav-
Developmental Apraxia of Speech 123
ioral correlates that should be evaluated in establishing a behavioral correlates have been described and studied
di¤erential diagnosis. The range of expression of these does not lend to precision in understanding DAS. Re-
characteristics, although frequently cited, has not been search utilizing consistent subject selection criteria is
specified quantitatively. Accordingly, these behaviors needed to begin to link understanding of DAS to ethical
should not be considered definitive but suggestive of clinical practices in assessment and intervention and to
directions for future research as well as guidelines for the elucidate the underlying causes of this disorder.
practicing clinician based on emerging research. See also motor speech involvement in children.
Exclusionary criteria for a di¤erential diagnosis have
been suggested in the areas of peripheral motor and —Barbara L. Davis
sensory function, cognition, and receptive language. Ex- References
clusionary criteria frequently noted include (1) no peri-
pheral organic disorder (e.g., cleft palate), (2) no sensory Bennett, S., and Netsell, R. W. (1999). Possible roles of the
deficit (i.e., in vision or hearing), (3) no peripheral muscle insula in speech and language processing: Directions for re-
weakness or dysfunction (e.g., dysarthria, cerebral palsy), search. Journal of Medical Speech-Language Pathology, 7,
(4) normal IQ, and (5) normal receptive language. 255–272.
Bradford, A., and Dodd, B. (1996). Do all speech-disordered
Phonological and phonetic correlates have also been
children have motor deficits? Clinical Linguistics and Pho-
listed. Descriptive terminology varies from phonetic netics, 10, 77–101.
(e.g., Murdoch et al., 1995) to phonological (Forrest and Broca, P. (1861). Remarques sur le siege do la faculte du lan-
Morrisette, 1999; Velleman and Shriberg, 1999) accord- guage articule, suives d’une observation d’aphemie (peste de
ing to the theoretical perspective of the researcher, com- la parole). Bulletin de la Societe d’Anatomique, 2nd série, 6,
plicating understanding of the nature of the disorder and 330–337.
comparison across studies. In addition, behavioral cor- Crary, M. A. (1984). Phonological characteristics of devel-
relates have been established across studies with highly opmental verbal dyspraxia. Seminars in Speech and Lan-
varied subject pools and di¤ering exclusionary criteria. guage, 6(2), 71–83.
The range of expression of symptoms is not established Crary, M. A., and Towne, R. (1984). The asynergistic nature of
developmental verbal dyspraxia. Australian Journal of Hu-
(i.e., what types and severity of suprasegmental errors man Communication Disorders, 12, 27–37.
are necessary or su‰cient for the diagnosis?). Some Davis, B. L., Jakielski, K. J., and Marquardt, T. M. (1998).
characteristics are in common with functional disorders Devlopmental apraxia of speech: Determiners of di¤erential
and thus do not constitute a di¤erential diagnostic char- diagnosis. Clinical Linguistics and Phonetics, 12, 25–45.
acteristic (i.e., how limited does the consonant or vowel Davis, B. L., and Velleman, S. L. (2000). Di¤erential diagnosis
repertoire have to be to express DAS?). In addition, of developmental apraxia of speech in infants and toddlers.
not all symptoms are consistently reported as being Infant Toddler Intervention, 10(3), 177–192.
necessary to a diagnosis of DAS (e.g., not all clients Dodd, B. (1995). Procedures for classification of sub-groups of
show ‘‘groping postures of the articulators’’). Long-term speech disorder. In B. Dodd (Ed.), Di¤erential diagnosis and
persistence of clinical symptoms in spite of intensive treatment of children with speech disorders. London: Whurr.
Ferry, P. C., Hall, S. M., and Hicks, J. L. (1975). Dilapidated
therapy has also frequently been associated with DAS. speech: Developmental verbal dyspraxia. Developmental
Phonological/phonetic correlates reported include (1) Medicine and Child Neurology, 17, 432–455.
limited consonant and vowel phonetic inventory, (2) Forrest, K., and Morrisette, M. L. (1999). Feature analysis of
predominant use of simple syllable shapes, (3) frequent segmental errors in children with phonological disorders.
omission of errors, (4) a high incidence of vowel errors, Journal of Hearing, Language, and Speech Research, 42,
(5) altered suprasegmental characteristics (including rate, 187–194.
pitch, loudness, and nasality), (6) variability and lack of Guyette, T. W., and Deidrich, W. M. (1981). A critical review
consistent patterning in speech output, (7) increased of developmental apraxia of speech. In N. J. Lass (Ed.),
errors on longer sequences, (8) groping postures, and (9) Speech and language advances in basic practice (No. 11).
lack of willingness or ability to imitate a model. London: Academic Press.
Hall, P. K., Jordan, L. S., and Robin, D. A. (1993). Devel-
Co-occurring characteristics of DAS in several related opmental apraxia of speech, Austin, TX: Pro-Ed.
areas have also been mentioned frequently. However, Horowitz, J. (1984). Neurological findings in developmental
co-occurrence may be optional for a di¤erential diagno- verbal apraxia. Seminars in Speech and Language, 6(2),
sis, because these characteristics have not been con- 11–18.
sistently tracked across available studies. Co-occurring LeNormand, M.-T., Vaivre-Douret, L., Payan, C., and Cohen,
characteristics frequently cited include (1) delays in gross H. (2000). Neuromotor development and language process-
and fine motor skills, (2) poor volitional oral nonverbal ing in developmental dyspraxia: A follow-up case study.
skills, (3) inconsistent diadokokinetic rates, (4) delay in Journal of Clinical and Experimental Neuropsychology, 22,
syntactic development, and (5) reading and spelling 408–417.
delays. Marquardt, T. M., Sussman, H. M., and Davis, B. L. (2000).
Developmental apraxia of speech: Advances in theory and
Clearly, DAS is a problematic diagnostic category practice. In D. Vogel and M. Cannito (Eds.), Treating dis-
for both research and clinical practice. Although it has orders of speech motor control. Austin, TX: Pro-Ed.
long been a focus of research and a subject of intense Morley, M. E., Court, D., and Miller, H. (1954). Devel-
interest to clinicians, little consensus exists on definition, opmental dysarthria. British Medical Journal, 1, 463–467.
etiology, and characterization of behavioral or neural Murdoch, B. E., Aqttard, M. D., Ozanne, A. E., and Stokes,
correlates. Circularity in the way in which etiology and P. D. (1995). Impaired tongue strength and endurance in
124 Part II: Speech
developmental verbal dyspraxia: A physiological analysis. Shriberg, L. D., Aram, D. M., and Kwiatkowski, J. (1997).
European Journal of Disorders of Communication, 30, 51–64. Developmental apraxia of speech: III. A subtype marked by
Shriberg, L. D., Aram, D. M., and Kwiatkowski, J. (1997). inappropriate stress. Journal of Speech, Language, and
Developmental apraxia of speech: I. Descriptive and theo- Hearing Research, 40, 313–337.
retical perspectives. Journal of Speech, Language, and Skinder, A., Connaghan, K., Strand, E. A., and Betz, S.
Hearing Research, 40, 273–285. (2000). Acoustic correlates of perceived lexical stress errors
Stackhouse, J. (1992). Developmental verbal dyspraxia: I. A in children with developmental apraxia of speech. Journal of
review and critique. European Journal of Disorders of Com- Medical Speech-Language Pathology, 8, 279–284.
munication, 27, 19–34. Skinder, M. S., Strand, E. A., and Mignerey, M. (1999). Per-
Sussman, H. M. (1988). The neurogenesis of phonology. In ceptual and acoustic analysis of lexical and sentential stress
H. Whitaker (Ed.), Phonological processes and brain mech- in children with developmental apraxia of speech. Journal of
anisms. New York: Springer-Verlag. Medical Speech-Language Pathology, 7, 133–144.
Thoonen, G., Maasen, B., Gabreels, F., Schreuder, R., and de Square, P. (1994). Treatment approaches for developmental
Swart, B. (1997). Towards a standardized assessment pro- apraxia of speech. Clinics in Communication Disorders, 4,
cedure for DAS of speech. European Journal of Disorders of 151–161.
Communication, 32, 37–60. Stackhouse, J., and Snowling, M. (1992). Barriers to literacy
Velleman, S. L., and Shriberg, L. D. (1999). Metrical analysis development: I. Two cases of developmental verbal dys-
of the speech of children with suspected developmental praxia. Cognitive Neuropsychology, 9, 273–299.
apraxia of speech. Journal of Speech, Language, and Hear- Strand, E. A., and Debertine, P. (2000). The e‰cacy of integral
ing Research, 42, 1444–1460. stimulation intervention with developmental apraxia of
Velleman, S. L., and Strand, K. (1993). Developmental verbal speech. Journal of Medical Speech-Language Pathology, 8,
dyspraxia. In J. E. Bernthal and N. W. Bankson (Eds.), 295–300.
Child phonology: Characteristics, assessment, and interven- Sussman, H. M., Marquardt, T. P., and Doyle, J. (2000). An
tion with special populations. New York: Thieme. acoustic analysis of phonemic integrity and contrastiveness
Vihman, M. M. (1997). Phonological development. Cambridge, in developmental apraxia of speech. Journal of Medical
UK: Blackwell. Speech-Language Pathology, 8, 301–313.
Thoonen, G., Maasen, B., Wit, J., Gabreels, F., and
Further Readings Schreuder, R. (1994). Feature analysis of singleton conso-
nant errors in developmental verbal dyspraxia (DVD).
Aram, D. M., and Nelson, J. E. (1982). Child language dis- Journal of Speech, Language, and Hearing Research, 37,
orders. St. Louis: Mosby. 1424–1440.
Bradford, A., and Dodd, B. (1994). The motor planning abili- Thoonen, G., Maasen, B., Wit, J., Gabreels, F., and
ties of phonologically disordered children. European Journal Schreuder, R. (1996). The integrated use of maximum per-
of Disorders of Communication, 29, 349–369. formance tasks in di¤erential diagnostic evaluations among
Bridgeman, E., and Snowling, M. (1988). The perception of children with motor speech disorders. Clinical Linguistics
phoneme sequence: A comparison of dyspraxic and normal and Phonetics, 10, 311–336.
children. British Journal of Communication, 23, 25–252. Thoonen, G., Maasen, B., Gabreels, F., and Schreuder, R.
Crary, M. (1993). Developmental motor speech disorders. San (1999). Validity of maximum performance tasks to diagnose
Diego, CA: Singular Publishing Group. motor speech disorders in children. Clinical Linguistics and
Croce, R. (1993). A review of the neural basis of apractic dis- Phonetics, 13, 1–23.
orders with implications for remediation. Adapted Physical Till, J. A., Yorkston, K. M., and Buekelman, D. R. (1994).
Review Quarterly, 10, 173–215. Motor speech disorders: Advances in assessment and treat-
Dewey, D. (1995). What is developmental dyspraxia? Brain and ment. Baltimore: Paul H. Brookes.
Cognition, 29, 254–274. Walton, J. H., and Pollock, K. E. (1993). Acoustic validation
Dewey, D., Roy, E. A., Square-Storer, P. A., and Hayden, D. of vowel error patterns in developmental apraxia of speech.
(1988). Limb and oral praxic abilities of children with ver- Clinical Linguistics and Phonetics, 2, 95–101.
bal sequencing deficits. Developmental Medicine and Child Williams, R., Packman, A., Ingham, R., and Rosenthal, J.
Neurology, 30, 743–751. (1981). Clinician agreement of behaviors that identify
Hall, P. K. (1989). The occurrence of developmental apraxia of developmental articulatory dyspraxia. Australian Journal of
speech in a mild articulation disorder: A case study. Journal Human Communication Disorders, 8, 16–26.
of Communication Disorders, 22, 265–276. Yorkston, K. M., Buekelman, D. R., Strand, E. A., and Bell,
Hayden, D. A. (1994). Di¤erential diagnosis of motor speech K. R. (1999). Management of motor speech disorders.
dysfunction in children. Clinics in Communication Dis- Austin, TX: Pro-Ed.
orders, 4, 119–141.
Hodge, M. (1994). Assessment of children with developmental
apraxia of speech: A rationale. Clinics in Communication
Disorders, 4, 91–101.
Kent, R. D. (2000). Research on speech motor control and its
Dialect, Regional
disorders: A review and prospective. Journal of Communi-
cation Disorders, 33, 391–428. Dialects or language varieties are a result of systematic,
Pollock, K. E., and Hall, P. K. (1991). An analysis of the
internal linguistic changes that occur within a language.
vowel misarticulations of five children with developmental
apraxia of speech. Clinical Linguistics and Phonetics, 5, Unlike accents, in which linguistic changes occur mainly
207–224. at the phonological level, dialects reflect structural
Rosenbek, J. C., and Wertz, R. T. (1972). A review of fifty changes in phonology, morphology, and syntax, as well
cases of developmental apraxia of speech. Language, as lexical or semantic changes. The degree of mutual
Speech, and Hearing Services in the Schools, 5, 23–33. intelligibility that a speaker’s language has with a des-
Dialect, Regional 125
ignated standard linguistic system is often used to dis- niques were used as primary mechanisms of data col-
tinguish dialect from language. Mutual intelligibility lection. A field worker would visit an area and talk to
means that speakers of one dialect can understand residents using predetermined elicitation techniques that
speakers of another dialect. would encourage the speaker to produce the distinctive
Although the construct of mutual intelligibility is fre- items of interest. The field worker would then manually
quently employed to di¤erentiate dialect from language, note whether the individual’s speech contained the dis-
there are counterexamples. On one hand, speakers may tinctive linguistic features of interest. These methods
have the same language, but the dialects may not be generated a number of linguistic atlases that contained
mutually intelligible. For example, Chinese has a num- linguistic maps displaying geographical distributions of
ber of dialects, such as Cantonese and Mandarin, each language characteristics.
spoken in di¤erent geographical regions. Although Can- Data were used to determine where a selected set of
tonese and Mandarin speakers consider these dialects, features was produced and where people stopped using
the two lack mutual intelligibility since those who speak this same set of features. The selected features could in-
only Cantonese do not easily understand those who clude vocabulary, specific sounds, or grammatical forms.
speak only Mandarin, and vice versa. On the other hand, Lines, or isoglosses, were drawn on a map to indicate the
speakers may produce di¤erent languages but have mu- existence of specific features. When multiple isoglosses,
tual intelligibility. For instance, Norwegian, Swedish, or a bundle of isoglosses, surround a specific region, this
and Danish are thought of as di¤erent languages, yet is used to designate dialect boundaries. The bundle of
speakers of these languages can easily understand one isoglosses on a linguistic map would indicate that people
another. on one side produced a number of lexical items and
A dialect continuum or dialect continua may account grammatical forms that were di¤erent from the speech of
for lack of mutual intelligibility in a large territory. A those who lived outside the boundary. Theoretically, the
dialect continuum refers to a distribution of sequentially dialect was more distinctive, with a greater amount of
arranged dialects that progressively change speech or bundling.
linguistic forms across a broad geographical area. Some After the 1950s, audio and, eventually, video record-
speech shifts may be subtle, others may be more dra- ings were made of speakers in designated regions.
matic. Assume widely dispersed territories are labeled Recordings allow a greater depth of analysis because
towns A, B, C, D, and E and are serially adjacent to one they can be repeatedly replayed. Concurrent with these
another, thereby creating a continuum. B is adjacent to technological advances, there was increased interest in
A and C, C is adjacent to B and D, and so on. There will urban dialects, and investigators began to explore di-
be mutual intelligibility between dialects spoken in A versity of dialects within large cities, such as Boston,
and B, between B and C, between C and D, and between Detroit, New York, and London. Technological devel-
D and E. However, the dialects of the two towns at the opments also led to more quantitative studies. Strong
extremes, A and E, may not be mutually intelligible, statistical analysis (dialectometry) has evolved since the
owing to the continuous speech and language shifts that 1970s and allows the investigator to explore large data
have occurred across the region. It is also possible that sets with large numbers of contrasts.
some of the intermediate dialects, such as B and D, may Several factors contribute to the formation of regional
not be mutually intelligible. dialects. Among these factors are settlement and migra-
Because di¤erent conditions influence dialects, it is tion patterns. For instance, regional English varieties
not easy to discriminate dialect precisely from language. began to appear in the United States as speakers immi-
Using mutual intelligibility as a primary marker of dis- grated from di¤erent parts of Britain. Speakers from the
tinction should be considered relative to the territories of eastern region of England settled in New England, and
interest. For example, in the United States, the concept those from Ulster settled in western New England and in
of mutual intelligibility appears valid, whereas it is not Appalachia. Each contributed di¤erent variations to the
completely valid in many other countries. region in which they settled.
Dialects exist in all languages and are often discussed Regional dialect formation may also result from
in terms of social or regional varieties. Social dialects the presence of natural boundaries such as mountains,
represent a speaker’s social stratification within a given rivers, and swamps. Because it was extremely di‰cult to
society or cultural group. Regional dialects are asso- traverse the Appalachian mountain range, inhabitants of
ciated with geographical location or where speakers live. the mountains were isolated and retained older English
Regional and social dialects may co-occur within lan- forms that contributes some of the unique characteristics
guage patterns of the same speaker. In other words, so- of Appalachian English. For example, the morphologi-
cial and regional dialects are not mutually exclusive. cal a-prefix in utterances such as ‘‘He come a-running’’
Regional dialects constitute a unique cluster of lan- or ‘‘She was a-tellin’ the story’’ appears to be a retention
guage characteristics that are distributed across a speci- from older forms of English that were prevalent in the
fied geographical area. Exploration of regional dialectal seventeenth century.
systems is referred to as dialectology, dialect geography, Commerce and culture also play important roles in
or linguistic geography. For many years, dialects spoken influencing regional dialects, as can be observed in the
in cities were thought of as prestigious. Therefore, in unique dialect of people in Baltimore, Maryland.
traditional dialect studies, data were mainly collected in Speakers of ‘‘Bawlamerese’’ live in ‘‘Merlin’’ (Mary-
rural areas. Surveys, questionnaires, and interview tech- land), whose state capitol is ‘‘Napolis’’ (Annapolis),
126 Part II: Speech
located next to ‘‘Warshnin’’ (Washington, D.C.), and but vascular, traumatic, and degenerative diseases are
refer to Bethlehem Steel as ‘‘Bethlum.’’ Because the their most common cause in most clinical settings; neo-
‘‘Bethlum’’ mill, located in Fells Point, has been a pri- plastic, toxic-metabolic, infectious, and inflammatory
mary employer of many individuals, language has causes are also possible.
evolved to discuss employment. Many will say they work Although incidence and prevalence are not precisely
‘‘down a point’’ or ‘‘down a mill,’’ where the boss will known, dysarthria often is present in a number of fre-
‘‘har and far’’ (hire and fire) people. While most people quently occurring neurological diseases, and it probably
working ‘‘down a point’’ live in ‘‘Dundock’’ (Dundalk), represents a significant proportion of all acquired neu-
some may live as far away as ‘‘Norf Abnew’’ (North rological communication disorders. For example, ap-
Avenue), ‘‘Habberdy Grace’’ (Harve de Grace), or even proximately one-third of people with traumatic brain
‘‘Klumya’’ (Columbia). injury may be dysarthric, with nearly double that preva-
Two other types of geolinguistic variables are often lence during the acute phase (Sarno, Buonaguro, and
associated with regional dialects. One variable is a set of Levita, 1986; Yorkston et al., 1999). Dysarthria proba-
linguistic characteristics that are unique to a geographi- bly occurs in 50%–90% of people with Parkinson’s dis-
cal area or that occur only in that area. For instance, ease, with increased prevalence as the disease progresses
unique to western Pennsylvania, speakers say ‘‘youse’’ (Logemann et al., 1978; Mlcoch, 1992), and it can be
(you singular), ‘‘yens’’ (you plural) and ‘‘yens boomers’’ among the most disabling symptoms of the disease in
(a group of people). The second variable is the frequency some cases (Dewey, 2000). Dysarthria emerges very fre-
of occurrence of regional linguistic characteristics in a quently during the course of amyotrophic lateral sclero-
specific geographic area. For example, the expression sis (ALS) and may be among the presenting symptoms
‘‘take ’er easy’’ is known throughout the United States, and signs in over 20% (Rose, 1977; Gubbay et al., 1985).
but mainly used in central and western Pennsylvania. It occurs in 25% of patients with lacunar stroke (Arboix
and Marti-Vilata, 1990). In a large tertiary care center,
—Adele Proctor
dysarthria was the primary communication disorder in
46% of individuals with any acquired neurological dis-
Further Readings ease seen for speech-language pathology evaluation over
Chambers, J. K. (1992). Dialect acquisition. Language, 68, a 4-year period (Du¤y, 1995).
673–705. The clinical diagnosis is based primarily on auditory
Chambers, J. K., and Trudgill, P. (1998). Dialectology. New perceptual judgments of speech during conversation,
York: Cambridge University Press. sentence repetition, and reading, as well as performance
Labov, W. (1994). Principles of linguistic change. Vol. 1. Inter- on tasks such as vowel prolongation and alternating
nal factors. Cambridge, MA: Blackwell. motion rates (AMRs; for example, repetition of ‘‘puh,’’
Linn, M. D. (1998). Handbook of dialects and language varia-
tion. New York: Academic Press.
‘‘tuh,’’ and ‘‘kuh’’ as rapidly and steadily as possible).
Romaine, S. (2000). Language in society: An introduction to Vowel prolongation permits judgments about respira-
sociolinguistics. New York: Oxford University Press. tory support for speech as well as the quality, pitch, and
duration of voice. AMRs permit judgments about the
rate and rhythm of repetitive movements and are quite
useful in distinguishing among certain dysarthria types
(e.g., they are typically slow but regular in spastic dys-
Dysarthrias: Characteristics and arthria, but irregular in ataxic dysarthria). Visual and
physical examination of the speech mechanism at rest
Classification and during nonspeech responses (e.g., observations of
asymmetry, weakness, atrophy, fasciculations, adventi-
The dysarthrias are a group of neurological disorders tious movements, pathological oral reflexes) and in-
that reflect disturbances in the strength, speed, range, formation from instrumental measures (e.g., acoustic,
tone, steadiness, timing, or accuracy of movements nec- endoscopic, videofluorographic) often provide confirma-
essary for prosodically normal, e‰cient and intelligible tory diagnostic evidence.
speech. They result from central or peripheral nervous Dysarthria severity can be indexed in several ways,
system conditions that adversely a¤ect respiratory, pho- but quantitative measures usually focus on intelligibility
natory, resonatory, or articulatory speech movements. and speaking rate. The most commonly used intelligibil-
They are often accompanied by nonspeech impairments ity measures are the Computerized Assessment of Intel-
(e.g., dysphagia, hemiplegia), but sometimes they are the ligibility in Dysarthric Speakers (Yorkston, Beukelman,
only manifestation of neurological disease. Their course and Traynor, 1984) and the Sentence Intelligibility Test
can be transient, improving, exacerbating-remitting, (Yorkston, Beukelman, and Tice, 1996), but other mea-
progressive, or stationary. sures are available for clinical and research purposes
Endogenous or exogenous events as well as genetic (Enderby, 1983; Kent et al., 1989).
influences can cause dysarthrias. Their neurological A wide variety of acoustic, physiological, and ana-
bases can be present congenitally or they can emerge tomical imaging methods are available for assessment.
acutely, subacutely, or insidiously at any time of life. Some are easily used clinically, whereas others are pri-
They are associated with many neurological conditions, marily research tools. Studies using them have often
Dysarthrias: Characteristics and Classification 127
yielded results consistent with predictions about patho- AMRs, inappropriate variations in pitch, loudness, and
physiology from auditory-perceptual classification, but duration, and sometimes excess and equal stress across
discrepancies that have been found make it clear that syllables.
correspondence between perceptual attributes and phys- Hypokinetic dysarthria is associated with basal gan-
iology cannot be assumed (Du¤y and Kent, 2001). glia control circuit pathology, and its features seem
Methods that show promise or that already have refined mostly related to rigidity and reduced range of motion.
what we understand about the anatomical and phys- Parkinson’s disease is the prototypic disorder associated
iological underpinnings of the dysarthrias include with hypokinetic dysarthria, but other conditions can
acoustic, kinematic, and aerodynamic methods, elec- also cause it. Its distinguishing characteristics include
tromyography, electroencephalography, radiography, reduced loudness, breathy-tight dysphonia, monopitch
tomography, computed tomography, magnetic reso- and monoloudness, and imprecise and sometimes rapid,
nance imaging, functional magnetic resonance imaging, accelerating, or ‘‘blurred’’ articulation and AMRs. Dys-
positron emission tomography, single-photon emission fluency and palilalia also may be apparent.
tomography, and magnetoencephalography (McNeil, Hyperkinetic dysarthria is also associated with basal
1997; Kent et al., 2001). ganglia control circuit pathology. Unlike hypokinetic
The dysarthrias can be classified by time of onset, dysarthria, its distinguishing characteristics are a prod-
course, site of lesion, and etiology, but the most widely uct of involuntary movements that interfere with in-
used classification system in use today is based on the tended speech movements. Its manifestations vary across
auditory-perceptual method developed by Darley, several causal movement disorders, which can range
Aronson, and Brown (1969a, 1969b, 1975). Often re- from relatively regular and slow (tremor, palatophar-
ferred to as the Mayo Clinic system, the method iden- yolaryngeal myoclonus), to irregular but relatively sus-
tifies dysarthria types, with each type representing a tained (dystonia), to relatively rapid and predictable or
perceptually distinguishable grouping of speech charac- unpredictable (chorea, action myoclonus, tics). These
teristics that presumably reflect underlying pathophysi- movements may be a nearly constant presence, but
ology and locus of lesion. The following summarizes the sometimes they are worse during speech or activated
major types, their primary distinguishing perceptual only during speech. They may a¤ect any one or all levels
attributes, and their presumed underlying localization of speech production, and their e¤ects on speech can be
and distinguishing neurophysiological deficit. highly variable. Distinguishing characteristics usually
Flaccid dysarthria is due to weakness in muscles reflect regular or unpredictable variability in phrasing,
supplied by cranial or spinal nerves that innervate res- voice, articulation, or prosody.
piratory, laryngeal, velopharyngeal, or articulatory Unilateral upper motor neuron dysarthria has an ana-
structures. Its specific characteristics depend on which tomical rather than pathophysiological label because it
nerves are involved. Trigeminal, facial, or hypoglossal has received little systematic study. It most commonly
nerve lesions are associated with imprecise articulation results from stroke a¤ecting upper motor neuron path-
of phonemes that rely on jaw, face, or lingual movement. ways. Because the damage is unilateral, severity usually
Vagus nerve lesions can lead to hypernasality or weak is rarely worse than mild to moderate. Its characteristics
pressure consonant production when the pharyngeal often overlap with varying combinations of those asso-
branch is a¤ected or to breathiness, hoarseness, dip- ciated with flaccid, spastic, or ataxic dysarthria (Du¤y
lophonia, stridor, or short phrases when the laryngeal and Folger, 1996; Hartman and Abbs, 1992).
branches are involved. When spinal respiratory nerves Mixed dysarthrias reflect combinations of two or
are a¤ected, reduced loudness, short phrases, and alter- more of the single dysarthria types. They occur more
ations in breath patterning for speech may be evident. In frequently than any single dysarthria type in many clini-
general, unilateral lesions and lesions of a single nerve cal settings. Some diseases are associated only with a
produce relatively mild deficits, whereas bilateral lesions specific mix; for example, flaccid-spastic dysarthria is the
or multiple nerve involvement can have devastating only mix expected in ALS. Other diseases, because the
e¤ects on speech. locus of lesions they cause is less predictable (e.g., mul-
Spastic dysarthria is usually associated with bilateral tiple sclerosis, traumatic brain injury), may be associated
lesions of upper motor neuron pathways that innervate with virtually any mix. The presence of mixed dysarthria
relevant cranial and spinal nerves. Its distinguishing is very uncommon or incompatible with some diseases
characteristics are attributed to spasticity, and they often (e.g., myasthenia gravis is associated only with flaccid
include a strained-harsh voice quality, slow rate, slow dysarthria), so sometimes the presence of a mixed dys-
but regular speech AMRs, and restricted pitch and arthria can make a particular disease an unlikely cause
loudness variability. All components of speech produc- or raise the possibility that more than a single disease is
tion are usually a¤ected. present.
Ataxic dysarthria is associated with lesions of the Because of their potential to inform our understand-
cerebellum or cerebellar control circuits. Its distinguishing of the neural control of speech, and because their
ing characteristics are attributed primarily to inco- prevalence in frequently occurring neurological diseases
ordination, and they are perceived most readily in is high and their functional e¤ects are significant, dys-
articulation and prosody. Characteristics often include arthrias draw considerable attention from clinicians and
irregular articulatory breakdowns, irregular speech researchers. The directions of clinical and more basic
128 Part II: Speech
research are broad, but many current e¤orts are aimed Mlcoch, A. G. (1992). Diagnosis and treatment of parkinso-
at the following: refining the di¤erential diagnosis and nian dysarthria. In W. C. Koller (Ed.), Handbook of Par-
indices of severity; delineating acoustic and physiological kinson’s disease (pp. 227–254). New York: Marcel Dekker.
correlates of dysarthria types and intelligibility; more Rose, F. C. (1977). Motor neuron disease. New York: Grune
and Stratton.
precisely establishing the relationships among perceptual
Sarno, M. T., Buonaguro, A., and Levita, E. (1986). Char-
dysarthria types, neural structures and circuitry, and acteristics of verbal impairment in closed head injured
acoustic and pathophysiological correlates; and devel- patients. Archives of Physical Medicine and Rehabilitation,
oping more e¤ective treatments for the underlying 67, 400–405.
impairments and functional limitations imposed by Yorkston, K. M., Beukelman, D. R., Strand, E. A., and Bell,
them. Advances are likely to come from several dis- K. R. (1999). Management of motor speech disorders in
ciplines (e.g., speech-language pathology, speech science, children and adults. Austin, TX: Pro-Ed.
neurology) working in concert to integrate clinical, Yorkston, K. M., Beukelman, D. R., and Tice, R. (1996).
anatomical, and physiological observations into a co- Sentence Intelligibility Test. Lincoln, NE: Tice Technology
herent understanding of the clinical disorders and their Services.
Yorkston, K. M., Beukelman, D. R., and Traynor, C. (1984).
underpinnings.
Computerized Assessment of Intelligibility of Dysarthric
See also dysarthrias: management. Speech. Austin, TX: Pro-Ed.
—Joseph R. Du¤y
Further Readings
References
Bunton, K., and Weismer, G. (2001). The relationship between
Arboix, A., and Marti-Vilata, J. L. (1990). Lacunar infarctions perception and acoustics for a high-low vowel contrast
and dysarthria. Archives of Neurology, 47, 127. produced by speakers with dysarthria. Journal of Speech,
Darley, F. L., Aronson, A. E., and Brown, J. R. (1969a). Dif- Language, and Hearing Research, 44, 1215–1228.
ferential diagnostic patterns of dysarthria. Journal of Speech Cannito, M. P., Yorkston, K. M., and Beukelman, D. R.
and Hearing Research, 12, 249–269. (Eds.). (1998). Neuromotor speech disorders: Nature, assess-
Darley, F. L., Aronson, A. E., and Brown, J. R. (1969b). ment, and management. Baltimore: Paul H. Brookes.
Clusters of deviant dimensions in the dysarthrias. Journal of Du¤y, J. R. (1994). Emerging and future concerns in motor
Speech and Hearing Research, 12, 462–496. speech disorders. American Journal of Speech-Language
Darley, F. L., Aronson, A. E., and Brown, J. R. (1975). Motor Pathology, 3, 36–39.
speech disorders. Philadelphia: Saunders. Gentil, M., and Pollack, P. (1995). Some aspects of parkinso-
Dewey, R. B. (2000). Clinical features of Parkinson’s disease. nian dysarthria. Journal of Medical Speech-Language Pa-
In C. H. Adler and J. E. Ahlskog (Eds.), Parkinson’s disease thology, 3, 221–238.
and movement disorders (pp. 77–84). Totowa, NJ: Humana Goozee, J. V., Murdoch, B. E., and Theodoros, D. G. (1999).
Press. Electropalatographic assessment of articulatory timing
Du¤y, J. R. (1995). Motor speech disorders: Substrates, di¤er- characteristics in dysarthria following traumatic brain in-
ential diagnosis, and management. St. Louis: Mosby. jury. Journal of Medical Speech-Language Pathology, 7,
Du¤y, J. R., and Folger, W. N. (1996). Dysarthria associated 209–222.
with unilateral central nervous system lesions: A retrospec- Hammen, V. L., and Yorkston, K. M. (1994). Respiratory
tive study. Journal of Medical Speech-Language Pathology, patterning and variability in dysarthric speech. Journal of
4, 57–70. Medical Speech-Language Pathology, 2, 253–262.
Du¤y, J. R., and Kent, R. D. (2001). Darley’s contributions Kent, R. D., Du¤y, J. R., Kent, J. F., Vorperian, H. K., and
to the understanding, di¤erential diagnosis, and scientific Thomas, J. E. (1999). Quantification of motor speech abili-
study of the dysarthrias. Aphasiology, 15, 275–289. ties in stroke: Time-energy analysis of syllable and word
Enderby, P. (1983). Frenchay dysarthria assessment. San Diego, repetition. Journal of Medical Speech-Language Pathology,
CA: College-Hill Press. 7, 83–90.
Gubbay, S. S., Kahana, E., Zilber, N., and Cooper, G. (1985). Kent, R. D., Kent, J. F., Du¤y, J. R., and Weismer, G. (1998).
Amyotrophic lateral sclerosis: A study of its presentation The dysarthrias: Speech-voice profiles, related dysfunctions,
and prognosis. Journal of Neurology, 232, 295–300. and neuropathology. Journal of Medical Speech-Language
Hartman, D. E., and Abbs, J. H. (1992). Dysarthria associated Pathology, 6, 165–211.
with focal unilateral upper motor neuron lesion. European Kent, R. D., Kent, J. F., Du¤y, J. R., Weismer, G., and Stun-
Journal of Disorders of Communication, 27, 187–196. tebeck, S. (2000). Ataxic dysarthria. Journal of Speech-
Kent, R. D., Du¤y, J. R., Slama, A., Kent, J. F., and Clift, A. Language-Hearing Research, 43, 1275–1289.
(2001). Clinicoanatomic studies in dysarthria: Review, cri- Kent, R. D., Kent, J. F., Weismer, G., and Du¤y, J. R. (2000).
tique, and directions for research. Journal of Speech, Lan- What dysarthrias can tell us about the neural control of
guage, and Hearing Research, 44, 535–551. speech. Journal of Phonetics, 28, 273–302.
Kent, R. D., Weismer, G., Kent, J. F., and Rosenbek, J. C. Kent, R. D., Kim, H., Weismer, H. G., Kent, J. F., Rosenbek,
(1989). Toward phonetic intelligibility testing in dysarthria. J., Brooks, B. R., and Workinger, M. (1994). Laryngeal
Journal of Speech and Hearing Disorders, 54, 482–499. dysfunction in neurological disease: Amyotrophic lateral
Logemann, J. A., Fischer, H. B., Boshes, B., and Blonsky, E. sclerosis, Parkinson’s disease, and stroke. Journal of Medi-
(1978). Frequency and cooccurence of vocal tract dysfunc- cal Speech-Language Pathology, 2, 157–176.
tion in the speech of a large sample of Parkinson patients. Kent, R. D., Weismer, G., Kent, J. F., Vorperian, H. K., and
Journal of Speech and Hearing Disorders, 43, 47–57. Du¤y, J. R. (1999). Acoustic studies of dysarthric speech:
McNeil, M. R. (Ed.). (1997). Clinical management of sensori- Methods, progress, and potential. Journal of Communica-
motor speech disorders. New York: Thieme. tion Disorders, 32, 141–186.
Dysarthrias: Management 129
King, J. B., Ramig, L. O., Lemke, J. H., and Horii, Y. (1994). nicative intent using speech plus the environment, con-
Parkinson’s disease: Longitudinal changes in acoustic text, and augmentative aids. Intelligibility refers to the
parameters of phonation. Journal of Medical Speech- degree to which the listener is able to understand the
Language Pathology, 2, 29–42. acoustic signal (Kent et al., 1989). Comprehensibility
Kleinow, J., Smith, A., and Ramig, L. O. (2001). Speech motor
refers to the dynamic process by which individuals con-
stability in IPD: E¤ects of rate and loudness manipulations.
Journal of Speech, Language, and Hearing Research, 44, vey communicative intent, using the acoustic signal plus
1041–1051. all information available from the environment (York-
LaPointe, L. L. (1999). Journal of Medical Speech-Language ston, Strand, and Kennedy, 1996). In conversational
Pathology, 7(2) (entire issue). interaction, listeners take advantage of environmental
LaPointe, L. L. (2000). Journal of Medical Speech-Language cues such as facial expression, gestures, the situation, the
Pathology, 8(4) (entire issue). topic, and so on. As the acoustic speech signal becomes
Lefkowitz, D., and Netsell, R. (1994). Correlation of clinical more degraded, contextual information becomes more
deficits with anatomical lesions: Post-traumatic speech dis- critical for maintaining comprehensibility.
orders and MRI. Journal of Medical Speech-Language Pa- Decisions regarding whether to focus treatment on
thology, 2, 1–14.
intelligibility or on comprehensibility depend largely
Moore, C. A., Yorkston, K. M., and Beukelman, D. R. (Eds.).
(1991). Dysarthria and apraxia of speech: Perspectives on on the severity of the dysarthria. Management for
management. Baltimore: Paul H. Brookes. mildly dysarthric individuals focuses on improving in-
Murdoch, B. E., Thompson, E. C., and Stokes, P. D. (1994). telligibility and naturalness. Individuals with moderate
Phonatory and laryngeal dysfunction following upper mo- levels of severity benefit from both intelligibility and
tor neuron vascular lesions. Journal of Medical Speech- comprehensibility approaches. Finally, management of
Language Pathology, 2, 177–190. very severe dysarthria often focuses on augmentative
Robin, D. A., Yorkston, K. M., and Beukelman, D. R. (Eds.). communication.
(1996). Disorders of motor speech: Assessment, treatment, Management focus also depends on whether the dys-
and clinical characterization. Baltimore: Paul H. Brookes. arthria is associated with a condition in which physio-
Samlan, R., and Weismer, G. (1995). The relationship of
logical recovery is likely to occur (e.g., cerebrovascular
selected perceptual measures of diadochokinesis in speech
intelligibility in dysarthric speakers with amyotrophic lat- accident) versus one in which the dysarthria is likely to
eral sclerosis. American Journal of Speech-Language Pa- get progressively worse (e.g., amyotrophic lateral sclero-
thology, 4, 9–13. sis [ALS]). For patients with degenerative diseases such
Solomon, N. P., Lorell, D. M., Robin, D. R., Rodnitzky, R. L., as ALS, early treatment may focus on maintaining in-
and Luschei, E. S. (1994). Tongue strength and endurance telligibility. Later in the disease progression, the focus of
in mild to moderate Parkinson’s disease. Journal of Medical treatment is less on the acoustic signal and more on
Speech-Language Pathology, 3, 15–26. communicative interaction, maximizing listener support
Thompson, E. C., Murdoch, B. E., and Stokes, P. D. (1995). and environmental cues, allowing the patient to continue
Tongue function in subjects with upper motor neuron type to use speech for a much longer period of time before
dysarthria following cerebrovascular accident. Journal of
having to use augmentative and alternative commu-
Medical Speech-Language Pathology, 3, 27–40.
Till, J. A., Yorkston, K. M., and Beukelman, D. R. (Eds.). nication. Yorkston (1996) provides a comprehensive
(1994). Motor speech disorders: Advances in assessment and review of the treatment e‰cacy literature for the dys-
treatment. Baltimore: Paul H. Brookes. arthrias associated with a number of di¤erent neurolog-
Ziegler, W., and Hartmann, E. (1996). Perceptual and acoustic ical disorders.
methods in the evaluation of dysarthric speech. In M. J.
Ball and M. Duckworth (Eds.), Advances in clinical pho-
netics (pp. 91–114). Amsterdam: John Benjamins. Intelligibility
Deficits in intelligibility vary according to the type of
dysarthria as well as the relative contribution of the basic
Dysarthrias: Management physiological mechanisms involved in speech: respira-
tion, phonation, resonance, and articulation. Medical
(e.g., surgical, pharmacological), prosthetic (e.g., palatal
Dysarthria is a collective term for a group of neurologi- lift), and behavioral interventions are used to improve
cal speech disorders caused by damage to mechanisms the function of those physiological systems.
of motor control in the central or peripheral nervous Behavioral intervention for respiratory support fo-
system. The dysarthrias vary in nature, depending on cuses on achieving and maintaining a consistent sub-
the particular neuromotor systems involved. Conse- glottal air pressure level, allowing adequate loudness and
quently, a number of issues are considered when devising length of breath groups (Yorkston et al., 1999). Methods
a management approach for a particular patient. These to improve respiratory support (Netsell and Daniel,
issues include the type of dysarthria (reflecting the under- 1979; Hixon, Hawley, and Wilson, 1982) often involve
lying neuromuscular status), the physiological processes having the speaker blow and maintain target levels of
involved, severity, and the expected course. water pressure (i.e., 5 cm H2 O) for 5 seconds. Sustained
Management of the dysarthrias is generally focused phonation tasks are also used, giving the speaker feed-
on improving the intelligibility and naturalness of back on maintained loudness. Finally, individuals are
speech, or on helping the speaker convey more commu- encouraged to produce sentences with appropriate
130 Part II: Speech
phrase lengths, focusing on maintaining adequate respi- of the soft palate, but researchers and clinicians disagree
ratory pressure. In each case, the clinician works to focus as to their e¤ectiveness. Kuehn and Wachtel (1994) sug-
the speaker’s attention and e¤ort toward taking in more gest the use of continuous positive airway pressure in a
air and using more force with exhaled air. Occasion- resistance exercise program to strengthen the velopha-
ally speakers may release too much airflow during ryngeal muscles. A common prosthetic approach is to
speech. Netsell (1995) has suggested the use of inspir- use a palatal lift, which is a rigid appliance that covers
atory checking, in which patients are taught to use the the hard palate and extends along the surface of the soft
inspiratory muscles to counter the elastic recoil forces of palate, raising it to the pharyngeal wall. Palatal lifts
the respiratory system. For individuals who exhibit dis- should be considered for patients who are consistently
coordination (e.g., ataxic dysarthria), respiratory treat- unable to achieve velopharyngeal closure and who have
ment is focused on helping the speaker consistently relatively isolated velopharyngeal impairment.
initiate phonation at appropriate inspiratory lung vol- Treatment focused on improving articulation often
ume levels, taking the next breath at the appropriate uses the hierarchical practice of selected syllable, words
phrase boundary, given the expiratory lung volume and phrases (Robertson, 2001). However, because artic-
level. ulatory imprecision may be due to reduced respiratory
Laryngeal system impairment frequently results in support, velopharyngeal insu‰ciency, or rate control,
either hypophonia, in which the vocal folds do not the treatment of articulation is not always focused on
achieve adequate closure for phonation (as in flaccid improving the place and manner of articulatory con-
dysarthria, or the hypokinetic dysarthria that accom- tacts. When specific work on improving articulatory
panies Parkinson’s disease), or hyperphonia, in which function is warranted, behavioral approaches involve
the vocal folds exhibit too much closure (as in spastic focusing the speaker’s attention on increased e¤ort for
dysarthria). Individuals with lower motor neuron deficits bigger and stronger movements. Compensatory strat-
involving the laryngeal muscles may benefit from surgi- egies such as using a di¤erent place of articulation or
cal intervention either to medialize the vocal fold or to exaggerating selected articulatory movements may be
augment the bulk of the fold. The most common proce- used (DeFao and Schaefer, 1983). The use of minimal
dure for medialization is a type I thyroplasty, often with contrasts (tie/sigh) or intelligibility drills (having the
arytenoid adduction (Isshiki, Okamura, and Ishikawa, speaker produce a carefully selected set of stimulus
1975; Nasseri and Maragos, 2000). Teflon and autoge- words) focus the speaker’s attention on making specific
nous fat are also used to increase the bulk of a paralyzed sound contrasts salient and clear. Strength training is
or atrophied fold (Heikki, 1998). Patients with myas- sometimes advocated, but only when the speaker is ha-
thenia gravis are typically successfully treated with anti- bitually generating less force than is necessary for speech
cholinesterase drugs or with a thymectomy. This medical and has the capacity to increase strength with e¤ort.
management usually results in improvement in their Strengthening is most appropriate for speakers with
voice and vocal fatigue. flaccid dysarthria; it is contraindicated for patients with
Behavioral treatment for hypophonia focuses on disorders such as myasthenia gravis, in which muscular
increasing glottal closure, but it also requires that the activity causes increasing weakness, and for patients
patient maximize respiratory pressures. For mild weak- with degenerative disorders such as ALS.
ness, exercises to increase the patient’s awareness of e‰- Surgical and medical management may also improve
cient glottal adduction, without extraneous supraglottic articulation. Neural anastomosis is sometimes used to
tension, are helpful. E¤ort closure techniques such as improve function to a damaged nerve, usually the sev-
pushing and grunting may maximize vocal fold adduc- enth cranial nerve (Daniel and Guitar, 1978). Botulinum
tion (Rosenbek and LaPoint, 1985). The Lee Silver- toxin has been used to improve speech in speakers with
man Voice Therapy Program (Ramig et al., 1995, 1996) orofacial and mandibular dystonias (Schulz and Ludlow,
has been shown to be e‰cacious for individuals with 1991). Although pharmacological treatment is frequently
Parkinson’s disease and is a commonly used therapy used to decrease limb spasticity, its e¤ects on articulation
technique to reduce the hypophonic aspects of their are less clear (Du¤y, 1995). Medications to decrease
dysarthria. tremor or chorea sometimes help improve speech by
Treatment of phonation due to laryngeal spasticity reducing the extraneous movement.
is di‰cult, and behavioral intervention typically is Rate control is frequently the focus of treatment for
not successful for this group of patients. Techniques to dysarthric individuals. Yorkston et al. (1999) point out
facilitate head and neck relaxation as well as laryngeal that this variable alone may result in the most dramatic
relaxation, strategies to maximize e‰ciency of the respi- changes in speech intelligibility for some individuals.
ratory system, and the use of postural control may be Rate control is most e¤ective for individuals with hypo-
helpful. Patients with phonatory deficits due to laryngeal kinetic or ataxic dysarthria, but it may be appropriate
dystonia pose similar problems. Medical management, for individuals with other types of dysarthria as well.
such as botulinum toxin injection, is frequently used to Rate reduction improves intelligibility by facilitating
improve the vocal quality of individuals with spasmodic increased precision of movement through the full range,
dysphonia and laryngeal dystonias. by facilitating more appropriate breath group units, and
Behavioral approaches to the treatment of resonance by allowing listeners more time to process the degraded
problems focus on increasing the strength and function acoustic signal.
Dysarthrias: Management 131
Behavioral approaches are geared toward slowing the driving pressure: A note on making the simple even simpler.
rate by increasing consonant and vowel duration, in- Journal of Speech and Hearing Disorders, 48, 413–415.
creasing interword interval durations, and increasing Isshiki, N., Okamura, H., and Ishikawa, T. (1975). Thyro-
pause time at phrasal boundaries, while working to plasty type I (lateral compression) for dysphonia due to
vocal cord paralysis or atrophy. Acta Oto-Laryngologica,
avoid any diminution in speech naturalness. Instrumen-
80, 465–473.
tation can also be helpful in rate control. In delayed au- Kent, R., Weismer, G., Kent, J., and Rosenbek, J. (1989). To-
ditory feedback, the speaker’s own voice is fed back to ward phonetic intelligibility testing in dysarthria. Journal of
the speaker through earphones after an interval delay. Speech and Hearing Disorders, 54, 482–499.
This technique typically slows the rate of speech and Kuehn, D., and Wachtel, J. (1994). CPAP therapy for treating
improves intelligibility. Visual biofeedback is used to fa- hypernasality following closed head injury. In J. A. Till,
cilitate the speaker’s use of pause time and to slow the K. Yorkston, and D. Beukelman (Eds.), Motor speech dis-
rate. Oscilloscopes can provide real-time feedback re- orders: Advances in assessment and treatment (pp. 207–212).
garding rate over time. Computer screens can be used Baltimore: Paul H. Brookes.
that cue the speaker to a target rate and mark the loca- Nasseri, S. S., and Maragos, N. E. (2000). Combination thy-
roplasty and the ‘‘twisted larynx’’: Combined type IV and
tion of pauses (Beukelman, Yorkston, and Tice, 1997).
type I thyroplasty for superior laryngeal nerve weakness.
Comprehensibility Netsell, R. (1995). Speech rehabilitation for individuals with
When there is evidence that the individual is able to im- unintelligible speech and dysarthria: The respiratory and
prove respiratory, phonatory, articulatory, or resonating velopharyngeal systems. Neurophysiology and Neurogenic
aspects of speech through behavioral, prosthetic, or Speech Language Disorders, 5, 6–9.
Netsell, R., and Daniel, B. (1979). Dysarthria in adults: Physi-
medical management, intelligibility is the primary focus
ologic approach to rehabilitation. Archives of Physical
of management. However, as the severity of dysarthria Medicine and Rehabilitation, 60, 502.
increases, management focuses more on the communi- Ramig, L., Countryman, S., O’Brien, C., Hoehn, M., and
cation interaction between the dysarthric speaker and his Thompson, L. (1996). Intensive speech treatment for pa-
or her communicative partners. Management strategies tients with Parkinson’s disease: Short- and long-term com-
are designed to help the listener maximize the use of parison of two techniques. Neurology, 47, 1496–1504.
context to improve ability to understand even a very Ramig, L., Countryman, S., Thompson, L., and Horii, Y.
degraded acoustic signal. Such strategies include being (1995). Comparison of two forms of intensive speech treat-
sure to have the listener’s attention, making eye contact, ment for Parkinson disease. Journal of Speech and Hearing
providing (or asking for) the topic, signaling topic Research, 38, 1232–1251.
Robertson, S. (2001). The e‰cacy of oro-facial and articulation
changes, reducing environmental noise, using simple but
exercises in dysarthria following stroke. International Jour-
complete grammatical constructs, using predictable nal of Language and Communication Disorders, 36(Suppl.),
wording, adding gestures if possible, and using alphabet 292–297.
board supplementation. Also important is to adopt a Rosenbek, J., and LaPoint, L. (1985). The dysarthrias: De-
consistent strategy for communication repair that is scription, diagnosis, and treatment. In D. Johns (Ed.),
agreed upon by both speaker and listener. By working Clinical management of neurogenic communication disorders
on communication interaction between speaker and lis- (pp. 97–152). Boston: Little, Brown.
tener, the dysarthric individual is often able to continue Schulz, G., and Ludlow, C. (1991). Botulinum treatment
to use speech as a primary mode of communication. for orolingual-mandibular dystonia: Speech e¤ects. In C.
See also dysarthrias: characteristics and classi- Moore, K. Yorkston, and D. Beukelman (Eds.), Dysarthria
and apraxia of speech: Perspectives in management (pp.
fication.
227–242). Baltimore: Paul H. Brookes.
—Edythe A. Strand Yorkston, K. (1996). Treatment e‰cacy in dysarthria. Jour-
nal of Speech, Language, and Hearing Research, 39, S46–
References S57.
Yorkston, K. M., Beukelman, D., Strand, E., and Bell, K.
Beukelman, D. R., Yorkston, K. M., and Tice, R. (1997). (1999). Clinical management of motor speech disorders.
Pacer/tally rate measurement software. Lincoln, NE: Tice Boston: Little, Brown.
Technology Services. Yorkston, K., Strand, E., and Kennedy, M. (1996). Com-
Daniel, R., and Guitar, B. (1978). EMG feedback and recovery prehensibility of dysarthric speech: Implications for as-
of facial and speech gestures following neural anastomosis. sessment and treatment planning. American Journal of
Journal of Speech and Hearing Disorders, 43, 9–20. Speech-Language Pathology, 5, 55–66.
DeFao, A., and Schaefer, C. (1983). Bilateral facial paralysis in
a preschool child: Oral-facial and articulatory character- Further Readings
istics. A case study. In W. Berry (Ed.), Clinical dysarthria
(pp. 165–190). Autsin, TX: Pro-Ed. Adams, S. (1994). Accelerating speech in a case of hypokinetic
Du¤y, J. (1995). Motor speech disorders: Substrates, di¤erential dysarthria: Descriptions and treatment. In J. Till, K. York-
diagnosis, and management. St. Louis: Mosby. ston, and D. Beukelman (Eds.), Motor speech disorders:
Heiki, R. (1998). Vocal fold augmentation by injection of Advances in assessment and treatment. Baltimore: Paul H.
autologous fascia. Laryngoscope, 108, 51–54. Brookes.
Hixon, T., Hawley, J., and Wilson, K. (1982). An around-the- Aronson, A. E. (1990). Clinical voice disorders. New York:
house device for the clinical determination of respiratory Thieme.
132 Part II: Speech
Bellaire, K., Yorkston, K., and Beukelman, D. (1986). Modi- (Eds.), Treating disorders speech motor control: For clini-
fication of breath patterning to increase naturalness of a cian, by clinicians. Austin, TX: Pro-Ed.
mildly dysarthric speaker. Journal of Communicative Dis- Yorkston, K., and Beukelman, D. (Eds.). (1989). Recent
orders, 19, 271–280. advances in clinical dysarthria. Boston: College-Hill Press.
Cannito, M., Yorkston, K., and Beukelman, D. (1998). Neu- Yorkston, K., Hammen, V., Beukelman, D., and Traynor, C.
romotor speech disorders. Baltimore: Paul H. Brookes. (1990). The e¤ect of rate control on the intelligibility and
Darley, F., Aronson, A., and Brown, J. (1969a). Di¤erential naturalness of dysarthric speech. Journal of Speech and
diagnostic patterns of dysarthria. Journal of Speech and Hearing Disorders, 55, 550–561.
Hearing Research, 12, 246–269. Yorkston, K., Honsinger, M., Beukelman, D., and Taylor, T.
Darley, F., Aronson, A., and Brown, J. (1969b). Clusters of (1989). The e¤ects of palatal lift fitting on the perceived
deviant speech dimensions in the dysarthrias. Journal of articulatory adequacy of dysarthric speakers. In K. York-
Speech and Hearing Research, 12, 462–496. ston and D. Beukelman (Eds.), Recent advances in clinical
Darley, F., Aronson, A., and Brown, J. (1975). Motor speech dysarthria. Boston: College-Hill Press.
disorders. Philadelphia: Saunders. Yorkston, K., Miller, R., and Strand, E. (1995). Management
Dworkin, J. (1991). Motor speech disorders: A treatment guide. of speech and swallowing in degenerative neurologic disease.
St. Louis: Mosby. Tucson, AZ: Communication Skill Builders.
Hanson, W., and Metter, E. (1983). DAF speech modifica-
tion in Parkinson’s disease: A report of two cases. In W.
Berry (Ed.), Clinical dysarthria. Boston: College Hill Press.
Jankovic, J. (1989). Blepharospasm and oromandibular- Dysphagia, Oral and Pharyngeal
laryngeal-cervical dystonia: A controlled trial of botulinum
A toxin therapy. In S. Fahn (Ed.), Advances in neurology,
Vol. 50. New York: Raven Press. ‘‘Dysphagia’’ is an impaired ability to swallow. Dys-
Liebson, E., Walsh, M., Jankowiak, J., and Albert, M. (1994). phagia can result from anatomic variation or neuro-
Pharmacotherapy for posttraumatic dysarthria. Neuro- muscular impairment anywhere from the lips to the
psychiatry, Neuropsychology, and Behavioral Neurology, 7, stomach. Although some investigators choose to con-
122–124. sider the voluntary oral preparatory stage of deglutition
McHenry, M., Wilson, R., and Minton, J. (1994). Manage-
as a separate stage, swallowing is traditionally described
ment of multiple physiologic system deficits following trau-
matic brain injury. Journal of Medical Speech-Language as a three-stage event (oral, pharyngeal, and esopha-
Pathology, 2, 58–74. geal). Historically, research as well as evaluation and
McNeil, M. (Ed.). (1997). Clinical management of sensorimotor treatment of dysphagia were directed primarily toward
speech disorders. New York: Thieme. the esophageal stage, which is generally treated by a
McNeil, M., Rosenbek, J., and Aronson, A. (Eds.). (1984). gastroenterologist. However, over the past few decades,
The dysarthrias: Physiology, acoustic perception, and man- speech-language pathologists have become increasingly
agement, San Diego: Singular Publishing Group. responsible for the research in, as well as the diagnosis
Moore, C., Yorkston, K., and Beukelman, D. (1991). Dys- and treatment of, the oral and pharyngeal aspects of
arthria and apraxia of speech: Perspectives on management. deglutition.
Baltimore: Paul H. Brookes.
Netsell, R., and Rosenbek, J. (1985). Treating the dysarthrias:
The neuroanatomical substrate of dysphagia reflects
Speech and language evaluation in neurology. Adult Dis- lower motor neuron innervation by cranial nerves V,
orders. New York: Grune and Stratton. VII, IX, X, and XII. Dysphagia can result from uni-
Ramig, L. (1992). The role of phonation in speech intelligibil- lateral or bilateral cortical insult. Within the cortex,
ity: A review and preliminary data from patients with Par- primary sites that contribute to deglution include the
kinson’s disease. In R. Kent (Ed.), Intelligibility in speech premotor cortex, primary motor cortex, primary soma-
disorders. Amsterdam: John Benjamin. tosensory cortex, insula, and the ventroposterior medial
Ramig, L., and Dromey, C. (1996). Aerodynamic mechanisms nucleus of the thalamus (Alberts et al., 1992; Daniels,
underlying treatment-related changes in vocal intensity in Foundas, Iglesia, et al., 1996; Daniels and Foundas,
patients with Parkinson’s disease. Journal of Speech and 1997). Other portions of the cortical system have also
Robin, D., Yorkston, K., and Beukelman, D. (1996). Disorder
been found to be active during swallowing (Hamdy et
of motor speech. Baltimore: Paul H. Brookes. al., 1999, 2001; Martin et al., 2001).
Rubow, R. (1984). The role of feedback, reinforcement, and Dysphagia is associated with an increased risk of
compliance on training and transfer in biofeedback-based developing malnutrition and respiratory complications
rehabilitation of motor speech disorders. In M. McNeil, such as aspiration pneumonia. In a study by Schmidt
J. Rosenbek, and A. Aronson (Eds.), The dysarthrias: et al. (1994), the odds ratio that pneumonia would de-
Physiology, acoustic perception, and management. San velop was 7.6 times greater for stroke patients who were
Diego, CA: Singular Publishing Group. identified as aspirators than for stroke patients who did
Rubow, R., and Swift, E. (1985). A microcomputer-based not aspirate. Furthermore, the odds ratio of dying was
wearable biofeedback device to improve transfer of treat- 9.2 times greater for patients who aspirated thickened
ment in parkinsonian dysarthria. Journal of Speech and
Hearing Disorders, 49, 26.
viscosities than for those who did not aspirate or who
Till, J., Yorkston, K., and Beukelman, D. (1994). Motor speech aspirated only thin fluids. Davalos et al. (1996) studied
disorders: Advances in assessment and treatment. Baltimore: the e¤ects of dysphagia on nutritional status in stroke
Paul H. Brookes. patients who had similar nutritional status at the time of
Vogel, D., and Miller, L. (1991). A top down approach to hospital admission. One week after the stroke, 48.3%
treatment of dysarthric speech. In D. Vogel and M. Cannito of patients who developed dysphagia while in the hospi-
Dysphagia, Oral and Pharyngeal 133
Table 1. Clinical Signs Suggestive of Dysphagia in Adults patients with dysphagia, 276 (59%) of the 469 patients
Di‰culty triggering the swallow who aspirated were found to have silent aspiration
Di‰culty managing oral secretions, with or without drooling (Smith et al., 1999). (3) When 47 stroke patients with
Abnormal or absent laryngeal elevation during swallow mixed sites of lesions were examined, 24 (51%) of the
attempts patients were found to aspirate; of those 24 patients, 11
Choking or coughing during or after intake of food or liquid (46%) were silent aspirators (Horner and Massey, 1988).
Wet-sounding cough (4) In a study of 107 patients in a rehabilitation facility,
Wet, gurgly voice quality 43 (40%) were found to aspirate on videofluoroscopic
Decreased oral sensation examination; however, clinical evaluation identified only
Weak sign of the cough
18 (42%) of the aspirators (Splaingard et al., 1988). Be-
Prolonged oral preparation with food
Inability to clear the mouth of food after intake cause of the additional expense encountered in caring for
Absent gag reflex patients with respiratory or nutritional complications,
Food or liquid leaking from a tracheostomy site studies such as these support the argument that money,
Fullness or tightness in the throat (globus sensation) as well as life, can be saved when patients are properly
Food or liquid leaking from the nose evaluated.
Regurgitation of food Dysphagia can occur at any age across the life span.
Sensation of food sticking in the throat or sternal region Among young adults, traumatic brain injury is a not
Xerostomia (dry mouth) uncommon cause of acquired dysphagia, whereas elderly
Odynophagia (pain on swallowing) individuals are more likely to acquire dysphagia as a
Repeated incidents of upper respiratory infections with or
result of illness. However, young adults are also suscep-
without a diagnosis of aspiration pneumonia
Tightness or pain in the chest, particularly after eating or tible to the same causes of dysphagia as the elderly.
when lying down Neurological disorders take a particular toll: it has been
Heartburn or indigestion estimated that 300,000–600,000 persons per year experi-
Unintended weight loss not related to disease ence dysphagia secondary to neurological disorders, and
the greatest percentage of these experience dysphagia
secondary to stroke (Doggett et al., 2001). After stroke
tal were malnourished, while only 13.6% of patients and neurological disease, the most frequent causes of
without dysphagia were malnourished. In a study of the dysphagia in adults include muscle disease, head and
nutritional status of patients admitted to a rehabilitation neck surgery, radiation to the head and neck, dementia,
service, 65% of patients admitted with stroke and dys- motor end-plate disease, traumatic brain injury, systemic
phagia were malnourished (Finestone et al., 1995). disease, cervical spine disease, medication e¤ects, and
Inadequate nutrition negatively a¤ects the ability of senescent changes in the sensorimotor system.
the immune system to fight disease and contributes to the
development of respiratory and cardiac insu‰ciency, the
Evaluation and Treatment
formation of decubitus ulcers, and impaired gastro-
intestinal function. The already comprised patient can There are various methods for studying the swallow. The
become increasingly comprised, which prolongs the hos- choice of method for a particular patient depends on the
pital length of stay and increases medical costs. information that is sought. When a patient is first seen,
Certain clinical signs help to alert health care pro- the assessment begins with a clinical examination (Perl-
viders to the likely presence of dysphagia. Table 1 lists man et al., 1991). The clinical examination provides im-
commonly observed clinical signs that are suggestive of portant information that assists in the decision-making
dysphagia in the adult population. The absence of any or process, but it is not intended to identify the underlying
all of these signs does not indicate that a patient has a variables that result in di‰culty with oral intake. Fur-
safe swallow or that the patient is able to ingest an ade- thermore, this examination provides no information rel-
quate number of calories by mouth to remain properly ative to the pharyngeal stage of the swallow and does
nourished. For example, a diminished or absent gag has not elicit adequate information to determine proper
not been found to distinguish aspirators from nontherapy. Therefore, the clinician will turn to one or more
aspirators (Horner and Massey, 1988). Many of these imaging modalities or other specific techniques.
signs can be indicative of a serious medical illness. Videofluoroscopy is the most frequently used assess-
Therefore, patients who exhibit these signs and who have ment technique because it provides the most complete
not been seen by a physician should be referred for body of information. Interpretation of this examination
medical examination. is performed after observation of no less than two dozen
Although clinical indicators have been found to have events within the oral cavity, pharynx, and larynx. For
a relationship to laryngeal penetration, a significant most patients, this is the only instrumental procedure
number of patients who aspirate do so with no clinical that will be performed.
indication. The incidence of silent aspiration is very Endoscopy permits the examiner to evaluate the
high, and the di‰culty of detecting it is suggested by the status of vocal fold function, the extent of vallecular or
following: (1) Discriminant analysis of 11 clinical indi- pyriform sinus stasis, and the presence of spillover or of
cators resulted in identification of the presence of aspi- delayed initiation of the pharyngeal stage of the swallow
ration in only 66% of patients (Linden, Kuhlemeier, and (Langmore, Schatz, and Olsen, 1988; Aviv et al., 2000).
Patterson, 1993). (2) In a heterogeneous group of 1101 Additionally, the view of velar function is superior to
134 Part II: Speech
that obtained with videofluoroscopy. Information relat- Davalos, A., Ricart, W., Gonzalez-Huix, F., Soler, S., Marru-
ing to the oral stage of deglutition, the extent of eleva- gat, J., Molins, A., et al. (1996). E¤ect of malnutrition after
tion of the hyoid bone, or information on the larynx or acute stroke on clinical outcome. Stroke, 27, 1028–1032.
pharynx during the moment of swallow is not observed Doggett, D., Tappe, K., Mitchell, M., Chapell, R., and Coates,
V. (2001). Prevention of pneumonia in elderly stroke
with endoscopy.
patients by systematic diagnosis and treatment of dyspha-
Ultrasound allows for observation of the motion of gia: An evidence-based comprehensive analysis of the liter-
the tongue (Sonies, 1991). Additionally, the shadow ature. Dysphagia, 16, 279–295.
reflected from the hyoid bone permits the examiner to Finestone, H., Greene-Finestone, L., Wilson, E., and Teasell,
observe and to measure the displacement of the hyoid. R. (1995). Malnutrition in stroke patients on the reha-
The advantages to using ultrasound for assessing the bilitation service and at follow-up: Prevalence and pre-
oral stage of the swallow are the absence of exposure to dictors. Archives of Physical Medicine and Rehabilitation,
ionizing radiation and the fact that the parent can hold 76, 310–316.
an infant or small child and feed the child a familiar Hamdy, S., Aziz, Q., Thompson, D., and Rothwell, J. (2001).
food while the examination is being performed. The in- Physiology and pathophysiology of the swallowing area of
the human motor cortex. Neural Plast., 8, 91–97.
formation obtainable with ultrasound is restricted to the
Hamdy, S., Mikulis, D., Crawley, A., Xue, S., Lau, H., Henry,
oral stage of deglutition. When a small child has a tra- S., et al. (1999). Cortical activation during human volitional
cheotomy tube, it is often extremely di‰cult to obtain a swallowing: An event-related fMRI study. American Jour-
good ultrasound image, because the tracheostomy tube nal of Physiology, 277, G219–G225.
prohibits good transducer placement. Horner, J., and Massey, E. W. (1988). Silent aspiration fol-
Muscle paralysis is best determined with intramuscu- lowing stroke. Neurology, 38, 317–319.
lar electromyography (Cooper and Perlman, 1997; Perl- Langmore, S. E., Schatz, K., and Olsen, N. (1988). Fiberoptic
man et al., 1999). In the examination of swallowing, it is endoscopic examination of swallowing safety: A new pro-
advisable to use bipolar hooked wire electrodes, because cedure. Dysphagia, 2, 216–219.
needle electrodes can cause discomfort and the subject Linden, P., Kuhlemeier, K., and Patterson, C. (1993). The
probability of correctly predicting subglottic penetration
may alter the swallowing pattern.
from clinical observations. Dysphagia, 8, 170–179.
Respirodeglutometry (RDG) is a method for assess- Martin, R. E., Goodyear, B. G., Gati, J. S., and Raiv, S. M.
ing the coordination of respiration and deglutition (2001). Cerebral cortical representation of automatic and
(Perlman, Ettema, and Barkmeier, 2000). This technique volitional swallowing in humans. Journal of Neurophysio-
is presently being investigated to determine the physiology, 85, 938–950.
logical correlates of RDG output and to determine Perlman, A., Ettema, S., and Barkmeier, J. (2000). Respiratory
changes in the respiratory-swallowing pattern as a func- and acoustic signals associated with bolus passage through
tion of age and various medical diagnoses. the pharynx. Dysphagia, 15, 89–94.
Decisions regarding behavioral, medical, or surgical Perlman, A. L., Langmore, S., Milianti, F., Miller, R., Mills,
intervention are made once the evaluation has been H., and Zenner, P. (1991). Comprehensive clinical exami-
nation of orophayngeal swallowing function: Veterans Ad-
completed. Therapeutic intervention is determined as a
ministration procedure. Seminars in Speech and Language,
function of the anatomical and physiological observa- 12, 246–254.
tions that were made during the evaluation process. Perlman, A., Palmer, P., McCulloch, T., and VanDaele, D.
Specific treatments are beyond the scope of this discus- (1999). Electromyographic activity from human submental,
sion but can be found in textbooks listed in Further laryngeal, and pharyngeal muscles during swallowing.
Readings. Journal of Applied Physiology, 86, 1663–1669.
Schmidt, J., Holas, M., Halvorson, K., and Reding, M. (1994).
—Adrienne L. Perlman Videofluoroscopic evidence of aspiration predicts pneumo-
nia and death but not dehydration following stroke. Dys-
References phagia, 9, 7–11.
Smith, C. H., Logemann, J. A., Colangelo, L. A., Rademaker,
Alberts, M. J., Horner, J., Gray, L., and Brazer, S. R. (1992). A. W., and Pauloski, B. R. (1999). Incidence and patient
Aspiration after stroke: Lesion analysis by brain MRI. characteristics associated with silent aspiration in the acute
Dysphagia, 7, 170–173. care setting. Dysphagia, 14, 1–7.
Aviv, J., Kaplan, S., Thomson, J., Spitzer, J., Diamond, B., Sonies, B. (1991). Normal and abnormal swallowing: Imaging in
and Close, L. (2000). The safety of flexible endoscopic diagnosis and therapy. New York: Springer-Verlag.
evaluation of swallowing with sensory testing (FEESST): Splaingard, M. L., Hutchins, B., Sulton, L. D., and Chaund-
An analysis of 500 consecutive evaluations. Dysphagia, 15, huri, G. (1988). Aspiration in rehabilitation patients: Vid-
39–44. eofluoroscopy vs bedside clinical assessment. Archives of
Cooper, D., and Perlman, A. (1997). Electromyography in the Physical Medicine and Rehabilitation, 69, 637–640.
functional and diagnostic testing of deglutition. In A. Perl-
man and K. Schulze-Delrieu (Eds.), Deglutition and its dis- Further Readings
orders. San Diego, CA: Singular Publishing Group.
Daniels, S., and Foundas, A. (1997). The role of the insular Huckabee, M., and Pelletier, C. (1999). Management of adult
cortex in dysphagia. Dysphagia, 12, 146–156. neurogenic dysphagia. San Diego, CA: Singular Publishing
Daniels, S., Foundas, A., Iglesia, G., and Sullivan, M. (1996). Group.
Lesion site in unilateral stroke patients with dysphagia. Logemann, J. (1998). Evaluation and treatment of swallowing
Journal of Stroke and Cerebrovascular Disease, 6, 30–34. disorders (2nd ed.). Austin, TX: Pro-Ed.
Early Recurrent Otitis Media and Speech Development 135
Logemann, J. A. (1999). Behavioral management for orophar- e¤ects, no e¤ects, and small beneficial e¤ects (e.g., Shri-
yngeal dysphagia. Folia Phoniatrica et Logopedica, 51, berg, Friel-Patti, et al., 2000), sometimes within the same
199–212. study. Substantive methodological di¤erences may ac-
Lundy, D. S., Smith, C., Colangelo, L., Sullivan, P. A., Loge- count for much of the disparity in findings, with the
mann, J. A., Lazarus, C. L., et al. (1999). Aspiration: Cause
method by which OME is diagnosed in di¤erent studies
and implications. Otolaryngology–Head and Neck Surgery,
120, 474–478. being critically important. The gold standard for diag-
McHorney, C. A., Bricker, D. E., Robbins, J., Kramer, A. E., nosing OME is an examination of the tympanic mem-
Rosenbek, J. C., Chignell, K. A. (2000). The SWAL-QOL brane via pneumatic otoscopy, after any necessary
outcomes tool for oropharyngeal dysphagia in adults: II. removal of cerumen, to determine whether indicators of
Item reduction and preliminary scaling. Dysphagia, 15, e¤usion such as bulging, retraction, bubbling, or abnor-
122–133. mal mobility are present (Stool et al., 1994; Bluestone
Perlman, A., and Schulze-Delrieu, K. (Eds.). (1997). Degluti- and Klein, 1996b). Behavioral symptoms such as irrita-
tion and its disorders. San Diego, CA: Singular Publishing bility are neither sensitive nor specific to the condition,
Group. and the validity of parental judgments concerning the
Preiksaitis, H., Mayrand, S., Robins, K., and Diamant, N. E.
frequency or duration of episodes of OME is poor even
(1992). Coordination of respiration and swallowing: E¤ect
of bolus volume in normal adults. American Journal of when repeated feedback is provided (Anteunis et al.,
Physiology, 263, R624–R630. 1999). In addition, it is important that OME be docu-
Rosenbeck, J., Robbins, J., Roecker, J., Coyle, J., and Wood, mented prospectively rather than via retrospective re-
J. (1996). A penetration-aspiration scale. Dysphagia, 11, ports or chart reviews, because a substantial percentage
93–98. of apparently healthy and symptom-free children are
Sonies, B. (Ed.). (1997). Instrumental imaging technologies and found to have OME on otoscopic assessment (Bluestone
procedures. Gaithersburg, MD: Aspen. and Klein, 1996a).
A second important di¤erence among studies of
OME and speech development is the extent to which
Early Recurrent Otitis Media and hearing levels are documented. The hypothesis that
Speech Development OME poses a threat to speech or language development
is typically linked to the assumption that e¤usion causes
conductive hearing loss, which prevents children from
Otitis media can be defined as inflammation of the perceiving and processing speech input in the usual
middle ear mucosa, resulting from an infectious process fashion (e.g., K. Roberts, 1997). However, the presence
(Scheidt and Kavanagh, 1986). When the inflammation of e¤usion is a poor predictor of hearing loss. Although
results in the secretion of e¤usion, or liquid, into the hearing thresholds for the majority of children with
middle ear cavity, the terms otitis media with e¤usion MEE fall between 21 and 30 dB (mild to moderate
(OME) and middle ear e¤usion (MEE) are often used. degrees of impairment), thresholds from 0 to 50 dB are
Middle ear e¤usion may be present during the period of not uncommon (Bess, 1986). Hearing thresholds must be
acute inflammation (when it is known as acute otitis measured directly to determine whether OME has e¤ects
media), and it may persist for some time after the acute on development independent of its variable e¤ects on
inflammation has subsided (Bluestone and Klein, hearing (e.g., Shriberg, Friel-Patti, et al., 2000).
1996a). Studies also vary with respect to their ability to sepa-
The prevalence of OME in young children is remark- rate the contribution of OME to poor developmental
ably high. OME has been described as one of the most outcome from the e¤ects of other variables with which
common infectious diseases of childhood (Bluestone and OME is known to be associated, such as sex and socio-
Klein, 1996a) and evidence from several large studies economic status. As noted earlier, OME is significantly
supports this conclusion. For example, in a prospective more prevalent in males than in females, and in children
epidemiologic study of 2253 children enrolled by age 2 from less privileged backgrounds than in their more
months, Paradise et al. (1997) reported that nearly 80% privileged counterparts (Paradise et al., 1997; Peters
of children had at least one episode of OME by 12 et al., 1997). Statistical procedures are necessary to con-
months of age; more than 90% had an episode by age 24 trol for such confounding in order to distinguish the
months. The mean cumulative percentage of days with e¤ects of OME from those of other variables. Several
MEE was 20.4% between 2 and 12 months of age, and recent studies have shown that after controlling for so-
16.6% between 12 and 24 months. Low socioeconomic cioeconomic confounds, OME accounts for little if any
status, male sex, and amount of exposure to other chil- of the variance in developmental outcome measures
dren were associated with an increased prevalence of (e.g., J. E. Roberts et al., 1998; Paradise et al., 2000).
OME during the first 2 years of life (Paradise et al., Finally, studies have also di¤ered substantially in the
1997). measures used to document the outcome variable of
The literature addressing the hypothesis that early speech development and in the extent to which e¤ect
recurrent OME poses a threat to children’s speech and sizes for significant di¤erences on outcome measures are
language development is large and contentious (for reported (cf. Casby, 2001). No accepted standard metric
reviews, see Stool et al., 1994; Shriberg, Flipsen, et al., for speech delay or disorder currently exists, although
2000). OME has been reported to result in adverse the Speech Disorders Classification System developed by
136 Part II: Speech
Shriberg et al. (1997) represents an important advance findings, given that increased risk was not found across
toward meeting this need. Instead, the e¤ects of OME all speech metrics and that confidence intervals for risk
on speech have been sought on a wide range of articu- estimates were wide. In addition, the results of structural
latory and phonological measures, not all of which are equation modeling suggested that hearing loss did not
known to be predictive of eventual speech outcome (e.g., operate directly to lower speech performance, but rather
Rvachew et al., 1999). was mediated significantly by language performance,
When these cautions are borne in mind, the literature providing another indication of the need for multi-
on early recurrent otitis media and speech development factorial approaches to identifying the factors and
suggests converging evidence that OME in and of itself pathways involved in normal and abnormal speech
has a negligible relationship to early speech develop- development.
ment. Several prospective investigations have shown Although the best available current evidence sug-
little or no relationship between cumulative duration of gests that OME itself does not represent a significant risk
otitis media (documented otoscopically) and measures of to speech development in otherwise healthy children,
speech production in otherwise healthy children. In a the question of whether OME may contribute inde-
longitudinal study of 55 low-SES children, J. E. Roberts pendently to outcome when it occurs in conjunction with
et al. (1988) found no correlation between OME and other risk factors or health conditions (e.g., Wallace
number of consonant errors or phonological processes et al., 1996; J. E. Roberts et al., 1998; Shriberg, Flipsen,
on a single-word test of articulation at ages 3, 4, 5, 6, 7, et al., 2000) remains open. Additional investigations that
or 8 years. Shriberg, Friel-Patti, et al. (2000) examined include prospective otoscopic diagnosis of OME; fre-
ten speech measures derived from spontaneous speech quent and independent assessments of hearing; valid,
samples obtained from 70 otherwise healthy, middle reliable assessments of both medical and sociodemo-
to upper middle class 3-year-olds who were classified graphic risk factors; and a multifactorial analytic strat-
according to the number of episodes of OME from 6 to egy will be needed to answer this question.
18 months of age; only one significant speech di¤erence See also otitis media: effects on children’s
was found, in which the group with more OME para- language.
doxically obtained higher intelligibility scores than the
group with fewer bouts of OME. Paradise et al. (2000) —Christine Dollaghan and Thomas Campbell
likewise found no relationship between cumulative du-
ration of MEE and scores on the Percentage of Con-
sonants Correct–Revised measure (PCC-R; Shriberg,
References
1993) in 241 sociodemographically diverse children at Anteunis, L. J. C., Engel, J. A. M., Hendriks, J. J. T., and
age 3 years. Paradise et al. (2001) reported that PCC-R Manni, J. J. (1999). A longitudinal study of the validity of
scores from children with even more persistent MEE parental reporting in the detection of otitis media and re-
from 2 to 36 months of age did not di¤er significantly lated hearing impairment in infancy. Audiology, 38, 75–82.
from those of children with the less persistent levels of Bess, F. H. (1986). Audiometric approaches used in the identi-
fication of middle ear disease in children. In J. F. Kavanagh
e¤usion reported by Paradise et al. (2000). Further, (Ed.), Otitis media and child development (pp. 70–82).
children with persistent and substantial MEE who were Parkton, MD: York Press.
randomly assigned to undergo prompt tympanostomy Bluestone, C. D., and Klein, J. O. (1996a). Otitis media, ate-
tube placement had no better PCC-R scores at age 3 lectasis, and eustachian tube dysfunction. In C. D. Blue-
than children who underwent tube placement after a stone, S. E. Stool, and M. A. Kenna (Eds.), Pediatric
delay of 6–9 months, during which their MEE persisted otolaryngology (3rd ed., vol. 1, pp. 388–582). Philadelphia:
(Paradise et al., 2001). These findings of little or no Saunders.
relationship between OME and speech development Bluestone, C. D., and Klein, J. O. (1996b). Methods of exami-
mirror those of several recent reports showing negli- nation: Clinical examination. In C. D. Bluestone, S. E.
gible associations between early OME and later oral Stool, and M. A. Kenna (Eds.), Pediatric otolaryngology
(3rd ed., vol. 1, pp. 150–164). Philadelphia: Saunders.
and written language performance (Peters et al., 1997; Casby, M. W. (2001). Otitis media and language development:
Casby, 2001). A meta-analysis. American Journal of Speech-Language
By contrast with these negative findings concerning Pathology, 10, 65–80.
the impact of OME, several studies in which hearing was Paradise, J. L., Dollaghan, C. A., Campbell, T. F., Feldman,
documented showed poorer speech outcomes for chil- H. M., Bernard, B. S., Colborn, D. K., et al. (2000). Lan-
dren with elevated hearing thresholds. In a sample of 70 guage, speech sound production, and cognition in three-
middle to upper middle class 3-year-olds who received year-old children in relation to otitis media in their first
otoscopic evaluations every 6 weeks and hearing evalu- three years of life. Pediatrics, 105, 1119–1130.
ations every 6 months between 6 and 18 months of age, Paradise, J. L., Feldman, H. M., Campbell, T. F., Dollaghan,
Shriberg, Friel-Patti, et al. (2000) reported that children C. A., Colborn, D. K., Bernard, B. S., et al. (2001). E¤ect
of early or delayed insertion of tympanostomy tubes for
with hearing loss, defined as average thresholds b20 dB persistent otitis media on developmental outcomes at the
(HL) during one evaluation between 6 and 18 months of age of three years. New England Journal of Medicine, 344,
age, had a significantly increased risk of scoring more 1179–1187.
than 1.3 standard deviations below the sample mean on Paradise, J. L., Rockette, H. E., Colborn, D. K., Bernard,
several percentage-consonants-correct metrics. Shriberg B. S., Smith, C. G., Kurs-Lasky, M., et al. (1997). Otitis
et al. note the need for some caution in interpreting these media in 2253 Pittsburgh-area infants: Prevalence and
Laryngectomy 137
risk factors during the first two years of life. Pediatrics, 9, mental, and educational considerations. Baltimore: Paul H.
318–333. Brookes.
Peters, S. A. F., Grievink, E. H., van Bon, W. H. J., van den Teele, D. W., Klein, J. O., Chase, C., Menyuk, P., and Rosner,
Bercken, J. H. L., and Schilder, A. G. M. (1997). The B. A. (1990). Otitis media in infancy and intellectual ability,
contribution of risk factors to the e¤ect of early otitis school achievement, speech, and language at age 7 years.
media with e¤usion on later language, reading and spelling. Journal of Infectious Diseases, 162, 685–694.
Developmental Medicine and Child Neurology, 39, 31–39. Teele, D. W., Klein, J. O., Rosner, B. A., and the Greater
Roberts, J. E., Burchinal, M. R., Koch, M. A., Footo, M. M., Boston Otitis Media Study Group. (1984). Otitis media with
and Henderson, F. W. (1988). Otitis media in early child- e¤usion during the first three years of life and development
hood and its relationship to later phonological develop- of speech and language. Pediatrics, 74, 282–287.
ment. Journal of Speech and Hearing Disorders, 53, 416–424.
Roberts, J. E., Burchinal, M. R., Zeisel, S. A., Neebe, E. C.,
Hooper, S. R., Roush, J., et al. (1998). Otitis media, the Laryngectomy
caregiving environment, and language and cognitive out-
comes at 2 years. Pediatrics, 102, 346–354.
Roberts, K. (1997). A preliminary account of the e¤ects of Total laryngectomy is a surgical procedure to remove
otitis media on 15-month-olds’ categorization and some
implications for early language learning. Journal of Speech, the larynx. Located in the neck, where it is commonly
Language, and Hearing Research, 40, 508–518. referred to as the Adam’s apple, the larynx contains the
Rvachew, S., Slawinski, E. B., Williams, M., and Green, C. L. vocal folds for production of voice for speech. Addi-
(1999). The impact of early onset otitis media on babbling tionally, the larynx serves as a valve during swallowing
and early language development. Journal of the Acoustical to prevent food and liquids from entering the airway and
Society of America, 105, 467–475. lungs. When a total laryngectomy is performed, the pa-
Scheidt, P. C., and Kavanagh, J. F. (1986). Common termi- tient loses his or her voice and must breathe through an
nology for conditions of the middle ear. In J. F. Kavanagh opening created in the neck called a tracheostoma.
(Ed.), Otitis media and child development (pp. xv–xvii). Total laryngectomy is usually performed to remove
Parkton, MD: York Press.
advanced cancers of the larynx, most of which arise from
Shriberg, L. D. (1993). Four new speech and prosody-voice
measures for genetics research and other studies in devel- prolonged smoking or a combination of tobacco use and
opmental phonological disorders. Journal of Speech and alcohol consumption. Laryngeal cancers account for less
Hearing Research, 36, 105–140. than 1% of all cancers. About 10,000 new cases of la-
Shriberg, L. D., Austin, D., Lewis, B. A., McSweeny, J. L., ryngeal cancer are diagnosed each year in the United
and Wilson, D. L. (1997). The Speech Disorders Classifica- States, with a male-female ratio approximately 4 to 1
tion System (SDCS): Extensions and lifespan reference data. (American Cancer Society, 2000). The Surveillance,
Journal of Speech, Language, and Hearing Research, 40, Epidemiology and End Results (SEER) program of the
723–740. National Cancer Institute (Ries et al., 2000) reports that
Shriberg, L. D., Flipsen, P., Thielke, H., Kwiatkowski, J., laryngeal cancer rates rise sharply in the fifth, sixth, and
Kertoy, M. K., Katcher, M. L., et al. (2000). Risk for
first half of the seventh decades of life (Casper and Col-
speech disorder associated with early recurrent otitis media
with e¤usion: Two retrospective studies. Journal of Speech, ton, 1998). The typical person diagnosed with cancer
Language, and Hearing Research, 43, 79–99. of the larynx is a 60-year-old man who is a heavy smoker
Shriberg, L. D., Friel-Patti, S., Flipsen, P., and Brown, R. L. with moderate to heavy alcohol intake (Casper and
(2000). Otitis media, fluctuant hearing loss, and speech- Colton, 1998). Symptoms of laryngeal cancer vary,
language outcomes: A preliminary structural equation depending on the exact site of the disease, but persistent
model. Journal of Speech, Language, and Hearing Research, hoarseness is common. Other signs include lowered
43, 100–120. pitch, sore throat, a lump in the throat, a lump in the
Stool, S. E., Berg, A. O., Berman, S., et al. (1994, July). Otitis neck, earache, di‰culty swallowing, coughing, di‰culty
media with e¤usion in young children: Clinical practice breathing, and audible breathing (National Cancer In-
guideline No. 12 (AHCPR Publication No. 94-0622). Rock-
stitute, 1995). It is estimated that there are 50,000 lar-
ville, MD: Agency for Health Care Policy and Research.
Wallace, I. F., Gravel, J. S., Schwartz, R. G., and Ruben, R. J. yngectomees (laryngectomized people) living in the
(1996). Otitis media, communication style of primary care- United States today.
givers, and language skills of 2 year olds: A preliminary As a treatment of laryngeal cancer, total laryngec-
report. Developmental and Behavioral Pediatrics, 17, 27–35. tomy is a proven technique to control disease. The pri-
mary disadvantages of total laryngectomy are the loss of
Further Readings the vocal folds that produce voice for speech and the
need for a permanent tracheostomy for breathing. Be-
Creps, C. L., and Vernon-Feagans, L. (2000). Infant daycare fore the introduction of the extended partial laryngec-
and otitis media: Multiple influences on children’s later de- tomy, patients with cancer of the larynx were treated
velopment. Journal of Applied Developmental Psychology, primarily with total laryngectomy (Weber, 1998). To-
21, 357–378.
Friel-Patti, S., and Finitzo, T. (1990). Language learning in a
day, early and intermediate laryngeal cancers can be
prospective study of otitis media with e¤usion in the first cured with conservation operations that preserve voice,
two years of life. Journal of Speech and Hearing Research, swallowing, and nasal breathing, and total laryngectomy
33, 188–194. is performed only in cases of very advanced cancers that
Roberts, J. E., Wallace, I. F., and Henderson, F. W. (Eds.). are bilateral, extensive, and deeply invasive (Pearson,
(1997). Otitis media in young children: Medical, develop- 1998). Radiation therapy is often administered before or
138 Part II: Speech
after total laryngectomy. In addition, radiation therapy tomized person can make speech movements with the
alone and sometimes in combination with chemotherapy tongue and lips as before the surgery, but without voice.
has proved to be curative treatment for laryngeal cancer, This silent speech is commonly referred to as ‘‘mouthing
depending on the site and stage of the disease (Chyle, words’’ or ‘‘whispering’’; however, unlike a normal
1998). Controversy and research continue over nonsur- whisper, air from the lungs does not move through
gical versus surgical intervention or a combination of the mouth after a laryngectomy. The e¤ectiveness of
these for advanced laryngeal cancer, weighing the issues the technique is variable and depends largely on the
of survival, preservation of function, and quality of life laryngectomee’s ability to precisely articulate speech
(Weber, 1998). movements and the ability of others to recognize or
A person with laryngeal cancer and the family mem- ‘‘read’’ them.
bers have many questions about survival, treatment Artificial larynges have been used since the first
options, and the long-term consequences and outcomes recorded laryngectomy in 1873 (Billroth and Gussen-
of various treatments. An otolaryngologist is the physi- bauer, 1874). Speech with an artificial larynx, also
cian who usually diagnoses cancer of the larynx and known as an electrolarynx, can be an e¤ective method of
provides information about possible surgical interven- communicating after laryngectomy, and many people
tions. A radiation oncologist is a physician consulted for can use one of these instruments as early as a day or two
opinions about radiation and chemotherapy approaches after surgery. Most modern instruments are battery
to management. If the patient decides to have a total powered and produce a mechanical tone. Usually the
laryngectomy, a speech pathologist meets with the pa- device is pressed against the neck or under the chin at a
tient and family before the operation to provide in- location where it produces the best sound, and the per-
formation on basic anatomy and physiology of normal son articulates this ‘‘voice’’ into speech. If the neck is too
breathing, swallowing, and speaking, and how these will swollen after surgery or the skin is hard as a result of
change after removal of the larynx (Keith, 1995). Also, radiation therapy, the tone of the artificial larynx may
the patient is informed that, after a period of recovery not be conducted into the throat su‰ciently for produc-
and rehabilitation, and with a few modifications, most tion of speech. In this circumstance it may be possible to
laryngectomized people return to the same vocational, use an oral artificial larynx with a plastic tube to place
home, and recreational activities they participated in the tone directly into the mouth, where it is articulated
prior to the laryngectomy. into speech. Speech with an artificial larynx has a sound
Besides voicelessness, laryngectomized persons expe- quality that is mechanical, yet a person who uses an
rience other changes. Since the nasal and oral tracts artificial larynx well can produce intelligible speech in
filter the air as well as provide moisture and warmth in practically all communication situations, including over
normal breathing, laryngectomees often require an envi- the telephone. Most laryngectomized people require
ronment with increased humidity, and they may wear training by a speech pathologist to use an artificial
heat- and moisture-exchanging filters over their trache- larynx optimally.
ostoma to replicate the functions of nasal and oral A laryngectomized person may be able to learn to use
breathing (Grolman and Schouwenberg, 1998). There is esophageal voice, also known as esophageal speech. For
no concern for aspiration of food and liquids into the this method, commonly known as ‘‘burp speech,’’ the
lungs after total laryngectomy, because the respiratory person learns to use the esophagus (food tube) to pro-
and digestive tracts are completely separated and no duce voice. First the laryngectomee pumps or sucks air
longer share the pharynx as a common tract. Unless the into the esophagus. Sound or ‘‘voice’’ is generated as
tongue is surgically altered or extensive pharyngeal or the air trapped in the upper esophagus moves back up
esophageal reconstruction beyond total laryngectomy is through the narrow junction of the pharynx and esoph-
performed, most laryngectomees return to a normal diet agus known as the PE segment. Then the voice is
and have few complaints about swallowing other than articulated into speech by the tongue and lips.
that it may require additional e¤ort (Logemann, Pau- A key to producing successful esophageal speech is
loski, and Rademaker, 1997). getting air into the esophagus consistently and e‰-
There are nonspeech methods of communicating that ciently, followed by immediate sound production for
can be used immediately after total laryngectomy. These speech. Esophageal speech has distinct advantages over
include writing on paper or on a slate, pointing to letters other alaryngeal speech techniques. The esophageal
or words or pictures on a speech or communication speaker requires no special equipment or devices, and
board, gesturing with pantomimes that are universally the speaker’s hands are not monopolized during conver-
recognizable, using e-mail, typing on portable keyboards sation. A significant disadvantage of esophageal speech
or speech-generating devices, and using life-line emer- is that it takes a relatively long time to learn to produce
gency telephone monitoring systems. None of these voice that is adequate for everyday speech purposes.
methods of communicating is as e‰cient or as personal Additionally, insu‰cient loudness and a speaking rate
as one’s own speech. that is usually slower than before laryngectomy are
A common fear of the laryngectomee is that he or she common concerns of esophageal speakers. Although
will never be able to speak without vocal folds. There some become excellent esophageal speakers, many do
are several methods of alaryngeal (without a larynx) not attain a level of fluent speech su‰cient for all com-
speech. Immediately or soon after surgery, a laryngec- municative situations.
Laryngectomy 139
Tracheoesophageal puncture with a voice prosthesis is People who undergo total laryngectomy experience
another method of alaryngeal voice production (Singer the same emotions of shock, fear, stress, loss, depres-
and Blom, 1980; Blom, 1998). During the surgery to re- sion, and grief as others with life-threatening illnesses.
move the larynx or at a later time, the surgeon makes an Along with regular medical follow-up to monitor for
opening (puncture) just inside and inferior to the supe- possible recurrence of cancer and to review all the body
rior edge of the tracheostoma. The opening is a tract systems, laryngectomized persons may benefit from re-
through the posterior wall of the trachea and the ante- ferral to other professionals and resources for psycho-
rior wall of the esophagus. Usually a catheter is placed logical, marital, nutritional, rehabilitation, and financial
in the opening and a prosthesis is placed a few days concerns.
later. A speech pathologist specially trained in trache- The International Association of Laryngectomees
oesophageal voice restoration measures the length of the and the American Cancer Society provide services to
tract between the trachea and the esophagus, and a sili- laryngectomized persons. They sponsor peer support
cone tube with a one-way valve—a voice prosthesis— groups, provide speech therapy, and distribute educa-
is placed in the puncture site. The prosthesis is non- tional materials on topics of interest, such as car-
permanent and must be replaced periodically. It does not diopulmonary resuscitation for neck breathers, smoking
generate voice itself. When the person exhales and the cessation, and specialized products and equipment for
tracheostoma is covered with a thumb, finger, or special laryngectomized persons.
valve, air from the lungs moves up the trachea, through See also alaryngeal voice and speech rehabilita-
the prosthesis, into the upper esophagus, and through tion.
the PE segment to produce voice. Because lung air is
used to produce voice with a tracheoesophageal punc- —Jack E. Thomas
ture, the speech characteristics of pitch and loudness, References
rate, phrasing, and timing more closely resemble the
laryngectomee’s presurgical speech qualities than can be American Cancer Society. (2000). Cancer facts and figures
2000. Washington, DC: Author.
achieved with other forms of alaryngeal speech. For
Billroth, C. A. T., and Gussenbauer, C. (1874). Über die erste
many laryngectomees, fluent speech can be achieved durch Th. Billroth am Menschen ausgeführte Kehlkopf Ex-
soon after placement of a voice prosthesis. tirpation und die Anwendung eines kunstlichen Kehlkopes.
There are disadvantages associated with tracheoe- Archiv fur die Klinische Chirurgie, 17, 343–356. Cited by
sophageal puncture. The laryngectomee may dislike R. L. Keith and J. C. Shanks in Historical highlights: Lar-
using a thumb or finger to cover the tracheostoma when yngectomee rehabilitation. In R. L. Keith and F. L. Darley
speaking, and use of a tracheostoma valve for hands- (Eds.), Laryngectomee rehabilitation (pp. 1–48). Austin,
free speaking may not be possible. Expenses associated TX: Pro-Ed.
with tracheoesophageal puncture include those for initial Blom, E. D. (1998). Evolution of tracheoesophageal voice
training in the use and maintenance of the prosthesis prosthesis. In E. D. Blom, M. I. Singer, and R. C. Hamaker
(Eds.), Tracheoesophageal voice restoration following total
with a speech pathologist and subsequent clinical visits laryngectomy (pp. 1–8). San Diego, CA: Singular Publish-
for modification or replacement of the voice prosthesis, ing Group.
and ongoing costs of prosthesis-related supplies. If the Casper, J. K., and Colton, R. H. (1998). Clinical manual for
PE segment is hypertonic and the tracheoesophageal laryngectomy and head/neck cancer rehabilitation (2nd ed.).
voice is not satisfactorily fluent for conversation, or if it San Diego, CA: Singular/Thomson Learning.
requires considerable e¤ort to produce, injection of bot- Chyle, V. (1998). Toxicity of primary radiotherapy for early
ulinum neurotoxin, commonly known as Botox, may be stage cancers of the larynx. In T. Robbins and T. Murry
required (Ho¤man and McCulloch, 1998; Lewin et al., (Eds.), Head and neck cancer: Organ preservation, function,
2001), or myotomy of the pharyngeal constrictor mus- and rehabilitation (pp. 19–21). San Diego, CA: Singular
cles may be considered (Hamaker and Chessman, 1998). Publishing Group.
Grolman, W., and Schouwenberg, P. F. (1998). Post-
Historically, laryngectomees and speech pathologists laryngectomy airway humidification and air filtration. In
have felt strongly about one form of alaryngeal speech E. D. Blom, M. I. Singer, and R. C. Hamaker (Eds.), Trache-
being superior to others. In the 1960s, newly laryngec- oesophageal voice restoration following total laryngectomy
tomized persons were discouraged from using artificial (pp. 109–121). San Diego, CA: Singular Publishing Group.
larynges, which were thought to delay or interfere with Hamaker, R. C., and Cheesman, A. D. (1998). Surgical man-
the learning of esophageal speech (Lauder, 1968). Today agement of pharyngeal constrictor muscle hypertonicity.
some think tracheoesophageal speech is superior because In E. D. Blom, M. I. Singer, and R. C. Hamaker (Eds.),
many laryngectomees are able to speak fluently and Tracheoesophageal voice restoration following total laryn-
fairly naturally with this method only a few weeks after gectomy (pp. 33–39). San Diego, CA: Singular Publishing
surgery. Others maintain that esophageal speech, with Group.
Ho¤man, H. T., and McCulloch, T. M. (1998). Botulinum
no reliance on a prosthesis or other devices, is the gold neurotoxin for tracheoesophageal voice failure. In E. D.
standard against which all other methods should be Blom, M. I. Singer, and R. C. Hamaker (Eds.), Trache-
compared (Stone, 1998). Most believe any form of oesophageal voice restoration following total laryngectomy
speech after laryngectomy is acceptable and should be (pp. 83–87). San Diego, CA: Singular Publishing Group.
encouraged, since speaking is a fundamental and essen- Keith, R. L. (1995). Looking forward: A guidebook for the laryn-
tial part of being human. gectomee (3rd ed.). New York: Thieme.
140 Part II: Speech
Lauder, E. (1968). The laryngectomee and the artificial larynx. United States is 7.8 people per thousand; if institution-
Journal of Speech and Hearing Disorders, 33, 147–157. alized individuals are included in the prevalence rates,
Lewin, J. S., Bishop-Leone, J. K., Forman, A. D., and Diaz, the number increases to 8.73 per thousand. Mental re-
E. M. (2001). Further experience with Botox injection tardation is associated with limitations in learning and in
for tracheoesophageal speech failure. Head and Neck, 23,
the ability to communicate, and has a profound e¤ect on
456–460.
Logemann, J. A., Pauloski, B. R., and Rademaker, A. W. a child’s ability to learn to talk. At one time it was
(1997). Speech and swallowing rehabilitation for head and believed that the language acquisition of all persons with
neck cancer patients. Oncology, 11, 651–659. mental retardation represented a slow-motion version of
National Cancer Institute. (1995). What you need to know normal language development. This hypothesis has two
about cancer of the larynx (NIH Publication No. 95-1568). major flaws: first, patterns of language development vary
Bethesda, MD: Author. across types of mental retardation, and second, within a
Pearson, B. W. (1998). Surgical options in laryngeal cancer. In single type of mental retardation, there is considerable
T. Robbins and T. Murry (Eds.), Head and neck cancer: heterogeneity.
Organ preservation, function, and rehabilitation (pp. 37–44). The majority of research on children with mental re-
San Diego, CA: Singular Publishing Group.
tardation has involved children with Down syndrome
Ries, L. A. G., Eisner, M. P., Kosary, C., Hankey, B. F.,
Miller, B. A., Clegg, L., et al. (Eds.). (2000). SEER cancer (or trisomy 21). This syndrome is the most common
statistics review, 1973–1997. Bethesda, MD: National Can- genetic cause of mental retardation, occurring in ap-
cer Institute. proximately one out of every 800 births. Because Down
Singer, M. I., and Blom, E. D. (1980). An endoscopic tech- syndrome is identifiable at birth, researchers have been
nique for restoration of voice after laryngectomy. Annals of able to trace developmental patterns from the first
Otology, Rhinology, and Laryngology, 89, 529–533. months of life. The development of speech and language
Stone, R. E. (1998). Oral communication after laryngectomy. is severely a¤ected in children with Down syndrome,
In T. Robbins and T. Murry (Eds.), Head and neck cancer: with levels lower than would be expected, given mental
Organ preservation, function, and rehabilitation (pp. 59–71). age (Miller, 1988). Speech intelligibility is compromised
throughout the life span because of problems with artic-
Weber, R. S. (1998). Advanced laryngeal cancer: Defining
the issues. In T. Robbins and T. Murry (Eds.), Head and ulation, prosody, and voice.
neck cancer: Organ preservation, function, and rehabilita- Children with Down syndrome di¤er from the normal
tion (pp. 33–36). San Diego, CA: Singular Publishing population in respect to a variety of anatomical and
Group. physiological features that may a¤ect speech production.
These features include di¤erences in the vocal cords, the
Further Readings presence of a high palatal vault and a larger than normal
tongue in relation to the oral cavity, weak facial muscles,
Doyle, P. C. (1994). Foundations of voice and speech rehabili- and general hypotonicity. Although the precise e¤ect of
tation following laryngeal cancer. San Diego, CA: Singular
these di¤erences is di‰cult to determine, they undoubt-
Publishing Group.
Graham, M. S. (1997). The clinician’s guide to alaryngeal edly influence speech-motor development and thus the
speech therapy. Boston: Butterworth-Heinemann. articulatory and phonatory abilities of children with
Keith, R. L., and Thomas, J. E. (1995). A handbook for the Down syndrome. An additional factor a¤ecting the
laryngectomee (4th ed.). Austin, TX: Pro-Ed. speech of children with Down syndrome is fluctuating
Rosenbaum, E. H., and Rosenbaum, I. (1998). Cancer sup- hearing loss associated with otitis media and middle ear
portive care: A comprehensive guide for patients and their pathologies.
families. Toronto: Somerville House Books. Fragile X syndrome, the most common known cause
Salmon, S. J., and Mount, K. H. (Eds.). (1998). Alaryngeal of inherited mental retardation (Down syndrome is
speech rehabilitation: For clinicians by clinicians (2nd ed.). more common but is not inherited), has an estimated
Austin, TX: Pro-Ed.
prevalence of approximately one per 1250 in males and
Shanks, J. C., and Stone, R. E. (Eds.). (1983). Help employ
laryngectomized persons. Stillwater, OK: National Clear- one per 2500 in females, with males exhibiting more se-
inghouse of Rehabilitation Disorders, Oklahoma State vere e¤ects. Little research has been done on the speech
University. of young children with fragile X syndrome. Available
reports on older children (Abbeduto and Hagerman,
1997) indicate abnormalities in articulatory develop-
ment, disfluncies, and the presence of atypical rate and
rhythm. These abnormalities may be attributed, in part,
Mental Retardation and Speech in to di¤erences in the structure and function of the oral-
Children motor systems of boys with fragile X syndrome, in-
cluding excessive drooling, hypotonia involving the
oral-facial muscles, and the presence of a narrow, high-
Mental retardation is defined by the American Associa- arched palate. Like their peers with Down syndrome,
tion on Mental Retardation as significantly subaverage children with fragile X syndrome have a high incidence
intellectual functions with related limitations in social of otitis media and intermittent hearing loss.
and behavioral skills. According to the most recent esti- Autism is a developmental disorder with prevalence
mates (Larson et al., 2001), the prevalence of mental re- estimates ranging from two to five per 10,000 (3 : 1
tardation in the noninstitutionalized population of the males). This disorder is characterized by deficits in social
Mental Retardation and Speech in Children 141
interaction, communication, and play; two out of three populations, primarily fricatives, a¤ricates, and liquids,
children with autism are mentally retarded (Pennington also pose di‰culties for children with mental retarda-
and Bennetto, 1998). Although in phonetic form, the tion. Among nonretarded children acquiring English,
prelinguistic vocalizations are like those of nonretarded correct pronunciation of all phonemes is achieved by the
infants, social communication skills in the prelinguistic age of 8 years. Some reports suggest that the phonolo-
period are atypical. About 50% of autistic children fail gies of children with Williams syndrome may be rela-
to develop spoken language; the other 50% exhibit tively adultlike by the (chronological) age of 8.
delays in acquiring language, although not to the same In contrast, individuals with Down syndrome, even
extent as children with Down syndrome do. Speech when they have a mental age of 8, exhibit many articu-
production is characterized by echolalia and abnormal lation errors. Moreover, comparisons of phonological
prosody (see autism). development in three populations matched for mental
Williams syndrome, a genetic disorder that includes age, Down syndrome, non-Down syndrome with mental
mental retardation, is relatively rare, occurring in one in retardation, and typically developing, revealed a greater
25,000 live births. One of the most striking aspects of number and variety of error types in the children with
Williams syndrome is that, in spite of marked impair- Down syndrome (Dodd, 1976). A persistent problem in
ments in cognition, linguistic skills appear to be rela- children with Down syndrome, is that their speech is
tively normal (Bellugi, Lai, and Wang, 1997; Mervis and hard to understand (Kumin, 1994). Parents report low
Bertrand, 1997). This dissociation of language and cog- levels of intelligibility through adolescence as a result of
nition underscores the importance of examining the re- speech sound errors, rate of speech, disfluencies, abnor-
lationship between mental retardation and speech in a mal voice quality, and unusual voice quality. There is
variety of mentally retarded populations. some indication that children with fragile X syndrome
The foundations for speech development are laid in also su¤er from low levels of intelligibility (Abbeduto
the first year of life, with the emergence of nonmean- and Hagerman, 1997).
ingful vocal types that serve as precursors for the pro- For many children with mental retardation, delays
duction of words and phrases. Of particular importance in the acquisition of speech and language may serve
is the production of consonant-vowel syllables, such as as the first indication of a cognitive delay (except for
[baba], which generally appear around age 6–7 months. Down syndrome, which is easily diagnosed at birth).
Phonetically, these ‘‘canonical’’ babbles are similar or Parents may be the first to raise concerns about atypical
even identical to the forms used in first words; thus, the patterns of development, and pediatricians and social
production [mama] may be a nonmeaningful babble at 8 workers should be aware of the link between linguistic
months and a word at 14 months. The di¤erence is rec- and cognitive development. Once mental retardation
ognition of the sound-meaning relationships that are the has been confirmed, assessment typically adheres to
basis for words. In general, prelinguistic vocal develop- traditional practices in speech-language pathology. In
ment of infants with mental retardation resembles that the prelinguistic period, which may be quite protracted
of their nonretarded peers in terms of types of vocal- for some children, assessment is initially based on
izations and schedule for emergence. Infants with retar- unstructured observations and parental report. If lan-
dation begin to produce canonical babble within the guage is slow to emerge, it is important to assess hear-
normal time frame or with minor delays. ing and oral-motor function. More formal assessments
Despite the nearly normal onset of canonical babble, are done in two ways: by means of standardized tests
however, the emergence of words is often delayed among that focus on the individual sounds and structures of
infants with mental retardation, particularly those with a predetermined set of words (i.e., a normed articula-
Down syndrome (Stoel-Gammon, 1997). Research sug- tion test) and by analyzing samples of conversational
gests great variability among children in this domain, speech to determine intelligibility and overall speech
with a few reports of word use in the second year of characteristics.
life for a few children with Down syndrome but the ma- Recommendations for the treatment of speech deficits
jority showing first words appearing between 30 and 60 in children with mental retardation range from inter-
months. The magnitude of the delay cannot be easily vention directed toward underlying causes such as hear-
predicted from the degree of retardation. Moreover, ing loss and deficits in speech-motor skills (Yarter, 1980)
once words appear, vocabulary growth is relatively slow. to programs aimed at modifying parent-to-child speech
Whereas nonretarded children have a vocabulary of in order to provide optimal input in the face of delayed
250 words at 24 months, this milestone is not reached language acquisition. Most phonological interventions
until the age of 4–6 years for most children with Down focus on increasing the phonetic repertoire and reducing
syndrome. the number of errors, using therapy techniques similar to
In terms of phonemic development, acquisition pat- those for children with phonological delay or disorder.
terns for children with mental retardation are similar to In some cases, therapy may occur at home, with the
those documented for nonretarded children (Rondal and parents, as well as in the clinic (Dodd and Leahy, 1989;
Edwards, 1997). In the early stages, words are ‘‘sim- Cholmain, 1994).
plified’’ in terms of their structure: consonant clusters are See also communication skills of people with
reduced to single consonants, unstressed syllables are down syndrome; mental retardation.
deleted, and consonants at the ends of words may be
omitted. Phonemes that are later-acquired in normal —Carol Stoel-Gammon
142 Part II: Speech
References broader than and encompasses that of developmental

apraxia of speech (DAS), which refers specifically to
Abbeduto, L., and Hagerman, R. J. (1997). Language and impaired planning, or praxis. Classically, developmental
communication in fragile X syndrome. Mental Retardation
speech production deficits have been categorized as ei-
and Developmental Disabilities Research Reviews, 3, 313–
322. ther phonological or DAS. However, recent empirical
Bellugi, U., Lai, Z., and Wang, P. (1997). Language, com- evidence suggests that a wider range of children (e.g.,
munication, and neural systems in Williams syndrome. those with specific language impairment [SLI] or incon-
Mental Retardation and Developmental Disabilities Research sistent speech errors) may exhibit deficits that are influ-
Reviews, 3, 334–342. enced by motor variables and, in these cases, may be
Cholmain, C. N. (1994). Working on phonology with young classified as motor speech involved.
children with Down syndrome. Journal of Clinical Speech Although the underlying causes of motor speech
and Language Studies, 1, 14–35. involvement are unclear, there is general evidence that
Dodd, B. J. (1976). A comparison of the phonological systems motor and cognitive deficits often co-occur (Diamond,
of mental age matched normal, severely abnormal and
Down’s syndrome children. British Journal of Disorders of
2000). Neurophysiological findings support the inter-
Communication, 11, 27–42. action of cognitive and motor development, most nota-
Dodd, B. J., and Leahy, P. (1989). Phonological disorders and bly in common brain mechanisms in the lateral
mental handicap. In M. Beveridge, G. Conti-Ramsden, and perisylvian cortex, the neocerebellum, and the dorso-
I. Leudar (Eds.), Language and communication in mentally lateral prefrontal cortex (Diamond, 2000; Hill, 2001).
handicapped people (pp. 33–56). London: Chapman and Apparently, speech motor and language domains co-
Hall. develop and mutually influence one another across
Kumin, L. (1994). Intelligibility of speech in children with development.
Down syndrome in natural settings: Parents’ perspective. In late infancy, basic movement patterns observed in
Perceptual and Motor Skills, 78, 307–313. babbling are linked to emerging intents and words
Larson, S. A., Lakin, K. C., Anderson, L., Kwak, N., Lee,
J. H., and Anderson, D. (2001). Prevalence of mental re-
(de Boysson-Bardis and Vihman, 1991; Levelt, Roelofs,
tardation and developmental disabilities: Estimates from and Meyer, 1999). At this level, it is apparent how lan-
the 1994/1995 National Health Interview Survey Disability guage and motor levels constrain one another. How-
Supplements. American Journal on Mental Retardation, 106, ever, the relations between language and motor levels in
231–252. later periods of development have not been specified.
Mervis, C. B., and Bertrand, J. (1997). Developmental rela- Language models include categories such as concepts,
tions between cognition and language: Evidence from Wil- semantics, syntax, and phonology (Levelt, Roelofs, and
liams syndrome. In L. B. Adamson and M. A. Romski Meyer, 1999). Motor systems are discussed in the very
(Eds.), Communication and language acquisition: Discoveries di¤erent terms of cortical inputs to pattern generators in
from atypical development (pp. 75–106). Baltimore: Paul H. the brainstem, which in turn provide inputs to motor
Brookes.
Miller, J. F. (1998). The developmental asynchrony of lan-
neuron pools for the generation of muscle activity. Sen-
guage development in children with down syndrome. In sory feedback is also a necessary component of motor
L. Nadel (Ed.), The psychobiology of Down syndrome (pp. systems (A. Smith, Go¤man, and Stark, 1995). Although
167–198). Cambridge, MA: MIT Press. it is established that motor and language domains both
Pennington, B. F., and Bennetto, L. (1998). Toward a neuro- show a protracted developmental time course, speech
psychology of mental retardation. In J. A. Burack, R. M. production models are not explicit about the nature of
Hodapp, and E. Zigler (Eds.), Handbook of mental retar- the linkages. The general view is that increasingly com-
dation and development (pp. 80–114). Cambridge, UK: plex linguistic structures are linked to increasingly com-
Cambridge University Press. plex movements in the course of development. Motor
Rondal, J., and Edwards, S. (1997). Language in mental retar- speech deficits occur when movement variables interfere
dation. London: Whurr.
Stoel-Gammon, C. (1997). Phonological development in Down
with the acquisition of speech and language production.
syndrome. Mental Retardation and Developmental Dis- A large range of speech and language characteristics
abilities Research Reviews, 3, 300–306. have been reported in children diagnosed with motor
Yarter, B. H. (1980). Speech and language programs for the speech disorders. In the following summary, emphasis
Down’s population. Seminars in Speech, Language and is placed on those that are at least partially motor in
Hearing, 1, 49–61. origin.
Variability. Children with motor speech disorders

Motor Speech Involvement in Children have been reported to produce highly variable errors,
even across multiple productions of the same word
(Davis, Jakielski, and Marquardt, 1998). When the defi-
Motor speech involvement of unknown origin is a rela- cit involves movement planning, imitation and repeti-
tively new diagnostic category that is applied when tion may not aid performance (Bradford and Dodd,
children’s speech production deficits are predominantly 1996). Although variability is observed in speech motor
linked to sensorimotor planning, programming, or exe- (A. Smith and Go¤man, 1998) and phonetic output of
cution (Caruso and Strand, 1999). The disorder occurs in young children who are normally developing, it is ex-
the absence of obvious neuromotor causes and often treme and persistent in disordered children. Usually,
includes concomitant language deficits. This category is variability is discussed as a phonetic error type. How-
Motor Speech Involvement in Children 143
ever, kinematic analysis of lip and jaw movement reveals babbling is hypothesized to consist of jaw oscillation
that children with SLI show movement output that is without independent control of the lips and tongue
less stable than that of their normally developing peers, (MacNeilage and Davis, 2000). More complex syllable
even when producing an accurate phonetic segment structures probably require increased movement control,
(Go¤man, 1999). Thus, both phonological and motor such as the homing movement for final consonant pro-
factors may contribute. Deficits in planning and imple- duction (Kent, 1992).
menting spatially and temporally organized movements
may influence the acquisition of stable phonological Prosodic Movement Organization and Sequencing. One
units (Hall, Jordan, and Robin, 1993). major aspect of motor development that has been
emphasized in motor speech disorders is rhythmicity.
Duration. Increased movement durations are a hall- Rhythmicity is thought to have origins in prelinguistic
mark of immature motor systems (B. L. Smith, 1978; babbling (and, perhaps, in early stereotypic movements,
Kent and Forner, 1980). In children with motor speech such as kicking and banging objects) (e.g., Thelen and
involvement, the slow implementation of movement may Smith, 1994). Rhythmicity underlies the prosodic struc-
lead to decreased performance on a nonlinguistic dia- ture of speech, which is used to convey word and sen-
dochokinetic task (Crary, 1993) as well as increased tence meaning as well as a¤ect. Children with motor
error rates on longer and more complex utterances. An speech disorders display particular deficits in prosodic
additional error type that may also be related to timing aspects of speech. Shriberg and his colleagues (Shriberg,
is poor movement coordination across speech subsys- Aram, and Kwiatkowski, 1997) found that a significant
tems. Such timing deficits in articulatory and laryngeal proportion of children diagnosed with DAS demon-
coordination may lead to voicing and nasality errors. strated errors characterized by even or misplaced stress
Hence, these errors may have origins in movement in their spontaneous speech. In a study using direct
planning and implementation. A decreased speech rate measures of lip and jaw movement during the produc-
provides the child with time to process, plan, and im- tion of di¤erent stress patterns, Go¤man (1999) reported
plement movement (Hall, Jordan, and Robin, 1993), that children with a diagnosis of SLI, who also demon-
but it may also negatively influence speech motor strated speech production and morphological errors,
performance. were poor at producing large and small movements
sequentially across di¤erent stress contexts. For exam-
Phonetic Movement Organization and Sequencing. As ple, in the problematic weak-strong prosodic sequence,
they develop, children produce increasingly di¤erenti- these children had di‰culty producing small move-
ated speech movements, both within and across articu- ments corresponding to unstressed syllables. Overall, the
latory, laryngeal, and respiratory subsystems (Gibbon, control of movement for the production of stress is a
1999; Moore, 2001). A lack of di¤erentiated and co- frequently cited deficit in children with motor speech
ordinated movement leads to a collapsing of phonetic disorders.
distinctions. It follows that segmental and syllabic in-
ventories are reduced for children with motor speech General Motor Development. In the clinical literature,
deficits (Davis, Jakielski, and Marquardt, 1998). Vowel general neuromotor status has long been implicated as
and consonant errors may be considered in reference contributing to even relatively subtle speech and lan-
to articulatory complexity. Vowel production requires guage deficits (Morris and Klein, 1987). Empirical
highly specified movements of the tongue and jaw (Pol- studies have provided evidence that aspects of gross and
lock and Hall, 1991). Consonant sounds that are early- fine motor (e.g., peg moving, gesture imitation) perfor-
developing and that are most frequently seen in the mance are below expected levels in children with vari-
phonetic inventories of children with motor speech defi- able speech errors, DAS (Bradford and Dodd, 1996),
cits make relatively few demands on the motor system and many diagnosed with SLI (Bishop and Edmundson,
(Hall, Jordan, and Robin, 1993). Kent (1992) suggests 1987; Hill, 2001). Such findings suggest that many
that early-developing stop consonants such as [b] and [d] speech production disorders include a general motor
are produced with rapid, ballistic movements. Fricatives component.
require fine force control and are acquired later. Liquids, As is apparent, an understanding of speech motor
which require highly controlled tongue movements, are contributions to the acquisition of speech and language
learned quite late in the developmental process. Using is in its infancy. However, it is clear that intervention
electropalatography, Gibbon (1999) has provided direct approaches for these children need to incorporate motor
evidence that children with speech deficits contact the as well as language components. Although e‰cacy
entire palate with the tongue, not just the anterior re- studies are scarce, several investigators have proposed
gion, in their production of alveolar consonants. Such techniques for the treatment of motor speech disorders
data indicate that motor control of di¤erentiated tongue in children. Although the emphasis has been on DAS,
movements has not developed in these children. Overall, these approaches could be tailored to more general
as proposed by Kent (1992), motor variables account for motor speech deficits. Major approaches to intervention
many aspects of the developmental sequence frequently have focused on motor programming (Hall, Jordan,
reported in speech- and language-impaired children. and Robin, 1993) and tactile-kinesthetic and rhythmic
Syllable shapes may also be influenced by motor (Square, 1994) deficits. Hierarchical language organiza-
factors. The earliest consonant-vowel structure seen in tion has also been emphasized, supporting the intimate
144 Part II: Speech
links between linguistic and movement variables (Velle- Levelt, W., Roelofs, A., and Meyer, A. (1999). A theory of
man and Strand, 1994). lexical access in speech production. Behavioral and Brain
New models of speech and language development Sciences, 22, 1–75.
are needed that integrate motor and language variables MacNeilage, P. F., and Davis, B. L. (2000). On the origin of
internal structure of word forms. Science, 288, 527–531.
in a way that is consistent with recent neurophysiological
Moore, C. A. (2001). Physiologic development of speech pro-
and behavioral evidence. Further, new methods of reduction. In B. Maasen, W. Hulstijn, R. Kent, H. F. M.
cording respiratory, laryngeal, and articulatory behav- Peters, and P. H. M. M. van Lieshout (Eds.), Speech motor
iors of infants and young children during the production control in normal and disordered speech (pp. 40–43). Nijme-
of meaningful linguistic activity should provide crucial gen, Netherlands: Uitgeverij Vantilt.
data for understanding how language and motor com- Morris, S. E., and Klein, M. D. (1987). Pre-feeding skills.
ponents of development interact across normal and dis- Tucson, AZ: Therapy Skill Builders.
ordered development. Such tools should also help Pollock, K., and Hall, P. (1991). An analysis of vowel
answer questions about appropriate interventions for misarticulations of five children with developmental
children whose deficits are influenced by atypical motor apraxia of speech. Clinical Linguistics and Phonetics, 5,
207–224.
control processes.
Shriberg, L. D., Aram, D. M., and Kwiatkowski, J. (1997).
See also developmental apraxia of speech. Developmental apraxia of speech: III. A subtype marked
—Lisa Go¤man by inappropriate stress. Journal of Speech, Language, and
Smith, A., and Go¤man, L. (1998). Stability and patterning of
References speech movement sequences in children and adults. Journal
of Speech, Language, and Hearing Research, 41, 18–30.
Bishop, D. V. M., and Edmundson, A. (1987). Specific lan- Smith, A., Go¤man, L., and Stark, R. E. (1995). Speech motor
guage impairment as a maturational lag: Evidence from development. Seminars in Speech and Language, 16, 87–99.
longitudinal data on language and motor development. Smith, B. L. (1978). Temporal aspects of English speech pro-
Developmental Medicine and Child Neurology, 29, 442– duction: A developmental perspective. Journal of Phonetics,
459. 6, 37–67.
Bradford, A., and Dodd, B. (1996). Do all speech-disordered Square, P. A. (1994). Treatment approaches for developmental
children have motor deficits? Clinical Linguistics and Pho- apraxia of speech. Journal of Communication Disorders, 4,
netics, 10, 77–101. 151–161.
Caruso, A. J., and Strand, E. A. (1999). Clinical management Thelen, E., and Smith, L. B. (1994). A dynamic systems
of motor speech disorders in children. New York: Thieme. approach to the development of cognition and action. Cam-
Crary, M. A. (1993). Developmental motor speech disorders. bridge, MA: MIT Press.
San Diego, CA: Singular Publishing Group. Velleman, S., and Strand, C. (1994). Developmental verbal
Davis, B. L., Jakielski, K. J., and Marquardt, T. M. (1998). dyspraxia. In J. Bernthal and N. Bankson (Eds.), Child
Developmental apraxia of speech: Determiners of di¤er- phonology: Characteristics, assessment, and intervention with
ential diagnosis. Clinical Linguistics and Phonetics, 12, 25– special populations. New York: Thieme.
45.
de Boysson-Bardis, B., and Vihman, M. M. (1991). Adaptation Further Readings
to language: Evidence from babbling and first words in four
languages. Language, 67, 297–318. Bernhardt, B., and Stemberger, J. P. (Eds.). (1998). Hand-
Diamond, A. (2000). Close interaction of motor development book of phonological development from the perspective of
and cognitive development and of the cerebellum and pre- constraint-based nonlinear phonology. San Diego, CA: Aca-
frontal cortex. Child Development, 71, 44–56. demic Press.
Gibbon, F. E. (1999). Undi¤erentiated lingual gestures in chil- Dodd, B. (1995). Di¤erential diagnosis and treatment of children
dren with articulation/phonological disorders. Journal of with speech disorder. San Diego, CA: Singular Publishing
Speech, Language, and Hearing Research, 42, 382–397. Group.
Go¤man, L. (1999). Prosodic influences on speech produc- Finan, D. S., and Barlow, S. M. (1998). Intrinsic dynamics and
tion in children with specific language impairment and mechanosensory modulation of non-nutritive sucking in
speech deficits. Journal of Speech, Language, and Hearing human infants. Early Human Development, 52, 181–197.
Research, 42, 1499–1517. Green, J. R., Moore, C. A., Higashikawa, M., and Steeve,
Hall, P. K., Jordan, L. S., and Robin, D. A. (1993). Devel- R. W. (2000). The physiologic development of speech motor
opmental apraxia of speech: Theory and clinical practice. control: Lip and jaw coordination. Journal of Speech, Lan-
Austin, TX: Pro-Ed. guage, and Hearing Research, 43, 239–255.
Hill, E. L. (2001). Non-specific nature of specific language Hayden, D. A., and Square, P. A. (1999). The Verbal Motor
impairment: A review of the literature with regard to con- Production Assessment for Children. San Antonio, TX: Psy-
comitant motor impairments. International Journal of Lan- chological Corp.
guage and Communication Disorders, 36, 149–171. Hodge, M. (1995). Assessment of children with developmental
Kent, R. D. (1992). The biology of phonological development. apraxia of speech: A rationale. Clinics in Communication
In C. A. Ferguson, L. Menn, and C. Stoel-Gammon (Eds.), Disorders, 4, 91–101.
Phonological development: Models, research, implications Locke, J. (2000). Movement patterns in spoken language.
(pp. 65–90). Timonium, MD: York Press. Science, 288, 449–450.
Kent, R. D., and Forner, L. L. (1980). Speech segment dura- Robbins, J., and Klee, T. (1987). Clinical assessment of oro-
tions in sentence recitations by children and adults. Journal pharyngeal motor development in young children. Journal
of Phonetics, 8, 157–168. of Speech and Hearing Research, 52, 272–277.
Mutism, Neurogenic 145
Shriberg, L. D., Aram, D. M., and Kwiatkowski, J. (1997). occasionally unilateral left or right anterior cerebral
Developmental apraxia of speech: I. Descriptive and theo- artery occlusion with involvement of the anterior cingu-
retical markers. Journal of Speech, Language, and Hearing late gyrus or supplementary motor area is frequently
Research, 40, 273–285. implicated (Nicolai, van Putten, and Tavy, 2001). Re-
Statholpoulos, E. T. (1995). Variability revisited: An acoustic,
cent data suggest that the critical areas are the portions
aerodynamic, and respiratory kinematic comparison of
children and adults during speech. Journal of Phonetics, 23, of the medial frontal lobes immediately anterior to the
67–80. supplementary motor area and portions of the anterior
Strand, E. A. (1995). Treatment of motor speech disorders in cingulate gyrus above the most anterior body of the
children. Seminars in Speech and Language, 16, 126–139. corpus callosum. These regions appear to be involved in
Velleman, S., and Shriberg, L. D. (1999). Metrical analysis of gating intention (plans of action) (Picard and Strick,
speech in children with suspected developmental apraxia of 1996; Cohen et al., 1999). Lesions of the globus pallidus,
speech. Journal of Speech, Language, and Hearing Research, thalamus, and other subcortical structures can also result
42, 1444–1460. in AM. Schi¤ and Plum (2000) advocate for a com-
panion syndrome of ‘‘hyperkinetic mutism’’ resulting
most frequently from bilateral temporal, parietal, and
occipital junction involvement in which the patient is
Mutism, Neurogenic speechless but moving. The cause may be any nervous
system–altering condition, including degenerative dis-
eases such as Creutzfeldt-Jacob disease (Otto et al.,
Mutism is speechlessness. It can be neurologic or be- 1998), that alters what Schi¤ and Plum (2000) posit
havioral. Neurogenic mutism is a sign and can result to be a series of corticostriatopallidal-thalamocortical
from many developmental or acquired nervous system (CSPTC) loops. CSPTC loops are critical to triggering
diseases and conditions. It usually accompanies other or initiating vocalization (Mega and Cohenour, 1997)
signs, but in rare cases it appears in isolation. All and to the drive or will to speak. AM is to be di¤er-
levels of the neuroaxis from the brainstem to the cortex entiated from persistent vegetative state, which reflects
have been implicated. Damage to any of the putative extensive damage to all cerebral structures, most criti-
processes critical to speech, including intention, motor cally the thalamus, with preservation of brainstem func-
programming, and execution, as well as linguistic and tion (Kinney et al., 1994).
prelinguistic processes have been invoked to explain
mutism’s appearance. A reasonably traditional review of Mutism in Aphasia. Mutism can be a feature of severe
the syndromes and conditions of which mutism is a fre- global aphasia. Patients with severe anomia, most often
quent part and traditional and emerging explanations in relation to thalamic lesions, may initially exhibit no
for mutism’s appearance are o¤ered here. capacity for spontaneous language and little for naming,
but they can repeat. Certain types of transcortical motor
Definition. Du¤y defines mutism traditionally as ‘‘the aphasia (Alexander, Benson, and Stuss, 1989), most
absence of speech’’ (1995, p. 282). Von Cramon (1981) particularly the adynamic aphasia of Luria (1970), may
adds the inability to produce nonverbal utterances. be associated with complete absence of spontaneous
Lebrun (1990) requires normal or relatively preserved speech. However, these patients are reasonably fluent
comprehension. Gelabert-Gonzalez and Fernandez-Villa during picture description, and this syndrome likely
(2001) require ‘‘unimpaired consciousness’’ (p. 111). reflects a prelinguistic disorder involving defective spon-
Each of these definitions has strengths, but none domi- taneous engagement of concept representations (Gold
nates the literature. Therefore, the literature can be a bit et al., 1997).
of a muddle. The literature is also challenging because of
the mutism population’s heterogeneity. One response to Mutism in Apraxia. Immediately after stroke, a pro-
this heterogeneity has been to classify mutism according found apraxia of speech (called aphemia and by a vari-
to relatively homogeneous subtypes. ety of other names in the world’s literature) can cause
Traditionally, groupings of mute patients have been mutism, as can primary progressive apraxia of speech.
organized according to the putative pathophysiology In the acute stage of stroke, the mutism is thought to
(Turkstra and Bayles, 1992), by syndrome, by etiology, signal an apraxia of phonation. The hypothesis is that
or by a mix of syndrome and medical etiology (Lebrun, mutism due to apraxia reflects a profound failure of
1990; Du¤y, 1995). This last approach guides the orga- motor programming.
nization of the following discussion.
Mutism in Dysarthria. Mutism can be the final stage
Akinetic Mutism. Akinetic mutism (AM) is a syn- of dysarthria (anarthria) in degenerative diseases such
drome of speechlessness and general akinesia that exists as amyotrophic lateral sclerosis, slowly progressive an-
in the context of residual sensory, motor, and at least arthria (Broussolle et al., 1996), olivopontocerebellar
some cognitive integrity and a normal level of arousal. atrophy, Parkinson’s disease, Shy-Drager syndrome,
The designation abulic state may be a synonym (Du¤y, striatonigral degeneration, and progressive supranuclear
1995), as are apallic state and coma vigil. Persons with palsy (Nath et al., 2001). Speech movements are im-
AM often are silent, despite pain or threat. Bilateral and possible because of upper and lower motor neuron
146 Part II: Speech
destruction. Cognitive changes may hasten the mutism Alzheimer’s disease and vascular dementia. Acta Neuro-
in some of these degenerative diseases. Anarthric mutism logica Scandinavica, 103, 367–378.
can be present at onset and chronically in locked-in Broussole, E., Bakchine, S., Tommasi, M., Laurent, B., Bazin,
syndrome (Plum and Posner, 1966) and the syndrome of B., Cinotti, L., et al. (1996). Slowly progressive anarthria
with late anterior opercular syndrome: A variant of frontal
bilateral infarction of opercular motor cortex. Relatively
cortical atrophy syndromes. Journal of the Neurological
recently, a syndrome beginning with mutism and evolv- Sciences, 144, 44–58.
ing to dysarthria, called temporary mutism followed by Cohen, R. A., Kaplan, R. F., Moser, D. J., Jenkins, M. A.,
dysarthria (TMFD) (Orefice et al., 1999) or mutism and and Wilkinson, H. (1999). Impairments of attention after
subsequent dysarthria syndrome (MSD) (Dunwoody, cingulotomy. Neurology, 53, 819–824.
Alsago¤, and Yuan, 1997), has been described. Ponto- Du¤y, J. R. (1995). Motor speech disorders: Substrates, di¤er-
mesencephalic stroke is one cause. ential diagnosis, and management. St. Louis: Mosby.
Dunwoody, G. W., Alsago¤, Z. S., and Yuan, S. Y. (1997).
Cerebellar mutism with subsequent dysarthria in an
Mutism in Dementia. Mutism has been reported in
adult: Case report. British Journal of Neurosurgery, 11,
Alzheimer’s disease, cerebrovascular dementia, and most 161–163.
frequently and perhaps earliest in frontotemporal de- Gelabert-Gonzalez, M., and Fernandez-Villa, J. (2001).
mentia (Bathgate et al., 2001). It can also occur in other Mutism after posterior fossa surgery: Review of the litera-
corticosubcortical degenerative diseases, including corti- ture. Clinical Neurology and Neurosurgery, 103, 111–114.
cobasal degeneration. Its occurrence in the late stages Gold, M., Nadeau, S. E., Jacobs, D. H., Adair, J. C.,
of these conditions is predictable, based on the hierar- Gonzalez-Rothi, L. J., and Heilman, K. M. (1997). Ady-
chical organization of cognitive, linguistic, and speech namic aphasia: A transcortical motor aphasia with defec-
processes. When cognitive processes are absent or se- tive semantic strategy formation. Brain and Language, 57,
verely degraded, speech does not occur. 374–393.
Kinney, H. C., Korein, J., Panigrahy, A., Dikkes, P., and
Goode, R. (1994). Neuropathological findings in the brain
Mutism Post Surgery. Mutism can occur after neuro- of Karen Ann Quinlan: The role of the thalamus in the
surgery (Pollack, 1997; Si¤ert et al., 2000). So-called persistent vegetative state. New England Journal of Medi-
cerebellar mutism can result from posterior fossa sur- cine, 21, 1469–1475.
gery, for example. It is hypothesized that disruption of Lebrun, Y. (1990). Mutism. London: Whurr.
connections between the cerebellum, thalamus, and sup- Luria, A. R. (1970). Traumatic aphasia: Its syndromes, psy-
plementary motor area causes impaired triggering of chology and treatment. The Hague, Paris: Mouton.
Mega, M. S., and Cohenour, R. C. (1997). Akinetic
vocalization (Gelabert-Gonzalez and Fernandez-Villa,
mutism: Disconnection of frontal-subcortical circuits. Neu-
2001). Mutism may also occur after callosotomy (Suss- ropsychiatry, Neuropsychology, and Behavioral Neurology,
man et al., 1983), perhaps because of damage to frontal 10, 254–259.
lobe structures and the cingulate gyrus. Nath, U., Ben-Shlomo, Y., Thomson, R. G., Morris, H. R.,
Wood, N. W., Lees, A. J., et al. (2001). The prevalence
Mutism in Traumatic Brain Injury. Mutism is frequent of progressive supranuclear palsy (Steele-Richardson-
in traumatic brain injury. Von Cramon (1981) called Olszewski syndrome) in the UK. Brain, 124, 1438–1449.
it the ‘‘traumatic midbrain syndrome.’’ He speculated Nicolai, J., van Putten, M. J. A. M., and Tavy, D. L. J. (2001).
BIPLEDs in akinetic mutism caused by bilateral anterior
that the mechanism is ‘‘temporary inhibition of neu- cerebral artery infarction. Clinical Neurophysiology, 112,
ral activity within the brain stem vocalization center’’ 1726–1728.
(p. 804) within the pontomesencephalic area. Often Orefice, G., Fragassi, N. A., Lanzillo, R., Castellano, A., and
mutism is followed by a period of whispered speech in Grossi, D. (1999). Transient muteness followed by dysarth-
this population. ria in patients with pontomesencephalic stroke. Cerebro-
vascular Diseases, 9, 124–126.
Summary. Speech depends on myriad general and spe- Otto, A., Zerr, I., Lantsch, M., Weidehaas, K., Riedmann, C.,
and Poser, S. (1998). Akinetic mutism as a classification
cific cognitive, motor, and linguistic processes. These
criterion for the diagnosis of Creutzfeldt-Jakob disease.
processes are widely distributed in the nervous system. Journal of Neurology, Neurosurgery, and Psychiatry, 64,
However, the frontal lobes and their connections to 524–528.
subcortical and brainstem structures are the most criti- Picard, N., and Strick, P. L. (1996). Motor areas of the medial
cal. Mutism, therefore, is common, but not inevitable, as wall: A review of their location and functional activation.
an early, late, or chronic sign of damage, regardless of Cerebral Cortex, 6, 342–353.
type, to these mechanisms. Plum, F., and Posner, J. (1966). Diagnosis of stupor and coma.
Philadelphia: Davis.
—John C. Rosenbek Pollack, I. F. (1997). Posterior fossa syndrome. International
Review of Neurobiology, 41, 411–432.
References Schi¤, N. D., and Plum, F. (2000). The role of arousal and
‘‘gating’’ systems in the neurology of impaired conscious-
Alexander, M. P., Benson, D. F., and Stuss, D. T. (1989). ness. Journal of Clinical Neurophysiology, 17, 438–452.
Frontal lobes and language. Brain and Language, 37, Si¤ert, J., Poussaint, T. Y., Goumnerova, L. C., Scott, R. M.,
656–691. LaValley, B., Tarbell, N. J., et al. (2000). Neurologi-
Bathgate, D., Snowden, J. S., Varma, A., Blackshaw, A., and cal dysfunction associated with postoperative cerebellar
Neary, D. (2001). Behavior in frontotemporal dementia, mutism. Journal of Neuro-Oncology, 48, 75–81.
Orofacial Myofunctional Disorders in Children 147
Sussman, N. M., Gur, R. C., Gur, R. E., and O’Connor, M. J. Weller, M. (1993). Anterior opercular cortex lesions cause dis-
(1983). Mutism as a consequence of callosotomy. Journal of sociated lower cranial nerve palsies and anarthria but no
Neurosurgery, 59, 514–519. aphasia: Foix-Chavney-Marie syndrome and ‘‘automatic
Turkstra, L. S., and Bayles, K. A. (1992). Acquired mutism: voluntary dissociation’’ revisited. Journal of Neurology, 240,
Physiopathy and assessment. Archives of Physical Medicine 199–208.
and Rehabilitation, 73, 138–144. Van der Nerwe, A. (1997). A theoretical framework for the
von Cramon, D. (1981). Traumatic mutism and the subsequent characterization of pathological speech sensorimotor con-
reorganization of speech functions. Neuropsychologia, 19, trol. In M. R. McNeil (Ed.), Clinical management of sen-
801–805. sorimotor speech disorders. New York: Thieme.
Further Readings
Bannur, U., and Rajshekhar, V. (2000). Post operative supple-
mentary motor area syndrome: Clinical features and out- Orofacial Myofunctional Disorders in
come. British Journal of Neurosurgery, 14, 204–210.
Bogen, J. E. (1976). Linguistic performance in the short term Children
following cerebral commissurotomy. In H. A. Whitaker and
H. Whitaker (Eds.), Perspectives in neurolinguistics and
psycholinguistics (vol. 2, pp. 193–224). New York: Aca- Orofacial myology is the scientific and clinical knowl-
demic Press. edge related to the structure and function of the muscles
Brun, A., Englund, E., Gustafson, L., Passant, U., Mann, of the mouth and face (orofacial muscles) (American
D. M. A., Neary, D., et al. (1994). Clinical and neuro- Speech-Language-Hearing Association [ASHA], 1993).
pathological criteria for frontotemporal dementia. Journal Orofacial myofunctional disorders are characterized by
of Neurology, Neurosurgery, and Psychiatry, 57, 416–418. abnormal fronting of the tongue during speech or swal-
Cairns, H., Oldfield, R. C., Pennybacker, J. D., and Whitter- lowing, or when the tongue is at rest. ASHA defines
idge, O. (1941). Akinetic mutism with an epidermoid cyst of an orofacial myofunctional disorder as ‘‘any pattern
III ventricle. Brain, 64, 273–290. involving oral and/or orofacial musculature that inter-
Catsman-Berrevoets, C. E., Van Dongen, H. R., Mulder,
feres with normal growth, development, or function of
P. G. H., Paz y Geuze, D., Paquier, P. F., and Lequin,
M. H. (1999). Tumor type and size are high risk factors for structures, or calls attention to itself ’’ (ASHA, 1993,
the syndrome of ‘‘cerebellar’’ mutism and subsequent dys- p. 22). With orofacial myofunctional disorders, the
arthria. Journal of Neurology, Neurosurgery, and Psychia- tongue moves forward in an exaggerated way and may
try, 67, 755–757. protrude between the upper and lower teeth during
Damasio, A. R., and Geschwind, N. (1984). The neural basis speech, swallowing, or at rest. This exaggerated tongue
of language. Annual Review of Neuroscience, 7, 127–147. fronting is also called a tongue thrust or a tongue thrust
Daniel, S. E., de Bruin, V. M., and Lees, A. J. (1995). The swallow and may contribute to malocclusion, lisping, or
clinical and pathological spectrum of Steele-Richardson- both (Young and Vogel, 1983; ASHA, 1989).
Olszewski syndrome (progressive supranuclear palsy): A A tongue thrust type of swallow is normal for infants.
reappraisal. Brain, 118, 759–770.
Kertesz, A., Martinez-Lage, P., Davidson, W., and Munoz,
The forward tongue posture typically diminishes as the
D. G. (2000). The corticobasal degeneration syndrome child grows and matures. Orofacial myofunctional dis-
overlaps progressive aphasia and frontotemporal dementia. orders may also be due to lip incompetence, which is
Neurology, 55, 1368–1375. a ‘‘lips-apart resting posture or the inability to achieve
Minagar, A., and David, N. J. (1999). Bilateral infarction in a lips-together resting posture without muscle strain’’
the territory of the anterior cerebral arteries. Neurology, 52, (ASHA, 1993, p. 22). During normal development, the
886–888. lips are slightly separated in children. With orofacial
Pasquier, F., Lebert, F., Lavenu, I., and Guillaume, B. (1999). myofunctional disorders, a lips-apart posture persists.
The clinical picture of frontotemporal dementia: Diagnosis Orofacial myofunctional disorders may be due to a
and follow-up. Dementia and Geriatric Cognitive Disorders, familial genetic pattern that determines the size of the
10, 10–14.
Penfield, W., and Roberts, L. (1959). Speech and brain mecha-
mouth, the arrangement and number of teeth, and the
nisms. Princeton, NJ: Princeton University Press. strength of the lip, tongue, mouth, or face muscles
Penfield, W., and Welch, K. (1951). The supplementary motor (Hanson and Barrett, 1988). Environmental factors such
area of the cerebral cortex. Archives of Neurology, 66, 289– as allergies may also lead to orofacial myofunctional
317. disorders. For example, an open mouth posture may
Rekate, H. L., Grubb, R. L., Aram, D. M., Hahn, J. F., and result from blocked nasal airways due to allergies or
Ratcheson, R. A. (1985). Muteness of cerebellar origin. enlarged tonsils and adenoids. The open-mouth breath-
Archives of Neurology, 42, 697–698. ing pattern may persist even after medical treatment for
Shuper, A., Stahl, B., and Mimouni, M. (2000). Transient the blocked airway. Other environmental causes of oro-
opercular syndrome: A manifestation of uncontrolled epi- facial myofunctional disorders may be excessive thumb
leptic activity. Acta Neurologica Scandinavica, 101, 335–338.
Steele, J. C., Richardson, J. C., and Olszewski, J. (1964).
or finger sucking, excessive lip licking, teeth clenching,
Progressive supranuclear palsy. Archives of Neurology, 10, and grinding (Van Norman, 1997; Romero, Bravo, and
333–359. Perez, 1998). Thumb sucking, for example, may change
Ure, J., Faccio, E., Videla, H., Caccuri, R., Giudice, F., Ollari, the shape of a child’s upper and lower jaw and teeth,
J., et al. (1998). Akinetic mutism: A report of three cases. requiring speech, dental, and orthodontic intervention
Acta Neurologica Scandinavica, 98, 439–444. (Umberger and Van Reenen, 1995; Van Norman, 1997).
148 Part II: Speech
The severity of the problem depends on how long the are identified with orofacial myofunctional disorders.
habit is maintained. Before speech and swallowing treatment is initiated,
Typically, a team of professionals, including a dentist, medical treatment may be necessary if the airway is
orthodontist, physician, and speech-language patholo- blocked due to enlarged tonsils and adenoids or aller-
gist, is involved in the assessment and treatment of chil- gies. Excessive and persistent oral habits, such as thumb
dren with orofacial myofunctional disorders (Benkert, and finger sucking or lip biting, may also need to be
1997; Green and Green, 1999; Paul-Brown and Clausen, eliminated or reduced before speech and swallowing
1999). Assessment is conducted to diagnose normal and treatments are initiated.
abnormal parameters of oral myofunctional patterns Some speech and swallowing treatment techniques
(ASHA, 1997). The dentist focuses on the e¤ect of pres- include
sure of the tongue against the gums; this kind of tongue
pressure may interfere with the normal process of tooth
Increasing awareness of mouth and facial muscles.
eruption. An orthodontist may be involved when the
Increasing awareness of mouth and tongue postures.
tongue pressure interferes with alignment of the teeth
Completing an individualized oral muscle exercise
and jaw. A physician needs to verify that an airway program to improve muscle strength and coordination.
obstruction is not causing the tongue thrust. Speech- Treatment strategies may include alternation of tongue
language pathologists assess and treat swallowing dis- and lip resting postures and muscle retraining exercises
orders, speech disorders, or lip incompetence that result (ASHA, 1997, p. 69).
from orofacial myofunctional disorders. As with all
Establishing normal speech articulation.
other assessment and treatment processes, speech-
Establishing normal swallowing patterns. Treatment
language pathologists need to have the appropriate strategies may include modification of handling and
training, education, and experience to practice in the swallowing of solids, liquids, and saliva (ASHA, 1997,
area of orofacial myofunctional disorders (ASHA, 2002). p. 69).
An orofacial myofunctional assessment is typically The expected outcome of treatment is to improve
prompted by referral or a failed speech screening for a or correct the patient’s orofacial myofunctional swal-
child older than 4 years of age. Assessment should be lowing and speech patterns. Orofacial myofunctional
based on orofacial myofunctional abilities and educa- treatment may be conducted concurrently with speech
tion, vocation, social, emotional, health, and medical treatment.
status. An orofacial myofunctional assessment by a Oral myofunctional treatment is e¤ective in modify-
speech-language pathologist typically includes the fol- ing tongue and lip posture and movement and in im-
lowing procedures (ASHA, 1997, p. 54): proving dental occlusion and a dental open bite or
Case history overbite (Christensen and Hanson, 1981; ASHA, 1991;
Review of medical/clinical health history and status Benkert, 1997). Lip exercises may be successful in treat-
(including any structural or neurological abnormalities) ing an open-mouth posture (ASHA, 1989; Pedrazzi,
Observation of orofacial myofunctional patterns 1997). A combination treatment approach, with a focus
Instrumental diagnostic procedures on speech correction as well as exercises to treat tongue
Structural assessment, including observation of the posture and swallowing patterns, appears to be the
face, jaw, lips, tongue, teeth, hard palate, soft palate, optimal way to improve speech and tongue thrust
and pharynx (Umberger and Johnston, 1997). The length of treat-
Perceptual and instrumental measures to assess oral ment varies according to the severity of the disorder,
and nasal airway functions as they pertain to oro- the age and maturity of the patient, and the timing of
facial myofunctional patterns and/or speech produc- treatment in relation to orthodontia. Typically 14–20
tion (e.g., speech articulation testing, aerodynamic sessions or more may occur over a period of 3 months to
measures) a year (ASHA, 1989). The value of early treatment is
emphasized in the literature (Pedrazzi, 1997; Van Nor-
Speech may be una¤ected by orofacial myofunctional man, 1997).
disorders (Khinda and Grewal, 1999). However, some ASHA has identified the basic content areas to be
speech sound errors, called speech misarticulations, covered in university curricula to promote competency
may be causally related to orofacial myofunctional dis- in the assessment and treatment of orofacial myo-
orders. The sounds most commonly a¤ected by orofacial functional disorders (ASHA, 1989, p. 92), including the
myofunctional disorders include s, z, sh, zh, ch, and j. following:
Sound substitutions (e.g., th for s, as in ‘‘thun’’ for
‘‘sun’’) or sound distortions may occur. A weak tongue 1. Oral-facial-pharyngeal structure, development, and
tip may result in di‰culties producing the sounds t, d, n, function
and l. 2. Interrelationships among oral-vegetative functions
Speech-language pathologists evaluate speech sound and adaptations, speech, and dental occlusion, using
errors resulting from orofacial myofunctional disorders, interdisciplinary approaches
as well as lip incompetence and swallowing disorders 3. Nature of atypical oral-facial patterns and their rela-
(ASHA, 1991). The assessment information is used to tionship to speech, dentition, airway competency, and
develop appropriate treatment plans for individuals who facial appearance
Orofacial Myofunctional Disorders in Children 149
4. Relevant theories such as those involving oral-motor Benkert, K. K. (1997). The e¤ectiveness of orofacial myofunc-
control and dental malocclusion tional therapy in improving dental occlusion. International
5. Rationale and procedures for assessment of oral Journal of Orofacial Myology, 23, 35–46.
myofunctional patterns, and observation and partici- Christensen, M., and Hanson, M. (1981). An investigation of
the e‰cacy of oral myofunctional therapy precursor to
pation in the evaluation and treatment of patients
articulation therapy for pre-first grade children. Journal of
with orofacial myofunctional disorders Speech and Hearing Disorders, 46, 160–165.
6. Application of current instrumental technologies to Green, H. M., and Green, S. E. (1999). The interrelationship of
document clinical processes and phenomena asso- wind instrument technic, orthodontic treatment, and oro-
ciated with orofacial myofunctional disorders facial myology. International Journal of Orofacial Myology,
7. Treatment options 25, 18–29.
Hanson, M., and Barrett, R. (1988). Fundamentals of orofacial
A Joint Committee of ASHA and the International myology. Springfield, IL: Charles C Thomas.
Association of Orofacial Myology has also delineated Khinda, V., and Grewal, N. (1999). Relationship of tongue-
the knowledge and skills needed to evaluate and treat thrust swallowing and anterior open bite with articulation
persons with orofacial myofunctional disorders (ASHA, disorders: A clinical study. Journal of Indian Society of
1993). The tasks required include the following: Pedodontia and Preventive Dentistry, 17(2), 33–39.
Paul-Brown, D., and Clausen, R. P. (1999, July/August). Col-
Understanding dentofacial patterns and applied physi- laborative approach for identifying and treating speech,
ology pertinent to orofacial myology language, and orofacial myofunctional disorders. Alpha
Understanding factors causing, contributing, or related Omegan, 92(2), 39–44.
to orofacial myology Pedrazzi, M. E. (1997). Treating the open bite. Journal of
Understanding basic orthodontic concepts General Orthodontia, 8, 5–16.
Understanding interrelationships between speech and Romero, M. M., Bravo, G. A., and Perez, L. L. (1998). Open
orofacial myofunctional disorders bite due to lip sucking: A case report. Journal of Clinical
Pediatric Dentistry, 22, 207–210.
Demonstrating competence in comprehensive assess-
Umberger, F., and Johnston, R. G. (1997). The e‰ciency of
ment procedures and in identifying factors a¤ecting oral myofunctional and coarticulation therapy. Interna-
prognosis tional Journal of Orofacial Myology, 23, 3–9.
Demonstrating competence in selecting an appropri- Umberger, F., and Van Reenen, J. (1995). Thumb sucking
ate, individualized, criterion-based treatment plan management: A review. International Journal of Orofacial
Demonstrating a clinical environment appropriate to Myology, 21, 41–45.
the provision of professional services Van Norman, R. (1997). Digit sucking: A review of the lit-
Demonstrating appropriate documentation of all clini- erature, clinical observations, and treatment recom-
cal services mendations. International Journal of Orofacial Myology, 22,
Demonstrating professional conduct within the scope 14–33.
Young, L. D., and Vogel, V. (1983). The use of cueing and
of practice for speech-language pathology (ASHA,
positive practice in the treatment of tongue thrust swallow-
2001) ing. Journal of Behavior Therapy and Experimental Psychi-
Further information on oral myofunction and oral atry, 14, 73–77.
myofunctional disorders is available from ASHA’s Spe-
cial Interest Division on Speech Science and Orofacial Further Readings
Disorders (www.asha.org) and the International Associ- Alexander, S., and Sudha, P. (1997). Genioglossis muscle elec-
ation of Orofacial Myology (www.iaom.com). trical activity and associated arch dim changes in simple
—Diane Paul-Brown tongue thrust swallow pattern. Journal of Clinical Pediatric
Dentistry, 21, 213–222.
References Andrianopoulos, M. V., and Hanson, M. L. (1987). Tongue
thrust and the stability of overjet correction. Angle Ortho-
American Speech-Language-Hearing Association. (1989, No- dontist, 57, 121–135.
vember). Report: Ad hoc Committee on Labial-Lingual Bresolin, D., Shapiro, P. A., Shapiro, G. G., Chapko, M. K.,
Posturing Function. ASHA, 31, 92–94. and Dassel, S. (1983). Mouth breathing in allergic children:
American Speech-Language-Hearing Association. (1991). The Its relationship to dentofacial development. American Jour-
role of the speech-language pathologist in management of nal of Orthodontics, 83, 334–340.
oral myofunctional disorders. ASHA, 33(Suppl. 5), 7. Cayley, A. S., Tindall, A. P., Sampson, W. J., and Butcher,
American Speech-Language-Hearing Association. (1993). A. R. (2000). Electropalatographic and cephalometric as-
Orofacial myofunctional disorders: Knowledge and skills. sessment of myofunctional therapy bite subjects. Australian
ASHA, 35(Suppl. 10), 21–23. Orthodontic Journal, 16, 23–33.
American Speech-Language-Hearing Association. (1997). Pre- Cayley, A. S., Tindall, A. P., Sampson, W. J., and Butcher,
ferred practice patterns for the profession of speech-language A. R. (2000). Electropalatographic and cephalometric as-
pathology. Rockville, MD: Author. sessment of tongue function in open and non-open bite
American Speech-Language-Hearing Association. (2001). subjects. European Journal of Orthodontics, 22, 463–474.
Scope of practice in speech-language pathology. Rockville, Christensen, M., and Hanson, M. (1981). An investigation of
MD: Author. the e‰cacy of oral myofunctional therapy as a precursor to
American Speech-Language-Hearing Association. (2002). articulation therapy for pre-first grade children. Journal of
Code of ethics. Rockville, MD: Author. Speech and Hearing Disorders, 46, 160–167.
150 Part II: Speech
Dworkin, J. P., and Culatta, K. H. (1980). Tongue strength: Its and [S ] are phonemically distinct in some languages,
relationship to tongue thrusting, open-bite, and articulatory such as English, they are represented di¤erently in the
proficiency. Journal of Speech and Hearing Disorders, 45, phonetic alphabet. Thus, the elongated s is used for
277–282. the voiceless palatoalveolar fricative, as in [Su]), to dif-
Gommerman, S. L., and Hodge, M. M. (1995). E¤ects of oral
ferentiate it from the voiceless alveolar fricative, as in
myofunctional therapy on swallowing and sibilant pro-
duction. International Journal of Orofacial Myology, 21, 9– [su].
22. The IPA has undergone several revisions since its in-
Hanson, J. L., and Andrianopoulos, M. V. (1982). Tongue ception but remains essentially unchanged. In the famil-
thrust and malocclusion. International Journal of Orthodon- iar consonant chart, symbols for pulmonic consonants
tics, 29, 9–18. are organized according to place of articulation, manner
Hanson, J. L., and Cohen, M. S. (1973). E¤ects of form and of articulation, and voicing. Nonpulmonic consonants,
function on swallowing and the developing dentition. such as clicks and ejectives, are listed separately, as are
American Journal of Orthodontics, 64, 63–82. vowels, which are shown in a typical vowel quadrangle.
Hanson, M. L. (1988). Orofacial myofunctional disorders: Symbols for suprasegmentals, such as length and tone,
Guidelines for assessment and treatment. International
are also provided, as are numerous diacritics, such as [s" ]
Journal of Orofacial Myology, 14, 27–32.
Hanson, M. L., and Peachey, G. (1991). Current issues in oro- for a dentalized [s].
facial myology. International Journal of Orofacial Myology, The most recent version of the complete IPA chart
17(2), 4–7. can be found in the Handbook of the International
Khinda, V., and Grewel, N. (1999). Relationship of tongue- Phonetic Association (IPA, 1999) as well as in a num-
thrust swallowing and anterior open bite with articulation ber of phonetics books (e.g., Ladefoged, 2001; Small,
disorders: A clinical study. Journal of the Indian Society of 1999). Illustrations of the sounds of the IPA are avail-
Pedodontics and Preventive Dentistry, 17(2), 33–39. able through various sources, such as Ladefoged (2001)
Martin, R. E., and Sessle, B. J. (1993). The role of the cerebral and Wells and House (1995). In addition, training
cortex in swallowing. Dysphagia, 8, 195–202. materials and phonetic fonts can be downloaded from
Mason, R. M. (1988). Orthodontic perspectives on orofacial
the Internet. Some new computers now come equipped
myofunctional therapy. International Journal of Orofacial
Myology, 14(1), 49–55. with ‘‘Unicode’’ phonetic symbols.
Nevia, F. C., and Wertzner, H. F. (1996). A protocol for oral Although extensive, the IPA does not capture all of
myofunctional assessment: For application with children. the variations that have been observed in children’s
International Journal of Oral Myology, 22, 8–19. speech. For this reason, some child/clinical phonologists
Pedrazzi, M. E. (1997). Treating the open bite. Journal of have proposed additional symbols and diacritics (e.g.,
General Orthodontics, 8, 5–16. Bush et al., 1973; Edwards, 1986; Shriberg and Kent,
Pierce, R. B. (1988). Treatment for the young child. Interna- 2003). The extended IPA (extIPA) was adopted by the
tional Journal of Oral Myology, 14, 33–39. International Clinical Phonetics and Linguistics Associ-
Pierce, R. B. (1996). Age and articulation characteristics: A ation (ICPLA) Executive Committee in 1994 to assist in
survey of patient records on 100 patients referred for
and standardize the transcription of atypical speech (e.g.,
‘‘tongue thrust therapy’’ January 1990–June 1996. Interna-
tional Journal of Orofacial Myology, 22, 32–33. Duckworth et al., 1990). The extIPA includes symbols
Saito, M. (2001). A study on improving tongue functions of for sounds that do not occur in ‘‘natural’’ languages,
open-bite children mixed dentition period: Modifications of such as labiodental and interdental plosives, as well as
a removable habit-breaker appliance and their sonographic many diacritics, such as for denasalized and unaspirated
analysis. Kokubyo Gakkai Zasshi, 68, 193–207. sounds. It also includes symbols for transcribing con-
Umberger, F. G., Weld, G. L., and Van Rennen, J. S. (1985). nected speech (e.g., sequences of quiet speech, fast or
Tongue thrust: Attitudes and practices of speech patholo- slow speech), as well as ways to mark features such as
gists and orthodontists. International Journal of Orofacial silent articulation. Descriptions and examples can be
Myology, 11(3), 5–13. found in Ball, Rahilly, and Tench (1996) and Powell
Wasson, J. L. (1989). Correction of tongue-thrust swallowing
(2001).
habits. Journal of Clinical Orthodontics, 12(1), 27–29.
When transcribing child or disordered speech, it is
sometimes impossible to identify the exact nature of a
segment. In such cases, ‘‘cover symbols’’ may be used.
Phonetic Transcription of Children’s These symbols consist of capital letters to represent ma-
Speech jor sound classes, modified with appropriate diacritics.
Thus, an unidentifiable voiceless fricative can be tran-
scribed with a capital F and a small under-ring for
Phonetic transcription entails using special symbols to voicelessness (e.g., Stoel-Gammon, 2001).
create a precise written record of an individual’s speech. Relatively little attention has been paid to the tran-
The symbols that are most commonly used are those scription of vowels in children’s speech (see, however,
of the International Phonetic Alphabet (IPA), first Pollock and Berni, 2001). Even less attention has been
developed in the 1880s by European phoneticians. Their paid to the transcription of suprasegmentals or prosodic
goal was to provide a di¤erent symbol for each unique features. Examples of relevant IPA and extIPA symbols
sound, that is, to achieve a one-to-one correspondence appear in Powell (2001), and Snow (2001) illustrates
between sound and symbol. For example, because [s] special symbols for intonation.
Phonetic Transcription of Children’s Speech 151
Broad or ‘‘phonemic’’ transcriptions, which capture phonological assessment procedures require whole word
only the basic segments, are customarily written in transcription (e.g., Hodson, 1980; Khan and Lewis,
slashes (virgules), as in /paI/ or /tElPfon/. ‘‘Narrow’’ or 1986), so that phonological processes involving more
‘‘close’’ transcriptions, which often include diacritics, are than one segment, such as assimilation (as in [gvk] for
written in square brackets. A narrow transcription more truck), can be more easily discerned. (In fact, Shriberg
accurately represents actual pronunciation, whether cor- and Kwiatkowski, 1980, use continuous speech samples,
rect or incorrect, as in [p h aI ] for pie, with aspiration on necessitating transcription of entire utterances.)
the initial voiceless stop, or a young child’s rendition of To facilitate whole word transcription, some clinical
star as [t¼aU ] or fish as [FIs]. phonologists, such as Hodson (1980) and Louko and
How narrow a transcription needs to be in any given Edwards (2001), recommend writing out broad tran-
situation depends on factors such as the purpose of the scriptions of target words (e.g., /trvk/) ahead of time and
transcription, the skill of the transcriber, and the amount modifying them ‘‘on line’’ for a tentative live transcrip-
of time available. As Powell (2001) points out, basic IPA tion that can be verified or refined by reviewing a tape of
symbols are su‰cient for some clinical purposes, for ex- the session. Although this makes the transcription pro-
ample, if a client’s consonant repertoire is a subset of the cess more e‰cient, it can also lead the transcriber to
standard inventory. A broad transcription is generally mishear sounds or to ‘‘hear’’ sounds that are not there
adequate to capture error patterns that involve deletion, (Oller and Eilers, 1975). Louko and Edwards (2001)
such as final consonant deletion or cluster reduction, as provide suggestions for counteracting the negative e¤ects
well as those that involve substitutions of one sound of such expectation.
class for another, such as gliding of liquids or stopping If a speech-language pathologist is going to expend
of fricatives. the time and energy necessary to complete a phonologi-
If no detail is included in a transcription, however, the cal analysis that is maximally useful, the transcription on
analyst may miss potentially important aspects of the which it is based must be as accurate and reliable as
production. For instance, if a child fails to aspirate ini- possible. Ideally, the testing session should be audio- or
tial voiceless stops, the unaspirated stops should be video-recorded on high-quality tapes and using the best
transcribed with the appropriate (extIPA) diacritic (e.g., equipment available, and it should take place in a quiet
[p¼], [t¼], as in [p¼i] for pea). Such stops can easily be environment, free of distractions (see Stoel-Gammon,
mistaken for the corresponding voiced stops and erro- 2001). Because some sounds are di‰cult to transcribe
neously transcribed as [b], [d], and so on. The clinician accurately from an audiotape (e.g., unreleased final
might then decide to work on initial voicing, using min- stops), it is advisable to do some transcribing on-line.
imal pairs such as pea and bee. This could be frustrating One way to enhance the accuracy of a transcription is
for a child who is already making a subtle (but incorrect) to transcribe with a partner or to find a colleague who is
contrast, for example, between [p¼] and [b]. willing to provide input on di‰cult items. ‘‘Transcrip-
To give another example, a child who is deleting final tion by consensus’’ (Shriberg, Kwiatkowski, and Ho¤-
consonants may retain some features of the deleted con- man, 1984), although impractical in some settings, is an
sonants as ‘‘marking’’ on the preceding vowel, for in- excellent way to derive a transcription and to sharpen
stance, vowel lengthening (if voiced obstruents are one’s skills. This involves two or more people transcrib-
deleted) or nasalization (if nasal consonants are deleted). ing a sample at the same time, working independently,
Unless the vowels are transcribed narrowly, the analyst then listening together to resolve disagreements.
may miss important distinctions, such as between [bi] Sometimes it is desirable to assess the reliability of
(beet), [bi:] (bead ), and [bi~] (bean). a transcription. For intrajudge reliability, the tran-
Stoel-Gammon (2001) suggests using diacritics only scriber relistens to a portion of the sample at some later
when they provide additional information, not when time and compares the two transcriptions on a sound-
they represent adultlike use of sounds. For example, if a by-sound basis, determining a percent of ‘‘point-to-
vowel is nasalized preceding a nasal consonant, the na- point’’ agreement. The same procedure may be used for
salization would not need to be transcribed. However, if determining interjudge reliability, except that a second
a vowel is nasalized in the absence of a nasal consonant, listener’s judgments are compared with those of the first
as in the preceding example, or if inappropriate nasal- transcriber. Reliability rates for children’s speech vary
ization is observed, a narrow transcription is crucial. greatly, depending on factors such as the type of sample
Phonetic transcription became increasingly important (connected speech or single words) and how narrow the
for speech-language pathologists with the widespread transcription is, with reliability rates being higher for
acceptance of phonological assessment procedures in the broad transcription (see Cucchiarini, 1996; Shriberg
1980s and 1990s. Traditional articulation tests (e.g., and Lof, 1991). Alternative methods of assessing tran-
Goldman and Fristoe, 1969) did not require much transcription agreement may sometimes be appropriate. For
scription. Errors were classified as substitutions, omis- instance, in assessing the phonetic inventories of young
sions, or distortions, and only the substitutions were children, Stoel-Gammon (2001) suggests measuring
transcribed. Therefore, no narrow transcription was agreement of features (place or manner) rather than
involved. identity of segments.
In order to describe patterns in children’s speech, it is People who spend long hours transcribing children’s
necessary to transcribe their errors. Moreover, most speech often look forward to the day when accurate
152 Part II: Speech
computer transcription will become a reality. Although Shriberg, L. D., Kwiatkowski, J., and Ho¤man, K. (1984). A
computer programs may be developed to make tran- procedure for phonetic transcription by consensus. Journal
scription more objective and time-e‰cient, speech- of Speech and Hearing Research, 27, 456–465.
language pathologists will continue to engage in the Shriberg, L. D., and Lof, G. L. (1991). Reliability studies in
broad and narrow phonetic transcription. Clinical Linguis-
transcription process because of what can be learned
tics and Phonetics, 5, 225–279.
through carefully listening to and trying to capture Small, L. H. (1999). Phonetics: A practical guide for students.
the subtleties of a person’s speech. Therefore, phonetic Boston: Allyn and Bacon.
transcription is likely to remain an essential skill for Snow, D. (2001). Transcription of suprasegmentals. Topics in
anyone engaged in assessing and remediating speech Language Disorders, 21(4), 42–52.
sound disorders. Stoel-Gammon, C. (2001). Transcribing the speech of young
children. Topics in Language Disorders, 21(4), 12–21.
—Mary Louise Edwards Wells, J., and House, J. (1995). The sounds of the International
Phonetic Alphabet. London: Department of Phonetics and
References Linguistics, University College London.
Ball, M. J., Rahilly, J., and Tench, P. (1996). The phonetic

transcription of disordered speech. San Diego, CA: Singular Further Readings
Publishing Group.
Bush, C. N., Edwards, M. L., Luckau, J. M., Stoel, C. M., Amorosa, H., von Benda, U., and Keck, A. (1985). Tran-
Macken, M. A., and Peterson, J. D. (1973). On specifying a scribing phonetic detail in the speech of unintelligible chil-
system for transcribing consonants in child language. dren: A comparison of procedures. British Journal of
Unpublished manuscript, Committee on Linguistics, Stan- Disorders of Communication, 20, 281–287.
ford University, Stanford, CA. Ball, M. J. (1991). Recent developments in the transcription of
Cucchiarini, C. (1996). Assessing transcription agreement: non-normal speech. Journal of Communication Disorders,
Methodological aspects. Clinical Linguistics and Phonetics, 24, 59–78.
10, 131–155. Ball, M. J. (1993). Phonetics for speech pathology (2nd ed.).
Duckworth, M., Allen, G., Hardcastle, W. J., and Ball, M. J. London: Whurr.
(1990). Extensions to the International Phonetic Alphabet Ball, M. J., Code, C., Rahilly, J., and Hazelett, D. (1994).
for the transcription of atypical speech. Clinical Linguistics Non-segmental aspects of disordered speech: Develop-
and Phonetics, 4, 273–280. ments in transcription. Clinical Linguistics and Phonetics, 8,
Edwards, M. L. (1986). Introduction to applied phonetics: Lab- 67–83.
oratory workbook. Needham Heights, MA: Allyn and Bernhardt, B., and Ball, M. J. (1993). Characteristics of atypi-
Bacon. cal speech currently not included in the Extension to the
Goldman, R., and Fristoe, M. (1969). Goldman-Fristoe Test IPA. Journal of the International Phonetic Association, 23,
of Articulation. Circle Pines, MN: American Guidance 35–38.
Service. Bronstein, A. J. (Ed.). (1988). Conference papers on American
Hodson, B. W. (1980). The Assessment of Phonological Pro- English and the International Phonetic Alphabet. Tusca-
cesses. Danville, IL: Interstate Printers and Publishers. loosa, AL: University of Alabama Press.
International Clinical Phonetics and Linguistics Association Compton, A. J., and Hutton, S. (1980). Phonetics for children’s
Executive Committee. (1994). The extIPA chart. Journal of misarticulations. San Francisco: Carousel House.
the International Phonetic Association, 24, 95–98. Edwards, H. T. (1997). Applied Phonetics: The Sounds of
International Phonetic Association. (1999). Handbook of the American English (2nd ed.). San Diego, CA: Singular Pub-
International Phonetic Association: A guide to the use of the lishing Group.
International Phonetic Alphabet. Cambridge, UK: Cam- Grunwell, P., and Harding, A. (1996). A note on describing
bridge University Press. types of nasality. Clinical Linguistics and Phonetics, 10,
Khan, L. M. L., and Lewis, N. (1986). Khan-Lewis pho- 157–161.
nological analysis. Circle Pines, MN: American Guidance Johnson, K. (1997). Acoustic and auditory phonetics. Oxford,
Service. UK: Blackwell.
Ladefoged, P. (2001). A course in phonetics (4th ed.). Fort Ladefoged, P., and Maddieson, I. (1996). The sounds of the
Worth, TX: Harcourt College Publishers. world’s languages. Oxford, UK: Blackwell.
Ladefoged, P. (2001). Vowels and consonants: An introduction Laver, J. (1994). Principles of phonetics. Cambridge, UK:
to the sounds of languages. Oxford, UK: Blackwell. Cambridge University Press.
Louko, L. J., and Edwards, M. L. (2001). Issues in collecting Lehiste, I. (1970). Suprasegmentals. Cambridge, MA: MIT
and transcribing speech samples. Topics in Language Dis- Press.
orders, 21, 1–11. Louko, L. J., and Edwards, M. L. (Eds.). (2001). Collecting
Oller, D. K., and Eilers, R. E. (1975). Phonetic expectation and and transcribing speech samples: Enhancing phonological
transcription validity. Phonetica, 31, 288–304. analysis. Topics in Language Disorders, 21(4).
Pollock, K. E., and Berni, M. C. (2001). Transcription of Maassen, B., O¤erninga, S., Vieregge, W., and Thoonen, G.
vowels. Topics in Language Disorders, 21, 22–41. (1996). Transcription of pathological speech in children by
Powell, T. (2001). Phonetic transcription of disordered speech. means of extIPA: Agreement and relevance. In T. Powell
Topics in Language Disorders, 21, 53–73. (Ed.), Pathologies of speech and language: Contributions of
Shriberg, L. D., and Kent, R. D. (2003). Clinical phonetics (3rd clinical phonetics and linguistics (pp. 37–43). New Orleans,
ed.). Needham Heights, MA: Allyn and Bacon. LA: ICPLA.
Shriberg, L. D., and Kwiatkowski, J. (1980). Natural process Mackay, I. (1987). Phonetics: The science of speech production
analysis. New York: Wiley. (2nd ed.). Boston: Little, Brown.
Phonological Awareness Intervention for Children with Expressive Phonological Impairments 153
Paden, E. P. (1989). Excercises in phonetic transcription: A are children with expressive phonological impairments
programmed workbook (2nd ed.). Woburn, MA: Butter- (EPIs) (Webster and Plante, 1992).
worth-Heinemann.
PRDS. (1983). The Phonetic Representation of Disordered Relationship Between Expressive Phonological Impair-
Speech: Final report. London: King’s Fund.
ment (EPI) and Phonological Awareness. A growing
Pullum, G. K., and Ladusaw, W. A. (1996). Phonetic symbol
guide (2nd ed.). Chicago: University of Illinois Press. body of evidence indicates that young children with se-
Shriberg, L., Hinke, R., and Trost-Ste¤en, C. (1987). A proce- vere EPI go on to experience problems in literacy. As
dure to select and train persons for narrow phonetic tran- well, results from another line of research indicate that
scription by consensus. Clinical Linguistics and Phonetics, 1, individuals with reading disabilities evidence more pho-
171–189. nological production di‰culties (e.g., with multisyllabic
Vieregge, W. H., and Maassen, B. (1999). ExtIPA transcrip- words) than their peers with typical reading abilities
tions of consonants and vowels spoken by dyspractic (Catts, 1986). Bird, Bishop, and Freeman (1995) found
children: Agreement and validity. In B. Maassen and P. that the children who had severe EPI experienced greater
Groenen (Eds.), Pathologies of speech and language: di‰culty with phonological awareness tasks than their
Advances in clinical phonetics and linguistics (pp. 275–284).
ability-matched peers, even when the tasks did not
London: Whurr.
require a verbal response. Clarke-Klein and Hodson
(1995) obtained similar results for spelling. Larivee and
On-line Resources Catts (1999), who tested children first in kindergarten
International Phonetic Association (IPA) home page and again 1 year later, found that expressive phonology
http://www2.arts.gla.ac.uk/IPA/ipa.html (measured by a multisyllabic word and nonword pro-
SIL International home page duction task) and phonological awareness scores in kin-
http://www.sil.org dergarten accounted for significant amounts of variance
IPA learning materials in first-grade reading.
http://www2.arts.gla.ac.uk/IPA/cassettes.html
Several investigators (e.g., Bishop and Adams, 1990;
http://hctv.humnet.ucla.edu/departments/linguistics/
VowelsandConsonants Catts, 1993), however, have reported that phonological
IPA Fonts impairments alone do not have as great an impact on
http://www2.arts.gla.ac.uk/IPA/ipafonts.html literacy as language impairments do. A possible expla-
http://www.sil.org/computing/fonts/encore-ipa.html nation for this discrepancy may be the level of EPI se-
http://www.chass.utoronto.ca/~rogers/fonts.html verity in the participants in their studies.
Severity Considerations. A common practice in the

articulation/phonology literature is to report the number
of errors on an articulation test. Not all speech sound
errors are equal, however. For example, if two children
Phonological Awareness Intervention have 16 errors on the same test, some examiners might
for Children with Expressive view them as equal. If, however, one child evidences a
Phonological Impairments lisp for all sibilants and the other has 16 omissions, the
impact on intelligibility will be vastly di¤erent. More-
over, the child with extensive omissions might be identi-
Phonological awareness refers to an individual’s aware- fied as having a language impairment because of the
ness of the sound structure of a language. Results from a omission of final consonants (which would a¤ect the
number of studies indicate that phonological awareness production of word-final morphemes on an expressive
skills are highly correlated with reading success (see language measure). Some highly unintelligible children
Stanovich, 1980) and that phonological awareness can who are considered to have a language impairment may,
be enhanced by direct instruction (see Blachman et al., in fact, have a severe phonological impairment with in-
1994). Some scientists prefer using the terms phono- tact receptive language abilities. Typically such children
logical sensitivity or metaphonology rather than phono- produce final morphemes as they learn the phonological
logical awareness. These three terms are generally pattern of word-final consonants.
considered comparable in meaning, except that meta-
phonology implies that the awareness is at a more Phonological Awareness Treatment Studies for Children
conscious level. A fourth term, phonemic awareness, with EPI. Although there have been numerous studies
refers only to phonemes, whereas phonological aware- reporting the results of phonological awareness treat-
ness includes syllables and intrasyllabic units (onset and ment, only a few investigators have focused on children
rime). Phonological processing, the most encompassing with phonological or language impairments. van Kleeck,
of these related terms, includes phonological production, Gillam, and McFadden (1998) provided classroom-
verbal working memory, word retrieval, spelling, and based phonological awareness treatment (15 minutes
writing, as well as phonological awareness. Among the twice a week) to 16 children with speech and/or lan-
individuals who have been identified most consistently guage disorders (8 in a preschool class and 8 in a pre-
as being ‘‘at risk’’ for failure to develop appropriate kindergarten class). The small-group sessions focused on
phonological awareness skills, and ultimately literacy, rhyming during the first semester and on phoneme
154 Part II: Speech
awareness during the second semester. The treatment treatment focused on the development of skills at the
groups and a nontreatment comparison group all made phonemic level and integrated phonological aware-
substantial gains in rhyming. Children in the treatment ness activities with grapheme-phoneme correspondence
groups, however, made markedly greater gains on pho- training. Activities included (a) picture Bingo and oddity
nemic awareness tasks than children in the nontreatment games for rhyme awareness, (b) identification of initial
group. Information on changes in expressive phonology and final sounds, and sometimes medial sounds, (c)
or language was not provided by the investigators. phoneme segmentation, (d) phoneme blending, and (e)
Howell and Dean (1994) used their Metaphon pro- linking speech to print. The children in this group made
gram to provide both phonological awareness and pro- significantly greater gains in phonological awareness and
duction treatment for 13 preschool children with EPI in reading scores than the children in the other groups.
Scotland. In phase 1 of this program, children progress Moreover, the children also made greater gains in pho-
from the concept/sound (not speech) level to the phonological production than children in the other groups
neme level to the word level. Minimal pairs are used with EPI. The results of this investigation lend support
extensively during phase 1. In phase 2, the progression to the contention that it is important to incorporate
is from word level to sentence level. The children phonological awareness tasks into treatment sessions for
attended between 11 and 34 30-minute sessions weekly. children with EPI.
Single subject case study results indicated that the chil-
dren improved on both phonological production and Enhancing Phonological Awareness Skills. Available
phonological awareness tasks (sentence and phoneme tasks range in di‰culty from simple ‘‘yes-no’’ judgments
segmentation). regarding whether two words rhyme to complex phono-
Harbers, Paden, and Halle (1999) provided individual logical manipulation activities (e.g., pig Latin, spooner-
treatment to four preschool children with EPI for 6–9 isms). Moreover, many activities that are commonly
months that focused on both feature awareness and used in treatment sessions have phonological awareness
production for three phonological patterns that the chil- components. When children are taught how a sound is
dren lacked. All four children targeted /s/ clusters. Three produced and how it feels, they develop awareness about
targeted strident singletons, two targeted velars, two place, manner, and voicing aspects of the sounds in their
targeted liquids, and one targeted final consonants. phonological system. One phonological awareness treat-
The investigators used a combination of the Metaphon ment program (Lindamood and Lindamood, 1998) has
(Howell and Dean, 1994) and Cycles (Hodson and a component that specifically addresses teaching the
Paden, 1991) treatment approaches. Improvement in the articulatory characteristics of phonemes to all children
production of /s/ clusters coincided with gains in recog- with reading disabilities, even when there are no phono-
nizing /s/ cluster features for two of the four children logical production problems. Moreover, when children
targeting /s/ clusters. Both of the children targeting learn about where a sound is located in a word (initial,
velars also evidenced concomitant gains in production medial, or final position), they develop awareness about
and awareness. For the remaining targets, there was a word positions.
slight tendency for the two variables (phonological One phonological awareness activity that has proved
awareness and production) to move in similar directions, to be particularly e¤ective is the ‘‘Say-It-And-Move-It’’
but inconsistencies occurred. task, using Elkonin cards (Ball and Blachman, 1991,
Gillon (2000) conducted a phonological awareness adapted from Elkonin, 1963). Children are taught to
treatment study in New Zealand that involved 91 chil- represent the sounds in one- (e.g., a), two- (e.g., up), or
dren with ‘‘spoken language impairment’’ between the three-phoneme (e.g., cat) words by using manipulatives.
ages of 5 and 7 years. Twenty-three children participated Initially blank tiles or blocks are used. Tiles with graph-
in an experimental ‘‘integrated’’ treatment program. A emes are incorporated after the child demonstrates
second group of 23 children received traditional speech- recognition of the sounds for the letters. The top half of
sound treatment. Two additional groups served as con- the paper has a picture of a word. The bottom half has
trols. One treatment group of 15 children who received the appropriate number of boxes for the phonemes
‘‘minimal’’ intervention, and the other consisted of 30 needed for the word. Children are taught to say each
phonologically normal children. Children in the first word slowly and to move one manipulative for each
treatment group received two 60-minute sessions per sound into the boxes from left to right.
week until a total of 20 hours of intervention had been Another phonological awareness activity that is
completed. The second group participated in phoneme- widely used both for assessment and for segmentation
oriented sessions for the same amount of time. All of the practice is categorization. This task requires matching
children continued participating in their regular class- and oddity awareness skills. Typically the child is given
room literacy instruction, which was based on a ‘‘Whole four pictures and is to identify the one that does not
Language’’ model. match the others in some aspect (e.g., rhyme) and thus is
The children in the first group did not receive direct the ‘‘odd one out.’’ Categorization also is used for indi-
production treatment for EPI during the course of the vidual sounds (e.g., initial consonants).
study. Additional stimulus items for children’s indi- Learning to blend phonological segments to make
vidual speech sound errors were integrated into some words is another important task and one that is ex-
of the activities, however. The phonological awareness tremely di‰cult for some children. Blending tasks com-
Phonological Awareness Intervention for Children with Expressive Phonological Impairments 155
monly start at the word level with compound words Catts, H. W. (1993). The relationship between speech-language
(e.g., ice plus cream) followed by blending syllables (e.g., impairments and reading disabilities. Journal of Speech and
can plus dee; candy). Blending intrasyllabic units (e.g., Hearing Research, 6, 948–958.
onset and rime, as sh plus eep; and body and coda, as Clarke-Klein, S., and Hodson, B. (1995). A phonologically
based analysis of misspellings by third graders with
shee plus p) should precede blending individual pho-
disordered-phonology histories. Journal of Speech and
nemes (e.g., sh plus ee plus p). Hearing Research, 38, 839–849.
Another task that has been found to be highly corre- Elkonin, D. B. (1963). The psychology of mastering the ele-
lated with success in reading is deletion (e.g., elision task, ments of reading. In B. Simon and J. Simon (Eds.), Educa-
Rosner and Simon, 1971). As with blending, it is impor- tional psychology in the USSR (pp. 165–179). London:
tant to begin with the larger segments (e.g., compound Routledge.
words). The child says the word (e.g., cowboy), and then, Gillon, G. T. (2000). The e‰cacy of phonological awareness
after part of the word is removed (e.g., boy), says the intervention for children with spoken language impairment.
new word (cow). After a child demonstrates success at Language, Speech, and Hearing Services in Schools, 31,
the larger unit levels, individual phonemes are deleted 126–141.
Harbers, H. M., Paden, E. P., and Halle, J. W. (1999). Pho-
(e.g., take away /t/ from note/, leaving no).
nological awareness and production: Changes during inter-
The task that consistently has accounted for the vention. Language, Speech, and Hearing Services in Schools,
greatest amount of variance in predicting decoding suc- 30, 50–60.
cess is manipulation. Children who are most successful Hodson, B., and Paden, E. (1991). Targeting intelligible speech
performing phoneme manipulation tasks such as spoo- (2nd ed.). Austin, TX: Pro-Ed.
nerisms typically are the best decoders (Strattman, Howell, J., and Dean, E. (1994). Treating phonological dis-
2001). Phonological manipulation in ‘‘pattern’’ songs orders in children (2nd ed.). London: Whurr.
(e.g., ‘‘Apples and Bananas’’) seems to be an extremely Larrivee, L. S., and Catts, H. W. (1999). Early reading
enjoyable task for very young children and can help achievement in children with expressive phonological dis-
them be more aware of sounds and word structures. orders. American Journal of Speech-Language Pathology, 8,
118–128.
Lindamood, P. C., and Lindamood, P. D. (1998). The Linda-
Implications for Best Practices. Because children with mood Phoneme Sequencing program for reading, spelling,
EPI appear to be at risk for the development of normal and speech (LiPS). Austin, TX: Pro-Ed.
reading and writing skills even after they no longer have Rosner, J., and Simon, D. (1971). The auditory analysis test.
intelligibility issues, it seems prudent to incorporate Journal of Learning Disabilities, 4, 384–392.
activities to enhance phonological awareness skills while Stanovich, K. E. (1980). Toward an interactive compensatory
they are receiving treatment for phonological produc- model of individual di¤erences in the development of read-
tion. Moreover, results from Gillon’s (2000) study indi- ing fluency. Reading Research Quarterly, 16, 32–71.
cate that enhancing phonological awareness skills leads Strattman, K. (2001). Predictors of second graders’ reading and
to improvement in phonological production. Thus, en- spelling scores. Unpublished doctoral dissertation. Wichita
State University, Wichita, KS.
hancing phonological awareness skills appears to serve a
van Kleeck, A., Gillam, R. B., and McFadden, T. U. (1998). A
dual purpose for children with expressive phonological study of classroom-based phonological awareness training
impairments. for preschoolers with speech and/or language disorders.
—Barbara Hodson and Kathy Strattman American Journal of Speech-Language Pathology, 7, 65–76.
Webster, P., and Plante, A. (1992). E¤ects of phonological
impairment on word, syllable, and phoneme segmentation
References and reading. Language, Speech, and Hearing Services in
Schools, 23, 176–182.
Ball, E., and Blachman, B. (1991). Does phoneme awareness
training in kindergarten make a di¤erence in early word Further Readings
recognition and developmental spelling? Reading Research
Quarterly, 26, 49–66. Adams, M. J., Foorman, B. R., Lundberg, I., and Beeler, T.
Bird, J., Bishop, D. V. M., and Freeman, M. H. (1995). Pho- (1998). Phonemic awareness in young children: A classroom
nological awareness and literacy development in children curriculum. Baltimore: Paul H. Brookes.
with expressive phonological impairments. Journal of Badian, N. (Ed.). (2000). Prediction and prevention of reading
Speech and Hearing Research, 38, 446–462. failure. Baltimore: York Press.
Bishop, D. V. M., and Adams, C. (1990). A prospective study Blachman, B. (Ed.). (1997). Foundations of reading acquisition
of the relationship between specific language impairment, and dyslexia. Mahwah, NJ: Erlbaum.
phonological disorders, and reading retardation. Journal of Blachman, B., Ball, E., Black, R., and Tangel, D. (2000). Road
Child Psychology and Psychiatry, 31, 1027–1050. to the code. Baltimore: Paul H. Brookes.
Blachman, B., Ball, E., Black, R., and Tangle, D. (1994). Catts, H. W., and Kamhi, A. G. (Eds.). (1999). Language and
Kindergarten teachers develop phonemic awareness in reading disabilities. Needham Heights, MA: Allyn and Bacon.
low-income, inner-city classrooms. Does it make a di¤er- Catts, H., and Olsen, T. (1993). Sounds abound: Listening,
ence? Reading and Writing: An Interdisciplinary Journal, 6, rhyming, and reading. East Moline, IL: LinguiSystems.
1–18. Chafouleas, S., VanAuken, T., and Dunham, K. (2001). Not
Catts, H. W. (1986). Speech production/phonological defi- all phonemes are created equal: The e¤ects of linguistic
cits in reading-disordered children. Journal of Learning manipulations on phonological awareness tasks. Journal of
Disabilities, 19, 504–508. Psychoeducational Assessment, 19, 216–226.
156 Part II: Speech
Cunningham, A. (1990). Explicit vs. implicit instruction in life span, despite having received treatment for the pho-
phonological awareness. Journal of Experimental Child nological disorder (Shriberg et al., 1997).
Psychology, 50, 429–444. Residual phonological errors are a subtype of devel-
Hatcher, P. (1994). Sound linkage: An integrated programme opmental phonological disorders that persist beyond the
for overcoming reading di‰culties. London: Whurr.
expected period of speech-sound development or nor-
Hodson, B. (Ed.). (1994). From phonology to metaphonology:
Issues, assessment, and intervention [special issue]. Topics in malization (Shriberg, 1997). They are present in the
Language Disorders, 14. speech of older school-age children and adults. Individ-
Hulme, E., and Joshi, R. (Eds.). (1998). Reading and spelling: uals with residual errors can be further classified into
Development and disorders. Mahwah, NJ: Erlbaum. subgroups of those with a history of speech delay and
Lencher, O., and Podhajski, B. (1998). The sounds abound pro- those without a history of speech delay (i.e., individuals
gram. East Moline, IL: LinguiSystems. in whom a speech delay was diagnosed at some time
Lonigan, C., Burgess, S., Anthony, J., and Barker, T. (1998). during the developmental period and those who were not
Development of phonological sensitivity in 2- to 5-year-old so diagnosed). It is postulated that the two groups di¤er
children. Journal of Educational Psychology, 90, 294–311. with respect to causal factors. The residual errors of
Moats, L. C. (2000). Speech to print. Baltimore: Paul H.
the first group are thought to reflect environmental
Brookes.
Snow, C., Burns, M., and Gri‰n, P. (Eds.). (1998). Preventing influences, while nonenvironmental causal factors such
reading di‰culties in young children. Washington, DC: Na- as genetic transmission are thought to be responsible for
tional Academy Press. the phonological errors of the second group.
Spector, C. (1999). Sound e¤ects: Activities for developing pho- Most residual errors have been identified as distor-
nological awareness. Eau Claire, WI: Thinking Publications. tions (Smit et al., 1990; Shriberg, 1993) of the expected
Stackhouse, J., and Wells, B. (1997). Children’s speech and lit- allophones of a particular phoneme. Distortions are
eracy di‰culties: A psycholinguistic framework. London: variant productions that do not fall within the percep-
Whurr. tual boundaries of a specific target phoneme (Danilo¤,
Stackhouse, J., and Wells, B. (Eds.). (2001). Children’s speech Wilcox, and Stephens, 1980; Bernthal and Bankson,
and literacy di‰culties: Vol. 2. Identification and interven-
1998). It has been hypothesized that distortions reflect
tion. London: Whurr.
Stackhouse, J., Wells, B., and Pascoe, M. (2002). From pho- incorrect allophonic rules or sensorimotor processing
nological therapy to phonological awareness. Seminars in limitations. That is, such productions are either perma-
Speech and Language, 23, 27–42. nent or temporary manifestations of inappropriate allo-
Stanovich, K. (2000). Progress in understanding reading. New phonic representation and/or the sensorimotor control
York: Guilford Press. of articulatory accuracy. It has been suggested that chil-
Torgesen, J., and Mathes, P. (2000). A basic guide to under- dren initially delete and substitute sounds and then pro-
standing, assessing, and teaching phonological awareness. duce distortions of sounds such as /r/, /l/, and /s/ when
Austin, TX: Pro-Ed. normalizing sound production; however, investigative
Wolf, M. (Ed.). (2001). Dyslexia, fluency, and the brain. study has not supported this hypothesis as a generality in
Timonium, MD: York Press.
children who normalize their phonological skills with
treatment (Shriberg and Kwiatkowski, 1988). Ohde and
Phonological Errors, Residual Sharf (1992) provide excellent descriptions of the acous-
tic and physiologic parameters of common distortion
errors.
Shriberg (1994) has conceptualized developmental pho- Smit et al. (1990) conducted a large-scale investiga-
nological disorders as speech disorders that originate tion of speech sound acquisition and reported that the
during the developmental period. In most cases the cause distortion errors noted in the speech of their older test
of such disorders cannot be attributed to significant subjects varied with respect to judged clinical impact or
involvement of a child’s speech or hearing processes, severity. Some productions were judged to be minor
cognitive-linguistic functions, or psychosocial processes distortions, while others were designated as clinically
(Bernthal and Bankson, 1998), but causal origins may be significant. Shriberg (1993) also noted such di¤erences
related to genetic or environmental di¤erences (Shriberg, in his study of children with developmental phonolog-
1994; Shriberg and Kwiatkowski, 1994). Children with ical disorders. He classified the errors into nonclinical
developmental phonological disorders are heterogeneous and clinical distortion types. Nonclinical distortions are
and exhibit a range in the severity of their phonological thought to reflect dialect or other factors such as speech-
disorders. Generally, the expected developmental period motor constraints and are not targeted for therapy.
for speech sound acquisition ends at approximately 9 Clinical distortions are potential targets for treatment
years of age, thus encompassing birth through the early and have been categorized by prevalence into common
school years. In sum, it is posited that children who ex- and uncommon types. The most common and uncom-
hibit phonological disorders di¤er with regard to the mon types are listed in Figure 1. The most common re-
etiology and severity of the disorder and include both sidual errors include distortions of the sound classes of
preschool and school-age children (Deputy and Weston, liquids, fricatives, and a¤ricates. Uncommon distortion
1998). Some individuals with developmental phonologi- errors include errors such as weak or imprecise conso-
cal disorders acquire normal speech, while others con- nant production and di‰culty maintaining nasal and
tinue to exhibit a phonological disorder throughout the voicing features. In most cases, residual errors constitute
Phonological Errors, Residual 157
cated that the two subjects were able to acquire correct

Common Distortion Errors
production of the former residual error.
1. Dentalization of voiced/voiceless sibilant fricatives In summary, residual errors are a distinct subtype of
or a¤ricates developmental phonological errors that are present in
2. Derhotacized /r/, /E/, /F/ the speech of older children and adults who are beyond
3. Lateralization of voiced/voiceless sibilant fricatives the period of normal sound acquisition. Most residual
or a¤ricates errors are described as distortions, which are sound vari-
4. Velarized /l/ or /r/ ations that are not within the phonetic boundaries of the
5. Labialized /l/ or /r/ intended target sound. Generally, residual errors are mi-
Uncommon Distortion Errors nor in terms of severity and do not interfere with intelli-
gibility, but normal speakers do react negatively to such
1. Weak consonant productions minor speech variations. An exact estimate of children
2. Imprecise articulation of consonants and vowels and adults with residual errors is unknown, but it is
3. Inability to maintain oral/nasal contrasts thought that there are substantial numbers of individuals
4. Di‰culty in maintaining correct voicing contrasts with such a phonological disorder.
Figure 1. Common and uncommon distortion errors as reported —Dennis M. Ruscello
by Shriberg (1993).
References
minor involvement of phonological production and do Bauman-Waengler, J. (2000). Articulatory and phonological
not have a significant impact on intelligibility, but re- impairments: A clinical focus. Boston: Allyn and Bacon.
search indicates that normal speakers react negatively to Bernthal, J. E., and Bankson, N. W. (1998). Articulation and
persons with even minor residual errors (Mowrer, Wahl, phonological disorders (4th ed.). Boston: Allyn and Bacon.
Crowe Hall, B. J. (1991). Attitudes of fourth and sixth graders
and Doolan, 1978; Silverman and Paulus, 1989; Crowe
toward peers with mild articulation disorders. Language,
Hall, 1991). Speech and Hearing Services in Schools, 22, 334–339.
Treatment for persons with residual errors is gener- Dagenais, P. A. (1995). Electropalatography in the treatment
ally carried out using approaches that have been used of articulation/phonological disorders. Journal of Commu-
with younger children. The treatment approaches are nication Disorders, 28, 303–330.
based on motor learning or cognitive-linguistic concepts Danilo¤, R., Wilcox, K., and Stephens, M. I. (1980). An
(Lowe, 1994; Bauman-Waengler, 2000); however, in acoustic-articulatory description of children’s defective
most cases a motor learning approach is utilized (Gierut, /s/ productions. Journal of Communication Disorders, 13,
1998). Although most individuals normalize their resid- 347–363.
ual errors with intervention, some individuals do not Deputy, P. N., and Weston, A. D. (1998). A framework for
di¤erential diagnosis of developmental phonologic dis-
(Dagenais, 1995; Shuster, Ruscello, and Toth, 1995).
orders. In B. J. Philips and D. M. Ruscello (Eds.), Di¤er-
The actual number of clients in the respective categories ential diagnosis in speech-language pathology (pp. 113–158).
is unknown, but survey data of school practitioners Boston: Butterworth-Heinemann.
reported by Ruscello (1995a) indicate that a subgroup Gibbon, F., Stewart, F., Hardcastle, W. J., and Crampin, L.
of clients do not improve with traditional treatment (1999). Widening access to electropalatography for children
methods. Respondents indicated that children either with persistent sound system disorders. American Journal of
were unable to achieve correct production of an error Speech-Language Pathology, 8, 319–334.
sound or achieved correct production but were unable Gierut, J. A. (1998). Treatment e‰cacy: Functional phonolog-
to incorporate the sound into spontaneous speech. The ical disorders in children. Journal of Speech, Language, and
respondents did not list the types of sound errors, but the Hearing Research, 41, S85–S100.
Lowe, R. J. (1994). Phonology: Assessment and intervention
error sounds reported are in agreement with the residual applications in speech pathology. Baltimore: Williams and
errors identified by both Shriberg (1993) and Smit et al. Wilkins.
(1990). Mowrer, D. E., Wahl, P., and Doolan, S. J. (1978). E¤ect of
In some cases, specially designed treatments are nec- lisping on audience evaluation of male speakers. Journal of
essary to facilitate remediation of residual errors. For Speech and Hearing Disorders, 43, 140–148.
example, principles from biofeedback and speech physi- Ohde, R. N., and Sharf, D. J. (1992). Phonetic analysis of nor-
ology have been incorporated into treatments (Dag- mal and abnormal speech. New York: Merrill.
enais, 1995; Ruscello, 1995b; Gibbon et al., 1999). Ruscello, D. M. (1995a). Visual feedback in treatment of re-
Di¤erent forms of sensory information other than audi- sidual phonological disorders. Journal of Communication
tory input have been provided to assist the individual in Disorders, 28, 279–302.
Ruscello, D. M. (1995b). Speech appliances in the treatment
developing appropriate target productions. Shuster, of phonological disorders. Journal of Communication Dis-
Ruscello, and Toth (1995) identified two older children orders, 28, 331–353.
with residual /r/ errors who had received traditional Shriberg, L. D. (1993). Four new speech and prosody-voice
long-term phonological treatment without success. A measures for genetics research and other studies in devel-
biofeedback treatment utilizing real-time spectrography opmental phonological disorders. Journal of Speech and
was implemented for both subjects, and the results indi- Hearing Research, 36, 105–140.
158 Part II: Speech
Shriberg, L. D. (1994). Five subtypes of developmental pho- case in point is the native English dialect spoken by
nological disorders. Clinics in Communication Disorders, 4, many African Americans, a populous ethnic minority
38–53. group. This dialect is labeled in various ways but is re-
Shriberg, L. D. (1997). Developmental phonological disorders: ferred to here as African American Vernacular English
One or many? In B. W. Hodson and M. L. Edwards (Eds.),
(AAVE). As a result of litigation, legislation, and social
Perspectives in Applied Phonology (pp. 105–132). Gaithers-
burg, MD: Aspen. changes beginning in the 1960s, best clinical practice
Shriberg, L. D., Austin, D., Lewis, B., McSweeny, J. L., and now requires speech clinicians to regard social dialect
Wilson, D. L. (1997). The speech disorders classification di¤erences in defining speech norms for clinical service
system (SDCS): Extensions and lifespan reference data. delivery. This mandate has created challenges for clinical
Journal of Speech, Language, and Hearing Research, 40, practices.
723–740. One clinical issue is how to identify AAVE speakers.
Shriberg, L. D., and Kwiatkowski, J. (1988). A follow-up study African Americans are racially, ethnically, and linguisti-
of children with phonologic disorders of unknown origin. cally diverse. Not all learn AAVE, and among those
Journal of Speech and Hearing Disorders, 53, 144–155. who do, the density of use varies. This discussion con-
Shriberg, L. D., and Kwiatkowski, J. (1994). Developmental
siders only those African Americans with an indigenous
phonological disorders: I. A clinical profile. Journal of
Speech and Hearing Research, 37, 1100–1126. slave history in the United States and ancestral ties to
Shuster, L. I., Ruscello, D. M., and Toth, A. (1995). The use of Subsaharan Africa. The native English spoken today is
visual feedback to elicit correct /r/. American Journal of rooted partly in a pidgin-creole origin. Since slavery was
Speech-Language Pathology, 4, 37–44. abolished, the continuing physical and social segregation
Silverman, F. H., and Paulus, P. G. (1989). Peer reactions to of African Americans has sustained large AAVE com-
teenagers who substitute /w/ for /r/. Language, Speech, and munities, particularly in southern states.
Hearing Services in Schools, 20, 219–221. Contemporary AAVE pronunciation is both like
Smit, A. B., Hand, L., Freilinger, J. J., Bernthal, J. E., and and unlike Standard English (SE). In both dialects, the
Bird, A. (1990). The Iowa articulation norms project and its vowel and consonant sounds are the same (with a few
Nebraska replication. Journal of Speech and Hearing Dis-
exceptions), but their use in words di¤ers (Wolfram,
orders, 55, 779–798.
1994; Stockman, 1996b). Word-initial single and clus-
Further Readings tered consonants in AAVE typically match those in SE
except for interdental fricatives (e.g., this > /dIs/). The
Bankson, N. W., and Bernthal, J. E. (1982). Articulation as- dialects di¤er in their distributions of word-final con-
sessment. In N. J. Lass, L. V. McReynolds, J. L. Northern, sonants. Some final consonants in AAVE are replaced
and D. E. Yoder (Eds.), Speech, language, and hearing (pp.
(cf. bath and bathe > /f/ and /v/, respectively). Others
572–590). Philadelphia: Saunders.
Bleile, K. M. (1995). Manual of articulation and phonological are absent as single sounds (e.g., man) or in consonant
disorders. San Diego, CA: Singular Publishing Group. clusters (test > /tEs/). Yet AAVE is not an open-syllable
Flipsen, P., Shriberg, L., Weismer, G., Karlsson, H., and dialect. Final consonants are variably absent in predict-
McSweeny, J. (1999). Acoustic characteristics of /s/ in ado- able or rule-governed ways. They are more likely to be
lescents. Journal of Speech, Language, and Hearing Re- absent or reduced in clusters when the following word
search, 42, 663–677. or syllable begins with another consonant rather than
Ruscello, D. M., St. Louis, K. O., and Mason, N. (1991). a vowel (Wolfram, 1994) or when a consonant is an
School-aged children with phonologic disorders: Coexis- alveolar as opposed to a labial or velar stop (Stockman,
tence with other speech/language disorders. Journal of 1991). In multisyllabic words, unstressed syllables (e.g.,
away > /-weI/) in any position may be absent, depend-
Shriberg, L. D., and Kwiatkowski, J. (1981). Phonological
disorders: I. A diagnostic classification system. Journal of ing on grammatical and semantic factors (Vaughn-
Speech and Hearing Disorders, 47, 226–241. Cooke, 1986). Consonants may also be reordered in
Shriberg, L. D., Tomblin, J. B., and McSweeny, J. L. (1999). some words (e.g., ask > /s/), and multiple words may
Prevalence of speech delay in 6-year-old children and be merged phonetically (e.g., fixing to > finna; sup-
comorbidity with language impairment. Journal of Speech, pose to > sposta) to function as separate words. These
Language, and Hearing Research, 42, 1461–1481. broadly predictable AAVE pronunciation patterns di¤er
enough from SE to compromise its intelligibility for
unfamiliar listeners. Intelligibility can be decreased fur-
Phonology: Clinical Issues in Serving ther by co-occurring dialect di¤erences in prosodic or
Speakers of African-American nonsegmental (rhythmic and vocal pitch) features (Tar-
Vernacular English one, 1975; Dejarnette and Holland, 1993), coupled with
known grammatical, semantic, and pragmatic ones.
Consider just the number of grammatical and phono-
Word pronunciation is an overt speech characteristic logical di¤erences between SE and AAVE in the follow-
that readily identifies dialect di¤erences among normal ing example:
speakers even when other aspects of their spoken lan-
guage do not. Although regional pronunciation di¤er- SE: They are not fixing to ask for the car /
ences in the United States were recognized historically, AAVE: They not finna ask for the car /
social dialects were not. Nonprestige social dialects in the ar nat fIksIn tu aesk fOr thP kaF /
particular were viewed simply as disordered speech. A deI na fInP 0ks fv dP ka: /
Phonology: Clinical Issues in Serving Speakers of African-American Vernacular English 159
Enough is known about the complex perceptual judg- or speech-language clinicians (Stockman and Settle,
ments of speech intelligibility to predict that the more 1991; Wilcox, 1996).
work listeners have to do to figure out what is being said, AAVE speakers with disorders can di¤er from their
the more likely is speech to be judged as unclear. nondisordered peers on speech sounds that are like SE
Identifying atypical AAVE speakers can be di‰cult, (Type I error, e.g., word-initial single and clustered
especially if known causes of disordered speech— consonants). They can also di¤er on sounds that are not
hearing loss, brain damage, and so on—are absent, as is like SE either qualitatively (Type II error, e.g., inter-
often the case. Clinicians must know a lot about the dental fricative substitutions) or quantitatively (Type
dialect to defend a diagnosis. But most clinicians (95%) III error, e.g., more frequent final consonant absence
are not African American and have little exposure to in abutting consonant sequences). Wolfram (1994) sug-
AAVE (Campbell and Taylor, 1992). Misdiagnosing gested that these three error categories provide a heu-
normal AAVE speakers as abnormal is encouraged ristic for scaling the severity of the pronunciation
further by the similarity of their typical pronunciation di‰culty and selecting targets for treatment.
patterns (e.g., final consonant deletion, cluster reduc- Two service delivery tracks are within the scope of
tion, and interdental fricative substitutions) to those practice for speech clinicians. One remediates atypical
commonly observed among immature or disordered speech relative to a client’s native dialect. The other one
SE speakers. However, typically developing African- expands the pronunciation patterns of normal speakers
American speakers make fewer errors on standardized who want to speak SE when AAVE is judged to be so-
articulation tests as they get older (Ratusnik and Koe- cially or professionally handicapping (Terrell and Ter-
nigsknecht, 1976; Simmons, 1988; Haynes and Moran, rell, 1983). For both client populations, e¤ective service
1989). Still, they make more errors than their pre- delivery requires clinician sensitivity to cultural factors
dominantly white, age-matched peers (Ratusnik and that impact (1) verbal and nonverbal interactions with
Koenigsknecht, 1976; Seymour and Seymour, 1981; clients, (2) selection of stimuli (e.g., games and objects)
Simmons, 1988; Cole and Taylor, 1990), and they do so for therapy activities, and (3) scheduling of sessions
beyond the age expected for developmental errors (Seymour, 1986; Proctor, 1994). However, the service
(Haynes and Moran, 1989). Therefore it is unknown delivery goals do di¤er for these two populations. For
whether the overrepresentation of African Americans in abnormal speakers, the goal is to eradicate and replace
clinical caseloads is due to practitioner ignorance, test existing patterns that decrease intelligible speech in the
bias, or an actual higher prevalence of speech disorders native dialect. This means that the pronunciation of
as a result of economic poverty and its associated risks bath/b0T/ as /baf/ should not be targeted for change, if
for development in all areas. it conforms to the client’s target dialect. But a deviation
The accuracy in identifying articulation/phonological from this expected pronunciation, such as bath/b0T/ >
disorders improves when test scores are adjusted for di- /b0t/ or /b0s/, is targeted, if observed at an age when
alect di¤erences (Cole and Taylor, 1990), or when the developmental errors are not expected. In contrast, the
pronunciation patterns for a child and caregiver are service delivery goal for normal AAVE speakers is to
compared on the same test words (Terrell, Arensberg, expand rather than eradicate the existing linguistic rep-
and Rosa, 1992). However, tests of isolated word pro- ertoire (Taylor, 1986). An additive approach assumes
nunciation are not entirely useful, even when nonstan- that speakers can learn to switch SE and AAVE codes as
dard dialect use is not penalized. They typically provide the communicative situation demands, just as bilingual
no contexts for sampling AAVE’s variable pronuncia- speakers switch languages. This means that a speaker’s
tion rules, which can cross word boundaries, as in the bidialectal repertoire includes both the SE and AAVE
case of final consonant absence. Although standardized pronunciation of ‘‘bath’’ (cf. bath > /baT/ and /b0f/).
deep tests of articulation (McDonald, 1968) do elicit Meeting these two di¤erent service goals requires at-
paired word combinations, they favor the sampling of tention to some issues that are not the same. They a¤ect
abutting consonant sequences (e.g., bus fish), which pe- which patterns are targeted and how change is facili-
nalize AAVE speakers even more, given their tendency tated. For typical AAVE speakers learning SE, second
to delete final consonants that precede other consonants language acquisition principles are relevant. Besides the
as opposed to vowels (Stockman, 1993, 1996b). These production practice, service delivery requires contrastive
issues have encouraged the use of criterion-referenced analysis of the two dialects and attention to sociocultural
evaluations of spontaneous speech samples for assess- issues that a¤ect code switching. Correct or target pro-
ment (see Stockman, 1996a, and Schraeder et al., 1999). ductions are judged relative to SE.
Despite the assessment challenges, it is readily In contrast, for speakers with abnormal pronuncia-
agreed that some AAVE speakers do have genuine tion, AAVE should be targeted. Which features to target
phonological/articulatory disorders (Seymour and Sey- in therapy and how to model the input become issues,
mour, 1981; Taylor and Peters, 1986). They di¤er from because most clinicians do not speak AAVE. They also
typically developing community peers in both the fre- may resist modeling a low social prestige dialect because
quency and patterning of speech sound error. This is true of negative social attitudes towards it. Wolfram (1994)
whether the clinical and nonclinical groups are distin- reminded us that AAVE and SE share many of the same
guished by the judgments of community informants, target features (e.g., most word-initial consonants).
such as Head Start teachers (Bleile and Wallach, 1992), Errors on shared features (Type I) should be targeted
other classroom teachers (Washington and Craig, 1992), first in treatment. They are likely to impair intelligibility
160 Part II: Speech
even more than the smaller sets of qualitative (Type II) English. In O. Taylor (Ed.), Treatment of communication
errors, such as stop replacement of interdental fricatives disorders in culturally and linguistically diverse populations
(cf. this /dIs/ > /bIs/), or quantitative (Type III) errors, (pp. 153–178). San Diego, CA: College-Hill Press.
such as final consonant deletion in more than the allow- Seymour, H., and Seymour, C. (1981). Black English and
Standard American English contrasts in consonantal devel-
able context number and types. AAVE features should
opment for four- and five-year-old children. Journal of
be targeted for treatment only when pronunciation pat- Speech and Hearing Disorders, 46, 276–280.
terns di¤er from AAVE norms. Articulatory patterns Simmons, J. O. (1988). Fluharty Preschool and Language
would not be modified if they di¤ered from the clin- Screening Test: Analysis of construct validity. Journal of
ician’s SE-modeled pattern but matched expected AAVE Speech and Hearing Disorders, 53, 168–174.
patterns. Stockman, I. (1991, November). Constraints on final consonant
Legitimizing social dialects like AAVE in the United deletion in Black English. Poster presented at the annual
States has required researchers and clinicians to (1) convention of the American Speech-Language-Hearing As-
broaden the reference point for normalcy and (2) explore sociation, Atlanta, GA.
alternative strategies for identifying service needs and Stockman, I. (1993). Variable word initial and medial conso-
nant relationships in children’s speech sound articulation.
modifying word pronunciation. The issues singled out in
Perceptual and Motor Skills, 76, 675–689.
this entry are not unique to phonological/articulatory Stockman, I. (1996a). The promises and pitfalls of language
problems. However, given their typically higher fre- sample analysis as an assessment tool for linguistic mi-
quency of occurrence relative to other domains of nority children. Language, Speech, and Hearing Services in
spoken language in all groups, they may turn up more Schools, 27, 355–366.
often in clinical work. Stockman, I. (1996b). Phonological development and disorders
See also dialect speakers; dialect versus disorder; in African American children. In A. Kamhi, K. Pollock,
language disorders in african-american children. and J. Harris (Eds.), Communication development and dis-
orders in African American children: Research, assessment
—Ida J. Stockman and intervention. Baltimore: Paul H. Brookes.
Stockman, I., and Settle, S. (1991, November). Initial con-
References sonants in young black children’s conversational speech.
Poster presented at the annual convention of the American
Bleile, K., and Wallach, H. (1992). A sociolinguistic investiga- Speech-Language-Hearing Association, Atlanta, GA.
tion of the speech of African-American preschoolers. Tarone, E. (1975). Aspects of intonation in Black English.
American Journal of Speech-Language Pathology, 1, 54–62. American Speech, 48, 29–36.
Campbell, L., and Taylor, O. (1992). ASHA certified speech- Taylor, O. (1986). Teaching English as a second dialect. In O.
language pathologists: Perceived competency levels with se- Taylor (Ed.), Treatment of communication disorders in cul-
lected skills. Howard Journal of Communication, 3, 163–176. turally and linguistically diverse populations (pp. 153–178).
Cole, P., and Taylor, O. (1990). Performance of working-class San Diego, CA: College-Hill Press.
African-American children on three tests of articulation. Taylor, O., and Peters, C. (1986). Speech and language dis-
Language, Speech, and Hearing Services in Schools, 21, orders in blacks. In O. Taylor (Ed.), The nature of commu-
171–176. nication disorders in culturally and linguistically diverse
Dejarnette, G., and Holland, W. (1993). Voice and voice dis- populations (pp. 157–180). San Diego, CA: College-Hill
orders. In D. Battle (Ed.), Communication disorders in mul- Press.
ticultural populations (pp. 212–238). Boston: Andover. Terrell, S., Arensberg, K., and Rosa, M. (1992). Parent-child
Haynes, W., and Moran, M. (1989). A cross-sectional devel- comparative analysis: A criterion-referenced method for the
opmental study of final consonant production in southern nondiscriminatory assessment of a child who spoke a rela-
black children from preschool through third grade. Lan- tively uncommon dialect of English. Language, Speech, and
guage, Speech, and Hearing Services in Schools, 20, 400–406. Hearing Services in Schools, 23, 34–42.
McDonald, E. (1968). The Screening Deep Test of Articulation. Terrell, S., and Terrell, F. (1983). E¤ects of speaking Black
Pittsburgh, PA: Stanwix House. English on employment opportunities. ASHA, 25, 27–
Moran, M. (1993). Final consonant deletion in African Amer- 29.
ican children speaking Black English: A closer look. Vaughn-Cooke, F. (1986). Lexical di¤usion: Evidence from a
Language, Speech, and Hearing Services in Schools, 24, decreolizing variety of Black English. In M. Montgomery
161–166. and G. Bailey (Eds.), Language variety in the South (pp.
Proctor, A. (1994). Phonology and cultural diversity. In R. J. 111–130). Tuscaloosa: University of Alabama Press.
Lowe (Ed.), Phonology: Assessment and intervention. Appli- Washington, J., and Craig, H. (1992). Articulation test per-
cations in speech pathology (pp. 207–245). Baltimore: Wil- formances of low-income, African-American preschoolers
liams and Wilkins. with communication impairments. Language, Speech, and
Ratusnik, D., and Koenigsknecht, R. (1976). Influence of age Hearing Services in Schools, 23, 203–207.
on black preschoolers’ nonstandard performance of certain Wilcox, D. L. (1996). Distinguishing between phonological dif-
phonological and grammatical forms. Perceptual and Motor ference and disorder in children who speak African-American
Skills, 42, 199–206. English. Unpublished master’s thesis, Indiana University,
Schraeder, P., Quinn, M., Stockman, I., and Miller, J. (1999). Bloomington.
Authentic assessment as an approach to preschool speech- Wolfram, W. (1994). The phonology of a sociocultural variety:
language screening. American Journal of Speech-Language The case of African American vernacular English. In J.
Pathology, 8, 95–200. Bernthal and N. Bankson (Eds.), Child phonology: Charac-
Seymour, H. (1986). Clinical principles for language interven- teristics, assessment, and intervention with special popula-
tion for language disorders among nonstandard speakers of tions (pp. 227–244). New York: Thieme.
Psychosocial Problems Associated with Communicative Disorders 161
Further Readings Vaughn-Cooke, F. (1987). Are black and white vernaculars

diverging? American Speech, 62, 12–32.
Bailey, G. (1999). Phonological characteristics of African Wolfram, W. (1989). Structural variability in phonological
American Vernacular English. In S. S. Mufwene, J. R. development: Final nasals in vernacular Black English. In
Rickford, G. Bailey, and J. Baugh (Eds.), Structure of Af- R. Fasold and D. Schi¤ren (Eds.), Current issues in linguis-
rican American vernacular English. New York: Routledge. tic theory: Language change and variation (pp. 301–332).
Battle, D. E. (1998). Communication disorders in multicultural Amsterdam: John Benjamins.
populations. Boston: Butterworth-Heinemann. Wolfram, W., and Schilling-Estes, N. (1998). American
Baugh, J. (1983). Black street speech: Its history, structure and English: Dialects and variation. Malden, MA: Blackwell
survival. Austin, TX: University of Texas Press. Publishers.
Dillard, J. L. (1972). Black English: Its history and usage in the
United States. New York: Random House.
Fasold, R. W., and Shuy, R. W. (1970). Teaching Standard Psychosocial Problems Associated with
English in the inner city. Washington, DC: Center for Ap-
plied Linguistics. Communicative Disorders
Fasold, R. W. (1981). The relation between black and white
speech in the South. American Speech, 56, 163–189.
Fudala, J. B. (1974). Arizona Articulation Proficiency Scale Individuals who study communicative disorders have
(2nd ed.). Los Angeles: Western Psychological Services. long been interested in the psychosocial di‰culties asso-
Goldman, R., and Fristoe, M. (1986). Goldman-Fristoe Test ciated with these problems. This interest has taken dif-
of Articulation. Circle Pines, MN: American Guidance ferent faces over the years as researchers and clinicians
Service. have focused on various aspects of the relationship
Labov, W. (1972). Language in the inner city. Philadelphia: between communicative impairment and psychological
University of Pennysylvania Press. and social di‰culties. For example, relatively early in
Luelsdor¤, P. A. (1975). A segmental phonology of Black En- the development of the profession of speech-language
glish. The Hague: Mouton. pathology, some investigators approached specific com-
Mowrer, D., and Burger, S. (1991). A comparative analysis of
phonological acquisition of consonants in the speech of 2 1/2-
municative disorders, such as stuttering, as manifes-
to 6-year-old Xhosa- and English-speaking children. Clini- tations of underlying psychological dysfunction (e.g.,
cal Linguistics and Phonetics, 5, 139–164. Travis, 1957). More recent approaches have moved
Myers-Jennings, C. C. (2000). Phonological disorders in cul- away from considering psychiatric dysfunction as the
turally diverse populations. In T. J. Coleman (Ed.), Clinical basis for most speech and language impairment (an ex-
management of communication disorders in culturally diverse ception is alexithymia). Despite this reorientation, there
children (pp. 173–196). Boston: Allyn and Bacon. is still considerable interest in the psychosocial aspects of
Newman, P. W., and Craighead, N. A. (1989). Assessment of communicative disorders. The literature is both exten-
articulatory and phonological disorders. In N. Craighead, sive and wide-ranging, and much of it focuses on specific
P. Newman, and W. Secord (Eds.), Assessment and reme- types of impairment (e.g., stuttering, language impair-
diation of articulatory and phonological disorders. Colum-
ment). There are two general areas of study, however,
bus, OH: Merrill.
Seymour, H., Green, L., and Huntley, R. (1991). Phonological that are of particular interest. The first is the frequent
patterns in the conversational speech of African-American co-occurrence of speech and language impairment and
Children. Poster presented at the national convention of the socioemotional problems. A great deal of research has
American Speech-Language-Hearing Association, Atlanta, been directed toward exploring this relationship as well
GA. as toward determining what mechanisms might underlie
Seymour, H., and Ralabate, P. (1985). The acquisition of a this comorbidity. A second area of interest concerns the
phonologic feature of Black English. Journal of Communi- long-term outcomes of communicative problems across
cation Disorders, 18, 139–148. various areas of psychosocial development (e.g., peer
Stockman, I., and Stephenson, L. (1981). Children’s articula- relations, socioemotional status). Both of these lines of
tion of medial consonant clusters: Implications for syllabi-
fication. Language and Speech, 24, 185–204.
work are briefly discussed here.
Taylor, O. L. (Ed.). (1986). Nature of communication disorders
in culturally and linguistically diverse populations. San Di- Co-occurrence of Disorders. Numerous investigators
ego, CA: College-Hill Press. have reported a high level of co-occurrence between
Taylor, O. (Ed.). (1986). Treatment of communication disorders communicative disorders and socioemotional problems.
in culturally and linguistically diverse populations. San Di- This high level of co-occurrence has been observed in
ego, CA: College-Hill Press. various groups of children, including both those with a
Tull, B. M. (1973). Analysis of selected prosodic features in the primary diagnosis of speech and language impairment
speech of black and white children. Unpublished dissertation, and those with a primary diagnosis of psychiatric im-
Ohio State University, Columbus. pairment or behavior disorder. Illustrative of these find-
Van Keulen, J. E., Weddington, G. T., and Debose, C. E.
(1998). Speech, language, learning and the African American
ings is the work of Baker and Cantwell (1987). These
child. Boston: Allyn and Bacon. researchers performed psychiatric evaluations on 600
Vaughn-Cooke, F. (1976). The implementation of a phono- consecutive patients seen at a community speech, lan-
logical change: The case for re-syllabification in Black guage, and hearing clinic. Children were divided into
English. Dissertation Abstracts International, 38(01), 234A. three subgroups of communication problems: speech
(University Microfilms No. AAC7714537) (children with disorders of articulation, voice, and
162 Part II: Speech
fluency), language (children with problems in language The work of Beitchman and colleagues provides one
expression, comprehension, and pragmatics), and a example of a research program examining long-term
speech and language group (children with a mixture of psychosocial outcomes of individuals with communica-
problems). Of these children, approximately 50% were tive impairment. These researchers followed children
diagnosed as having a psychiatric disorder. These prob- with speech impairment and language impairment and
lems were categorized into two general groups of be- their typical controls longitudinally over a 14-year pe-
havior disorder and emotional disorder. riod (Beitchman et al., 2001). At age 5, the children in
Several researchers have speculated on the basis for the group with speech impairment and the group with
this high level of co-occurrence between communication language impairment had a higher rate of behavioral
and socioemotional disorders. For example, Beitchman, problems than the control group. At age 12, socioemo-
Brownlie, and Wilson (1996) proposed several potential tional status was closely linked to status at age 5. At
relationships, including the following: (1) impaired com- age 14 years and at age 19 years, individuals in the group
municative skills lead to socioemotional impairment, (2) with language impairment had significantly higher rates
impaired communicative skills result in academic prob- of psychiatric involvement than the control group. Chil-
lems, which in turn lead to behavioral problems, (3) dren in the group with speech impairment did not di¤er
other variables (e.g., socioeconomic status) explain, in from the controls.
part or in whole, the relationship between communica- A few studies have examined the long-term psycho-
tive problems and socioemotional di‰culties, and (4) social outcomes of individuals with speech and/or lan-
an underlying factor (e.g., neurodevelopmental status) guage impairment as they enter adulthood. For example,
accounts for both types of problems. Records, Tomblin, and Freese (1992) examined quality
Further research is needed to clarify the relationship of life in a group of 29 young adults (mean age, 21.6
between speech and language ability and socioemotional years) with specific language impairment and 29 con-
status. One approach to this problem has been to inves- trols. The groups did not significantly di¤er on reported
tigate various child factors that may contribute to personal happiness or life satisfaction. Additionally, dif-
developmental risk. For example, Tomblin et al. (2000) ferences were not observed with respect to satisfaction in
reported that reading disability is a key mediating factor relation to specific aspects of life, such as employment or
predicting whether children with language impairment social relationships.
demonstrate behavioral di‰culties. Howlin, Mawhood, and Rutter (2000) reported a
Of particular interest is the relationship between so- bleaker picture. They reexamined two groups of young
cial competence, communicative competence, and soci- men, 23–24 years of age, who had first been evaluated at
oemotional functioning. It is clear that speech and 7–8 years of age. One group was identified with autism
language skills play a critical role in social interaction and the other with language impairment. At follow-up,
and that children who have di‰culty communicating the group with language impairment showed fewer social
are likely to have di‰culty interacting with others. The and behavioral problems than the group with autism.
way in which various components of behavior interact, The two groups had converged over the years, however,
however, is not as straightforward as might initially be and di¤erences between the two were not qualitative.
thought. For example, Fujiki et al. (1999) found that The young men with language impairment showed a
children with language impairment were more with- high incidence of social di‰culties, including problems
drawn and less sociable than their typical peers, consis- with social interaction, limited social contacts, and di‰-
tent with much of the existing literature. More specific culty establishing friendships. Most still lived with their
evaluation revealed that these di¤erences were based on parents and had unstable employment histories in man-
particular types of withdrawal (reticence, solitary active ual or unskilled jobs. Neither childhood language ability
withdrawal). Further, severity of language impairment, nor current language ability predicted social functioning
at least as measured by a formal test of language, was in adulthood. Howlin et al. (2000) concluded that in
not related to severity of withdrawal. Further clarifica- language impairment, ‘‘as in autism, a broader deficit
tion is needed to determine how these areas of devel- underlies both the language delay and the social impair-
opment interact to produce social outcomes, and what ments’’ (p. 573).
factors may exacerbate or moderate socioemotional Given some of the data cited above, it would appear
status. that children with speech di‰culties achieve better
psychosocial outcomes than children with language dif-
Long-Term Consequences. A related line of work has ficulties (see also Toppelberg and Shapiro, 2000). Al-
focused on the long-term psychosocial and socio- though this may generally be the case, generalizations
behavioral consequences of speech and language im- across individuals with di¤erent types of speech impair-
pairment. In summarizing numerous studies looking at ment must be made with caution. Some types of speech
the outcomes of communication disorders, Aram and problems, such as stuttering, are likely to have impor-
Hall (1989) stated that children with language impair- tant psychosocial implications, but also have relatively
ment have frequently been found to have high rates of low incidence rates. Thus, in large group design studies
persistent social and behavioral problems. Children with where individuals are categorized together under the
speech impairment tend to have more favorable long- general heading of ‘‘speech,’’ the unique psychosocial
term outcomes. di‰culties associated with such disorders may be masked
Psychosocial Problems Associated with Communicative Disorders 163
by the psychosocial profiles associated with more com- atric outcome. Journal of the American Academy of Child
monly occurring communication problems. and Adolescent Psychiatry, 40, 75–82.
It should also be noted that speech impairments may Fujiki, M., Brinton, B., Morgan, M., and Hart, C. H. (1999).
vary from having no outward manifestations aside from Withdrawn and sociable behavior of children with spe-
cific language impairment. Language, Speech, and Hearing
those involved in talking to relatively severe physical or
Services in Schools, 30, 183–195.
cognitive deficits. The impact of associated problems on Howlin, P., Mawhood, L., and Rutter, M. (2000). Austism and
the psychosocial development of children with di¤ering developmental receptive language disorder: A follow-up
types of communicative impairment is di‰cult to sum- comparison in early adult life. II. Social, behavioural, and
marize briefly. Illustrative of the complexity even within psychiatric outcomes. Journal of Child Psychology and Psy-
a specific category of speech impairment are children chiatry and Allied Disciplines, 41, 561–578.
with cleft lip and palate. These children may have artic- Richman, L. C., and Eliason, M. J. (1992). Disorders of com-
ulation problems and hypernasality secondary to specific munication: Developmental language disorders and cleft
physical anomalies. These physical anomalies may be palate. In C. E. Walker and M. C. Roberts (Eds.), Hand-
resolved, to various degrees, with surgery. Speech may book of clinical child psychology (2nd ed., pp. 537–552).
New York: Wiley.
also vary considerably. No specific personality type has
Richman, L. C., and Millard, T. (1997). Brief report: Cleft lip
been associated with children with cleft palate (Richman and palate: Longitudinal behavior and relationships of cleft
and Eliason, 1992). Individual studies, however, have conditions to behavior and achievement. Journal of Pediat-
found these children to exhibit higher than expected ric Psychology, 22, 487–494.
rates of both internalizing and externalizing behavior Records, N. L., Tomblin, J. B., and Freese, P. R. (1992). The
(Richman and Millard, 1997). It appears that factors quality of life of young adults with histories of specific lan-
such as family support, degree of disfigurement, and self- guage impairment. American Journal of Speech-Language
appraisal interact in complex ways to produce psycho- Pathology, 1, 44–53.
social outcomes in children with cleft lip and palate. Tomblin, J. B., Zhang, X., Buckwalter, P., and Catts, H.
In summary, it is clear that individuals with commu- (2000). The association of reading disability, behavioral
disorders, and language impairment among second-grade
nicative disorders often have di‰culty with aspects of
children. Journal of Child Psychology and Psychiatry, 41,
psychosocial behavior and that these problems can have 475–482.
long-term implications. There is also evidence that chil- Toppelberg, C. O., and Shapiro, T. (2000). Language dis-
dren with language impairment have more psychosocial orders: A 10-year research update review. Journal of the
di‰culties than children with speech impairment. It must American Academy of Child and Adolescent Psychiatry, 39,
be remembered, however, that speech problems di¤er by 143–152.
type of impairment, severity, and other variables. Thus, Travis, L. E. (1957). The unspeakable feelings of people with
generalizations must be made with caution. Given the special reference to stuttering. In L. E. Travis (Ed.), Hand-
accumulated evidence, there is good reason to believe book of speech pathology (pp. 916–946). New York:
that parents, educators, and clinicians working with Appleton-Century-Crofts.
children with speech and language impairment should
give serious consideration to psychosocial status in
planning a comprehensive intervention program.
Further Readings
See also poverty: effects on language; social de- Aram, D. M., Ekelman, B., and Nation, J. (1984). Preschoolers
velopment and language impairment. with language disorders: 10 years later. Journal of Speech
—Martin Fujiki and Bonnie Brinton Baker, L., and Cantwell, D. P. (1982). Psychiatric disorder in
children with di¤erent types of communication disorders.
References Journal of Communication Disorders, 15, 113–126.
Baker, L., and Cantwell, D. P. (1987). A prospective psychiat-
Aram, D. M., and Hall, N. E. (1989). Longitudinal follow-up ric follow-up of children with speech/language disorders.
of children with preschool communication disorders: Treat- Journal of the American Academy of Child and Adolescent
ment implications. School Psychology Review, 18, 487– Psychiatry, 26, 546–553.
501. Beitchman, J. H., Cohen, N. J., Konstantareas, M. M., and
Baker, L., and Cantwell, D. P. (1987). Comparison of well, Tannock, R. (1996). Language, learning, and behavior dis-
emotionally disordered, and behaviorally disordered chil- orders: Developmental, biological, and clinical perspectives.
dren with linguistic problems. Journal of the American New York: Cambridge University Press.
Academy of Child Adolescent Psychiatry, 26, 193–196. Beitchman, J. H., Nair, R., Clegg, M. A., Ferguson, B., and
Beitchman, J. H., Brownlie, E. B., and Wilson, B. (1996). Lin- Patel, P. G. (1986). Prevalence of psychiatric disorders in
guistic impairment to psychiatric disorder: Pathways to children with speech and language disorders. Journal of the
outcome. In J. H. Beitchman, N. Cohen, M. M. Kon- American Academy of Child Psychiatry, 25, 528–535.
stantareas, and R. Tannock (Eds.), Language, learning, and Beitchman, J. H., Wilson, B., Brownlie, B., Walters, H., Inglis,
behavior disorders: Developmental, biological, and clinical A., and Lancee, W. (1996). Long-term consistency in
perspectives (pp. 493–514). New York: Cambridge Univer- speech/language profiles: II. Behavioral, emotional, and
sity Press. social outcomes. Journal of the American Academy of Child
Beitchman, J. H., Wilson, B., Johnson, C. J., Atkinson, L., and Adolescent Psychiatry, 35, 815–825.
Young, A., Adlaf, E., et al. (2001). Fourteen-year follow-up Benasich, A. A., Curtiss, S., and Tallal, P. (1993). Lan-
of speech/language-impaired and control children: Psychi- guage, learning behavioral disturbances in childhood: A
164 Part II: Speech
longitudinal perspective. Journal of the American Academy sion in a client for whom traditional response modes,
of Child and Adolescent Psychiatry, 32, 585–594. such as speaking or even pointing, are not possible.
Cantwell, D., and Baker, L. (1988). Clinical significance of Computers can also be used to administer or score a
childhood communication disorders: Perspectives from a formal test (Cochran and Masterson, 1995; Hallowell
longitudinal study. Journal of Child Neurology, 2, 257–264.
and Katz, 1999; Long, 1999). Computer-based scoring
Cochrane, V. M., and Slade, P. (1999). Appraisal and coping in
adults with cleft lip: Associations with well-being and social systems allow the input of raw scores, which are then
anxiety. British Journal of Medical Psychology, 72, 485–503. converted to profiles or derived scores of interest (Long,
Cohen, N. J., Menna, R., Vallance, D., Barwick, M. A., Im, 1999). The value of such programs is inversely related to
N., and Horodezky, N. B. (1998). Language, social cogni- the ease of obtaining the derived scores by hand. If the
tive processing, and behavioral characteristics of psychiat- translation of raw scores to derived scores is tedious and
rically disturbed children with previously identified and time-consuming, clinicians might find the software tools
unsuspected language impairments. Journal of Child Psy- worth their investment in time and money.
chology and Psychiatry and Allied Disciplines, 39, 853–864. Although few computerized tests are currently avail-
Felsenfeld, S., Broen, P. A., and McGue, M. (1992). A 28-year able, the potential for such instruments is quite high.
follow-up of adults with a history of moderate phonolog-
Hallowell and Katz (1999) point out that computerized
ical disorder: Linguistic and personality results. Journal of
Speech and Hearing Research, 35, 1114–1125. test administration could allow tighter standardization
Johnson, C. L., Beitchman, J. H., Young, A., Escobar, M., of administration conditions and procedures, tracking of
Atkinson, L., Wilson, B., et al. (1999). Fourteen year follow- response latency, and automated interfacing with alter-
up of children with and without speech/language impair- native response mode systems. Of particular promise are
ments: Speech/language stability and outcomes. Journal of the computerized tests that adapt to a specific client’s
Speech, Language, and Hearing Research, 42, 744–760. profile. That is, stimuli are presented in a manner that is
Mawhood, L., Howlin, P., and Rutter, M. (2000). Autism and contingent on the individual’s prior responses (Letz,
developmental receptive language disorder: A comparative Green, and Woodard, 1996). The type of task or specific
follow-up in early adult life. I. Cognitive and language items that are administered can be automatically deter-
outcomes. Journal of Child Psychology and Psychiatry and
mined by a client’s ongoing performance (e.g., Master-
Allied Disciplines, 41, 547–559.
Noterdaeme, M., and Amorosa, H. (1999). Evaluation of son and Bernhardt, 2001), which makes individualized
emotional and behavioral problems in language impaired assessment more feasible than ever. Incorporation of
children using the Child Behavior Checklist. European Child some principles from artificial intelligence also makes the
and Adolescent Psychiatry, 8, 71–77. future of computers in assessment exciting. For example,
Paul, R., and Kellogg, L. (1997). Temperament in late talkers. Masterson, Apel, and Wasowicz (2001) developed a tool
Journal of Child Psychology and Psychiatry and Allied Dis- for spelling assessment that employs complex algorithms
ciplines, 38, 803–811. for parsing spelling words into target orthographic
Rapin, I. (1996). Practitioner review: Developmental language structures and then aligning a student’s spelling with the
disorder: A clinical update. Journal of Child Psychology and appropriate correct forms. Based on the type of mis-
Psychiatry, 6, 643–655.
spellings exhibited by each individual student, the sys-
tem identifies related skills that need testing, such as
phonological awareness or morphological knowledge.
Speech and Language Disorders in This system makes possible a comprehensive description
Children: Computer-Based Approaches of a student’s spelling abilities that would otherwise be
prohibitive because of the time required to perform the
analyses by hand and administer the individualized
Computers can be used e¤ectively in the assessment follow-ups.
of children’s speech and language. Biofeedback instru- Computerized language and phonological sample
mentation allows the clinician to obtain relatively objec- analysis (CL/PSA) has been in use since the 1980s
tive measures of certain aspects of speech production. (Evans and Miller, 1999; Long, 1999; Masterson and
For example, measures of jitter and shimmer can be Oller, 1999; Long and Channell, 2001). These programs
recorded, along with perceptual judgments about a cli- allow researchers and clinicians to perform complex, in-
ent’s pitch and intensity perturbations (Case, 1999). depth analyses that would likely be impossible without
Acoustic analyses (Kent and Read, 1992) can be used to the technology. They provide instant analysis of a wide
supplement the clinician’s perceptions of phonological range of phonological and linguistic measures, and some
contrasts (Masterson, Long, and Buder, 1998). For the provide tools that reduce and simplify the time-consum-
evaluation of a client suspected of having a fluency dis- ing process of transcribing samples (Long, 1999). Many
order, recent software developments allow the clinician of the CL/PSA programs also include comparison data-
to gather measures of both the number and type of bases of language samples from both typical and clinical
speech disfluencies and to document signs of e¤ort, populations (Evans and Miller, 1999). Despite the power
struggle, or disruption of airflow and phonation (Bak- of CL/PSA programs, their use in clinical settings re-
ker, 1999a). Hallowell (1999) discusses the use of instru- mains limited, for unclear reasons. It is possible that
mentation for detecting and measuring eye movements funding for software and hardware is insu‰cient; how-
for the purpose of comprehension assessment. This ever, data from recent surveys (McRay and Fitch, 1996;
exciting tool allows the clinician to evaluate comprehen- ASHA, 1997) do not support this conjecture, since most
Speech and Language Disorders in Children: Computer-Based Approaches 165
respondents do report owning and using computers for Computers add a new twist to an old standard in
other purposes. Lack of use is more likely related to in- phonological treatment. Instead of having to sort and
su‰cient familiarity with many of the measures derived carry numerous picture cards from one treatment ses-
from language sample analysis and failure to recognize sion to the next, clinicians can choose one of several
the benefits of these measures for treatment planning software packages that allow access and display of
(Cochran and Masterson, 1995; Fitch and McRay, multimedia stimuli on the basis of phonological charac-
1997). In an e¤ort to address this problem, Long estab- teristics (Masterson and Rvachew, 1999). New tech-
lished the Computerized Profiling Website (http:// nologies, such as the palatometer, provide clients with
www.computerizedprofiling.org) in 1999. Clinicians can critical feedback for sound production when tactile or
visit the web site and obtain free versions of this CL/ kinesthetic feedback has not been su‰cient. Similarly,
PSA software as well as instructional materials regarding computer programs can be used to provide objective
its use and application. feedback regarding the frequency of stutterings, which
Computer software for use in speech and language might be considered less confrontational than feedback
intervention has progressed significantly from the early provided by the clinician (Bakker, 1999b). One particu-
versions, which were based primarily on a drill-and- larly promising technology, the Speech Enhancer, incor-
practice format. Cochran and Nelson (1999) cite litera- porates real-time processing of an individual’s speech
ture that confirms what many clinicians knew intuitively: production and selectively boosts energy only in those
software that allows the child to be in control and to frequencies necessary for maximum intelligibility. Car-
independently explore based on personal interests is iski and Rosenbek (1999) collected data from a single
more beneficial than computer programs based on the subject and found that intelligibility scores were higher
drill-and-practice model. Improvements in multimedia when using the Speech Enhancer than when using a
capacities and an appreciation for maximally e¤ective high-fidelity amplifier. The authors suggested that their
designs have resulted in a proliferation of software results supported the notion that the device did indeed
packages that can be e¤ectively used in language inter- do more than simply amplify the speech output.
vention with young children. As with any tool, the focus The decision to use computers in both assessment and
must remain on the target linguistic structures rather treatment activities will continue to be based on the
than the toys or activities that are used to elicit or model clinician’s judgment as to the added value of the tech-
productions. In addition to therapeutic benefits, com- nology application. If a clinician can do an activity just
puters o¤er reasonable compensatory strategies for as well without a computer, it is unlikely that she or he
older, school-age students with language-learning dis- will go to the expense in terms of time and money to in-
abilities (Wood and Masterson, 1999; Masterson, Apel, vest in the computer tool. On the other hand, for those
and Wood, 2002). For example, word processors with tasks that cannot be done as well or even at all, clinicians
text-to-speech capabilities allow students to check their will likely turn to the computer if they are convinced that
own work by listening to as well as reading their text. the tasks themselves are worth it.
Spell and grammar checkers can be helpful, as long as See also aphasia treatment: computer-aided
students have been su‰ciently trained in the optimal use rehabilitation.
of these tools, including an appreciation of their limita-
tions. Speech recognition systems continue to improve, —Julie J. Masterson
and perhaps someday they will free writers with lan-
guage disorders from the burden of text entry, which References
requires choices regarding spelling, and spelling can be
so challenging for students with language disorders that American Speech-Language-Hearing Association. (1997). Om-
it interferes with text construction. Currently, speech nibus Survey Results: 1997 edition. Rockville, MD: Author.
recognition technology remains limited in recognition Bakker, K. (1999a). Clinical technologies for the reduction of
stuttering and enhancement of speech fluency. Seminars in
accuracy for students with language disorders (Wetzel, Speech and Language, 20, 271–280.
1996). Even when accuracy improves to an acceptable Bakker, K. (1999b). Technical solutions for quantitative and
level, students will still need specific training in the opti- qualitative assessments of speech fluency. Seminars in
mal use of the technology. Optimal writing involves Speech and Language, 20, 185–196.
more than a simple, direct translation of spoken lan- Cariski, D., and Rosenbek, J. (1999). The e¤ectiveness of the
guage to written form. Students who employ speech rec- Speech Enhancer. Journal of Medical Speech-Language Pa-
ognition software to construct written texts will need thology, 7, 315–322.
focused instruction regarding the di¤erences between Case, J. L. (1999). Technology in the assessment of voice dis-
the styles of spoken and written language. Finally, the order. Seminars in Speech and Language, 20, 169–184.
Internet provides not only a context for language inter- Cochran, P., and Masterson, J. (1995). Not using a computer
in language assessment/intervention: In defense of the re-
vention, but a potential source of motivation as well. luctant clinician. Language, Speech, and Hearing Services in
The percentage of school-age children who use the Schools, 26, 260–262.
Internet on a daily basis for social as well as academic Cochran, P. S., and Nelson, L. K. (1999). Technology applica-
purposes continues to increase, and it is likely that tions in intervention for preschool-age children with langu-
speech-language pathologists will capitalize on this age disorders. Seminars in Speech and Language, 20, 203–
trend. 218.
166 Part II: Speech
Evans, J. L., and Miller, J. (1999). Language sample analysis in Case, J. L. (1999). Technology in the treatment of voice dis-
the 21st century. Seminars in Speech and Language, 20, orders. Seminars in Speech and Language, 20, 281–295.
101–116. Farrall, J. L., and Parsons, C. L. (1992). A comparison of a
Fitch, J. L., and McRay, L. B. (1997). Integrating technology traditional test format vs. a computerized administration of
into school programs. Language, Speech, and Hearing the Carrow-Woolfolk Test for Auditory Comprehension
Services in Schools, 28, 134–136. of Language. Australian Journal of Human Communication
Hallowell, B. (1999). A new way of looking at auditory Disorders, 20, 33–48.
linguistic comprehension. In Becker, W., Deubel, H., and Fitch, J. L., and McRay, L. B. (1997). Integrating technology
Mergner, T. (Eds.). Current oculomotor research: Physio- into school programs. Language, Speech, and Hearing
logical and psychological aspects (pp. 287–291). New York: Services in Schools, 28, 134–136.
Plenum Press. Friel-Patti, S., DesBarres, K., and Thibodeau, L. (2001). Case
Hallowell, B., and Katz, R. C. (1999). Technological applica- studies of children using Fast ForWord. Journal of Speech-
tions in the assessment of acquired neurogenic communica- Language Pathology, 10, 203–215.
tion and swallowing disorders in adults. Seminars in Speech Jamieson, D. G., and Rvachew, S. (1992). Remediation of
and Language, 20, 149–167. speech production errors with sound identification training.
Kent, R., and Read, C. (1992). The acoustic analysis of speech. Journal of Speech-Language Pathology and Audiology, 16,
San Diego, CA: Singular Publishing Group. 201–210.
Letz, R., Green, R. C., and Woodard, J. L. (1996). Develop- Katz, R. C., and Hallowell, B. (1999). Technological applica-
ment of a computer-based battery designed to screen adults tions in the treatment of acquired neurogenic communica-
for neuropsychological impairment. Neurotoxicology and tion and swallowing disorders in adults. Seminars in Speech
Teratology, 18, 365–370. and Language, 20, 251–269.
Long, S. H. (1999). Technology applications in the assessment Long, S. H., and Channell, R. W. (2001). Accuracy of four
of children’s language. Seminars in Speech and Language, language analysis procedures performed automatically.
20, 117–132. American Journal of Speech-Language Pathology, 10, 180–
Long, S. H., and Channell, R. W. (2001). Accuracy of four 188.
language analysis procedures performed automatically. Long, S. H., Fey, M. E., and Channell, R. W. (1998). Com-
American Journal of Speech-Language Pathology, 10, puterized Profiling (CP) (Version 9.0) (MS-DOS)
180–188. [Computer program]. Cleveland, OH: Department of Com-
Masterson, J., Apel, K., and Wasowicz, J. (in press). Spelling munication Sciences, Case Western Reserve University.
Evaluation for Language and Literacy (SPELL) [computer MacWhinney, B. (1998). The CHILDES project: Computa-
software]. Evanston, IL: Learning by Design. tional tools for analyzing talk. Mahwah, NJ: Erlbaum.
Masterson, J., Apel, K., and Wood, L. (2002). Linking soft- Masterson, J., and Crede, L. (1999). Learning to spell: Impli-
ware and hardware applications to what we know about cations for assessment and intervention. Language, Speech,
literacy development. In K. Butler and E. Silliman (Eds.), and Hearing Services in Schools, 30, 243–254.
Speaking, reading, and writing in children with language- Masterson, J., and Perrey, C. (1999). Training analogical rea-
learning disabilities: New paradigms for research and prac- soning skills in children with language disorders. American
tice (pp. 273–293). Mahwah, NJ: Erlbaum. Journal of Speech-Language Pathology, 8, 53–61.
Masterson, J., and Bernhardt, B. (2001). Computerized Ar- Masterson, J., Wynne, M., Kuster, J., and Stierwalt, J. (1999).
ticulation and Phonology Evaluation System (CAPES) New and emerging technologies: Going where we’ve never
[computer software]. San Antonio, TX: Psychological Cor- gone before. ASHA, 41, 16–20.
poration. Matesich, J., Porch, B., and Katz, R. (1996). PICApad PC
Masterson, J., Long, S., and Buder, E. (1998). Instrumentation [Computer software]. Scottsdale, AZ: Sunset Software.
in clinical phonology. In J. Bernthal and N. Bankson Miller, J., Freiberg, C., Rolland, M.-B., and Reeves, M.
(Eds.), Articulation and phonological disorders (4th ed., (1992). Implementing computerized language sample anal-
pp. 378–406). Englewood Cli¤s, NJ: Prentice-Hall. ysis in the public school. C. Dollaghan (Ed.), Topics in lan-
Masterson, J. J., and Oller, D. K. (1999). Use of technology in guage disorders. Rockville, MD: Aspen Press.
phonological assessment: Evaluation of early meaningful Miller, J. F., and Chapman, R. S. (1998). Systematic analysis
speech and prelinguistic vocalizations. Seminars in Speech of language transcripts (SALT) (Version 4.0, MS-DOS)
and Language, 20, 133–148. [Computer program]. Madison, WI: Language Analysis
Masterson, J. J., and Rvachew, S. (1999). Use of technology in Laboratory, Waisman Center on Mental Retardation and
phonological intervention. Seminars in Speech and Lan- Human Development, University of Wisconsin.
guage, 20, 233–250. Nelson, L., and Masterson, J. (1999). Using microcomputer
McRay, L. B., and Fitch, J. L. (1996). A survey of com- technology to advance assessment and intervention for
puter use of public school speech-language pathologists. children with language disorders. Topics in Language Dis-
Language, Speech, and Hearing Services in Schools, 27, 40– orders, 19, 68–86.
47. Rvachew, S. (1994). Speech perception training can facilitate
Wetzel, K. (1996). Speech-recognizing computers: A written- sound production learning. Journal of Speech and Hearing
communication tool for students with learning disabilities? Research, 37, 347–357.
Journal of Learning Disabilities, 29, 371–380. Tye-Murray, N. (1992). Laser videodisc technology in the
Wood, L. A., and Masterson, J. J. (1999). The use of technol- aural rehabilitation setting: Good news for people with se-
ogy to facilitate language skills in school-age children. vere and profound hearing impairments. American Journal
Seminars in Speech and Language, 20, 219–232. of Audiology, 1, 33–36.
Wilkinson, G. S. (1993). Wide Range Achievement Test 3
Further Readings (WRAT3) Scoring Program [Computer software]. Odessa,
FL: Psychological Assessment Resources.
American Guidance Service. (1997). PPVT-III ASSIST [Com- Woodcock, R. W. (1998). Woodcock Scoring and Interpretive
puter software]. Circle Pines, MN: Author. Program [Computer software]. Itasca, IL: Riverside.
Speech and Language Issues in Children from Asian-Pacific Backgrounds 167
Speech and Language Issues in Table 1. Cultural Di¤erences Between Asian-Pacific

Americans and Western Groups
Children from Asian-Pacific
Eastern Tendencies Western Tendencies
Backgrounds
A person is not autonomous. A person is autonomous.
A person is part of society. A person is unique and
Asian-Pacific Americans originate from Pacific Asia individualistic.
or are descendants of Asian-Pacific island immigrants. A person needs to maintain A person makes rational
Numbering 10,477,000 in the United States, Asian- relationships and have choices.
constraints.
Pacific Americans are the fastest-growing segment of
A person is oriented toward A person is active in
the U.S. population, representing 3.8% of the nation’s harmony. decision making.
population and 10% of California’s population (Popu- A person is a partner in the A person is responsible
lation Reference Bureau, 2001). By the year 2020, Asian- community where people are for own actions and
American children in U.S. schools will total about 4.4 mutually responsible for takes the
million. behaviors and consequences. consequences.
The recent Asian influx represents a diverse group A person needs to be humble, A person is di¤erent,
from Southeast Asia, China, India, Pakistan, Malaysia, improve, and master skills. unique, and special.
Indonesia, and other Pacific Rim areas. In general, Pa- A person needs to endure A person needs to feel
cific Asia is divided into the following regions: East Asia hardships and persevere. good about self.
A person needs to self-reflect. A person needs to toot
(China, Taiwan, Japan, and Korea), Southeast Asia
own horn.
(Philippines, Vietnam, Cambodia, Laos, Malaysia, Sin-
gapore, Indonesia, Thailand), the Indian subcontinent,
or South Asia (India, Pakistan, Bangladesh, Sri Lanka),
and the Pacific islands (Polynesia, Micronesia, Mela-
nesia, New Zealand, and Australia). Asian-Pacific pop-
ulations speak many languages, and their English is and families encourage children to initiate and continue
influenced by various dialects and languages. a verbal interaction. However, socioeconomic and indi-
Asian-Pacific Americans are extremely diverse in all vidual di¤erences must always be considered.
aspects of life, including attitudes toward disability and
treatment, childrearing practices, languages, and culture. Languages. The hundreds of di¤erent languages and
The Asian-Pacific island cultures, however, have inter- dialects that are spoken in East and Southeast Asia and
acted with and influenced each other for many gen- the Pacific islands can be classified into five major fami-
erations, and therefore share many similarities. The lies: (1) Malayo-Polynesian (Austronesian), including
following information is presented to provide an under- Chamorro, Ilocano, and Tagalog; (2) Sino-Tibetan,
standing of Asian-Pacific Americans in order to assist including Thai, Yao, Mandarin, and Cantonese; (3)
speech-language pathologists and audiologists in pro- Austro-Asiatic, including Khmer, Vietnamese, and
viding services to these culturally and linguistically di- Hmong; (4) Papuan, including New Guinean; and (5)
verse populations. Recommended assessment procedures Altaic, including Japanese and Korean (Ma, 1985). Ad-
and intervention strategies are provided. ditionally, there are 15 major languages in India from
four language families, Indo-Aryan, Dravidian, Austro-
Attitudes Toward Disability and Treatment Methods. Asiatic, and Tibeto-Burman (Shekar and Hegde, 1995).
What constitutes a disability depends on the values of
the cultural group. In general, Eastern cultures may view Cultural Tendencies. Cultural tendencies of Asian-
a disabling condition as the result of wrongdoing of the Pacific Americans may be quite di¤erent from those of
individual’s ancestors, resulting in guilt and shame. Dis- individuals born and raised in a Western culture. Table 1
abilities may be explained by a variety of spiritual or provides a sampling of these di¤erences. However, cau-
cultural beliefs, such as an imbalance in inner forces, bad tion should be taken not to overgeneralize this informa-
wind, spoiled foods, gods, demons, or spirits, hot or cold tion in relation to a particular client or family.
forces, or fright. Some believe disability is caused by
karma (fate) or a curse. All over the world, people use Recommended Assessment Procedures
di¤erent methods to treat illnesses and diseases, includ- The following assessment guidelines are often referred to
ing consulting with priests, healers, herbalists, Qi-Gong as the RIOT (review, interview, observe, test) protocol
specialists, clansmen, shamans, elders, and physicians. (Cheng, 1995, 2002). They are adapted here for Asian-
Among the Hmong, for example, surgical intervention is Pacific American populations:
viewed as invasive and harmful. 1. Review all pertinent documents and background
information. Many Asian countries do not have medical
Childrearing Practices. Childrearing practices and records or cumulative school records. Oral reports are
expectations of children vary widely from culture to sometimes unreliable. A cultural informant or an inter-
culture (Westby, 1990; Van Kleeck, 1994). There are preter is generally needed to obtain this information
di¤erences in how parents respond to their children’s because of the lack of English language proficiency of
language, who interacts with children, and how parents the parents or guardians. Pregnancy and delivery records
168 Part II: Speech
might not have been kept, especially if the birth was a Lack of participation and lack of volunteering infor-
home birth or in a refugee camp. mation
2. Interview teachers, peers, family members, and Di¤erent nonverbal messages
other informants and work with them to collect data Embarrassment over praise
regarding the client and the home environment. The Di¤erent greeting rituals, which may appear impolite,
family can provide valuable information about the such as looking down when the teacher approaches
communicative competence of the client at home and in Use of Asian-language-influenced English, such as the
the community, as well as historical and comparative deletion of plural and past tense
data on the client’s language development. The clinician
These are just a few examples of the observed behav-
needs information regarding whether or not the client is
iors that may be misinterpreted. Asian-Pacific American
proficient in the home language. The family’s home lan-
children may be fluent in English but use the discourse
guage, its proficiency in di¤erent languages, the patterns
styles of their home culture, such as speaking softly to
of language usage, and the ways the family spends time
persons in authority, looking down or away, and avoid-
together are some areas for investigation. Interview
ing close physical contact. Surface analysis of linguistic
questions are available from multiple sources (Cheng,
and pragmatic functions is not su‰cient to determine the
1990, 1991, 2002; Langdon and Saenz, 1996). Questions
communicative competence of children and might even
should focus on obtaining information on how the client
misguide the decision-making process.
functions in his or her natural environment in relation to
Sociol and psychological di‰culties arise in the con-
age peers who have had the same or similar exposure to
flict of culture, language, and ideology between Asian
their home language or to English.
students, their parents, and the American educational
3. Observe the client over time in multiple contexts
system. These di‰culties can include the background of
with multiple communication partners. Observe inter-
traditions, religions, and histories of the Asian-Pacific
actions at school, both in and outside the classroom, and
population, problems of acculturation, the understand-
at home. This cognitive-ecological-functional model
ing of social rules, contrasting influences from home and
takes into account the fact that clients often behave dif-
the classroom, confusion regarding one’s sense of iden-
ferently in di¤erent settings (Cheng, 1991). Direct ob-
tity relating to culture, society, and family, the definition
servation of social behavior with multiple participants
of disability, and the implications of receiving special
allows the evaluator to observe the ways members of
education services.
di¤erent cultures view their environment and organize
Intervention activities and materials can be selected
their behavior within it.
based on the client’s family and cultural background,
4. Test the client using informal and dynamic assess-
using activities that are culturally and socially relevant.
ment procedures in both the school language and the
In addition to traditional intervention techniques of
home language. Use the portfolio approach by keeping
modeling and expansion, speech-language pathologists
records of the client’s performance over time. Interact
can include activities such as those discussed by Cheng
with the client, being sensitive to his or her needs to cre-
(1989).
ate meaning based on what is perceived as important,
Alternative strategies should be o¤ered when clients
the client’s frame of reference, and experiences.
or caregivers are reluctant to accept the treatment pro-
What clinicians learn from the assessment should be
gram recommended by the speech-language pathologist
integrated into their intervention strategies. Intervention
or audiologist. Inviting them to special classes or speech
should be constructed based on what is most productive
and language sessions is a useful way to provide the
for promoting communication and should incorporate
needed information. Seeking assistance from community
the client’s personal and cultural experiences. Salient and
leaders and social service providers may also be neces-
relevant features of the client’s culture should be high-
sary to convince the clients of the importance of ther-
lighted to enhance and empower the client.
apy or recommended programs. The clients or caregivers
There are many challenges professionals face in
may also be asked to talk with other Asian-Pacific
working with the Asian-Pacific American populations.
Americans who have experiences with treatment pro-
The discourse styles in e¤ect in American homes and
grams. Other individuals can be e¤ective in sharing their
schools may di¤er from those that are practiced in Asian
personal stories about their experiences with therapy.
classrooms. Clinicians need to be doubly careful and not
The clinician should be patient with the clients, letting
interpret these di¤erences as deficient, disordered, aber-
them think through a problem and waiting for them
rant, and undesirable. Behaviors that can be easily mis-
to make the decision to participate in the treatment
understood include the following (Cheng, 1999):
program.
Following are some suggestions to create an optimal
Delay or hesitation in response
language learning environment and to reduce di‰cult
Frequent topic shifts and poor topic maintenance
communication (Cheng, 1996):
Confused facial expressions, such as a frown signaling
concentration rather than displeasure Make no assumptions about what students know or do
Short responses not know.
Use of a soft-spoken voice Anticipate their needs and greatest challenges.
Taking few risks Expect frustration and possible misunderstanding.
Speech Assessment, Instrumental 169
Encourage students to join social activities such as Cheng, L. L. (1999). Sociocultural adjustment of Chinese-
student government, clubs, and organizations to in- American students. In C. C. Park and M. Chi (Eds.), Asian-
crease their exposure to di¤erent types of discourse, as American education. London: Bergin and Garvey.
language is a social tool and should be used for fulfill- Cheng, L. L. (2002). Asian and Pacific American cultures. In
D. E. Battle (Ed.), Communication disorders in multicultural
ing multiple social needs and requirements.
populations. Boston, MA: Butterworth-Heinemann.
Facilitate the transition into mainstream culture Langdon, H. W., and Saenz, T. I. (1996). Language Assessment
through such activities as role-playing (preparing and Intervention with Multicultural Students: A Guide for
scripts for commonly occurring activities, using cul- Speech-Language-Hearing Professionals. Oceanside, CA:
turally unique experiences as topics for discussions) Academic Communication Associates.
and conducting social/pragmatic activities (Cheng, Ma, L. J. (1985). Cultural diversity. In A. K. Dutt (Ed.),
1989), such as a birthday party and a Thanksgiving Southeast Asia: Realm of contrast, Boulder, CO: Westview
celebration. Press.
Nurture bicultural/multicultural identity. Introduce Population Reference Bureau. (2001). On-line. Washington,
multicultural elements not only in phonology, mor- DC: U.S. Bureau of the Census.
Shekar, C., and Hegde, M. N. (1995). India: Its people, culture,
phology, and syntax, but also in pragmatics, seman-
and languages. In L. L. Cheng (Ed.), Integrating language
tics, and ritualized patterns. and learning for inclusion. San Diego, CA: Singular Pub-
lishing Group.
Providing speech-language and hearing services to Van Kleeck, A. (1994). Potential bias in training parents as
Asian-Pacific Americans is challenging. Preassessment conversational partners with their children who have delays
information on the language, culture, and personal his- in language development. American Journal of Speech-
tory of the individual lays a solid foundation to further Language Pathology, 3, 67–68.
explore the client’s strengths and weaknesses. Assess- Westby, C. (1990). Ethnographic interviewing: Asking the right
ment procedures need to be guided by the general prin- questions to the right people in the right way. Journal of
ciples of being fair to the culture and nonbiased. The Childhood Communication Disorders, 13, 101–111.
results of assessment should take into consideration the
cultural and pragmatic variables of the individual. In- On-line Resources
tervention can be extremely rewarding when culturally http://www.krysstal.com/langfams.html
relevant and appropriate approaches are used. The goals http://www.travlang.com/languages
of intervention must include the enhancement of appro- http://www.zompist.com/lang8.html
priate language and communication behaviors, home
language, and literacy. Clinicians need to be creative and
sensitive in their intervention to provide comfortable,
productive, and enriching services for all clients. Some Speech Assessment, Instrumental
publishers that have developed materials for use with
Asian-Pacific American populations include Academic
Communications Associates (Oceanside, CA), Commu- The instrumental analysis of speech can be approached
nication Skill Builders (Tucson, AZ), Thinking Pub- through the three main stages of the speech chain: artic-
lications (Eau Claire, WI), Newbury House (Rowley, ulatory, acoustic, and auditory phonetics. This entry
MA). reviews the instrumentation used to assess the articu-
latory and acoustic phases. Auditory phonetic tech-
—Li-Rong Lilly Cheng niques are covered elsewhere in this volume.
Although speech planning in the brain (neuro-
phonetics) lies outside the traditional tripartite speech
References chain, the neurological aspects of both speech produc-
tion and perception can be studied through the use of
Cheng, L. L. (1989). Service delivery to Asian/Pacific LEP
children: A cross-cultural framework. Topics in Language brain imaging techniques. Speech articulation proper
Disorders, 9, 1–14. is deemed here to begin with the movement of muscles
Cheng, L. L. (1990). The identification of communicative dis- required to produce aerodynamic changes resulting in
orders in Asian-Pacific students. Journal of Child Commu- the flow of an airstream (see Laver, 1994; Ball and
nicative Disorders, 13, 113–119. Rahilly, 1999). Of course, muscle movements also occur
Cheng, L. L. (1991). Assessing Asian language performance: throughout articulation. This area has been investigated
Guidelines for evaluating LEP students. Oceanside, CA: using electromyography (EMG). In EMG, electrodes of
Academic Communication Associates. di¤erent types (surface, needle, and hooked wire) are
Cheng, L. L. (1995, July). The bilingual language-delayed child: used to gather data on electrical activity within target
Diagnosis and intervention with the older school-age bilingual
muscles, and these data are matched with a simulta-
child. Paper presented at the Israeli Speech and Hearing
Association International Symposium on Bilingualism, neously recorded speech signal (Stone, 1996; Gentil and
Haifa, Israel. Moore, 1997). In this way the timing of muscle activity
Cheng, L. L. (1996). Beyond bilingualism: A quest for com- in relation to di¤erent aspects of speech can be inves-
municative competence. Topics in Language Disorders, 16, tigated. This technique has been used to examine both
9–21. normal and disordered speech. Areas studied include
170 Part II: Speech
movements of the folds. This technology is often coupled

to a stroboscopic light source, as stroboscopic endoscopy
allows the viewer to see individual movements of the
folds (see Abberton and Fourcin, 1997). Endoscopy,
however, is invasive, and use of a rigid endoscope pre-
cludes normal speech. Indirect investigation of vocal fold
activity is undertaken with electroglottography (EGG),
also termed electrolaryngography (see Stone, 1996;
Abberton and Fourcin, 1997). This technique allows
vocal fold movement to be extrapolated from measuring
the varying electrical resistance across the larynx. Both
approaches have been used in the investigation of nor-
mal and disordered voice.
Velic action and associated di¤erences in oral and
nasal airflow (and hence in nasal resonance) can also be
measured directly or indirectly. The velotrace is an in-
strument designed to indicate directly the height of the
velum (see Bell-Berti et al., 1993), while nasometric
devices of varying sophistication measure oral versus
nasal airflow (see Zajac and Yates, 1997). The velotrace
is invasive, as part of the device must be inserted into
the nasal cavity to sit on the roof of the velum. Nas-
ometers measure indirectly using, for example, two
external microphones to measure airflow di¤erences.
Figure 1. Averaged integrated EMG signals for the mentalis Figure 2 shows a trace from the Kay Elemetrics naso-
muscle (MENT), orbicularis oris inferior (OOI), orbicularis meter of hypernasal speech.
oris superior (OOS), anterior belly of the digastric (ABD), and The next step in the speech production chain is
the depressor labii inferior (DLI) for a patient with Frie- the articulation of sounds. Most important here is the
dreich’s ataxia uttering /epapap/. (Courtesy of Michèle Gentil.) placement of the individual articulators, and electro-
palatography (EPG) has proved to be a vital develop-
ment in this area of study. Hardcastle and Gibbon
the respiratory and laryngeal muscles, muscle groups in

the lips, tongue, and soft palate, and various disorders,
including disorders of voice and fluency, and certain
acquired neurological problems. Figure 1 shows EMG
traces from a patient with Friedreich’s ataxia.
Aerodynamic activity in speech is studied through
aerometry. A variety of devices have been used to
measure speech aerodynamics (Zajac and Yates, 1997).
Many systems have employed an airtight mask that is
placed over the subject’s face and attached to a pneu-
motachograph. The mask contains sensors to measure
pressure changes and airflow at the nose and mouth, and
generally also a microphone to record the speech signal,
against which the airflow can be plotted. If the focus of
attention is lung volume changes, then a plethysmograph
may be employed. This is an airtight box that houses the
subject, and any changes to the air pressure within the
box (caused by changes in the subject’s lung volume) are
recorded. A simpler plethysmograph (the respitrace; see
Stone, 1996) consists of a wire band placed around the
subject’s chest that measures changes in cross-sectional
area during inhalation and exhalation.
In normal pulmonic egressive speech, airflow from the
lungs passes through the larynx, where a variety of pho-
nation types may be implemented. The study of laryn-
geal activity (more particularly, of vocal fold activity) Figure 2. Trace adapted from a Kay Elemetrics nasometer
can be direct or indirect. In direct study, a rigid or flexi- showing normal and hypernasal versions of ‘‘eighteen, nine-
ble endoscope connected to a camera is used to view the teen, twenty.’’
Figure 3. Reading EPG3 system stylized palate diagram (left) showing misarticulated /s/ with wide channel; Kay Palatometer stylized
system palate diagram (right) showing target /s/ articulated at the postalveolar region.
(1997) describe this technique. A thin acrylic artificial magnetic fields are measured and recorded by computer.
palate is made to fit the subject. This palate has a large As with x-ray microbeam imaging, the tracked points
number of electrodes embedded in it (from 62 to 96, can be used to infer the shape and movements of articu-
depending on the system employed) to cover important lators within the vocal tract.
areas for speech (e.g., the alveolar region). When the The final imaging technique to be considered is mag-
tongue touches these electrodes, they fire, and the resul- netic resonance imaging (MRI). The imager surrounds a
tant tongue-palate contact patterns can be shown on a subject with electromagnets, creating an electromagnetic
computer screen. The electrodes are normally sampled field. This field causes hydrogen protons (abundant in
100 times per second, and the patterns are displayed in human tissue) to align but also to precess, or wobble. If a
real time. This allows the technique to be used both for brief radio pulse is introduced at the same frequency as
research and for feedback in therapy. EPG has been the precessing, the protons are moved out of alignment
used to study normal speech and a wide range of dis- and then back again. As they realign, they emit weak
ordered speech patterns. Figure 3 shows tongue-palate radio signals, which can be used to construct an image of
contact patterns in a stylized way for two di¤erent EPG the tissue involved. MRI can provide good images of the
systems. vocal tract but currently not at su‰cient frequency to
Other ways of examining articulation (and indeed a allow analysis of continuous speech. All of these imaging
whole range of speech-related activity) can be subsumed techniques have been used to study aspects of both nor-
under the overall heading of speech imaging (see Stone, mal and disordered speech. Figure 4 shows ultrasound
1996; Ball and Gröne, 1997). The oldest of these tech- diagrams for two vowels and two consonants.
niques is x-radiography. A variety of di¤erent x-ray tech- Acoustic analyses via sound spectrography are now
niques have been used in speech research, among them easily undertaken with a range of software programs on
videofluorography, which uses low doses of radiation personal computers as well as on dedicated hardware-
to give clear pictures of the vocal tract, and x-ray software configurations such as the Kay Elemetrics
microbeam imaging, in which the movements of pellets Sonagraph. Reliable analysis depends on good record-
attached to relevant points of the tongue and palate are ings (see Tatham and Morton, 1997). Spectrographic
tracked. Because of the dangers of radiation, alternative analysis packages currently allow users to analyze tem-
imaging techniques have been sought. Among these is poral, frequency, amplitude, and intensity aspects of a
ultrasound, which uses the time taken for sound waves speech signal (see Baken and Danilo¤, 1991; Farmer,
to bounce o¤ a structure and return to a receiver to map 1997). For example, a waveform displays amplitude
structures in the vocal tract. Because ultrasound waves versus time; a wideband spectrogram displays frequency
do not travel through the air, mapping of the tongue versus time using a wideband pass filter (around 200–
(from below) is possible, but mapping of tongue–palate 300 Hz), giving good time resolution but poor frequency
distances is not, as the palate cannot be mapped through resolution; a narrow-band spectrogram shows frequency
the air space of the oral cavity. Electromagnetic articu- versus time using a narrowband pass filter (around
lography (EMA) is another tracking technique. In this 29 Hz), which provides good frequency resolution but
technique the subject is placed within alternating mag- poor time resolution; and spectral envelopes show fre-
netic fields generated by transmitter coils in a helmet quency versus intensity at a point in time (produced either
assembly. Small receiver coils are placed at articulatorily by fast Fourier transform or linear predictive coding).
important sites (e.g., tongue tip, tongue body). The Speech analysis research has generally concentrated on
movements of the receiver coils through the alternating wideband spectrograms and spectral envelopes. These
172 Part II: Speech
Figure 4. Ultrasound images of two vowels and two consonants. (Courtesy of Maureen Stone.)
Figure 5. Wideband spectrogram of a disfluent speaker producing ‘‘(provin)cial t(owns).’’ (Courtesy of Joan Rahilly.)
both show formant frequencies (bands of high intensity Baken, R., and Orliko¤, R. (2000). Clinical measurement of
at certain frequency levels), which are useful in the speech and voice (2nd ed.). San Diego, CA: Singular Pub-
identification of and discrimination between vowels and lishing Group.
other sonorants. Fricatives are distinguishable from the Ball, M. J., Gracco, V., and Stone, M. (2001). Imaging tech-
niques for the investigation of normal and disordered speech
boundaries of the broad areas of frequency seen clearly
production. Advances in Speech-Language Pathology, 3,
on a spectrogram, while plosives can be noted from the 13–24.
lack of acoustic activity during the closure stage and the Ball, M. J., and Lowry, O. (2001). Methods in clinical phonet-
coarticulatory e¤ects on the formants of neighboring ics. London: Whurr.
sounds. Segment duration can easily be measured from Borden, G., Harris, K., and Raphael, L. (1994). Speech science
spectrograms in modern analysis packages. Various primer: Physiology, acoustics and perception of speech (3rd
pitch extraction algorithms are provided for the investi- ed.). Philadelphia: Lippincott Williams and Wilkins.
gation of intonation. Farmer (1997) provides an exten- Dagenais, P. (1995). Electropalatography in the treatment of
sive review of acoustic analysis work in a range of articulation/phonological disorders. Journal of Communica-
disorders: voice, fluency, aphasia, apraxia and dysarth- tion Disorders, 28, 303–329.
Gentil, M. (1990). EMG analysis of speech production of
ria, child speech disorders, and the speech of the hearing-
patients with Friedreich disease. Clinical Linguistics and
impaired. Figure 5 shows a wideband spectrogram of Phonetics, 4, 107–120.
disfluent speech. Hardcastle, W. (1976). Physiology of speech production. Lon-
—Martin J. Ball don: Academic Press.
Hardcastle, W., Gibbon, F., and Nicolaidis, K. (1991). EPG
data reduction methods and their implications for studies of
lingual coarticulation. Journal of Phonetics, 19, 251–266.
References Hardcastle, W., and Laver, J. (Eds.). (1997). The handbook of
Abberton, E., and Fourcin, A. (1997). Electrolaryngography. phonetic sciences. Oxford, UK: Blackwell.
In M. J. Ball and C. Code (Eds.), Instrumental clinical pho- Hirano, M., and Bless, D. (1993). Videostroboscopy of the
netics (pp. 119–148). London: Whurr. larynx. London: Whurr.
Baken, R., and Danilo¤, R. (1991). Readings in clinical spec- Kent, R. D., Atal, B., and Miller, J. (Eds.). (1991). Papers in
trography of speech. San Diego, CA: Singular Publishing speech communication: Speech production. Woodbury, NY:
Group. Acoustical Society of America.
Ball, M. J., and Gröne, B. (1997). Imaging techniques. In M. J. Kent, R. D., and Read, C. (2002). The acoustic analysis of
Ball and C. Code (Eds.), Instrumental clinical phonetics, speech. San Diego, CA: Singular Publishing Group.
(pp. 194–227). London: Whurr. Lass, N. (Ed.). (1996). Principles of experimental phonetics. St.
Ball, M. J., and Rahilly, J. (1999). Phonetics: The science of Louis: Mosby.
speech. London: Arnold. Lieberman, A. (1996). Speech: A special code. Cambridge,
Bell-Berti, F., Krakow, R., Ross, D., and Horiguchi, S. (1993). MA: MIT Press.
The rise and fall of the soft palate: The velotrace. Journal of Masaki, S., Tiede, M., Honda, K., Shimada, Y., Fujimoto, I.,
the Acoustical Society of America, 93, 2416A. Nakamura, Y., et al. (1999). MRI-based speech production
Farmer, A. (1977). Spectrography. In M. J. Ball and C. Code study using a synchronized sampling method. Journal of the
(Eds.), Instrumental clinical phonetics (pp. 22–63). London: Acoustical Society of Japan, 20, 375–379.
Whurr. Miller, J., Kent, R. D., and Atal, B. (Eds.). (1991). Papers in
Gentil, M., and Moore, W. H., Jr. (1997). Electromyography. speech communication: Speech perception. Woodbury, NY:
In M. J. Ball and C. Code (Eds.), Instrumental clinical pho- Acoustical Society of America.
netics (pp. 64–86). London: Whurr. Perkell, J., Cohen, M., Svirsky, M., Matthies, M., Garabieta,
Hardcastle, W., and Gibbon, F. (1997). Electropalatography I., and Jackson, M. (1992). Electro-magnetic midsagittal
and its clinical applications. In M. J. Ball and C. Code articulometer (EMMA) systems for transducing speech
(Eds.), Instrumental clinical phonetics (pp. 149–193). Lon- articulatory movements. Journal of the Acoustical Society of
don: Whurr. America, 92, 3078–3096.
Laver, J. (1994). Principles of phonetics. Cambridge, UK: Rothenburg, M. (1977). Measurement of airflow in speech.
Cambridge University Press. Journal of Speech and Hearing Research, 20, 155–176.
Stone, M. (1996). Instrumentation for the study of speech Rothenburg, M. (1992). A multichannel electroglottograph.
physiology. In N. J. Lass (Ed.), Principles of experimental Journal of Voice, 6, 36–43.
phonetics (pp. 495–524). St. Louis: Mosby. Speaks, C. (1999). Introduction to sound (3rd ed.). San Diego,
Tatham, M., and Morton, K. (1997). Recording and displaying CA: Singular Publishing Group.
speech. In M. J. Ball and C. Code (Eds.), Instrumental Stevens, K. (1998). Acoustic phonetics. Cambridge, MA: MIT
clinical phonetics (pp. 1–21). London: Whurr. Press.
Zajac, D., and Yates, C. (1997). Speech aerodynamics. In M. J. Stone, M. (1990). A three-dimensional model of tongue move-
Ball and C. Code (Eds.), Instrumental clinical phonetics ment based on ultrasound and x-ray microbeam data.
(pp. 87–118). London: Whurr. Journal of the Acoustical Society of America, 87, 2207–2217.
Stone, M., and Lundberg, A. (1996). Three-dimensional
tongue surface shapes of English consonants and vowels.
Further Readings Journal of the Acoustical Society of America, 99, 1–10.
Westbury, J. (1991). The significance and measurement of head
Atal, B., Miller, J., and Kent, R. D. (Eds.). (1991). Papers in position during speech production experiments using the
speech communication: Speech processing. Woodbury, NY: x-ray microbeam system. Journal of the Acoustical Society
Acoustical Society of America. of America, 89, 1782–1791.
174 Part II: Speech
Speech Assessment in Children: guage. In linguistic terminology, these gaps would be

characterized as a type of phonotactic constraint (Dinn-
Descriptive Linguistic Methods sen, 1984).
Descriptive linguistic methods have long been used in

The Distribution of Sounds. Distribution refers to
the analysis of fully developed primary languages.
where sounds (phones or phonemes) occur in words
These same methods are also well-suited to the study
and is determined by examining context. For children,
of language development, particularly the analysis of
sounds may be used in all word positions, initial, inter-
children’s speech sound systems. Descriptive methods
vocalic, and final, or they may be limited to certain
are a preferred analytic tool because they are designed to
contexts. In development, obstruent stops commonly
gather evidence that reveals the hallmark and defining
occur word-initially but not postvocalically; whereas fri-
characteristics of a sound system, independent of theo-
catives and liquids commonly occur postvocalically but
retical orientation, age, or population of study. The
not word-initially (Smith, 1973). As with the phonemic
defining properties of descriptive linguistic analyses of
inventory, restrictions on the distribution of sounds cor-
children’s sound systems are discussed in this article.
respond to markedness, with children having a tendency
toward unmarked as opposed to marked structure.
The Phonetic Inventory. A phonetic inventory com-
prises all sounds produced or used by a child, regardless
of whether those sounds are correct relative to the in- Rule-Governed Alternations. Asymmetries in the distri-
tended (adult) target. In the acquisition literature, the bution of sounds may be further indicative of systematic
conventional criterion for determining the phonetic rule-governed alternations in sound production (Ken-
status of sounds is a two-time occurrence independent stowicz, 1994). Rule-governed alternations occur when
of the target or context; that is, any sound produced morphologically related words are produced in di¤erent
twice is included in a child’s phonetic repertoire (Stoel- ways, for example, ‘‘electric’’ but ‘‘electricity.’’ Alter-
Gammon, 1985). Children’s phonetic inventories reflect nations are typically sampled by adding either a prefix or
the range of individual variability expected in develop- su‰x to a base word in order to change the context in
ment. As such, complementary methods have been which a sound occurs. There are two general types of
designed to further depict developmental variation, rule-governed change: allophonic variation and neutral-
including the phone tree methodology (Ferguson and ization. Allophonic variation occurs when a single pho-
Farwell, 1975) and the typology of phonetic complexity neme has multiple corresponding phonetic outputs that
(Dinnsen, 1992). For children with speech sound dis- vary by context. An example is /t/ produced as aspirated
orders, the phonetic inventory may be quite large despite in word-initial position ‘‘tap,’’ as flap in intervocalic po-
errors of production, and may consist of sounds that do sition ‘‘bitter,’’ and as unreleased in word-final position
not occur in the ambient language. ‘‘it.’’ In each case, the target sound is /t/, but the pho-
netic characteristics of the output di¤er predictably by
The Phonemic Inventory. Phonemes are used to signal word position. Thus, there is a one-to-many mapping
meaning di¤erences in a language. Phonemes are con- between phoneme and phones in allophonic variation.
ventionally determined by the occurrence of minimal Neutralization occurs when two or more phonemes are
pairs. A minimal pair is defined as two words identical merged into one phonetic output in a well-defined con-
except for one sound, for example ‘‘pat’’ and ‘‘bat’’ or text. An example is /t/ and /d/ both produced as flap
‘‘cap’’ and ‘‘cab.’’ Here, the consonants /p/ and /b/ are in intervocalic position ‘‘writer’’ and ‘‘rider.’’ In neu-
the only point of di¤erence in each pair of words; there- tralization, the contrast between phonemes is no longer
fore, these would be said to function as phonemes in the apparent at the phonetic (surface) level. Consequently,
di¤erentiation of meaning. For children, the phonemic there is a many-to-one mapping between phonemes and
inventory is generally smaller than the phonetic inven- phone. In children, the emergence of target-appropriate
tory (Gierut, Simmerman, and Neumann, 1994). Gaps morphophonemics occurs later in language development.
in the phonemic repertoire often a¤ect the sound classes For children with speech sound disorders, nontarget
of fricatives, a¤ricates, and liquids. From a linguistic allophonic variation and neutralization have been
perspective, the nonoccurrence of these sound classes observed and parallel the rules of fully developed lan-
in children’s speech parallels markedness. Markedness guages of the world (Camarata and Gandour, 1984).
defines lawful relationships among sound categories that Together, these four properties define the most basic
have been found to hold universally across languages elements of a sound system at a segmental level of
of the world. One type of markedness is implicational structure. In addition to examining these properties, de-
in nature, such that the occurrence of property X in a scriptive linguistic methods may evaluate prosodic levels
language implies property Y, but not vice versa. The of structure by examining units larger than the sound,
implying property X is taken to be marked, and is pre- such as permissible syllable types and combinations and
sumably more di‰cult to acquire, whereas the implied the overlay of primary and secondary stress on these in
property Y is unmarked and predictably easier to learn. the formation of words and phrases (Lleó and Prinz,
In development, then, phonemic gaps in the inventory 1996; Kehoe and Stoel-Gammon, 1997). As with seg-
correspond to more marked (di‰cult) structures of lan- mental structure, children typically use unmarked pro-
Speech Assessment in Children: Descriptive Linguistic Methods 175
sodic structure, with preferences for open syllables and variability within and across children’s sound systems
trochaic (strong-weak) stress assignment. within and across points in time (Chomsky, 1999). These
For children with speech sound disorders, there are challenges have been handled in di¤erent ways by dif-
other methods of analysis that may be relevant to a ferent linguistic theories, but at the core, they have
comprehensive characterization of the sound system served to outline a well-defined set of research issues
(Fey, 1992; see speech sound disorders in children: about children’s speech sound development that as of yet
description and classification). Relational analyses remain unresolved. Central questions bear on the nature
establish a one-to-one correspondence between a child’s of children’s mental (internal) representation of sound,
errored outputs and intended target sounds. These anal- the relationship between perception and production in
yses are intended to capture the patterns of a child’s speech sound development, and the contribution of
errors, and to descriptively label these patterns as innateness and maturation to language acquisition.
phonological processes. Four main categories of pho-
nological processes characterize children’s commonly —Judith A. Gierut
occurring developmental errors (Ingram, 1989). These
categories are substitution processes, involving di¤erent References
manners or places of production than the target; syllable Anderson, S. R. (1981). Why phonology isn’t natural. Linguis-
shape processes, involving di¤erent canonical (conso- tic Inquiry, 12, 493–539.
nant-vowel) shapes than the target; assimilatory pro- Camarata, S. M., and Gandour, J. (1984). On describing idio-
cesses, involving sounds produced more alike in a word syncratic phonologic systems. Journal of Speech and Hear-
than in the target; and other processes, such as reversals ing Disorders, 49, 262–266.
in the sequencing of sounds or articulatory di¤erences in Chomsky, N. (1999). On the nature, use, and acquisition of
sound production such as lisping. Children with speech child language. In W. Ritchie and T. Bhatia (Eds.), Hand-
sound errors are likely to use other unusual phonological book of child language acquisition (pp. 33–54). New York:
processes and to persist in their use of these processes for Academic Press.
Dinnsen, D. A. (1984). Methods and empirical issues in ana-
longer durations than are typical (Leonard, 1992). lyzing functional misarticulation. In M. Elbert, D. A.
Supplementary clinical methods have also been de- Dinnsen, and G. Weismer (Eds.), Phonological theory and
signed to evaluate perceptual or metalinguistic skills, the misarticulating child (pp. 5–17). Rockville, MD: Amer-
as these skills may a¤ect a child’s knowledge of the am- ican Speech-Language-Hearing Association.
bient sound system. The Speech Production-Perception Dinnsen, D. A. (1992). Variation in developing and fully
Task is one clinical technique that establishes a child’s developed phonologies. In C. A. Ferguson, L. Menn, and
ability to perceptually di¤erentiate target sounds from C. Stoel-Gammon (Eds.), Phonological development: Mod-
their corresponding substitutes (Locke, 1980). Other els, research, implications (pp. 191–210). Timonium, MD:
metalinguistic procedures employ categorization tasks York Press.
that evaluate a child’s judgment of the similarity of tar- Ferguson, C. A., and Farwell, C. B. (1975). Words and sounds
in early language acquisition: English initial consonants in
get sounds and their substitutes (Klein, Lederer, and the first fifty words. Language, 51, 419–439.
Cortese, 1991). Although these methods may have clini- Fey, M. E. (1992). Articulation and phonology: Inextricable
cal utility in isolating the source of breakdown and in constructs in speech pathology. Language, Speech and
designing appropriate intervention for a child’s speech Hearing Services in Schools, 23, 225–232.
disorder, they are considered external (not primary) evi- Gierut, J. A., Simmerman, C. L., and Neumann, H. J. (1994).
dence in conventional linguistic analyses of sound sys- Phonemic structures of delayed phonological systems.
tems (Anderson, 1981), because these skills lie outside Journal of Child Language, 21, 291–316.
the domain of phonology in particular and language in Goldsmith, J. A. (Ed.). (1995). The handbook of phonological
general. theory. Cambridge, MA: Blackwell.
Finally, one of the most central aspects of a descrip- Ingram, D. (1989). Phonological disability in children. London:
Cole and Whurr.
tive linguistic analysis of a sound system is the interpre- Kehoe, M., and Stoel-Gammon, C. (1997). The acquisition of
tation or theoretical account of the data. A number of prosodic structure: An investigation of current accounts of
theories have been advanced to account for the funda- children’s prosodic development. Language, 73, 113–144.
mental properties of sound systems. Each relies on a Kenstowicz, M. (1994). Phonology in generative grammar.
unique set of assumptions about the structure, function, Cambridge, MA: Blackwell.
and organization of sounds in a speaker’s mental lexi- Klein, H. B., Lederer, S. H., and Cortese, E. E. (1991). Chil-
con. Among the most recognized frameworks are linear dren’s knowledge of auditory/articulatory correspondences:
phonology, including standard generative and natural Phonologic and metaphonologic. Journal of Speech and
frameworks; nonlinear phonology, including autoseg- Hearing Research, 34, 559–564.
mental, metrical, underspecification, and feature geome- Leonard, L. B. (1992). Models of phonological development
and children with phonological disorders. In C. A. Fergu-
try frameworks (Goldsmith, 1995); and, most recently, son, L. Menn, and C. Stoel-Gammon (Eds.), Phonological
optimality theory (Prince and Smolensky, 1997). Any development: Models, research, implications (pp. 495–508).
formal theory of language must account for the facts of Timonium, MD: York Press.
acquisition, including those pertinent to children with Lleó, C., and Prinz, M. (1996). Consonant clusters in child
speech sound disorders. Acquisition data present unique phonology and the directionality of syllable structure as-
challenges for linguistic theory because of the inherent signment. Journal of Child Language, 23, 31–56.
176 Part II: Speech
Locke, J. L. (1980). The inference of speech perception in the Locke, J. L. (1983). Phonological acquisition and change. New
phonologically disordered child: Part II. Some clinically York: Academic Press.
novel procedures, their use, some findings. Journal of Macken, M. A. (1980). The child’s lexical representation: The
Speech and Hearing Disorders, 45, 445–468. ‘‘puzzle-puddle-pickle’’ evidence. Journal of Linguistics, 16,
Prince, A., and Smolensky, P. (1997). Optimality: From neural 1–17.
networks to universal grammar. Science, 275, 1604–1610. McGregor, K. K., and Schwartz, R. G. (1992). Converging
Smith, N. V. (1973). The acquisition of phonology: A case evidence for underlying phonological representations in a
study. Cambridge, UK: Cambridge University Press. child who misarticulates. Journal of Speech and Hearing
Stoel-Gammon, C. (1985). Phonetic inventories, 15–24 Research, 35, 596–603.
months: A longitudinal study. Journal of Speech and Hear- Schwartz, R. G. (1992). Clinical applications of recent ad-
ing Research, 28, 505–512. vances in phonological theory. Language, Speech and Hear-
ing Services in Schools, 23, 269–276.
Further Readings Stemberger, J. P., and Stoel-Gammon, C. (1991). The under-
specification of coronals: Evidence from language acquisi-
Barlow, J. A. (Ed.). (2001). Clinical forum: Recent advances in tion and performance errors. In C. Paradis and J.-F. Prunet
phonological theory and treatment. Language, Speech, and (Eds.), Phonetics and phonology: Vol. 2: The Special Status
Hearing Services in Schools, 32, 225–297. of Coronals (pp. 181–199). San Diego, CA: Academic Press.
Barlow, J. A., and Gierut, J. A. (1999). Optimality theory in Velten, H. (1943). The growth of phonemic and lexical patterns
phonological acquisition. Journal of Speech, Language, and in infant speech. Language, 19, 281–292.
Hearing Research, 42, 1482–1498. Vihman, M. M. (1996). Phonological development: The origins
Bernhardt, B. H., and Stemberger, J. P. (1998). Handbook of of language in the child. Cambridge, MA: Blackwell.
phonological development from the perspective of constraint-
based non-linear phonology. San Diego, CA: Academic
Press. Speech Development in Infants and
Bernhardt, B., and Stoel-Gammon, C. (1994). Nonlinear pho-
nology: Introduction and clinical application. Journal of Young Children with a Tracheostomy
Chin, S. B., and Dinnsen, D. A. (1991). Feature geometry in
disordered phonologies. Clinical Linguistics and Phonetics, A tracheostomy is a permanent opening of the trachea to
5, 329–337. outside air. It most often requires a surgical procedure
Donegan, P. J., and Stampe, D. (1979). The study of natural for closure. The primary reason for performing a surgi-
phonology. In D. A. Dinnsen (Ed.), Current approaches to cal tracheostomy is for long-term airway management
phonological theory (pp. 126–173). Bloomington: Indiana in cases of chronic upper airway obstruction or central
University Press. or obstructive sleep apnea, or to provide long-term me-
Edwards, M. L., and Shriberg, L. D. (1983). Phonology: chanical ventilatory support. The use of assisted ventila-
Applications in communicative disorders. San Diego, CA: tion for more than 1 month in the first year of life has
College-Hill Press. been considered to constitute a chronic tracheostomy
Gandour, J. (1981). The nondeviant nature of deviant phono-
logical systems. Journal of Communication Disorders, 14,
(Bleile, 1993). Most of the estimated 900–2,000 infants
11–29. and children per year who need a tracheostomy, a ven-
Gerken, L. (1996). Prosodic structure in young children’s lan- tilator, or both for a month or more are, in fact, less than
guage production. Language, 72, 683–712. a year old (Singer et al., 1989). Although the mortality
Gierut, J. A. (1999). Syllable onsets: Clusters and adjuncts associated with a chronic tracheostomy in young chil-
in acquisition. Journal of Speech, Language, and Hearing dren is twice that in adults, the procedure is invaluable
Research, 42, 708–726. for acute and long-term airway management (Fry et al.,
Gierut, J. A., Elbert, M., and Dinnsen, D. A. (1987). A func- 1985).
tional analysis of phonological knowledge and generaliza- When a tracheostomy or mechanical ventilation is
tion learning in misarticulating children. Journal of Speech used over a long period, the impact on communication
Goad, H., and Ingram, D. (1987). Individual variation and its
and feeding behavior can be significant. Oral communi-
relevance to a theory of phonological acquisition. Journal of cation in children occurs in tandem with growth and
Child Language, 14, 419–432. maturation of the structures of the speech apparatus.
Grunwell, P. (1985). Phonological assessment of child speech. Neuromuscular or biomechanical di‰culties resulting
Windsor, UK: NFER-Nelson. from altered patterns of growth or structural problems
Haas, W. (1963). Phonological analysis of a case of dyslalia. can negatively a¤ect the development of oral commu-
Journal of Speech and Hearing Disorders, 28, 239–246. nication. In particular, the respiratory, laryngeal, and
Ingram, D. (1981). Procedures for the phonological analysis of articulatory subsystems of the speech apparatus are at
children’s language. Baltimore: University Park Press. risk for pathological changes a¤ecting the development
Jakobson, R. (1968). Child language, aphasia and phonological of speech.
universals. The Hague, Netherlands: Mouton.
Kent, R. D., and Ball, M. J. (Eds.). (1997). The new phonolo-
The respiratory subsystem of the speech apparatus
gies: Developments in clinical linguistics. San Diego, CA: comprises the lower airways, rib cage, diaphragm, and
Singular Publishing Group. abdominal structures. The lower airways consist of the
Kiparsky, P., and Menn, L. (1977). On the acquisition of pho- trachea, the right and left mainstem bronchi, and the
nology. In J. MacNamara (Ed.), Language Learning and lungs. The tracheobronchial tree is much smaller in
Thought (pp. 47–78). New York: Academic Press. children than in adults, and di¤ers in shape and bio-
Speech Development in Infants and Young Children with a Tracheostomy 177
mechanics as well. The trachea in young children has the appropriate tracheostomy tube. Driver (2000) has
been described as the size of a soda straw, and is highly compiled a list of the critical factors in tracheostomy
malleable (Fry et al., 1985). The infant tracheal diameter tube selection. These factors include the child’s respira-
is approximately 0.5 cm, whereas adult tracheal diame- tory requirements, age and weight, tracheal diameter,
ters are 1.5–2.5 cm. C-shaped cartilage rings joined by distance from the tracheal opening to the carina, and
connective tissue help to keep the trachea from collaps- anatomical features of the neck for selection of a neck
ing against the flow of air during breathing. Because plate or flange. In addition, decisions must be made re-
the membranes of the infant trachea are soft and frag- garding whether or not there should be an inner cannula,
ile, there is a risk of tracheal compromise secondary the flexibility of the cannula, whether or not there should
to a tracheostomy. Complications may include reactive be a cu¤ (an air-inflatable outer bladder used to create a
granulation at the site of cannula, edema and scaring, seal against the outer wall of the tracheal tube and tra-
chronic irritation of the tracheal lumen, and tracheal chea), and what external adapters might be used (Driver,
collapse as a result of increased negative pressure pulling 2000).
air through a compromised structure (Fry et al., 1985). Tracheostomy tubes are selected primarily on the ba-
During the first several years of life, significant sis of the ventilatory needs of the infant or young child.
changes occur in the structure and mechanics of the Tracheostomy tubes will be larger in diameter and may
respiratory system. The airways increase in radius and have a cu¤ in the event the child needs high ventilator
length and the lungs increase in size and weight. The pressures with frequent suctioning. However, because
thoracic cavity enlarges and changes in shape, and over- of a young child’s susceptibility to trauma of the speech
all chest wall compliance decreases with upright pos- apparatus, it is optimal to have a smaller-diameter, flex-
ture. Airway resistance decreases, and pleural pressure ible tube without a cu¤. When a smaller tube is selected,
becomes more subatmospheric (Beckerman, Brouillette, air leakage around the tube and through the upper air-
and Hunt, 1992). Tidal volume, inspiratory capacity, vi- way will be available to the infant for voicing.
tal capacity, and minute ventilation increase with age. Many pediatric upper airway management problems
Besides providing phonation, the larynx, along with can be successfully addressed with a tracheostomy alone.
the epiglottis and soft palate, protects the lower airway. However, chronic respiratory failure will require some
The infant larynx is located high in the neck, close to the form of mechanical ventilation. The type of mechanical
base of the tongue. The thyroid cartilage is located ventilation support required will depend on the type
directly below the hyoid bone, whereas the cricoid carti- of disorder, the degree of respiratory dependence, and
lage is the lowest part of the laryngeal structure. Because whether the child will ultimately be weaned from ven-
of its location and size, the infant larynx, like the tra- tilatory support. There are two types of mechanical
chea, is susceptible to trauma during airway manage- ventilation systems commonly used with children. Neg-
ment procedures. The laryngeal structures become less ative pressure ventilation is noninvasive and uses nega-
susceptible to injury as they change shape and descend tive (below atmospheric) pressure by exerting suction on
during the first year of life. the outside of the chest and abdomen. As a result, intra-
The articulatory subsystem, composed of the phar- thoracic pressure is reduced and induces airflow into the
ynx, mouth, and nose, also undergoes significant lungs. Expiration is accomplished by passive recoil of the
changes in growth and function during infancy and early lungs. Negative pressure ventilators work well with chil-
childhood. The pharynx plays a critical role in both res- dren who have relatively normal airways and compliant
piration and swallowing. The infant pharynx lacks a chest walls. Negative pressure ventilators are associated
rigid framework and can collapse if external suction is with fewer complications than positive pressure ven-
applied within the airway. If the airway-maintaining tilators and do not require a tracheostomy (Splaingard
muscles are weak or paralyzed, normal negative pres- et al., 1983). However, they are cumbersome and are
sures associated with inspiratory e¤orts also can cause not adequate for children with severe respiratory disease
airway collapse at the level of the pharynx (Thach, or rigid chest walls (Driver, 2000). Positive pressure
1992). Movement of the pharyngeal walls, elevation of ventilation is invasive and applies positive (above atmo-
the soft palate, and elevation of posterior portion of the spheric) pressure to force air into the lungs via a venti-
tongue are important maneuvers for achieving velo- lator connected to a tracheostomy tube (for long-term
pharyngeal closure. The infant tongue is proportionately use). Expiration is accomplished by passive recoil of
larger in relation to mouth size than the adult’s; thus, the lungs. The primary advantages are the flexibility to
tongue retraction can cause upper airway blockage and individualize respiratory support and to deliver various
respiratory distress. Various craniofacial abnormalities concentrations of oxygen (Metz, 1993). There are two
may result in structural or neurological situations that major types of positive pressure ventilators, volume
require airway management interventions, including ventilators and pressure ventilators. In addition, ven-
tracheostomy. tilators are set in a mode (e.g., assist-control, synchron-
The decision to use long-term airway maintenance in ized intermittent mandatory ventilation) to deliver a
the form of a tracheostomy requires consideration of certain number of breaths per minute, based on the tidal
many factors. Even the type of incision can make a dif- volume and minute ventilation.
ference in overall outcome for the infant or young child Infants and young children with tracheostomies can
(Fry et al., 1985). Other information is needed to select become oral communicators if oral motor control is
178 Part II: Speech
su‰cient, the velopharynx is competent, the upper air- breathing probably due to increases in resistance on both
way is in reasonably good condition (i.e., there is no inspiration and expiration. Extra e¤ort from expiratory
significant vocal fold paresis or paralysis and no signifi- muscles during phonation also must be generated to
cant airway obstruction), the ability to deliver airflow force air around the tracheostomy tube to the vocal
and pressure to the vocal folds and supporting larngeal folds. Finally, young children may not be familiar with
structures is su‰cient, and chest wall muscular support coughing up secretions through the oral cavities and
for speech breathing is su‰cient (not a prerequisite for show distress until this skill is acquired (McGowan et al.,
ventilator-supported speech). If oral communication is 1993). Initially, the young child may be able to tolerate
possible, then the type of tracheostomy tube and the the speaking valve for only 5 minutes at a time. With
various valve configurations must be selected on the encouragement and appropriate reinforcement, the child
basis of both e¤ective airway management and oral will likely tolerate the speaking valve for increasing
communication criteria. Driver (2000) suggests that the amounts of time. When a speaking valve is placed in line
best results for oral communication are achieved if the with mechanical ventilation, various volume or pressure
smallest, simplest tracheostomy tube is selected. The size adjustments may be made to maximize the timing of
and nature (fenestrated vs. non-fenestrated) of the tra- phonation and the natural characteristics of the breath-
cheostomy tube also will a¤ect the e¤ort required to ing and speaking cycles. Only a few general guidelines
move gas across the airway (Hussey and Bishop, 1996). on mechanical ventilation and speech in infants and
Respiratory e¤ort to breathe will impact on the addi- young children have been published (e.g., Lohmeier and
tional e¤ort required to vocalize and speak. The most Boliek, 1999).
e‰cient tracheostomy tube is one that has flow charac- A chronic tracheostomy interferes with the devel-
teristics similar to those of the upper respiratory system opment of oral motor skills and experimentation with
for the maintenance of respiratory homeostasis, but with sound production by limiting movements of the jaw,
some trade-o¤ for air leakage necessary for vocalization tongue, and lips. In addition, long-term intubation may
(Mullins et al., 1993). When tracheostomy tubes have a result in a significantly high-vaulted palate and vocal
cu¤, extreme caution should be taken to ensure that cu¤ cord injury (Driver, 2000). Adequate breath support for
deflation is accomplished prior to any attempts to sup- speech also may be a¤ected because of neuromuscular
port oral communication. Tracheostomy tubes with cu¤s weakness, hypotonia, hypertonia, or paralysis. Conse-
are not recommended for infants and very young chil- quently, infants and young children may have one or
dren because of increased risk of tracheal wall trauma. several issues a¤ecting the speech mechanism. Infants
When a su‰cient air leak around the tracheostomy and young children who need mechanical ventilation
tube exists, then a unidirectional speaking valve can be may not vocalize until near the end of the first year of
attached to the hub of the tube. When the child inspires, life and may not be able to appropriately time their
air enters through a diaphragm that closes on expiration, vocalizations to ventilator cycle until well after 12
thus forcing air to exit through the upper airway. The months of age (Lohmeier and Boliek, 1999). The speech
same e¤ect can be accomplished by manually occluding characteristics of infants and children with tracheos-
the hub of the tracheostomy tube. Speaking valves can tomies, using speaking valves or manual occlusion, in-
be used with ventilator-assisted breathing as well. clude a smaller lung volume initiations, terminations,
Several factors should be considered when selecting a and excursions, fewer syllables per breath group, vari-
pediatric speaking valve. These factors include the type able chest wall configurations during vocalization
of diaphragm construction (bias open or bias closed), the including rib cage or abdomen paradoxical movements,
amount of resistance inherent in the valve type, and the breathy or pressed voice quality that reflects available
amount of air loss during vocalization and speech pro- airflow and tracheal pressures, intermittent voice stop-
duction. First, speaking valves can be either bias open pages, hypernasality, and poor intelligibility (Lohmeier
or bias closed at atmospheric pressure. A biased closed and Boliek, 1999). In addition, experimentation with
valve remains closed until negative air pressure is applied vocal play, feeding, and oral-motor exploration may be
during inspiration. In this case, the valve will open dur- limited during a sensitive period for speech and language
ing inspiration and close during expiration. A bias open acquisition. Therefore, all e¤orts should be made to
valve remains open and only closes during the expiratory support phonation and other communicative oppor-
phase of the breath cycle. A bias closed valve may re- tunities.
quire greater e¤ort to achieve airflow (Zajac, Fronataro- Only a handful of group and single case studies have
Clerici, and Roop, 1999). Di¤erences in resistance have assessed the developmental outcomes of speech and lan-
been found among valve types, especially during low guage following the long-term use of a tracheostomy or
flows (.450 liters/sec), however, all valves recently tested mechanical ventilation. These studies su¤er from prob-
have resistances in the range of nasal resistance reported lems such as sample heterogeneity and the prelinguistic
for normal adults. Whereas speaking valves have similar or linguistic status of the child at the time the inter-
resistances, bias-open valves consistently show air loss vention is performed, but they do suggest some general
during the rise in pressure associated with the /p/ con- trends and outcomes. Fairly obviously, these children
sonant (Zajac, Frontaro-Clerici, and Roop, 1999). are at risk for delay in speech and language development
Introducing a speaking valve to a young child can (Simon and Handler, 1981; Simon, Fowler, and Han-
be challenging. The valve changes the sensation of dler, 1983; Kaslon and Stein, 1985; Simon and Mc-
Speech Development in Infants and Young Children with a Tracheostomy 179
Gowan, 1989; Bleile and Miller, 1994). Major gains in presented at the Annual Meeting of the American Speech-
the development of speech and language can sometimes Language and Hearing Association, San Francisco, No-
be made during and after decannulation with total com- vember 1999.
munication intervention approaches. However, the data McGowan, J. S., Bleile, K. M., Fus, L., and Barnas, E. (1993).
Communication disorders. In K. M. Bleile (Ed.), The care
suggest that residual e¤ects of long-term tracheostomy
of children with long-term tracheostomies (pp. 113–137). San
can be measured long after decannulation. Most Diego, CA: Singular Publishing Group.
reported delays seem to be articulatory in nature; voice Metz, S. (1993). Ventilator assistance. In K. M. Bleile (Ed.),
and respiratory dysfunction are rarely reported in chil- The care of children with long-term tracheostomies (pp.
dren after decannulation (Singer, Wood, and Lambert, 41–55). San Diego, CA: Singular Publishing Group.
1985; Singer et al., 1989; Hill and Singer, 1990; Kamen Mullins, J. B., Templer, J. W., Kong, J., Davis, W. E., and
and Watson, 199l; Kertoy et al., 1999). Taken together, Hinson, J., Jr. (1993). Airway resistance and work of
these studies indicate possible residual e¤ects of long- breathing in tracheostomy tubes. Laryngoscope, 103,
term tracheostomy on the speech mechanism. These 1367–1372.
e¤ects appear unrelated to the time of intervention (i.e., Simon, B. M., Fowler, S. M., and Handler, S. D. (1983).
Communication development in young children with long-
prelinguistic or linguistic) but may be related to length
term tracheostomies: Preliminary report. International
of cannulation and the general constellation of medical Journal of Pediatric Otorhinolaryngology, 6, 37–50.
conditions associated with long-term tracheostomy use. Simon, B. M., and Handler, S. D. (1981). The speech patholo-
—Carol A. Boliek gist and management of children with tracheostomies.
Journal of Otolaryngology, 10, 440–448.
References Simon, B. M., and McGowan, J. (1989). Tracheostomy in
young children: Implications for assessment and treatment
Beckerman, R. C., Brouillette, R. T., and Hunt, C. E. (1992). of communication and feeding disorders. Infants and Young
Respiratory control disorders in infants and children. Balti- Children, 1, 1–9.
more: Williams and Wilkins. Singer, L. T., Kercsmar, C., Legris, G., Orlowski, J. P., Hill,
Bleile, K. M. (1993). Children with long-term tracheostomies. B., and Doershuk, C. (1989). Developmental sequelae of
In K. M. Bleile (Ed.), The care of children with long-term long-term infant tracheostomy. Developmental Medicine
tracheostomies (pp. 3–19). San Diego, CA: Singular Pub- and Child Neurology, 31, 224–230.
lishing Group. Singer, L. T., Wood, R., and Lambert, S. (1985). Devel-
Bleile, K. M., and Miller, S. A. (1994). Toddlers with medical opmental follow-up of long-term tracheostomy: A prelimi-
needs. In J. E. Bernthal and N. W. Bankson (Eds.), Child nary report. Developmental and Behavioral Pediatrics, 6,
phonology: Characteristics, assessment, and intervention with 132–136.
special populations (pp. 81–108). New York: Thieme. Splaingard, M. L., Frates, R. C., Jr., Je¤erson, L. S., Rosen,
Driver, L. E. (2000). Pediatric considerations. In D. C. Tippett C. L., and Harrison, D. M. (1983). Home negative pressure
(Ed.), Tracheostomy and ventilator dependency (pp. 193– ventilation: Report of 20 years of experience in patients
235). New York: Thieme. with neuromuscular disease. Archives of Physical Medicine
Fry, T. L., Jones, R. O., Fischer, N. D., and Pillsbury, H. C. and Rehabilitation, 66, 239–242.
(1985). Comparisons of tracheostomy incisions in a pediat- Thach, B. T. (1992). Neuromuscular control of the upper air-
ric model. Annals of Orolaryngology, Rhinology, and Lar- way. In R. C. Beckerman, R. T. Brouillette, and C. E. Hunt
yngology, 94, 450–453. (Eds.), Respiratory control disorders in infants and children
Hill, B. P., and Singer, L. T. (1990). Speech and language de- (pp. 47–60). Baltimore: Williams and Wilkins.
velopment after infant tracheostomy. Journal of Speech and Zajac, D. J., Fronataro-Clerici, L., and Roop, T. A. (1999).
Hearing Research, 55, 15–20. Aerodynamic characteristics of tracheostomy speaking
Hoit, J. D., Shea, S. A., and Banzett, R. B. (1994). Speech valves: An updated report. Journal of Speech, Language,
production during mechanical ventilation in tracheosto- and Hearing Research, 42, 92–100.
mized individuals. Journal of Speech and Hearing Research,
37, 53–63. Further Readings
Hussey, J. D., and Bishop, M. J. (1996). Pressures required to
move gas through the native airway in the presence of a Adamson, L. B., and Dunbar, B. (1991). Communication
fenestrated vs a nonfenestrated tracheostomy tube. Chest, development of young children with tracheostomies. Aug-
110, 494–497. mentative and Alternative Communication, 7, 275–283.
Kamen, R. S., and Watson, B. C. (1991). E¤ects of long- Ahman, E., and Lipski, K. (1991). Early intervention for tech-
term tracheostomy on spectral characteristics of vowel nology dependent infants and young children. Infants and
production. Journal of Speech and Hearing Research, 34, Young Children, 3, 67–77.
1057–1065. Brodsky, L., and Volk, M. (1993). The airway and swallowing.
Kaslon, K., and Stein, R. (1985). Chronic pediatric trache- In J. C. Arvedson and L. Brodsky (Eds.), Pediatric swal-
ostomy: Assessment and implication for habilitation of lowing and feeding assessment and management. San Diego,
voice, speech and language in young children. International CA: Singular Publishing Group.
Journal of Pediatric Otorhinolaryngology, 6, 37–50. Citta-Pietrolungo, T. J., Alexander, M. A., Cook, S. P., and
Kertoy, M. K., Guest, C. M., Quart, E., and Lieh-Lai, M. Padman, R. (1993). Complications of tracheostomy and
(1999). Speech and phonological characteristics of individ- decannulation in pediatric and young patients with trau-
ual children with a history of tracheostomy. Journal of matic brain injury. Archives of Physical Medicine and
Speech, Language, and Hearing Research, 42, 621–635. Rehabilitation, 74, 905–909.
Lohmeier, H. L., and Boliek, C. A. (1999). Infant respiration Cox, J. Z., and VanDeinse, S. D. (1997). Special needs of the
and speech production on a ventilator: A case study. Paper prelinguistic ventilator-assisted child. In L. E. Driver, V. S.
180 Part II: Speech
Nelson, and S. A. Warschausky (Eds.), The ventilator- phrase repetitions (‘‘I want—I want that’’), and revisions
assisted child: A practical resource guide. San Antonio, TX: (‘‘I want—I need that’’), which are relatively common
Communication Skill Builders. in the speech of normally developing children, repre-
Eliachar, I. (2000). Unaided speech in long-term tube free tra- sent normal aspects of the speaking process. These
cheostomy. Laryngoscope, 110, 749–760.
disfluencies arise when a speaker experiences an error in
Leder, S. B. (1994). Perceptual rankings of speech quality pro-
duced with one-way tracheostomy speaking valves. Journal language formulation or speech production or needs
of Speech and Hearing Research, 37, 1308–1312. more time to prepare a message. Other types of dis-
Leder, S. B. (1990). Verbal communication for the ventilator fluencies, which occur relatively infrequently in the
dependent patient: Voice intensity with the portex ‘‘Talk’’ speech of normally developing children, may be in-
tracheostomy tube. Laryngoscope, 100, 1116–1121. dicative of a developing stuttering disorder. These dis-
Leder, S. B., and Traquina, D. N. (1989). Voice intensity of fluencies, often called ‘‘atypical’’ disfluencies, ‘‘stuttered’’
patients using a communi-trach I cu¤ed speaking trache- disfluencies, or ‘‘stutter-like’’ disfluencies, include whole-
ostomy tube. Laryngoscope, 99, 744–747. word repetitions (‘‘I-I-I want that’’) and, particularly,
Lichtman, S. W., Birnbaum, I. L., Sanfilippo, M. R., Pellicone, fragmentations within a word unit, such as part-word
J. T., Damon, W. J., and King, M. L. (1995). E¤ect of a
repetitions (‘‘li-li-like this’’), sound prolongations (‘‘lllllike
tracheostomy speaking valve on secretions, arterial oxygen-
ation, and olfaction: A quantitative evaluation. Journal of this’’), and blocks (‘‘l—ike this’’) (Ambrose and Yairi,
Speech and Hearing Research, 38, 549–555. 1999).
Line, W. S., Jr., Hawkins, D. B., Kahlstrom, E. J., Mc- Stuttered speech disfluencies can also be accompanied
Laughlin, E., and Ensley, J. (1986). Tracheostomy in infants by a¤ective, behavioral, and cognitive reactions to the
and young children: The changing perspective. Laryngo- di‰culties with speech production. These reactions are
scope, 96, 510–515. distinct from the speech disfluencies themselves but are
Manley, S. B., Frank, E. M., Melvin, C. F. (1999). Preparation part of the overall stuttering disorder (Yaruss, 1998).
of speech-language pathologist to provide services to Examples of behavioral reactions, which rapidly become
patients with a tracheostomy tube: A survey. American incorporated into the child’s stuttering pattern, include
Journal of Speech-Language Pathology, 8, 171–180.
physical tension and struggle in the speech mechanism
McGowan, J. S., Kerwin, M. L., and Bleile, K. M. (1993).
Oral-motor and feeding problems. In K. M. Bleile (Ed.), as children attempt to control their speech. A¤ective
The care of children with long-term tracheostomies. San Di- and cognitive reactions include feelings of anxiety, em-
ego, CA: Singular Publishing Group. barrassment, and frustration. As stuttering continues,
Nash, M. (1988). Swallowing problems in the tracheosto- children may develop shame, low self-esteem, and avoid-
mized patient. Otolaryngology Clinics of North America, 21, ance of words, sounds, or speaking situations. These
701–709. negative reactions can lead to increased stuttering sever-
Ramsey, A. M., and Grady, E. A. (1997). Long-term airway ity and greatly exacerbate the child’s communication
management for the ventilator-assisted child. In L. E. problems.
Driver, V. S. Nelson, and S. A. Warschausky (Eds.), The
ventilator-assisted child: A practical resource guide. San
Etiology. Numerous theories about the etiology of
Antonio, TX: Communication Skill Builders.
Thompson-Henry, S., Braddock, B. (1995). The modified stuttering have been proposed (Bloodstein, 1993). His-
Evan’s blue dye procedure fails to detect aspiration in the torically, these theories tended to focus on single causes
tracheostomized patient: Five case reports. Dysphagia, 10, acting in isolation. Examples include psychological
172–174. explanations based on supposed neuroses, physiologi-
Tippett, D. C., and Siebens, A. A. (1995). Reconsidering cal explanations involving muscle spasms, neurological
the value of the modified Evan’s blue dye test: A comment explanations focusing on ticlike behaviors, and environ-
on Thompson-Henry and Braddock (1995). Dysphagia, 11, mental explanations suggesting that normal disfluency
78–81. was misidentified as stuttering. None of these theories
VanDeinse, S. D., and Cox, J. Z. (1997). Feeding and swal- has proved satisfactory, though, for the phenomenology
lowing issues in the ventilator-assisted child. In L. E.
of stuttering is complex and highly individualized. As
Driver, V. S. Nelson, and S. A. Warschausky (Eds.), The
ventilator-assisted child: A practical resource guide. San a result, current theories focus on multiple etiological
Antonio, TX: Communication Skill Builders. factors that interact in complex ways for di¤erent chil-
dren who stutter (e.g., Smith and Kelly, 1997). These
interactions involve not only genetic and environmental
factors, but also various aspects of the child’s overall
Speech Disfluency and Stuttering in development.
Children Pedigree and twin studies have shown a genetic
component to childhood stuttering—a family history of
stuttering can be identified for approximately 60%–70%
Childhood stuttering (also called developmental stutter- of children who stutter (Ambrose, Yairi, and Cox, 1993).
ing) is a communication disorder that is generally char- The precise nature of that genetic inheritance is not fully
acterized by interruptions, or speech disfluencies, in understood, however, and studies are currently under
the smooth forward flow of speech. Speech disfluencies way to evaluate di¤erent models of genetic transmission.
can take many forms, and not all are considered to be It is also likely that environmental factors, such as the
atypical. Disfluencies such as interjections (‘‘um’’, ‘‘er’’), model the child hears when learning to speak and the
Speech Disfluency and Stuttering in Children 181
demands placed on the child to speak quickly or pre- likely to experience recovery than boys. This is particu-
cisely, may play a role in determining whether stuttering larly true for girls who have other females in their family
will be expressed in a particular child (e.g., Starkweather with a history of recovery from stuttering (Ambrose,
and Givens-Ackerman, 1997). Yairi, and Cox, 1997).
There are several aspects of children’s overall devel-
opment that a¤ect children’s speech fluency and the Diagnosis and Assessment. The high rate of recovery
development of childhood stuttering. For example, chil- from early stuttering indicates a positive prognosis for
dren are more likely to be disfluent when producing many preschool children who stutter; however, it also
longer, more syntactically complex sentences (Yaruss, complicates the diagnostic process and makes it di‰cult
1999). It is not clear whether this increase is associated to evaluate the e‰cacy of early intervention. There is
with greater demands on the child’s language formula- general agreement among practitioners that it is best to
tion abilities or speech production abilities; however, it evaluate young children soon after the onset of stuttering
is likely that stuttering arises due to the interaction to estimate the likelihood of recovery. Often, however, it
between linguistic and motoric functions. As a group, is di‰cult to make this determination, and there is con-
children who stutter have been shown to exhibit lan- siderable disagreement about whether it is best to enroll
guage formulation and speech production abilities that children in treatment immediately or wait to see whether
are slightly lower than their typically fluent peers; they will recover without intervention (e.g., Bernstein
however, these di¤erences do not generally represent Ratner, 1997b; Curlee and Yairi, 1997).
clinically identifiable deficits in speech or language de- Based on the understanding that the etiology of
velopment (Bernstein Ratner, 1997a). Finally, tempera- childhood stuttering involves multiple interacting fac-
ment, and specifically the child’s sensitivity to stimuli in tors, the diagnostic assessment of a preschool child who
the environment as well as to speaking mistakes, has stutters involves evaluation of several aspects of the
been implicated as a factor contributing to the likelihood child’s speech, language, and overall development, as
that a child will react negatively to speech disfluencies well as selected aspects of the child’s environment. Spe-
(Conture, 2001). cifically, a complete diagnostic assessment includes the
following: (1) a detailed interview with parents or care-
Onset, Development, and Distribution. The onset of givers about factors such as family history of stuttering,
childhood stuttering typically occurs between the ages of the family’s reactions to the child’s speaking di‰culties,
2½ and 5, though later onset is sometimes reported. the child’s reactions to stuttering, the speech and lan-
Stuttering can develop gradually (with increasing fre- guage models the child is exposed to at home, and any
quency of disfluency and growing severity of individual other information about communication or other stres-
instances of disfluency), or it can appear relatively sud- sors the child may be experiencing, such as competition
denly (with the rapid development of more severe stut- for talking time with siblings; (2) assessment of the ob-
tering behaviors). Stuttering often begins during a period servable characteristics of the child’s fluency, including
of otherwise normal or possibly even advanced speech- the frequency, duration, and type of disfluencies and a
language development, although many children who rating of stuttering severity; (3) assessment of the child’s
stutter exhibit concomitant deficits in other aspects of speaking abilities, including an assessment of speech
speech and language development. For example, 30%– sound production/phonological development and oral-
40% of preschool children who stutter also exhibit a motor skills; (4) assessment of the child’s receptive and
disorder of speech sound production (articulation or expressive language development, including morpholog-
phonological disorder), though the exact nature of the ical structures, vocabulary, syntax, and pragmatic inter-
relationship between these communication disorders is action; and, increasingly, (5) assessment of the child’s
not clear (Yaruss, LaSalle, and Conture, 1998). temperament, including sensitivity to stimuli in the en-
The lifetime incidence of stuttering may be as high vironment and concerns about speaking di‰culties.
as 5%, although the prevalence is only approximately Together, these factors can be used to estimate the
1%, suggesting that the majority of young children who likelihood that the child will recover from early stut-
stutter—perhaps as many as 75%—recover from stuttering without intervention or whether treatment is
tering and develop normal speech fluency (Yairi and indicated. Although it is impossible to determine with
Ambrose, 1999). Children who recover typically do so certainty which children will recover from stuttering
within the first several months after onset; however, re- without intervention, some diagnostic signs that may in-
covery is also common within the first 2 years after onset dicate an increased risk of continued stuttering and the
(Yairi and Ambrose, 1999). After this time, natural or need for treatment include a family history of chronic
unaided recovery is less common, and children appear to stuttering, significant physical tension or struggle during
be significantly less likely to experience a complete re- stuttered or fluent speech, concomitant disorders of
covery if they have been stuttering for longer than 2 to 3 speech or language, and a high degree of concern about
years, or if they are still stuttering after approximately speaking di‰culties on the part of the child or the family
age 7 (Andrews and Harris, 1964). Boys are a¤ected (e.g., Yairi et al., 1996). Importantly, some practitioners
more frequently than girls: in adults, the male to female also recommend treatment even in cases where the esti-
ratio is approximately 4 or 5 to 1, though at onset, the mated risk for continued stuttering is low, either in an
ratio is closer to 2 to 1, suggesting that girls are more attempt to speed up the natural recovery process or to
182 Part II: Speech
help concerned parents reduce their worries about their modify individual instances of stuttering so they are
child’s fluency. less disruptive to communication (Ramig and Bennett,
1997). Other direct approaches include operant treat-
Treatment. As theories about stuttering have changed, ments based on reinforcing fluent speech in a hierarchy
so too have preferred treatment approaches, particularly of utterances of increasing syntactic complexity and
for older children and adults who stutter. At present, length (Bothe, 2002).
there are two primary approaches to treatment for A critical component of treatment for older children
young children who stutter, traditionally labeled ‘‘indi- who stutter, for whom complete recovery is less likely—
rect’’ and ‘‘direct’’ therapy. Indirect therapy is based and even for preschool children who are concerned
on the notion that children’s fluency is a¤ected by spe- about their speech or who have significant risk factors
cific characteristics of their speech, such as speaking rate, indicating a likelihood of continued stuttering (Logan
time allowed for pausing between words and phrases, and Yaruss, 1999)—is learning to accept stuttering and
and the length and complexity of utterances. Specifically, to minimize the impact of stuttering in daily activities.
it appears that children are less likely to stutter if they As children learn to cope with their stuttering, they are
speak more slowly, allow more time for pausing, or use less likely to develop the negative reactions that charac-
shorter, simpler utterances. If children can learn to use terize more advanced stuttering, so the disorder is less
these ‘‘fluency facilitating’’ strategies, they are more likely to become debilitating for them. In addition to
likely to be fluent and, presumably, less likely to develop pursuing treatment, many older children and families
a chronic stuttering disorder. of children who stutter also find meaningful support
A key assumption underlying the indirect treatment through self-help groups, and clinicians are increasingly
approach is that these parameters of children’s speech recommending support group participation for their
are influenced by the communication model of the peo- young clients and their families.
ple in the child’s environment. Thus, in indirect therapy,
clinicians teach parents and caregivers to use a slower Summary. Stuttering is a complex communication
rate of speech, to increase their rate of pausing, and, in disorder involving the production of certain types of
some instances, to modify the length and complexity of speech disfluencies, as well as the a¤ective, behavioral,
their utterances, although there is increasing concern and cognitive reactions that may result. There is no one
among some researchers that restricting the language known cause of stuttering. Instead, childhood stuttering
input children receive may have unintended negative appears to arise because of a complex interaction among
consequence for children’s overall language develop- several factors that are both genetically and environ-
ment (e.g., Bernstein Ratner and Silverman, 2000). Fur- mentally determined, such as the child’s linguistic abili-
thermore, although it is clear that children do learn ties, motoric abilities, and temperament. Therefore,
certain aspects of communication from their environ- diagnostic evaluations of preschoolers who stutter must
ment, there is relatively little empirical support for the examine the child’s environment and several aspects of
notion that changing parents’ speech characteristics di- child’s development. Treatment options include both in-
rectly influences children’s speech characteristics or their direct and direct approaches designed not only to mini-
speech fluency. Even in the absence of clear e‰cacy mize the occurrence of speech disfluencies, but also to
data, however, the indirect approach is favored by many minimize the impact of those disfluencies on the lives of
clinicians who are hesitant to draw attention to the children who stutter.
child’s speech or increase the child’s concerns about their See also language in children who stutter;
speech fluency. stuttering.
In recent years, a competing form of treatment for —J. Scott Yaruss
preschool children who stutter has gained popularity
(Harrison and Onslow, 1999). This behavioral approach References
is based on parent-administered intermittent reinforce-
Ambrose, N., and Yairi, E. (1999). Normative disfluency data
ment of fluent speech and occasional, mild, supportive for early childhood stuttering. Journal of Speech, Language,
correction of stuttered speech. Specifically, when a child and Hearing Research, 42, 895–909.
stutters, the parent labels the stuttering as ‘‘bumpy’’ Ambrose, N., Yairi, E., and Cox, N. (1993). Genetic aspects of
speech and encourages the child to repeat the sentence early childhood stuttering. Journal of Speech, Language,
‘‘without the bumps.’’ E‰cacy data indicate that this and Hearing Research, 36, 701–706.
form of treatment is highly successful at reducing the Ambrose, N., Yairi, E., and Cox, N. (1997). The genetic basis
observable characteristics of stuttering, although ques- of persistence and recovery in stuttering. Journal of Speech,
tions remain about the mechanism responsible for this Language, and Hearing Research, 40, 567–580.
improved fluency. Andrews, G., and Harris, M. (1964). The syndrome of stutter-
Regardless of the approach used in the preschool ing. London: Heinman Medical Books.
Bernstein Ratner, N. (1997a). Stuttering: A psycholinguistic
years, clinicians generally shift from indirect to more di- perspective. In R. F. Curlee and G. M. Siegel (Eds.), Nature
rect forms of treatment as children grow older and their and treatment of stuttering: New directions (2nd ed., pp.
awareness of their speaking di‰culties increases. Direct 99–127). Needham Heights, MA: Allyn and Bacon.
treatment strategies include specifically teaching children Bernstein Ratner, N. (1997b). Leaving Las Vegas: Clinical
to use a slower speaking rate or reduced physical tension odds and individual outcomes. American Journal of Speech-
to smooth out their speech and helping them learn to Language Pathology, 6, 29–33.
Speech Disorders: Genetic Transmission 183
Bernstein Ratner, N., and Silverman, S. (2000). Parental per- Manning, W. H. (2001). Clinical decision making in fluency
ceptions of children’s communicative development at stut- disorders (2nd ed.). San Diego, CA: Singular Publishing
tering onset. Journal of Speech, Language, and Hearing Company.
Research, 43, 1252–1263. Shapiro, D. A. (1999). Stuttering intervention: A collaborative
Bloodstein, O. (1993). Stuttering: The search for a cause and a journey to fluency freedom. Austin, TX: Pro-Ed.
cure. Needham Heights, MA: Allyn and Bacon. Starkweather, C. W., and Givens-Ackerman, J. (1997). Stut-
Bothe, A. (2002). Speech modification approaches to stuttering tering. Austin, TX: Pro-Ed.
treatment in schools. Seminars in Speech and Language, 23,
181–186.
Conture, E. G. (2001). Stuttering: Its nature, assessment, and
treatment. Needham Heights, MA: Allyn and Bacon.
Curlee, R., and Yairi, E. (1997). Early intervention with early
childhood stuttering: A critical examination of the data. Speech Disorders: Genetic
American Journal of Speech-Language Pathology, 6, 8–18. Transmission
Harrison, E., and Onslow, M. (1999). Early intervention for
stuttering: The Lidcombe program. In R. F. Curlee (Ed.),
Stuttering and Related Disorders of Fluency (2nd ed., pp.
65–79). New York: Thieme. Advances in behavioral and molecular genetics over the
Logan, K. J., and Yaruss, J. S. (1999). Helping parents address past decade have made it possible to investigate genetic
attitudinal and emotional factors with young children who factors that may contribute to speech sound disorders.
stutter. Contemporary Issues in Communication Science and These speech sound disorders of unknown etiology were
Disorders, 26, 69–81. often considered to be ‘‘functional’’ or learned. Mount-
Ramig, P. R., and Bennett, E. M. (1997). Clinical management ing evidence suggests that at least some speech sound
of children: Direct management strategies. In R. F. Curlee disorders may in part be genetic in origin. However, the
and G. M. Siegel (Eds.), Nature and treatment of stuttering: search for a genetic basis of speech sound disorders has
New directions (2nd ed., pp. 292–312). Needham Heights, been complicated by definitional and methodological
MA: Allyn and Bacon.
problems. Issues that are critical to understanding the
Smith, A., and Kelly, E. (1997). Stuttering: A dynamic, multi-
factorial model. In R. F. Curlee and G. M. Siegel (Eds.), genetic transmission of speech disorders include preva-
Nature and treatment of stuttering: New directions (2nd ed., lence data, phenotype definitions, sex as a risk factor,
pp. 204–217). Needham Heights, MA: Allyn and Bacon. familial aggregation of disorders, and behavioral and
Starkweather, C. W., and Givens-Ackerman, J. (1997). Stut- molecular genetic findings.
tering, Austin, TX: Pro-Ed.
Yairi, E., and Ambrose, N. (1999). Early childhood stuttering: Prevalence Estimates. Prevalence estimates of speech
I. Persistence and recovery rates. Journal of Speech, Lan- sound disorders are essential in conducting behavioral
guage, and Hearing Research, 42, 1098–1112. and molecular genetic studies. They are used to calculate
Yairi, E., Ambrose, N., Paden, E. P., and Throneburg, R. N. an individual’s risk of having a disorder as well as to test
(1996). Predictive factors of persistence and recovery: Path-
di¤erent genetic models of transmission. Prevalence rates
ways of childhood stuttering. Journal of Communication
Disorders, 29, 51–77. for speech disorders may vary based on the age and sex
Yaruss, J. S. (1998). Describing the consequences of disorders: of the individual, the type of disorder, and the comorbid
Stuttering and the International Classification of Impair- conditions associated with it. In an epidemiological
ments, Disabilities, and Handicaps. Journal of Speech, sample, Shriberg and Austin (1998) reported the preva-
Language, and Hearing Research, 49, 249–257. lence of speech delay in 6-year-old children to be 3.8%.
Yaruss, J. S. (1999). Utterance length, syntactic complexity, Speech delay was approximately 1.5 times more preva-
and childhood stuttering. Journal of Speech, Language, and lent in boys than in girls. Shriberg and Austin (1998) also
Hearing Research, 42, 329–344. found that children with speech involvement have a two
Yaruss, J. S., LaSalle, L. R., and Conture, E. G. (1998). Eval- to three times greater risk for expressive language prob-
uating stuttering in young children: Diagnostic data. Amer-
lems than for receptive language problems. Estimates
ican Journal of Speech-Language Pathology, 7, 62–76.
of the comorbidity of receptive language disorders with
speech disorders ranged from 6% to 21%, based on
Further Readings whether receptive language was assessed by vocabulary,
Bloodstein, O. (1993). Stuttering: The search for a cause and a grammar, or both. Similarly, estimates of comorbidity
cure. Needham Heights, MA: Allyn and Bacon. of expressive language disorders with speech disorders
Bloodstein, O. (1995). A handbook on stuttering (5th ed.). San ranged from 38% to 62%, depending on the methods
Diego, CA: Singular Publishing Group. used to assess expressive skills.
Conture, E. G. (2001). Stuttering: Its nature, assessment, and
treatment. Needham Heights, MA: Allyn and Bacon. Phenotype Definitions. Phenotype definitions (i.e., the
Curlee, R. F. (1999). Stuttering and related disorders of fluency. behavior that is under study) are also crucial for genetic
New York: Thieme.
Curlee, R. F., and Siegel, G. M. (Eds.). (1997). Nature and
studies of speech disorders. Phenotype definitions may
treatment of stuttering: New directions (2nd ed.). Needham be broadly or narrowly defined, according to the hy-
Heights, MA: Allyn and Bacon. pothesis to be tested. A broad phenotype may include
Guitar, B. (1998). Stuttering: An integrated approach to its language as well as speech disorders and sometimes re-
nature and treatment (2nd ed.). Baltimore: Williams and lated language learning di‰culties such as reading and
Wilkins. spelling disorders (Tallal, Ross, and Curtiss, 1989). An
184 Part II: Speech
residual errors. These two subtypes may have di¤erent

genetic or environmental causes.
Sex as a Risk Factor. A robust finding in studies of fa-

milial speech and language disorders has been a higher
prevalence of disorders in males than in females, ranging
from a 2 : 1 to a 3 : 1 ratio (Neils and Aram, 1986; Tallal
et al., 1989; Tomblin, 1989; Lewis, 1992). Explanations
for this increased prevalence in males include referral
bias (Shaywitz et al., 1990), immunoreactive theories
(Robinson, 1991), di¤erences in rates and patterns of
neurological maturation (Plante, 1996), variation in
cognitive phenotypes (Bishop, North, and Donlan,
1995), and di¤erences in genetic transmission of the dis-
orders. An X-linked mode of transmission of speech and
Figure 1. A typical family pedigree of a child with a speech dis- language disorders has not been supported by pedigree
order. The arrow indicates the proband child. Male family studies (Lewis, 1992; Beitchman, Hood, and Inglis,
members are represented by squares and female family mem- 1992). However, a sex-specific threshold hypothesis that
bers are represented by circles. Individuals who are a¤ected proposes that girls have a higher threshold for expres-
with speech disorders are shaded in black. Other disorders are sion of the disorder, and therefore require a higher
coded as follows: Read ¼ reading disorder, Spell ¼ spelling genetic loading (more risk genes) before the disorder is
disorder, Lang ¼ language disorder, LD ¼ learning disability, expressed, has been supported (Tomblin, 1989; Lewis,
Apraxia ¼ apraxia of speech, Artic ¼ articulation disorder.
1992; Beitchman, Hood, and Inglis, 1992). Consistent
with this hypothesis, a higher percentage of a¤ected rel-
atives are reported for female (38%) than for male pro-
bands (26%). Di¤ering sex ratios may be found for
individual exhibiting a single disorder or a combination
various subtypes of phonology disorders (Shriberg and
of disorders is considered to be a¤ected. Such a broad
Austin, 1998). Hall, Jordan, and Robin (1993) reported
phenotype may test a general verbal trait deficit hypoth-
a 3 : 1 male to female ratio for developmental apraxia of
esis which holds that there is a common underlying
speech. Similarly, boys with phonology disorders were
genetic and cognitive basis for speech and language dis-
found to have a higher rate of comorbid language dis-
orders that is expressed di¤erently in individual family
orders than girls with phonology disorders (Shriberg and
members (i.e., variable expression). An alternative ex-
Austin, 1998). Lewis et al. (1999) found that probands
planation is that each disorder has a unique underlying
with phonology disorders alone demonstrated a more
genetic and cognitive basis. Some investigators have
equal sex ratio (59% male and 41% female) than pro-
narrowly defined the phenotype as a specific speech dis-
bands with phonology disorders with language disorders
order, such as phonology (Lewis, Ekelman, and Aram,
(71% male and 29% female).
1989). Even if the proband (i.e., the child with a dis-
order from whom other family members are identified)
is selected by a well-defined criterion, nuclear family Familial Aggregation. Familial aggregation refers to
members often present a varied spectrum of disorders. the percentage of family members demonstrating a dis-
Some studies, while narrowly defining the phenotype for order. Familial aggregation or family resemblance may
the proband, have used a broad phenotype definition for be due to heredity, shared family environment, or both.
family members. Since older siblings and parents often Research has supported the conclusion that speech and
do not demonstrate speech sound errors in their conver- language disorders aggregate within families (Neils and
sational speech, researchers have relied on historical Aram, 1986; Tallal, Ross, and Curtiss, 1989; Tomblin,
reports, rather than direct observations of the speech 1989; Gopnik and Crago, 1991; Lewis, 1992; Felsenfeld,
disorder. Figure 1 shows a typical family pedigree of a McGue, and Broen, 1995; Lahey and Edwards, 1995;
child with a speech disorder. Spitz et al., 1997; Rice, Haney, and Wexler, 1998).
Narrow phenotype definitions may examine subtypes Reports indicate that 23%–40% of first-degree family
of phonology disorders with postulated distinct genetic members of individuals with speech and language dis-
bases. One schema for the subtyping of phonology disorders are a¤ected. Di¤erences in reported rates of
orders may be based on whether or not the phonology a¤ected family members again may be attributed to dif-
disorder is accompanied by more pervasive language ferences in definitional criteria for probands and family
disorders (Lewis and Freebairn, 1997). Children with members.
isolated phonology disorders experience fewer aca- Two studies specifically examined familial aggrega-
demic di‰culties than children with phonology disorders tion of phonology disorders (Lewis, Ekelman, and
accompanied by other language disorders (Aram and Aram, 1989; Felsenfeld, McGue, and Broen, 1995). Both
Hall, 1989). Shriberg et al. (1997) propose at least two studies reported 33% of first-degree family members
forms of speech sound disorders of unknown origin: (nuclear family members) to have had speech-language
those with speech delay and those with questionable di‰culties. Brothers were most frequently a¤ected.
Speech Disorders: Genetic Transmission 185
Behavioral and Molecular Genetic Studies. The twin study that provided direct evidence for a genetic basis for
study paradigm has been employed to identify genetic a speech sound disorder associated with a neurological
and environmental contributions to speech and language abnormality. It was the initial step in the application of
disorders. Twin studies compare the similarity (concor- molecular genetic techniques to the study of speech and
dance) of identical or monozygotic twins to fraternal language disorders. Further studies are needed to deter-
or dizygotic twins. If monozygotic twins are more con- mine if FOXP2 is found in other families with speech
cordant than dizygotic twins, a genetic basis is implied. disorders.
To date, a twin study specifically examining speech See also speech disorders in children: descriptive
sound disorders has not been conducted. Rather, twin linguistic approaches; developmental apraxia of
studies have employed a broad phenotype definition speech; phonological errors, residual.
that includes both speech and language disorders. Twin
studies of speech and language disorders (Lewis and —Barbara A. Lewis
Thompson, 1992; Bishop, North, and Donlan, 1995;
Tomblin and Buckwalter, 1998) have consistently re- References
ported higher concordance rates for monozygotic than
Aram, D. M., and Hall, N. E. (1989). Longitudinal follow-up
for dizygotic twin pairs, confirming a genetic contribu-
of children with preschool communication disorders: Treat-
tion to these disorders. Concordance rates for mono- ment implications. School Psychology Review, 19, 487–501.
zygotic twins range from .70 (Bishop, North, and Bishop, D. V. M., North, T., and Donlan, C. (1995). Non-
Donlan, 1995) to .86 (Lewis and Thompson, 1992) and word repetition as a phenotypic marker for inherited lan-
.96 (Tomblin and Buckwalter, 1998). Concordance guage impairment: Evidence from a twin study. Journal of
rates reported for dizygotic twin pairs are as follows: Child Psychology and Psychiatry, 37, 391–403.
.46 (Bishop, North, and Donlan, 1995), .48 (Lewis and Beitchman, J. H., Hood, J., and Inglis, A. (1992). Familial
Thompson, 1992), and .69 (Tomblin and Buckwalter, transmission of speech and language impairment: A pre-
1998). A large twin study (3000 pairs of twins) suggested liminary investigation. Canadian Journal of Psychiatry, 37,
that genetic factors may exert more influence at the 151–156.
Dale, P. S., Simono¤, E., Bishop, D. V. M., Eley, T., Oliver,
lower extreme of language abilities, whereas environ- B., Price, T. S., et al. (1998). Genetic influence on language
mental factors may influence normal language abilities delay in two-year-old children. Nature Neuroscience, 1,
more (Dale et al., 1998). These studies, while supporting 324–328.
a genetic contribution to speech and language skills, also Felsenfeld, S., McGue, M., and Broen, P. A. (1995). Familial
indicate a moderate environmental influence. Environ- aggregation of phonological disorders: Results from a 28-
mental factors working with genetics may determine year follow-up. Journal of Speech and Hearing Research, 38,
speech and language impairment in an individual. 1091–1107.
Consistent with these findings, an adoption study by Felsenfeld, S., and Plomin, R. (1997). Epidemiological and
Felsenfeld and Plomin (1997) demonstrated that a his- o¤spring analyses of developmental speech disorders using
tory of speech and language disorders in the biological data from the Colorado Adoption Project. Journal of
parent best predicted whether or not a child was a¤ected. Fisher, S. E., Vargha-Khadem, F., Watkins, K. E., Monacol,
This relationship was not found when the family history A., and Pembrey, M. E. (1998). Localization of a gene
of the adoptive parents was considered. As with twin implicated in a severe speech and language disorder. Nature
studies, a broad phenotype definition that encompassed Genetics, 18, 168–170.
both speech and language disorders was employed. Gopnik, M., and Crago, M. (1991). Familial aggregation of a
Adoption studies have not been conducted for speech developmental language disorder. Cognition, 39, 1–50.
sound disorders alone. Hall, P. K., Jordan, L. S., and Robin, D. A. (1993). Devel-
Segregation analyses examine the mode of transmis- opmental dyspraxia of speech: Theory and clinical practice.
sion of the disorder within a family. Segregation analyses Austin, TX: Pro-Ed.
have confirmed familial aggregation of speech and lan- Lahey, M., and Edwards, J. (1995). Specific language impair-
ment: Preliminary investigation of factors associated with
guage disorders and supported both a major locus model family history and with patterns of language performance.
and a polygenic model of transmission of the disorder Journal of Speech and Hearing Research, 38, 643–657.
(Lewis, Cox, and Byard, 1993). The failure to define a de- Lai, C. S. L., Fisher, S. E., Hurst, J. A., Vargha-Khadem,
finitive mode of transmission may be due to genetic heter- F., and Monaco, P. (2001). A forkhead-domain gene is
ogeneity (i.e., more than a single underlying genetic basis). mutated in a severe speech and language disorder. Nature,
Only a single study to date has reported a molecular 413, 519–523.
genetic analysis of a family with apraxia of speech and Lewis, B. A. (1992). Pedigree analysis of children with phono-
other language impairments. Genetic studies of a single logy disorders. Journal of Learning Disabilities, 25, 586–597.
large pedigree, known as the K.E. family, revealed link- Lewis, B. A., Cox, N. J., and Byard, P. J. (1993). Segregation
age to a region of chromosome 7 (Vargha-Khadem et analyses of speech and language disorders. Behavior Genet-
ics, 23, 291–297.
al., 1995; Fisher et al., 1998). Subsequently, the FOXP2 Lewis, B. A., Ekelman, B. L., and Aram, D. M. (1989). A
gene that is postulated to result in the development of familial study of severe phonological disorders. Journal of
abnormal neural structures for speech and language was Speech and Hearing Research, 32, 713–724.
identified (Lai et al., 2001). Neuroimaging of family Lewis, B. A., and Freebairn, L. (1997). Subgrouping children
members indicated abnormalities in regions of the fron- with familial phonology disorders. Journal of Communica-
tal lobe and associated motor systems. This was the first tion Disorders, 30, 385–402.
186 Part II: Speech
Lewis, B. A., Freebairn, L. A., and Taylor, H. G. (1999). 1981). Speech-language pathologists also consider the
School-age follow-up of children with familial phonology contributions of psychopathology in the evaluation and
disorders. Paper presented at the annual convention of the management of acquired adult speech disorders (Sapir
American Speech, Language, and Hearing Association. and Aronson, 1990). This is important because depres-
November 1999, San Francisco, CA.
sion and/or anxiety are common in stroke, traumatic
Lewis, B. A., and Thompson, L. A. (1992). A study of devel-
opmental speech and language disorders in twins. Journal of brain injury, and progressive neurological disease
Speech and Hearing Research, 35, 1086–1094. (Giannoti, 1972). Depression frequently occurs after a
Neils, J., and Aram, D. M. (1986). Family history of children laryngectomy and may interfere with rehabilitation
with developmental language disorders. Perceptual and e¤orts (Rollin, 1987). Individuals subjected to prolonged
Motor Skills, 63, 655–658. stress may speak with excessive tension in the vocal
Plante, E. (1996). Phenotypic variability in brain-behavior mechanism. When misuse of the vocal apparatus occurs
studies of specific language impairment. In M. Rice (Ed.), for a long period of time, it can lead to the formation of
Toward a genetics of language. Mahwah, NJ: Erlbaum. vocal fold lesions (e.g., nodules) and long-term dyspho-
Rice, M. L., Haney, K. R., and Wexler, K. (1998). Family nia (Aronson, 1991; Case, 1991). One of the responsibil-
histories of children with SLI who show extended optional
ities of the speech-language pathologist is to determine
infinitives. Journal of Speech, Language, and Hearing Re-
search, 41, 419–432. if and how psychogenic components contribute to
Robinson, R. J. (1991). Causes and associations of severe acquired speech disorders in adults secondary to struc-
and persistent specific speech and language disorders in tural lesions and neurological disease to enhance di¤er-
children. Developmental Medicine and Child Neurology, 33, ential diagnosis and to plan appropriate intervention
943. (Sapir and Aronson, 1990).
Shaywitz, S. E., Shaywitz, B. A., Fletcher, J. M., and Escobar, In general, some degree of psychopathology is present
M. D. (1990). Prevalence of reading disability in boys and in most acquired adult speech disorders. It is reasonable
girls. JAMA, 264, 998–1002. that persons who previously communicated normally
Shriberg, L. D., and Austin, D. (1998). Comorbidity of speech- would be a¤ected psychologically when their ability to
language disorders: Implications for a phenotype marker
communicate was disrupted. In such cases, psychopa-
for speech delays. In R. Paul (Ed.), The speech/language
connection. Baltimore: Paul H. Brookes. thology (e.g., depression and anxiety) contributes to and
Shriberg, L. D., Austin, D., Lewis, B. A., McSweeny, J. L., possibly exacerbates the speech disorder, but it is not
and Wilson, D. L. (1997). The percentage of consonants the cause of the disorder. Therefore, while the speech-
correct (PCC) metric: Extensions and reliability data. Jour- language pathologist must be alert to the role of psy-
nal of Speech and Hearing Research, 40, 708–722. chopathology in assessment and management of these
Spitz, R., Tallal, P., Flax, J., and Benasich, A. A. (1997). Look cases, these disorders are not ‘‘purely’’ psychogenic in
who’s talking: A prospective study of familial transmission nature.
of language impairments. Journal of Speech and Hearing Purely psychogenic speech disorders, the subject of
Research, 40, 990–1001. this chapter, are rare in clinical practice. With psycho-
Tallal, P., Ross, R., and Curtiss, S. (1989). Familial aggrega-
genic speech disorders, the communication breakdown
tion in specific language impairment. Journal of Speech and
Hearing Disorders, 54, 167–173. stems from a conversion disorder. Conversion disorders
Tomblin, J. B. (1989). Familial concentration of developmental are included within a larger family of psychiatric dis-
language impairment. Journal of Speech and Hearing Dis- orders, somatoform illnesses. These tend to be associated
orders, 54, 287–295. with pathologic beliefs and attitudes on the part of the
Tomblin, J. B., and Buckwalter, P. (1998). The heritability of patient that results in somatic symptoms. The American
poor language achievement among twins. Journal of Speech Psychiatric Association (1987) defines a conversion dis-
and Hearing Research, 41, 188–199. order as ‘‘an alteration or loss of physical functioning
Vargha-Khadem, F., Watkins, K., Alcock, K., Fletcher, P., that suggests a physical disorder, that actually represents
and Passingham, R. (1995). Praxic and nonverbal cognitive an expression of a psychological conflict or need.’’ An
deficits in a large family with a genetically transmitted
example might be a woman who suddenly loses her voice
speech and language disorder. Proceedings of the National
Academy of Sciences of the United States of America, 92, because she cannot face the psychological conflict of a
930–933. spouse’s a¤air. Here the symptom (voice loss) constitutes
a lesser threat to her psychological equilibrium than
confronting the husband with his infidelity. A partial list
Speech Disorders in Adults, of psychogenic speech disorders includes partial (dys-
Psychogenic phonia) or complete (aphonia) loss of voice (Andersson
and Schalen, 1998), dysarthria (Kallen, Marshall, and
Casey, 1986), mutism (Kalman and Granet, 1981), and
The human communication system is vulnerable to stuttering (Wallen, 1961; Deal, 1982). There are a few
changes in the individual’s emotional or psychological reports that indicate patients can also develop psycho-
state. Several studies show the human voice to be a sen- genic language disorders, specifically aphasia (Iddings
sitive indicator of di¤erent emotions (Aronson, 1991). and Wilson, 1981; Sevush and Brooks, 1983) and dys-
Psychiatrists routinely evaluate vocal (intensity, pitch), lexia and dysgraphia (Master and Lishman, 1984).
prosodic (e.g., rhythm, rate, pauses), and other features Reports of psychogenic swallowing disorders (dyspha-
of communication to diagnosis neurotic states (Brodnitz, gia) also exist (Carstens, 1982).
Speech Disorders in Adults, Psychogenic 187
Diagnosis valuable diagnostic information is gleaned from the

clinical examination and the interview. PSD symptoms
Before proceeding with the evaluation and treatment of fluctuate widely across patients and within the same pa-
a patient with a suspected psychogenic speech disorder tient. Examples of PSDs a¤ecting voice include conver-
(PSD), the speech-language pathologist should be sure sion aphonia (no voiced but articulated air stream),
that the patient has been seen by the appropriate medical conversion dysphonia (some voice but abnormal pitch,
specialist to rule out an organic cause of the problem. loudness, or quality), and conversion muteness (no voice
Usually this is not a problem because the PSD patient but moving of the lips as though articulating) (Case,
will have sought care from several specialists, sometimes 1991). Individual PSD patients may report speaking
simultaneously, and may have a complicated medical better in some situations than others. The patient may
history in which multiple diagnoses are tenable (Case, even exhibit variability in symptoms within the context
1991). Therefore, definitive diagnosis of a speech dis- of the clinical examination (Case, 1991). Aphonic or
order as ‘‘psychogenic’’ is an inexact process relying dysphonic patients may vocalize normally when laugh-
greatly on the examiner’s skill, experience, and ability to ing, coughing, or clearing of the throat (Morrison and
synthesize information from the clinical interview and Rammage, 1994). Further, distraction techniques such
examination. as asking the patient to hum, grunt, or say ‘‘uh-huh’’ as
an a‰rmation might produce a normal-sounding voice.
Interview and History PSD patients may react unfavorably when it is pointed
The PSD patient often has a history of other conversion out that he or she has produced a normal-sounding voice
disorders and prior psychological stress unrelated to the and insist that the examiner identify a physical cause of
symptom (Pincus and Tucker, 1974). PSD onset is sud- the problem.
den and tends to be linked to a specific traumatic event
(e.g., surgery) or painful emotional experiences (e.g., Intervention
death of a family member). In the patient interview, the
speech-language pathologist determines what the patient After the examining physician has ruled out an organic
might gain from the presenting speech disorder. Primary cause for the speech disorder and the problem defini-
gain refers to the reduction of anxiety, tension, and tively diagnosed as psychogenic, an appropriate inter-
conflict provided by the speech disorder. This could be vention plan can be selected. Intervention approaches
related to a breakdown in communication between the for the patient with a PSD vary widely and largely de-
patient and some person of importance, such as a pend on the work setting, skill, training, and philosophy
spouse, a boss, or parent. Here the speech problem con- of the clinician (see also neurogenic mutism, laryn-
stitutes a lesser dilemma for the individual than the in- gectomy). For the most part, interventions used by
terpersonal problems from which it arose. Secondary speech-language pathologists focus on removal of the
gain refers to those benefits received by the individual symptoms (e.g., aphonia, dysphonia, dysarthria) found
from the external environment. This could take the form to be abnormal in the evaluation, and give limited at-
of monetary compensation, attention, and sympathy tention to psychological or psychiatric issues.
from others over perceived distress, being given fewer Several studies report results of successful interven-
responsibilities (e.g., work, child care), release from so- tions for PSD patients (Aronson, 1969; Marshall and
cial obligations, and satisfaction of dependency needs. Watts, 1975; Kalman and Granet, 1981; Carstens, 1982;
Sometimes secondary gains reinforce the PSD and pro- Kallen, Marshall, and Casey, 1986; Andersson and
long its duration (Morrison and Rammage, 1994). Un- Schalen, 1998). These usually begin with the clinician
like individuals with acquired adult communication acknowledging the patient’s distress and assuring the
disorders resulting from structural damage or neurologi- patient that there is no known organic reason for the
cal disease, the PSD patient may show unusual calmness problem. Potential factors that might contribute to
and lack of concern over the speech disorder, a phe- the patient’s distress are discussed in a nonthreatening
nomenon called la belle indi¤erence. It should be under- manner. Here the focus of attention is on how the
stood, however, that PSD patients do not consciously symptom or symptoms are disrupting communication.
produce the symptom a¤ecting communication but Individually designed intervention programs are then
truly believe they are ill. As a consequence, many PSD initiated and typically proceed in small steps or behav-
patients display excessive concern about their bodies and ioral increments. For example, a sequence of interven-
overreact to normal somatic stimuli. tion steps for a patient with psychogenic aphonia may
include the following: (1) eliciting a normal vocal tone
with a cough, grunt, or hum; (2) prolonging the normal
Clinical Examination
vocalization into a vowel; (3) turning the vowel into a
Results of the oral-peripheral and laryngeal examination VC word; (4) linking two VC words together; (5) pro-
are often normal or fail to account for the PSD patient’s ducing sentences using several VC words; and so forth.
symptoms. Instrumental measures have limited value in Relaxation exercises, patient education, and counseling
assessment and diagnosis of assessing the individual with may be included in the intervention programs for some
a PSD, since these contribute to the patient’s fixation PSD patients. In general, symptomatic interventions are
that his or her problem is organically based. The most successful with two to 10 sessions of treatment.
188 Part II: Speech
Three factors—duration of time elapsing between Master, D. R., and Lishman, W. A. (1984). Seizures, dyslexia,
onset of symptoms and beginning of therapy, severity of and dysgraphia of psychogenic origin. Archives of Neurol-
the speech disorder, and the degree to which the patient’s ogy, 41, 889–890.
speech disorder is distinguishable from the emotional Morrison, M., and Rammage, L. (1994). Management of voice
disorders. San Diego, CA: Singular.
disturbance—are related to intervention success (Case,
Pincus, J. H., and Tucker, G. (1974). Behavioral neurology.
1991). Generally, intervention is more apt to be success- New York: Oxford Press.
ful if the time between the onset of symptoms and the Rollin, W. J. (1987). The psychology of communication dis-
start of intervention is short, if the problem is severe, orders in individuals and their families. Englewood Cli¤s,
and if a speech disorder is clearly distinguishable from NJ: Prentice-Hall.
the emotional disturbance responsible for the problem Sapir, S., and Aronson, A. E. (1990). The relationship between
(Case, 1991). However, certain patients may be more psychopathology and speech and language disorders in
anxious for help and receptive to therapy if they have neurological patients. Journal of Speech and Hearing Dis-
been without voice for a considerable period of time be- orders, 55, 503–509.
fore consulting a speech-language pathologist (Freeman, Sevush, S., and Brooks, J. (1983). Aphasia versus functional
disorder: Factors in di¤erential diagnosis. Psychosomatics,
1986). In addition, patients who react unfavorably to
24, 847–848.
intervention success or refuse to accept or acknowledge Wallen, V. (1961). Primary stuttering in a 28-year-old adult.
their ‘‘improved communicative status’’ do not respond Journal of Speech and Hearing Disorders, 26, 394–395.
well to behavioral treatments and have a poor prog-
nosis for symptom-based interventions. In such cases the
speech-language pathologist must refer the patient to the Further Readings
psychologist or the psychiatrist.
Aronson, A. E., Peterson, H. W., Jr., and Liten, E. M. (1964).
—Robert C. Marshall Voice symptomatology in functional dysphonia. Journal of
Speech and Hearing Disorders, 29, 367–380.
References Aronson, A. E., Peterson, H. W., Jr., and Liten, E. M. (1966).
Psychiatric symptomatology in functional dysphonia and
American Psychiatric Association. (1987). Diagnostic and sta- aphonia. Journal of Speech and Hearing Disorders, 31, 115–
tistical manual of mental disorders (3rd ed., rev.). Washing- 127.
ton, DC: American Psychiatric Press. Bangs, J. L., and Freidiinger, A. (1950). A case of hysterical
Andersson, K., and Schalen, L. (1998). Etiology and treatment dysphonia in an adult. Journal of Speech and Hearing Dis-
of psychogenic voice disorder: Results of a follow-up study orders, 15, 316–323.
of thirty patients. Journal of Voice, 12, 96–106. Boone, D. (1966). Treatment of functional aphonia in a child
Aronson, A. E. (1969). Speech pathology and symptom ther- and an adult. Journal of Speech and Hearing Disorders, 31,
apy in the interdisciplinary treatment of psychogenic apho- 69–74.
nia. Journal of Speech and Hearing Disorders, 34, 324– Dickes, R. A. (1974). Brief therapy of conversion reactions: An
341. in-hospital technique. American Journal of Psychiatry, 131,
Aronson, A. E. (1991). Clinical voice disorders. New York: 584–586.
Thieme. Elias, A., Raven, R., and Butcher, P. (1989). Speech therapy
Brodnitz, F. E. (1981). Psychological considerations in vocal for psychogenic voice disorder: Survey of current practice
rehabilitation. Journal of Speech and Hearing Disorders, 4, and training. British Journal of Disorders of Communication,
21–26. 24, 61–76.
Carstens, C. (1982). Behavioral treatment of functional dys- Gray, B. F., England, G., and Mahoney, J. (1965). Treatment
phagia in a 12-year-old boy. Psychomatics, 23, 195–196. of benign vocal nodules by reciprocal inhibition. Behavior
Case, J. E. (1991). Clinical management of voice disorders. Research and Therapy, 3, 187–193.
Austin, TX: Pro-Ed. Guze, S. B., Woodru¤, R. A., and Clayton, P. J. (1971). A
Deal, J. L. (1982). Sudden onset of stuttering: A case report. study of conversion symptoms in psychiatric outpatients.
Journal of Speech and Hearing Disorders, 47, 301–304. American Journal of Psychiatry, 128, 135–138.
Freeman, M. (1986). Psychogenic voice disorders. In M. Faw- Helm, N. A., Butler, R. B., and Benson, D. F. (1978).
cus (Ed.), Voice disorders and their management (pp. 204– Acquired stuttering. Neurology, 28, 1159–1165.
207). London: Croom-Helm. Horsley, I. A. (1982). Hypnosis and self-hypnosis in the treat-
Gianotti, G. (1972). Emotional behavior and side of lesion. ment of psychogenic dysphonia: A case report. American
Cortex, 8, 41–55. Journal of Clinical Hypnosis, 24, 277–283.
Iddings, D. T., and Wilson, L. G. (1981). Unrecognized Kintzl, J., Biebl, W., and Rauchegger, H. (1988). Functional
anomic aphasia in an elderly woman. Psychosomatics, 22, aphonia: Psychological aspects of diagnosis and therapy.
710–711. Folia Phoniatrica, 40, 131–137.
Kallen, D., Marshall, R. C., and Casey, D. E. (1986). Atypical Koon, R. E. (1983). Conversion dysphagia in children. Psy-
dysarthria in Munchausen syndrome. British Journal of chomatics, 24, 182–184.
Disorders of Communication, 21, 377–380. Mace, C. J. (1994). Reversible cognitive impairment related
Kalman, T. P., and Granet, R. B. (1981). The written interview to conversion disorder. Journal of Nervous and Mental Dis-
in hysterical mutism. Psychosomatics, 22, 362–364. eases, 182, 186–187.
Marshall, R. C., and Watts, M. T. (1975). Behavioral treat- Marshall, R. C., and Starch, S. A. (1984). Behavioral treat-
ment of functional aphonia. Journal of Behavioral Therapy ment of acquired stuttering. Australian Journal of Human
and Experimental Psychiatry, 6, 75–78. Communication Disorders, 12, 17–24.
Speech Disorders in Children: A Psycholinguistic Perspective 189
Matas, M. (1991). Psychogenic voice disorders: Literature re- even if a neuroanatomical correlate or genetic basis for a
view and case report. Canadian Journal of Psychiatry, 36, speech and language impairment can be identified, the
363–365. medical diagnosis does not predict with any precision the
Neelman, J., and Mann, A. H. (1993). Treatment of hysterical speech and language di‰culties that an individual child
aphonia with hypnosis and prokaletic therapy. British
will experience, so the diagnosis will not significantly
Journal of Psychiatry, 163, 816–819.
Oberfield, R. A., Reuben, R. N., and Burkes, L. J. (1983). In- a¤ect the details of a day-to-day intervention program.
terdisciplinary approach of conversion disorders in adoles- To plan appropriate therapy, the medical model needs to
cent girls. Psychomatics, 24, 983–989. be supplemented by a linguistic approach.
Schalen, L., and Andersson, K. (1992). Di¤erential diagnosis The linguistic perspective is primarily concerned with
of psychogenic dysphonia. Clinical Otolaryngology, 35, the description of language behavior at di¤erent levels
225–230. of analysis. If a child is said to have a phonetic or artic-
Teitelbaum, M. L., and Kettl, P. (1985). Psychiatric consulta- ulatory di‰culty, the implication is that the child has
tion with a noncooperative, depressed stroke patient. Psy- problems with the production of speech sounds. A pho-
chomatics, 26, 145–146. nological di‰culty refers to inability to use sounds con-
Walton, D. A., and Black, H. (1959). The application of mod-
trastively to convey meaning. For example, a child may
ern learning theory to the treatment of aphonia. Journal of
Psychosomatic Research, 3, 303–311. use [t] for [s] at the beginning of words, even though the
child can produce a [s] sound in isolation perfectly well.
Thus, the child fails to distinguish between target words
(e.g., ‘‘sea’’ versus ‘‘tea’’) and is likely to be misunder-
Speech Disorders in Children: stood by the listener. The cause of this di‰culty may not
A Psycholinguistic Perspective be obvious.
The linguistic sciences have provided an indispensable
foundation for the assessment of speech and language
The terminology used to describe speech problems is di‰culties (Ingram, 1976; Grunwell, 1987). However,
rooted in classificatory systems derived from di¤erent this assessment is still a description and not an explana-
academic disciplines. In order to understand the ratio- tion of the disorder. Specifically, a linguistic analysis
nale behind the psycholinguistic approach, it is helpful focuses on the child’s speech output but does not take
to examine other approaches and compare how speech account of underlying cognitive processes. For this, a
problems have been classified from di¤erent perspec- psycholinguistic approach is needed.
tives. Three perspectives that have been particularly in- The psycholinguistic approach attempts to make
fluential are the medical, linguistic, and psycholinguistic good some of the shortcomings of the other approaches
perspectives. by viewing children’s speech problems as being derived
In a medical perspective, speech and language prob- from a breakdown in an underlying speech processing
lems are classified according to clinical entity. Com- system. This assumes that the child receives information
monly used labels include dyspraxia, dysarthria, and of di¤erent kinds (auditory, visual) about an utterance,
stuttering. Causes of speech di‰culties can be identified remembers it, and stores it in a variety of lexical rep-
(e.g., cleft palate, hearing loss, neurological impairment) resentations (a means for keeping information about
or an associated medical condition is known (e.g., words, which may be semantic, grammatical, phonolog-
autism, learning di‰culties, Down syndrome). ical, motor, or orthographic) within the lexicon (a store
Viewing speech and language disorders from a medi- of words), then selects and produces spoken and written
cal perspective can be helpful in various ways. First, words. Figure 1 illustrates the basic essentials of a psy-
through the medical exercise of constructing a di¤eren- cholinguistic model of speech processing. On the left
tial diagnosis, a condition may be defined when symp- there is a channel for the input of information via the ear
toms commonly associated with that condition are and on the right a channel for the output of information
identified; two examples are dyspraxia and dysarthria. through the mouth. The lexical representations at the
Second, for some conditions, medical management can top of the model store previously processed information.
contribute significantly to the prevention or remedia- In psycholinguistic terms, top-down processing refers
tion of the speech or language di‰culty, such as by in- to an activity whereby previously stored information
sertion of a cochlear implant to remediate hearing loss or (i.e., in the lexical representations) is helpful and used,
by surgical repair of a cleft palate. Third, the medical for example, in naming objects in pictures. A bottom-up
perspective may be helpful when considering the progno- processing activity requires no such prior knowledge and
sis for a child’s speech and language development, such can be completed without accessing stored linguistic
as when a progressive neurological condition is present. knowledge from the lexical representations; an example
However, the medical approach has major limitations is repeating sounds.
as a basis for the principled remediation of speech prob- A number of models have been developed from this
lems in individual children. A medical diagnosis cannot basic structure (e.g., Dodd, 1995; Stackhouse and Wells,
always be made. More often the term ‘‘specific speech 1997; Hewlett, Gibbon, and Cohen-McKenzie, 1998;
and/or language impairment’’ is used once all other Chiat, 2000). Although these models di¤er in their pre-
possible medical labels have been ruled out. Moreover, sentation, they share the premise that children’s speech
190 Part II: Speech
facets of unintelligibility in an individual child can

LEXI CAL
be related to di¤erent underlying processing deficits
REPRESENT ATI O N S (Chiat, 1983, 1989; Stackhouse and Wells, 1993). Ex-
tending the psycholinguistic approach to word finding
di‰culties, Constable (2001) has discovered that such
di‰culties are long-term consequences of underlying
speech processing problems that a¤ect how the lexical
representations are stored, and in particular how the
phonological, semantic, and motor representations are
interconnected.
The psycholinguistic approach has also been used to
investigate the relationship between spoken and written
language and to predict which children may have long-
term di‰culties (Dodd, 1995; Stackhouse, 2001). Those
children who fail to progress to a level of consistent
speech output, age-appropriate phonological awareness,
and letter knowledge skills are at risk for literacy
problems, particularly when spelling. Psycholinguistic
analysis of popular phonological awareness tasks (e.g.,
rhyme, syllable/sound segmentation and completion,
INPUT O UTPUT blending, spoonerisms) has shown that the development
of phonological awareness skills depends on an intact
Figure 1. The essentials of a psycholinguistic model of speech speech processing system (Stackhouse and Wells, 1997).
processing. (From Stackhouse, J., and Wells, B. [1997]. Chil- Thus, children with speech di‰culties are disadvantaged
dren’s speech and literacy di‰culties 1: A psycholinguistic in school, since developing phonological awareness is a
framework. London: Whurr. Reproduced with permission.) necessary stage in dealing with alphabetic scripts such
as English. Further, these phonological awareness skills
are needed not just for an isolated activity, such as a
di‰culties arise from one or more points in a faulty rhyme game, but also to participate in the interactions
speech processing system. The aim of the psycholin- typical of phonological intervention sessions delivered
guistic approach is to find out exactly where a child’s by a teacher or clinician (Stackhouse et al., 2002).
speech processing skills are breaking down and how An individual child’s psycholinguistic profile of
these deficits might be compensated for by coexisting speech processing skills provides an important basis for
strengths. The investigative procedure to do so entails planning a targeted remediation program (Stackhouse
generating hypotheses, normally from linguistic data, and Wells, 2001). There is no prescription for delivering
about the levels of breakdown that give rise to dis- this program, nor is there a bag of special activities. All
ordered speech output. These hypotheses are then tested intervention materials have the potential to be used in
systematically through carefully constructed tasks that a psycholinguistic way if analyzed appropriately. Princi-
provide su‰cient data to assemble a child’s profile of pled intervention is based on setting clear aims (Rees,
speech processing strengths and weaknesses (Stackhouse 2001a). Tasks are chosen or designed for their psycho-
and Wells, 1997; Chiat, 2000). linguistic properties and manipulated to ensure appro-
Collation of these profiles shows that some children priate targeting and monitoring of intervention. To this
with speech di‰culties have problems only on the output end, each task is analyzed into its components, as fol-
side of the model. However, many children with persist- lows:
ing speech problems have pervasive speech processing
Task ¼ Materials þ Procedure þ Feedback G Technique
di‰culties (in input, output, and lexical representations)
that impede progress. For example, when rehearsing new Rees (2001b) presents seven questions for examining
words for speech or spelling, it is usual to repeat them these four components. She demonstrates how altering
verbally. An inconsistent or distorted output, normally any one of them can change the nature of the task and
the result of more than one level of breakdown, may in thus the psycholinguistic demands made on the child.
turn a¤ect auditory processing skills, memory, and the In summary, a psycholinguistic approach to interven-
developing lexicon. It is therefore not surprising that tion puts the emphasis first on the rationale behind the
children with dyspraxic speech di‰culties often have design and selection of tasks for a particular child, and
associated input (Bridgeman and Snowling, 1988) and then on the order in which the tasks are to be presented
spelling di‰culties (Clarke-Klein and Hodson, 1995; to a child so that strengths are exploited and weaknesses
McCormick, 1995). supported (Vance, 1997; Corrin, 2001a, 2001b; Waters,
The case study research of children with develop- 2001). The approach tackles the issues of what to do,
mental speech disorders, typical of the psycholinguistic with whom, why, when, and how.
approach, has shown that not only are children un- In a review of psycholinguistic models of speech
intelligible for di¤erent reasons but also that di¤erent development, Baker et al. (2001) present both box-
Speech Disorders in Children: A Psycholinguistic Perspective 191
and-arrow and connectionist models as new ways of Hewlett, N., Gibbon, F., and Cohen-McKenzie, W. (1998).
conceptualizing speech impairment in children. This dis- When is a velar an alveolar? Evidence supporting a revised
cussion has focused on the former, since to date, box- psycholinguistic model of speech production in children.
and-arrow models have arguably had the most impact International Journal of Language and Communication Dis-
orders, 2, 161–176.
on clinical practice by adding to our repertoire of
Holm, A., and Dodd, B. (1999). An intervention case study of a
assessment and treatment approaches, and also by pro- bilingual child with phonological disorder. Child Language
moting communication and collaboration between Teaching and Therapy, 15, 139–158.
teachers and clinicians (Popple and Wellington, 2001). Ingram, D. (1976). Phonological disability in children. London:
The success of the psycholinguistic approach may lie Edward Arnold.
in the fact that it targets the underlying sources of McCormick, M. (1995). The relationship between the phono-
di‰culties rather than the symptoms alone (Holm and logical processes in early speech development and later spel-
Dodd, 1999). Although it is true that the outcome of ling strategies. In B. Dodd (Ed.), Di¤erential diagnosis
intervention depends on more than a child’s speech and treatment of children with speech disorder. London:
processing profile (Goldstein and Geirut, 1998), the Whurr.
Popple, J., and Wellington, W. (2001). Working together: The
development of targeted therapy through the setting
psycholinguistic approach within a school setting. In J.
of realistic aims and quantifiable objectives should Stackhouse and B. Wells (Eds.), Children’s speech and liter-
make a contribution to the measurement of the e‰cacy acy di‰culties: Vol. 2. Identification and intervention. Lon-
of intervention. don: Whurr.
Rees, R. (2001a). Principles of psycholinguistic intervention. In
—Joy Stackhouse
J. Stackhouse and B. Wells (Eds.), Children’s speech and
literacy di‰culties: Vol. 2. Identification and intervention.
References London: Whurr.
Rees, R. (2001b). What do tasks really tap? In J. Stackhouse
Baker, E., Croot, K., Mcleod, S., and Paul, R. (2001). Psy- and B. Wells (Eds.), Children’s speech and literacy di‰cul-
cholinguistic models of speech development and their ap- ties: Vol. 2. Identification and intervention. London: Whurr.
plication to clinical practice. Journal of Speech, Language, Stackhouse, J. (2001). Identifying children at risk for literacy
and Hearing Research, 44, 685–702. problems. In J. Stackhouse and B. Wells (Eds.), Children’s
Bridgeman, E., and Snowling, M. (1988). The perception of speech and literacy di‰culties: Vol. 2. Identification and in-
phoneme sequence: A comparison of dyspraxic and normal tervention. London: Whurr.
children. British Journal of Disorders of Communication, 23, Stackhouse, J., and Wells, B. (1993). Psycholinguistic assess-
245–252. ment of developmental speech disorders. European Journal
Chiat, S. (1983). Why Mikey’s right and my key’s wrong: The of Disorders of Communication, 28, 331–348.
significance of stress and word boundaries in a child’s out- Stackhouse, J., and Wells, B. (1997). Children’s speech and
put system. Cognition, 14, 275–300. literacy di‰culties 1: A psycholinguistic framework. London:
Chiat, S. (1989). The relation between prosodic structure, syl- Whurr.
labification and segmental realization: Evidence from a Stackhouse, J., and Wells, B. (Eds.). (2001). Children’s speech
child with fricative stopping. Clinical Linguistics and Pho- and literacy di‰culties 2: Identification and intervention.
netics, 3, 223–242. London: Whurr.
Chiat, S. (2000). Understanding children with language prob- Stackhouse, J., Wells, B., Pascoe, M., and Rees, R. (2002).
lems. Cambridge, UK: Cambridge University Press. From phonological therapy to phonological awareness.
Clarke-Klein, S., and Hodson, B. (1995). A phonologically Seminars in Speech and Language: Updates in Phonological
based analysis of misspellings by third graders with dis- Intervention, 23, 27–42.
ordered phonology histories. Journal of Speech and Hearing Vance, M. (1997). Christopher Lumpship: Developing phono-
Research, 38, 839–849. logical representations in a child with auditory processing
Constable, A. (2001). A psycholinguistic approach to word deficit. In S. W. Chiat, J. Law, and J. Marshall (Eds.),
finding di‰culties. In J. Stackhouse and B. Wells (Eds.), Language disorders in children and adults. London: Whurr.
Children’s speech and literacy di‰culties: Vol. 2. Identifica- Waters, D. (2001). Using input processing strengths to over-
tion and intervention. London: Whurr. come speech output di‰culties. In J. Stackhouse and B.
Corrin, J. (2001a). From profile to programme: Steps 1–2. In Wells (Eds.), Children’s speech and literacy di‰culties: Vol.
J. Stackhouse and B. Wells (Eds.), Children’s speech and 2. Identification and intervention. London: Whurr.
literacy di‰culties: Vol. 2. Identification and intervention.
London: Whurr. Further Readings
Corrin, J. (2001b). From profile to programme: Steps 3–6. In
J. Stackhouse and B. Wells (Eds.), Children’s speech and Bishop, D. V. M. (1997). Uncommon understanding: Develop-
literacy di‰culties: Vol. 2. Identification and intervention. ment and disorders of language comprehension in children.
London: Whurr. Hove, UK: Psychology Press.
Dodd, B. (1995). Di¤erential diagnosis and treatment of chil- Bishop, D. V. M., and Leonard, L. (2000). Speech and lan-
dren with speech disorder. London: Whurr. guage impairments in children: Causes, characteristics, inter-
Goldstein, H., and Geirut, J. (1998). Outcomes measurement vention, and outcome. Hove, UK: Psychology Press.
in child language and phonological disorders. In M. Frattali Chiat, S., Law, J., and Marshall, J. (1997). Language disorders
(Ed.), Measuring outcomes in speech-language pathology. in children and adults: Psycholinguistic approaches to ther-
New York: Thiene. apy. London: Whurr.
Grunwell, P. (1987). Clinical phonology (2nd ed.). London: Constable, A., Stackhouse, J., and Wells, B. (1997). Develop-
Croom Helm. mental word-finding di‰culties and phonological processing:
192 Part II: Speech
The case of the missing handcu¤s. Applied Psycholinguis- sures to establish their stability. For instance, to estab-
tics, 18, 507–536. lish the baserates of stuttering in a child, the clinician
Dent, H. (2001). Electropalatography: A tool for psycholin- should measure stuttering in at least three consecutive
guistic therapy. In J. Stackhouse and B. Wells (Eds.), speech samples. Baserates also should sample responses
Children’s speech and literacy di‰culties: Vol. 2. Identifica-
tion and intervention. London: Whurr.
adequately. For instance, to establish the baserate of
Ebbels, S. (2000). Psycholinguistic profiling of a hearing impaired production of a phoneme in a child, 15–20 words,
child. Child Language Teaching and Therapy, 16, 3–22. phrases, or sentences that contain the target phoneme
Hodson, B., and Edwards, M. L. (1997). Perspectives in applied should be used. Baserates also may be established for
phonology. Gaithersburg, MD: Aspen. di¤erent settings, such as the clinic, classroom, and
Lees, J. (1993). Children with acquired aphasias. London: home. In each setting, multiple measurements would be
Whurr. made.
Simpson, S. (2000). Dyslexia: A developmental language dis-
order. Child Care, Health and Development, 26, 355–380. Positive Reinforcement and Reinforcement Schedules.
Snowling, M. (2000). Dyslexia (2nd ed.). Oxford: Blackwell. Positive reinforcement is a powerful method of shaping
Snowling, M., and Stackhouse, J. (1996). Dyslexia, speech lan-
guage: A practitioner’s handbook. London: Whurr.
new behaviors or increasing the frequency of low-
Williams, P., and Stackhouse, J. (2000). Rate, accuracy and frequency but desired behaviors. It is a method of
consistency: Diadochokinetic performance of young, nor- selecting and strengthening an individual’s behaviors by
mally developing children. Clinical Linguistics and Phonet- arranging for certain consequences to occur immediately
ics, 14, 267–293. follow the behavior (Skinner, 1953, 1969, 1974). In using
Wood, J., Wright, J. A., and Stackhouse, J. (2000). Language positive reinforcement, the clinician arranges a behav-
and literacy: Joining together—An early years training ioral contingency, which is an interdependent relation-
package. Reading, UK: British Dyslexia Association. ship between a response made in the context of a
stimulus array and the consequence that immediately
follows it. Therefore, technically, behavioral contingency
is the heart of behavioral treatment.
Speech Disorders in Children: Positive reinforcers are specific events or objects that,
Behavioral Approaches to Remediation following a behavior, increase the future probability of
that behavior. Speech-language pathologists routinely
use a variety of positive reinforcers in teaching speech
Speech disorders in children include articulation and skills to children and adults. Praise is a common posi-
phonological disorders, stuttering, cluttering, develop- tive reinforcer. Other positive reinforces include tokens,
mental apraxia of speech, and a variety of disorders given for correct responses, that may be exchanged for
associated with organic conditions such as brain injury small gifts. Biofeedback or computer feedback as to
(including cerebral palsy), cleft palate, and genetic syn- the accuracy of response are other forms of positive
dromes. Despite obvious linguistic influences on the reinforcement.
analysis, classification, and theoretical understanding Positive reinforcers are initially o¤ered for every cor-
of speech disorders in children, most current treatment rect response, resulting in a continuous reinforcement
methods use behavioral techniques. The e¤ectiveness of schedule. When the new response or skill has somewhat
behavioral treatment techniques in remediating speech stabilized, the reinforcer may be o¤ered for every nth
disorders in children has been well documented (Onslow, response, resulting in a fixed ratio (FR) schedule. For
1993; Bernthal and Bankson, 1998; Hegde, 1998, 2001; instance, a child may receive reinforcer for every fifth
Pena-Brooks and Hegde, 2000). Behavioral techniques correct phoneme production (an FR5). Gradually
that apply to all speech discords—and indeed to most reducing the number of reinforcers with the use of pro-
disorders of communication—include positive reinforce- gressively larger ratios will help maintain a skill taught
ment and reinforcement schedules, negative reinforce- in clinical settings.
ment, instructions, demonstrations, modeling, shaping,
prompting, fading, corrective feedback to reduce unde- Antecedent Control of Target Behaviors. A standard
sirable responses, and techniques to promote generalized behavioral method is to carefully set the stage for a skill
productions and response maintenance. to be taught in treatment sessions. This technique,
A basic procedure in implementing behavioral inter- known as antecedent control, increases the likelihood
vention is establishing the baserates of target behaviors. of a target response by providing stimuli that evoke it.
Baserates, or baselines, are systematically measured Stimulus manipulations include a variety of procedures,
values of specified behaviors or skills in the absence of such as modeling, shaping, prompting, and fading.
planned intervention. Baserates are the natural rates of
response when nothing special (such as modeling or ex- Modeling. In modeling, the clinician produces the tar-
plicit positive reinforcement) is programmed. Baserates get response, which is then expected to be followed by
help establish a stable and reliable response rate against at least an attempt to produce the same response by
which the e¤ects of a planned intervention or an experi- the client. The clinician’s behavior is the model, which
mental treatment can be evaluated. The baserate of any the client attempts to imitate (response). In most cases,
parameter should be determined by at least three mea- modeling is preceded by instructions to the client on
Speech Disorders in Children: Behavioral Approaches to Remediation 193
how to produce a response, and demonstrations of target production. Eventually, the influence of such special
responses. Although instructions and demonstrations are stimuli as prompts is reduced by a technique called
part of behavioral treatment procedures, much formal fading.
research has focused on modeling as a special stimulus to
help establish a new target response. Fading. Fading is a technique used to gradually with-
A child’s initial attempt to imitate a target response draw a special stimulus, such as models or prompts,
modeled by the clinician may be more or less correct; while still maintaining the correct response rate. Abrupt
nonetheless, the clinician might wish to reinforce all withdrawal of a controlling stimulus will result in failure
attempts in the right direction. In gradual steps, the cli- to respond. In fading, a modeled stimulus or prompt
nician may then require responses that are more like the may be reduced in various ways. For instance, a model-
modeled response. To achieve this final result of an imi- ing such as ‘‘Say I see sun’’ (in training the production of
tated response that matches the modeled stimulus, shap- phoneme /s/) may be shortened by the clinician to ‘‘Say I
ing is often used. see . . .’’; or the vocal intensity of the prompter or mod-
eler may be reduced such that the modeling becomes
Shaping. Whereas straightforward positive reinforce- progressively softer and eventually inaudible, with only
ment is e¤ective in increasing the frequency of a low- articulatory movements (e.g., correct tongue position)
frequency response, shaping is necessary to create skills being shown.
that are absent. Shaping, also known as successive
approximations, is a procedure to teach new responses Corrective Feedback. Various forms of corrective feed-
in gradual steps. The entire procedure typically includes back may be provided to reduce the frequency of incor-
instructions, demonstrations, modeling, and positive re- rect responses. Verbal feedback such as ‘‘that is not
inforcement. A crucial aspect of shaping is specifying correct,’’ ‘‘that was bumpy speech’’ (to correct stuttering
the individual components of a complex response and in young children), ‘‘that was too fast,’’ and so forth is
teaching the components sequentially in a manner that part of all behavioral treatment programs. Additional
will result in the final target response. For instance, in corrective procedures include token loss for incorrect
teaching the correct production of /s/ to a child who has responses (tokens earned for correct responses and lost
an articulation disorder, the clinician may identify such for incorrect ones), and time-out, which includes a brief
simplified components of the response as raising the period of no interaction made contingent on incorrect
tongue tip to the alveolar ridge, creating a groove along responses. For instance, every stuttering may be fol-
the tongue tip, approximating the two dental arches, lowed by a signal to stop talking for 5 seconds.
blowing air through the tongue-tip groove, and so forth.
The child’s production of each component response is Generalization and Maintenance. Behavioral tech-
positively reinforced and practiced several times. Finally, niques to promote generalized production of speech
the components are put together to produce the approx- skills in natural environments and maintenance of those
imation of /s/. Subsequently, and in progressive steps, skills over time are important in all clinical work.
better approximations of the modeled sound production Teaching clients to self-monitor the production of their
are reinforced, resulting in an acceptable form of the newly acquired skills and training significant others to
target response. To further strengthen a newly learned prompt and reinforce those skills at home are among the
response, the clinician may use the prompting procedure. most e¤ective of the generalization and maintenance
techniques.
Prompting. The probability of a target response that —M. N. Hegde
has just emerged with assistance from the previously
described procedures may fluctuate from moment to References
moment. The child may appear unsure and the response
Bernthal, J. E., and Bankson, N. W. (1998). Articulation and
rate may be inconsistent. In such cases, prompting will phonological disorders (4th ed.). Boston: Allyn and Bacon.
help stabilize that response and increase its frequency. Hegde, M. N. (1998). Treatment procedures in communicative
Prompting is a special cue or a stimulus that will help disorders (3rd ed.). Austin, TX: Pro-Ed.
evoke a response from an unsure client. Such cues take Hegde, M. N. (2001). Hegde’s pocketguide to treatment in
various verbal and nonverbal forms. Examples of verbal speech-language pathology (2nd ed.). San Diego, CA: Sin-
prompts include such statements as ‘‘What do you say to gular Publishing Group.
this picture?’’ ‘‘The word starts with a /p/’’ (both prompt Pena-Brooks, A., and Hegde, M. N. (2000). Assessment and
a correct naming or articulatory response). Nonverbal treatment of articulation and phonological disorders in chil-
prompts include a variety of facial and hand gestures dren. Austin, TX: Pro-Ed.
that suggest a particular target response; the meaning of Onslow, M. (1996). Behavioral management of stuttering. San
Diego, CA: Singular Publishing Group.
some gestures may first have to be taught to the child. Skinner, B. F. (1953). Science and human behavior. New York:
For instance, a clinician might tell a child who stutters Macmillan.
that his or her speech should be slowed down when a Skinner, B. F. (1969). Contingencies of reinforcement: A theo-
particular hand gesture is made. In prompting the pro- retical analysis. New York: Appleton-Century-Crofts.
duction of a phone such as /p/, the clinician may press Skinner, B. F. (1974). About behaviorism. New York: Vintage
the two lips together, which may lead to the correct Books.
194 Part II: Speech
Further Readings Table 1. Conditions and Factors Present at Birth or Shortly

Thereafter with a High Probability of Resulting in Future
Axelrod, S., and Hall, R. V. (1999). Behavior modification (2nd Developmental Delay
ed.). Austin, TX: Pro-Ed.
Maag, J. W. (1999). Behavior management. San Diego, CA: Chromosomal abnormalities such as Down syndrome
Singular Publishing Group. Genetic or congenital disorders
Martin, G., and Pear, J. (1999). Behavior modification: What Severe sensory impairments, including hearing and vision
it is and how to do it (6th ed.). Upper Saddle River, NJ: Inborn errors of metabolism
Prentice-Hall. Disorders reflecting disturbance of the development of the
nervous system
Intracranial hemorrhage
Hyperbilirubinemia at levels exceeding the need for exchange
Speech Disorders in Children: transfusion
Birth-Related Risk Factors Major congenital anomalies
Congenital infections
Disorders secondary to exposure to toxic substances, including
The youngest clients who receive services from speech- fetal alcohol syndrome
language pathologists are neonates with medical needs. Low birth weight
This area of care came into being during the last several Respiratory distress
decades as the survival rate of infants with medical needs Lack of oxygen
improved and it became apparent that developmental Brain hemorrhage
delays would likely be prevalent among the survivors. Nutritional deprivation
Since 1970, for example, the survival rate of infants in
some low birth weight categories jumped from 30%
to 75%; among the survivors the occurrence of mental term stays in the hospital may limit opportunities for
retardation is 22%–24% (Bernbaum and Ho¤man- learning, including in the speech domain.
Williamson, 1991). Children with hearing impairment and those with
The prevalence of developmental delay among pre- Down syndrome are two relatively large populations
viously medically needy neonates resulted in federal and with birth-related conditions and factors that are likely
state laws that give these children legal rights to devel- to experience future speech disorders (see mental re-
opmental services (Kern, Delaney, and Taylor, 1996). tardation and speech in children). Two additional
These laws identify the physical and mental conditions relatively large populations of newborn children likely to
and the biological and environmental factors present experience future speech disorders are those born under-
at birth that are most likely to result in future devel- weight and those whose mother engaged in substance
opmental delay. The purpose in making this identifica- abuse during pregnancy.
tion is to permit an infant to receive intervention services In the United States, approximately 8.5% of infants
early in life, when brain development is most active and are born underweight (Guyer et al., 1995). The major
before the negative social consequences of having a birth weight categories are low birth weight, very low
developmental delay, including one in the speech do- birth weight, extremely low birth weight, and micro-
main, can occur (Bleile and Miller, 1994). premie. Birth weight categories as measured in grams
The most important federal legal foundation for and pounds are shown in Table 2. As the category of
developmental services for young children with medical micropremie suggests, many low birth weight children
needs is the Individuals with Disabilities Education Act are born prematurely. A typical pregnancy lasts 40
(IDEA). This law gives individual states the authority to weeks from first day of the last normal menstrual cycle;
determine which conditions and factors present at birth a preterm birth is defined as one occurring before the
place a child at su‰cient risk for future developmental completion of 37 weeks of gestation. The co-occurrence
delay that the child qualifies for education services. of low birth weight and prematurity varies by country;
Examples of conditions and factors that the act indicates in the United States, 70% of low birth weight babies are
place an infant at risk for future developmental delay are also born prematurely.
listed in Table 1. A single child might have several con- Approximately 36%–41% of women in the United
ditions and risk factors. For example, a child born with States abuse illicit drugs, alcohol, or nicotine sometime
Down syndrome might also experience respiratory dis- during pregnancy (Center on Addiction and Substance
tress as a consequence of the chromosomal abnormality, Abuse, 1996). Illicit drugs account for 11% of this
as well as an unrelated congenital infection.
Many populations of newborn children with medical
needs are at high risk for future speech disorders. In Table 2. Birth Weight Categories, in Grams and Pounds
part this is because developmental speech disorders are
Categories Grams Pounds
common among all children (Slater, 1992). However,
the medical condition or factor itself may contribute to Low birth weight <2,500 5.5
the child having a speech disorder, as occurs with chil- Very low birth weight <1,500 3.3
dren with cerebral palsy, a tracheotomy, or cleft palate Extremely low birth weight <1,000 2.2
(Bleile, 1993), and the combination of illness and long- Micropremies <800 1.76
Speech Disorders in Children: Birth-Related Risk Factors 195
substance abuse, and heavy use of alcohol or nicotine ing occupational and physical therapy, social services,
accounts for the other 25%–30%. Approximately three- and speech-language pathology. The role of the speech-
quarters of pregnant women who abuse one substance language pathologist includes assessing communication
also abuse other substances (Center on Addiction and development and implementing an early intervention
Substance Abuse, 1996). For example, a pregnant program to facilitate the child’s communication abilities.
woman who abuses cocaine might also drink heavily. Interacting with the child’s caregivers assumes increasing
When alcohol is an abused substance, more severely importance as medical issues resolve, allowing the family
a¤ected children are considered to have fetal alcohol to give greater attention to developmental concerns.
syndrome. The hallmarks of fetal alcohol syndrome are The vast majority of children born with medical needs
mental retardation and physical deformities (Streissguth, grow to possess the cognitive and physical capacity for
1997). Children with milder cognitive impairments and speech. For higher functioning children, the clinician’s
without physical deformities are considered to have fetal role includes providing the evaluation and treatment
alcohol e¤ect or alcohol-related neurodevelopmental services to facilitate speech and language development.
disorder. The speech disorders of many high-functioning children
Regardless of the specific cause of the disorder, resolve by the end of the preschool years or during the
speech-disordered clients with birth-related conditions early grade school years. Such children are at risk for
and factors receive services similar to those given other future reading problems and other learning di‰culties,
children. The clinician’s primary responsibility is to pro- and their progress in communication should continue to
vide evaluation and intervention services appropriate to be monitored even after the speech disorder has resolved.
the child’s developmental abilities. A di¤erence in care Speech may prove challenging for children with more
provision is that the child’s speech disorder is likely to extensive developmental problems. A general clinical
occur as part of a larger picture of medical problems and rule of thumb is that a child who will speak typically will
developmental delay. This may make it di‰cult to diag- do so by 5 years of age (Bleile, 1995). Children with
nose and treat the speech disorder, especially when the more limited speech potential may be taught to com-
child is younger and medical problems may predomi- municate through a combination of speech and non-
nate. In addition to having thorough training in typi- oral options. Lower functioning children might be
cal speech and language development, a speech-language taught to communicate through an alternative commu-
clinician working with these children should possess the nication system (see augmentative and alternative
following: communication approaches in children). Some useful
web sites for further information include www.asha.org;
Basic knowledge of medical concepts and terminology
www.autism.org; www.cdc.gov; www.intelehealth.com;
Ability to access and understand information about
www.mayohealth.org; www.med.harvard.edu; www
unfamiliar conditions and factors as need arises
.modimes.org; www.ncbi.nlm.nih.gov/PubMed/; www
Knowledge of safety procedures and health precau-
.ndss.org; and www.nih.gov.
tions
Ability to work well with teams that include the child’s —Ken Bleile and Angela Burda
caregivers and professionals
A neonate identified as at risk for future develop-
References
mental delay typically first receives developmental ser- Bernbaum, J., and Ho¤man-Williamson, M. (1991). Primary
vices in a hospital intensive care unit. Medical and care of the preterm infant. St. Louis: Mosby–Year Book.
developmental services are provided by a team of health Bleile, K. (1993). Children with long-term tracheostomies. In
care professionals. Often, a primary role of the speech- K. Bleile (Ed.), The care of children with long-term trache-
language clinician is to assess the oral mechanism to ostomies. San Diego, CA: Singular Publishing Group.
determine readiness to feed. Such evaluations are par- Bleile, K. (1995). Manual of articulation and phonological dis-
orders. San Diego, CA: Singular Publishing Group.
ticularly important for clients at risk for aspiration.
Bleile, K., and Miller, S. (1994). Toddlers with medical needs.
These include children with neurological and physical In J. Bernthal and N. Bankson (Eds.), Child phonology:
handicaps, as well as those born prematurely, whose Characteristics, assessment, and intervention with special
immature systems of neurological control often do not populations (pp. 81–109). New York: Thieme.
allow orally presented food to be managed safely. The Center on Addiction and Substance Abuse. (1996). Substance
speech-language pathologist may also counsel the child’s abuse and the American woman. New York: Columbia
caregivers and o¤er suggestions about ways to facilitate University.
communication development. Guyer, B., Strobino, D., Ventura, S., et al. (1995). Annual
An early intervention program is initiated shortly af- summary of vital statistics: 1994. Pediatrics, 96, 1029–1039.
ter the child is born and the risk factor has been identi- Kern, L., Delaney, B., and Taylor, B. (1996). Laws and issues
concerning education and related services. In L. Kurtz,
fied. The exception is a child born prematurely, whose P. Dowrick, S. Levy, and M. Batshaw (Eds.), Handbook
nervous system may not yet be able to manage environ- of developmental disabilities (pp. 218–228). Gaithersburg,
mental stimulation. Such a child typically receives mini- MD: Aspen.
mal stimulation until the time he or she would have been Slater, S. (1992). Portrait of the professions. ASHA, 34, 61–65.
born if the pregnancy had been full term. Early inter- Streissguth, A. (1997). Fetal alcohol syndrome: A guide for
vention typically includes a package of services, includ- families and communities. Baltimore: Paul H. Brookes.
196 Part II: Speech
Further Readings not self-evident. Locke (1983), for example, has argued
that children will be similar cross-linguistically until
Batshaw, M. (1997). Children with disabilities (4th ed.). Balti- some point after the acquisition of the first 50 words.
more: Paul H. Brookes.
Bernbaum, J., and Ho¤man-Williamson, M. (1991). Primary
Elsewhere, however, it has been argued that such cross-
care of the preterm infant. St. Louis: Mosby–Year Book. linguistic di¤erences are evident at the very earliest
Kurtz, L., Dowrick, P., Levy, S., and Batshaw, M. (Eds.). stages of phonological development (Ingram, 1989).
(1996). Handbook of developmental disabilities. Gaithers- Resolution of this issue will require extensive research
burg, MD: Aspen. into early typical phonological development in a range
Streissguth, A. (1997). Fetal alcohol syndrome: A guide for of languages. To date, the data support early cross-
families and communities. Baltimore: Paul H. Brookes. linguistic di¤erences.
The second, critical step is to determine whether chil-
dren with phonological delay look like their typically
Speech Disorders in Children: developing peers or like children with phonological im-
Cross-Linguistic Data pairment in other linguistic communities. The data on
this issue are even more sparse than for the first step, but
some preliminary data exist, and those data support the
Since the 1960s, the term articulation disorder has been linguistic account of phonological delay over the articu-
replaced in many circles by the term phonological dis- latory one.
order. This shift has been driven by the recognition that These two steps can be summarized as follows:
children with articulation disorders show general pat-
terns in their speech that are not easily identified by an Step 1: Verify that typically developing children vary
articulatory defect approach. Further, these patterns cross-linguistically.
appear similar to those used by younger, typically
Step 2: Determine whether children with a phonological
developing children. The notion of phonological versus
impairment look like their typically developing peers
articulatory impairment, however, has not been exam-
or like children with phonological impairments in
ined in depth through comparisons of typically develop-
other linguistic communities.
ing children and children with phonological impairment
across a range of languages. Such comparisons would Examination of a range of studies on early phono-
provide the most revealing evidence in support of one logical development shows that children in di¤erent
view over the other. If articulatory factors are behind linguistic environments converge in their acquisition to-
children’s phonological impairment, children with such ward a basic or core phonetic inventory of speech sounds
impairments should show patterns somewhat indepen- that is di¤erent for each language. This is demonstrated
dent of their linguistic environment. They should look here by examining the production of word-initial con-
more like one another than like their linguistically sonants in English, French, K’iche’, and Dutch.
matched peers. If there is a linguistic basis to phonolog- Below is an inventory of the English consonants typ-
ical impairment, such children should look more like ically used in the early stage of phonological acquisition
their typically developing linguistic peers than like chil- (Ingram, 1981). English-speaking children show early
dren with phonological delay in other linguistic envi- acquisition of three place features, a voicing contrast
ronments. The basic structure of this research line is among stops, and a series of voiceless consonants.
presented below, using English and Italian children as
examples (TD ¼ typically developing, PI ¼ phonologi- English
cally impaired): m n
b d g
Cross-Linguistic Predictions
p t k
Articulatory Deficit
f s h
English TD þ Italian TD ¼ di¤erent w
English PI þ Italian PI ¼ same
English TD þ English PI ¼ di¤erent French shows some similarities to English, but also
Italian TD þ Italian PI ¼ di¤erent two striking di¤erences, based on my analysis of selected
diary studies. French-speaking children tend to acquire
Linguistic Deficit velar consonants later, yet show an early use of /l/, a
English TD þ Italian TD ¼ di¤erent sound that appears later in English.
English PI þ Italian PI ¼ di¤erent
French
English TD þ English PI ¼ same
Italian TD þ Italian PI ¼ same m
b d
The pursuit of this line of research requires two steps.
First, it needs to be verified that there are indeed cross- p t
linguistic di¤erences in phonological acquisition between f s
typically developing children. The truth of this claim is l
Speech Disorders in Children: Cross-Linguistic Data 197
K’iche’ (formerly spelled Quiché) is a Mayan lan- Topbas (1992) provides data on Turkish for both
guage spoken in Guatemala. K’iche’ children (Pye, typically developing children and children with phono-
Ingram, and List, 1987) show an early /l/, as in French, logical impairment. The Turkish inventory of the typi-
and an a¤ricate, /tS/, which is one of the most frequent cally developing children is noteworthy for its lack of
early sounds acquired. The first fricative tends to be the early fricatives, despite a system of eight fricatives, both
velar /x/, despite the fact that the language has both /s/ voiced and voiceless, in the adult language. For English-
and /S/. speaking children, the lack of fricatives is often a sign of
phonological impairment, but this appears to be ex-
K’iche’
pected for typically developing Turkish-speaking chil-
m n dren. The data on phonological impairment are from a
p t tS " single child at age 6 years, 0 months. This child shows
x the same lack of fricatives and the early a¤ricate, just as
w l in the typically developing data.
Lastly, below is an intermediate stage in Dutch, based Turkish
on data in Beers (1994). The Dutch inventory shows Typically Developing Phonologically Impaired
early use of the velar fricative /x/, as in K’iche’, but also
m n m n
the full range of Dutch fricatives, which appear at more
or less the same time. b d b d
p t tS k p t tS
Dutch
j j
m n
p t k Lastly, Swedish data are available to pursue this issue
further (Magnussen, 1983; Nettelbladt, 1983). The data
f s x h
on typically developing Swedish children are based on a
w j case study of a child at age 2 years, 2 months. This child
Data such as these from English, French, K’iche’, lacked the velar stop and the voiced fricative /v/. The
and Dutch show the widely di¤erent ways that children voiced stops were also missing in a group of ten children
may acquire their early consonantal inventories. It has with phonological impairment; however, these children
been proposed that these di¤erences result from the did show early use of /v/, just as the Italian-speaking
varying roles these consonants play in the phonologies of children did.
the languages discussed (Ingram, 1989). The more fre- Swedish
quently a consonant is used in a wide range of words, the
Typically Developing Phonologically Impaired
more likely it is that it will be in the early inventory.
The question now is, what do the inventories of m n m n
children with phonological impairment look like when p t p t
comparisons like those above are done? The limited evi-
b d b d
dence to date indicates that they look like their same
language typically developing peers. Data from Italian, f u h f s h
Turkish, and Swedish suggest that this is the case. v j
The first language examined here is Italian, using the Thus, preliminary data from a range of languages
data reported in Bortolini, Ingram, and Dykstra (1993). support the phonological rather than the articulatory
Typically developing Italian children use the a¤ricate account of phonological impairment. This results in two
/tS/ and the fricative /v/, both later acquisitions for preliminary conclusions: (1) Typically developing chil-
English-speaking children. Italian-speaking children dren show early phonological inventories unique to their
with phonological impairment have an inventory that is linguistic environment. (2) Children with phonological
a subset of the one used by typically developing children. impairment show systems more similar to those of their
They do not have the a¤ricate but do show the early typically developing peers in their own linguistic enviro-
acquisition of /v/. The early use of /v/ can be traced ment than to those of children with phonological im-
back to the fact that the voiced labiodental fricative is a pairment in other language environments.
much more common sound in the vocabulary of Italian-
speaking children than it is in the vocabulary of their —David Ingram
English-speaking peers.
References
Italian
Typically Developing Phonologically Impaired Beers, M. (1994, May). Classification of phonological problems:
Comparison of segmental inventories. Paper presented at the
p t tS k p t k European Symposium on Child Language Disorders, Gar-
b d g b d deren, the Netherlands.
Bortolini, U., Ingram, D., and Dykstra, K. (1993, May). The
f s f s acquisition of the feature [voice] in normal and phono-
v v logically delayed Italian children. Paper presented at the
198 Part II: Speech
Symposium on Research in Child Language Disorders, are all members of the category stops. They are non-
University of Wisconsin, Madison. continuant because the airflow is discontinued in the oral
Ingram, D. (1981). Procedures for the phonological analysis of cavity during their production. Some stops also have the
children’s language. Baltimore, MD: University Park Press. feature voiceless because they are produced without
Ingram, D. (1989). First language acquisition: Method, descrip-
glottal vibration. Each speech sound can be categorized
tion, and explanation. Cambridge, UK: Cambridge Univer-
sity Press. in di¤erent ways according to such features. These fea-
Locke, J. (1983). Phonological acquisition and change. New tures are called distinctive because they di¤erentiate each
York: Academic Press. sound from all others in the language. The (implicit)
Magnusson, E. (1983). The phonology of language disordered knowledge of distinctive features is presumed to be one
children: Production, perception, and awareness. Travaux de organizational basis for phonological systems.
l’Institut de Linguistique de Lund, 17. Lund, Sweden: Distinctive features therapy (McReynolds and Eng-
Gleerup. mann, 1975) focuses on features that a child’s system
Nettelbladt, U. (1983). Developmental studies of dysphonology lacks. A child who produces no fricatives lacks the con-
in children. Travaux de l’Institut de Linguistique de Lund, tinuant feature, so distinctive features therapy would
19. Lund, Sweden: Gleerup.
focus on this feature. In theory, establishing a continuant
Pye, C., Ingram, D., and List, H. (1987). A comparison of ini-
tial consonant acquisition in English and Quiché. In K. E. in any place of articulation (e.g., [s]) would lead to the
Nelson and A. van Kleeck (Eds.), Children’s language (vol. child’s generalization of the feature to other, untrained
6). Hillsdale, NJ: Erlbaum. fricatives. Once the feature is included in the child’s
Topbas, S. (1992, August). A pilot study of phonological acquisi- feature inventory, it can be combined with other fea-
tion by Turkish children and its implications for phonological tures (e.g., voice, palatal) to yield the remaining English
disorders. Paper presented at the 6th International Confer- fricatives.
erence on Turkish Linguistics, Anadolu University, Turkey. The function of distinctive features is to provide
communicative contrast. The more features we use, the
more lexical distinctions we make and the more mean-
ings we express. A child who produces several target
phonemes identically (e.g., all fricatives as [f ]) will have
Speech Disorders in Children: too many homonyms. Therapy will focus on using more
Descriptive Linguistic Approaches distinctive features, for example, producing di¤erent
target phonemes distinctively. Williams (2000a, 2000b)
recommends simultaneously contrasting all target
Linguistic approaches to treating children’s speech dis- sounds with the overgeneralized phoneme. In some dis-
orders are motivated by the fact that a phonology is a ordered phonologies, features are noncontrastive be-
communication system (Stoel-Gammon and Dunn, cause they are used in limited positions. For a child who
1985; Ingram, 1990; Grunwell, 1997). Within this sys- uses voiced stops only in initial position and voiceless
tem, patterns are detectable within and among various stops only in final position, for example, ‘‘bad,’’ ‘‘bat,’’
subcomponents: (1) syllable and word shapes (phono- ‘‘pad,’’ and ‘‘pat’’ are homonyms because they are all
tactic repertoire), (2) speech sounds (phonetic reper- pronounced as [b0t]. Conversely, some children with
toire), (3) the manner in which sounds contrast with each disordered phonology may maintain a contrast, but
other (phonemic repertoire), and (4) the behaviors of without the expected feature. A child who does not voice
di¤erent sounds in di¤erent contexts (phonological pro- final stops may lengthen the preceding vowel to indicate
cesses). Each of these subcomponents may influence the voicing; ‘‘bat’’ as [b0t] and ‘‘bad’’ as [b0:t]. Such a child
others; each may interfere with successful communica- has phonological ‘‘knowledge’’ of the contrast and may
tion. Therefore, individual sound or structure errors are independently develop voicing skills. Therefore, Elbert
treated within the context of the child’s whole phono- and Gierut (1986) suggest that features of which children
logical system rather than one by one; remediation have little knowledge are a higher priority for interven-
begins at the level of communicative function—the tion. In other studies (e.g., Rvachew and Nowak, 2001),
word. however, subjects have made more progress when most-
The assumption underlying this type of treatment is knowledge features were addressed first.
that the child’s phonological system—his or her (sub- Often, distinctive features are remediated with an
conscious) mental organization of the sounds of the emphasis on contrast, through minimal pair therapy.
language—is not developing in the appropriate manner Treatment focuses on words di¤ering by one distinctive
for the child’s language or at a rate appropriate for the feature. For example, the continuant feature could be
child’s age. The goal is for the child to adjust his or her taught by contrasting ‘‘tap’’ versus ‘‘sap’’ and ‘‘met’’
phonological system in the needed direction. Initiating a versus ‘‘mess.’’ Typically the sound that the child sub-
change in one part of the phonological system is ex- stitutes for the target sound (e.g., [t] for /s/) is compared
pected to have a more general impact on the whole to the target sound ([s]). Therapy may begin with dis-
system. crimination activities; the child indicates the picture that
The sounds of a language are organizable into various corresponds to the word produced by the clinician (e.g.,
categories according to their articulatory or acoustic a hammer for ‘‘tap’’ versus an oozing tree for ‘‘sap’’).
features. The consonants /p, t, k, b, d, g/, for example, This highlights the confusion that may result if the
Speech Disorders in Children: Descriptive Linguistic Approaches 199
wrong sound is used. Communication-oriented produc- fricatives produced as stops, in stopping; liquids pro-
tion activities are designed to encourage the child to duced as glides, in gliding).
produce the feature that had been missing from his Phonotactic processes: sounds or syllables are omitted,
system (e.g., to say ‘‘mess’’ rather than ‘‘met’’). (See added, or moved. The process changes the shape of the
phonological awareness intervention for children word or syllable. As examples, a CVC word becomes
with expressive phonological impairments for a dis- CV; a CCVC word becomes CVC, CVCVC, or CVCC;
cussion of approaches in which the contrastive role of ‘‘smoke’’ becomes [moks].
phonological features and structures is even more Assimilation processes: two sounds or two syllables
explicitly addressed.) become more alike. For example, a child who does not
Gierut (1990) has tested the use of maximal pair ther- typically front velar consonants in words such as ‘‘go’’
apy, in which the contrasting sounds di¤er on many may nonetheless say [dOd] for ‘‘dog.’’ The final velar
features (e.g., [s] versus [m]). She has found that children consonant becomes alveolar in accord with the initial
may be able to focus better on the missing feature (e.g., consonant. Similarly, a two-syllable word such as
continuant) when it is not contrasted with the substitut- ‘‘popcorn’’ may be reduplicated as [kOkO]; the first syl-
ing feature (e.g., noncontinuant) than when it is. lable changes to match the second.
Markedness is another phonological concept with
As in distinctive feature therapy, in phonological
implications for remediation of sounds. Markedness re-
process therapy classes of sounds or structures that pat-
flects ease of production and perception: [y] is marked,
tern together are targeted together. Again, either the
due to low perceptual salience; [b] is unmarked because
entire class can be directly addressed in therapy or some
it is easy to produce and to perceive. Children typically
acquire the least marked sound classes (e.g., stops) and representative members of the class may be selected for
treatment, in the expectation that treatment e¤ects will
structures (e.g., open CV syllables) of their languages
generalize to the entire class. The goal is to reduce the
first (Dinnsen et al., 1990). It is unusual for a child to
child’s use of that process, with resultant changes in her
master a more marked sound or structure before a less
phonological system. For example, if the child’s phono-
marked one. Gierut’s (1998) research suggests that tar-
logical system is expanded to include a few final con-
geting a more marked structure or sound in therapy
sonants, it is expected that she or he will begin to
facilitates the acquisition of the less marked one ‘‘for
produce a variety of final consonants, not just those that
free.’’ However, other researchers (e.g., Rvachew and
were targeted in therapy.
Nowak, 2001) report more success addressing less
Some therapists use traditional production activities
marked sounds first.
Therapy based on nonlinear phonology stresses the (beginning at the word level) to decrease a child’s use
of a phonological process; others use a minimal pair
importance of syllable and word structures as well as
approach, comparing the targeted class (e.g., velars) with
segments (Bernhardt, 1994). Minimal pair therapy is
often used to highlight the importance of structure (e.g., the substituting class (e.g., alveolars). The cycles ap-
proach (Hodson and Paden, 1991) to process therapy
‘‘go’’ versus ‘‘goat,’’ ‘‘Kate’’ versus ‘‘skate,’’ ‘‘monkey’’
has some unique features. First, each session includes a
versus ‘‘monk’’). The goal of this therapy is to expand
period of auditory bombardment, during which the child
the child’s phonotactic repertoire. For example, a child
listens (passively) to a list of words that contain the tar-
who previously omitted final consonants may or may
geted sound class or structure. Second, each pattern is
not produce the correct final consonant in a given word,
the focus for a predetermined length of time, regardless
but will produce some final consonant. Structural and
of progress. Then, treatment moves on to another pat-
segmental deficits often interact in such a way that a
tern. Hodson and Paden argue that cycling is more sim-
child can produce a sound in certain positions but not
ilar to the phonological development of children without
others (Edwards, 1996).
The approaches described above focus primarily on phonological disorders.
In summary, the goal of linguistically based ap-
what’s missing from the child’s phonological system.
proaches to phonological therapy is to make as broad an
Another set of approaches focuses on what’s happening
instead of the target production. For example, a child impact as possible on the child’s phonological system,
making strategic choices of treatment goals that will
whose phonological system lacks the /y, D, s, z/ fricative
trigger changes in untreated as well as treated sounds or
phonemes may substitute stops (stopping), substitute la-
structures.
bial fricatives ([f, v]; fronting), substitute palatal frica-
tives ([S, Z ]; backing or palatalization), or omit /y, D, s, —Shelley L. Velleman
z/ in various word positions (initial consonant deletion,
final consonant deletion, consonant cluster reduction). References
These patterns are referred to as phonological processes
Bernhardt, B. (1994). Phonological intervention techniques for
in speech-language pathology. syllable and word structure development. Clinics in Com-
Phonological process therapy addresses three types of munication Disorders, 4, 54–65.
error patterns: Dinnsen, D. A., Chin, S. B., Elbert, M., and Powell, T. (1990).
Some constraints on functionally disordered phonologies:
Substitution processes: sounds with a certain feature Phonetic inventories and phonotactics. Journal of Speech
are substituted by sounds with a di¤erent feature (e.g., and Hearing Research, 33, 28–37.
200 Part II: Speech
Edwards, M. L. (1996). Word position e¤ects in the production Gierut, J. (1992). The conditions and course of clinically
of fricatives. In B. Bernhardt, J. Gilbert, and D. Ingram induced phonological change. Journal of Speech and Hear-
(Eds.), UBC International Conference on Phonological Ac- ing Research, 35, 1049–1063.
quisition (pp. 149–158). Vancouver, BC: Cascadilla Press. Hodson, B. W., and Edwards, M. L. (1997). Perspectives in
Elbert, M., and Gierut, J. (1986). Handbook of clinical phonol- applied phonology. Gaithersburg, MD: Aspen.
ogy: Approaches to assessment and treatment. San Diego, McLeod, S., van Doorn, J., and Reed, V. A. (1997). Realiza-
CA: College-Hill Press. tions of consonant clusters by children with phonological
Gierut, J. (1990). Di¤erential learning of phonological opposi- impairments. Clinical Linguistics and Phonetics, 11, 85–113.
tions. Journal of Speech and Hearing Research, 33, 540– Pena-Brooks, A., and Hegde, M. N. (2000). Assessment and
549. treatment of articulation and phonological disorders in chil-
Gierut, J. A. (1998). Treatment e‰cacy for functional phono- dren. Austin, TX: Pro-Ed.
logical disorders in children. Journal of Speech, Language Schwartz, R. G., Leonard, L. B., Froem-Loeb, D., and Swan-
and Hearing Research, 41, S85–S100. son, L. A. (1987). Attempted sounds are sometimes not: An
Grunwell, P. (1997). Developmental phonological disability: expanded view of phonological selection and avoidance.
Order in disorder. In B. W. Hodson and M. L. Edwards Journal of Child Language, 14, 411–418.
(Eds.), Perspectives in applied phonology (pp. 61–103). Stackhouse, J., and Wells, B. (1997). Children’s speech and
Gaithersburg, MD: Aspen. literacy di‰culties: A psycholinguistic framework. London:
Hodson, B. W., and Paden, E. P. (1991). Targeting intelligible Thomson International.
speech: A phonological approach to remediation (2nd ed.). Stoel-Gammon, C., and Herrington, P. (1990). Vowel systems
Austin, TX: Pro-Ed. of normally developing and phonologically disordered chil-
Ingram, D. (1990). Phonological disability in children (2nd ed.). dren. Clinical Linguistics and Phonetics, 4, 145–160.
San Diego, CA: Singular Publishing Group. Stoel-Gammon, C., and Stone, J. R. (1991). Assessing phonol-
McReynolds, L. V., and Engmann, D. (1975). Distinctive fea- ogy in young children. Clinics in Communication Disorders,
ture analysis of misarticulations. Baltimore: University Park 1, 25–39.
Press. Velleman, S. L. (1998). Making phonology functional: What do
Rvachew, S., and Nowak, M. (2001). The e¤ect of target- I do first? Boston: Butterworth-Heinemann.
selection strategy on phonological learning. Journal of Velleman, S. L. (Ed.). (2002). Updates in phonological inter-
Speech Language and Hearing Research, 44, 610–623. vention [Special issue]. Seminars in Speech and Language,
Stoel-Gammon, C., and Dunn, C. (1985). Normal and dis- 23(1).
ordered phonology in children. Austin, TX: Pro-Ed. Vihman, M. M. (1993). Variable paths to early word produc-
Williams, A. L. (2000a). Multiple oppositions: Theoretical tion. Journal of Phonetics, 21, 61–82.
foundations for an alternative contrastive intervention Yavas, M. (Ed.). (1994). First and second language phonology.
framework. American Journal of Speech-Language Pathol- San Diego, CA: Singular Publishing Group.
ogy, 9, 282–288. Yavas, M. (Ed.). (1998). Phonology: Development and dis-
Williams, A. L. (2000b). Multiple oppositions: Case studies of orders. San Diego, CA: Singular Publishing Group.
variables in phonological intervention. American Journal of
Speech-Language Pathology, 9, 289–299.
Speech Disorders in Children: Motor
Further Readings Speech Disorders of Known Origin
Ball, M. J., and Kent, R. D. (Eds.). (1997). The new phonolo-
gies: Developments in clinical linguistics. San Diego, CA:
Singular Publishing Group. By definition, children with a communication diagnosis
Barlow, J. A. (1996). Variability and phonological knowledge. of motor speech disorder have brain dysgenesis or have
In T. W. Powell (Ed.), Pathologies of speech and language: sustained pre-, peri-, or postnatal damage or disease to
Contributions of clinical phonetics and linguistics (pp. the central or peripheral nervous system or to muscle
125–133). New Orleans, LA: International Clinical Phonet- tissue that impairs control of speech production pro-
ics Association. cesses and subsequent actions of the muscle groups used
Bernhardt, B. (1992). Developmental implications of nonlinear
to speak (respiratory, laryngeal, velopharyngeal, jaw,
phonological theory. Clinical Linguistics and Phonetics, 6,
259–282. lip, and tongue) (Hodge and Wellman, 1999). This im-
Bernhardt, B., and Stemberger, J. P. (1998). Handbook of pho- pairment may manifest with one or more of the follow-
nological development: From the perspective of constraint- ing: weakness, tone alterations (hypertonia, hypotonia),
based nonlinear phonology. San Diego, CA: Academic Press. reduced endurance and coordination, and involuntary
Bernthal, J. E., and Bankson, N. W. (1998). Articulation and movements of a¤ected speech muscle groups (dysarth-
phonological disorders (4th ed.). Needham Heights, MA: ria), or as di‰culty in positioning muscle groups and
Allyn and Bacon. sequencing their actions to produce speech that cannot
Bleile, K. M. (1995). Manual of articulation and phonological be explained by muscle weakness and tone abnormalities
disorders: Infancy through adulthood. San Diego, CA: Sin- (apraxia of speech). Disturbances a¤ecting higher mental
gular Publishing Group.
processes of speech motor planning and programming
Chin, S. B., and Dinnsen, D. A. (1992). Consonant clusters in
disordered speech: Constraints and correspondence pat- underlie the motor speech diagnosis, apraxia of speech.
terns. Journal of Child Language, 19, 259–286. To date, apraxia of speech of known origin in childhood
Ferguson, C. A., Menn, L., and Stoel-Gammon, C. (Eds.). is rare. Conditions in which it may appear are seizure
(1992). Phonological development: Models, research, impli- disorders (e.g., Landau-Kle¤ner syndrome; Love and
cations. Timonium, MD: York Press. Webb, 2001), focal ischemic events (Murdoch, Ozanne,
Speech Disorders in Children: Motor Speech Disorders of Known Origin 201
and Cross, 1990), and traumatic brain injury. In these speech muscles to produce voice or recognizable speech
cases the speech apraxia is typically accompanied by ex- sounds. These children’s psychosocial development is
pressive and receptive language deficits. Oropharyngeal also at risk because of limitations imposed by the speech
apraxia and mutism have been reported following pos- disorder on social interactions, which in turn may limit
terior fossa tumor resection in children (Dailey and their academic progress because of fewer opportunities
McKhann, 1995). Depending on the severity and dura- to gain experience using language (Hodge and Wellman,
tion of the neurological insult, signs of childhood- 1999).
acquired apraxia of speech remit and may disappear. When the cause of a childhood motor speech disorder
Disturbances a¤ecting the execution of speech actions is known, it is typically because a physician specializing
are diagnosed as dysarthrias, with subtypes identified by in pediatric neurology has diagnosed it. If this neuro-
site of lesion, accompanying pathophysiological signs, logical diagnosis (e.g., cerebral palsy, Möbius syndrome,
and e¤ects on speech production. Dysarthrias are the muscular dystrophy) is made during the prelinguistic
more common type of childhood motor speech disorder. period, there is some expectation that if neuromuscular
A known neurological condition a¤ecting neuromuscu- dysfunction was observed in earlier nonspeech activities
lar function, including that of muscles used in speech, is (e.g., sucking, chewing, swallowing, control of saliva,
a key factor leading to the diagnosis of dysarthria. A facial expressions) of muscle groups that are involved
useful taxonomy of subtypes of childhood dysarthrias in speech production, speech development will also be
(with associated sites of lesion) was described by Love delayed and disrupted (Love, 2000). Congenital supra-
(2000) and includes spastic (upper motor neuron), dys- bulbar paresis, or Worster-Drought syndrome, which
kinetic (basal ganglia control circuit), ataxic (cerebellar has been classified by Clark et al. (2000) as a mild spastic
control circuit), flaccid (lower motor neuron and asso- quadriplegic type of cerebral palsy resulting from dam-
ciated muscle fibers), and mixed (two or more sites in age to the perisylvian areas of the cortex, often is not
the previous categories); the mixed type is the most diagnosed until the child is older, if at all. Early diagno-
common. sis of this condition is important to speech-language
Limited research has been published on the nature pathologists (Crary, 1993; Clark et al., 2000) because it
of neuromuscular impairment in the various child- is predominantly characterized by persisting signs of
hood dysarthrias and how this correlates with perceived abnormal neuromuscular dysfunction in oropharyngeal
speech abnormalities (Workinger and Kent, 1991). muscles during infants’ feeding and swallowing and later
Solomon and Charron (1998) reviewed the literature on control of speech movements. Closer examination of
speech breathing in children with cerebral palsy. Love children with congenital suprabulbar paresis also reveals
(2000) summarized the literature on respiratory, laryn- evidence of persisting neuromuscular abnormalities af-
geal, velopharyngeal, tongue, jaw, and lip impairment in fecting gross and fine motor development and learning
children with dysarthria. It is di‰cult to generalize from di‰culties. These children need coordinated, multidisci-
the literature to individual cases because of the relatively plinary services like those a¤orded children with other
small numbers of children studied and the range of indi- subtypes of cerebral palsy.
vidual di¤erences in children with neurogenic conditions, In other cases of childhood motor speech disorders,
even when they share the same neurological diagnosis. no abnormal neurological signs are observed in the
Furthermore, it cannot be assumed that because a child’s early development, and signs of the motor speech
neurological diagnosis has implicated a certain site of disorder may be the first or only indication of neurolog-
lesion (e.g., cranial nerve or muscle group), other muscle ical abnormality (Arvedson and Simon, 1998). In some
groups are not also impaired. Murdoch, Johnson, and of these cases, subsequent neurological investigation
Theodoros (1997) described the case of a child with with electromyography, electroencephalography, neural
Möbius syndrome (commonly held to result from dam- imaging procedures such as magnetic resonance imag-
age to cranial nerves VI and VII) and also identified ing, and metabolic testing identifies a neurological con-
impaired function at the level of the velopharyngeal, dition or lesion as the cause of the speech disturbance
laryngeal, and respiratory subsystems using perceptual (e.g., seizures and brain dysmorphology in the bilateral
and instrumental evaluation. However, a reduction in perisylvian region, infection, tumor, progressive con-
maximal range of performance of speech muscle groups, ditions such as facioscapulohumeral muscular dystro-
persistent dependencies between muscle groups (e.g., lip phy) (Hodge and Wellman, 1999). In rare cases, a motor
with jaw, jaw with tongue, tongue body with tongue tip), speech disorder may result from treatment for another
and reductions in speed, precision, and consistency of medical condition, such as surgery for cerebellar tumors
speech movements are common themes in the literature or drug treatment for a debilitating movement disorder
on the nature of impairment in childhood motor speech such as Tourette’s syndrome. In still other cases neuro-
disorders. logical investigation reveals no identifiable cause.
The e¤ects of most childhood motor speech disorders Many conditions that result in childhood motor
are to reduce the rate and quality of a¤ected children’s speech disorders (e.g., cerebral palsy, traumatic brain
speech development, frequency of speech use, speech in- injury, chromosomal abnormalities) also a¤ect other
telligibility, speaking rate, and overall speech accept- areas of brain function. Therefore, children may show a
ability. The speech disorder can range from mild to so mixed dysarthria or characteristics of both dysarthria
severe that the child never gains su‰cient control over and apraxia of speech, and they have a high probability
202 Part II: Speech
of comorbidities a¤ecting higher cognitive functions cooperation allows (Murdoch, Johnson, and Theodoros,
of language, thought, attention, and memory, sensory- 1997). Procedures for assessing various aspects of speech
perceptual processes and control of other motor systems function (e.g., phonological system and phonetic ade-
(e.g., eyes, limbs, trunk, head). quacy, intelligibility, prosody) and speech ability that
Multidisciplinary assessment by members of a pedi- can be repeated across time to index change are de-
atric rehabilitation team is accepted clinical practice to scribed in Hodge and Wellman (1999) and Yorkston et
determine the presence and severity of comorbid con- al. (1999). In addition to perceptual measures, acoustic
ditions that may negatively a¤ect the child’s develop- measures such as second formant onset and extent and
ment. Specific to the motor speech disorder, assessment vowel area have been shown to be sensitive to both ef-
goals may include one or more of the following: estab- fective and ine¤ective compensatory articulation strat-
lishing a di¤erential diagnosis; identifying the nature and egies used by children with motor speech disorders (e.g.,
severity of impaired movement control in each of the Nelson and Hodge, 2000).
muscle groups used to produce speech; describing the When making decisions about intervention, the clini-
nature and extent of limitations imposed by the im- cian also needs to assess the child’s feeding behavior,
pairment on the child’s speech function in terms of cognitive and receptive and expressive language skills,
articulatory adequacy, prosody and voice, and speech hearing, psychosocial status and motivation to speak
intelligibility, quality, and rate; determining the child’s and communicate, gross and fine motor skills, general
ability to use speech, together with other modes, to health and stamina, and the child’s family situation,
communicate with others in various contexts of daily including family members’ goals and perspectives. A
life; and making decisions about appropriate short- and multidisciplinary team approach to assessment and in-
long-term management (Hodge and Wellman, 1999; tervention planning, with ongoing involvement of the
Yorkston et al., 1999; Love, 2000). At young ages or in family in selecting, coordinating, and evaluating treat-
severe cases, making a di¤erential diagnosis can be di‰- ment service options, is critical to the successful man-
cult if the child has insu‰cient speech behaviors to ana- agement of children with motor speech disorders
lyze and lacks the attentional, memory, and cognitive (Mitchell and Mahoney, 1995).
abilities to execute tasks that are classically used to dif- At present, most childhood motor speech disorders
ferentiate dysarthria from praxis disturbances. Hodge are considered to be chronic because they are the result
(1991) summarized strategies for assessing speech motor of brain damage or dysgenesis, for which there is no
function in children 0–3 years old. Love and Webb cure. It is expected that the neurological diagnosis asso-
(2001) reviewed primitive and oropharyngeal reflexes ciated with the motor speech disorder will be chronic
that may signal abnormal motor development at young and may a¤ect the child’s academic, social, and voca-
ages. Hayden and Square (1999) developed a stan- tional future. However, appropriate and su‰cient
dardized, normed protocol to aid in the systematic as- treatment can significantly improve these children’s
sessment of neuromotor integrity of the motor speech communication e¤ectiveness (Hodge and Wellman,
system at rest and when engaged in vegetative and voli- 1999; Love, 2000). The overall goal of treatment is to
tional nonspeech and speech tasks for children ages 3–12 help these children communicate in the most successful
years. Their protocol was designed to identify and dif- and independent manner possible and includes helping
ferentially diagnose childhood motor speech disorders them become desirable communication partners. It is
and is built on their seven-stage hierarchical model of unrealistic to expect that these children will be provided
speech motor development and control. Thoonen et al. with or benefit from continuous treatment from infancy
(1999) described the use of several maximal performance to adulthood. Families have identified the preschool and
tasks (diadochokinetic rates for repeated mono- and tri- early school years, and school transitions (i.e., change in
syllables, sustained vowel and fricatives) in a decision schools due to family relocation, elementary to junior
tree model to identify and diagnose motor speech dis- high, junior to senior high, and then to college), as times
orders; they normed their model on children age 6 years when the knowledge and support of speech-language
and older. pathologists is particularly needed. Children with a con-
Assessment of impairment should include evaluation genital or early-onset condition that results in a motor
of the integrity of structural as well as functional aspects speech disorder must learn how to produce the phono-
of the speech mechanism, because abnormal resting logical system of their language with reduced control of
postures and actions of oropharyngeal muscles can lead the muscle groups used to produce and shape the sound
to abnormalities in the dental arches, poor control of contrasts that signal meaning to their listeners. Explicit,
oral secretions, and an increased risk for middle ear goal-directed opportunities for extensive practice in pro-
infections. The use of detailed, comprehensive protocols ducing and combining speech sounds in meaningful
to guide assessment of speech mechanism impairment utterances are typically required for these children to
and interpretation of findings is recommended (e.g., attain their potential for learning their phonological
Dworkin and Culatta, 1995; St. Louis and Ruscello, system, its phonetic realization, and making their
2000). Instrumental procedures that assess the function speech intelligible. In addition to speech development,
of the various speech subsystems (respiratory, laryngeal, the child’s language, cognitive, and social development is
velopharyngeal, and oral articulatory) also help to de- also at risk because of the important role that speech
termine the nature and severity of impairment if child plays in development. Children who su¤er neurological
Speech Disorders in Children: Motor Speech Disorders of Known Origin 203
insult after the primary period of speech development creasing speech intelligibility and quality. This includes
has occurred (e.g., after 3–4 years of age) are faced with working with other members of pediatric rehabilitation
the task of relearning a system that has been acquired teams to optimize the child’s seating and positioning
with normal speech motor control, so they have internal for speech production (Solomon and Charron, 1998;
models of their phonological-phonetic system in place. Love, 2000).
For these children, the focus is on relearning these, and Develop the child’s phonological and phonetic reper-
then monitoring and providing support as needed for toire with attention to level of development as well as
acquisition of new spoken language skills that had not the specific profile of muscle group impairment; also,
been acquired at the time of the neurological insult. develop the child’s phonological awareness and pre-
Treatment planning for all children with motor literacy and literacy skills.
speech disorders should include a team approach that Improve vocal loudness and quality through behav-
promotes active family involvement in making decisions ioral training.
and implementing treatment, attention to principles of Increase speech intelligibility and naturalness through
motor learning at a level that is developmentally appro- prosodic training.
priate to the child (Hodge and Wellman, 1999), consid- Use behavioral training to maximize the e¤ects of drug
eration of a variety of service delivery forms, and a treatments, prosthetic compensation (e.g., palatal lift),
holistic view of the child with a motor speech problem or surgery (e.g., pharyngoplasty).
(Mitchell and Mahoney, 1995). Training tasks need to Identify and promote the use of e¤ective speech pro-
be goal-directed and should actively engage and involve duction compensatory behaviors.
the child as a problem solver. Because learning is
context-specific, training activities should simulate real- Communication E¤ectiveness
Teach the e¤ective use of interaction enhancement
world tasks as much as possible and be enjoyed by the
child. Training goals should build on previously learned strategies.
Model and promote the use of e¤ective conversational
skills and behaviors. The child must have multiple
opportunities to practice attaining each goal, and should repair strategies and speech production self-monitoring
have knowledge of results. skills.
Teach e¤ective cognitive strategies so the child can use
A combination of treatment approaches that address
multiple levels of the communication disorder (impair- word choice and syntactic structure to maximize lis-
ment, speech functional impairment, and ability to teners’ comprehension.
Promote maintenance of speech production skills that
communicate in various contexts) is typically used in
management programs for children with motor speech have been established and self-monitoring of commu-
disorders. The particular approaches change with the nication skills.
Implement strategies to increase the child’s self-esteem
child’s and family’s needs and as the child’s abilities
change. Possible approaches include the following and self-confidence in initiating and participating in
(Hodge and Wellman, 1999; Yorkston et al., 1999; communication interactions.
Love, 2000): See also developmental apraxia of speech; dys-
General arthrias: characteristics and classification; motor
Educate family members, other caregivers, and peers
speech involvement in children.
about the nature of the child’s speech disorder and —Megan M. Hodge
ways to communicate e¤ectively with the child.
Augment speech with developmentally appropriate al-
References
ternative communication modes.
Provide receptive and expressive language treatment Arvedson, J., and Simon, D. (1998). Acquired neurologic defi-
cits in young children: A diagnostic journey with Dora.
(both spoken and written) as appropriate, and inte-
Seminars in Speech and Language, 19, 71–80.
grate this with speech training activities when possible. Clark, M., Carr, L., Reilly, S., and Neville, B. (2000). Worster-
Address related issues as necessary, including manage-
Drought syndrome, a mild tetraplegic perisylvian cerebral
ment of any interfering behaviors (e.g., attention, lack palsy: Review of 47 cases. Brain, 123, 2160–2170.
of motivation), control of oral secretions and swallow- Crary, M. (1993). Developmental motor speech disorders. San
ing, oral-dental status, and sensory (auditory-visual) Diego, CA: Singular Publishing Group.
status. Dailey, A., and McKhann, G. (1995). The pathophysiology of
oral pharyngeal apraxia and mutism following posterior
Speech-Specific fossa tumor resection in children. Journal of Neurosurgery,
Increase the child’s physiological support for speech by 83, 467–475.
Dworkin, J., and Culatta, R. (1995). Dworkin-Culatta Oral
increasing spatiotemporal control and coordination
Mechanism Exam—Treatment (D-COME-T). Nicholas-
of speech muscle groups (respiratory, laryngeal, velo- ville, KY: Edgewood Press.
pharyngeal, lips, tongue, jaw). The objective is to in- Hayden, D., and Square, P. (1999). Verbal motor production
crease movement control for speech production, and assessment for children. Toronto: Psychological Corporation.
the selection, implementation, and evaluation of these Hodge, M. (1991). Assessing early speech motor function.
techniques must be made relative to their e¤ect on in- Clinics in Communication Disorders, 1, 69–85.
204 Part II: Speech
Hodge, M., and Wellman, L. (1999). Clinical management of with progressive dysarthria. Canadian Acoustics, 27, 84–
children with dysarthria. In A. Caruso and E. Stand (Eds.), 85.
Clinical management of children with motor speech disorders Horton, S., Murdoch, B., Theodoros, D., and Thompson, E.
(pp. 209–280). New York: Thieme. (1997). Motor speech impairment in a case of childhood
Love, R. J. (2000). Childhood motor speech disability (2nd ed.). basilar artery stroke: Treatment directions derived from
Toronto: Allyn and Bacon. physiological and perceptual assessment. Pediatric Rehabil-
Love, R. J., and Webb, W. G. (2001). Neurology for the speech- itation, 1, 163–177.
language pathologist (4th ed.). Boston: Butterworth- Kent, R. (1997). The perceptual sensorimotor examination for
Heinemann. motor speech disorders. In M. R. McNeil (Ed.), Clinical
Mitchell, P., and Mahoney, G. (1995). Team management for management of sensorimotor speech disorders (pp. 27–48).
young children with motor speech disorders. Seminars in New York: Thieme.
Speech and Language, 16, 159–172. Law, M. (1993). Perspectives on understanding and changing
Murdoch, B., Ozanne, A., and Cross, J. (1990). Acquired the environments of children with disabilities. Physical and
childhood motor speech disorders: Dysarthria and dys- Occupational Therapy in Pediatrics, 13, 1–17.
praxia. In B. Murdoch (Ed.), Acquired neurological speech/ Mitchell, P. (1995). A dynamic-interactive developmental view
language disorders in childhood (pp. 308–342). London: of early speech and language production: Application to
Taylor and Francis. clinical practice in motor speech disorders. Seminars in
Murdoch, B. E., Johnson, B. M., and Theodoros, D. G. (1997). Speech and Language, 16, 100–109.
Physiological and perceptual features of dysarthria in Moe- Murdoch, B., and Hudson-Tennant, L. (1994). Speech dis-
bius syndrome: Directions for treatment. Pediatric Rehabil- orders in children treated for posterior fossa tumors: Ataxic
itation, 1, 83–97. and developmental features. European Journal of Disorders
Nelson, M., and Hodge, M. (2000). E¤ects of facial paralysis of Communication, 29, 379–397.
and audiovisual information on stop place identification. Redmond, S., and Johnston, S. (2001). Evaluating the mor-
Journal of Speech, Language, and Hearing Research, 43, phological competence of children with severe speech and
158–171. physical impairments. Journal of Speech, Language, and
Solomon, N., and Charron, S. (1998). Speech breathing in Hearing Research, 44, 1362–1375.
able-bodied children and children with cerebral palsy: A Thompson, C. (1988). Articulation disorders in children. In N.
review of the literature and implications for clinical inter- Lass, L. McReynolds, J. Northern, and D. Yoder (Eds.),
vention. American Journal of Speech-Language Pathology, Handbook of speech pathology and audiology (pp. 548–590).
7, 61–78. Toronto: B.C. Decker.
St. Louis, K. O., and Ruscello, D. M. (2000). Oral Van Mourik, M., Catsman-Berrevoets, C., van Dongen, H.,
Speech Mechanism Screening Examination—Third Edition and Neville, B. (1997). Complex orofacial movements and
(OSME-3). Austin, TX: Pro-Ed. the disappearance of cerebellar mutism: Report of five
Thoonen, G., Maassen, B., Gabreels, F., and Schreuder, R. cases. Developmental Medicine and Child Neurology, 39,
(1999). Validity of maximum performance tasks to diagnose 686–690.
motor speech disorders in children. Clinical Linguistics and
Phonetics, 13, 1–23.
Workinger, M., and Kent, R. (1991). Perceptual analysis of Speech Disorders in Children:
the dysarthrias in children with athetoid and spastic cere-
bral palsy. In C. Moore, K. Yorkston, and D. Beukelman Speech-Language Approaches
(Eds.), Dysarthria and apraxia of speech: Perspectives on
management (pp. 109–126). Baltimore: Paul H. Brookes.
Yorkston, K., Beukelman, D., Strand, E., and Bell, K. (1999). Children with speech disorders often display di‰culty
Management of motor speech disorders in children and adults in other domains of language, suggesting that they ex-
(2nd ed.). Austin, TX: Pro-Ed. perience di‰culty with the language learning process
in general. A theoretical shift from viewing children’s
Further Readings speech disorders as articulatory-based to viewing them
from a linguistic perspective was precipitated by the ap-
Chapman-Bahr, D. (2001). Oral motor assessment and treat-
plication of phonological theories and principles to the
ment: Ages and stages. Needham Heights, MA: Allyn and
Bacon. field of speech pathology, beginning in the late 1970s.
Foley, B. (1993). The development of literacy in individuals Implicit in this shift was the recognition that acquisition
with severe congenital speech and motor impairments. of a linguistic system is a gradual, primarily auditory-
Topics in Language Disorders, 13, 16–22. perceptually based process involving the development of
Goldblatt, D., and Williams, D. (1986). ‘‘I an sniling’’: Moe- receptive knowledge first (Hodson and Paden, 1991;
bius syndrome inside and out. Journal of Child Neurology, Ingram, 1997). Another implication from research on
1, 71–78. normal phonological acquisition is that linguistic input
Gordon, N. (2001). Mutism: Elective, selective, and acquired. should demonstrate a sound’s contrastive role in the
Brain and Development, 23, 83–87. ambient phonology because what children are develop-
Hardy, J. (1983). Cerebral palsy. Englewood Cli¤s, NJ:
ing is phonological oppositions. As a result of applying
Prentice-Hall.
Hayden, D., and Square, P. (1994). Motor speech treatment a linguistic model to intervention, speech-language ap-
hierarchy: A systems approach. Clinics in Communication proaches have flourished in the past two decades for
Disorders, 4, 162–174. treating children’s speech disorders. These approaches
Hodge, M. (1999). Relationship between F2/F1 vowel are characterized by: (1) an emphasis on the function
quadrilateral area and speech intelligibility in a child of the phonological system to support communication,
Speech Disorders in Children: Speech-Language Approaches 205
and thus on the pragmatic limitations of unintelligible this approach e¤ectively reduced the application of
speech; (2) a focus on the contrastive nature of pho- selected phonological processes.
nemes and the use of minimal pair contrast training to The cycles approach, proposed by Hodson and
facilitate reorganization of the system; (3) the use of Paden’s (1991), involves a goal attack strategy that cap-
phonological analysis and description to identify error italizes on observations concerning the gradual manner
patterns; (4) the selection of error patterns for elimina- in which normally developing children acquire their
tion and of sound classes and features for acquisition; phonological systems. Groups of sounds a¤ected by an
and (5) the use of a small set of sounds as exemplars of error pattern are introduced for only 1 or 2 weeks, then a
those patterns/features for acquisition. What separates new error pattern is introduced. Thus, the criterion for
speech-language approaches from other articulation advancement to a new goal or target is time-based rather
approaches is the use of phonological analysis to identify than accuracy-based. A cycle can range from 5 to 15
error patterns a¤ecting sound classes and sequences weeks, depending on the number of deficient patterns,
rather than the selection of isolated sounds to be trained and once completed, the sequence is recycled for the
in each word position. Speech-language approaches error patterns that still remain in the child’s speech.
di¤er from articulation approaches in their focus on Hodson and Paden’s cycle approach involves auditory
modifications of groups of sounds via a small set of bombardment and production practice in the form of
exemplars (e.g., /f, s/ to represent all fricatives) and their picture- and object-naming activities, and reportedly
emphasis on contrastivity for successful communication eliminates most of a child’s phonological error patterns
in a social context. in 1–2 years of intervention (Tyler, Edwards, and Sax-
As such, minimal pair contrast treatment is a speech- man, 1987; Hodson and Paden, 1991; Hodson, 1997).
language approach that highlights the semantic confu- In contrast to the three approaches just described,
sion caused when the child produces a sound error that which focus on speech within a linguistic framework,
results in a pair of homonyms (e.g., ‘‘sun’’ and ‘‘ton’’ there are language-based approaches in which little at-
both produced as /tvn/). This technique involves contention may be drawn to sound errors and these errors
trasting a pair of words in which one word contains the may not be specific targets of intervention. Instead, the
child’s error production and the other contains the target entire language system (syntax, semantic, phonology,
production (with phonemes di¤ering by only one fea- pragmatics) is targeted as a tool for communication, and
ture). In minimal pair approaches, the child is instructed improvements in phonology are expected from a process
to make perceptual and productive contrasts involving of ‘‘whole to part learning’’ (Norris and Ho¤man, 1990;
the target sound and his or her error. The goal of treat- Ho¤man, 1992). Phonological changes might be ex-
ment is to help the child learn to produce the target pected to occur because phonemes are practiced as parts
sound in the word pair to signal a di¤erence in meaning of larger wholes within the script for an entire event.
between the two words. Minimal pair contrast inter- Language-based approaches involve a variety of natu-
ventions have been shown to be e¤ective in eliminating ralistic, conversationally embedded techniques such as
error patterns and increasing the accuracy of target and sca¤olding narratives, focused stimulation in the form
related error sounds (Ferrier and Davis, 1973; Elbert, of expansions and recasts, and elicited production de-
Rockman, and Saltzman, 1980; Blache, Parsons, and vices such as forced-choice questions, cloze tasks, and
Humphreys, 1981; Weiner, 1981; Elbert, 1983; Tyler, preparatory sets.
Edwards, and Saxman, 1987; Saben and Ingham, 1991). Norris and Ho¤man’s (1990, 1993) language-based
Minimal pair approaches that involve solely perception approach focuses on sca¤olding narratives in the form of
of word contrasts, however, have not resulted in as much expansions, expatiations, and turn assistance devices to
change as when production practice with models and help the child talk about picture sequences with higher
phonetic cues is also included (Saben and Ingham, levels of discourse and semantic complexity. Ho¤man,
1991). Norris, and Monjure (1990) contrasted their sca¤olded
Minimal pair contrasts are used somewhat di¤erently narrative approach to a phonological process approach
in Metaphon (Howell and Dean, 1994; Dean et al., in two brothers with comparable phonological and lan-
1995), a ‘‘cognitive-linguistic’’ approach. This approach guage deficits. The narrative intervention facilitated
is considered cognitive-linguistic because it facilitates gains in phonology that were similar to the phonological
conceptual development and cognitive reorganization of approach, and greater gains in syntactic, semantic, and
linguistic information. Its aim is to increase, at the meta- pragmatic performance.
linguistic level, awareness and understanding of sound Other language-based approaches reported in the lit-
class di¤erences primarily through classification tech- erature have focused primarily on morphosyntactic goals
niques, with little emphasis on production. For example, (e.g., finite morphemes, pronouns, complex sentences)
to contrast alveolars and velars, the concepts of front using focused stimulation designed to provide multiple
and back are introduced and applied first to nonspeech models of target morphosyntactic structures in a natu-
sorting and classification activities. Next, these concepts ral communicative context. Procedures have involved
are transferred to the speech domain by having the child recasts and expansions of child utterances, and oppor-
listen to minimal pairs and judge whether or not words tunities to use target forms in response to forced-choice
begin with front or back sounds. Dean et al. (1995) questions, sentence fill-ins, requests for elaboration, or
present evidence from several children suggesting that false assertions in pragmatically appropriate contexts
206 Part II: Speech
(Cleave and Fey, 1997). Researchers have been inter- gest that some children, especially those with both
ested in the cross-domain e¤ects of these procedures on speech and language impairments, will show improve-
improvement in children’s speech disorders. Fey et al. ment in speech when the intervention focuses on lan-
(1994) examined the e¤ects of language intervention in guage. Determining exactly who these children are is
25 children with moderate to severe language and speech di‰cult. Preliminary evidence suggests that children
impairments who were randomly assigned to a clinician whose phonological systems are highly inconsistent may
treatment group, a parent treatment group, or a delayed- be good candidates for a language-based approach
treatment control group. The treatment groups made (Tyler, 2002). Finally, service delivery restrictions may
large gains in grammar after 5 months of intervention, dictate the use of language-based approaches in class-
but improvement in speech was no greater than that room or collaborative settings. These approaches de-
achieved by the control group. Tyler and Sandoval serve further investigation for their possible benefit in
(1994) examined the e¤ects of treatment focused only on remediating both speech and language di‰culties.
speech, only on morphosyntax, and on both domains in In summary, a variety of speech-language approaches
six preschool children. The two children who received have been shown to be e¤ective in improving speech
morphosyntactic intervention showed improvements in intelligibility and reducing the number and severity of
language but negligible improvement in phonology. error patterns in children with speech disorders. Al-
In contrast to these findings that language-based in- though these approaches employ di¤erent teaching
tervention focused on morphosyntax does not lead to methods, they originate in a linguistic model and share
gains in speech, Tyler, Lewis, Haskill, and Tolbert an emphasis on the function of the phonological system
(2002) found that a morphosyntax intervention ad- to support communication, and on the contrastive na-
dressing finite morphemes led to improvement in speech ture of phonemes to reduce the pragmatic limitations of
in comparison to a control group. Tyler et al. (2002) unintelligible speech.
investigated the e‰cacy and cross-domain e¤ects of both
a morphosyntax and a phonological intervention. Ten —Ann A. Tyler
preschool children were assigned at random to an inter-
vention of two 12-week blocks, beginning with either a References
block focused on speech first or a block focused on Blache, S. E., Parsons, C. L., and Humphreys, J. M. (1981). A
morphosyntax first. Treatment e‰cacy was evaluated minimal-word-pair model for teaching the linguistic signifi-
after one block in the sequence was applied. Not only cance of distinctive feature properties. Journal of Speech and
was the morphosyntax intervention e¤ective in promot- Hearing Disorders, 46, 291–296.
ing change in morphemes marking tense and agreement Cleave, P., and Fey, M. (1997). Two approaches to the facili-
in comparison to the no-treatment control group, but tation of grammar in children with language impairments:
it led to improvement in speech that was similar to Rationale and description. American Journal of Speech-
that achieved by the phonology intervention. Thus, for Language Pathology, 6, 22–32.
Dean, E. C., Howell, J., Waters, D., and Reid, J. (1995).
children who received language intervention, the amount Methaphon: A metalinguistic approach to the treatment of
of speech improvement was significantly greater than phonological disorder in children. Clinical Linguistics and
that observed for the control group. In a similar study, Phonetics, 9, 1–19.
Matheny and Panagos (1978) examined the e¤ect of Elbert, M. (1983). A case study of phonological acquisition.
highly structured interventions focused on syntax only Topics in Language Disorders, 3, 1–9.
and articulation only in children with deficits in both Elbert, M., Rockman, B. K., and Saltzman, D. (1980). Con-
domains, compared with a control group. Each group trasts: The use of minimal pairs in articulation training.
made significant gains in the treated domain in compar- Austin, TX: Exceptional Resources.
ison with the control group, but also made improve- Ferrier, E., and Davis, M. (1973). A lexical approach to the
ments in the untreated domain. Thus, a language-based remediation of sound omissions. Journal of Speech and
Hearing Disorders, 38, 126–130.
intervention focused on complex sentence structures led Fey, M., Cleave, P. L., Ravida, A., Long, S. H., Dejmal, A. E.,
to improved speech. and Easton, D. L. (1994). E¤ects of grammar facilitation on
Findings regarding the e¤ects of language interven- the phonological performance of children with speech and
tion on speech are equivocal, particularly results from language impairments. Journal of Speech and Hearing Re-
methodologically rigorous studies with control groups search, 37, 594–607.
(Matheny and Panagos, 1978; Fey et al., 1994; Tyler Hodson, B. W. (1997). Disordered phonologies: What have we
et al., 2002). One variable that may account for these learned about assessment and treatment? In B. W. Hodson
di¤ering results is the use of di¤erent measures to docu- and M. L. Edwards (Eds.), Perspectives in applied phonol-
ment change. Matheny and Panagos used before and ogy (pp. 197–224). Gaithersburg, MD: Aspen.
after standardized test scores, whereas Fey et al. used Hodson, B. W., and Paden, E. P. (1991). Targeting intelligible
speech: A phonological approach to remediation (2nd ed.).
Percent of Consonants Correct (PCC; Shriberg and Austin, TX: Pro-Ed.
Kwiatkowski, 1982), a general measure of consonant Ho¤man, P. R. (1992). Synergistic development of phonetic
accuracy, and Tyler et al. used a more discrete measure skill. Language, Speech, and Hearing Services in Schools, 23,
of target and generalization phoneme accuracy. None- 254–260.
theless, the collective findings from studies of the e¤ects Ho¤man, P. R., Norris, J. A., and Monjure, J. (1990). Com-
of di¤erent language-based approaches on speech sug- parison of process targeting and whole language treatment
Speech Disorders Secondary to Hearing Impairment Acquired in Adulthood 207
for phonologically delayed preschool children. Language, terns. Journal of Speech and Hearing Disorders, 49, 309–
Speech, and Hearing Services in Schools, 21, 102–109. 317.
Howell, J., and Dean, E. (1994). Treating phonological dis- Elbert, M., and McReynolds, L. V. (1985). The generalization
orders in children: Metaphon—theory to practice (2nd ed.). hypothesis: Final consonant deletion. Language and Speech,
London: Whurr. 28, 281–294.
Ingram, D. (1997). The categorization of phonological impair- Fey, M. (1992). Articulation and phonology: Inextricable con-
ment. In B. W. Hodson and M. L. Edwards (Eds.), Per- structs in speech pathology. Language, Speech, and Hearing
spectives in applied phonology (pp. 19–41). Gaithersburg, Services in Schools, 23, 225–232.
MD: Aspen. Gierut, J. A. (1989). Maximal opposition approach to phono-
Matheny, N., and Panagos, J. M. (1978). Comparing the logical treatment. Journal of Speech and Hearing Disorders,
e¤ects of articulation and syntax programs on syntax and 54, 9–19.
articulation improvement. Language, Speech, and Hearing Gierut, J. A. (1998). Treatment e‰cacy: Functional phonolog-
Services in Schools, 9, 57–61. ical disorders in children. Journal of Speech, Language, and
Norris, J. A., and Ho¤man, P. R. (1990). Language interven- Hearing Research, 41, S85–S100.
tion within naturalistic environments. Language, Speech, Hodson, B. W. (1986). The Assessment of Phonological
and Hearing Services in Schools, 21, 72–84. Processes—Revised. Austin, TX: Pro-Ed.
Norris, J. A., and Ho¤man, P. R. (1993). Language interven- Kelman, M. E., and Edwards, M. L. (1994). Phonogroup: A
tion for school-age children. San Diego, CA: Singular Pub- practical guide for enhancing phonological remediation. Eau
lishing Group. Claire, WI: Thinking Publications.
Saben, C. B., and Ingham, J. C. (1991). The e¤ects of minimal Locke, J. L. (1993). The child’s path to spoken language. Cam-
pairs treatment on the speech-sound production of two bridge, MA: Harvard University Press.
children with phonologic disorders. Journal of Speech and Monahan, D. (1986). Remediation of common phonological
Hearing Research, 34, 1023–1040. processes: Four case studies. Language, Speech, and Hear-
Shriberg, L. D., and Kwiatkowski, J. (1982). Phonological ing Services in Schools, 17, 199–206.
disorders: III. A procedure for assessing severity of in- Olswang, L. B., and Bain, B. (1994). Data collection: Mon-
volvement. Journal of Speech and Hearing Disorders, 17, itoring children’s treatment progress. American Journal of
256–270. Speech-Language Pathology, 3, 55–66.
Tyler, A. A. (2002). Language-based intervention for phono- Paul, R., and Shriberg, L. D. (1982). Associations between
logical disorders. Seminars in Speech and Language, 23, 69– phonology and syntax in speech-delayed children. Journal
81. of Speech and Hearing Research, 25, 536–547.
Tyler, A. A., Edwards, M. L., and Saxman, J. H. (1987). Powell, T. W. (1991). Planning for phonological generaliza-
Clinical application of two phonologically based treatment tion: An approach to treatment target selection. American
procedures. Journal of Speech and Hearing Disorders, 52, Journal of Speech-Language Pathology, 1, 21–27.
393–409. Schwartz, R. G. (1992). Clinical applications of recent ad-
Tyler, A. A., and Sandoval, K. T. (1994). Preschoolers with vances in phonological theory. Language, Speech, and Hear-
phonologic and language disorders: Treating di¤erent lining Services in Schools, 23, 269–276.
guistic domains. Language, Speech, and Hearing Services in Tyler, A. A. (1997). Evidence of linguistic interactions in in-
Schools, 25, 215–234. tervention. Topics in Language Disorders, 17, 23–40.
Tyler, A. A., Lewis, K., Haskill, A., and Tolbert, L. (2002). Tyler, A. A., and Watterson, K. (1991). E¤ects of phonological
E‰cacy and cross-domain e¤ects of a morphosyntax and a versus language intervention in preschoolers with both
phonology intervention. Language, Speech, and Hearing phonological and language impairment. Child Language
Services in Schools, 33, 52–66. Teaching and Therapy, 7, 141–160.
Weiner, F. F. (1981). Treatment of phonological disability Wilcox, K. A., and Morris, S. E. (1995a). Speech inter-
using the method of meaningful minimal contrast: Two vention in a language-focused curriculum. In M. Rice and
case studies. Journal of Speech and Hearing Disorders, 46, K. Wilcox (Eds.), Building a language-focused curriculum for
97–103. the preschool classroom: A foundation for lifelong communi-
cation (pp. 73–89). Baltimore: Paul H. Brookes.
Further Readings Wilcox, K. A., and Morris, S. E. (1995b). Speech outcomes of
the language-focused curriculum. In M. Rice and K. Wilcox
Chaney, C. (1990). Evaluating the whole language approach to (Eds.), Building a language-focused curriculum for the pre-
language arts: The pros and cons. Language, Speech, and school classroom: A foundation for lifelong communication
Hearing Services in Schools, 21, 244–249. (pp. 171–180). Baltimore: Paul H. Brookes.
Crystal, D. (1987). Towards a ‘‘bucket’’ theory of language
disability: Taking account of interaction between linguistic
levels. Clinical Linguistics and Phonetics, 1, 7–22.
Dinnsen, D. A., and Elbert, M. (1984). On the relationship Speech Disorders Secondary to Hearing
between phonology and learning. In M. Elbert, D. A. Impairment Acquired in Adulthood
Dinnsen, and G. Weismer (Eds.), Phonological theory and
the misarticulating child (pp. 59–68). (ASHA Monograph
No. 22.) Rockville, MD: American Speech-Language-
Hearing loss is very common in the general population,
Hearing Association.
Edwards, M. L. (1983). Selection criteria for developing ther- with a prevalence of 82.9 per 1000 (U.S. Public Health
apy goals. Journal of Childhood Communication Disorders, Service, 1990). It becomes more common with age as a
7, 36–45. result of noise exposure, vascular disease, ototoxic
Elbert, M., Dinnsen, D. A., and Powell, T. W. (1984). On agents, and other otological diseases. After arthritis and
the prediction of phonologic generalization learning pat- hypertension, hearing loss is the third most common
208 Part II: Speech
chronic condition in persons over 65 (National Center Reduced phonemic contrast also characterizes the
for Health Statistics, 1982). In a study of 3556 adults speech of some adventitiously deafened adults. Lane
from Beaver Dam, Wisconsin, Cruickshanks et al. et al. (1995) observed that voice-onset time tends to de-
(1998) found prevalence rates for hearing loss of 21% in crease for both voiced and voiceless stop consonants,
adults ages 48–59 years, 44% for those ages 60–69, 66% while Waldstein (1990) observed this e¤ect only with
for those ages 70–79, and 90% for those ages 80–92. The voiceless stop consonants. Vowel, plosive, and sibilant
prevalence of perceived hearing handicap, however, is spectra become less distinct, and vowel formant spacing
lower than the true prevalence of hearing loss. By age 70, for some speakers becomes more restricted and central-
approximately 30% of the population perceives them- ized (Waldstein, 1990; Lane and Webster, 1991; Matth-
selves as hearing impaired, and by 80 years, 50% report ies et al., 1994, 1996; Lane et al., 1995). The first vowel
being hearing impaired (Desai et al., 2001). There is also formant commonly is elevated, with some speakers also
some indication that the prevalence of hearing impair- exhibiting a reduction in second formant frequency
ment in persons 45–69 years old is increasing, especially (Perkell et al., 1992; Kishon-Rabin et al., 1999). A
among men (Wallhagen et al., 1997). greater overlap in articulator postures and placements
The typical hearing loss configuration in adults is a has also been observed, with a tendency for the conso-
bilateral high-frequency sensory loss with normal or near nant place of articulation to be displaced forward and
normal hearing in the low frequencies (Moscicki et al., vowel postures to be neutralized (Matthies et al., 1996).
1985; Cruickshanks et al., 1998). Men tend to have more Fricatives and a¤ricates appear particularly prone to
hearing loss than women, and white individuals report deterioration with profound hearing loss (Lane and
greater hearing impairment than African Americans Webtser, 1991; Matthies et al., 1996). Many of these
(Cruickshanks et al., 1998; Desai et al., 2001). changes, although subtle in many cases and variable in
Although hearing loss is common in the general pop- expression across this population, are consistent with the
ulation, its e¤ects on speech production are most pro- speech di¤erences common to speakers with prelingual
nounced in individuals who have congenital hearing loss hearing loss. Although the evidence is limited, owing to
or hearing losses acquired in early childhood. For indi- the small numbers of subjects examined, the data across
viduals who acquire hearing loss as adults, the impact on studies suggest that the e¤ects of hearing loss on speech
speech production is limited and usually does not result production are most pronounced if the hearing loss
in any perceptible speech di¤erences (Goehl and Kauf- occurs in the teens and early twenties than if it occurs in
man, 1984). The preservation of speech in most adults later adulthood.
with hearing loss likely is a consequence of residual The primary management procedure for adults with
hearing su‰cient for auditory feedback. acquired hearing loss severe enough to compromise
Speech di¤erences have, however, been reported for speech is to restore some degree of auditory feedback.
some persons with complete loss or nearly complete loss The initial intervention typically consists of fitting tradi-
of hearing. These individuals tend to remain intelligible, tional amplification in the form of hearing aids. For
although the speaking rate may be reduced by about a first-time hearing aid wearers, postfitting rehabilitation
third when compared with normal-hearing speakers consisting of counseling and auditory training im-
(Leder, Spitzer, Kirchner, et al., 1987). A decreased proves auditory performance and retention of the hear-
speaking rate is reflected in increased sentence and pause ing aid, although secondary benefit in respect to speech
durations as well as increased word, syllable, and vowel production in adults has not been studied systematically
durations (Kirk and Edgerton, 1983; Leder et al., 1986; (Walden et al., 1981).
Leder, Spitzer, Kirchner, et al., 1987; Waldstein, 1990; A sensory implant often is recommended for adults
Lane et al., 1998). Movement durations associated with who do not receive su‰cient benefit from hearing aids or
articulatory gestures also are prolonged in some adven- who cannot wear hearing aids (see cochlear implants).
titiously deafened adults (Matthies et al., 1996), and it Individuals with an intact auditory nerve and a patent
has been suggested that this overall decrease in rate cochlea are usually candidates for a cochlear implant.
contributes to a reduction in speech quality and com- Persons who cannot be fitted with a cochlear implant,
munication e¤ectiveness (Leder, Spitzer, Kirchner, et al., such as persons with severed auditory nerves or ossified
1987). cochleae, may be candidates for a device that stimulates
Changes in respiratory and vocal control have been the auditory system intracranially, such as a brainstem
noted, as evidenced by abnormal airflow, glottal aper- implant. As with hearing aids, adult patients receiv-
ture, and air expenditure per syllable as well as frequent ing sensory implants benefit from pre- and postfitting
encroachment on respiratory reserve (Lane et al., 1991, counseling and frequent monitoring. With current tech-
1998). Adventitiously deafened adults also tend to ex- nologies, many patients show substantial improvement
hibit increased breathiness, vocal intensity, and mean in auditory and speech function within months of device
fundamental frequency (Leder, Spitzer, and Kirchner, activation with little or no additional intervention, al-
1987; Leder, Spitzer, Milner, et al., 1987; Lane et al., though normalization of all speech parameters may
1991; Lane and Webster, 1991; Perkell et al., 1992). In never occur or may take years to achieve (Kishon-Rabin
addition, the fundamental frequency tends to be more et al., 1999). Some speech parameters, such as vocal in-
variable, particularly on stressed vowels (Lane and tensity and fundamental frequency, show some degree of
Webster, 1991). reversal when an implant is temporarily turned o¤ and
Speech Disorders Secondary to Hearing Impairment Acquired in Adulthood 209
then on, but the extent and time course of overall re- of voice-onset-time. Journal of the Acoustical Society of
covery after initial activation vary with the individual. America, 98, 3096–3106.
The variability in speech recovery after initial implant Leder, S., Spitzer, J., and Kirchner, J. (1987). Speaking funda-
activation appears to result from a number of factors, mental frequency of postlingually profoundly deaf adult
men. Annals of Otology, Rhinology, and Laryngology, 96,
among them age at onset of hearing loss, improvement
322–324.
in auditory skills after implant activation, extent of Leder, S., Spitzer, J., Kirchner, J. C., Flevaris-Phillips, C.,
speech deterioration prior to activation, and the speech Milner, P., and Richardson, F. (1987). Speaking rate of
parameters a¤ected (Perkell et al., 1992; Lane et al., adventitiously deaf male cochlear implant candidates. Jour-
1995, 1998; Kishon-Rabin et al., 1999; Vick et al., 2001). nal of the Acoustical Society of America, 82, 843–846.
As a result, some patients may benefit from behavioral Leder, S., Spitzer, J., Milner, P., Flevaris-Phillips, C., Kirch-
intervention to facilitate recovery. In particular, persons ner, J., and Richardson, F. (1987). Voice intensity of pro-
with poor speech quality prior to receiving an implant, spective cochlear implant candidates and normal-hearing
as well as persons with central auditory deficits and adult males. Laryngoscope, 97, 224–227.
compromised devices, might benefit from systematic au- Leder, S., Spitzer, J., Milner, P., Flevaris-Phillips, C.,
Richardson, F., and Kirchner, J. (1986). Reacquisition of
ditory, speech, and communication skills training, al-
contrastive stress in an adventitiously deaf speaker using a
though the relationship between speech recovery and single-channel cochlear implant. Journal of the Acoustical
behavioral treatment has received little investigation in Society of America, 79, 1967–1974.
these patients. Matthies, M., Svirsky, M., Lane, H., and Perkell, J. (1994). A
See also auditory training; cochlear implants; preliminary study of the e¤ects of cochlear implants on the
hearing loss screening: the school-age child; production of sibilants. Journal of the Acoustical Society of
noise-induced hearing loss; ototoxic medications; America, 96, 1367–1373.
presbycusis; speechreading training and visual Matthies, M., Svirsky, M., Perkell, J., and Lane, H. (1996).
tracking. Acoustic and articulatory measures of sibilant production
with and without auditory feedback from a cochlear implant.
—Sheila Pratt Journal of Speech and Hearing Research, 39, 936–946.
Moscicki, E., Elkins, E., Baum, H., and McNamara, P. (1985).
Hearing loss in the elderly: An epidemiologic study of
References the Framingham Heart Study cohort. Ear and Hearing, 6,
184–190.
Cruickshanks, K., Wiley, T., Tweed, T., Klein, B., Klein, R., National Center for Health Statistics. (1982). Hearing ability of
Mares-Perlman, J., et al. (1998). Prevalence of hearing loss persons by sociodemographic and health characteristics:
in older adults in Beaver Dam, Wisconsin. American Jour- United States. Vital and Health Statistics (Series 10, No.
nal of Epidemiology, 148, 879–886. 140, DHS Publication No. PHS 82-1568). Washington, DC:
Desai, M., Pratt, L., Lentzner, H., and Robinson, K. (2001). U.S. Public Health Service.
Trends in vision and hearing among older Americans. Aging Perkell, J., Lane, H., Svirsky, M., and Webster, J. (1992).
Trends, 2. Hyattsville, MD: National Center of Health Sta- Speech of cochlear implant patients: A longitudinal study
tistics. of vowel production. Journal of the Acoustical Society of
Goehl, H., and Kaufman, D. (1984). Do the e¤ects of adven- America, 91, 2961–2978.
titious deafness include disordered speech? Journal of U.S. Public Health Service. (1990). Healthy People 2000.
Speech and Hearing Disorders, 49, 58–64. Washington, DC: U.S. Government Printing O‰ce.
Kirk, K., and Edgerton, B. (1983). The e¤ects of cochlear im- Vick, J. C., Lane, H., Perkell, J. S., Matthies, M. L., Gould, J.,
plant use on voice parameters. Otolaryngologic Clinics of and Zandipour, M. (2001). Covariation of cochlear implant
North America, 16, 281–292. users’ perception and production of vowel contrasts and
Kishon-Rabin, L., Taitelbaum, R., Tobin, Y., and Hilde- their identification by listeners with normal hearing. Journal
sheimer, M. (1999). The e¤ect of partially restored hearing of Speech, Language, and Hearing Research, 44, 1257–
on speech production of postlingually deafened adults with 1267.
multichannel cochlear implants. Journal of the Acoustical Walden, B., Erdman, S., Montgomery, A., Schwartz, D., and
Society of America, 106, 2843–2857. Prosek, R. (1981). Some e¤ects of training on speech rec-
Lane, H., Perkell, J., Svirsky, M., and Webster, J. (1991). ognition by hearing-impaired adults. Journal of Speech and
Changes in speech breathing following cochlear implant in Hearing Research, 24, 207–216.
postlingually deafened adults. Journal of Speech and Hear- Waldstein, R. (1990). E¤ects of postlingual deafness on speech
ing Research, 34, 526–533. production: Implications for the role of auditory feedback.
Lane, H., Perkell, J., Wozniak, J., Mansella, J., Guiod, P., Journal of the Acoustical Society of America, 88, 2099–
Matthies, M., et al. (1998). The e¤ect of changes in hearing 2114.
status on speech sound level and speech breathing: A study Wallhagen, M., Strawbridge, W., Cohen, R., and Kaplan, G.
conducted with cochlear implant users and NF-2 patients. (1997). An increasing prevalence of hearing impairment
Journal of the Acoustical Society of America, 104, 3059– and associated risk factors over three decades of the Ala-
3069. meda County study. American Journal of Public Health, 87,
Lane, H., and Webster, J. W. (1991). Speech deterioration 440–442.
in postlingually deafened adults. Journal of the Acoustical
Society of America, 89, 859–866. Further Readings
Lane, H., Wozniak, J., Matthies, M., Svirsky, M., and Perkell,
J. (1995). Phonemic resetting versus postural adjustments Bennie, C., Danilo¤, R., and Buckingham, H. (1982). Phonetic
in the speech of cochlear implant users: An exploration disintegration in a five-year old following sudden hearing
210 Part II: Speech
loss. Journal of Speech and Hearing Disorders, 47, 181– This entry summarizes information on phonological
189. development and disorders in Latino children, focusing
Cowie, R., Douglas-Cowie, E., Phil, D., and Kerr, A. (1982). on those who are Spanish-speaking. Spanish phonology
Speech changes following reimplantation from a single- and phonological development in typically developing
channel to a multichannel cochlear implant. Journal of
Spanish-speaking children, Spanish-speaking children
Laryngology and Otology, 96, 101–112.
Economou, A., Tartter, V., Chute, P., and Hellman, S. (1992). with phonological disorders, typically developing bilin-
Speech changes following reimplantation from a single- gual (Spanish-English) children, and bilingual (Spanish-
channel to a multichannel cochlear implant. Journal of the English) children with phonological disorders will be
Acoustical Society of America, 92, 1310–1323. reviewed.
Lane, H., and Tranel, B. (1971). The Lombard sign and the There are five primary vowels in Spanish, the two
role of hearing in speech. Journal of Speech and Hearing front vowels, /i/ and /e/, and the three back vowels, /u/,
Research, 14, 677–709. /o/, and /a/. There are 18 phonemes in General Spanish
Langereis, M., Bosman, A., van Olphen, A., and Smooren- (Núñez-Cedeño and Morales-Front, 1999): the voiceless
burg, G. (1997). Changes in vowel quality in post-lingually unaspirated stops, /p/, /t/, and /k/; the voiced stops, /b/,
deafened cochlear implant users. Audiology, 36, 279–297.
/d/, and /g/; the voiceless fricatives, /f/, /x/, and /s/; the
Langereis, M., Bosman, A., van Olphen, A., and Smooren-
burg, G. (1998). E¤ect of cochlear implantation on voice a¤ricate, /tS/; the glides, /w/ and /j/; the lateral, /l/; the
fundamental frequency in post-lingually deafened adults. flap /Q/, the trill /r/; and the nasals, /m/, /n/, and /O/.
Audiology, 37, 219–230. The three voiced stops /b, d, g/ are in complementary
Plant, G. (1984). The e¤ects of an acquired profound hearing distribution with the spirants [b] (voiced bilabial), [D]
loss on speech production. British Journal of Audiology, 18, (voiced interdental), and [f] (voiced velar), respectively.
39–48. The spirant allophones most generally occur intervocali-
Pratt, S., and Tye-Murray, N. (1997). Speech impairment sec- cally both within and across word boundaries (e.g.,
ondary to hearing loss. In M. R. McNeil (Ed.), Clinical /dedo/ (finger) ! [deDo]; /la boka/ (the mouth) ! [la
Management of Sensorimotor Speech Disorders (pp. 345– boka]) and in word internal consonant clusters (e.g.,
387). New York: Thieme.
/ablaQ/ (to talk) ! [aßlaQ]).
Smyth, V., Murdoch, B., McCormack, P., and Marshall, I.
(1991). Objective and subjective evaluation of subjects fitted The phonetic inventory of Spanish di¤ers from that of
with the cochlear multi-channel cochlear prostheses: 3 English. Spanish contains some sounds that are not part
studies. Australian Journal of Human Communication Dis- of the English phonetic system, including the voiced
orders, 19, 32–52. palatal nasal [O], as in [niOo] (boy), the voiceless bilabial
Spir, S., and Canter, G. (1991). Postlingual deaf speech and the fricative [F], as in [emFeQmo] (sick), the voiceless velar
role of audition in speech production: Comments on Wald- fricative [x], as in [relox] (watch), the voiced spirants [ß],
stein’s paper. Journal of the Acoustical Society of America, as in [klaßo] (nail), and [f], as in [lafo] (lake), the
90, 1672–1678. alveolar trill [r], as in [pero] (dog), and the voiced uvular
Svirsky, M., Lane H., Perkell, J., and Wozniak, J. (1992). trill [R], as in [Roto] (broken).
E¤ects of short-term auditory deprivation on speech pro-
As in English, there are a number of dialectal varieties
duction in adult cochlear implant users. Journal of the
Acoustical Society of America, 92, 1284–1300. associated with Spanish. In the United States, the two
Svirsky, M., and Tobey, E. (1991). E¤ect of di¤erent types of most prevalent dialect groups of Spanish are Southwest-
auditory stimulation on vowel formant frequencies in mul- ern United States (e.g., Mexican Spanish) and Caribbean
tichannel cochlear implant users. Journal of the Acoustical (e.g., Puerto Rican Spanish) (Iglesias and Goldstein,
Society of America, 89, 2895–2904. 1998). Unlike English, in which dialectal variations are
Zimmermann, G., and Rettaliata, P. (1981). Articulatory pat- generally defined by alterations in vowels, Spanish dia-
terns of an adventitiously deaf speaker: Implications for the lectal di¤erences primarily a¤ect consonants. Specifi-
role of auditory information in speech production. Journal cally, fricatives and liquids (in particular /s/, /Q/, and /r/)
of Speech and Hearing Research, 24, 169–178. tend to show more variation than stops, glides, or the
a¤ricate.
Common dialectal variations include deletion and/or
Speech Issues in Children from Latino aspiration of /s/ (e.g., /dos/ (two) ! [do] or [do h ]); dele-
Backgrounds tion of /Q/ (e.g., /koQtar/ (cut) ! [kottaQ]); substitution
of [l] or [i] for /Q/ (e.g., /koQtar/ ! [koltaQ/[koitaQ]);
and substitution of [x] or [R] for /r/ (e.g., /pero/
Research into English phonological development in typ- (dog) ! [pexo/peRo]. It should be noted that not every
ically developing children and children with phonologi- feature is always evidenced in the same manner and that
cal disorders has been occurring since the 1930s (e.g., not every speaker of a particular dialect uses each and
Wellman et al., 1931; Hawk, 1936). There is limited in- every dialectal feature.
formation on phonological development in Latino chil-
dren, particularly those who are monolingual Spanish Phonological Development in Monolingual Spanish-
speakers and bilingual (Spanish and English) speakers. Speaking Children. Most of the developmental phono-
Over the past 15 years, however, phonological informa- logical data on Spanish have been collected from
tion collected on monolingual Spanish speakers and bi- typically developing, monolingual children. Data from
lingual (Spanish-English) speakers has increased greatly. segment-based studies suggest that typically developing,
Speech Issues in Children from Latino Backgrounds 211
preschool, Spanish-speaking children accurately produce phonological systems of bilingual (English-Spanish)

most segments by age 3½ years (Maez, 1981). By 5 speakers develop somewhat di¤erently from the phono-
years, the following phonemes were found not to be logical system of monolingual speakers of either lan-
mastered (produced accurately at least 90% of the time): guage. Gildersleeve, Davis, and Stubbe (1996) and
/g/, /f/, /s/, /O/, /Q/, and /r/ (e.g., de la Fuente, 1985; Gildersleeve-Neumann and Davis (1998) found that in
Anderson and Smith, 1987; Acevedo, 1993). By the time English, typically developing, bilingual preschoolers
Spanish-speaking children reached first grade, there were showed an overall lower intelligibility rating, made more
only a few specific phones on which typically developing errors overall (on both consonants and vowels), distorted
children were likely to show any errors at all: the frica- more sounds, and produced more uncommon error
tives [x], [s], and [D], the a¤ricate [tS ], the flap [Q], the trill patterns than monolingual children of the same age.
[r], the lateral [l], and consonant clusters (Evans, 1974; Gildersleeve, Davis, and Stubbe (1996) and Gildersleeve-
M. M. Gonzalez, 1978; Bailey, 1982). Neumann and Davis (1998) also found higher percent-
Studies examining phonological processes indicate ages of occurrence (7%–10% higher) for typically
that Spanish-speaking children have suppressed (i.e., are developing, bilingual children (in comparison to their
no longer productively using) the majority of phonolog- monolingual peers) on a number of phonological pro-
ical processes by the time they reach 3½ years of age cesses, including cluster reduction, final consonant dele-
(e.g., A. Gonzalez, 1981; Mann et al., 1992; Goldstein tion, and initial voicing. This discrepancy between
and Iglesias, 1996a). Commonly occurring phonological monolingual and bilingual speakers, however, does not
processes (percentages of occurrence greater than 10%) seem to be absolute across the range of phonological
included postvocalic singleton omission, stridency dele- processes commonly exhibited in children of this age.
tion, tap/trill /r/, consonant sequence reduction, and Goldstein and Iglesias (1999) examined English and
final consonant deletion. Less commonly occurring pro- Spanish phonological skills in 4-, 5-, and 6-year-old,
cesses (percentages of occurrence greater than 10%) were typically developing bilingual children and found that
fronting (both velar and palatal), prevocalic singleton some phonological patterns (e.g., initial consonant dele-
omission, assimilation, and stopping. tion and dea¤rication) were exhibited at somewhat lower
Although there have been quite a number of studies rates in bilingual children with phonological disorders
characterizing phonological patterns in typically devel- than has been reported for monolingual, Spanish-
oping Spanish-speaking children, this information speaking children with phonological disorders (Goldstein
remains sparse for Spanish-speaking children with and Iglesias, 1996b). Thus, although the average per-
phonological disorders. Goldstein and Iglesias (1993) centage-of-occurrence di¤erence is not large between
examined consonant production in Spanish-speaking monolingual and bilingual speakers, the results indicate
preschoolers with phonological disorders and found that that bilingual children will not always exhibit higher
all stops, the fricative [f ], the glides, and the nasals were percentages of occurrence on phonological processes
produced accurately more than 75% of the time. The than monolingual children.
spirants [b] and [D], the a¤ricate, the flap [Q], the trill [r], Goldstein and Washington (2001) indicated that the
and the lateral [l] were produced accurately 50%–74% of phonological skills of 4-year-old bilingual children were
the time. Finally, the fricative [s], the spirant [f], and clus- similar to their monolingual counterparts; however, the
ters were produced accurately less than 50% of the time. substitution patterns used for the target sounds flap /Q/
Phonological development in Spanish-speaking pre- and trill /r/ did vary somewhat between bilingual and
school children with phonological disorders has also monolingual speakers. For example, in bilingual chil-
been examined (Meza, 1983; Goldstein and Iglesias, dren [l] was a common substitute for the trill, but it was
1996b). Meza (1983) found that these children showed a relatively rare substitute for the trill in monolingual
errors on liquids, stridents, and bilabials in more than children. All four studies also found that bilingual chil-
30% of possible occurrences. Goldstein and Iglesias dren exhibited error patterns found in both languages
(1996b) found that low-frequency phonological pro- (e.g., cluster reduction) as well as those, like liquid glid-
cesses (percentages of occurrence less than 15%) were ing, that were typical in one language (English) but
palatal fronting, final consonant deletion, assimilation, atypical in the other (Spanish).
velar fronting, and weak syllable deletion. Moderate Data from bilingual children with phonological dis-
frequency processes (percentages of occurrence between orders indicate that they exhibit more errors, lower rates
15% and 30% for three processes) were initial consonant of accuracy on consonants, and higher percentages of
deletion, liquid simplification, and stopping. The high- occurrence for phonological processes than either typi-
frequency process (percentages of occurrence greater cally developing, bilingual children (Goldstein and Igle-
than 30%) was cluster reduction. Other error types sias, 1999) or monolingual, Spanish-speaking children
exhibited by children with phonological disorders that with phonological disorders (Goldstein and Iglesias,
were not usually observed in typically developing, 1996b). The types of errors exhibited by the children,
Spanish-speaking children were dea¤rication, lisping, however, are similar regardless of target language (i.e.,
and backing. Spanish versus English). Specifically, bilingual children
with phonological disorders showed higher error rates
Phonological Development in Bilingual (Spanish- on clusters, fricatives, and liquids than other classes of
English) Children. There is increasing evidence that the sounds. Finally, percentages of occurrence for phono-
212 Part II: Speech
logical processes were higher overall for bilingual chil- Gonzalez, A. (1981). A descriptive study of phonological devel-
dren with phonological disorders than for monolingual, opment in normal speaking Puerto Rican preschoolers.
Spanish-speaking children with phonological disorders. Unpublished doctoral dissertation, Pennsylvania State Uni-
The number of Latino children who speak Spanish versity, State College, PA.
Gonzalez, M. M. (1978). Cómo detectar al niño con problemas
continues to increase. Developmental phonological data
del habla (Identifying speech disorders in children). México:
collected from typically developing, monolingual Latino Editorial Trillas.
children who speak Spanish indicate that by age 3½, Hawk, S. (1936). Speech defects in handicapped children.
they use the dialectal features of their speech com- Journal of Speech Disorders, 1, 101–106.
munity and have mastered the majority of sounds in Iglesias, A., and Goldstein, B. (1998). Language and dialectal
the language. The phonological development of typi- variations. In J. Bernthal and N. Bankson (Eds.), Articula-
cally developing, bilingual (Spanish-English) speakers is tion and phonological disorders (4th ed., pp. 148–171).
somewhat di¤erent from that of monolingual speakers of Needham Heights, MA: Allyn and Bacon.
either language. Maez, L. (1981). Spanish as a first language. Unpublished
doctoral dissertation, University of California, Santa
—Brian Goldstein Barbara.
Mann, D. P., Kayser, H., Watson, J., and Hodson, B. (1992,
References November). Phonological systems of Spanish-speaking
Texas preschoolers. Paper presented at the annual conven-
Acevedo, M. A. (1993). Development of Spanish consonants in tion of the American Speech-Language-Hearing Associa-
preschool children. Journal of Childhood Communication tion, San Antonio, TX.
Disorders, 15, 9–15. Meza, P. (1983). Phonological analysis of Spanish utterances
Anderson, R., and Smith, B. (1987). Phonological develop- of highly unintelligible Mexican-American children. Unpub-
ment of two-year-old monolingual Puerto Rican Spanish- lished master’s thesis, San Diego State University, San
speaking children. Journal of Child Language, 14, 57–78. Diego, CA.
Bailey, S. (1982). Normative data for Spanish articulatory skills Núñez-Cedeño, R., and Morales-Front, A. (1999). Fonologı́a
of Mexican children between the ages of six and seven. generativa contemporánea de la lengua española (Contem-
Unpublished master’s thesis, San Diego State University, porary generative phonology of the Spanish language).
San Diego, CA. Washington, DC: Georgetown University Press.
de la Fuente, M. T. (1985). The order of acquisition of Spanish Wellman, B., Case, I., Mengert, I., and Bradbury, D. (1931).
consonant phonemes by monolingual Spanish speaking chil- Speech sounds of young children (University of Iowa Studies
dren between the ages of 2.0 and 6.5. Unpublished doctoral in Child Welfare No. 5). Iowa City: University of Iowa
dissertation, Georgetown University, Washington, DC. Press.
Evans, J. S. (1974). Word pair discrimination and imitation
abilities of preschool Spanish-speaking children. Journal of Further Readings
Learning Disabilities, 7, 573–580.
Gildersleeve, C., Davis, B., and Stubbe, E. (1996). When mon- Cotton, E., and Sharp, J. (1988). Spanish in the Americas.
olingual rules don’t apply: Speech development in a bilingual Washington, DC: Georgetown University Press.
environment. Paper presented at the annual convention of Dalbor, J. (1980). Spanish pronunciation: Theory and practice
the American Speech-Language-Hearing Association, Seat- (2nd ed.). New York: Holt, Rinehart and Winston.
tle, WA. Eblen, R. (1982). A study of the acquisition of fricatives by
Gildersleeve-Neumann, C., and Davis, B. (1998). Learning three-year-old children learning Mexican Spanish. Lan-
English in a bilingual preschool environment: Change over guage and Speech, 25, 201–220.
time. Paper presented at the annual convention of the Fantini, A. (1985). Language acquisition of a bilingual child: A
American Speech-Language-Hearing Association, San An- sociolinguistic perspective (to age 10). San Diego, CA: Col-
tonio, TX. lege-Hill Press.
Goldstein, B., and Iglesias, A. (1993). Phonological patterns Goldstein, B. (1995). Spanish phonological development. In
in speech-disordered Spanish-speaking children. Paper pre- H. Kayser (Ed.), Bilingual speech-language pathology: An
sented at a convention of the American Speech-Language- Hispanic focus (pp. 17–38). San Diego, CA: Singular Pub-
Hearing Association, Anaheim, CA. lishing Group.
Goldstein, B., and Iglesias, A. (1996a). Phonological patterns Goldstein, B. (1996). The role of stimulability in the assessment
in normally developing Spanish-speaking 3- and 4-year-olds and treatment of Spanish-speaking children. Journal of
of Puerto Rican descent. Language, Speech, and Hearing Communication Disorders, 29, 299–314.
Services in the Schools, 27, 82–90. Goldstein, B. (2000). Cultural and linguistic diversity resource
Goldstein, B., and Iglesias, A. (1996b). Phonological patterns guide for speech-language pathology. San Diego, CA: Sin-
in Puerto Rican Spanish-speaking children with phono- gular Publishing Group.
logical disorders. Journal of Communication Disorders, 29, Goldstein, B. (2001). Assessing phonological skills in Hispanic/
367–387. Latino children. Seminars in Speech and Language, 22, 39–
Goldstein, B., and Iglesias, A. (1999, February). Phonological 49.
patterns in bilingual (Spanish-English) children. Seminar Goldstein, B., and Cintron, P. (2001). An investigation of
presented at the 1999 Texas Research Symposium on Lan- phonological skills in Puerto Rican Spanish-speaking 2-
guage Diversity, Austin, TX. year-olds. Clinical Linguistics and Phonetics, 15, 343–361.
Goldstein, B., and Washington, P. (2001). An initial in- Goldstein, B., and Pollock, K. (2000). Vowel errors in Spanish-
vestigation of phonological patterns in 4-year-old typically speaking children with phonological disorders: A retrospec-
developing Spanish-English bilingual children. Language, tive, comparative study. Clinical Linguistics and Phonetics,
Speech, and Hearing Services in the Schools, 32, 153–164. 14, 217–234.
Speech Sampling, Articulation Tests, and Intelligibility in Children with Phonological Errors 213
Gonzalez, G. (1983). The acquisition of Spanish sounds in the In most cases, when we assess speech production in a
speech of two-year-old Chicano children. In R. Padilla preschool child, the purposes of the assessment are to
(Ed.), Theory technology and public policy on bilingual edu- describe the child’s phonological system and make deci-
cation (pp. 73–87). Rosslyn, VA: National Clearinghouse sions about management, if needed. A good audio- or
for Bilingual Education.
video-recorded speech sample from play interactions
Macken, M. (1978). Permitted complexity in phonological de-
velopment: One child’s acquisition of Spanish consonants. with parents or the clinician can capture all of the pri-
Lingua, 44, 219–253. mary data needed to describe the system, or it can be
Macken, M., and Barton, D. (1980). The acquisition of the supplemented with a single-word test instrument. Typi-
voicing contrast in Spanish: A phonetic and phonological cally, we define an adequate conversational speech sam-
study of word-initial stop consonants. Journal of Child ple as one that includes at least 100 di¤erent words
Language, 7, 433–458. (Crystal, 1982). Additionally, these words should not be
Mann, D., and Hodson, B. (1994). Spanish-speaking children’s direct or immediate repetitions of an adult model. Fi-
phonologies: Assessment and remediation of disorders. nally, for children with poor intelligibility, it is helpful
Seminars in Speech and Language, 15, 137–147. for the examiner to repeat what he believes the child said
Oller, D. K., and Eilers, R. (1982). Similarity of babbling in
after each utterance so that this spoken gloss is also
Spanish- and English-learning babies. Child Language, 9,
565–577. recorded on the tape.
Pandolfi, A. M., and Herrera, M. O. (1990). Producción fono- Once the recording has been made, the next task is for
logica diastratica de niños menores de tres años (Phono- the examiner to gloss and transcribe the speech sample,
logical production in children less than three years old). ideally using narrow transcription (Edwards, 1986; Shri-
Revista Teorica y Aplicada, 28, 101–122. berg and Kent, 2003). Professionals who frequently do
Perez, E. (1994). Phonological di¤erences among speakers of extensive transcription may wish to use a computer-
Spanish-influenced English. In J. Bernthal and N. Bankson based system. (See Masterson, Long, and Buder, 1998,
(Eds.), Child phonology: Characteristics, assessment, and infor an excellent review of such software.) However, such
tervention with special populations (pp. 245–254). New systems are only as good as the clinician’s memory for
York: Thieme.
the symbols and diacritics and her memory of their lo-
Poplack, S. (1978). Dialect acquisition among Puerto Rican
bilinguals. Language in Society, 7, 89–103. cation on the keyboard; consequently, doing transcrip-
Terrell, T. (1981). Current trends in the investigation of Cuban tion by hand may be the more reliable way to go about
and Puerto Rican phonology. In J. Amastae and L. Elı́as- this task if one does not do it frequently.
Olivares (Eds.), Spanish in the United States: Sociolinguistic With the transcript in hand, the clinician now has a
aspects (pp. 47–70). Cambridge, UK: Cambridge Univer- choice of types of analyses. First of all, one can under-
sity Press. take both independent and relational analyses (Stoel-
Vaquero, M. (1996). Antillas. In M. Alvar (Ed.), Manual de Gammon and Dunn, 1985). Independent analyses treat
dialectologı́a Hispánica (Manual of Hispanic dialectology) the child’s system as self-contained, that is, with no ref-
(pp. 51–67). Barcelona, Spain: Ariel. erence to the adult system. They include a phonetic in-
Watson, I. (1991). Phonological processing in two languages.
ventory for consonants and perhaps for vowels, as well
In E. Bialystok (Ed.), Language processing in bilingual chil-
dren (pp. 25–48). Cambridge, UK: Cambridge University as tallies of syllable or word shapes. Relational analyses,
Press. on the other hand, explicitly compare the child’s pro-
Yavas, M., and Goldstein, B. (1998). Phonological assessment duction to that of the adult, including a segmental
and treatment of bilingual speakers. American Journal of (phonemic) inventory for consonants and perhaps for
Speech-Language Pathology, 7, 49–60. vowels and a list of the phonological processes that the
child uses.
Independent analyses are appropriate for children
who are very young, or who have very poor intelligibil-
Speech Sampling, Articulation Tests, ity, or who appear to use few di¤erentiated speech
and Intelligibility in Children with sounds. Clinicians typically devise their own forms for
Phonological Errors these analyses, although some of the software mentioned
above permits certain of these independent analyses to
be done automatically. Frequency of use is an issue in
Children who have speech sound disorders can reason- independent analyses, so the various phones and syl-
ably be separated into two distinct groups. One group lable structures that appear in the transcript should be
comprises children for whom intelligibility is a primary tallied on the inventory form. It is helpful to structure
issue and who tend to use many phonological processes, these forms into major consonant classes and major
especially deletion processes. These children are gener- vowel classes, as well as syllable position—syllable-
ally in the preschool age range. The second group in- initial, syllable-final, and intervocalic. In addition, for
cludes children who have residual errors, that is, some types of analysis, separate inventories should be
substitution and distortion errors that are relatively few done for one-, two-, and three-syllable utterances or
in number. These children are typically of school age, words.
and intelligibility is better than in the first group. The One kind of relational analysis, the segmental or
first group, namely children with phonological di‰- phonemic inventory, which compares the child’s pro-
culties, is the focus of this entry. duction to the adult target, is more familiar to clinicians
214 Part II: Speech
because it resembles typical published tests of articula- (However, SHAPE has an extensive list of potential
tion and phonology. Phonological process analysis is idiosyncratic processes in an appendix, along with in-
also considered to be relational in nature. If clinicians structions for determining the frequency of use of these
are working from a transcript of conversational speech, processes.)
all of the software mentioned earlier can provide at least Specialized testing using single-word stimuli may be
a list of phonological processes. Alternatively, clinicians required for specific treatment orientations. For exam-
may again devise their own forms for the segmental ple, in order to carry out a generative analysis, Elbert
inventories and the list of phonological processes. Typi- and Gierut (1986) have developed a test with more than
cally, the list of phonological processes will include 300 items in which bound morphemes, such as re-
the 8–10 processes commonly listed in texts and tests (meaning again) or the diminutive -y or -ie, are added to
of phonology, as well as any unique or idiosyncratic the word at either end. The purpose is to determine
processes that the child uses. The examiner then goes whether the child changes an error production of the first
through the transcript noting what the child produces for consonant or the last consonant in the presence of the
each adult form. These productions are also tallied. morphological addition and changes it in a way that
One other important measure that is relational in na- clarifies the child’s underlying representation.
ture is the Percentage of Consonants Correct (PCC; Another example of specialized testing using either
Shriberg and Kwiatkowski, 1982). The number of adult single words or connected speech is the elicitation of
targets that the child attempts is tallied, using the stan- data needed for nonlinear analysis (Bernhardt and Stoel-
dards of colloquial speech, and the number of targets Gammon, 1994). Nonlinear approaches deal with the
that the child produces acceptably is also tallied. Simple hierarchies of representation of words, for example, at
division of these tallies results in the PCC. The PCC is the segmental level, the syllable level, the foot level, and
considered to be a measure of severity. It is a useful so on. Productions of multisyllabic words with varying
measure for assessing change over long periods of treat- stress patterns are needed to complete these analyses. In
ment, such as 6 months. most cases these multisyllabic words must be elicited by
imitation or picture naming.
Tests of Phonology. Several tests published commer- Finally, an altogether di¤erent type of specialized
cially permit analysis of children’s use of phonological testing is the determination of stimulability. A child is
processes on the basis of single-word naming of objects said to be stimulable for an error sound if the clinician
or pictures. They include the Bernthal-Bankson Test of can elicit an acceptable production using models, cues,
Phonology (BBTOP; Bernthal and Bankson, 1990), the or phonetic placement instructions. This part of the as-
Khan-Lewis Phonological Analysis (KLPA; Khan and sessment is often performed informally and usually for
Lewis, 1986), and the Smit-Hand Articulation and Pho- just a few error sounds. However, Perrine, Bain, and
nology Evaluation (SHAPE; Smit and Hand, 1997), all Weston (2000) have devised a systematic way to assess
of which are based on pictures or photos, and the As- stimulability based on a hierarchy of cues and models
sessment of Phonological Processes–Revised (APP-R; that is helpful in planning intervention.
Hodson, 1986), which is based on object naming.
Tests of phonology can complement analyses of the Intelligibility. Intelligibility refers to how well the
conversational sample. One of their virtues is that for the child’s words can be understood by others. There are at
phonological processes that are assessed, they incorpo- least two measures of intelligibility that are based on
rate multiple exemplars of each process, so that there is counts of intelligible words, as well as many scales for
some assurance that the child’s use of the process is not making perceptual judgments of intelligibility. The most
happenstance. For some extremely unintelligible chil- straightforward numerical measure, Percent of Intelligi-
dren, tests of phonology may be the only way to figure ble Words (PIW), has been described by Shriberg and
out the child’s patterns because the clinician at least Kwiatkowski (1982). To determine this measure, a per-
knows what the intended word should be. Some of these son who does not know the child listens to the conver-
tests (BBTOP, KLPA, and SHAPE) also permit com- sational speech sample but does not hear the clinician’s
parisons to be made between the child’s performance comments. This person attempts to gloss the sample.
and normative data. The number of words correctly glossed by the listener
On the other hand, if a test of phonology becomes the is divided by the total number of words to obtain the
primary assessment tool, the clinician needs to be aware PIW.
that measures derived from single-word naming may A second measure based on counts, the Preschool In-
di¤er from measures derived from conversation. The telligibility Measure, has been developed by Morris,
conversational speech of disordered children generally Wilcox, and Schooling (1995). The child is asked to
includes more nonadult productions than does single- imitate a series of one- or two-syllable words that are
word naming. In addition, tests of phonology tend to selected randomly from a large database of words, and
deal with a very circumscribed set of phonological pro- her productions are recorded. Then the audiotape is
cesses, and they do not deal at all with vowel produc- played for listeners, who see 12 foils for each word and
tions. Consequently, if the child uses important but circle the one they think the child said. This measure is
idiosyncratic processes, such as glottal replacement, or better suited for documenting changes in intelligibility
has systematic vowel errors, they may not be picked up. over time than it is for initial evaluation.
Speech Sampling, Articulation Tests, and Intelligibility in Children with Residual Errors 215
Other scales of intelligibility involve judgments on the Shriberg, L. D., Gruber, F. A., and Kwiatkowski, J. (1994).
part of the clinician or significant others in the child’s Developmental disorders: III. Long-term speech-sound
environment about how well the child communicates. normalization. Journal of Speech, Language, and Hearing
For example, teachers might be asked to rate how well Research, 37, 1151–1177.
Shriberg, L. D., and Kent, R. D. (2003). Clinical phonetics (3rd
they understand the child on a 6-point scale, with 1 rep-
ed.). Boston: Allyn and Bacon.
resenting ‘‘all the time’’ and 6 representing ‘‘never.’’ Or a Shriberg, L. D., and Kwiatkowski, J. (1982). Phonological
parent might be asked to rate the di‰culty that family disorders: III. A procedure for assessing severity of in-
members have in understanding the child. volvement. Journal of Speech and Hearing Disorders, 47,
256–270.
Treatment Decisions and Prognosis. Decisions about Smit, A., and Hand, L. (1997). Smit-Hand Articulation and
whether to treat and how often the child should be Phonology Evaluation (SHAPE). Los Angeles, CA: West-
seen are made primarily on the basis of severity and ern Psychological Services.
secondarily on the basis of stimulability. Although there Stoel-Gammon, C., and Dunn, C. (1985). Normal and dis-
is little research on the topic of appropriate treatment ordered phonology in children. Austin, TX: Pro-Ed.
decisions, severity appears to have universal acceptance
among speech-language pathologists as the most impor- Further Readings
tant variable in deciding for or against treatment. Bauman-Waengler, J. (2000). Articulatory and phonological
With respect to prognosis, until recently there has impairments: A clinical focus. Boston, MA: Allyn and Bacon.
been little research about how children normalize Bernthal, J. E., and Bankson, N. W. (1998). Articulation and
(achieve age-appropriate phonology) and how long it phonological disorders (4th ed.). Boston: Allyn and Bacon.
takes. However, work by Shriberg, Gruber, and Kwiat- Perrine, S. L., Bain, B. A., and Weston, A. D. (2000). Dynamic
kowski (1994) and by Gruber (1999) suggests that some assessment of phonological stimulability: Construct valida-
children who receive intervention normalize by about 6 tion of a cueing hierarchy. Paper presented at the annual
convention of the American Speech-Language-Hearing As-
years of age, and that the outer limit for normalization is
sociation, Washington, DC, Nov. 16, 2000.
about 8.5 years. However, the predictors of normaliza- Shriberg, L. D. (1994). Five subtypes of developmental pho-
tion are not yet known. nological disorders. Clinics in Communication Disorders, 4,
—Ann Bosma Smit 38–53.
Shriberg, L. D., and Kent, R. D. (2003). Clinical phonetics (3rd
References ed.). Allyn and Bacon.
Bernhardt, B., and Stoel-Gammon, C. (1994). Nonlinear pho- disorders III: A procedure for assessing severity of in-
nology: Introduction and clinical application. Journal of volvement. Journal of Speech and Hearing Disorders, 47,
Speech and Hearing Research, 37, 123–143. 256–270.
Bernthal, J., and Bankson, N. (1990). Bernthal-Bankson Test of Smit, A., Hand, L., Freilinger, J., Bernthal, J., and Bird, A.
Phonology (BBTOP). Chicago, IL: Riverside Press. (1990). The Iowa Articulation Norms Project and its
Crystal, D. (1982). Profiling linguistic disability. London: Ed- Nebraska replication. Journal of Speech and Hearing Dis-
ward Arnold. orders, 55, 779–798.
Edwards, M. L. (1986). Introduction to applied phonetics: Lab- Stoel-Gammon, C., and Dunn, C. (1985). Normal and dis-
oratory workbook. San Diego, CA: College-Hill Press. ordered phonology in children. Austin, TX: Pro-Ed.
Elbert, M., and Gierut, J. (1986). Handbook of clinical phonol- Tomblin, J. B., Morris, H. L., and Spriestersbach, D. C.,
ogy: Approaches to assessment and treatment. San Diego, (Eds.). (2000). Diagnosis in speech-language pathology (2nd
CA: College-Hill Press. ed.). San Diego, CA: Singular Publishing Group.
Gruber, F. A. (1999). Probability estimates and paths to con-
sonant normalization in children with speech delay. Journal
Hodson, B. (1986). Assessment of Phonological Processes– Speech Sampling, Articulation Tests,
Revised (APP-R). Danville, IL: Interstate Publishers and
Printers. and Intelligibility in Children with
Khan, L., and Lewis, N. (1986). The Khan-Lewis Phonological Residual Errors
Analysis (KLPA). Circle Pines, MN: American Guidance
Service.
Masterson, J., Long, S., and Buder, E. (1998). Instrumentation Children who have speech sound errors that have per-
in clinical phonology. In J. E. Bernthal and N. W. Bankson sisted past the preschool years are considered to have
(Eds.), Articulation and phonological disorders (4th ed., pp. residual errors (Shriberg, 1994). Typically, these school-
378–406). Boston: Allyn and Bacon. age children have substitution and distortion errors
Morris, S. R., Wilcox, K. A., and Schooling, R. L. (1995). The rather than deletions, and intelligibility is not usually a
Preschool Intelligibility Measure. American Journal of
primary issue. Children with residual errors generally
Perrine, S. L., Bain, B. A., and Weston, A. D. (2000). Dynamic have acquired the sound system of their language, but
assessment of phonological stimulability: Construct valida- they have errors that draw attention to the speaking
tion of a cueing hierarchy. Paper presented at the annual pattern. The assumption is usually made that they are
convention of the American Speech-Language-Hearing As- having di‰culty with the articulatory movements needed
sociation, Washington, DC, Nov. 16, 2000. to produce the acceptable sound and with embedding
216 Part II: Speech
that sound into the stream of speech. It should be provided some clues about prognosis. For example, it
noted, however, that in the early days of studying com- appears that if the child reduces substitutions and
munication disorders, authorities such as Van Riper omissions while increasing distortions, that child will
(1978) assumed that the child’s di‰culty was first of all take longer to normalize than the child who decreases
perceptual. all types of errors.
Until recently, there has been little research about
how residual speech sound errors develop, even though Speech Sampling. Speech-language pathologists typi-
some of the earliest research and intervention in com- cally elicit a speech sample using a published test of ar-
munication disorders focused on children with residual ticulation, supplemented with a conversational speech
errors. The profession has uncovered bits and pieces of sample. For a school-age child, the conversational sam-
information about development after the preschool pe- ple should be audio- or video-recorded, with careful
riod, but no coherent picture has emerged to assist in attention to the quality of the recording. This sample
predicting which children will actually make needed should include at least 100 di¤erent words and 3 minutes
changes without intervention. The question is an impor- of child talking time. If the child has a relatively large
tant one, because if we fail to treat a child at age 6 who is number of errors, this speech sample can be transcribed
not going to change spontaneously, and instead we wait phonetically for further analysis. If the child has just a
until age 9, the child has 3 additional years of practice on few sounds in error, then the clinician may decide not to
an error phoneme, and remediation will likely be more transcribe the entire speech sample. Instead, he tallies all
di‰cult. Certain information that can be obtained from instances of a target phoneme in the sample, determines
speech sampling may provide insight into the prediction how many were acceptably produced, and derives a
question: percentage of correctly produced sounds. When these
counts are based on a 3-minute sample, this procedure
Most residual errors a¤ect a subset of the phonemes of results in a TALK sample (Diedrich, 1971), which is a
English (‘‘the big 10’’): /s z S Z tS dZ T D v r/ (Winitz, probe of conversational speech.
1969). These are typically late-acquired phonemes, but
most of them are used correctly by 90% of children by Articulation Tests. Some of the first tests that were
age 8 (Smit et al., 1990). commercially available in communication disorders were
The phoneme in error may make a di¤erence. Re- tests of articulation and were designed to assess the de-
analysis of the Smit et al. (1990) cross-sectional data velopment of speech sounds. Typically, tests of this type
suggests that children may be less likely to self-correct are based on the single-word naming of pictures without
the alveolar and palatal fricatives and a¤ricates than a model from the examiner. Most tests of articulation
they are the /r/ and the /T D/ (Smit, unpublished). assess production of all English consonant phonemes in
The nature and allophonic distribution of the error word-initial and word-final position, and possibly En-
may make a di¤erence. Stephens, Ho¤man, and glish consonant clusters as well. Scoring sheets usually
Danilo¤ (1986) showed that children with lateral allow explicit comparisons between adult targets and the
productions of alveolopalatal fricatives and children children’s productions. Most articulation tests result in a
who substituted back sounds for these fricatives gen- summary numerical score that can be compared to nor-
erally did not improve spontaneously, whereas about mative data. Some currently used tests of articulation
half of the children with dental errors corrected them. include the Templin-Darley Tests of Articulation (Tem-
Ho¤man, Schuckers, and Danilo¤ (1980) showed that plin and Darley, 1969), the Goldman-Fristoe Test of
children who produced the consonantal /r/ allophone Articulation—2 (Goldman and Fristoe, 2000), the Smit-
correctly some of the time were likely to achieve the Hand Articulation and Phonology Evaluation (SHAPE;
other /r/-allophones spontaneously. Smit and Hand, 1997), and the Photo Articulation
The length of time that the child has made the error Test—Third Edition (Lippke et al., 1987).
may make a di¤erence. It is reasonable to assume that Most of the inventory tests do not require that the
if a child’s production of a phoneme has not changed clinician use narrow transcription, the exception being
at all in several years, then spontaneous change is SHAPE. Rather, broad transcription is generally used,
unlikely. even when the test requires only a notation of ‘‘correct,’’
The child’s developmental history may make a di¤er- ‘‘substitution,’’ ‘‘omission,’’ or ‘‘distortion.’’
ence. Shriberg (1994) has pointed out that some chil- The error sounds identified on an inventory test are
dren had phonological errors as preschoolers, while often examined in light of ‘‘ages of acquisition’’ for those
others did not. On logical grounds, children who had sounds. The age of acquisition is the age at which 75% or
phonological problems earlier are less likely to change 90% of children typically say the sound correctly (Tem-
without intervention because they have already dem- plin, 1957; Smit et al., 1990). The guidelines used by
onstrated di‰culties in learning the sound system of many school districts to determine caseload make refer-
their native language. ence to these ages of acquisition.
The pattern of change in the child’s errors may be im- There are other types of tests of articulation besides
portant. Recent research by Gruber (1999) into the inventory tests. In particular, there are several tests of
time taken for children who are receiving intervention contextual variation, among them the McDonald Deep
to normalize (achieve age-appropriate phonology) has Test of Articulation (McDonald, 1964), the Contextual
Speech Sampling, Articulation Tests, and Intelligibility in Children with Residual Errors 217
Test of Articulation (Aase et al., 2000), and the Secord and a Percent of Intelligible Words is calculated (Shri-
Contextual Articulation Tests (S-CAT; Secord and berg and Kwiatkowski, 1982).
Shine, 1997). Contextual variation is a way of manipu- For children with residual speech sound errors, a
lating the phonetic environment of a target sound in more salient issue than intelligibility may be that their
order to see if the client can produce the target sound errors call attention to their speech, that is, to the me-
acceptably in one or more of these novel phonetic envi- dium rather than the message. Listeners may consider
ronments. If the child is able to do so, then the clinician their speech to be babyish, bizarre, or odd. The clinician
can use these facilitating contexts in the first few treat- can develop questionnaires and rating scales and can ask
ment sessions. persons familiar with the child to fill them out in order to
Still another kind of assessment involves determining document this perception, in addition to asking the child
stimulability. Stimulability refers to the ability to elicit about the content of any teasing that may occur.
an acceptable production of a speech sound or structure,
such as a consonant cluster, from the child by presenting Interpreting the Data. Decisions about whether to pro-
instructions, cues, and models. A systematic way to as- vide intervention are often based on multiple factors.
sess stimulability has been proposed by Perrine, Bain, These include the child’s age relative to the age of ac-
and Weston (2000). The implications of stimulability quisition for the child’s error phonemes, whether or not
have been addressed by numerous researchers, but that the child is stimulable for correct production, intelligi-
research can be summed up in a few statements: bility, and the degree to which the child and significant
others consider the speech to be a problem. There is little
1. The child who is not stimulable for a specific pho- research other than that of Gruber (1999) to go by in
neme target is the child who should have the highest establishing prognosis. However, a reasonable assump-
priority for intervention. tion is that the older the child who has residual errors,
2. If a child is stimulable for a target phoneme, the child the longer it will take to achieve normalization in a
may or may not improve without intervention. treatment program.
3. Stimulable phonemes are likely to bring about quick
success in intervention. —Ann Bosma Smit
Finally, the clinician may assess inconsistency. In-

References
consistency refers to variations in the child’s productions Aase, D., Hovre, C., Krause, K., Schelfhout, S., Smith, J., and
of a given phoneme. If the child’s production is charac- Carpenter, L. J. (2000). Contextual Test of Articulation. Eau
terized by ‘‘inconsistency with hits’’ (correct produc- Claire, WI: Thinking Publications.
tions), then the context of the hits can be determined. Diedrich, W. (1971). Procedures for counting and charting a
Just as in the case for contextual variation, a hit can target phoneme. Language, Speech, and Hearing Services in
serve as an entrée into intervention. To look for incon- Schools, 18–32.
Goldman, R., and Fristoe, M. (2000). Goldman-Fristoe Test of
sistencies, the clinician may catalogue the productions Articulation—2. Circle Pines, MN: American Guidance
of the target that are heard in the conversational Service.
speech sample. Alternatively, she can administer the Gruber, F. A. (1999). Probability estimates and paths to con-
Story-Telling Probes of Articulation Competence from sonant normalization in children with speech delay. Journal
the S-CAT (Secord and Shine, 1997). of Speech, Language, and Hearing Research, 42, 448–459.
Ho¤man, P. R., Schuckers, G. H., and Danilo¤, R. G. (1980).
Intelligibility. Intelligibility is defined as a listener’s Developmental trends in correct /r/ articulation as a func-
ability to understand a speaker’s words. Although intel- tion of allophone type. Journal of Speech and Hearing Re-
ligibility of speech can be reduced in children who have search, 23, 746–755.
residual errors, the reduction often is not substantial. Lippke, B. A., Dickey, S. E., Selmar, J. W., and Soder, A. L.
(1987). Photo Articulation Test—Third Edition. Los Ange-
One exception is the child who may have a few dis- les, CA: Western Psychological Services.
tortions of phonemes but who has particular di‰culty in McDonald, E. (1964). A Deep Test of Articulation. Pittsburgh,
stringing sounds together in multisyllabic utterances. PA: Stanwix House.
This di‰culty may manifest itself as weak or imprecise Perrine, S. L., Bain, B. A., and Weston, A. D. (2000). Dynamic
articulations of sounds, along with deletions of some assessment of phonological stimulability: Construct valida-
consonants. In such cases, the examining clinician may tion of a cueing hierarchy. Paper presented at the annual
want to repeat what she understood the child to say convention of the American Speech-Language-Hearing As-
immediately afterward, so that her gloss is also recorded sociation, Washington, DC, Nov. 16, 2000.
on the tape when the conversational speech sample is Secord, W. A., and Shine, R. E. (1997). Secord Contextual
recorded. Articulation Tests. Salt Lake City, UT: Red Rock Publishing.
Shriberg, L. D. (1994). Five subtypes of developmental pho-
The standard way to assess intelligibility in a numeri- nological disorders. Clinics in Communication Disorders, 4,
cal way is to have a person who is not familiar with the 38–53.
child listen to the audio recording of the conversational Shriberg, L. D., and Kwiatkowski, J. (1982). Phonological
sample, but without hearing the examiner’s speech. This disorders: III. A procedure for assessing severity of in-
person writes down the child’s words. This gloss is com- volvement. Journal of Speech and Hearing Disorders, 47,
pared to the one generated by the examining clinician, 256–270.
218 Part II: Speech
Smit, A., and Hand, L. (1997). Smit-Hand Articulation and Proposed classifications of child speech disorders have
Phonology Evaluation. Los Angeles, CA: Western Psycho- been advanced along descriptive, predictive, and clinical
logical Services. grounds (Shriberg, 1994). Three classifications currently
Smit, A., Hand, L., Freilinger, J., Bernthal, J., and Bird, A. warrant particular attention either because of empirical
(1990). The Iowa Articulation Norms Project and its Ne-
support (those associated with Shriberg and Dodd) or
braska replication. Journal of Speech and Hearing Dis-
orders, 55, 779–798. practical significance (the Diagnostic and Statistical
Stephens, I., Ho¤man, P., and Danilo¤, R. (1986). Phonetic Manual of Mental Disorders-IV-TR; American Psychi-
characteristics of delayed /s/ development. Journal of Pho- atric Association, 2000).
netics, 14, 247–256. Currently, the most comprehensive and rigorously
Templin, M. C. (1957). Certain language skills in children. studied classification is the Speech Disorders Classifica-
Westport, CT: Greenwood Press. tion System, developed through a 20-year program of
Templin, M., and Darley, F. (1969). The Templin-Darley Tests research by Shriberg and his colleagues (Shriberg, 1994,
of Articulation. Iowa City: University of Iowa, Bureau of 1999; Shriberg and Kwiatkowski, 1982, 1994a, 1994b,
Education Research and Service. 1994c; Shriberg et al., 1997). This evolving classification
Van Riper, C. (1978). Speech correction: Principles and meth-
is designed to provide a framework for identifying and
ods (6th ed.). Englewood Cli¤s, NJ: Prentice-Hall.
Winitz, H. (1969). Articulatory acquisition and behavior. En- describing subtypes and testing etiological hypotheses.
glewood Cli¤s, NJ: Prentice-Hall, Inc. At this time, it is primarily a research tool.
Within this classification, children’s speech sound
Further Readings disorders of unknown origin are divided into speech de-
lay and residual error categories. Speech delay, with an
See the list for speech sampling, articulation tests, and estimated prevalence of 3.8% among 6-year-olds (Shri-
intelligibility in children with phonological errors. berg, Tomblin, and McSweeney, 1999), is characterized
by reduced intelligibility and increased risk for broader
communication and academic di‰culties. It encom-
Speech Sound Disorders in Children: passes more severe forms of speech disorder. Residual
Description and Classification speech errors, with a tentatively estimated prevalence of
5% among individuals older than age 9 (Shriberg, 1994),
is characterized by the presence of at least one speech
Children with speech sound disorders form a heteroge- sound error (often involving distortion of a sibilant
neous group whose problems di¤er in severity, scope, fricative or liquid) that persists past the developmental
etiology, course of recovery, and social consequences. period. Although the category of residual speech errors
Beyond manifest problems with speech production and encompasses less severe forms of speech disorder that
use, their problems can include reduced intelligibility, are associated with neither reduced intelligibility nor
risk for broader communication disorders, and academic broader communication di‰culties, disorders in this
di‰culties, as well as social stigma. category remain of interest for theoretical reasons (i.e.,
Because of the heterogeneity of children’s speech genetic versus environmental origin) and for their po-
sound disorders, the description and classification of tential social and vocational costs, which for some indi-
these disorders have been attempted from a variety of viduals can continue throughout life.
perspectives, with persisting controversy as a predictable Within the Speech Disorders Classification System,
result. Nonetheless, one distinction that has garnered the major categories of speech delay and residual speech
relatively universal support is the division of children’s errors are further divided according to suspected etio-
speech disorders into those that are developmental (with logical factors or developmental pattern. Five subtypes
onset in early or middle childhood, e.g., before age 9) of speech delay are postulated in relation to the follow-
and those that are nondevelopmental (occurring after ing possible causes: genetic transmission, early history of
that time period and resulting from known causes). recurrent otitis media with e¤usion (Shriberg et al., 2000),
Developmental disorders have received substantially motor speech involvement associated with developmental
more research attention to date. apraxia of speech (Shriberg, Aram, and Kwiatkowski,
A second widely accepted distinction separates devel- 1997), motor speech involvement associated with mild
opmental disorders with known causes from those with- dysarthria, and developmental psychosocial involvement.
out. For developmental speech disorders of known In each case the etiological factor is considered dominant
causes in children, the terminology has been relatively in a mechanism that is suspected to be multifactorial in
stable and has typically referenced etiological factors nature. Two subtypes of residual speech errors are pro-
(e.g., speech disorders due to mental retardation, cleft posed, those found in association with a documented
palate). In contrast, the terminology for children’s history of speech delay (residual error-A) and those for
speech disorders of unknown origin is less stable, re- which no previous history of speech disorder was
flecting uncertainty about their nature and origin. Dur- reported (residual error-B) (e.g., Shriberg et al., 2001).
ing the past 30 years, commonly used terms have Ongoing research is aimed at increasing understanding
included functional articulation disorders, phonological of the causal, developmental, and cognitive processing
disorders (Locke, 1983), articulation and phonological mechanisms underlying each of these five subtypes.
disorders (Bernthal and Bankson, 1998), and persistent The classification description described by Dodd
sound system disorders (Shelton, 1993). (1995) is well motivated from theoretical perspectives
Speech Sound Disorders in Children: Description and Classification 219
and based on processing accounts of child speech dis- In the most recent DSM-IV classification, phonologi-
orders, but has been less thoroughly validated than the cal disorders is defined by the failure to use speech
Speech Disorders Classification System. Nontheless, sounds that are expected given the child’s age and dia-
Dodd’s classification system has been supported by lect. Although subtypes are not described, the American
studies that examine characteristics of clinical popula- Psychiatric Association’s description of the category
tions (Dodd, 1995), error patterns across languages (e.g., phonological disorders acknowledges that errors may
Fox and Dodd, 2001), bilingual children’s generalization reflect di‰culties in peripheral production as well as
patterns (Holm and Dodd, 2001), and treatment e‰cacy more abstract di‰culties in the child’s representation
(Dodd and Bradford, 2000). Thus, its empirical support and use of the sound system of the target language.
is growing rapidly. It is intended primarily to be used Under comments on di¤erential diagnosis, phonological
to aid in di¤erential diagnosis and clinical manage- disorders is described as a possible secondary diagnosis
ment, and was proposed as a system that uniquely com- when speech errors in excess of expectations are noted
bined four historical approaches to classifying speech in association with disorders that might be considered
disorders. These four approaches were based on age at known causes for speech di‰culties (viz., mental re-
onset, severity, causal and maintenance factors, and de- tardation, hearing impairment or other sensory deficit,
scription of symptoms, respectively. speech motor deficit, and severe environmental depriva-
Dodd’s classification system recognizes five subtypes: tion). Speech di‰culties that may be associated with the
articulation disorder, delayed phonological acquisition, term ‘‘Developmental (or Childhood) Apraxia of
consistent deviant disorder, inconsistent disorder, and Speech’’ are addressed neither in the DSM-IV criteria
other. Within this system, an articulation disorder is nor as a subclassification, although that term is de-
defined as inability to produce an undistorted version of scribed as a possible label for some forms of phonologi-
a speech sound or sounds that are expected, given the cal disorder in DSM-IV-TR—a revision designed to
child’s age. English sounds that are often a¤ected in such increase the currency of the DSM without changing the
disorders include /s/, /r/, and the interdental fricatives. actual classificatory categories.
This label is applied regardless of whether the cause is an Classifications of children’s speech disorders are
anatomical anomaly or is unknown. Delayed phonolog- encumbered by demands that they address numerous
ical acquisition is defined in cases where a child’s speech audiences and unresolved controversies. Among the
errors are consistent with those seen in younger, nor- current audiences to be served are clinicians, researchers,
mally developing children. Consistent deviant disorder is and administrators. Each of the three classifications
the label applied when a child demonstrates a reduced described here addresses those audiences to a di¤erent
variety of syllable structure use as well as errors that are degree. One of the unresolved controversies that has
atypical of those seen in normal development. been addressed to some degree by each is the status of
Inconsistent disorder is identified when a child’s clinically postulated entities such as developmental or
productions are inconsistent in ways that cannot be childhood apraxia of speech. A second controversy, and
explained by complex phonological rules or the e¤ects of one that is being addressed by Shriberg and colleagues,
linguistic load on production. Ozanne (1995) described a relates to how child speech sound disorders should be
study suggesting that inconsistent disorder represents conceptualized in relation to other communication dis-
one subgroup associated with developmental verbal orders that frequently co-occur, but are associated with
dyspraxia (the condition referred to by Shriberg and causal mechanisms that are more ill-defined than those
others as developmental [or childhood] apraxia of for conditions that fall under child speech disorders with
speech). Inconsistent disorder is operationally defined known causes (e.g., hearing loss, cleft palate). Two that
using a 25-item word list. The child is asked to produce are of particular interest are specific language impair-
each word three times, with inconsistency noted when ment (Shriberg, Tomblin, and McSweeney, 1999) and
the child produces at least ten of the words di¤erently stuttering (Guitar, 1998). In addition, classifications are
on two of the three elicited productions. Dodd’s ‘‘other’’ ideally consistent with developmental as well as psycho-
category encompasses suspected motor speech disorders. logical processing accounts of the manifest behaviors
The DSM-IV-TR (American Psychiatric Association, associated with child speech disorders. The accounts of
2000) classification represents the most streamlined clas- Shriberg, Dodd, and their colleagues appear to be pur-
sification of children’s speech disorders and one that is suing these challenging issues.
perhaps more familiar than others to a broad range of
speech pathologists, who use it for billing purposes, and —Rebecca J. McCauley
non-speech-language pathologists who come in contact References
with children with childhood speech disorders. Within
this classification, Phonological Disorders 315.39 (for- American Psychiatric Association. (1994). Diagnostic and sta-
merly Developmental Articulation Disorders) is nested tistical manual of mental disorders—Fourth edition. Wash-
ington, DC: Author.
within Communication Disorders. Communication Dis- American Psychiatric Association. (2000). Diagnostic and sta-
orders, in turns, falls under the relatively large category tistical manual of mental disorders—Fourth edition, Text
Disorders Usually First Diagnosed in Infancy, Child- revision (DSM-IV-TR). Washington, DC: American Psy-
hood, or Adolescence. This category includes, among chiatric Publishing.
others, mental retardation, learning disorders (learning Bernthal, J. E., and Bankson, N. W. (1998). Articulation and
disabilities), and pervasive developmental disorders. phonological disorders. Boston: Allyn and Bacon.
220 Part II: Speech
Dodd, B. (1995). Procedures for classification of subgroups of Shriberg, L. D., and Kwiatkowski, J. (1994b). Developmental
speech disorder. In B. Dodd (Ed.), Di¤erential diagnosis and phonological disorders: II. Short-term speech-sound nor-
treatment of children with speech disorders (pp. 49–64). San malization. Journal of Speech and Hearing Research, 37,
Diego, CA: Singular Publishing Group. 1127–1150.
Dodd, B., and Bradford, A. (2000). A comparison of three Shriberg, L. D., and Kwiatkowski, J. (1994c). Developmental
therapy methods for children with di¤erent types of devel- phonological disorders: II. Long-term speech-sound nor-
opmental phonological disorders. International Journal of malization. Journal of Speech and Hearing Research, 37,
Language and Communication Disorders, 35, 189–209. 1151–1177.
Fox, A., and Dodd, B. (2001). Phonologically disordered Shriberg, L. D., Tomblin, J. B., and McSweeney, J. L. (1999).
German-speaking children. American Journal of Speech- Prevalence of speech delay in 6-year-old children and
Language Pathology, 10, 291–307. comorbidity with language impairment. Journal of Speech,
Guitar, B. (1998). Stuttering: An integrated approach to its Language, and Hearing Research, 42, 1461–1481.
nature and treatment (2nd ed.). Baltimore: Williams and
Wilkins. Further Readings
Holm, A., and Dodd, B. (2001). Comparison of cross language
generalisation following speech therapy. Folia Phoniatrica Felsenfeld, S., Broen, P. A., and McGue, M. (1992). A 28-year
et Logopaedica, 53, 166–172. follow-up of adults with a history of moderate phonological
Locke, J. L. (1983). Clinical phonology: The explanation and disorder: Linguistic and personality results. Journal of
treatment of speech sound disorders. Journal of Speech and Speech and Hearing Research, 35, 1114–1125.
Hearing Disorders, 48, 339–341. Felsenfeld, S., Broen, P. A., and McGue, M. (1994). A 28-year
Ozanne, A. (1995). The search for developmental verbal dys- follow-up of adults with a history of moderate phonological
praxia. In B. Dodd (Ed.), Di¤erential diagnosis and treat- disorder: Educational and occupational results. Journal of
ment of children with speech disorders (pp. 231–247). San Speech and Hearing Research, 37, 1341–1353.
Diego, CA: Singular Publishing Group. Goodyer, I. M. (2001). Language di‰culties and psychopa-
Shelton, R. L. (Ed.). (1993). Persistent sound system disorder: thology. In D. V. M. Bishop and L. B. Leonard (Eds.),
Nature and treatment. Seminars in Speech and Language, Speech and language impairments in children: Causes, char-
14. acteristics, intervention, and outcome (pp. 227–244). Phila-
Shriberg, L. D. (1994). Five subtypes of developmental pho- delphia: Taylor and Francis.
nological disorders. Clinics in Communication Disorders, 4, Holm, A., and Dodd, B. (1999). An intervention case study of
38–53. a bilingual child with phonological disorder. Child Lan-
Shriberg, L. D. (1999). Emerging profiles for five phonological guage Teaching and Therapy, 15, 139–158.
disorders. Presented at the Child Phonology Conference, Holm, A., Ozanne, A., and Dodd, B. (1997). E‰cacy of inter-
Bangor, Wales. vention for a bilingual child making articulation and pho-
Shriberg, L. D., Aram, D., and Kwiatkowski, J. (1997a). nological errors. International Journal of Bilingualism, 1,
Developmental apraxia of speech: I. Descriptive perspec- 55–69.
tives. Journal of Speech, Language, and Hearing Research, So, L., and Dodd, B. (1994). Phonologically disordered
40, 273–285. Cantonese-speaking children. Clinical Linguistics and Pho-
Shriberg, L. D., Aram, D., and Kwiatkowski, J. (1997b). netics, 8, 235–255.
Developmental apraxia of speech: II. Toward a diagnostic Stackhouse, J., and Wells, B. (1997). Children’s speech and lit-
marker. Journal of Speech, Language, and Hearing Re- eracy di‰culties. London: Whurr.
search, 40, 286–312. Zhu, H., and Dodd, B. (2000). Putonghua (modern standard
Shriberg, L. D., Aram, D., and Kwiatkowski, J. (1997c). Chinese)-speaking children with speech disorders. Clinical
Developmental apraxia of speech: III. A subtype marked by Linguistics and Phonetics, 14, 15–191.
inappropriate stress. Journal of Speech, Language, and
Hearing Research, 40, 313–337. Stuttering
Shriberg, L. D., Austin, D., Lewis, B. A., McSweeny, J. L.,
and Wilson, D. L. (1997). The Speech Disorders Classifica-
tion System (SDCS): Extensions and lifespan reference data. Stuttering is a developmental disorder of communication
Journal of Speech, Language, and Hearing Research, 40,
723–740.
that a¤ects approximately 5% of children born in the
Shriberg, L. D., Flipsen, P., Jr., Karlsson, H. B., and United States and Western Europe. Children are at
McSween, J. L. (2001). Acoustic phenotypes for speech- highest risk for beginning to stutter between their second
genetic studies: An acoustic marker for residual /er/ and fourth birthdays. The risk decreases gradually
distortions. Clinical Linguistics and Phonetics, 15, 631– thereafter, with few onsets occurring after 9 or 10 years
650. of age (Andrews and Harris, 1964). The percentage of
Shriberg, L. D., Friel-Patti, S., Flipsen, P., Jr., and Brown, older children, adolescents, and adults who stutter is
R. L. (2000). Otitis media, fluctuant hearing loss, and much lower, about 0.5%–1.0% (Andrews, 1984; Blood-
speech-language outcomes: A preliminary structural equa- stein, 1995), and the discrepancy between the percentage
tion model. Journal of Speech, Language, and Hearing Re- of children a¤ected (i.e., incidence) and the percentage of
search, 43, 100–120.
older children and adults who stutter (i.e., prevalence)
disorders: II. A diagnostic classification system. Journal of indicates that 75%–90% of the children who begin to
Speech and Hearing Disorders, 47, 226–241. stutter stop. Complete, untreated remissions of stuttering
Shriberg, L. D., and Kwiatkowski, J. (1994a). Developmental are most likely to occur within 2 years of onset (Andrews
phonological disorders: I. A clinical profile. Journal of and Harris, 1964; Yairi and Ambrose, 1999; Mansson,
Speech and Hearing Research, 37, 1100–1126. 2000), with decreasing frequency after that.
Stuttering 221
Most of the data on the epidemiology of stuttering guistic processes responsible for everyday speech errors
have been obtained from cross-sectional surveys that are also responsible for developmental stuttering; motor
asked informants if they or family members currently control theories, which link stuttering to sensorimotor or
stutter or had ever stuttered. The credibility of such data speech motor processes (e.g., Neilson and Neilson,
is compromised by a number of methodological weak- 1987); and multifactor theories, which attribute stutter-
nesses (Yairi, Ambrose, and Cox, 1996). Prospective, ing to an interplay of the cognitive, linguistic, motor,
longitudinal studies employing trained examiners have and a¤ective processes involved in spoken language
been completed in England (Andrews and Harris, 1964) (e.g., Perkins, Kent, and Curlee, 1991; Smith and Kelly,
and Denmark (Mansson, 2000). The incidence (5.0%) 1997).
and remission rate (>75%) reported by both studies were Adult stuttering typically involves various behaviors
remarkably similar, despite substantial di¤erences in and a¤ective-cognitive reactions that a¤ect, in varying
their designs and the populations studied. degrees, interpersonal communication, vocational op-
The incidence, prevalence, and remission or persis- portunities, and personal-social adjustment. Frequent
tence of stuttering are a¤ected by sex and family his- repetitions and prolongations of sounds and syllables
tories of stuttering. More than two-thirds of the children and blockages of speech occur that cannot be readily
who stutter have first-, second-, or third-degree relatives controlled (Perkins, 1990). These speech disruptions are
who currently or once stuttered (Ambrose, Yairi, and often accompanied by facial grimaces and tremors, dis-
Cox, 1993). Like most speech-language disorders, stut- rhythmic phonation, and extraneous bodily movements
tering a¤ects more males than females, with about twice that seem to involve muscle tension and excessive e¤ort.
as many young male preschoolers a¤ected as females, a Stuttering does not occur randomly, but as if con-
ratio that increases to four or more males to every fe- strained by various linguistic variables (Ratner, 1997).
male among adults (Ambrose, Yairi, and Cox, 1993). For example, it occurs more often than chance would
Similar ratios have been reported in other countries suggest on initial words of phrases, clauses, and sen-
and cultures (Bloodstein, 1995; Ambrose, Cox, and tences, on stressed syllables, and on longer, less famil-
Yairi, 1997; Mansson, 2000). The increase in male- iar words. With the exception of the sentence-initial
female ratio with age reflects, in part, higher rates of position, these are the same loci that attract the speech
remission among females (Ambrose, Cox, and Yairi, errors of nonstuttering speakers (Fromkin, 1993), sug-
1997), whereas family histories of remission and persis- gesting that the same linguistic variables a¤ect both
tence are linked, respectively, to untreated remissions of groups’ speech.
stuttering within 2 years of onset or its persistence for 3 The frequency and severity of adults’ stuttering may
or more years (Ambrose, Cox, and Yairi, 1997). vary substantially across time, social settings, and con-
Findings from family pedigree studies are consistent texts, but it often occurs on specific words (e.g., the per-
with the vertical transmission (i.e., generation to gener- son’s name) and in situations where stuttering has
ation) of a genetic susceptibility or predisposition to frequently occurred in the past (Bloodstein, 1995). Thus,
stutter but are inconsistent with autosomal dominant, adults’ prior stuttering experiences may lead them to
recessive, or sex-linked transmissions (Kidd, Heimbuch, avoid specific words and speaking situations and to de-
and Records, 1981; Yairi, Ambrose, and Cox, 1996). velop attitudes and beliefs about speaking, stuttering,
Twin studies have found that stuttering occurs in both and themselves that can be more disabling or handicap-
members of monozygotic twin pairs much more often ping than their stuttered speech.
than in same-sex, dizygotic twins (e.g., Howie, 1981); Stuttering is substantially reduced, sometimes elimi-
however, the lack of concordance of stuttering in some nated, in a number of verbal activities (Bloodstein,
monozygotic twin pairs indicates that both genetic and 1995). For example, most adults stutter little or not at
environmental factors are involved in some, if not all, all when singing, speaking while alone or with pets
cases. Segregation analyses suggest that a single major or infants, reading or reciting in unison with others,
locus is a primary contributor to stuttering phenotypes pacing speech with a rhythmical stimulus, and reading
but that other genes are involved in determining whether or speaking with auditory masking. Stuttering reappears,
or not stuttering persists (Ambrose, Cox, and Yairi, however, as soon as such activities end.
1997). Research centers in the United States and Europe Much of what is known about adults who stutter has
are currently engaged in linkage analyses designed to been obtained by studies that compared stuttering and
identify the specific genes involved. nonstuttering speakers’ motor, sensory, perceptual, and
Current etiological theories reflect diverse beliefs cognitive abilities, as well as the two groups’ a¤ective
about the nature of stuttering, its origins, and the levels and personality characteristics. A variety of di¤erences
of description that will provide the most useful scientific have been found, but usually with substantial overlaps in
explanation. However, no theory of origin has achieved the data of the two groups. In addition, it is seldom clear
general acceptance in the field. There are, for example, if or how such di¤erences might be functionally related
cognitive theories, such as Bloodstein’s (1997) anticipa- to stuttering. A comprehensive review of this work led
tory struggle theory, which hypothesizes that a child’s Bloodstein (1995) to conclude that the two groups are
belief that speech is di‰cult elicits tension that causes similar, for the most part, except when they speak. Re-
stuttering when he or she tries to speak; psycholinguistic cent advances in brain imaging technology, however,
theories (e.g., Ratner, 1997), which propose that the lin- have allowed investigators to compare the brain activity
222 Part II: Speech
patterns of the two groups while the subjects read aloud, information, and support among members, have be-
and the brain activity of stuttering speakers during stut- come an increasingly important component to success-
tered and fluent speech. ful, long-term management of stuttering in adults in the
There are no reliable di¤erences in cerebral blood United States.
flow of stuttering and nonstuttering adult men when they See also speech disfluency and stuttering in
are not speaking (Ingham et al., 1996; Braun et al., children.
1997). A series of H2 15 O positron emission tomography
(PET) of the two speaker groups during solo and choral —Richard F. Curlee
reading conditions found greater right than left hemi- References
sphere activity in the supplementary motor and pre-
motor areas (BA 6), anterior insula, and cerebellum and Ambrose, N., Yairi, E., and Cox, N. (1993). Genetic aspects of
reduced activity in primary auditory areas (BA 41/42) of early childhood stuttering. Journal of Speech and Hearing
stuttering speakers in the solo condition, but exactly the Research, 36, 701–706.
opposite pattern of activity in nonstuttering speakers Ambrose, N., Cox, N., and Yairi, E. (1997). The genetic basis
of persistence and recovery in stuttering. Journal of Speech,
(Ingham, 2001). These di¤erences decreased, however, Language, and Hearing Research, 40, 567–580.
when fluent speech was induced in stuttering speakers by Andrews, G. (1984). The epidemiology of stuttering. In R. F.
having them read aloud in unison (i.e., choral reading) Curlee and W. H. Perkins (Eds.), Nature and treatment of
with a recording. stuttering: New directions (1st ed.). San Diego, CA: College-
A follow-up PET study of subsets of the same two Hill Press.
groups was conducted while subjects imagined they were Andrews, G., and Harris, M. (1964). The syndrome of stutter-
reading aloud (Ingham et al., 2000). Stuttering subjects ing. Clinics in Developmental Medicine, 17.
were instructed to imagine they were stuttering in the Bloodstein, O. (1995). A handbook on stuttering (5th ed.). San
solo condition and fluent in the choral condition. The Diego, CA: Singular Publishing Group.
patterns of activity that had occurred when each group Bloodstein, O. (1997). Stuttering as an anticipatory struggle re-
action. In R. F. Curlee and G. M. Siegel (Eds.), Nature and
read aloud were similar to those observed when speakers treatment of stuttering: New directions (2nd ed.). Boston:
merely imagined they were reading. As such studies Allyn and Bacon.
continue, a much better understanding of the brain- Braun, A. R., Varga, M., Stager, S., Schulz, G., Selbie, S.,
behavior substrates of stuttered and stutter-free or nor- Maisog, J. M., et al. (1997). Altered patterns of cerebral
mal speech may be achieved. activity during speech and language production in devel-
Adults who have been stuttering most of their lives opmental stuttering. An H2 15 O positron emission tomog-
often develop various situational fears, social anxieties, raphy study. Brain, 120, 761–784.
lowered expectations, diminished self-esteem, and an Fromkin, V. (1993). Speech production. In J. Berko Gleason
array of escape and avoidance behaviors. Prior to ini- and N. Bernstein Ratner (Eds.), Psycholinguistics. Austin,
tiating treatment, clinicians should obtain a thorough TX: Harcourt, Brace and Jovanovich.
Howie, P. (1981). Concordance for stuttering in monozygotic
history of an adult’s stuttering and prior treatment,
and dizygotic twin pairs. Journal of Speech and Hearing
including major current concerns, treatment expecta- Research, 24, 317–321.
tions, and goals, and should assess attitudes, a¤ective Ingham, R. J. (2001). Brain imaging studies of developmental
reactions and behaviors, and self-concepts that may re- stuttering. Journal of Communication Disorders, 34, 493–516.
quire treatment. Analyses of samples recorded in various Ingham, R. J., Fox, P. T., Ingham, J. C., and Zamarripa, F.
speaking situations document the type, frequency, dura- (2000). Is overt stuttering a prerequisite for the neural acti-
tion, and overall severity of stuttering. Such information vations associated with chronic developmental stuttering?
allows clinicians to select appropriate treatment strat- Brain and Language, 75, 163–194.
egies, track progress, and determine when treatment Ingham, R. J., Fox, P. T., Ingham, J. C., Zamarripa, F., Jer-
objectives have been achieved. Current treatment strat- abek, P., and Cotton, J. (1996). A functional lesion investi-
gation of developmental stuttering using positron emission
egies focus either on modifying adults’ a¤ective, cogni- tomography. Journal of Speech and Hearing Research, 39,
tive, and behavioral reactions to stuttering (e.g., Prins, 1208–1227.
1993; Manning, 1996) or on learning speech production Kidd, K., Heimbuch, R., and Records, M. (1981). Vertical
techniques (i.e., fluency training) to reduce or eliminate transmission of susceptibility to stuttering with sex-modified
stuttering (e.g., Neilson and Andrews, 1993; Onslow, expression. Proceedings of the National Academy of Science,
1996). Most clinicians apparently prefer a combined 78, 606–610.
strategy to manage the constellation of speech, a¤ective, Manning, W. H. (1996). Clinical decision making in the diagno-
and cognitive symptoms commonly presented by adults sis and treatment of fluency disorders. Albany, NY: Delmar.
who stutter. No well-controlled clinical trials of stut- Mansson, H. (2000). Childhood stuttering: Incidence and de-
tering treatments have been reported, but some relapse velopment. Journal of Fluency Disorders, 25, 47–57.
Neilson, M., and Andrews, G. (1993). Intensive fluency train-
in treatment gains is common. It is generally agreed, ing of chronic stutterers. In R. F. Curlee (Ed.), Stuttering
therefore, that complete, permanent recovery from and related disorders of fluency. New York: Thieme.
chronic stuttering is rare when stuttering persists into Neilson, M. D., and Neilson, P. D. (1987). Speech motor con-
adult life, regardless of the treatment employed. Conse- trol and stuttering: A computational model of adaptive
quently, local self-help groups of the National Stutter- sensory motor processing. Speech Communication, 6, 325–
ing Association, which promote sharing of experience, 333.
Transsexualism and Sex Reassignment: Speech Di¤erences 223
Onslow, M. (1996). Behavioral management of stuttering. San and syllable production: A PET performance-correlation
Diego, CA: Singular Publishing Group. analysis. Brain, 123, 1985–2004.
Perkins, W. H. (1990). What is stuttering? Journal of Speech Gregory, H. (1997). The speech-language pathologist’s role in
and Hearing Disorders, 55, 370–382. stuttering self-help groups. Seminars in Speech and Lan-
Perkins, W. H., Kent, R. D., and Curlee, R. F. (1991). A guage, 18, 401–409.
theory of neuropsycholinguistic function in stuttering. Hedges, D. W., Umar, F., Mellon, C. D., Herrick, L. C.,
Journal of Speech and Hearing Research, 34, 734–752. Hanson, M. L., and Wahl, M. J. (1995). Direct comparison
Prins, D. (1993). Management of stuttering: Treatment of of the family history method and the family study method
adolescents and adults. In R. F. Curlee (Ed.), Stuttering and using a large stuttering pedigree. Journal of Fluency Dis-
related disorders of fluency. New York: Thieme. orders, 20, 25–33.
Ratner, N. (1997). Stuttering: A psycholinguistic perspective. Ingham, R. J. (1993). Stuttering treatment e‰cacy: Paradigm
In R. F. Curlee and G. M. Siegel (Eds.), Nature and treat- dependent or independent? Journal of Fluency Disorders, 18,
ment of stuttering: New directions (2nd ed.). Boston: Allyn 133–149.
and Bacon. Ingham, R. J., Fox, P. T., Ingham, J. C., Collins, J., and
Smith, A., and Kelly, E. (1997). Stuttering: A dynamic, multi- Pridgen, S. (2000). TMS in developmental stuttering and
factorial model. In R. F. Curlee and G. M. Siegel (Eds.), Tourette’s syndrome. In M. S. George and R. H. Belmaker
Nature and treatment of stuttering: New directions (2nd ed.). (Eds.), Transcranial magnetic stimulation (TMS) in neuro-
Boston: Allyn and Bacon. psychiatry. New York: American Psychiatric Press.
Yairi, E., and Ambrose, N. (1999). Early childhood stuttering: Janssen, P., Kraaiment, F., and Brutten, G. (1990). Rela-
I. Persistence and recovery rates. Journal of Speech, Lan- tionship between stutterers’ genetic history and speech-
guage, and Hearing Research, 42, 1097–1112. associated variables. Journal of Fluency Disorders, 15,
Yairi, E., Ambrose, N., and Cox, N. (1996). Generics of stut- 39–48.
tering: A critical review. Journal of Speech and Hearing Kolk, H., and Postma, A. (1997). Stuttering as a covert repair
Research, 39, 771–784. phenomenon. In R. F. Curlee and G. M. Siegel (Eds.),
Nature and treatment of stuttering: New directions (2nd ed.).
Boston: Allyn and Bacon.
Further Readings Kully, D., and Langevin, M. J. (1999). Intensive treatment for
stuttering adolescents. In R. F. Curlee (Ed.), Stuttering and
Adams, M. R. (1990). The demands and capacities model: I. related disorders of fluency (2nd ed.). New York: Thieme.
Theoretical elaborations. Journal of Fluency Disorders, 15, Ludlow, C., and Braun, A. (1993). Research evaluating the use
135–141. of neuropharmacological agents for treating stuttering:
Boberg, E., and Kully, D. (1994). Long-term results of an Possibilities and problems. Journal of Fluency Disorders, 18,
intensive treatment program for adults and adolescents 169–182.
who stutter. Journal of Speech and Hearing Research, 37, Onslow, M., and Packman, A. (1997). Designing and imple-
1050–1059. menting a strategy to control stuttered speech in adults. In
Brady, J. P. (1991). The pharmacology of stuttering: A critical R. F. Curlee and G. M. Siegel (Eds.), Nature and treatment
review. American Journal of Psychiatry, 148, 1309–1316. of stuttering: New directions (2nd ed.). Boston: Allyn and
Caruso, A. J., Chodzko-Zajko, W. J., Bidinger, D. A., and Bacon.
Sommers, R. K. (1994). Adults who stutter: Responses to Prins, D. (1991). Theories of stuttering as event and disorder:
cognitive stress. Journal of Speech and Hearing Research, Implications for speech production processes. In H. F. M.
37, 746–754. Peters, W. Hulstijn, and C. W. Starkweather (Eds.), Speech
Cordes, A. K. (1998). Current status of the stuttering treatment motor control and stuttering. Amsterdam: Elsevier.
literature. In A. K. Cordes and R. J. Ingham (Eds.), Treat- Prins, D. (1993). Models for treatment e‰cacy studies of adult
ment e‰cacy research: A search for empirical bases. San stutterers. Journal of Fluency Disorders, 18, 333–349.
Diego, CA: Singular Publishing Group. Salmelin, R., Schnitzler, A., Schmitz, F., and Freund, H. J.
Cox, N. (1988). Molecular genetics: The key to the puzzle of (2000). Single word reading in developmental stutterers and
stuttering? ASHA, 4, 36–40. fluent speakers. Brain, 123, 1184–1202.
DeNil, L. F., Kroll, R. M., Kapur, S., and Houle, S. (1998). A Sandak, R., and Fiez, J. L. (2000). Stuttering: A view from
positron emission tomography study of silent and oral neuroimaging. Lancet, 356, 445–446.
single word reading in stuttering and nonstuttering adults. Webster, W. G. (1993). Hurried hands and tangled tongues.
Journal of Speech, Language, and Hearing Research, 43, Implications of current research for the management of
1038–1053. stuttering. In E. Boberg (Ed.), Neuropsychology of stutter-
Denny, M., and Smith, A. (1997). Respiratory and laryngeal ing. Edmonton: University of Alberta Press.
control in stuttering. In R. F. Curlee and G. M. Siegel
(Eds.), Nature and treatment of stuttering: New directions
(2nd ed.). Boston: Allyn and Bacon. Transsexualism and Sex Reassignment:
Folkins, J. W. (1991). Stuttering from a speech motor control
perspective. In H. F. M. Peters, W. Hulstijn, and C. W.
Speech Di¤erences
Starkweather (Eds.), Speech motor control and stuttering.
Amsterdam: Elsevier.
According to the Random House Dictionary, a trans-
Fox, P. T., Ingham, R. J., George, M. S., Mayberg, H., Ing-
ham, J. C., Roby, J., et al. (1997). Imaging human intra- sexual individual is ‘‘A person having a strong desire to
cerebral connectivity by PET during TMS. NeuroReport, 8, assume the physical characteristics and gender role of
2787–2791. the opposite sex; a person who has undergone hormone
Fox, P. T., Ingham, R. J., Ingham, J. C., Zamarripa, F., Xing, treatment and surgery to attain the physical character-
J.-H., and Lancaster, J. (2000). Brain correlates of stuttering istics of the opposite sex’’ (Flexner, 1987). Brown and
224 Part II: Speech
Rounsley (1996) explain, ‘‘Transsexuals are individuals component. Voice therapy for the female-to-male trans-
who strongly feel that they are, or ought to be, the op- sexual is virtually nonexistent. In fact, there appears to
posite sex. The body they were born with does not match be considerable agreement among researchers studying
their own inner conviction and mental image of who the treatment of the transsexual that voice therapy for
they are or want to be. . . . This dilemma causes them the female-to-male transsexual is unnecessary because
intense emotional distress and anxiety and often inter- lowering of the fundamental frequency occurs automati-
feres with their day-to-day functioning’’ (p. 6). cally as a result of androgens administered to the female-
Historically, examples of transsexualism existed in to-male transsexual (Spencer, 1988; Colton and Casper,
Greek and Roman times, during the Middle Ages, and 1996). Van Borsel et al. (2000) conducted a two-part
in the Renaissance (Doctor, 1988). However, the first study of the voice problems of the female-to-male trans-
documented sex reassignment surgery is thought to have sexual. Part 1 was a survey of 16 individuals who had
been performed around 1923. It involved a man who been treated with androgens for at least 1 year by the
married at 20 but came to believe he should have been a Gent University Gender Team in Belgium. Question-
woman, and took the name of Lili (Hoyer, 1933). The naires indicated that 14 of the 16 respondents had expe-
most celebrated case in the United States was that of rienced a ‘‘lower’’ or ‘‘heavier’’ voice. The remaining
Christine Jorgensen, who grew up in New York City as a two reported that they had a lower-pitched voice before
male and had transsexual reassignment surgery per- treatment started. Only one of the subjects was not
formed in Denmark in 1952 (Jorgensen, 1967). pleased with his voice because of what he perceived as
The prevalence of transsexualism is di‰cult to deter- strain in speaking at a lower pitch. Fourteen indicated
mine. The United States does not have a national regis- that voice change was as important as sex reassignment
try to collect information from all possible sources that surgery, although 11 of the 16 did not consider the need
may deal with the transsexual person. Furthermore, for speech therapy important. The study confirmed the
some individuals may remain undiagnosed or may wish view that pitch is lowered as a result of androgen treat-
to remain ‘‘closeted,’’ or they may travel to foreign ment and appears to result in an acceptable male voice.
countries for sex reassignment surgery. Part 2 was a longitudinal study of the voice change
Data from other countries suggest that one in 30,000 of two female-to-male transsexuals who were adminis-
adult males and one in 100,000 adult females seek sex tered androgens. Acoustic measures of fundamental fre-
reassignment surgery. However, the overall prevalence quency, jitter, and shimmer were made of the sustained
figures are greater if one includes transsexuals who do vowel production of /a/ and the reading of a standard
not elect sex reassignment surgery. In the United States, paragraph. The measures for one subject were made
it is estimated that 6000–10,000 transsexuals had under- over 17 months and for the other subject over 13
gone sex reassignment surgery by 1988 (Brown and months. The results confirmed that the fundamental fre-
Rounsley, 1996). Spencer (1988) reported that in 1979, quency was substantially reduced for sustained vowel
more than 4000 U.S. citizens had undergone sex reas- production and reading, although not by more than one
signment surgery and about 50,000 were thought to be octave. Measures of jitter and shimmer were relatively
awaiting surgery. Estimates vary greatly concerning the unchanged over time.
number of male-to-female transsexuals compared to The administration of hormones for the male-to-
the number of female-to-male transsexuals, although female transsexual has little e¤ect on voice. Some studies
all estimates indicate that the number of male-to- have examined the male-to-female transsexual’s changes
female transsexuals exceeds the number of female-to- in fundamental frequency and its relationship to the
male transsexuals by as much as 3 or 4 to 1 (Oates identification of the voice as a female voice (Bradley et
and Dacakis, 1983; Doctor, 1988; Spencer, 1988; Wolfe al., 1978; Spencer, 1988; Dacakis, 2000; Gelfer and
et al., 1990). Schofield, 2000). Although there is some agreement that
The transsexual individual (also referred to as a fundamental frequency is most often perceived as a fe-
transgendered individual) who seeks therapy and pos- male voice at 155–160 Hz and above, it is not su‰cient
sibly surgery is often referred to a clinic specializing alone to identify the male-to-female transsexual as a fe-
in the treatment of gender dysphoria. Programs that male speaker (Bradley et al., 1978; Gelfer and Schofield,
are o¤ered through these clinics include psychological 2000). Mount and Salmon (1988) conducted a long-
counseling, hormone treatments, and other nonsurgical range study of a 63-year-old male-to-female transsexual
procedures, which may then lead to the final surgical who had undergone sex reassignment surgery. The indi-
reassignment surgery. Male-to-female surgery generally vidual was able to increase her speaking fundamental
requires a single operation and is less expensive than frequency after 4 months of therapy. However, she was
female-to-male surgery, which requires at least three not perceived as a female speaker until formant fre-
operations, is considerably more expensive, and is not quencies had increased, particularly F2 values. This was
associated with as good aesthetic and functional results achieved through the modification of resonance and ar-
as male-to-female surgery (Brown and Rounsley, 1988). ticulation. Gelfer and Schofeld (2000) conducted a study
The literature on therapy for the male-to-female of 15 male-to-female transsexuals with a control group
transsexual and the female-to-male transsexual repre- of six biological females and three biological males. All
sents two extremes. Therapy for the male-to-female subjects recorded the Rainbow Passage and produced
transsexual has focused on voice therapy as a major the isolated vowels /a/ and /i/. Twenty undergraduate
Transsexualism and Sex Reassignment: Speech Di¤erences 225
psychology majors served as listeners. The only signifi- (1983) includes vocabulary and language forms and uses
cant di¤erences among subjects were that the ‘‘Subjects videotapes of the male-to-female transsexual to teach the
perceived as female had a higher SFF [speaking funda- individual how to walk, sit down, and enter a room.
mental frequency] and a higher upper limit of SFF than Chaloner (1991) provides case histories, and uses role
subjects perceived as male’’ (p. 30). Although formant playing in group therapy to help the male-to-female
frequencies for /a/ and /i/ were not significantly di¤erent transsexual become more successful in ‘‘living the female
between the male-to-female transsexuals perceived as role’’ (p. 330). Future research on the assessment and
male and those perceived as female, the mean formant treatment of transgendered individuals should provide
frequencies for the perceived female speakers were all the clinician with a larger repertoire of approaches to
higher than those of the transsexual speakers judged to assist the transsexual individual in making the transition
be male. Gunzburger (1995) had six male-to-female to a di¤erent sexual role.
transsexual speakers record a list of Dutch words that
were also combined into prose. Subjects were asked to —John M. Pettit
read the material in a female manner and a male man-
ner. Acoustic analyses indicated that the central fre- References
quency of F3 was systematically higher in the female Batin, R. R. (1983). Treatment of the transsexual voice. In W.
version. Recordings of two male-to-female speakers that Perkins (Ed.), Voice disorders: Current therapy of commu-
were representative of a male speaker and a female nication disorders (pp. 63–66). New York: Thieme-Stratton.
speaker were played to 31 male and female naive lis- Bradley, R. C., Bull, G. L., Gore, C. H., and Edgerton, M. T.
teners, who were asked to identify the sex of the speaker. (1978). Evaluation of vocal pitch in male transsexuals.
The perceptual judgments supported the results of the Journal of Communication Disorders, 11, 443–449.
acoustic analyses. It appeared that the male-to-female Brown, M. L., and Rounsley, C. A. (1996). True selves. San
transsexual speakers judged to be female had F3 for- Francisco: Jossey-Bass.
mants more like those associated with the female voice. Chaloner, J. (1991). The voice of the transsexual. In M. Faw-
cus (Ed.), Voice disorders and their management (2nd ed.,
The shorter vocal tract typically found in females propp. 314–332). London: Chapman and Hall.
duces higher F3 formants than those of males, with a Colton, R. H., and Casper, J. K. (1996). Understanding voice
longer vocal tract (Peterson and Barney, 1952; Fant, problems (2nd ed., pp. 281–282). Baltimore: Lippincott,
1960). Gunzburger (1995) attributed these changes to a Williams and Wilkins.
decreased vocal cavity length in the perceived female Dacakis, G. (2000). Long-term maintenance of fundamental
male-to-female transsexual and pointed out that short- frequency increases in male-to-female transsexuals. Journal
ening the vocal tract can be accomplished through of Voice, 14, 549–556.
changes in articulation and retracting the corners of the de Bruin, M. D., Coerts, M. J., and Greven, A. J. (2000).
mouth (p. 347). Speech therapy in the management of male-to-female
According to Stemple, Glaze, and Gerdeman (1995), transsexuals. Folia Phoniatrica, 52, 220–227.
Doctor, R. F. (1988). Transvestites and transsexuals: Toward
the male-to-female transsexual not only has to increase a theory of cross-gender behavior. New York: Plenum
her fundamental frequency while being careful not to Press.
damage the vocal folds, but also has to learn to modify Fant, G. (1960). Acoustic theory of speech production. The
the resonance, inflection, and intonation to make articu- Hague: Mouton.
lation more precise, and to modify coughing, vocalized Flexner, S. B. (Ed.). (1987). Random House Dictionary of the
pauses, and throat clearing (p. 204). English Language, Unabridged (2nd ed.). New York: Ran-
Therapy for the female-to-male transsexual appears dom House.
to be less of an issue than therapy for the male-to-female Gelfer, M. P., and Schofield, K. J. (2000). Comparison of
transsexual. Many of the textbooks on voice disorders acoustic and perceptual measures of voice in male-to-female
include a discussion of the therapy needs for the male-to- transsexuals perceived as female versus those perceived as
male. Journal of Voice, 14, 22–33.
female transsexual but do not provide any details on Gunzburger, D. (1995). Acoustic and perceptual implications
procedures, techniques, or concerns for the clinician to of the transsexual voice. Archives of Sexual Behavior, 24,
consider in the therapy process. De Bruin, Coerts, and 339–348.
Greven (2000) provide the clinician with a detailed ap- Hoyer, N. (1933). Man into woman: An authentic record of a
proach, including specific goals and subgoals, to follow change of sex. New York: Dutton.
in therapy for the male-to-female transsexual. Among Jorgenson, C. (1967). Christine Jorgensen: A personal auto-
the major goals are minimizing chest resonance; mod- biography. New York: Bantam Books.
ifying intonation patterns, articulation, intensity, and Mount, K. H., and Salmon, S. J. (1988). Changing the vocal
rate; and feminizing laughing and coughing. In addition, characteristics of a postoperative transsexual patient: A
they address other verbal and nonverbal aspects of longitudinal study. Journal of Communication Disorders, 21,
229–238.
feminine communication such as gestures, movements, Oates, J. M., and Dacakis, G. (1983). Speech pathology con-
greetings, shaking hands, dress, and hairdo. The authors siderations in the management of transsexualism: A review.
briefly discuss laryngeal surgery but conclude that it only British Journal of Disorders of Communication, 18, 139–151.
results in raising the fundamental frequency (which is Peterson, G., and Barney, H. (1952). Control methods used in
not in itself su‰cient to guarantee a feminine voice) and the study of the vowels. Journal of the Acoustical Society of
that the results of this surgery are not predictable. Batin America, 24, 175–184.
226 Part II: Speech
Spencer, L. E. (1988). Speech characteristics of male-to-female bon dioxide in the blood). Many medical conditions
transsexuals: A perceptual and acoustic study. Folia Pho- can cause severe respiratory insu‰ciency requiring ven-
niatrica, 40, 31–42. tilatory support. Examples include cervical spinal cord
Stemple, J. C., Glaze, L. E., and Gerdeman, B. K. (1995). injury (rostral enough to impair diaphragm function),
Clinical voice pathology: Theory and management (2nd ed.).
muscular dystrophy, amyotrophic lateral sclerosis, and
Van Borsel, J., DeCuypere, G., Rubens, R., and Destaerke, B. chronic obstructive pulmonary disease.
(2000). Voice problems in female-to-male transsexuals. In- Several types of ventilator systems are available
ternational Journal of Language and Communication Dis- for individuals with respiratory insu‰ciency, includ-
orders, 35, 427–442. ing positive-pressure ventilators, negative-pressure ven-
Wolfe, V. I., Ratusnik, D. L., Smith, F. H., and Northrop, G. tilators, phrenic nerve pacers, abdominal pneumobelts,
(1990). Intonation and fundamental frequency in male-to- and rocking beds (Banner, Blanch, and Desautels, 1990;
female transsexuals. Journal of Speech and Hearing Dis- Hill, 1994; Levine and Henson, 1994). Positive-pressure
orders, 55, 43–50. ventilators operate by ‘‘pushing’’ air into the pulmonary
system for inspiration, whereas negative-pressure ven-
Further Readings tilators work to lower the pressure around the respira-
Andrews, M. L., and Schmidt, C. P. (1997). Gender presenta- tory system and expand it for inspiration. Phrenic nerve
tion: Perceptual and acoustical analyses of voice. Journal of pacers stimulate the phrenic nerves and cause the dia-
Voice, 11, 307–313. phragm to contract to generate inspiration. Abdominal
Boone, D. R., and McFarlane, S. C. (2000). The voice and pneumobelts displace the abdomen inward (by inflation
voice therapy (6th ed.). Boston: Allyn and Bacon. of a bladder) to push air out of the pulmonary system for
Brown, M., Perry, A., Cheesman, A. D., and Pring, T. (2000). expiration, and then allow the abdomen to return to
Pitch change in male-to-female transsexuals: Has phono- its resting position (by deflation of the bladder) for in-
surgery a role to play? International Journal of Language spiration. Rocking beds are designed to move an in-
and Communication Disorders, 35, 129–136.
dividual upward toward standing and downward toward
Case, J. L. (1991). Clinical management of voice disorders (2nd
ed.). Austin, TX: Pro-Ed. supine to drive inspiration and expiration, respectively,
Chivers, M. L., and Bailey, J. M. (2000). Sexual orientation of using gravitational force to displace the abdomen and
female-to-male transsexuals: A comparison of homosexual diaphragm. All of these systems are currently used
and nonhomosexual types. Archives of Sexual Behavior, 29, (Make et al., 1998); however, the most commonly used
259–278. one today and the one that the speech-language pathol-
Donald, P. J. (1982). Voice change surgery in the transsexual. ogist is most likely to encounter in clinical practice is
Head and Neck Surgery, 4, 433–437. the positive-pressure ventilator (Spearman and Sanders,
Feinbloom, D. H. (1976). Transvestites and transsexuals: 1990).
Mixed views. New York: Delecorte Press. The positive-pressure ventilator uses a positive-
Gross, M. (1999). Pitch-raising surgery in male-to-female
pressure pump to drive air through a tube into the pul-
transsexuals. Journal of Voice, 13, 246–250.
Gunzburger, D. (1993). An acoustic analysis and some per- monary system. The tube can be routed through (1) the
ceptual data concerning voice change in male-to-female larynx (in this case, it is called an endotracheal tube),
transsexuals. European Journal of Disorders of Communica- such as during surgery or acute respiratory failure; (2)
tion, 28, 13–21. the upper airway, via a nose mask, face mask, or
Kunachak, S., Prakunhungsit, S., and Sujjalak, K. (2000). mouthpiece (this is called noninvasive ventilation); or (3)
Thyroid cartilage and vocal fold reduction: A new phono- a tracheostoma (a surgically fashioned entry through the
surgical method for male-to-female transsexuals. Annals of anterior neck to the tracheal airway). The latter two
Otology, Rhinology, and Laryngology, 109, 10826–10829. modes of delivery are used in individuals who need long-
Rogers, A. (1993). Legal implications of transsexualism. term ventilatory support. With noninvasive positive-
Lancet, 341, 1085–1086.
pressure ventilation, speech is produced in a relatively
normal manner. That is, after inspiratory air from the
ventilator flows into the nose and/or mouth, expiration
Ventilator-Supported Speech begins and speech can be produced until the next in-
Production spiration is delivered. The situation is quite di¤erent,
however, when inspiratory air is delivered via a trache-
ostoma. In some cases it is not possible to produce
When breathing becomes di‰cult or impossible, it may speech with the ventilator-delivered air because the air is
be necessary to use a ventilator to sustain life. Usually not allowed to reach the larynx. This occurs when the
the need for a ventilator is temporary, such as during a tracheostomy tube, which is secured in the tracheostoma
surgical procedure. However, if breathing di‰culty is and provides a connection to the ventilator’s tubing, is
chronic, ventilatory support may be required for an configured so as to block airflow to the larynx. This is
extended period, sometimes a lifetime. The main indica- done by inflating a small cu¤ that surrounds the tube
tions for ventilatory support are respiratory insu‰ciency where it lies within the trachea. However, if the cu¤ is
resulting in hypoventilation (not enough gas moving into deflated (or if there is no cu¤ ), it is possible to speak
and out of the lungs), hypoxemia (not enough oxygen using the ventilator-delivered air. Because the air from
in the arterial blood), or hypercapnea (too much car- the ventilator enters below the larynx, speech can be
Ventilator-Supported Speech Production 227
ing normal speech production is positive (i.e., above

atmospheric pressure), generally low in amplitude (i.e.,
in the range of 5–10 cm H2 O), and relatively unchang-
ing throughout the expiratory phase of the breathing
cycle, tracheal pressure during ventilator-supported
speech production is generally fast-changing (i.e., rapidly
rising during the inspiratory phase of the ventilator cycle
and rapidly falling during the expiratory phase of the
cycle), high-peaked (approximately 35 cm H2 O in the
figure), and not always above atmospheric pressure (i.e.,
during the latter 2 s of the cycle in the figure). These
Figure 1. Inspiration and expiration during positive-pressure
waveforms can also be examined relative to the mini-
ventilation with a deflated tracheostomy tube cu¤. (From Hoit,
J. D., and Banzett, R. B. [1997]. Simple adjustments can mum pressure required to maintain vibration of the
improve ventilator-supported speech. American Journal of vocal folds for phonation (labeled Threshold Pressure in
Speech-Language Pathology, 6, 87–96, adapted from Tippett, the figure). From this comparison, it is clear that the
D. C., and Siebens, A. A. [1995]. Preserving oral communica- tracheal pressure associated with normal speech produc-
tion in individuals with tracheostomy and ventilator depen- tion exceeds this threshold pressure throughout the cycle
dency. American Journal of Speech-Language Pathology, 4, (expiratory phase), whereas the pressure associated with
55–61, Fig. 7. Reproduced with permission.) the ventilator-supported speech production is below
the threshold pressure for nearly half the cycle. This
latter observation largely explains why ventilator-
produced during both the inspiratory and the expiratory supported speech is characterized by short utterances
phase of the ventilator cycle (Fig. 1). During the inspir- and long pauses. The periods during which the pressure
atory phase, speech production competes with ventila- is above the voicing threshold pressure are relatively
tion because the ventilator-delivered air that flows short (compared with normal speech-related expirations)
through the larynx to produce speech is routed away and the periods during which the pressure is below that
from the pulmonary system, where gas exchange takes threshold are relatively long (compared with normal
place (i.e., oxygen is exchanged for carbon dioxide). speech-related inspirations). The reason why ventilator-
For this and other reasons, the act of speaking with supported speech is variable in loudness and voice qual-
a tracheostomy and positive-pressure ventilator is ity has to do with the fast-changing nature of the tra-
challenging, and the resultant speech often is quite cheal pressure waveform. The rapid rate at which the
abnormal. pressure rises and falls makes it impossible for the larynx
Positive-pressure ventilators can be adjusted to meet to make the adjustments necessary to produce a steady
each individual’s ventilatory needs. These adjustments voice loudness and quality.
typically are determined by the pulmonologist and exe- There are several strategies for improving ventilator-
cuted by the respiratory therapist. The most basic supported speech. One set of strategies is mechanical in
adjustments involve setting the tidal volume and breath- nature and involves modifying the tracheal pressure
ing rate, the product of which is the minute ventilation waveform. Specifically, speech can be improved if the
(the amount of air moved into or out of the pulmonary tracheal pressure stays above the voicing threshold
system per minute). These parameters are adjusted pri-
marily according to the client’s body size and breathing
comfort, and their appropriateness is confirmed by
blood gas measurements. Most ventilators allow adjust-
ment of other parameters, such as inspiratory duration,
magnitude and pattern of inspiratory flow, fraction of
inspired oxygen, and pressure at end-expiration (called
positive end-expiratory pressure, or PEEP), among
others. How these parameters are adjusted influences
ventilation and can also have a substantial influence on
speech production.
The speech produced with a tracheostomy and
positive-pressure ventilator is usually abnormal. Some of
its common features are short utterances, long pauses,
and variable loudness and voice quality (Hoit, Shea, and
Banzett, 1994). The mechanisms underlying these speech Figure 2. Schematic representation of tracheal pressure during
features are most easily explained by relating them to the normal speech production and ventilator-supported speech
tracheal pressure waveform associated with ventilator- production. The dashed line indicates the minimum pressure
supported speech. This waveform is shown schematically required to vibrate the vocal folds. (From Hoit, J. D. [1998].
in Figure 2, along with a waveform associated with nor- Speak to me. International Ventilator Users Network News, 12,
mal speech production. Whereas tracheal pressure dur- 6. Reproduced with permission.)
228 Part II: Speech
pressure for a longer portion of the ventilator cycle (to Levine, S., and Henson, D. (1994). Negative pressure ventilation.
increase utterance duration and decrease pause duration) In M. J. Tobin (Ed.), Principles and practice of mechanical
and if it changes less rapidly and does not peak as highly ventilation (pp. 393–411). New York: McGraw-Hill.
(to decrease variability of loudness and voice quality). Make, B. J., Hill, N. S., Goldberg, A. I., Bach, J. R., Criner,
G. J., Dune, P. E., et al. (1998). Mechanical ventilation be-
The tracheal pressure waveform can be modified by
yond the intensive care unit: Report of a consensus confer-
adjusting certain parameters on the ventilator (such as ence of the American College of Chest Physicians. Chest,
those mentioned earlier) or by adding external valves to 113, 289S–344S.
the ventilator system (e.g., Dikeman and Kazandjian, Spearman, C. B., and Sanders, H. G., Jr. (1990). Physical
1995; Hoit and Banzett, 1997). Ventilator-supported principles and functional designs of ventilators. In R. R.
speech can also be improved using behavioral strategies. Kirby, M. J. Banner, and J. B. Downs (Eds.), Clinical
Such strategies include the use of linguistic manipu- applications of ventilatory support (pp. 63–104). New York:
lations designed to hold the floor during conversation Churchill Livingstone.
(e.g., breaking for obligatory pauses at linguistically in-
appropriate junctures) and the incorporation of another Further Readings
sound source to supplement the laryngeal voicing source American Speech-Language-Hearing Association, Ad Hoc
(e.g., buccal or pharyngeal speech). Committee on Use of Specialized Medical Speech Devices.
Evaluation and management of the speech of a client (1993). Position statement and guidelines for the use of
with a tracheostomy and positive-pressure ventilator voice prostheses in tracheostomized persons with or without
involves a team approach, with the team usually con- ventilatory dependency. ASHA, 35(Suppl. 10), 17–20.
sisting of a speech-language pathologist working in Bach, J. R. (1992). Ventilator use by Muscular Dystrophy As-
collaboration with a pulmonologist and a respiratory sociation patients. Archives of Physical Medicine and Reha-
therapist. Such collaboration is critical because speech bilitation, 73, 179–183.
production and ventilation are highly interdependent in Bloch-Salisbury, E., Shea, S. A., Brown, R., Evans, K., and
a client who uses a ventilator. An intervention designed Banzett, R. B. (1996). Air hunger induced by acute increase
in Pco2 adapts to chronic elevation of Pco2 in ventilated
to improve speech will almost certainly influence venti- humans. Journal of Applied Physiology, 81, 949–956.
lation, and an adjustment to ventilation will most likely Gilgo¤, I. (1991). Living with a ventilator. Western Journal of
alter the quality of the speech. As an example, a speech- Medicine, 154, 619–622.
language pathologist might request that a client be Hoit, J. D., Banzett, R. B., and Brown, R. (1997). Improving
allowed to deflate his cu¤ so that he can speak. Cu¤ de- ventilator-supported speech. TELEROUNDS No. 39 Live
flation should not compromise ventilation as long as Satellite Transmission (Dec. 10, 1997). Tucson, AZ: Na-
tidal volume is increased appropriately (Bach and Alba, tional Center for Neurogenic Communication Disorders.
1990). By understanding the interactions between speech [Videotape available.]
production and ventilation, clinicians can implement in- Hoit, J. D., and Shea, S. A. (1996). Speech production and
terventions that optimize spoken communication with- speech with a phrenic nerve pacer. American Journal of
out compromising ventilation, thereby improving the Isaki, E., and Hoit, J. D. (1997). Ventilator-supported com-
overall quality of life in clients who use ventilators. munication: A survey of speech-language pathologists.
—Jeannette D. Hoit Journal of Medical Speech-Language Pathology, 5, 263–272.
Kirby, R. R., Banner, M. J., and Downs, J. B. (Eds.). (1990).
References Clinical applications of ventilatory support. New York:
Churchill Livingstone.
Bach, J. R., and Alba, A. S. (1990). Tracheostomy ventilation: Mason, M. (Ed.). (1993). Speech pathology for tracheostomized
A study of e‰cacy with deflated cu¤s and cu¿ess tubes. and ventilator dependent patients. Newport Beach, CA:
Chest, 97, 679–683. Voicing!
Banner, M. J., Blanch, P., and Desautels, D. A. (1990). Me- Reeve, C. (1998). Still me. New York: Random House.
chanical ventilators. In R. R. Kirby, M. J. Banner, and J. B. Shea, S. A., Hoit, J. D., and Banzett, R. B. (1998). Competition
Downs (Eds.), Clinical applications of ventilatory support between gas exchange and speech production in ventilated
(pp. 401–503). New York: Churchill Livingstone. subjects. Biological Psychology, 49, 9–27.
Dikeman, K. J., and Kazandjian, M. S. (1995). Communication Sparker, A. W., Robbins, K. T., Nevlud, G. N., Watkins, C.,
and swallowing management of tracheostomized and ventila- and Jahrsdoerfer, R. (1987). A prospective evaluation of
tor-dependent adults. San Diego, CA: Singular Publishing speaking tracheostomy tubes for ventilator dependent
Group. patients. Laryngoscope, 97, 89–92.
Hill, N. S. (1994). Use of the rocking bed, pneumobelt, and Sternburg, L. L. (1982). Some adaptive compensations in speech
other noninvasive aids to ventilation. In M. J. Tobin (Ed.), control achieved after respiratory paralysis: Four cases.
Principles and practice of mechanical ventilation (pp. 413– Unpublished doctoral dissertation, Brandeis University,
425). New York: McGraw-Hill. Waltham, MA.
Hoit, J. D., and Banzett, R. B. (1997). Simple adjustments can Sternburg, L. L., and Sternburg, D. (1986). View from the see-
improve ventilator-supported speech. American Journal of saw. New York: Dodd, Mead.
Speech-Language Pathology, 6, 87–96. Tippett, D. C. (Ed.). (2000). Tracheostomy and ventilator de-
Hoit, J. D., Shea, S. A., and Banzett, R. B. (1994). Speech pendency: Management of breathing, speaking, and swal-
production during mechanical ventilation in tracheostom- lowing. New York: Thieme.
ized individuals. Journal of Speech and Hearing Research, Tobin, M. J. (Ed.). (1994). Principles and practice of mechani-
37, 53–63. cal ventilation. New York: McGraw-Hill.
Part III: Language
Agrammatism sentences with embedded clauses (‘‘The man greeted by
his wife was smoking a pipe’’) are harder to comprehend
than sentences with two conjoined sentences (‘‘The man
Agrammatism is a disorder that leads to di‰culties with was greeted by his wife and he was smoking a pipe’’)
sentences. These di‰culties can relate both to the correct (Goodglass et al., 1979; Caplan and Hildebrandt, 1988).
comprehension and the correct production of sentences. Agrammatic production has attracted much less at-
That these di‰culties occur at the sentence level is evi- tention than agrammatic comprehension. Symptoms of
dent from the fact that word comprehension and pro- agrammatic production have traditionally been assessed
duction can be relatively spared. by analyzing spontaneous speech (Goodglass and
Agrammatism occurs in many clinical populations. In Kaplan, 1983; Rochon, Sa¤ran, Berndt, and Schwartz,
patients with Wernicke’s aphasia, for instance, agram- 2000). Four types of symptoms of spontaneous speech
matism has been established both for comprehension have been established. (1) Reduced variety of grammat-
(Lukatela, Schankweiler, and Crain, 1995) and for ical form. If sentences are produced at all, they have lit-
production (Haarmann and Kolk, 1992). Agrammatic tle subordination or phrasal elaboration. (2) Omission of
comprehension has been demonstrated in patients with function words (articles, pronouns, auxiliaries, preposi-
Parkinson’s disease (Grossman et al., 2000), Alzheimer’s tions, and the like) and inflections. (3) Omission of main
disease (Waters, Caplan, and Rochon, 1995), and in verbs. (4) A slow rate of speech. Whereas these symp-
children with specific language disorders (Van-der-Lely toms have been established in English-speaking subjects,
and Dewart, 1986). Agrammatic comprehension has similar symptoms occur in many other languages (Menn
even been demonstrated in normal subjects processing and Obler, 1990). A number of studies have attempted to
under stressfull conditions (Dick et al., 2001). However, elicit production of grammatical morphology and word
agrammatism has been studied most systematically in order in agrammatic patients. A complicating factor is
patients with Broca’s aphasia, and it is this group this that there are systematic di¤erences between spontane-
review will focus on. ous speech and elicited speech. In particular, function
Symptoms of agrammatic comprehension are typi- word omission is less frequent in elicited speech and
cally assessed by presenting a sentence to the subject and function word substitution is more frequent (Hofstede
asking the subject to pick from a number of pictures the and Kolk, 1994). The following symptoms have been
one depicting the proper interpretation of the sentence. observed on elicitation tests. (1) Grammatical word
Another procedure is to ask subjects to act out the order is impaired (Sa¤ran, Schwartz, and Marin, 1980).
meaning of the sentence with the help of toy figures. The (2) It is more impaired in embedded clauses than in main
main symptoms thus established are the following: (1) clauses (Kolk and van Grunsven, 1985). (3) Inflection
Sentences in which the two thematic roles can be for tense is harder than inflection for agreement (Fried-
reversed (e.g., ‘‘The cat is chasing the dog’’) are sub- mann and Grodzinsky, 1997). (4) Sentences with non-
stantially harder to understand than their nonreversible canonical ordering of thematic roles appear harder to
counterparts (‘‘The cat is drinking milk’’) (Caramazza produce than their canonical counterparts (Caplan and
and Zurif, 1976; Kolk and Friederici, 1985). Roughly Hanna, 1998; Bastiaanse and van Zonneveld, 1998; but
speaking, thematic roles specify who is doing what to see also Kolk and van Grunsven, 1985).
whom. (2) Sentences with noncanonical ordering of the- The localization of agrammatism is variable. With
matic roles around the verb are harder to comprehend respect to both production and comprehension, agram-
than ones with canonical ordering. In English, the order matism is associated with lesions across the entire left
of the active sentence is considered to be canonical: perisylvian cortex.
agent-action-patient (or subject-verb-object). Sentences Theories of agrammatism abound. Some researchers
with a word order deviating from this pattern are rela- claim that di¤erences between patients are so great that
tively di‰cult to understand. Thus, passive constructions a unitary theory will not be possible (Miceli et al., 1989).
are harder to understand than active ones (Schwartz, Extant theories pertain either to comprehension or to
Sa¤ran, and Marin, 1980; Kolk and van Grunsven production. This is justified by the fact that agrammatic
1985), and object relative sentences (‘‘The boy whom the production and comprehension can be dissociated
girl pushed was tall’’) are harder than subject relative (Miceli et al., 1983). The most important approaches are
sentences (‘‘The boy who pushed the girl was tall’’) the following. The trace deletion hypothesis about
(Lukatela, Schankweiler, and Crain, 1995; Grodzinsky, agrammatic comprehension holds that traces, or empty
1999), to mention the most frequently studied contrasts. elements resulting from movement transformations
(3) Sentences with a complex—more deeply branched— according to generative linguistic theories, are lacking
phrase structure are harder to understand than their (Grodzinsky, 2000). The mapping hypothesis maintains
simple counterparts, even if they have canonical word that it is not a defect in the structural representation that
order. For instance, a locative construction (e.g., ‘‘The is responsible for these di‰culties but a defect in the
letter is on the book’’) is harder to understand than a procedures by which these representations are employed
simple active construction (‘‘The sailor is kissing the to derive thematic roles (Linebarger, Schwartz, and Saf-
girl’’), even if subjects are able to comprehend the loca- fran, 1983). Finally, a number of hypotheses claim a
tive proposition as such (Schwartz, Sa¤ran, and Marin, processing limitation to be the bottleneck. The limita-
1980; Kolk and van Grunsven, 1985). Furthermore, tion may relate to working memory capacity (Caplan
232 Part III: Language
and Waters, 1999), altered weights or increased noise and Wernicke’s aphasics: Speed and accuracy factors. Cor-
in a distributed neural network (Dick et al., 2001), or tex, 28, 97–112.
a slowdown in syntactic processing (Kolk and van Helms-Estabrooks, N., Fitzpatrick, P. M., and Barresi, B.
Grunsven, 1985). With respect to production, the tree (1981). Response of an agrammatic patient to a syntax
stimulation program for aphasia. Journal of Speech and
truncation hypothesis maintains that damage to a par-
Hearing Disorders, 46, 422–427.
ticular node in the syntactic tree leads to the impossibil- Hofstede, B. T. M., and Kolk, H. H. J. (1994). The e¤ects of
ity of processing any structure higher than the damaged task variation on the production of grammatical morphol-
node (Friedmann and Grodzinsky, 1997). Finally, the ogy in Broca’s aphasia: A multiple case study. Brain and
adaptation theory of agrammatic speech (Kolk and van Language, 46, 278–328.
Grunsven, 1985) maintains that the underlying deficit is Kolk, H. H. J., and Friederici, A. D. (1985). Strategy and im-
a slowing down of the syntactic processor. A second pairment in sentence understanding by Broca’s and Wer-
claim is that the actual slow, telegraphic output results nicke’s aphasics. Cortex, 21, 47–67.
from the way patients adapt to this deficit. Kolk, H. H. J., and van Grunsven, M. F. (1985). Agramma-
Treatment programs for agrammatism vary from tism as a variable phenomenon. Cognitive Neuropsychology,
2, 347–384.
theoretically neutral syntax training programs (Helms-
Linebarger, M. C., Schwartz, M., and Sa¤ran, E. (1983). Sen-
Estabrooks, Fitzpatrick, and Barresi, 1981), to programs sitivity to grammatical structure in so-called agrammatic
motivated by the mapping hypothesis (Schwartz et al., aphasia. Cognition, 13, 361–392.
1994) or by the trace deletion hypothesis (Thompson Lukatela, K., Schankweiler, D., and Crain, S. (1995). Syntactic
et al., 1996). The reduced syntax therapy proposed by processing in agrammatic aphasia by speakers of a Slavic
Springer and Huber (2000) takes a compensatory language. Brain and Language, 49, 50–76.
approach to treatment and fits well with the adaptation Menn, L., and Obler, L. (1990). Agrammatic aphasia: A cross-
theory. language narrative source book. Amsterdam: Benjamins.
Miceli, G., Mazzuchi, A., Menn, L., and Goodglass, H. (1983).
—Herman Kolk Contrasting cases of Italian agrammatic aphasia without
comprehension disorder. Brain and Language, 35, 24–65.
References Miceli, G., Silveri, M. C., Romani, C., and Caramazza, A.
Bastiaanse, R., and van Zonneveld, R. (1998). On the relation (1989). Variation in the pattern of omissions and substitu-
between verb inflection and verb position in Dutch agram- tions of grammatical morphemes in the spontaneous speech
matic subjects. Brain and Language, 64, 165–181. of so-called agrammatic patients. Brain and Language, 26,
Caplan, D., and Hanna, J. E. (1998). Sentence production by 447–492.
aphasic patients in a constrained task. Brain and Language, Rochon, E., Sa¤ran, E. M., Berndt, R. S., and Schwartz, M. F.
63, 159–183. (2000). Quantitative analysis of aphasic sentence produc-
Caplan, D., and Hildebrandt, N. (1988). Disorders of syntactic tion: Further development and new data. Brain and Lan-
comprehension. Cambridge, UK: Bradford Books. guage, 72, 193–218.
Caplan, D., and Waters, G. (1999). Verbal working memory Sa¤ran, E., Schwartz, and Marin, O. (1980). The word order
and sentence comprehension. Behavioral and Brain Sciences, problem in agrammatism: II. Production. Brain and Lan-
22, 77–94. guage, 10, 263–280.
Caramazza, A., and Zurif, E. G. (1976). Dissociation of algo- Schwartz, M. F., Sa¤ran, E. M., and Marin, O. (1980). The
rithmic and heuristic processes in sentence comprehension: word order problem in agrammatism: I. Comprehension.
Evidence from aphasia. Brain and Language, 3, 572–582. Brain and Language, 10, 249–262.
Dick, F., Bates, E., Wulfeck, B., Utman, J., Dronkers, N., and Schwartz, M. F., Sa¤ran, E. M., Fink, R. B., Meyers, J. L.,
Gernsbacher, M. (2001). Language deficits, localization, and Martin, N. (1994). Mapping therapy: a treatment pro-
and grammar: Evidence for a distributive model of lan- gram for agrammatism. Aphasiology, 8, 9–54.
guage breakdown in aphasic patients and neurologically in- Springer, L., and Huber, W. (2000). Agrammatism: Deficit or
tact individuals. Psychological Review, 108, 759–788. compensation? Consequences for aphasia therapy. Neuro-
Friedmann, N., and Grodzinsky, Y. (1997). Tense and agree- psychological Rehabilitation, 10, 279–309.
ment in agrammatic production: Pruning the syntactic tree. Thompson, C. K., Shapiro, L. P., Tait, M. E., Jacobs, B. J.,
Brain and Language, 56, 397–425. and Schneider, S. L. (1996). Training Wh-question produc-
Goodglass, H., Blumstein, S. E., Gleason, J. B., Hyde, M. R., tion in agrammatic aphasia: Analysis of argument and ad-
Green, E., and Stadlender, S. (1979). The e¤ects of syntactic junct movement. Brain and Language, 52, 175–228.
encoding on sentence comprehension in aphasia. Brain and Van-der-Lely, H., and Dewart, H. (1986). Sentence compre-
Language, 7, 201–209. hension strategies in specifically language impaired children.
Goodglass, H., and Kaplan, E. (1983). The assessment of British Journal of Disorders of Communication, 21, 291–
aphasia and related disorders (2nd ed.). Philadelphia: Lea 306.
and Febiger. Waters, G. S., Caplan, D., and Rochon, E. (1995). Processing
Grodzinsky, Y. (2000). The neurology of syntax: Language capacity and sentence comprehension in patients with Alz-
use without Broca’s area. Behavioral and Brain Sciences, 23, heimer’s disease. Cognitive Neuropsychology, 12, 1–30.
1–92.
Grossmann, M., Kalmanson, J., Bernhardt, N., Morris, J., Further Readings
Stern, M. B., and Hurtig, H. I. (2000). Cognitive resource
limitations in Parkinson’s disease. Brain and Language, 73, Badecker, W., and Caramazza, A. (1985). On considerations of
1–16. method and theory governing the use of clinical categories
Haarmann, H. J., and Kolk, H. H. J. (1992). The production in neurolinguistics and cognitive neuropsychology: The case
and comprehension of grammatical morphology in Broca’s against agrammatism. Cognition, 20, 97–125.
Agraphia 233
Beretta, A. (2001). Linear and structural accounts of theta grammatical morphemes? A case study. Brain and Lan-
role assignment in agrammatic aphasia. Aphasiology, 15, guage, 33, 273–295.
515–531. Pulvermueller, F. (1995). Agrammatism: Behavioral descrip-
Berndt, R. S., and Haendiges, A. N. (2000). Grammatical class tion and neurobiological explanation. Journal of Cognitive
in word and sentence production: Evidence from an aphasic Neuroscience, 7, 165–181.
patient. Journal of Memory and Language, 43, 249–273. Sa¤ran, E. M., Schwartz, M. F., and Linebarger, M. C. (1998).
Berndt, R. S., Mitchum, C. C., and Haendiges, A. N. (1996). Semantic influences on thematic role assignment: Evidence
Comprehension of reversible sentences in ‘‘agrammatism’’: from normals and aphasics. Brain and Language, 62, 255–
A meta-analysis. Cognition, 58, 289–308. 279.
Crain, S., Ni, W., and Shankweiler, D. (2001). Grammatism. Schwartz, M., Linebarger, M. C., Sa¤ran, E., and Pate, D.
Brain and Language, 77, 294–304. (1987). Syntactic transparency and sentence interpretation
Druks, J., and Marshall, J. C. (1995). When passives are harder in aphasia. Language and Cognitive Processing, 2, 85–113.
than actives: Two case studies of agrammatic comprehen- Swaab, T. Y., Brown, C., and Hagoort, P. (1998). Under-
sion. Cognition, 55, 311–331. standing ambiguous words in sentence contexts: Electro-
Friederici, A. D., and Frazier, L. (1992). Thematic analysis in physiological evidence for delayed contextual selection in
agrammatic comprehesion: Thematic structure and task Broca’s aphasia. Neuropsychologia, 36, 737–761.
demands. Brain and Language, 42, 1–29. Swinney, D., Zurif, E., Prather, P., and Love, T. (1996). Neu-
Friederici, A. D., and Gorrell, P. (1998). Structural prominence rological distribution of processing resources underlying
and agrammatic theta-role assignment: A reconsideration of language comprehension. Journal of Cognitive Neurosci-
linear strategies. Brain and Language, 65, 253–275. ence, 8, 174–184.
Friedmann, N. (2001). Agrammatism and the psychological
reality of the syntactic tree. Journal of Psycholinguistic Re-
search, 30, 71–90. Agraphia
Haarmann, H. J., Just, M. A., and Carpenter, P. A. (1997).
Aphasic sentence comprehension as a resource deficit:
A computational approach. Brain and Language, 59, 76– Agraphia (or dysgraphia) is the term used to describe an
120.
acquired impairment of writing. The impairment may
Hagiwara, H. (1995). The breakdown of functional categories
and the economy of derivation. Brain and Language, 50, result from damage to any of the cognitive, linguistic,
92–116. or sensorimotor processes that normally support the
Hartsuiker, R. J., and Kolk, H. J. (1998). Syntactic facilitation ability to spell and write. These procedures can be con-
in agrammatic sentence production. Brain and Language, ceptualized within the framework of a cognitive model
62, 221–254. of language processing such as that shown in Figure 1
Hickok, G., Zurif, E., and Canseco-Gonzalez, E. (1993). (Ellis, 1988; Shallice, 1988; Rapcsak and Beeson, 2000).
Structural description of agrammatic comprehension. Brain According to the model, the writing process can be
and Language, 45, 371–395. divided into central and peripheral components. The
Hillis, A. E., and Caramazza, A. (1995). Representation of central components are linguistic in nature and are re-
grammatical categories of words in the brain. Journal of
Cognitive Neuroscience, 7, 396–407.
sponsible for the retrieval of appropriate words and
Kolk, H. H. J. (1995). A time-based approach to agrammatic provision of information about their correct spelling.
production. Brain and Language, 50, 282–303. Peripheral procedures serve to translate spelling knowl-
Linebarger, M. C., Schwartz, M. F., Romania, J. R., Kohn, edge into handwriting, and to guide the motor control
S. E., and Stephens, D. L. (2000). Grammatical encoding in for appropriate movements of the hand.
aphasia: Evidence from a ‘‘processing prosthesis.’’ Brain When the system is working normally and an indi-
and Language, 75, 416–427. vidual wants to write a familiar word, the relevant con-
Luzatti, C., Toraldo, A., Guasti, M., Ghirardi, G., Lorenzi, L., cepts in the semantic system activate representations in
and Guarnaschelli, C. (2001). Comprehension of reversible the memory store for learned spellings (i.e., the ortho-
active and passive sentences in agrammatism. Aphasiology, graphic output lexicon). Access to this lexicon via the
15, 419–442.
Marslen-Wilson, W. D., and Tyler, L. K. (1997). Dissociating
semantic system is referred to as the lexical-semantic
types of mental computation. Nature, 387, 592–594. spelling route. In contrast, when the individual attempts
Martin, R. C., and Romani, C. (1994). Verbal working mem- to spell unfamiliar words or pronounceable nonwords
ory and sentence comprehension: A multiple-components (such as flig), reliance on knowledge of sound-to-letter
view. Neuropsychology, 8, 506–523. correspondences allows the assembly of plausible spell-
Mauner, G., Fromkin, V. A., and Cornell, T. L. (1993). Com- ings by a process referred to as phoneme-grapheme con-
prehension and acceptability judgments in agrammatism: version. This alternative means of spelling is depicted in
Disruptions in the syntax of referential dependency. Brain Figure 1 by the arrow from the phonological bu¤er
and Language, 45, 340–370. (where phonological information is held) to the graphe-
Miyake, A., Carpenter, P. A., and Just, M. A. (1995). Reduced mic bu¤er (where the assembled spelling is held). Spell-
resources and specific impairments in normal and aphasic
sentence comprehension. Cognitive Neuropsychology, 12,
ing in this manner is considered a nonlexical process,
651–679. because spellings are not retrieved as whole words from
Nespoulous, J. L., Dordain, M., Perron, C., Ska, B., Bub, D., the lexicon. Spellings generated by the lexical-semantic
Caplan, D., Mehler, J., and Lecours, A. R. (1988). Agram- and nonlexical spelling routes are subsequently pro-
matism in sentence production without comprehension def- cessed in the graphemic bu¤er. This bu¤er serves as
icits: Reduced availability of syntactic structures and/or of an interface between central spelling processes and the
peripheral procedures that support the production of

handwriting. Peripheral writing procedures are accom-
plished through a series of hierarchically organized
stages that include letter selection (referred to as allo-
graphic conversion), motor programming, and the gen-
eration of graphic innervatory patterns.
Clinical assessment of spelling and writing provides
an understanding of the nature and degree of impair-
ment to specific processes as well as the availability of
residual abilities (Kay, Lesser, and Coltheart, 1992;
Beeson and Hillis, 2001; Beeson and Rapcsak, 2002).
Damage to specific components of the spelling process
may result in identifiable agraphia syndromes with rela-
tively predictable lesion sites (Roeltgen, 1993, 1994;
Rapcsak and Beeson, 2000, 2002). Central agraphia syn-
dromes reflect damage to the lexical-semantic or non-
lexical spelling routes, or the graphemic bu¤er, and
result in similar impairments across di¤erent modalities
of output (e.g., written spelling, oral spelling, typing).
Central agraphia syndromes include lexical agraphia,
phonological agraphia, deep agraphia, and graphemic
bu¤er agraphia. Peripheral agraphia syndromes reflect
damage to writing processes that are distal to the gra-
phemic bu¤er. Dysfunction primarily a¤ects the selec-
tion or production of letters in handwriting. These
syndromes include allographic disorders, apraxic
agraphia, and nonapraxic disorders of neuromuscular
execution (Roeltgen, 1993; Rapcsak, 1997; Rapcsak and
Beeson, 2000). An individual may have impairment to
multiple components of the writing process so that
the agraphia profile does not conform to a recognized
syndrome.
Figure 1. A simplified model of single-word writing. Semantic Lexical agraphia (also called surface agraphia) is a
system refers to knowledge of word meanings. Orthographic central agraphia syndrome that results from damage
output lexicon refers to memory store of learned spellings. to the lexical-semantic spelling route. It is characterized
Phoneme-grapheme conversion is the process of spelling by by the loss or unavailability of orthographic knowledge,
converting units of sound to corresponding letters. Graphemic
so that spelling is accomplished by phoneme-grapheme
bu¤er denotes a working memory system that temporarily
stores orthographic representations while they are being con- conversion and words are spelled as they sound. Spell-
verted into output for handwriting (or typing or oral spelling). ing accuracy is strongly influenced by orthographic
Allographic conversion is the process by which abstract ortho- regularity in that regular words (e.g., bake) and non-
graphic representations are converted into appropriate physical words are spelled correctly, but attempts to spell words
letter shapes. Graphic motor programs are spatiotemporal with irregular sound-to-spelling relationships result in
codes for writing movements that contain information about phonologically plausible errors (e.g., cough—co¤ ).
the sequence, position, direction, and relative size of the strokes Low-frequency, irregular words are especially vulnerable
necessary to create di¤erent letters. Graphic innervatory pat- to error. In addition, if the semantic influence on spelling
terns are motor commands to specific muscle systems specify- is impaired, there is di‰culty writing homophonic words
ing the appropriate force, speed, and amplitude of movement.
that cannot be spelled correctly without reference to
Phonological output lexicon is the memory store of sound
patterns for familiar words used in speech production. Phono- meaning (e.g., dear—deer). Lexical agraphia is typically
logical bu¤er is a working memory system for phonological seen following damage to left extrasylvian temporo-
information. parietal regions. The syndrome has also been described
in patients with Alzheimer’s disease and in semantic
dementia.
Phonological agraphia and deep agraphia are central
agraphia syndromes attributable to dysfunction of the
nonlexical spelling route. In both syndromes, spelling is
accomplished primarily via a lexical-semantic strategy,
and individuals have di‰culty using phoneme-grapheme
conversion to spell unfamiliar words or nonwords. In
phonological agraphia, the spelling of familiar words,
both regular and irregular, may be relatively spared;
Agraphia 235
however, in deep agraphia, there is concomitant im- weakness, dea¤erentation) or damage to the basal gan-
pairment of the lexical-semantic spelling route, so glia (i.e., tremor, rigidity) or cerebellum (i.e., ataxia,
that semantic errors are prevalent (e.g., boy—girl ). In dysmetria). Typical errors of letter morphology include
both syndromes, spelling accuracy is better for highly spatial distortions and stroke omissions or additions,
frequent, concrete words (e.g., house) than for low- which may result in illegible handwriting. Spelling by
frequency, abstract words (e.g., honor). There is also an other modalities (e.g., oral spelling) is typically spared.
influence of grammatical word class in that nouns are In right-handers, apraxic agraphia is associated with
easier to spell than function words such as prepositions, damage to a left hemisphere cortical network dedicated
pronouns, and articles. Other spelling errors may include to the motor programming of handwriting movements.
morphological errors (talked—talking) and functor sub- The major functional components of this neural network
stitutions (as—with). As in any of the central agraphia include posterior-superior parietal cortex (including the
syndromes, patients may recall only some of the letters region of the intraparietal sulcus), dorsolateral premotor
of the target word, reflecting partial orthographic cortex, and the supplementary motor area (SMA). Cal-
knowledge. Phonological and deep agraphia are asso- losal lesions in right-handers may be accompanied by
ciated with damage to the perisylvian language areas, unilateral apraxic agraphia of the left hand.
including Broca’s area, Wernicke’s area, and the supra- Writing disorders attributable to impaired neuromus-
marginal gyrus. Deep agraphia in patients with extensive cular execution are caused by damage to motor systems
left hemisphere lesions may reflect reliance on the right involved in generating graphic innervatory patterns.
hemisphere for writing (Rapcsak, Beeson, and Rubens, Poor motor control results in defective control of writing
1991). force, speed, and amplitude. Such writing disorders re-
Graphemic bu¤er agraphia reflects impairment of the flect the specific underlying disease or locus of damage.
ability to retain orthographic representations in short- In the case of Parkinson’s disease, micrographia results
term memory as the appropriate graphic motor pro- from reduced force and amplitude of movements of the
grams are selected and implemented. Damage to the hand. In patients with cerebellar dysfunction, move-
graphemic bu¤er leads to abnormally rapid decay of in- ments of the pen may be disjointed and erratic. Break-
formation relevant to the order and identity of stored down of graphomotor control in these neurological
graphemes. Spelling accuracy is notably a¤ected by conditions suggests that the basal ganglia and the cere-
word length because each additional grapheme increases bellum, working in concert with dorsolateral premotor
the demand on limited storage capacity. In contrast to cortex and the SMA, are critically involved in the se-
other central agraphia syndromes, spelling in graphemic lection and implementation of kinematic parameters
bu¤er agraphia is not significantly influenced by lexical for writing movements. Obviously, patients with hemi-
status (words versus nonwords), lexical-semantic fea- paresis often have weakness and spasticity of the hand
tures (frequency, concreteness, grammatical class), or and limb that markedly impairs their ability to write
orthographic regularity. Characteristic spelling errors with the preferred hand.
include letter substitutions, additions, deletions, and Behavioral treatments for agraphia may target central
transpositions (e.g., garden—garned ). These errors are or peripheral components of the writing process (Behr-
observed in all spelling tasks and across all modalities of mann and Byng, 1992; Carlomagno, Iavarone, and
output (writing, oral spelling, typing). Lesion sites in Colombo, 1994; Hillis and Caramazza, 1994; Patterson,
patients with graphemic bu¤er agraphia have been vari- 1994; Beeson and Hillis, 2001; Beeson and Rapcsak,
able, but left parietal and frontal cortical involvement is 2002). Treatments for central agraphias may be directed
common. toward the lexical-semantic or nonlexical spelling proce-
Allographic disorders are peripheral writing im- dures. In contrast, treatments for peripheral agraphias
pairments that reflect the breakdown of procedures by are designed to improve the selection and implementa-
which orthographic representations are mapped to letter- tion of graphic motor programs for writing. In general,
specific graphic motor programs. Allographic disorders agraphia treatments are designed to strengthen damaged
are characterized by an inability to activate or select ap- processes and take advantage of residual abilities.
propriate letter shapes, whereas oral spelling is pre- See also alexia; phonological analysis of lan-
served. Patients may have di‰culty that is specific to guage disorders in aphasia; phonology and adult
writing upper- or lowercase letters, or they may produce aphasia.
case-mixing errors (e.g., pApeR). Other patients produce
well-formed letter substitution errors that bear physical —Pelagie M. Beeson and Steven Z. Rapcsak
similarity to the target. Allographic disorders are usually
associated with damage to the left parieto-occipital References
region.
Beeson, P. M., and Hillis, A. E. (2001). Comprehension and
Apraxia agraphia is a peripheral writing impairment production of written words. In R. Chapey (Ed.), Language
caused by damage to graphic motor programs, or it may intervention strategies in adult aphasia (4th ed., pp. 572–
reflect an inability to translate information contained in 604). Baltimore: Lippincott, Williams and Wilkins.
these programs into specific motor commands. Apraxic Beeson, P. M., and Rapcsak, S. Z. (2002). Clinical diagnosis
agraphia is characterized by poor letter formation that and treatment of spelling disorders. In A. E. Hillis (Ed.),
cannot be attributed to sensorimotor impairment (i.e., Handbook on adult language disorders: Integrating cognitive
neuropsychology, neurology, and rehabilitation (pp. 101–

120). Philadelphia: Psychology Press.
Behrmann, M., and Byng, S. (1992). A cognitive approach to
the neurorehabilitation of acquired language disorders. In
D. I. Margolin (Ed.), Cognitive neuropsychology in clinical
practice (pp. 327–350). New York: Oxford University
Press.
Carlomagno, S., Iavarone, A., and Colombo, A. (1994). Cog-
nitive approaches to writing rehabilitation: From single case
to group studies. In M. J. Riddoch and G. W. Humphreys
(Eds.), Cognitive neuropsychology and cognitive rehabilita-
tion (pp. 485–502). Hillsdale, NJ: Erlbaum.
Ellis, A. W. (1988). Normal writing processes and peripheral
acquired dysgraphias. Language and Cognitive Processes, 3,
99–127. Figure 1. The cognitive processes underlying reading.
Hillis, A. E., and Caramazza, A. (1994). Theories of lexical
processing and rehabilitation of lexical deficits. In M. J.
Riddoch and G. W. Humphreys (Eds.), Cognitive neuro-
psychology and cognitive rehabilitation (pp. 1–30). Hillsdale, recognizing the word as a known word (by accessing a
NJ: Erlbaum. stored representation of the learned spelling of the word,
Kay, J., Lesser, R., and Coltheart, M. (1992). Psycholinguistic in the orthographic input lexicon), accessing its meaning
assessments of language processing in aphasia (PALPA). (or semantic representation), and accessing the pronun-
East Sussex, England: Erlbaum. ciation (the stored sound of the word, in the phonologi-
Patterson, K. (1994). Reading, writing, and rehabilitation: A cal output lexicon), as well as activating motor speech
reckoning. In M. J. Riddoch and G. W. Humphreys (Eds.), mechanisms for articulating the word. Even the first
Cognitive neuropsychology and cognitive rehabilitation (pp. of these components, seeing and perceiving the entire
425–447). Hillsdale, NJ: Erlbaum. written letter string, requires complex visual-perceptual
Rapcsak, S. Z. (1997). Disorders of writing. In L. J. G. Rothi
and K. M. Heilman (Eds.), Apraxia: The neuropsychology
skills, including computation of several levels of spatial
of action (pp. 149–172). Hove, England: Psychology Press. representation, before the stored representations can be
Rapcsak, S. Z., and Beeson, P. M. (2000). Agraphia. In accessed. Furthermore, reading an unfamiliar word—
L. J. G. Rothi, B. Crosson, and S. Nadeau (Eds.), Aphasia say, an unfamiliar surname—entails access to print-to-
and language: Theory and practice (pp. 184–220). New sound conversion, or grapheme-to-phoneme conversion
York: Guilford Press. (GPC), mechanisms. Familiar words can also be read
Rapcsak, S. Z., and Beeson, P. M. (2002). Neuroanatomical via GPC mechanisms, but the accuracy of pronunciation
correlates of spelling and writing. In A. E. Hillis (Ed.), will depend on the ‘‘regularity’’ of the word—the extent
Handbook on adult language disorders: Integrating cognitive to which the word conforms to typical GPC rules. For
neuropsychology, neurology, and rehabilitation (pp. 71–99). example, sail but not yacht can be read accurately via
Philadelphia: Psychology Press.
Rapcsak, S. Z., Beeson, P. M., and Rubens, A. B. (1991).
GPC mechanisms.
Writing with the right hemisphere. Brain and Language, 41, These components underlying the reading process are
510–530. schematically depicted in Figure 1 (see Hillis and Car-
Roeltgen, D. P. (1993). Agraphia. In K. M. Heilman and E. amazza, 1992, or Hillis, 2002, for a review of the evi-
Valenstein (Eds.), Clinical neuropsychology (3rd ed., pp. dence for various components of this model). Various
63–89). New York: Oxford University Press. features of this model are controversial, such as the pre-
Roeltgen, D. P. (1994). Localization of lesions in agraphia. In cise nature and arrangement of the components, the de-
A. Kertesz (Ed.), Localization and neuroimaging in neuro- gree to which they interact, and whether the various
psychology (pp. 377–405). San Diego, CA: Academic Press. levels of representation are accessed in parallel or serially
Shallice, T. (1988). From neuropsychology to mental structure. (Shallice, 1988; Hillis and Caramazza, 1991; Plaut and
Cambridge, U.K.: Cambridge University Press.
Shallice, 1993; Hillis, 2002). Nevertheless, most models
of naming include most of the components depicted in
Figure 1. Neurological impairment can selectively im-
Alexia pair any one or more of these components of the reading
process, with di¤erent consequences in terms of the types
of errors produced and the types of stimuli that are af-
Alexia, or acquired impairment of reading, is extremely fected. In addition, because several of the components of
common after stroke, dementia, or traumatic brain in- reading are cognitive mechanisms that are also involved
jury. Reading can be a¤ected in a variety of di¤erent in other tasks, damage to one of these shared compo-
ways, leading to a number of di¤erent clinical syndromes nents will have predictable consequences for reading and
or types of alexia. To understand these alexic syndromes, other tasks. For example, impairment at the level of se-
it is necessary to appreciate the cognitive processes un- mantics, or word meaning, will a¤ect not only the read-
derlying the task of reading words. Reading aloud a fa- ing of familiar irregular words but also the naming and
miliar word, such as leopard, normally entails at the very comprehension of words. Thus, it is possible to identify
least seeing and perceiving the entire written letter string, what component of the reading system is impaired by
Alexia 237
considering the types of errors produced by the individ- distinguish familiar from unfamiliar words, or pseudo-
ual, the types of words that elicit errors, and the accu- words (e.g., glamp), in a task known as lexical decision.
racy of performance across other language tasks, such as Sometimes such an individual will read each letter in the
spoken naming and comprehension. The consequences string aloud serially, which seems to facilitate access to
of damage to each level of the reading process are dis- the orthographic input lexicon (resulting in letter-by-
cussed here. letter reading; see papers in Coltheart, 1998). If GPC
mechanisms are intact, these mechanisms may be used
Visual Attention and Perception. To read a familiar or to read even familiar words, resulting in ‘‘regulariza-
unfamiliar word, the string of letters must be accurately tion’’ of irregular words (e.g., one ! ‘‘own’’). Other
perceived in the correct order and converted to a series errors are predominantly visually similar words (e.g.,
of graphemes (abstract letter identities, without a partic- though ! ‘‘touch’’). Oral reading of all types of words
ular case or font). Perception of the printed word can may be a¤ected, although very familiar words—those
break down when there is (1) poor visual acuity or visual frequently encountered in reading—may be relatively
field cut, (2) impairment in distinguishing individual let- spared. Since the orthographic input lexicon is not
ters or symbols in a string (attentional dyslexia; Shallice, involved in other linguistic tasks, damage to this cogni-
1988), or (3) impairment in perception of more than one tive process does not cause errors in other tasks. There-
object or feature at a time (called simultagnosia; Parkin, fore, individuals with impairment of this component are
1996). An individual with simultagnosia might read the said to have pure alexia.
word chair as ‘‘h’’ or ‘‘i’’. Finally, individuals with
damage to the nondominant (usually right) hemisphere Semantic System. Disruption of semantic representa-
of the brain often fail to perceive the side of a visual tions is often incomplete, such that the meanings that
stimulus like a word that is contralateral to the brain are accessed are often impoverished, and only certain
damage. Such an impairment, known as neglect dys- categories of words are a¤ected. For example, an
lexia, results in reading errors such as chair ! ‘‘fair,’’ alexic patient may read dog as ‘‘cat’’ if an incom-
spool ! ‘‘pool,’’ and love ! ‘‘glove,’’ errors that entail plete semantic representation of dog is accessed that
substitution, deletion, or insertion of letters on the left specifies only hanimali, or hmammali, hdomesticatedi,
side (or initial letters) of words (Kinsbourne and War- hquadrapedi, etc., without information about what dif-
rington, 1962; see papers in Riddoch, 1991, for reviews). ferentiates a dog from a cat. Thus, most errors are
Depending on the level of spatial representation a¤ected, semantically related words, such as robin ! ‘‘cardinal’’
reading accuracy is sometimes improved by moving the or robin ! ‘‘bird’’ (errors called semantic paralexias).
printed word to the una¤ected side of space or by spell- However, if GPC mechanisms are available, these may
ing the word aloud to the person (see Hillis and Car- be used to read all types of words, or used to block se-
amazza, 1995, for a discussion of various types of mantic paralexias. Or GPC mechanisms may be com-
neglect dyslexia resulting from damage to distinct levels bined with partial semantic information to access the
of spatial representation that are computed prior to correct phonological representation in the output lexi-
accessing a stored graphemic representation). All types con, so that the individual can read aloud words better
of reading stimuli are likely to be a¤ected in neglect than he or she can understand words (Hillis and Car-
dyslexia, although words that have final letter strings in amazza, 1991). Often there is especially incomplete se-
common with other words often elicit the most errors. mantic information to distinguish abstract words, so that
For example, the words light, fight, might, right, tight, abstract words are read less accurately than concrete
sight, blight, bright, slight, and so on are all likely to be words. Similarly, functors are read least accurately,
read as the same word, since only the final letters are often with one functor substituted for another (e.g.,
perceived and used to access the stored graphemic rep- therefore ! ‘‘because’’); verbs are read less accurately
resentation for recognition. Pseudo-words also elicit than adjectives; and adjectives are read less accurately
comparable errors (e.g., glamp ! ‘‘lamp’’ or ‘‘damp’’). than nouns (Coltheart, Patterson, and Marshall, 1980).
An individual with impairment in computing one or If GPC mechanisms are also impaired (a commonly co-
more levels of spatial representation will usually also occurring deficit), pseudo-words and unfamiliar words
make errors in perceiving the left side of nonlinguistic cannot be read. Since the semantic system is shared by
visual stimuli (Hillis and Caramazza, 1995), although the tasks of naming and comprehension, semantic errors
exceptional cases of pure neglect dyslexia, without other are also made in oral and written naming and in com-
features of hemispatial neglect, have been reported prehension of spoken and written words (Hillis et al.,
(Costello and Warrington, 1987; Patterson and Wilson, 1990).
1990).
Phonological Output Lexicon. Impairment in accessing
Orthographic Input Lexicon. Impairment at the level of phonological representations for output results in poor
accessing learned spellings of familiar words, or stored oral reading despite accurate comprehension of printed
graphemic representations that constitute the ‘‘ortho- words. For example, gray might be read as ‘‘blue’’ but
graphic input lexicon,’’ results in impaired recognition of defined as ‘‘the color of hair when you get old’’ (from
written words despite accurate perception of the letters. Caramazza and Hillis, 1990). Again, if GPC mecha-
The individual with this impairment will often fail to nisms are available, these mechanisms may be used to
Table 1. Characteristics of Reported Individuals with Surface Alexia (Compensatory Use of GPC Mechanisms)
PS* JJ* HG*
Error types (example of Regularization (bear ! ‘‘beer’’) Regularization (were ! ‘‘we’re’’) Regularization (one ! ‘‘own’’)
errors)
Lexical decision Impaired Intact Intact
Written word comprehension Impaired (e.g., shoe understood Impaired (e.g., shoe understood Intacty
as ‘‘show’’) as sock)
Spoken word comprehension Intact Impaired (e.g., ‘‘shoe’’ Intacty
understood as sock)
Oral naming Intact Impaired (e.g., a shoe named as Impaired (e.g., shoe named as
‘‘sock’’) ‘‘glove’’)
GPC mechanisms Intact Intact Intact
Level of deficit Orthographic input lexicon Semantic system Phonological output lexicon
* PS is described in Hillis (1993); JJ is described in Hillis and Caramazza (1991); HG is described in Hillis (1991).
y
HG also had a semantic impairment for certain categories of words; this table describes her performance for categories for which
she had intact comprehension, such as numbers and clothing. PS also had a mild semantic impairment in the categories of animals
and vegetables; this table describes his performance for categories for which he had intact comprehension.
read aloud all types of words, resulting in regularization each had impairment at di¤erent levels of lexical repre-
errors on irregular words. Otherwise, errors may be sentation but had intact GPC mechanisms.
semantically related (e.g., fork ! ‘‘spoon’’), or phono- Phonological alexia or phonological dyslexia refers to
logically related to the target (e.g., choir ! ‘‘queer’’). impairment in use of GPC mechanisms, so that the
Phonological representations of words that are used reader is unable to compose a plausible pronunciation of
more frequently may be more accessible than other unfamiliar words or pseudo-words. In addition, occa-
words, so that high-frequency words are read more ac- sional semantic paralexias or functor substitutions are
curately than low-frequency words. Since the phonolog- produced, presumably because these errors are not
ical output lexicon is also essential for oral naming and blocked by GPC mechansisms (see Beauvois and Der-
spontaneous speech, the person will make similar errors ouesne, 1979; Goodglass and Budin, 1988; Shallice,
on these tasks as in reading. 1988).
Deep alexia or deep dyslexia is a pattern that arises
Types of Alexia when there is damage to both GPC mechanisms and
another component of the ‘‘semantic route’’ of reading:
A number of alexic syndromes consisting of a particular the semantic system and/or the phonological output
pattern of frequently co-occurring symptoms in reading lexicon (see Coltheart, Patterson, and Marshall, 1980).
have been described. (These types of alexia are also Semantic paralexias and functor substitutions are
known as acquired dyslexia.) Individuals with a par- invariably produced, although visually similar word
ticular alexic syndrome may have di¤erent underlying errors and derivational errors (e.g., write ! ‘‘writer’’;
deficits, however. To identify which component of the predicted ! ‘‘prediction’’) are also common. Concrete
reading process is impaired it is necessary to know words are read more accurately than abstract words, and
not only the error types and the types of words that there is the following grammatical category e¤ect:
are misread, but also the individual’s pattern of per- nouns > adjectives > verbs > functors (nouns most ac-
formance on other lexical tasks, such as naming and curate). Table 2 characterizes patterns of performance
comprehension. across tasks in patients with deep dyslexia with damage
Surface alexia or surface dyslexia refers to a pattern to di¤erent components of the semantic route. It has
of reading that reflects use of GPC mechanisms to read been argued that the pattern of reading errors reflects
both familiar and unfamiliar words, so that irregular reliance on the nondominant hemisphere’s rudimentary
words are often read as regularization errors (e.g., language capabilities (Coltheart, 1980), although direct
bear ! ‘‘beer’’; see papers in Patterson, Coltheart, and evidence for this proposal is lacking.
Marshall, 1985). Regular words, which can be read Neglect dyslexia, attentional dyslexia, and pure
accurately via GPC mechanisms, are more likely to be alexia were described under impairments of specific
read correctly than irregular words. Comprehension of components.
homophones, such as eight and ate, may be confused. Neurological disease or focal brain damage can dis-
Oral reading of pseudo-words is accurate. This pattern rupt one or more relatively distinct cognitive mecha-
can be seen with damage to any level of the reading sys- nisms that underlie the task of reading, resulting in
tem that requires reliance on GPC mechanisms to bypass di¤erent patterns of reading impairment. It is generally
the damaged component. For example, Table 1 de- possible to identify the impaired components by deter-
scribes features of three patients with surface alexia who mining the types of errors made, the types of stimuli that
Alexia 239
Table 2. Characteristics of Reported Individuals with Deep Alexia (Compensatory Use of GPC Mechanisms)
KE* RGB*
Error types (example of errors) Semantic paralexias (e.g., peach ! apple), Semantic paralexias (e.g., hope ! ‘‘faith’’),
derivational errors (e.g., walked ! ‘‘walk’’), derivational errors (e.g., crime !
functor substitutions (e.g., in ! ‘‘for’’) ‘‘criminal’’), functor substitutions (e.g.,
toward ! ‘‘shall’’)
Written word comprehension Impaired (e.g., shoe understood as ‘‘mitten’’) Intact (e.g., six read as ‘‘seven,’’ but defined as
‘‘half a dozen’’)
Spoken word comprehension Impaired (e.g., ‘‘shoe’’ understood as mitten) Intact
Oral naming Impaired (e.g., shoe named as ‘‘mitten’’) Impaired (e.g., mittens named as ‘‘socks’’)
GPC mechanisms Impaired Impaired
Levels of deficit Semantic system and GPC mechanisms Phonological output lexicon and GPC
mechanisms
* KE is described in Hillis et al. (1990); RGB is described in Caramazza and Hillis (1990).
are misread, and the accuracy of performance on related Hillis, A. E., and Caramazza, A. (1992). The reading process
tasks, such as word comprehension and naming. The and its disorders. In D. I. Margolin (Ed.), Cognitive neuro-
various components of the reading system have distinct psychology in clinical practice (pp. 229–261). New York:
neural substrates, so that damage to di¤erent parts of the Oxford University Press.
Hillis, A. E., and Caramazza, A. (1995). A framework for
brain results in di¤erent patterns of alexia (see Black and
interpreting distinct patterns of hemispatial neglect. Neuro-
Behrmann, 1994, and Hillis et al., 2002, for reviews). case, 1, 189–207.
—Argye E. Hillis Hillis, A. E., Kane, A., Barker, P., Beauchamp, N., and Wityk,
R. (2001). Neural substrates of the cognitive processes
underlying reading: Evidence from magnetic resonance
References perfusion imaging in hyperacute stroke. Aphasiology, 15,
Beauvois, M. F., and Derouesne, J. (1979). Phonological 919–931.
alexia: Three dissociations. Journal of Neurology, Neuro- Hillis, A. E., Rapp, B. C., Romani, C., and Caramazza, A.
surgery, and Psychiatry, 42, 1115–1124. (1990). Selective impairment of semantics in lexical pro-
Black, S., and Behrmann, M. (1994). Localization in alexia. In cessing. Cognitive Neuropsychology, 7, 191–244.
A. Kertesz (Ed.), Localization and neuroimaging in neuro- Kinsbourne, M., and Warrington, E. K. (1962). A variety of
psychology (pp. 152–184). San Diego, CA: Academic Press. reading disability associated with right hemisphere lesions.
Caramazza, A., and Hillis, A. E. (1990). Where do semantic Journal of Neurology, Neurosurgery, and Psychiatry, 25,
errors come from? Cortex, 26, 95–122. 334–339.
Coltheart, M. (1980). Deep dyslexia: A right hemisphere Parkin, A. J. (1996). Explorations in cognitive neuropsychology.
hypothesis. In M. Coltheart, K. E. Patterson, and J. C. Oxford, U.K.: Blackwell.
Marshall (Eds.), Deep dyslexia (pp. 326–380). London: Patterson, K., and Wilson, B. (1990). A ROSE is a ROSE or a
Routledge and Kegan Paul. NOSE: A deficit in initial letter identification. Cognitive
Coltheart, M. (Ed.) (1998). Pure alexia (letter-by-letter read- Neuropsychology, 7, 447–477.
ing) [Special issue]. Cognitive Neuropsychology, 15, 1–238. Patterson, K. E., Coltheart, M., and Marshall, J. C. (1985).
Coltheart, M., Patterson, K., and Marshall, J. C. (Eds.). Surface dyslexia. London: LEA.
(1980). Deep dyslexia. London: Routledge and Kegan Paul. Plaut, D., and Shallice, T. (1993). Deep dyslexia: A case study
Costello, A. de L., and Warrington, E. K. (1987). Dissociation of connectionist neuropsychology. Cognitive Neuropsy-
of visuo-spatial neglect and neglect dyslexia. Journal of chology, 10, 377–500.
Neurology, Neurosurgery, and Psychiatry, 50, 1110–1116. Riddoch, M. J. (Ed.). (1991). Neglect and the peripheral dys-
Goodglass, H., and Budin, C. (1988). Category and modality lexias. Hove, U.K.: Erlbaum.
specific dissociations in word comprehension and concur- Shallice, T. (1988). From neuropsychology to mental structure.
rent phonological dyslexia. Neuropsychologia, 26, 67–78. Cambridge, U.K.: Cambridge University Press.
Hillis, A. E. (1991). E¤ects of a separate treatments for distinct
impairments within the naming process. In T. Prescott
(Ed.), Clinical aphasiology (vol. 19, pp. 255–265). Austin, Further Readings
TX: Pro-Ed.
Hillis, A. E. (1993). The role of models of language processing Benson, D. F., and Geschwind, N. (1969). The alexias. In P. J.
in rehabilitation of language impairments. Aphasiology, 7, Vinken and G. W. Bruyn (Eds.), Handbook of clinical neu-
5–26. ropsychology (vol. 4). Amsterdam: North Holland.
Hillis, A. E. (2002). The cognitive processes underlying read- Coltheart, M., and Funnell, E. (1987). Reading and writing:
ing. In A. E. Hillis (Ed.), Handbook of adult language dis- One lexicon or two? In D. A. Allport, D. G. Mackay, W.
orders: Integrating cognitive neuropsychology, neurology, Prinz, and E. Scheerer (Eds.), Language perception and
and rehabilitation. Philadelphia: Psychology Press. production: Shared mechanisms in listening, reading, and
Hillis, A. E., and Caramazza, A. (1991). Mechanisms for writing. London: Academic Press.
accessing lexical representations for output: Evidence from Friedman, R. B., and Alexander, M. P. (1984). Pictures,
a category-specific semantic deficit. Brain and Language, 40, images, and pure alexia: A case study. Cognitive Neuro-
106–144. psychology, 1, 9–23.
Marshall, J. C., and Newcombe, F. (1966). Syntactic and se- integrity: AD patients have relatively preserved priming
mantic errors in paralexia. Neuropsychologia, 4, 169–176. for high-frequency lexical associates such as ‘‘cottage-
Marshall, J. C., and Newcombe, F. (1973). Patterns of para- cheese’’ that have little semantic relationship (Nebes,
lexia: A psycholinguistic approach. Journal of Psycholin- 1989; Ober et al., 1991), but they are impaired in their
guistic Research, 2, 175–199.
priming for coordinates taken from the same semantic
Rapcsak, S. Z., Gonzalez Rothi, L., and Heilman, K. M.
(1987). Phonologic alexia with optic and tactile anomia: A category, such as ‘‘peach-banana’’ (Glosser and Fried-
neuropsychological and anatomic study. Brain and Lan- man, 1991; Glosser et al., 1998). Item-by-item analyses
guage, 31, 109–121. show reduced priming for words that are di‰cult to un-
Rapp, B., Folk, J. R., Tainturier, M. (2001). Word reading. In derstand and name (Chertkow, Bub, and Seidenberg,
B. Rapp (Ed.), The handbook of cognitive neuropsychology. 1989). The unity of impairment across comprehension
Philadelphia: Psychology Press. and expression is emphasized by the observation of the
Reuter-Lorenz, P. A., and Brunn, J. L. (1990). A prelexical greatest naming di‰culty in patients with significant
basis for letter-by-letter reading: A case study. Cognitive semantic comprehension deficits (Chertkow and Bub,
Neuropsychology, 7, 1–20. 1990; Hodges, Salmon, and Butters, 1992). The basis for
this pattern of impaired semantic memory has been an
active focus of investigation. Some studies associate the
Alzheimer’s Disease semantic comprehension impairment in AD with the
degradation of knowledge about a word and its asso-
ciated concept (Gonnerman et al., 1997; Garrard et al.,
Alzheimer’s disease (AD) is a neurodegenerative condi- 1998; Conley, Burgess, and Glosser, 2001). A category-
tion that results in insidiously progressive cognitive de- specific deficit understanding or naming natural kinds
cline. According to widely recognized clinical diagnostic such as ‘‘animals’’ compared with manufactured arti-
criteria for AD (McKhann et al., 1984), these patients facts such as ‘‘implements’’ may emerge in AD (Silveri
have language processing impairments as well as di‰- et al., 1991). Other recent work suggests that di‰culty
culty with memory, visual perceptual-spatial processing, understanding words and pictures in AD is related to an
and executive functioning. The language impairment impairment in the categorization process that is so
changes as the disease progresses (Bayles et al., 2000), crucial to understanding concepts. In particular, AD
and one profound consequence of language di‰culty in patients appear to have di‰culty implementing rule-
AD is that this deficit strongly reflects clinical decline based processes for understanding the critical features of
and the need for additional skilled nursing support words that determine category membership (Grossman,
(Chan et al., 1995). This article briefly summarizes the Smith, et al., submitted) or for learning the category
studies showing that AD patients’ language deficit in- membership of new concepts (Koenig et al., 2001).
cludes di‰culty with comprehension and expression of The comprehension and expression of concepts often
the sounds/letters, words, and sentences that are used to requires appreciating the long-distance relationships
communicate in day-to-day circumstances. Work relat- among several words in a sentence. Some early work
ing these language deficits to a specific neuroanatomical attributes sentence processing di‰culty in AD to a
distribution of disease is also reviewed. grammatical deficit. Paragrammatic errors such as
We may consider first the semantic impairment in ‘‘mices’’ and ‘‘catched’’ can be observed in speech, oral
AD. This deficit limits the comprehension and ex- reading, and writing. Other studies relate impaired com-
pression of concepts represented by single words and prehension to di‰culty with the grammatical features of
sentences. In expression, for example, a significant word- phrases and a deficit in understanding grammatically
finding deficit is a prominent and early clinical feature complex sentences such as those containing a center-
of AD (Bayles, Tomoeda, and Trosset, 1990; White- embedded clause (Emery and Breslau, 1989; Kontiola
Devine et al., 1996; Cappa et al., 1998). This is seen in et al., 1990; Grober and Bang, 1995; Croot, Hodges, and
spontaneous speech as well as on measures of confron- Patterson, 1999). Essentially normal performance during
tation naming. Naming di‰culty due to a semantic im- on-line studies of sentence comprehension cast doubt
pairment often is manifested as semantic paraphasic on this claim (Kempler et al., 1998; Grossman and
errors, such as the substitution of ‘‘chair’’ for the in- Rhee, 2001). More recently, considerable evidence indi-
tended target ‘‘table.’’ As the condition progresses, AD cates that sentence processing di‰culty is related to
patients’ spontaneous speech becomes limited to the use a limitation in the working memory resources often
of overlearned phrases and ultimately becomes quite needed to support sentence processing. Although the
empty of content, while they fail to provide any re- precise nature of the limited cognitive resource(s) re-
sponses during confrontation naming. mains to be established, AD patients’ grammatical
Semantic deficits are also prominent in comprehen- comprehension deficit can be brought out by experi-
sion. More than 50% of AD patients di¤er significantly mental manipulations that stress cognitive resources
from healthy seniors in their performance on simple such as working memory, inhibitory control, and infor-
category judgment tasks. For example, many AD pa- mation processing speed. Studies demonstrate working
tients are impaired when shown a word or a picture and memory limitations through the use of verbs featuring
asked, ‘‘Is this a vegetable?’’ (Grossman et al., 1996). unusual syntactic-thematic mapping in a sentence com-
Priming is a relatively automatic measure of semantic prehension task (Grossman and White-Devine, 1998),
Alzheimer’s Disease 241
concurrent performance of a secondary task during sen- ited activation of middle and inferior frontal regions in
tence comprehension (Waters, Caplan, and Rochon, AD patients that had been recruited during a category
1995; Waters, Rochon, and Caplan, 1998), and limited membership semantic decision in healthy seniors (Saykin
inhibition of the context-inappropriate meaning of a et al., 1999). More recently, a BOLD fMRI study of
polysemous word (Faust et al., 1997). semantic judgments described limited activation of left
Semantic memory and sentence processing appear to temporoparietal cortex and frontal cortex in AD pa-
be preserved in some AD patients. Nevertheless, this tients compared to healthy seniors, and AD patients
cohort of AD patients may have a di¤erent language recruited brain regions adjacent to the activated areas
impairment profile. Many of these patients, despite pre- seen in elderly control subjects for specific categories of
served single-word and sentence comprehension, are knowledge such as ‘‘animals’’ and ‘‘implements’’ (Gross-
impaired in retrieving words from the mental lexicon. man, Koenig, et al., in press).
This kind of naming di‰culty is marked by changes in AD patients thus have prominent deficits at several
the sounds contributing to a word, such as omissions and levels of language processing. This includes impaired se-
substitutions (Biassou et al., 1995). This limitation in mantic memory, manifested in measures of comprehen-
lexical retrieval appears to be equally evident on oral sion and expression. There is also di‰culty with lexical
lexical retrieval tasks and in writing. AD patients appear retrieval in reading and writing, although perceptual
to be quite accurate at discriminating between speech judgments of speech sounds are relatively preserved. The
sounds that vary in the place of articulation or voice neural basis for these language impairments appears to
onset timing, but phonemic (single-sound) substitutions be a defect in temporoparietal association cortex of the
can also be heard in their speech. left hemisphere, although a defect in left frontal associa-
Alzheimer’s disease is a focal neurodegenerative con- tion cortex also may contribute to the language impair-
dition. Functional neuroimaging studies obtained at rest ments in AD.
with modalities such as single-photon emission com- See also dementia.
puted tomography, positron emission tomography, and
functional magnetic resonance imaging (fMRI) (Foster Acknowledgments
et al., 1983; Friedland, Brun, and Budinger, 1985;
Work was supported in part by U.S. Public Health Ser-
DeKosky et al., 1990; Johnson et al., 1993; Alsop, Detre,
vice grants AG15116, AG17586, and NS35867.
and Grossman, 2000) and histopathological studies of
autopsied brains (Brun and Gustafson, 1976; Arnold —Murray Grossman
et al., 1991; Braak and Braak, 1995) show that specific
brain regions are compromised in AD. The neuro- References
anatomical distribution of disease revealed by these
studies includes gross defects such as atrophy and mi- Alsop, D. C., Detre, J., and Grossman, M. (2000). Assessment
of cerebral blood flow in Alzheimer’s disease by arterial spin
croscopic abnormalities such as neuritic plaques and
labelling fMRI. Annals of Neurology, 47, 93–100.
neurofibrillary tangles in the temporal, parietal, and Arnold, S. E., Hyman, B. T., Flory, J., Damasio, A. R., and
frontal association cortices of the brain. The neural basis van Hoesen, G. W. (1991). The topographic and neuro-
for the language di‰culties in AD is investigated most anatomical distribution of neurofibrillary tangles and
commonly through brain-behavior correlation studies, neuritic plaques in the cerebral cortex of patients with
although occasional functional neuroimaging reports Alzheimer’s disease. Cerebral Cortex, 1, 103–116.
describe defects in regional brain activation during lan- Bayles, K. A., Tomoeda, C. K., Cruz, R. F., and Mahendra,
guage challenges. Early correlation studies relate sen- N. (2000). Communication abilities of individuals with late-
tence comprehension di‰culty to reduced resting activity stage Alzheimer disease. Alzheimer Disease and Associated
in posterior temporal and inferior parietal regions of the Disorders, 14, 176–182.
Bayles, K. A., Tomoeda, C. K., and Trosset, M. W. (1990).
left hemisphere (Haxby et al., 1985; Grady et al., 1988).
Naming and categorical knowledge in Alzheimer’s disease:
More recent work associates di‰culty understanding The process of semantic memory deterioration. Brain and
single words and impaired confrontation naming with Language, 39, 498–510.
left temporoparietal cortex (Desgranges et al., 1998). Biassou, N., Grossman, M., Onishi, K., Mickanin, J., Hughes,
Moreover, the defect in this brain region is significantly E., Robinson, K. M., et al. (1995). Phonological processing
greater in AD patients with a semantic memory im- deficits in Alzheimer’s disease. Neurology, 45, 2165–2169.
pairment than in AD patients with relatively preserved Braak, H., and Braak, E. (1995). Staging of Alzheimer’s
semantic memory (Grossman et al., 1997), and a com- disease-related neurofibrillary changes. Neurobiology of
parative study demonstrates the specificity of this cor- Aging, 16, 271–278.
relative pattern in AD relative to patients with a Brun, A., and Gustafson, L. (1976). Distribution of cerebral
degeneration in Alzheimer’s disease: A clinico-pathologic
frontotemporal form of dementia (Grossman et al.,
study. Archives für Psychiatrik Nervenkrasse, 223, 15–33.
1998). By comparison, only very modest correlations Cappa, S. F., Binetti, G., Pezzini, A., Padovani, A., Rozzini,
show a relationship between grammatical comprehen- L., and Trabucchi, M. (1998). Object and action naming in
sion and left inferior frontal cortex in AD. Alzheimer’s disease and fronto-temporal dementia. Neurol-
A handful of functional neuroimaging studies report ogy, 50, 351–355.
monitoring the regional cortical responses of AD pa- Chan, A. S., Salmon, D. P., Butters, N., and Johnson, S. A.
tients during language challenges. One study shows lim- (1995). Semantic network abnormality predicts rate of
cognitive decline in patients with probable Alzheimer’s Grossman, M., Koenig, P., Glosser, G., DeVita, C., Moore, P.,
disease. Journal of the International Neuropsychological Rhee, J., et al. (in press). Semantic memory di‰culty in
Society, 1, 297–303. Alzheimer’s disease: An fMRI study. Brain.
Chertkow, H., and Bub, D. N. (1990). Semantic memory loss Grossman, M., Payer, F., Onishi, K., D’Esposito, M., Morri-
in dementia of the Alzheimer’s type: What do the various son, D., Sadek, A., and Alavi, A. (1998). Language com-
measures measure? Brain, 113, 397–417. prehension and regional cerebral defects in frontotemporal
Chertkow, H., Bub, D. N., and Seidenberg, M. (1989). Priming degeneration and Alzheimer’s disease. Neurology, 50, 157–
and semantic memory loss in Alzheimer’s disease. Brain and 163.
Language, 36, 420–446. Grossman, M., and Rhee, J. (2001). Cognitive resources dur-
Conley, P., Burgess, C., and Glosser, G. (2001). Age vs. Alz- ing sentence processing in Alzheimer’s disease. Neuropsy-
heimer’s: A computational model of changes in representa- chologia, 39, 1419–1431.
tion. Brain and Cognition, 46, 86–90. Grossman, M., Smith, E. E., Koenig, P., Glosser, G., Rhee, J.,
Croot, K., Hodges, J. R., and Patterson, K. (1999). Evidence and Dennis, K. (submitted). Categorization of object
for imapired sentence comprehension in early Alzheimer’s descriptions in Alzheimer’s disease and frontotemporal de-
disease. Journal of the International Neuropsychological So- mentia: Limitation in rule-based processing.
ciety, 5, 393–404. Grossman, M., and White-Devine, T. (1998). Sentence com-
DeKosky, S. T., Shih, W. J., Schmitt, F. A., Coupal, J., and prehension in Alzheimer’s disease. Brain and Language, 62,
Kirkpatrick, C. (1990). Assessing utility of single photon 186–201.
emission computed tomography (SPECT) scan in Alz- Grossman, M., White-Devine, T., Payer, F., Onishi, K.,
heimer’s disease: Correlation with cognitive severity. Alz- D’Esposito, M., Robinson, K. M., et al. (1997). Constraints
heimer Disease and Associated Disorders, 4, 14–23. on the cerebral basis for semantic processing from neuro-
Desgranges, B., Baron, J.-C., de la Sayette, V., Petit-Taboue, imaging studies of Alzheimer’s disease. Journal of Neurol-
M. C., Benali, K., Landeau, B., et al. (1998). The neural ogy, Neurosurgery, and Psychiatry, 63, 152–158.
substrates of memory systems impairment in Alzheimer’s Haxby, J. V., Duara, R., Grady, C., Cutler, N., and Rapoport,
disease: A PET study of resting brain glucose utilization. S. (1985). Relations between neuropsychological and cere-
Brain, 121, 611–631. bral metabolic asymmetries in early Alzheimer’s disease.
Emery, O. B., and Breslau, L. D. (1989). Language deficits in Journal of Cerebral Blood Flow and Metabolism, 5, 193–
depression: Comparisons with SDAT and normal aging. 200.
Journal of Gerontology, 44, M85–M92. Hodges, J. R., Salmon, D. P., and Butters, N. (1992). Semantic
Faust, M., Balota, D. A., Duchek, J. M., Gernsbacher, M. A., memory impairment in Alzheimer’s disease: Failure of
and Smith, S. (1997). Inhibitory control during sentence access or degraded knowledge. Neuropsychologia, 30,
comprehension in individuals with dementia of the Alz- 301–314.
heimer type. Brain and Language, 57, 225–253. Johnson, K. A., Kijewski, M. F., Becker, J. A., Garada, B.,
Foster, N. L., Chase, T. N., Fedio, P., Patronas, N. J., Brooks, Satlin, A., and Holman, B. L. (1993). Quantitative brain
R. A., and DiChiro, G. (1983). Alzheimer’s disease: Focal SPECT in Alzheimer’s disease and normal aging. Journal of
cortical changes shown by positron emission tomography. Nuclear Medicine, 34, 2044–2048.
Neurology, 33, 961–965. Kempler, D., Almor, A., Tyler, L. K., Andersen, E. S.,
Friedland, R., Brun, A., and Budinger, T. (1985). Pathological and MacDonald, M. C. (1998). Sentence comprehension
and positron emission tomographic correlations in Alz- deficits in Alzheimer’s disease: A comparison of o¤-line vs.
heimer’s disease. Lancet, 1, 288. on-line sentence processing. Brain and Language, 64, 297–
Garrard, P., Patterson, K., Watson, P. C., and Hodges, J. R. 316.
(1998). Category specific semantic loss in dementia of Alz- Koenig, P., Smith, E. E., Rhee, J., Moore, P., Petock, L., and
heimer’s type: Functional-anatomic correlations from cross- Grossman, M. (2001). Categorization of novel animals in
sectional analyses. Brain, 121, 633–646. Alzheimer’s disease. Neurology, 56, A56–A57.
Glosser, G., and Friedman, R. (1991). Lexical but not semantic Kontiola, P., Laaksonen, R., Sulkava, R., and Erkinjuntti, T.
priming in Alzheimer’s disease. Psychology and Aging, 6, (1990). Pattern of language impairment is di¤erent in Alz-
522–527. heimer’s disease and multi-infarct dementia. Brain and
Glosser, G., Friedman, R., Grugan, P. K., Lee, J. H., and Gross- Language, 38, 364–383.
man, M. (1998). Lexical semantic and associative priming McKhann, G., Drachman, D., Folstein, M., Katzman, R.,
in Alzheimer’s disease. Neuropsychology, 12, 218–224. Price, D., and Stadian, E. M. (1984). Clinical diagnosis of
Gonnerman, L. M., Andersen, E. S., Devlin, J. T., Kempler, Alzheimer’s disease: Report on the NINCDS-ADRDA
D., and Seidenberg, M. S. (1997). Double dissociation of work group under the auspices of the Department of Health
semantic categories in Alzheimer’s disease. Brain and Lan- and Human Services Task Force on Alzheimer’s disease.
guage, 57, 254–279. Neurology, 34, 939–944.
Grady, C. L., Haxby, J. V., Horwitz, B., and Sundaram, M. Nebes, R. D. (1989). Semantic memory in Alzheimer’s disease.
(1988). Longitudinal study of the early neuropsychological Psychological Bulletin, 106, 377–394.
and cerebral metabolic changes in dementia of the Alz- Ober, B. A., Shenaut, G. K., Jagust, W. J., and Stillman, R. C.
heimer type. Journal of Clinical and Experimental Neuro- (1991). Automatic semantic priming with various category
psychology, 10, 576–596. relations in Alzheimer’s disease and normal aging. Psychol-
Grober, E., and Bang, S. (1995). Sentence comprehension in ogy and Aging, 6, 647–660.
Alzheimer’s disease. Developmental Neuropsychology, 11, Saykin, A. J., Flashman, L. A., Frutiger, S. A., Johnson, S. C.,
95–107. Mamourian, A. C., Moritz, C. H., et al. (1999). Neuro-
Grossman, M., D’Esposito, M., Hughes, E., Onishi, K., Bias- anatomic substrates of semantic memory impairment in
sou, N., White-Devine, T., et al. (1996). Language compre- Alzheimer’s disease: Patterns of functional MRI activation.
hension di‰culty in Alzheimer’s disease, vascular dementia, Journal of the International Neuropsychological Society, 5,
and fronto-temporal degeneration. Neurology, 47, 183–189. 377–392.
Aphasia, Global 243
Silveri, M. C., Daniele, A., Giustolisi, L., and Gainotti, G. reflect communicative intent. The contributions to con-
(1991). Dissociation between living and nonliving things in versation that are credited to these patients, therefore,
dementia of the Alzheimer type. Neurology, 41, 545–546. may be the result of the communicative partners’ need
Waters, G. S., Caplan, D., and Rochon, E. (1995). Processing for informative communication rather than the patients’
capacity and sentence comprehension in patients with Alz-
use of prosodic elements to convey intent.
heimer’s disease. Cognitive Neuropsychology, 12, 1–30.
Waters, G. S., Rochon, E., and Caplan, D. (1998). Task de- The e‰ciency of communication following global
mands and sentence comprehension in patients with aphasia depends on the type of question that is asked
dementia of the Alzheimer’s type. Brain and Language, (Herrmann et al., 1989). Better performance is observed
62, 361–397. for responses to yes/no questions than for responses to
White-Devine, T., Robinson, K. M., Onishi, K., Seidl, A., interrogative pronoun questions and narrative requests.
Biassou, N., D’Esposito, M., et al. (1996). Verb confronta- Patients with global aphasia mostly use gesture in their
tion naming and word-picture matching in Alzheimer’s dis- responses to yes/no questions. The other types of ques-
ease. Neuropsychology, 10, 495–503. tioning require increased verbal output and thus create
the need for more complex communicative responses.
Patients with global aphasia rarely take the initia-
Aphasia, Global tive to communicate or expand on shared topics (Herr-
mann et al., 1989). Their most frequent communication
strategies are those that enable them to secure com-
Global aphasia is an acquired language disorder char- prehension (e.g., indicating comprehension problems,
acterized by severe loss of comprehension with concom- requesting support for establishing comprehension). Al-
itant deficits in expressive abilities (Peach, 2001). Unlike though these individuals rely most heavily on nonverbal
other syndromes of aphasia, few distinctions are found strategies, the e‰ciency of their communication may
between preserved and impaired components of these approximate that of less impaired aphasic patients while
patients’ language. The outlook for recovery from global imposing nearly as low a burden on the communication
aphasia tends to be bleak (Kertesz and McCabe, 1977). partner (Marshall, Freed, and Phillips, 1997).
For this reason, the term global may be more prognostic Global aphasia results most commonly from a cere-
than descriptive. brovascular event in the middle cerebral artery at a
Several isolated areas of relatively preserved compre- level inferior to the point of branching. The majority of
hension have been identified in global aphasia. These lesions producing global aphasia are extensive and in-
include recognition of specific word categories (Wapner volve both prerolandic and postrolandic areas of the left
and Gardner, 1979) and famous personal and geo- hemisphere. These include Broca’s (posterior frontal)
graphic names (Yasuda and Ono, 1998). Globally and Wernicke’s (superior temporal) areas and may ex-
aphasic subjects also show relatively better comprehen- tend to subcortical areas, including the basal ganglia,
sion for personally relevant information (Van Lancker internal capsule, and thalamus (Murdoch et al., 1986).
and Nicklay, 1992). Occasionally, the lesion is confined to anterior, poste-
Globally aphasic patients are most severely impaired rior, or deep cortical and subcortical regions (Mazzocchi
in their expressive abilities. The verbal output of many and Vignolo, 1979). Global aphasia has also been de-
patients with global aphasia consists primarily of ster- scribed in patients with lesions restricted to subcortical
eotypic recurring utterances or speech automatisms. regions, including the basal ganglia, internal capsule,
Recurrent utterances have been described as being either periventricular white matter, temporal isthmus, and tha-
nondictionary verbal forms (unrecognizable) consisting lamus (Alexander, Naeser, and Palumbo, 1987).
of consonant-vowel (CV) syllables (for example, do-do- Ferro (1992) investigated the influence of lesion site
do or ma-ma-ma) or dictionary forms (word or sentence) on recovery from global aphasia. The lesions in his sub-
(Alajouanine, 1956). Blanken, Wallesch, and Papagno jects with global aphasia were grouped into five types
(1990) investigated the relationship between the nondic- with di¤ering outcomes. Type 1 included patients with
tionary forms of recurrent utterances and comprehen- large pre- and postrolandic middle cerebral artery
sion disturbances in global aphasia. Although recurrent infarcts. These patients had a very poor prognosis. The
utterances are frequently associated with comprehension remaining four groups were classified as follows: type 2,
disturbances, the overall variability in language compre- prerolandic; type 3, subcortical; type 4, parietal, and
hension suggests that speech stereotypes cannot be used type 5, double frontal and parietal lesion. Patients in
to infer the presence of severe comprehension deficits. these latter groups demonstrated variable outcomes,
Patients with global aphasia give the impression of improving generally to Broca’s or transcortical aphasia.
having more preserved communicative abilities than is Complete recovery was observed in some patients with
actually the case because of their use of the supraseg- type 2 and 3 infarcts. In contrast, Basso and Farabola
mental aspects of speech. To investigate this, deBlesser (1997) investigated recovery in three cases of aphasia
and Poeck (1985) analyzed the spontaneous utterances based on the patients’ lesion patterns. One patient had
of a group of globally aphasic subjects with output global aphasia from a large lesion involving both the
limited to CV recurrences. They concluded that the anterior and posterior language areas, while two other
length and pitch of these utterances were stereotypical patients had Broca’s and Wernicke’s aphasia from
and that the prosody of these patients did not seem to lesions restricted to either the anterior or posterior
language areas, respectively. The patient with global Several factors have been investigated for their prog-
aphasia recovered better than his two aphasic counter- nostic significance with regard to global aphasia. The
parts and had an outstanding outcome. Basso and Far- patient’s age appears to have an impact on recovery:
rabola concluded that group recovery patterns based on the younger the patient, the better the prognosis (Hol-
aphasia severity and site of lesion may not be able to land, Swindell, and Forbes, 1985). However, numerous
account for the improvement that is occasionally exceptions to this trend have been described. Age may
observed in individual patients. also relate to the type of aphasia at 1 year post stroke. In
Global aphasia has the lowest recovery rate of all the the study by Holland, Swindell, and Forbes (1985),
aphasias (Kertesz and McCabe, 1977). When assessing younger globally aphasic patients evolved to a nonfluent
the language recovery that does occur, comprehension is Broca’s aphasia while older patients evolved to increas-
found to improve more than expression (Prins, Snow, ingly severe fluent aphasias with advancing age. The
and Wagenaar, 1978). Di¤erences have been reported oldest patients remained globally aphasic.
in the temporal patterns of recovery depending on Absence of hemiparesis with global aphasia may be a
whether the subjects were receiving speech and language positive indicator for recovery. Tranel et al. (1987)
treatment. For globally aphasic patients not receiving described globally aphasic patients with dual discrete
treatment, improvement appears to be greatest during lesions (anterior and posterior cerebral) that spared
the first 6 months after onset (Pashek and Holland, the primary motor area. Global aphasia improved sig-
1988). nificantly in this group within the first 10 months after
Globally aphasic patients receiving treatment dem- onset. These conclusions are tempered by the results of
onstrate substantial improvements during the first 3–6 Keyserlingk et al. (1997), who found that chronic glob-
months but also continue to improve during the period ally aphasic patients with no history of hemiparesis did
between 6 and 12 months or more after onset. Sarno and not fare any better with regard to language outcome
Levita (1981) observed the most accelerated improve- than did their globally aphasic counterparts with hemi-
ment between 6 and 12 months after stroke. Nicholas paresis from the time of onset.
et al. (1993) found di¤erent patterns of recovery for The radiologic findings of patients with global apha-
language and nonlanguage skills during the first year. sia have also been studied to determine whether lesion
Substantial improvements in praxis and oral-gestural patterns found on computed tomography may provide
expression were noted only in the first 6 months after prognostic information. Although the findings have been
onset, while similar improvements in auditory and read- generally mixed, Naeser et al. (1990) were able to show
ing comprehension were observed only between 6 and 12 significantly better recovery of auditory comprehension
months after onset. for a group of globally aphasic subjects whose damage
The majority of patients with global aphasia will not did not include Wernicke’s area (i.e., the lesions were
recover to less severe forms of the disorder. Some limited to the subcortical temporal isthmus).
patients, however, will improve such that the condi- Finally, it appears that a lack of variability between
tion evolves into other aphasia syndromes, including auditory comprehension scores and other language
Broca’s, transcortical motor, mixed nonfluent, conduc- scores may be viewed as a negative indicator (Mark,
tion, anomic, and Wernicke’s aphasias. Occasionally, Thomas, and Berndt, 1992). The more performance
patients make a complete recovery to normal language. di¤ers among language tasks, the better the outlook.
One apparent explanation for the variability among Within auditory comprehension scores, globally aphasic
these patients might be the greater instability of lan- patients who produce yes/no responses to simple ques-
guage scores (and therefore aphasia classifications) tions, regardless of their accuracy, seem to have a better
obtained during the first 4 weeks after stroke versus outcome at 1 year post onset than those who cannot
those obtained after the first month post onset. McDer- grasp the yes/no format.
mott, Horner, and DeLong (1996) found greater magni-
tudes of change in scores and frequencies of aphasia type —Richard K. Peach
evolution in subjects tested during the first 30 days after
onset than in subjects tested in the second 30 days after
onset. Aphasia tends to be more severe during the acute
References
stage, giving observers an initial impression of global Alajouanine, M. S. (1956). Verbal realization in aphasia. Brain,
aphasia. However, globally aphasic patients who do 79, 1–28.
progress to some other form of aphasia may demon- Alexander, M. P., Naeser, M. A., and Palumbo, C. L. (1987).
strate changes that extend into the first months after Correlations of subcortical CT lesion sites and aphasia
onset (Pashek and Holland, 1988). In some cases, the profiles. Brain, 110, 961–991.
global aphasia may not begin to evolve until after the Basso, A., and Farrabola, M. (1997). Comparison of improve-
ment of aphasia in three patients with lesions in anterior,
first month has passed. The discrepancies in these posterior, and antero-posterior language areas. Neuropsy-
studies, therefore, do not appear to be simply the result chological Rehabilitation, 7, 215–230.
of the time at which the initial language observations Blanken, G., Wallesch, C. W., and Papagno, C. (1990).
were recorded. Apparently, evolution from global apha- Dissociations of language functions in aphasics with
sia is the result of a complex interaction among a num- speech automatisms (recurring utterances). Cortex, 26, 41–
ber of heretofore incompletely understood variables. 63.
Aphasia, Primary Progressive 245
deBlesser, R., and Poeck, K. (1985). Analysis of prosody in the Further Readings
spontaneous speech of patients with CV-recurring utter-
ances. Cortex, 21, 405–416. Alexander, M. P., and Loverso, F. L. (1993). A specific treat-
Ferro, J. M. (1992). The influence of infarct location on re- ment for global aphasia. Clinical Aphasiology, 21, 277–289.
covery from global aphasia. Aphasiology, 6, 415–430. Collins, M. J. (1997). Global aphasia. In L. L. LaPointe (Ed.),
Herrmann, M., Koch, U., Johannsen-Horbach, H., and Wal- Aphasia and related neurogenic language disorders (2nd ed.,
lesch, C. W. (1989). Communicative skills in chronic and pp. 133–150). New York: Thieme.
severe nonfluent aphasia. Brain and Language, 37, 339–352. Conlon, C. P., and McNeil, M. R. (1991). The e‰cacy of
Holland, A. L., Swindell, C. S., and Forbes, M. M. (1985). The treatment for two globally aphasic adults using visual action
evolution of initial global aphasia: Implications for prog- therapy. Clinical Aphasiology, 19, 185–195.
nosis. Clinical Aphasiology, 15, 169–175. Forde, E., and Humphreys, G. W. (1995). Refractory seman-
Kertesz, A., and McCabe, P. (1977). Recovery patterns and tics in global aphasia: On semantic organisation and the
prognosis in aphasia. Brain, 100, 1–18. access-storage distinction in neuropsychology. Memory, 3,
Keyserlingk, A. G., Naujokat, C., Niemann, K., Huber, W., 265–307.
and Thron, A. (1997). Global aphasia—with and without Gold, B. T., and Kertesz, A. (2000). Preserved visual lexico-
hemiparesis. European Neurology, 38, 259–267. semantics in global aphasia: A right-hemisphere contri-
Mark, V. W., Thomas, B. E., and Berndt, R. S. (1992). Factors bution? Brain and Language, 75, 359–375.
associated with improvement in global aphasia. Aphasiol- Hanlon, R. E., Lux, W. E., Dromerick, A. W. (1999). Global
ogy, 6, 121–134. aphasia without hemiparesis: Language profiles and lesion
Marshall, R. C., Freed, D. B., and Phillips, D. S. (1997). distribution. Journal of Neurology, Neurosurgery, and Psy-
Communicative e‰ciency in severe aphasia. Aphasiology, chiatry, 66, 365–369.
11, 373–385. Helm, N. A., and Barresi, B. (1980). Voluntary control of in-
Mazzocchi, F., and Vignolo, L. A. (1979). Localization of voluntary utterances: A treatment approach for severe
lesions in aphasia: Clinical CT scan correlations in stroke aphasia. Clinical Aphasiology, 10, 308–315.
patients. Cortex, 15, 627–654. Helm-Estabrooks, N., Fitzpatrick, P. M., and Barresi, B.
McDermott, F. B., Horner, J., and DeLong, E. R. (1996). (1982). Visual action therapy for global aphasia. Journal of
Evolution of acute aphasia as measured by the Western Speech and Hearing Disorders, 47, 385–389.
Aphasia Battery. Clinical Aphasiology, 24, 159–172. Masand, P., and Chaudhary, P. (1994). Methylphenidate
Murdoch, B. E., A¤ord, R. J., Ling, A. R., and Ganguley, B. treatment of poststroke depression in a patient with global
(1986). Acute computerized tomographic scans: Their value aphasia. Annals of Clinical Psychiatry, 6, 271–274.
in the localization of lesions and as prognostic indicators in McCall, D., Shelton, J. R., Weinrich, M., and Cox, D. (2000).
aphasia. Journal of Communication Disorders, 19, 311–345. The utility of computerized visual communication for
Naeser, M. A., Gaddie, A., Palumbo, C. L., and Stiassny-Eder, improving natural language in chronic global aphasia:
D. (1990). Late recovery of auditory comprehension in Implications for approaches to treatment in global aphasia.
global aphasia: Improved recovery observed with sub- Aphasiology, 14, 795–826.
cortical temporal isthmus lesion vs. Wernicke’s cortical area Naeser, M. A., Baker, E. H., Palumbo, C. L., Nicholas, M.,
lesion. Archives of Neurology, 47, 425–432. Alexander, M. P., Samaraweera, R., et al. (1998). Lesion
Nicholas, M. L., Helm-Estabrooks, N., Ward-Lonergan, J., site patterns in severe, nonverbal aphasia to predict out-
and Morgan, A. R. (1993). Evolution of severe aphasia in come with a computer-assisted treatment program. Archives
the first two years post onset. Archives of Physical Medicine of Neurology, 55, 1438–1448.
and Rehabilitation, 74, 830–836. Sarno, M. R., and Levita, E. (1979). Recovery in treated
Pashek, G. V., and Holland, A. L. (1988). Evolution of aphasia aphasia during the first year post-stroke. Stroke, 10, 663–
in the first year post-onset. Cortex, 24, 411–423. 670.
Peach, R. K. (2001). Clinical intervention for global aphasia. Van Lancker, D., and Klein, K. (1990). Preserved recognition
In R. Chapey (Ed.), Language intervention strategies in of familiar personal names in global aphasia. Brain and
aphasia and related neurogenic communication disorders (4th Language, 39, 511–529.
ed., pp. 487–512). Philadelphia: Lippincott Williams and Weinrich, M., Steele, R., Carlson, G. S., Kleczewska, M.,
Wilkins. Wertz, R. T., and Baker, E. H. (1989). Processing of visual
Prins, R. S., Snow, E., and Wagenaar, E. (1978). Recovery syntax in a globally aphasic patient. Brain and Language,
from aphasia: Spontaneous speech versus language com- 36, 391–405.
prehension. Brain and Language, 6, 192–211. Weinrich, M., Steele, R., Kleczewska, M., Carlson, G. S.,
Sarno, M. T., and Levita, E. (1981). Some observations on the Baker, E. H., and Wertz, R. T. (1989). Representation of
nature of recovery in global aphasia after stroke. Brain and ‘‘verbs’’ in a computerized visual communication system.
Language, 13, 1–12. Aphasiology, 3, 501–512.
Tranel, D., Biller, J., Damasio, H., Adams, H. P., and Cornell,
S. H. (1987). Global aphasia without hemiparesis. Archives
of Neurology, 44, 304–308.
Van Lancker, D., and Nicklay, C. K. H. (1992). Comprehen-
Aphasia, Primary Progressive
sion of personally relevant (PERL) versus novel language in
two globally aphasic patients. Aphasiology, 6, 37–61.
Wapner, W., and Gardner, H. (1979). A note on patterns of
The clinical syndrome of primary progressive aphasia is
comprehension and recovery in global aphasia. Journal of a diagnostic category applied to conditions in which
Speech and Hearing Research, 29, 765–772. individuals exhibit at least a 2-year history of progressive
Yasuda, K., and Ono, Y. (1998). Comprehension of famous language deterioration not accompanied by other cogni-
personal and geographical names in global aphasic subjects. tive symptoms and not attributable to any vascular,
Brain and Language, 61, 274–287. neoplastic, metabolic, or infectious diseases. The disease
is considered to be a focal cortical atrophy syndrome, as At present, the cause of PPA is unknown. It is
neuronal cell death is, at least initially, limited to cir- similarly unclear whether PPA is related to a distinct
cumscribed cortical regions and symptoms are isolated neuropathological entity. Investigations of the neuro-
to specific abilities and behaviors subserved by the af- pathology associated with PPA yield heterogeneous
fected region (Polk and Kertesz, 1993; Black, 1996). findings (Black, 1996). Most case studies that include
Primary progressive aphasia (PPA) is characterized by histological data report nonspecific neuronal loss in the
the gradual worsening of language dysfunction in the left perisylvian region accompanied by spongiform
context of preserved memory, judgment, insight, visuo- changes in the superficial cortical layers (Snowden et al.,
spatial skills, and overall comportment, at least until the 1992; Scheltens, Ravid, and Kamphorst, 1994). Other
later stages of the disease. Historically viewed as an cases have been reported of individuals who initially
atypical presentation of dementia, the isolated deterio- presented with progressive language disturbances but
ration of language in the context of degenerative disease eventually were diagnosed with AD (Pogacar and Wil-
has been reported for more than 100 years. PPA was first liams, 1984; Kempler et al., 1990), Pick’s disease (Hol-
recognized as a distinct clinical entity by Mesulam in land et al., 1985; Gra¤-Radford et al., 1990; Scheltens et
1982. Mesulam and Weintraub (1992) distinguished PPA al., 1990; Kertesz et al., 1994), and Creutzfeldt-Jakob
from other degenerative neurological conditions such as disease (Shuttleworth, Yates, and Paltan-Ortiz, 1985;
Alzheimer’s disease (AD) by its gradual progression of Mandell, Alexander, and Carpenter, 1989). However,
language dysfunction in the absence of more widespread most of these cases do not meet the diagnostic criteria
cognitive or behavioral disturbances for a period of for PPA according to Mesulam’s definition. Further-
at least 2 years. In some cases, the syndrome may later more, it is possible that the onset of neuropathophysio-
become associated with cognitive and behavioral dis- logical changes related to AD develop sometime after
turbances similar to those seen in frontal lobe or fronto- the onset of the focal cortical degeneration associated
temporal dementia or with motor speech disorders with PPA, thus explaining the initial appearance of iso-
(dysarthria, apraxia of speech), as observed in cortico- lated language symptoms, followed by the onset of more
basal degeneration or upper motor neuron disease (e.g., widespread cognitive involvement. Thus, autopsy find-
primary lateral sclerosis) (Kertesz and Munoz, 1997). ings of pathology associated with AD do not preclude
In other cases, the symptoms remain limited to the dis- the possibility that two distinct disease processes may co-
solution of language production and comprehension occur within the same individual. Although PPA is rec-
abilities throughout the duration of the a¤ected individ- ognized as a distinct clinical entity, the issue of whether
ual’s life. it is a distinct pathological entity remains unresolved.
The diagnosis of PPA is typically made based on a The course of the disease is quite varied. After a
2-year history of progressive language deterioration that 2-year history of isolated language symptoms, some
emerges in the absence of any marked disturbance of proportion of individuals diagnosed with PPA eventu-
other cognitive function and is not associated with any ally exhibit more widespread cognitive involvement
vascular, neoplastic, metabolic, or infectious disease consistent with a diagnosis of dementia (i.e., deteriora-
(Du¤y and Petersen, 1992; Mesulam and Weintraub, tion in two or more cognitive areas such as memory,
1992). In addition to neurological examination, medical personality changes, and the ability to independently
assessment typically includes neuroimaging and neuro- carry out activities of daily living due to cognitive as
psychological testing. During the first few years, com- opposed to physical impairments). Mesulam (1982)
puted tomographic and magnetic resonance imaging suggested waiting 5 years after the onset of symptoms
results tyipically are negative or reveal mild to moderate before trying to predict the course of cognitive involve-
atrophy of the left perisylvian region. Metabolic neuro- ment. However, there is no indication from the literature
imaging (e.g., positron emission tomography) typically that after 5 years, individuals are less likely to develop
reveals left perisylvian hypometabolism and is sensi- dementia (Rogers and Alarcon, 1999). Estimates vary
tive to abnormalities earlier than structural neuro- concerning the percentage of individuals who, after
imaging methods (e.g., Kempler et al., 1990; McDaniel, initially presenting with a 2-year history of isolated lan-
Wagner, and Greenspan, 1991). Neuropsychological as- guage dissolution, eventually exhibit widespread cogni-
sessment typically reveals relative preservation of non- tive involvement. These estimates range from 30% to
verbal cognitive function (e.g., abstract reasoning, visual 50% (Du¤y and Petersen, 1992; Mesulam and Wein-
short-term memory, visuoperceptual organization) in traub, 1992; Rogers and Alarcon, 1999). Thus it is likely
conjunction with below-normal performance on tests that between 50% and 70% of individuals diagnosed
requiring verbal processing, such as immediate verbal with PPA experience only the consequences of declining
recall, novel verbal learning, and verbal fluency (e.g., speech, language, and communication for many years.
Sapin, Anderson, and Pulaski, 1989; Weintraub, Rubin, These individuals continue to drive, manage their own
and Mesulam, 1990). Additionally, many studies have finances, and in all respects other than speech, language,
reported the presence of nonlinguistic sequelae known to and communication maintain baseline levels of perfor-
frequently co-occur in nonprogressive forms of aphasia, mance on repeated testing over many years. Thus, the
such as acalculia, dysphagia, depression, limb apraxia, course is variable, with some patients progressing rap-
and apraxia of speech (Rogers and Alarcon, 1999). idly, but for others, the course can be quite prolonged,
Aphasia, Primary Progressive 247
typically taking 6 or 7 years before they develop severe the fluent or nonfluent classes has not been established,
aphasia or mutism. For these individuals, the global the hypothesis that individuals with a fluent profile are
cognitive deterioration does not occur as early in the more likely to develop generalized cognitive involvement
disease process or to the same extent as seen in AD. than those with a nonfluent profile has received much
Researchers have attempted to identify clinical symp- attention (e.g., Weintraub, Rubin, and Mesulam, 1990;
toms that could serve as reliable predictors of eventual Du¤y and Peterson, 1992; Snowden et al., 1992; Rogers
cognitive status. The profile of speech and language and Alarcon, 1999).
dysfunction has been investigated as a prognostic indi- The hypothesis that the profile of language impair-
cator of whether an individual is likely to develop gen- ment may predict the course of generalized cognitive
eralized dementia. The hypothesis that a language profile involvement has been investigated primarily through
consistent with fluent aphasia as opposed to nonfluent systematic review of the literature. Du¤y and Peterson
aphasia predicts a course of earlier cognitive decline has (1992) reviewed 28 reports, published between 1977 and
been investigated through case study (e.g., Snowden et 1990, describing 54 individuals with PPA. Approxi-
al., 1992) and systematic review of the literature (Du¤y mately half of the 54 individuals developed generalized
and Petersen, 1992; Mesulam and Weintraub, 1992; dementia, but none of the 12 patients identified with
Rogers and Alarcon, 1999). The fluency dimension in nonfluent profiles evinced generalized cognitive involve-
aphasia refers to a dichotomous classification based on ment. This finding was interpreted as supporting the hy-
the nature of the spoken language disturbance. Individ- pothesis that a nonfluent profile may predict a longer
uals with fluent aphasia produce speech at normal to fast duration of isolated language symptoms, or perhaps a
rates with normal to long phrase length and few, if any, lower probability of developing widespread cognitive
phonological speech errors. Verbal output in fluent involvement. Mesulam and Weintraub (1992) reviewed
aphasia is characterized as logorrheic (running on and 63 cases of PPA. The average duration of isolated lan-
on) and neologistic (novel words), and it tends to be guage symptoms was 5.2 years, and six individuals
empty (devoid of meaningful content). Disturbances of exhibited isolated language symptoms for more than 10
auditory comprehension and anomia (i.e., word retrieval years. They reported that, compared to either probable
di‰culties) were the primary symptoms in most cases of or pathologically confirmed AD, the PPA group con-
fluent PPA reviewed by Rogers and Alarcon (1999). tained more males, a higher incidence of onset before
Hodges and Patterson (1996) found these to be the pri- age 65, and a greater incidence of nonfluent aphasia. A
mary presenting complaints in the individuals with fluent nonfluent profile was never observed in the AD group,
PPA that they labeled semantic dementia. Unlike indi- whereas the distribution of fluent and nonfluent profiles
viduals with AD, individuals with semantic dementia or in the PPA group was balanced (48% fluent, 44% non-
fluent PPA have relatively spared episodic (both recent fluent). According to Mesulam and Weintraub (1992),
and old autobiographical) memory but exhibit loss of not all individuals with probable AD exhibit aphasic
semantic memory, especially as concerns mapping con- disturbances, but those who do, exhibit only the fluent
cepts to their spoken form (Kertesz and Munoz, 1997). aphasia subtype.
A nonfluent profile is characterized by e¤ortful Rogers and Alarcon (1999) reviewed 57 articles pub-
speech, sparse output with decreased phrase length, lished between 1982 and 1998 describing 147 individuals
impaired access to phonological word-form information, with relatively isolated deterioration of speech and lan-
infrequent use of grammatical markers, and disturbed guage for at least 2 years. Thirty-seven patients had
prosody, and is frequently associated with apraxia of fluent PPA, 88 had nonfluent PPA, and in 22 cases the
speech. Auditory comprehension, while a¤ected, gener- type of aphasia was indeterminate. Among the individ-
ally deteriorates later than expressive language skills. uals with fluent PPA, 27% exhibited dementia at the time
Nonfluent spontaneous speech and the production of of the published report. Among the nonfluent PPA
phonemic paraphasias in naming have been proposed as group, 37% were reported to have developed generalized
important characteristics distinguishing PPA from the dementia. Of the 22 individuals with an undetermined
aphasia-like symptoms in AD. However, the language type of aphasia, 73% exhibited clinical symptoms of
disorder evinced by individuals with PPA rarely fits dementia. The average duration of isolated language
neatly and unambiguously into the fluency typology. symptoms among the 77 individuals who developed
Snowden et al. (1992) described a group of individuals generalized dementia was 5 years (6.6 years in fluent
with PPA who exhibited expressive and receptive dis- PPA, 4.3 years in nonfluent PPA, and 3.7 years among
ruptions of phonology and semantics. Be’land and Ska those with an undetermined type of aphasia). The ag-
(1992) described an individual with PPA who presented gregate data in this review did not support the hypothe-
with ‘‘a syntactic deficit as in Broca’s . . . auditory com- sis that individuals with a nonfluent profile are less likely
prehension deficits of the Wernicke’s aphasia type . . . to develop generalized cognitive involvement than those
and phonemic approximations as found in conduction with a fluent profile. Furthermore, the data did not sup-
aphasia’’ (p. 358). Although there is no accepted classi- port the hypothesis that a nonfluent profile predicts a
fication for individuals exhibiting this profile, the term longer duration of isolated language symptoms. Despite
‘‘mixed aphasia’’ has been applied (e.g., Snowden et al., the unequal number of patients in each of the fluency
1992). Although reliable sorting of aphasia in PPA into groups, the lack of control regarding the time post onset
across cases, and the possibility that there may be con- References
siderable impetus to report PPA in individuals who do
not exhibit generalized dementia, it does not appear that Be’land, R., and Ska, B. (1992). Interaction between verbal and
the fluency profile is a reliable predictor of eventual gestural language in progressive aphasia: A longitudinal
case study. Brain and Language, 43, 355–385.
cognitive status. Black, S. E. (1996). Focal cortical atrophy syndromes. Brain
The initial symptoms of PPA vary from individual to and Cognition, 31, 188–229.
individual, but anomia is the most commonly reported Broussolle, E., Serge, B., Tommasi, M., Laurent, B., Bazin, B.,
presenting complaint in patients with both fluent and Cinotti, L., et al. (1996). Slowly progressive anarthria with
nonfluent PPA (Mesulam, 1987; Rogers and Alarcon, late opercular syndrome: A variant form of frontal cortical
1999). Another early symptom, particularly in nonfluent atrophy syndromes. Journal of the Neurological Sciences,
PPA, is slow, hesitant speech, frequently punctuated by 144, 44–58.
long pauses and filler words (‘‘um,’’ ‘‘uh’’). Although Du¤y, J. R., and Peterson, R. C. (1992). Primary progressive
this may represent simply one of many manifestations of aphasia. Aphasiology, 6, 1–13.
anomia, it also portends the language formulation di‰- Gra¤-Radford, N. R., Damasio, A. R., Hyman, B. T., Hart,
M. N., Tranel, D., Damasio, H., et al. (1990). Progressive
culties that later render the speech of these individuals aphasia in a patient with Pick’s disease: A neuropsycho-
telegraphic (reduced mean length of utterance consisting logical, radiologic, and anatomic study. Neurology, 40,
primarily of content words). Impaired access to phono- 620–626.
logic form is frequently associated with later-emerging Hart, R. P., Beach, W. A., and Taylor, J. R. (1997). A case
spelling di‰culties, although partial access to initial let- of progressive apraxia of speech and non-fluent aphasia.
ters and syllable structure may be retained for many Aphasiology, 11, 73–82.
years (Rogers and Alarcon, 1998, 1999). Di‰culties with Hodges, J. R., and Patterson, K. (1996). Nonfluent progressive
phonologic encoding have also been reported in cases of aphasia and semantic dementia: A comparative neuropsy-
fluent PPA. Tyler et al. (1997) described the anomic dif- chological study. Journal of the International Neuropsycho-
ficulties of one individual with fluent PPA as impaired logical Society, 2, 511–524.
Holland, A. L., McBurney, D. H., Moossy, J., and Reinmuth,
mapping between the semantic lexicon and output pho- O. M. (1985). The dissolution of language in Pick’s disease
nology. More typically, individuals who eventually with neurofibrillary tangles: A case study. Brain and Lan-
exhibit fluent PPA initially complain of di‰culties un- guage, 24, 36–58.
derstanding spoken language (Hodges and Patterson, Karbe, H., Kertesz, A., and Polk, M. (1993). Profiles of lan-
1996), whereas individuals with nonfluent PPA typically guage impairment in primary progressive aphasia. Archives
exhibit preserved language comprehension in the early of Neurology, 50, 193–200.
stages (Karbe, Kertesz, and Polk, 1993). Some individ- Kempler, D., Metter, E. J., Riege, W. H., Jackson, C. A.,
uals with a nonfluent presentation of PPA initially ex- Benson, D. F., and Hanson, W. R. (1990). Slowly progres-
hibit motor symptoms consistent with a diagnosis of sive aphasia: Three cases with language, memory, CT and
dysarthria or apraxia of speech. The initial symptom of PET data. Journal of Neurology, Neurosurgery, and Psychi-
atry, 53, 987–993.
progressive speech apraxia has been reported by Hart, Kertesz, A., Hudson, L., Mackenzie, I. R. A., and Munoz,
Beach, and Taylor (1997). Dysarthria and orofacial D. G. (1994). The pathology and nosology of primary pro-
apraxia have been reported as initial symptoms in a gressive aphasia. Neurology, 44, 2065–2072.
variation of PPA labeled slowly progressive anarthria Kertesz, A., and Munoz, D. G. (1997). Primary progressive
(Broussolle et al., 1996). The relationships between and aphasia. Clinical Neuroscience, 4(2), 95–102.
among nonfluent PPA, primary progressive apraxia, Mandell, A. M., Alexander, M. P., and Carpenter, S. (1989).
slowly progressive anarthria, corticobasal degeneration, Creutzfeld-Jacob disease presenting as isolated aphasia.
primary lateral sclerosis, and Parkinson’s disease is of Neurology, 39, 55–58.
interest, because these conditions exhibit considerable McDaniel, K. D., Wagner, M. T., and Greenspan, B. S.
clinical overlap and in some cases share similar patho- (1991). The role of brain single photon emission computed
tomography in the diagnosis of primary progressive apha-
physiology. In the later stages of all of these syndromes, sia. Archives of Neurology, 48, 1257–1260.
individuals lose the ability to communicate by speech Mesulam, M. M. (1982). Slowly progressive aphasia without
and are uniformly described as ‘‘mute,’’ despite apparent generalized dementia. Annals of Neurology, 11, 592–598.
di¤erences regarding the underlying nature of the spe- Mesulam, M. M. (1987). Primary progressive aphasia: Di¤er-
cific impairment precluding the production of spoken entiation from Alzheimer’s disease. Annals of Neurology,
language. 22, 533–534. Editorial.
Regardless of the subtype of PPA, the progressive loss Mesulam, M. M., and Weintraub, S. (1992). Spectrum of pri-
of language need not result in the total cessation of all mary progressive aphasia. In M. N. Rossor (Ed.), Unusual
communication, as there are augmentative and alterna- dementias. Bailliere’s Clinical Neurology, 1(3), 583–609.
tive communication tools and strategies that can be Pogacar, S., and Williams, R. S. (1984). Alzheimer’s disease
presenting as slowly progressive aphasia. Rhode Island
proactively established so that the individual with PPA Medical Journal, 67, 181–185.
can maximize communication competency at every Polk, M., and Kertesz, A. (1993). Music and language in dege-
stage, despite the relentless deterioration in speech and nerative disease of the brain. Brain and Cognition, 22, 98–117.
language (Rogers, King, and Alarcon, 2000). Rogers, M. A., and Alarcon, N. B. (1998). Dissolution of
spoken language in primary progressive aphasia. Aphasiol-
—Margaret A. Rogers ogy, 12, 635–650.
Aphasia: The Classical Syndromes 249
Rogers, M. A., and Alarcon, N. B. (1999). Characteristics or by administering a standardized test, for example, the
and management of primary progressive aphasia. Neuro- Western Aphasia Battery (WAB) (Kertesz, 1982) or
physiology and Neurogenic Speech and Language Disorders the Boston Diagnostic Aphasia Examination (BDAE)
Newsletter, 9(4), 12–26. (Goodglass and Kaplan, 1983). The following describes
Rogers, M. A., King, J. M., and Alarcon, N. B. (2000). Pro-
each syndrome and the assumed site of lesion associated
active management of primary progressive aphasia. In D.
Beukelman, K. Yorkston, and J. Reichle (Eds.), Augmen- with each.
tative Communication for Adults with Neurogenic and Neu-
romuscular Disabilities (pp. 305–337). Baltimore: Paul Global Aphasia. This nonfluent syndrome is associated
H. Brookes. with a large left hemisphere lesion that may involve the
Sapin, L. R., Anderson, F. H., and Pulaski, P. D. (1989). Pro- frontal, temporal, and parietal lobes, insula, and under-
gressive aphasia without dementia: Further documentation. lying white matter, including the arcuate fasciculus
Annals of Neurology, 25, 411–413. (Dronkers and Larsen, 2001). It is the most severe of
Scheltens, P., Hazenberg, G. J., Lindeboom, J., Valk, J., and all of the syndromes. Auditory comprehension is
Wolters, E. C. (1990). A case of progressive aphasia without markedly reduced and may be limited to inconsistent
dementia: ‘‘temporal’’ Pick’s disease? Journal of Neurology,
comprehension of single words. Oral-expressive lan-
Neurosurgery, and Psychiatry, 53, 79–80.
Scheltens, P., Ravid, R., and Kamphorst, W. (1994). Patho- guage is sparse, often limited to a recurring intelligible—
logic finding in a case of primary progressive aphasia. Neu- ‘‘bees, bees, bees’’—or unintelligible—‘‘doobe, doobe,
rology, 44, 279–282. doobe’’—stereotype. Other automatic expressions, in-
Shuttleworth, E. C., Yates, A. J., and Paltan-Ortiz, J. D. cluding profanity and counting, may also be preserved.
(1985). Creutzfeld-Jacob disease presenting as progressive Globally aphasic patients are unable to repeat words,
aphasia. Journal of the National Medical Association, 77, and no naming ability is present. Reading and writing
649–655. abilities are essentially absent.
Snowden, J. S., Neary, D., Mann, D. M. A., Goulding, P. J.,
and Testa, H. J. (1992). Progressive language disorder due Broca’s Aphasia. This nonfluent syndrome receives its
to lobar atrophy. Annals of Neurology, 31, 174–183.
name from the early reports by Paul Broca (1861a,
Tyler, L. K., Moss, H. F., Patterson, K., and Hodges, J.
(1997). The gradual deterioration of syntax and semantics 1861b). Classical localization of the lesion resulting in
in a patient with progressive aphasia. Brain and Language, Broca’s aphasia is damage in the left, inferior frontal
56, 426–476. gyrus—Broca’s area (Brodmann’s areas 44 and 45)
Weintraub, S., Rubin, N., and Mesulam, M. (1990). Primary (Damasio, 1992). However, both historical (Marie,
progressive aphasia: Longitudinal course, neuropsychologi- 1906) and contemporary (Mohr, 1976; Dronkers et al.,
cal profile, and language features. Archives of Neurology, 1992) reports question the classical lesion localization.
47, 1329–1335. Patients have been described who have lesions in Broca’s
area without Broca’s aphasia, and other patients have
Broca’s aphasia but their lesion does not involve Broca’s
Aphasia: The Classical Syndromes area. Auditory comprehension is relatively good for
single words and short sentences. However, comprehen-
sion of grammatically complex sentences is impaired.
Aphasia is an acquired disorder of language subsequent Their phrase length is short, and they produce halting,
to brain damage that a¤ects auditory comprehension, telegraphic, agrammatic speech that contains, primarily,
reading, oral-expressive language, and writing. Early content words. For example, describing how he spent
observations by Broca (1861a, 1861b) and Wernicke the weekend, a patient with Broca’s aphasia related,
(1874) suggested that aphasia might be classified into a ‘‘Ah, frat, no Saturday, ah, frisk, no, fishing, son.’’
variety of syndromes, or types, based on di¤erences in Repetition of words and sentences is poor. Naming
auditory comprehension and oral-expressive language ability is disrupted, and reading and writing show a
behaviors. Moreover, di¤erent syndromes were believed range of impairment.
to result from di¤erent sites of brain damage. Revisions
of early classification systems yield a contemporary tax- Transcortical Motor Aphasia. Lichtheim (1885) pro-
onomy that comprises seven syndromes: global, Broca’s, vided an early description of this nonfluent syndrome,
transcortical motor, Wernicke’s, transcortical sensory, and he observed that the site of lesion spared the peri-
conduction, and anomic (Benson, 1988; Kertesz, 1979). sylvian language region. Currently, it is believed that the
Classification is based on the aphasic person’s auditory lesion resulting in transcortical motor aphasia is smaller
comprehension, oral-expressive fluency (phrase length than that causing Broca’s aphasia and is in the left
and syntax), spoken repetition, and naming abilities. The anterior-superior frontal lobe (Alexander, Benson, and
seven syndromes can be divided into nonfluent, those Stuss, 1989). With one exception, language behaviors are
with short phrase length and impaired morphosyntax similar to those in Broca’s aphasia: good auditory com-
(global, Broca’s, and transcortical motor), and fluent, prehension for short, noncomplex sentences; short, halt-
those with longer phrase length and apparent preserva- ing, agrammatic phrase production; disrupted naming
tion of syntactic structures (Wernicke’s, transcortical ability; and impaired reading and writing. The exception
sensory, conduction, and anomic). An aphasic person’s is relatively preserved ability to repeat phrases and sen-
syndrome may be determined by informal examination tences. Essentially, patients with transcortical motor
aphasia repeat much better than would be predicted comprehension and oral-expressive abilities. Although
from their disrupted, volitional productions. auditory comprehension is relatively good, it is not
perfect. And, while oral-expressive language is fluent
Wernicke’s Aphasia. This fluent syndrome received its (longer phrase length and a semblance of syntax), pa-
name from the early report by Carl Wernicke (1874). tients with conduction aphasia make numerous phono-
The traditional belief is that Wernicke’s aphasia results logical errors and replace intended words with words
from a lesion in Wernicke’s area (posterior Brodmann’s that sound similar. Naming, reading, and writing abili-
area 22) in the left hemisphere auditory-association cor- ties are disrupted to some extent.
tex (Damasio, 1992), with extension into Brodmann’s
areas 37, 39, and 40. However, Basso et al. (1985) have Anomic Aphasia. This fluent syndrome is the least se-
reported cases of Wernicke’s aphasia resulting from vere. Anomia—word-finding di‰culty—is present in all
exclusively anterior lesions, and Dronkers, Redfern, and aphasic syndromes; thus, localization of the lesion that
Ludy (1995) have found Wernicke’s aphasia in patients results in anomic aphasia is not precise. It can be found
whose lesions also spared Wernicke’s area. Spoken subsequent to anterior or posterior lesions (Dronkers
phrase length averages six or more words, and a sem- and Larsen, 2001), and Kreisler et al. (2000) report
blance of syntax is present. However, the oral-expressive anomic aphasia resulting from a lesion in the thalamus;
behavior includes phonological errors and jargon. One medial temporal area; or frontal cortex, insula, and an-
patient with Wernicke’s aphasia described where he went terior part of the temporal gyri. Patients with anomic
to college, Washington and Lee University, by relating, aphasia display longer phrase length and preserved syn-
‘‘There was the old one, ah Frulich, and the young one, tax; mild, if any, auditory comprehension deficits; good
young hunter, ah, Frulich and young hunter or Brulan.’’ repetition ability; and mild reading and writing impair-
A salient sign in Wernicke’s aphasia is impaired auditory ment. Frequently, the anomic patient will substitute
comprehension. These patients understand little of what synonyms for the intended words or replace the desired
is said to them, and the deficit cannot be explained by word with a generalization, for example, ‘‘thing’’ or
reduced auditory acuity. In addition, verbal repetition ‘‘stu¤.’’
and naming abilities are impaired, and there is a range of
reading and writing deficits.
Cautions
Transcortical Sensory Aphasia. This fluent syndrome The classification of aphasia into the classical syndromes
may result from lesions surrounding Wernicke’s is not exempt from controversy. Some (Caramazza,
area, posteriorly or inferiorly (Damasio, 1992). Oral- 1984; Caplan, 1987) have challenged its validity. Darley
expressive language is similar to that seen in Wernicke’s (1982) suggested that aphasic people di¤er on the basis
aphasia: longer phrase length and relatively good syntax. of severity or the presence of a coexisting communica-
Auditory comprehension is impaired, similar to that in tion disorder, frequently apraxia of speech. He advo-
Wernicke’s aphasia, and naming, reading, and writing cated viewing aphasia unmodified by adjectives. The
deficits are present. The salient sign in transcortical sen- relationship between the site of lesion and the corre-
sory aphasia is preserved verbal repetition ability for sponding syndrome is also controversial. The classical
words and, frequently, long and complex sentences. Es- sites of lesion for most aphasic syndromes are chal-
sentially, transcortical sensory aphasia patients repeat lenged by exceptions (Basso et al., 1985; Murdoch, 1988;
better than one would predict based on their impaired Dronkers and Larsen, 2001). Some of the inconsistency
auditory comprehension. may result from the time post onset when behavioral
observations are made. Improvement in aphasia over
Conduction Aphasia. Wernicke (1874) described this time results in approximately 50% of aphasic patients
fluent syndrome. Lesion localization has been contro- changing from one syndrome to another (Kertesz and
versial. Geschwind (1965) proposed that conduction McCabe, 1977). Thus, an acutely aphasic patient with an
aphasia results from a lesion in the arcuate fasciculus inferior left frontal gyrus lesion may display the expected
that disrupts connections between the posterior language Broca’s aphasia; however, at 6 months after onset, the
comprehension area and the anterior motor speech area. same patient’s language characteristics may resemble
Damasio (1992) suggested that conduction aphasia re- anomic aphasia. Confusion may also result from the
sults from damage in the left hemisphere supramarginal methods employed to classify the aphasias. For example,
gyrus (Brodmann’s area 40), with or without extension classifications made with the WAB do not always agree
to the white matter beneath the insula, or damage in with those made with the BDAE (Wertz, Deal, and
the left primary auditory cortices (Brodmann’s areas 41 Robinson, 1984). Finally, controversy and confusion
and 42), the insula, and the underlying white matter. may result from misuse of the term syndrome (Benson
Dronkers et al. (1998) reported that all of their patients and Ardila, 1996). The behavioral profile that constitutes
with conduction aphasia had a lesion that involved the a specific aphasic ‘‘syndrome’’ is characterized by a
posterior-superior temporal gyrus, often extending into range of impairment and not by identical performance
the inferior parietal lobule. The salient sign in conduc- among all individuals within a specific syndrome. In
tion aphasia is impaired ability to repeat phrases and many, certainly not all, aphasic people, impaired behav-
sentences in the presence of relatively good auditory ioral features—fluency, auditory comprehension, verbal
Aphasia: The Classical Syndromes 251
repetition, naming—tend to result in di¤erent clusters Lichtheim, L. (1885). On aphasia. Brain, 7, 433–484.
that represent di¤erent profiles. These have led to the Marie, P. (1906). Revision de la question de l’aphasie: La troi-
development and use of the classical syndromes in sieme circonvolution frontale gauche ne joue aucun role
aphasia. special dans la fonction du langage. Semaine Medicale, 26,
241–247.
—Robert T. Wertz, Nina F. Dronkers, and Jennifer Mohr, J. P. (1976). Broca’s area and Broca’s aphasia. In
Ogar H. Whitaker and H. Whitaker (Eds.), Studies in neuro-
linguistics (vol. 1, pp. 201–233). New York: Academic
References Press.
Murdoch, B. E. (1988). Computerized tomographic scan-
Alexander, M. P., Benson, D. F., and Stuss, D. T. (1989). ning: Its contributions to the understanding of the neuro-
Frontal lobes and language. Brain and Language, 37, anatomical basis of aphasia. Aphasiology, 2, 437–462.
656–691. Wernicke, C. (1874). Der aphasische Symptomencomplex. Bres-
Basso, A., Lecours, A. R., Moraschini, S., and Vanier, M. lau, Poland: Kohn and Weigert.
(1985). Anatomical correlations of the aphasias as defined Wertz, R. T., Deal, J. L., and Robinson, A. J. (1984). Classi-
through computerized tomography: Exceptions. Brain and fying the aphasias: A comparison of the Boston Diagnostic
Language, 26, 201–229. Aphasia Examination and the Western Aphasia Battery. In
Benson, D. F. (1988). Classical syndromes of aphasia. In R. H. Brookshire (Ed.), Clinical Aphasiology Conference
F. Boller and J. Grafman (Eds.), Handbook of neuro- proceedings (pp. 40–47). Minneapolis: BRK.
psychology (vol. 1, pp. 269–280). Amsterdam: Elsevier.
Benson, D. F., and Ardila, A. (1996). Aphasia: A clinical per-
spective. New York: Oxford University Press. Further Readings
Broca, P. (1861a). Remarques sur le siège de la faculté du lan-
gage articulé, suivies d’une observation d’aphémie (perte de Alexander, M. P. (1997). Aphasia: Clinical and anatomic
la parole). Bulletins de la Société d’Anatomie (Paris), 2e aspects. In T. E. Feinberg and M. J. Farah (Eds.), Behav-
série, 6, 330–357. ioral neurology and neuropsychology (pp. 133–149). New
Broca, P. (1861b). Nouvelle observation d’aphémie produite York: McGraw-Hill.
par une lésion de la troisième circonvolution frontale. Bul- Bogen, J. E., and Bogen, G. M. (1976). Wernicke’s region:
letins de la Société d’Anatomie (Paris), 2e série, 6, 398–407. Where is it? Annals of the New York Academy of Sciences,
Caplan, D. (1987). Neurolinguistics and linguistic aphasiology. 280, 834–843.
Cambridge, U.K.: Cambridge University Press. Caplan, D., Hildebrandt, N., and Marlies, N. (1996). Loca-
Caramazza, A. (1984). The logic of neuropsychological re- tion of lesions in stroke patients with deficits in syntactic
search and the problem of patient classification in aphasia. processing in sentence comprehension. Brain, 119, 933–
Brain and Language, 21, 9–20. 949.
Damasio, A. R. (1992). Aphasia. New England Journal of Damasio, H., and Damasio, A. R. (1989). Lesion analysis in
Medicine, 326, 531–539. neuropsychology. New York: Oxford University Press.
Darley, F. L. (1982). Aphasia. Philadelphia: Saunders. Dronkers, N. F. (1996). A new brain region for coordinating
Dronkers, N. F., and Larsen, J. (2001). Neuroanatomy of the speech articulation. Nature, 384, 159–161.
classical syndromes of aphasia. In R. S. Berndt (Ed.), Freedman, M., Alexander, M. P., and Naeser, M. A. (1984).
Handbook of neuropsychology (2nd ed., vol. 3). Amsterdam: Anatomic basis of transcortical motor aphasia. Neurology,
Elsevier. 34, 409–417.
Dronkers, N. F., Redfern, B. B., and Ludy, C. A. (1995). Le- Geschwind, N. (1972). Language and the brain. Scientific
sion localization in chronic Wernicke’s aphasia. Brain and American, 226, 76–83.
Language, 51, 62–65. Godefroy, O., Duhamel, A., Leclerc, X., Saint Michel, T.,
Dronkers, N. F., Redfern, B. B., Ludy, C., and Baldo, J. Henon, H., and Leys, D. (1998). Brain-behavior relation-
(1998). Brain regions associated with conduction aphasia ships: Models for the study of brain damaged patients.
and echoic rehearsal. Journal of the International Neuro- Brain, 121, 1545–1556.
psychological Society, 4, 23–24. Goodglass, H. (1993). Understanding aphasia. San Diego, CA:
Dronkers, N. F., Shapiro, J. K., Redfern, B., and Knight, Academic Press.
R. T. (1992). The role of Broca’s area in Broca’s aphasia. Kertesz, A., Sheppard, A., and MacKenzie, R. (1982). Local-
Journal of Clinical and Experimental Neuropsychology, 14, ization in transcortical sensory aphasia. Archives of Neurol-
52–53. ogy, 39, 475–478.
Geschwind, N. (1965). Disconnection syndromes in animals Kirshner, H. S., Alexander, M., Lorch, M. P., and Wertz,
and man. Brain, 88, 237–294. R. T. (1999). Disorders of speech and language. Continuum:
Goodglass, H., and Kaplan, E. (1983). The Boston Diagnostic Lifelong Learning in Neurology, 5, 1–237.
Aphasia Examination. Philadelphia: Lea and Febiger. Naeser, M. A., and Hayward, R. W. (1978). Lesion localiza-
Kertesz, A. (1979). Aphasia and associated disorders: Taxon- tion in aphasia with computed tomography and the Boston
omy, localization, and recovery. New York: Grune and Diagnostic Aphasia Exam. Neurology, 28, 545–551.
Stratton. Palumbo, C. L., Alexander, M. P., and Naeser, M. A. (1992).
Kertesz, A. (1982). The Western Aphasia Battery. New York: CT scan lesion sites associated with conduction aphasia. In
Grune and Stratton. K. Se (Ed.), Conduction aphasia (pp. 51–75). Hillsdale, NJ:
Kertesz, A., and McCabe, P. (1977). Recovery patterns and Erlbaum.
prognosis in aphasia. Brain, 100, 1–18. Sarno, M. E. (Ed.). (1998). Acquired aphasia (3rd ed.). San
Kreisler, A., Godefroy, O., Dalmaire, C., Debachy, B., Diego, CA: Academic Press.
Leclercq, M., Pruvo, J.-P., et al. (2000). The anatomy of Stemmer, B., and Whitaker, H. A. (Eds.). (1998). Handbook of
aphasia revisited. Neurology, 54, 1117–1122. neurolinguistics. San Diego, CA: Academic Press.
Vignolo, L. A., Boccardi, E., and Caverni, L. (1986). Un- tinguishes Wernicke’s aphasia from transcortical sensory
expected CT-scan findings in global aphasia. Cortex, 22, aphasia. Writing mirrors the speech output. Hand-
55–69. writing is usually well formed, but the text is without
Wertz, R. T. (1983). Classifying the aphasias: Commodious or content, and jargonagraphia may occur. Because of
chimerical? In R. H. Brookshire (Ed.), Clinical Aphasiology
posterior lesions, hemiparesis is present in rare cases, but
Conference proceedings (pp. 296–303). Minneapolis: BRK.
Willmes, K., and Poeck, K. (1998). To what extent can aphasic visual field defects are more common. Many patients
syndromes be localized? Brain, 116, 1527–1540. also show signs of anosognosia, especially during the
acute stage of the illness. In most cases, the use of ges-
tural communication or pantomime is a¤ected as well.
Aphasia, Wernicke’s Patients not traditionally classified as having aphasia
may also show language disturbances resembling Wer-
nicke’s aphasia, such as patients with schizophrenia, de-
A new concept in aphasiology was created when Wer- mentia, or semantic dementia, a fluent form of primary
nicke (1874/1977) described ten patients with di¤erent progressive aphasia.
forms of aphasia, and showed that two of the patients Some authors suggest that Wernicke’s aphasia is not a
had fluent but paraphasic speech with poor comprehen- uniform entity but includes many variants. Forms of
sion (i.e., sensory aphasia). At autopsy of another pa- neologistic, semantic, and phonemic jargon and pure
tient, a lesion was found in the left posterior temporal word deafness may all be grouped under Wernicke’s
lobe. This type of aphasia has been called by many aphasia. Pure word deafness is a rare disorder charac-
names, including receptive, impressive, sensory, or more terized by severe di‰culties in speech comprehension
generally fluent aphasia. In most of the current classifi- and repetition with preservation of other language func-
cation systems, this type of syndrome is called Wer- tions, including the comprehension of nonverbal sounds
nicke’s aphasia. It a¤ects 15%–25% of all patients with and music (Kirshner, Webb, and Duncan, 1981). How-
aphasia (Laska et al., 2001). ever, when Buchman et al. (1986) reviewed 34 published
Although the exact boundaries of Wernicke’s area are cases, they were unable to find any really pure cases—
controversial, the typical lesion associated with Wer- that is, cases without any other more generalized per-
nicke’s aphasia is most often located in the posterior ceptual disorders that could be classified as acoustic
temporal area. The middle and superior temporal lobe agnosia or mild language disorders such as paraphasia,
posterior to the primary auditory cortex are a¤ected in naming di‰culties, and reading and writing disorders.
almost all cases. The primary auditory cortex is also Most of the patients with ‘‘pure’’ word deafness have
often a¤ected, as are the white matter subjacent to the had bilateral temporal lesions, but some patients with
posterior temporal lobe, the angular gyrus, and the unilateral left hemisphere lesions have been described
supramarginal gyrus. In rare cases, restricted subcortical (Takahashi et al., 1992).
lesions may result in Wernicke’s aphasia and hemiplegia, Personality factors may play a role in the clinical ex-
the latter being uncommon in cases with cortical lesions. pression of aphasia. In some views, jargon aphasia is not
Recent studies have not changed these classical views of solely a linguistic deficit. Rochford (1974) suggested that
the clinico-anatomical relations of initial aphasia. a pathological arousal mechanism and lack of control
Patients with Wernicke’s aphasia are usually older were crucial to jargon aphasia. Weinstein and Lyerly
than patients with Broca’s aphasia. However, some rare (1976) suggested that jargon aphasia could emanate
cases of children with acquired fluent aphasia and a from abnormal adaptation to the aphasic speech dis-
posterior temporal lesion have been described (Paquier order. They found a significant di¤erence in premorbid
and Van Dongen, 1991). Ferro and Madureira (1997) personality between patients with jargon aphasia and
have attributed the age di¤erence between patients with those without jargon aphasia. Most of their patients with
fluent aphasia and those with nonfluent aphasia to the jargon aphasia had a strong premorbid tendency to deny
higher prevalence of posterior infarcts in older patients. illness or openly expressed fear of illness, indicating the
The most common etiological factor in vascular Wer- importance of anosognosic features in jargon aphasia.
nicke’s aphasia is cardiac embolus, which more often Linguistically, patients with Wernicke’s aphasia speak
a¤ects the temporal area, whereas carotid atherosclerotic with normal fluency and prosody without articulatory
infarctions are in most cases located in the frontoparietal distortions. They often provide long and fluent answers
area (Harrison and Marshall, 1987; Knepper et al., (logorrhoea) to simple questions. In fact, patients with
1989). Coppens (1991), however, points to a higher Wernicke’s aphasia produce an equal number of words
mortality rate in older patients with stroke, which might as persons without aphasia in spontaneous speech.
cause a selection bias in studies showing a relationship However, they show less lexical variety, a high propor-
between age and type of aphasia. tion of repetitions, and empty speech (Bates et al., 2001).
The typical clinical signs of Wernicke’s aphasia This may give an impression of grammatically correct
include poor comprehension of spoken and written lan- speech, but the meaning of the utterances is lost because
guage and fluent but paraphasic (phonemic and seman- of a high proportion of paraphasias and neologisms
tic) speech. In some cases, neologistic jargon may occur. (Lecours and Lhermitte, 1983). This type of speech error
Naming is also severely a¤ected, and phonemic or se- is called paragrammatism. Patients with Wernicke’s
mantic prompting is of no help. Poor repetition dis- aphasia show morphological errors, but less so than
Aphasia, Wernicke’s 253
patients with Broca’s aphasia (Bates et al., 2001). How- on the sample studied, more than half of patients with
ever, there is some evidence that in highly inflected lan- aphasia will show evolution to another type of aphasia
guages such as Finnish, the number of errors is higher during the first year after the onset of illness (Ross and
on inflected words than on the lexical stems (Niemi, Wertz, 2001). Patients with initial Wernicke’s aphasia
Koivuselkä-Sallinen, and Laine, 1987). At least in spon- will usually evolve to have a conduction or transcortical
taneous speech, distorted sentence structure in utterances type of aphasia, and may evolve further to have anomic
of patients with Wernicke’s aphasia is related to the aphasia (Pashek and Holland, 1988). On the other hand,
lexical-semantic di‰culties rather than to morphosyn- the condition of elderly patients with initial global
tactic problems (Helasvuo, Klippi, and Laakso, 2001). aphasia tends to evolve to Wernicke’s aphasia during
The same has been found in sentence comprehension. the recovery period, and the condition of younger
Patients with Wernicke’s aphasia performed correctly patients evolves to Broca’s aphasia. This could explain
only on sentences that did not require semantic oper- why only one-third of patients with fluent aphasia and
ations (Pinango and Zurif, 2001). According to these lesions in Wernicke’s area have a persisting aphasia, and
findings, the deficit in phonemic hearing does not explain only slightly more than half of patients with chronic
the nature of comprehension problems in patients with Wernicke’s aphasia have lesions in Wernicke’s area
Wernicke’s aphasia. (Dronkers, 2000).
Most patients show skill in pragmatic abilities, such
as using gaze direction and other nonverbal actions in —Matti Lehtihalmes
conversation. Unawareness of one’s own speech errors
usually occurs initially in Wernicke’s aphasia, but some References
degree of auditory self-monitoring develops after onset,
and patients then begin to use various self-repair strat- Bates, E., Reilly, J., Wulfeck, B., Dronkers, N., Opie, M.,
egies to manage conversation (Laakso, 1997). In con- Fenson, J., et al. (2001). Di¤erential e¤ects of unilateral
trast to self-repair sequences in nonaphasic speakers, lesions on language production in children and adults. Brain
and Language, 79, 223–265.
these sequences are very lengthy and often unsuccessful.
Buchman, A. S., Garron, D. C., Trost-Cardamone, J. E.,
The initial severity of the aphasia is considered the Wichter, M. D., and Schwartz, M. (1986). Word deafness:
most important single factor in predicting recovery from One hundred years later. Journal of Neurology, Neurosur-
aphasia. Wernicke’s aphasia is usually tantamount to gery, and Psychiatry, 49, 489–499.
severe aphasia. In a study by Ross and Wertz (2001), of Coppens, P. (1991). Why are Wernicke’s aphasia patients older
all patients with aphasia, those with Wernicke’s aphasia than Broca’s? A critical view of the hypotheses. Aphasiol-
and global aphasia showed the most severe impairment ogy, 5, 279–290.
in language functions and communication. These Dronkers, N. F. (2000). The pursuit of brain–language rela-
patients showed only limited recovery when measured tionships. Brain and Language, 71, 59–61.
at the impairment level by the Boston Diagnostic Apha- Ferro, J. M., and Madureira, S. (1997). Aphasia type, age and
cerebral infarct localisation. Journal of Neurology, 244,
sia Examination (BDAE) and at the disability level by 505–509.
CADL. In addition to initial severity of aphasia, supra- Harrison, M. J. G., and Marshall, J. (1987). Wernicke aphasia
marginal and angular gyri involvements seem to relate to and cardiac embolism. Journal of Neurology, Neurosurgery,
poor recovery in comparison with cases without exten- and Psychiatry, 50, 938–939.
sion to the posterior superior temporal gyrus (Kertesz, Helasvuo, M.-L., Klippi, A., and Laakso, M. (2001). Gram-
Lau, and Polk, 1993). matical structuring in Broca’s and Wernicke’s aphasia in
Patients who have recovered from Wernicke’s aphasia Finnish. Journal of Neurolinguistics, 14, 231–254.
have shown a clear increase in activation in the right Karbe, H., Thiel, A., Weber-Luxenburger, G., Herholz, K.,
perisylvian area, suggesting a functional reorganization Kessler, J., and Heiss, W. D. (1998). Brain plasticity in
of the language with the help of the right hemisphere poststroke aphasia: What is the contribution of the right
hemisphere? Brain and Language, 64, 215–230.
(Weiller et al., 1995). However, Karbe et al. (1998) Kertesz, A., Lau, W. K., and Polk, M. (1993). The structural
reported that increased activity in the right hemisphere determinants of recovery in Wernicke’s aphasia. Brain and
was present in patients with poor recovery and reflected Language, 44, 153–164.
the large lesions in the left hemisphere. Patients with Kirshner, H. S., Webb, W. G., and Duncan, G. W. (1981).
good recovery showed increased activation in the left Word deafness in Wernicke’s aphasia. Journal of Neurology,
hemisphere surrounding the damaged area. Neurosurgery, and Psychiatry, 44, 197–201.
The classification of aphasia depends strongly on the Knepper, L. E., Biller, J., Tranel, D., Adams, H. P., and
methods used in the assessment. The major diagnostic Marsh, E. E., III (1989). Etiology of stroke in patients with
tests, such as the BDAE, the Western Aphasia Battery Wernicke’s aphasia. Stroke, 20, 1730–1732.
(WAB), or the Aachener Aphasie Test (AAT), have Laakso, M. (1997). Self-initiated repair by fluent aphasic
speakers in conversation (Studia Fennica Linguistica 8).
slightly di¤erent criteria for classification. For example, Helsinki: Finnish Literature Society.
whereas the WAB assigns all patients to some aphasia Laska, A. C., Hellbom, A., Murray, V., Kahan, T., and von
classification, up to 70% of patients examined with the Arbin, M. (2001). Aphasia in acute stroke and relation to
BDAE might be designated as having unclassified apha- outcome. Journal of Internal Medicine, 249, 413–422.
sia. Another issue that confuses classification is the time Lecours, A. R., and Lhermitte, F. (1983). Clinical forms
after onset at which the evaluation is done. Depending of aphasia. In A. R. Lecours, F. Lhermitte, and B.
Bryans (Eds.), Aphasiology (pp. 76–108). London: Baillière Mackenzie, C. (2000). The relevance of education and age in
Tindall. the assessment of discourse comprehension. Clinical Lin-
Niemi, J., Koivuselkä-Sallinen, P., and Laine, M. (1987). Lex- guistics and Phonetics, 14, 151–161.
ical deformations are sensitive to morphosyntactic factors in Maneta, A., Marshall, J., and Lindsay, J. (2001). Direct and
posterior aphasia. Aphasiology, 1, 53–57. indirect therapy for sound deafness. International Journal of
Paquier, P., and Van Dongen, H. R. (1991). Two contrasting Language and Communication Disorders, 36, 91–106.
cases of fluent aphasia in children. Aphasiology, 5, 235–245. Marshall, J., Pring, T., Chiat, S., and Robson, J. (1995/6).
Pashek, G. V., and Holland, A. L. (1988). Evolution of aphasia Calling a salad a federation: An investigation of se-
in the first year post-onset. Cortex, 24, 411–424. mantic jargon. Part 1. Nouns. Journal of Neurolinguistics, 9,
Pinango, M. M., and Zurif, E. B. (2001). Semantic opera- 237–250.
tions in aphasic comprehension: Implications for the corti- Marshall, J., Chiat, S., Robson, J., and Pring, T. (1995/6).
cal organization of language. Brain and Language, 79, 297– Calling a salad a federation: An investigation of se-
308. mantic jargon. Part 2. Verbs. Journal of Neurolinguistics, 9,
Rochford, G. (1974). Are jargon dysphasics dysphasic? British 251–260.
Journal of Disorders of Communication, 9, 35–44. Niemi, J., and Laine, M. (1989). The English language bias in
Ross, K. B., and Wertz, R. T. (2001). Type and severity of neurolinguistics: New languages give new perspectives.
aphasia during the first seven months poststroke. Journal of Aphasiology, 3, 155–159.
Medical Speech-Language Pathology, 9, 31–53. Obler, L. K., Albert, M. L., Goodglass, H., and Benson, D. F.
Takahashi, N., Kawamura, M., Shonotou, H., Hirayama, K., (1978). Aphasia type and aging. Brain and Language, 6,
Kaga, K., and Shindo, M. (1992). Pure word deafness due 318–322.
to left hemisphere damage. Cortex, 28, 295–303. Peterson, L. N., and Kirshner, H. S. (1981). Gestural impair-
Weiller, C., Isensee, C., Rijntjes, M., Huber, W., Müller, S., ment and gestural ability in aphasia. Brain and Language,
Bier, D., et al. (1995). Recovery from Wernicke’s aphasia: 14, 333–348.
A positron emission tomographic study. Annals of Neurol- Portno¤, L. A. (1982). Schizophrenia and semantic aphasia: A
ogy, 37, 723–732. clinical comparison. International Journal of Neuroscience,
Weinstein, E. A., and Lyerly, O. G. (1976). Personality factors 16, 189–197.
in jargon aphasia. Cortex, 12, 122–133. Simmons, N. N., and Buckingham, H. W., Jr. (1992). Recov-
Wernicke, C. (1874). Der Aphatische Symptomencomplex: ery in jargonaphasia. Aphasiology, 6, 403–414.
Eine psychologische Studie auf anatomischer Basis. Breslau: Van Dongen, H. R., Loonen, M. C. B., and Van Dongen, K. J.
Cohn and Weigert. (English translation by Gertrude H. (1985). Anatomical basis for acquired fluent aphasia in
Eggert, in G. H. Eggert [1977]. Wernicke’s works on apha- children. Annals of Neurology, 17, 306–309.
sia: A sourcebook and review [pp. 91–145]. The Hague:
Mouton.)
Further Readings Aphasia Treatment: Computer-Aided

Best, W., and Howard, D. (1994). Word sound deafness Rehabilitation
resolved. Aphasiology, 8, 223–256.
Blanken, G., Dittmann, J., and Sinn, H. (1994). Old solutions
to new problems: A contribution to today’s relevance The role of technology in treating clinical aphasiology
of Carl Wernicke’s theory of aphasia. Aphasiology, 8, has been evolving since studies first demonstrated the
207–221. feasibility of using computers in the treatment of aphasic
Cooper, W. E., and Zurif, E. B. (1983). Aphasia: Information adults. This journey began with remote access to treat-
processing in language production and reception. In B. ment in rural settings using large computer systems
Butterworth (Ed.), Language production Vol. 2: Develop- over the telephone. There followed the introduction and
ment, writing and other language processes (pp. 225–256). widespread use of personal computers and portable
London: Academic Press. computers, with the subsequent development of com-
Garrard, P., and Hodges, J. R. (1999). Semantic dementia: plex software and multimedia programs. This changing
Implications for the neural basis of language and meaning.
course is not simply the result of technological progress
Aphasiology, 13, 609–623.
Godefroy, O., Dubois, C., Debachy, B., Leclerc, M., and but represents greater understanding by clinicians and
Kreisler, A. (2002). Vascular aphasias: Main characteristics researchers of the strengths and limitations of computer-
of patients hospitalised in acute stroke units. Stroke, 33, aided treatment for aphasia and related disorders.
702–705. Four common types of treatment activities are ap-
Graham-Keegan, L., and Caspari, I. (1997). Wernicke’s apha- propriate for presentation on a computer: stimulation,
sia. In L. L. LaPointe (Ed.), Aphasia and related neurogenic drill and practice, simulations, and tutorials. Stimulation
language disorders (2nd ed., pp. 42–62). New York: Thieme. activities o¤er the participant numerous opportunities to
Hickok, G., and Poeppel, D. (2000). Towards a functional respond quickly and usually correctly over a relatively
neuroanatomy of speech perception. Trends in Cognitive long period of time for the purpose of maintaining and
Sciences, 4, 131–138.
Kertesz, A. (1983). Localization of lesions in Wernicke’s
stabilizing the underlying processes or skills, rather than
aphasia. In A. Kertesz (Ed.), Localization in neuro- simply learning a new set of responses. It is easy to de-
psychology (pp. 209–230). New York: Academic Press. sign computer programs that contain a large database of
Kertesz, A., Appell, J., and Fisman, M. (1986). The dissolution stimuli, and then to control variables (e.g., word length)
of language in Alzheimer’s disease. Canadian Journal of as a function of the participant’s response accuracy. Drill
Neurological Sciences, 13, 415–418. and practice exercises teach specific information so that
Aphasia Treatment: Computer-Aided Rehabilitation 255
the participant can function more independently. Stimuli sponse to the patient’s particular needs. This relation
are selected for a particular participant and goal, and between clinician and computer permits considerable
therefore authoring or editing options are required to flexibility, thus compensating for limitations inherent
modify stimuli and target responses. A limited number in the COT approach. In addition to treatment pro-
of stimuli are presented and are replaced with new items grams written specifically for use with clinicians
when a criterion is reached. Because response accuracy is (e.g., Loverso, Prescott, and Selinger, 1992; Van de
the focus of the task, the program should present an in- Sandt-Koenderman, 1994), other software, such as
tervention or cues to help shape the participant’s re- COT word processing or a variety of video game pro-
sponse toward the target response. Drill and practice grams, and even some web-based activities, can be
programs are convergent tasks. Simulations (‘‘micro- used in this manner as long as clinicians provide patients
worlds’’) create a structured environment in which a with the additional information needed to perform the
problem is presented and possible solutions are o¤ered. task.
Simulations may be simple, such as presenting a series of Augmentative communication devices (ACDs) in
text describing a problem, followed by a list of possible aphasia treatment usually refer to small computers
solutions. Complex programs more closely simulate real- functioning as sophisticated ‘‘electronic pointing
life situations by using pictures and sound. Simulations boards.’’ Unlike devices used by patients with severe
provide the opportunity to design divergent treatment dysarthria or other speech problems, patients with
tasks that more fully recruit real-life problem-solving aphasia and other disorders a¤ecting language cannot
strategies than those addressed by more traditional, type the words they are unable to speak. ACDs designed
convergent computer tasks, for example, by including for these individuals may incorporate digitized speech,
several alternative but equally correct solutions. Wheth- pictures, animation, and a minimum of text. To facilitate
er computer simulations can improve communicative both expression and comprehension, some devices are
behavior in real-life settings remains to be tested. Tuto- designed to permit both communication partners to ex-
rials o¤er valuable information regarding communica- change messages. Although ACDs vary in design and
tion and quality of life to the family, friends, and others organization, some devices allow modification of the
who can influence the aphasic patient’s world for the organization and semantic content in response to the
better. Computer tutorials present information com- particular needs and abilities of each patient. Re-
monly found in patient information pamphlets but in an searchers such as Aftonomos, Steele, and Wertz (1997)
interactive, self-paced, format. Tutorials can incorporate claim that for some patients with aphasia, treatment
features of an expert system, in which information is utilizing ACDs results in improved performance on
provided in response to a patient/family profile. standardized tests and in ‘‘natural language’’ (speaking,
Computers can be incorporated into treatment in listening, etc.).
three fundamentally di¤erent ways. Computer-only A speech-language pathologist educated in commu-
treatment (COT) software is designed to allow patients, nication theory and su‰ciently experienced in the clinic
as part of clinician-provided treatment programs, to and in real life can create an infinite number of novel
practice alone at the computer, without the simultaneous and relevant treatment activities and evaluate and mod-
supervision or direct assistance from clinicians. The op- ify these activities in response to unique and idiosyn-
eration of COT programs should be familiar and intu- cratic patient behavior, even when those behaviors are
itive for the patients, particularly those who cannot read unanticipated, for example, resulting from previously
lengthy or complex text. The program may alter in a unacknowledged associations. In contrast, computer-
limited way elements of the task in response to patient provided treatment is based on a finite set of rules that
performance, such as reducing the number of stimuli or are stated explicitly to evoke specific response that are
presenting predetermined cues in response to errors (e.g., (at best) likely to occur at particular points during a fu-
Seron et al., 1980; Katz and Nagy, 1984). As all posture treatment session, as in a game of chess. However,
sible cues and therapeutic strategies that may be helpful unlike chess, many elements of language, communica-
to every patient cannot be anticipated, intervention is tion, and rehabilitation are not well delineated or uni-
commonly simplistic, inflexible, or nonexistent. Conse- versally recognized.
quentally, COT programs are usually convergent tasks In describing four interrelated properties of com-
(e.g., drills) with simple, obvious goals and, if e¤ective, puters and programming, Bolter (1984) helped aphasiol-
increase treatment e‰ciency as supplementary tasks ogists better understand the relation between computers
designed to reinforce or help generalize recently learned and treatment. (1) Computers deal with discrete (or dig-
skills. ital ) units of data, typically unambiguous numbers or
Computer-assisted treatment (CAT) software is pre- other values, but many fundamental and recurrent
sented on a computer as the patient and clinician work aspects of communication are not clearly defined or un-
together on the program. The role for the computer is derstood. Whether during treatment or real-life, pur-
limited to supportive functions (e.g., presenting stimuli, poseful interactions, language and communication units
storing responses, summarizing performance). The clini- are often incomplete, emanate (simultaneously and
cian retains responsibility for the most therapeutically sequentially) from various modalities, and depend on
critical components, particularly designing, administer- context and past experiences. (2) Computers are conven-
ing, monitoring, and modifying the intervention in re- tional, that is, they apply predetermined rules to symbols
that have no e¤ect on the rules. Regardless of the value References

of the symbols, the sophistication of the program
(‘‘complex branching algorithms’’) or the outcome of the Aftonomos, L. B., Steele, R. D., and Wertz, R. T. (1997).
Promoting recovery in chronic aphasia with an interactive
program, the rules never change (e.g., Katz and Nagy,
technology. Archives of Physical Medicine, 78, 841–846.
1984). In aphasia treatment, all the rules of treatment are Bolter, J. D. (1984). Turing’s man: Western culture in the com-
not known and those that are may not be correct for all puter age. Chapel Hill: University of North Carolina Press.
conditions, requiring clinicians to monitor and some- Crerar, M. A., Ellis, A. W., and Dean, E. C. (1996). Re-
times modify rules for each patient. Unlike clinician- mediation of sentence processing deficits in aphasia using
provided treatment, computer-provided treatment does a computer-based microworld. Brain and Language, 52,
not modify the rules of the treatment it applies; there- 229–275.
fore, computers do not respond adequately to the Darley, F. L. (1972). The e‰cacy of language rehabilitation in
dynamics of patient performance. (3) Computers are aphasia. Journal of Speech and Hearing Research, 37, 3–21.
finite. Their rules and symbols are defined within the Guyard, H., Masson, V., and Quiniou, R. (1990). Computer-
based aphasia treatment meets artificial intelligence.
program. Except in a limited way for artificial intelli-
gence software, unanticipated responses do not result in Katz, R. C., and Nagy, V. T. (1984). An intelligent computer-
the creation of new rules and symbols. Therapy demands based task for chronic aphasic patients. In R. H. Brookshire
a di¤erent approach. Not all therapeutically relevant (Ed.), Clinical aphasiology: 1984 conference proceedings
behaviors have been identified, and those that have often (pp. 159–165). Minneapolis: BRK.
vary in relevance among patients and situations. Treat- Katz, R. C., and Wertz, R. T. (1992). Computerized hierar-
ment software that incorporates artificial intelligence chical reading treatment in aphasia. Aphasiology, 6, 165–
(e.g., Guyard, Masson, and Quiniou, 1990) only roughly 177.
approximates this approach, usually by reducing the Katz, R. C., and Wertz, R. T. (1997). The e‰cacy of
scope of the task. (4) Computers are isolated from real- computer-provided reading treatment for chronic aphasic
adults. Journal of Speech, Language, and Hearing Research,
world experience. Problems and their solutions exist 40, 493–507.
within the boundaries of the program and frequently Loverso, F. L., Prescott, T. E., and Selinger, M. (1992).
have little to do with reality. Problems are created with Microcomputer treatment applications in aphasiology.
the intention that they can be solved by manipulating Aphasiology, 6, 155–163.
symbols in a predetermined, finite series of steps. This Mills, R. H. (1982). Microcomputerized auditory comprehen-
lack of ‘‘world knowledge’’ is perhaps the most sig- sion training. In R. H. Brookshire (Ed.), Clinical aphasiol-
nificant obstacle to comprehensive computer-provided ogy: 1982 conference proceedings (pp. 147–152).
treatment, as it limits the ability of programs to present Minneapolis: BRK.
real-world problems with multiple options and practical, Robinson, I. (1990). Does computerized cognitive rehabilita-
flexible solutions. tion work? A review. Aphasiology, 4, 381–405.
Seron, X., Deloche, G., Moulard, G., and Rouselle, M. (1980).
In an extensive review of the literature, Robinson A computer-based therapy for the treatment of aphasic
(1990) reported that the e‰cacy of computer-aided subjects with writing disorders. Journal of Speech and
treatment for aphasia and for other cognitive dis- Hearing Disorders, 45, 45–58.
orders had not been demonstrated. The research studies Van de Sandt-Koenderman, M. (1994). Multicue, a computer
reviewed su¤ered from inappropriate experimental de- program for word finding in aphasia, 1. International Con-
signs, insu‰cient statistical analyses, and other defi- gress Language-Therapy-Computers, Graz, Austria: Uni-
ciencies. Robinson stated that some researchers obscured versity of Graz.
the basic question by asking what works with whom
under what conditions (see Darley, 1972). Further Readings
There is no substitute for carefully controlled,
randomized studies, the documentation of which has Bengston, V. L. (1973). Self-determination: A social psycho-
become the scientific foundation of aphasiology. Re- logic perspective on helping the aged. Geriatrics, 28, 1118–
1130.
search reported over the last 15 years has assessed the Colby, K. M., and Kraemer, H. C. (1975). An objective mea-
e¤ect of particular computerized interventions (e.g., surement of nonspeaking children’s performance with a
Crerar, Ellis, and Dean, 1996) and incorporated in- computer-controlled program for the stimulation of lan-
creasingly sophisticated designs and greater numbers of guage behavior. Journal of Autism and Childhood Schizo-
subjects to assess the e‰cacy of computer-aided aphasia phrenia, 5, 139–146.
treatment, from simple A-B-A designs and comparisons Crerar, M. A., and Ellis, A. W. (1995). Computer-based ther-
of pre- and posttreatment testing (Mills, 1982) to large, apy for aphasia: Towards second generation clinical tools.
randomly assigned single-subject studies (Loverso, Pres- In C. Code and D. Müller (Eds.), Treatment of aphasia:
cott, and Selinger, 1992) and group studies incorporat- From theory to practice (pp. 223–250). London: Whurr.
ing several conditions (Katz and Wertz, 1992, 1997). Dean, E. C. (1987). Microcomputers and aphasia. Aphasiology,
1, 267–270.
The e‰cacy of computerized aphasia treatment is being Delouche, G., Dordain, M., and Kremin, H. (1993). Rehabil-
addressed one study at a time. itation of confrontational naming in aphasia: Relations
See also speech and language disorders in chil- between oral and written modalities. Aphasiology, 7, 201–
dren: computer-based approaches. 216.
Enderby, P. (1987). Microcomputers in assessment, rehabilita-
—Richard C. Katz tion and recreation. Aphasiology, 1, 151–166.
Aphasia Treatment: Pharmacological Approaches 257
Glisky, E. L., Schlacter, D. L., and Tuving, E. (1986). Learn- Aphasia Treatment: Pharmacological
ing and retention of computer-related vocabulary in
memory-impaired patients: Method of vanishing cues. Approaches
Journal of Clinical and Experimental Neuropsychology, 8,
292–312.
Katz, R. C. (1986). Aphasia treatment and microcomputers. San
For over a century, clinicians have sought to use phar-
Diego, CA: College-Hill Press.
Katz, R. C. (1987). E‰cacy of aphasia treatment using micro- macological agents to remediate aphasia and/or to help
computers. Aphasiology, 1, 141–150. compensate for it, but without much success (Small,
Katz, R. C. (1990). Intelligent computerized treatment or arti- 1994). However, in several limited areas, the use of drug
ficial aphasia therapy. Aphasiology, 4, 621–624. treatment as an adjunct to traditional (behavioral)
Katz, R. C., and Hallowell, B. (1999). Computer applica- speech therapy has shown some promise. Furthermore,
tions in treatment of acquired neurogenic communication the future for pharmacological and other biological
disorders in adults. Seminars in Speech and Language, 20, treatments is bright (Small, 2000).
251–269. In this brief article, we restrict our attention to the
Loverso, F. L. (1987). Unfounded expectations: Computers in subacute and chronic phases of aphasia, rather than the
rehabilitation. Aphasiology, 1, 157–160.
treatments for acute neurological injury. Much of this
Loverso, F. L., Prescott, T. E., Selinger, M., and Riley, L.
(1988). Comparison of two modes of aphasia treatment: work has focused on a class of neurotransmitters, the
Clinician and computer-clinician assisted. Clinical Aphasi- catecholamines, which occur throughout the brain. Two
ology, 18, 297–319. important catecholamines are dopamine, produced by
Lucas, R. W. (1977). A study of patients’ attitudes to com- the substantia nigra, and norepinephrine, produced by
puter interrogation. International Journal of Man-Machine the locus coeruleus. Since the catecholamines do not
Studies, 9, 69–86. cross the blood-brain barrier, typical therapy involves
Odor, J. P. (1988). Student models in machine-mediated learn- agents that increase catecholamine concentrations.
ing. Journal of Mental Deficiency Research, 32, 247–256. Dextro-amphetamine is the most popular agent of this
Petheram, B. (1988). Enabling stroke victims to interact with a sort, acting nonspecifically to increase the concentrations
minicomputer—comparison of input devices. International
Disabilities Studies, 10, 73–80.
of all the catecholamines at synaptic junctions. In the
Rushako¤, G. E. (1984). Clinical applications in communica- early studies, a single dose of dextro-amphetamine led to
tion disorders. In A. H. Schwartz (Ed.), Handbook of accelerated recovery in a beam-walking task in rats with
microcomputer applications in communication disorders (pp. unilateral motor cortex ablation (Feeney, Gonzalez, and
148–171). San Diego, CA: College-Hill Press. Law, 1982). By contrast, a single dose of haloperidol, a
Scott, C., and Byng, S. (1989). Computer-assisted remediation dopamine antagonist, blocked the amphetamine e¤ect.
of a homophone comprehension disorder in surface dys- When given alone, haloperidol delays spontaneous re-
lexia. Aphasiology, 3, 301–320. covery, whereas phenoxybenzamine, an a1 -adrenergic
Steele, R. D., Weinrich, M., Kleczewska, M. K., Wertz, R. T., antagonist, reproduces the deficits in animals that have
and Carlson, G. S. (1987). Evaluating performance of recovered. Similar results have now been obtained in
severely aphasic patients on a computer-aided visual com-
munication system. In R. H. Brookshire (Ed.), Clinical
several species and in motor and visual systems (Feeney
aphasiology: 1987 conference proceedings (pp. 46–54). Min- and Sutton, 1987; Feeney, 1997).
neapolis: BRK. The role of antidepressant medications in stroke re-
Vaughn, G. R., Amster, W. W., Bess, J. C., Gilbert, D. J., covery, including selective serotonin reuptake inhibitors
Kearrns, K. P., Rudd, A. K., et al. (1987). E‰cacy of (SSRIs) and the less selective tricyclics, is not straight-
remote treatment of aphasia by TEL-communicology. forward. Neither fluoxetine (an SSRI) nor direct admin-
Journal of Rehabilitative Research and Development, 25, istration of serotonin seems e¤ective in improving motor
446–447. function in a rat model (Boyeson, Harmon, and Jones,
Weinrich, M., McCall, D., Weber, C., Thomas, K., and 1994), whereas the tricyclics have produced mixed e¤ects
Thornburg, L. (1995). Training on an iconic communica- (Boyeson and Harmon, 1993; Boyeson, Harmon, and
tion system for severe aphasia can improve natural lan-
guage production. Aphasiology, 9, 343–364.
Jones, 1994).
Wertz, R. T., Dronkers, N. F., Knight, R. T., Shenaut, G. K., The role of the inhibitory transmitter g-aminobutyric
and Deal, J. L. (1987). Rehabilitation of neurogenic com- acid (GABA) has been investigated in several studies.
munication disorders in remote settings. Journal of Reha- Intracortical infusion of GABA exacerbates the hemi-
bilitative Research and Development, 25, 432–433. paresis produced by a small motor cortex lesion in rats
(Schallert et al., 1992). The short-term administration of
diazepam, a benzodiazepine and indirect GABA agonist,
can permanently impede sensory cortical recovery. Fur-
thermore, phenobarbital, which may have some GABA
agonist e¤ects, also impedes recovery (Hernandez and
Holling, 1994).
A number of early studies were conducted with lim-
ited success and are summarized in two recent re-
views (Small, 1994; Small, 2001). Modern studies of
pharmacological treatment of aphasia have focused on
neurotransmitter systems, particularly catecholaminergic
systems. A number of studies have been conducted, not Piracetam is a GABA derivative that acts as a noo-
all well designed. At present, no drug has been ade- tropic agent on the central nervous system (CNS) and
quately shown to help aphasia recovery to the degree facilitates cholinergic and excitatory amine neuro-
that would be necessary to recommend its general use. transmission (Giurgea, Greindl, and Preat, 1983; Vernon
Several biological approaches that have been tested and Sorkin, 1991). A large multicenter trial (De Deyn
for aphasia recovery have been shown to be ine¤ec- et al., 1997) showed no e¤ect on the primary outcome
tive (e.g., meprobamate, hyperbaric oxygen) or very measure of neurological status at 4 weeks. Another study
poorly supported by published results (e.g., amobarbital, showed improvement at 12 weeks that was no longer
selegiline). present at 24 weeks (Enderby et al., 1994). A later
Several studies have examined the role of dopamine. study (Huber et al., 1997) showed that improvement
Albert et al. (1988) described a case suggesting that the occurred on only one subtest (written language) of a
dopamine agonist bromocriptine helped restore speech large battery.
fluency in a patient with transcortical motor aphasia A crucial issue that must be addressed as part of
resulting from stroke. Another case report failed to find aphasia rehabilitation is depression, since it can ad-
a similar benefit in a similar patient (MacLennan et al., versely a¤ect language recovery. Following stroke,
1991). Two additional patients improved in speech flu- patients with depression have more cognitive impair-
ency but not in other aspects of language function ment than patients with comparable lesions but no de-
(Gupta and Mlcoch, 1992). Another open-label study pression (Downhill and Robinson, 1994). Furthermore,
suggested some e¤ect in moderate but not severe aphasia in stroke patients matched for severity and lesion local-
(Sabe, Leiguarda, and Starkstein, 1992). ization, patients with depression experience a poorer
Dextro-amphetamine is perhaps the most widely recovery than their nondepressed counterparts in func-
studied biological treatment for the chronic e¤ects of tional status and cognitive performance (Morris,
stroke, including aphasia (Walker-Batson, 2000), yet Raphael, and Robinson, 1992).
both its clinical e‰cacy and mode of action remain Growth factors have been advocated for a variety of
unclear (Goldstein, 2000). Nonetheless, evidence from purposes in the treatment of stroke, particularly in the
both animal model systems and humans make this a acute phase of ischemic brain injury (Zhang et al., 1999),
somewhat promising drug for the treatment of aphasia. but also as neuroprotective agents useful in the chronic
In a study of motor rehabilitation from stroke, phase of recovery from brain injury (Olson et al., 1994).
more than half of a group of 88 elderly patients who had Gene transfer into the CNS might ultimately play a role
been classified as rehabilitation failures because of poor in delivering trophins or other agents to damaged brain
progress in physical therapy benefited from dextro- areas and thus to help stimulate recovery or increased
amphetamine as an adjunct to physical therapy synaptic connectivity.
(Clark and Mankikar, 1979). A double-blind placebo- Neural stem cells are multipotential precursors to
controlled study replicated this finding (Crisostomo et neurons and glia. Attempts have been made to induce
al., 1988) in eight patients with ischemic stroke. di¤erentiation into neurons and glial cells, and further
An early study of aphasia pharmacotherapy with into specific types of such cells. Specifically with regard
methylphenidate (similar to amphetamine) and chlor- to stroke and the treatment of cortical lesions, fetal neo-
diazepoxide (a benzodiazepine) revealed no e¤ects cortical cells have been successfully transplanted into the
(Darley, Keith, and Sasanuma, 1977). A recent pro- site of cortical lesions (Johansson, 2000), and have even
spective double-blind study of motor recovery with been shown to migrate selectively into areas of experi-
methylphenidate found a significant di¤erence in motor mental cell death (Macklis, 1993; Snyder et al., 1997).
and depression scores on some measures but not others One important consequence of this research into the
(Grade et al., 1998). Methylphenidate may play a role in pharmacology of aphasia is the realization that drugs are
the treatment of post-stroke depression (Lazarus et al., not only potential therapeutic adjuncts but can also
1992). serve as inhibitors of successful recovery. The first study
Walker-Batson et al. (1991) have reported a study of of this type, by Porch and colleagues (1985), showed that
six aphasic patients with ischemic cerebral infarction. patients taking certain medicines performed more poorly
Each patient took dextro-amphetamine every 4 days, on an aphasia battery than those who were not taking
about an hour prior to a session of speech and language medicines.
therapy, for a total of ten sessions. When evaluated after In a formal retrospective (chart review) study,
this period, the patients performed at significantly above patients with motor deficits after stroke were divided
expected levels. into one group taking a number of specific drugs at the
Of potential significance, the studies showing benefi- time of stroke (clonidine, prazosin, any dopamine re-
cial e¤ects of dextro-amphetamine, that is, the study by ceptor antagonist [e.g., neuroleptics], benzodiazepines,
Walker-Batson et al. (1991), a motor study by the same phenytoin, or phenobarbital) and another group that
group (Walker-Batson et al., 1995), and the other study was not (Goldstein, 1995). Statistical analysis revealed
of motor rehabilitation (Crisostomo et al., 1988), share that whereas patient demographics and stroke severity
the common feature of evaluating the drug as an en- were similar between groups, motor recovery time was
hancement to behavioral or physical therapy rather than significantly shorter in the patients who were not taking
as a monotherapeutic panacea. one of these drugs.
Aphasia Treatment: Pharmacological Approaches 259
This work has profound relevance to aphasia reha- with piracetam. Members of the Piracetam in Acute Stroke
bilitation. To maximize functional recovery, it is impor- Study (PASS) Group. Stroke, 28, 2347–2352.
tant not only to ensure adequate behavioral treatment, Downhill, J. R., Jr., and Robinson, R. G. (1994). Longitudinal
but also to ensure the appropriate neurobiological sub- assessment of depression and cognitive impairment follow-
ing stroke. Journal of Nervous and Mental Disease, 182,
strate for this treatment (or, more concretely, to ensure
425–431.
that this substrate is not pharmacologically inhibited Enderby, P., Broeckx, J., Hospers, W., Schildermans, F.,
from responding to the therapy). It is thus advisable for and Deberdt, W. (1994). E¤ect of piracetam on recovery
patients in aphasia therapy to avoid drugs that might and rehabilitation after stroke: A double-blind, placebo-
interfere with catecholaminergic or GABAergic function controlled study. Clinical Neuropharmacology, 17, 320–
or that are thought to delay recovery by empirical study. 331.
Current knowledge suggests a potential beneficial ef- Feeney, D. M. (1997). From laboratory to clinic: nora-
fect of increased CNS catecholamines on human motor drenergic enhancement of physical therapy for stroke or
recovery and aphasia rehabilitation. Although pharma- trauma patients. Advances in Neurology, 73, 383–394.
cotherapy cannot be used as a replacement for speech Feeney, D. M., Gonzalez, A., and Law, W. A. (1982). Am-
phetamine, haloperidol, and experience interact to a¤ect
and language therapy, it might play a role as an adjunct,
rate of recovery after motor cortex injury. Science, 217,
and other biological therapies, such as cell transplanta- 855–857.
tion, might play a role in concert with carefully designed, Feeney, D. M., and Sutton, R. L. (1987). Pharmacotherapy for
adaptive learning approaches. In the published cases recovery of function after brain injury. Critical Reviews in
where pharmacotherapy improved language functioning Neurobiology, 3, 135–197.
in people with aphasia, it was used adjunctively, not Giurgea, C. E., Greindl, M. G., and Preat, S. (1983). Noo-
alone. It is very likely that pharmacotherapy has a valu- tropic drugs and aging. Acta Psychiatrica Belgium, 83,
able role to play as an adjunct to behavioral rehabilita- 349–358.
tion to decrease performance variability and to improve Goldstein, L. B. (1995). Common drugs may influence motor
mean performance in patients with mild to moderate recovery after stroke. The Sygen in Acute Stroke Study
Investigators [see comments]. Neurology, 45, 865–871.
language dysfunction from cerebral infarctions.
Goldstein, L. B. (2000). E¤ects of amphetamines and small re-
lated molecules on recovery after stroke in animals and
man. Neuropharmacology, 39, 852–859.
Acknowledgments Grade, C., Redford, B., Chrostowski, J., Toussaint, L., and
Research was supported by the National Institute Blackwell, B. (1998). Methylphenidate in early post-
of Deafness and Other Communication Disorders, stroke recovery: A double-blind, placebo-controlled study.
National Institutes of Health, under grant NIH DC Archives of Physical Medicine and Rehabilitation, 79, 1047–
1050.
R01-3378.
Gupta, S. R., and Mlcoch, A. G. (1992). Bromocriptine treat-
—Steven L. Small ment of nonfluent aphasia. Archives of Physical Medicine
and Rehabilitation, 73, 373–376.
Hernandez, T. D., and Holling, L. C. (1994). Disruption of
References behavioral recovery by the anti-convulsant phenobarbital.
Brain Research, 635, 300–306.
Albert, M. L., Bachman, D. L., Morgan, A., and Helm- Huber, W., Willmes, K., Poeck, K., Van Vleymen, B., and
Estabrooks, N. (1988). Pharmacotherapy for aphasia. Neu- Deberdt, W. (1997). Piracetam as an adjuvant to language
rology, 38, 877–879. therapy for aphasia: A randomized double-blind placebo-
Boyeson, M. G., and Harmon, R. L. (1993). E¤ects of trazo- controlled pilot study. Archives of Physical Medicine and
done and desipramine on motor recovery in brain-injured Rehabilitation, 78, 245–250.
rats. American Journal of Physical Medicine and Rehabili- Johansson, B. B. (2000). Brain plasticity and stroke rehabilita-
tation, 72, 286–293. tion. The Willis Lecture. Stroke, 31, 223–230.
Boyeson, M. G., Harmon, R. L., and Jones, J. L. (1994). Lazarus, L. W., Winemiller, D. R., Lingam, V. R., Neyman,
Comparative e¤ects of fluoxetine, amitriptyline and seroto- I., Hartman, C., Abassian, M., et al. (1992). E‰cacy and
nin on functional motor recovery after sensorimotor cortex side e¤ects of methylphenidate for poststroke depression
injury. American Journal of Physical Medicine and Rehabil- [see comments]. Journal of Clinical Psychiatry, 53, 447–449.
itation, 73, 76–83. Macklis, J. D. (1993). Transplanted neocortical neurons mi-
Clark, A. N. G., and Mankikar, G. D. (1979). d-Amphetamine grate selectively into regions of neuronal degeneration pro-
in elderly patients refractory to rehabilitation procedures. duced by chromophore-targeted laser photolysis. Journal of
Journal of the American Geriatrics Society, 27, 174–177. Neuroscience, 13, 3848–3863.
Crisostomo, E. A., Duncan, P. W., Propst, M., Dawson, D. V., MacLennan, D. L., Nicholas, L. E., Morley, G. K., and
and Davis, J. N. (1988). Evidence that amphetamine with Brookshire, R. H. (1991). The e¤ects of bromocriptine on
physical therapy promotes recovery of motor function in speech and language function in a man with transcortical
stroke patients. Annals of Neurology, 23, 94–97. motor aphasia. Clinical Aphasiology, 21, 145–155.
Darley, F. L., Keith, R. L., and Sasanuma, S. (1977). The Morris, P. L., Raphael, B., and Robinson, R. G. (1992). Clin-
e¤ect of alerting and tranquilizing drugs upon the per- ical depression is associated with impaired recovery from
formance of aphasic patients. Clinical Aphasiology, 7, 91– stroke. Medical Journal of Australia, 157, 239–242.
96. Olson, L., Backman, L., Ebendal, T., Eriksdotter-Jonhagen,
De Deyn, P. P., Reuck, J. D., Deberdt, W., Vlietinck, R., and M., Ho¤er, B., Humpel, C., et al. (1994). Role of growth
Orgogozo, J. M. (1997). Treatment of acute ischemic stroke factors in degeneration and regeneration in the central
nervous system: Clinical experiences with NGF in Parkin- diate family. The value dimensions in our lives, such
son’s and Alzheimer’s diseases. Journal of Neurology, 242 as health, sexuality, career, creativity, marriage, intelli-
(Suppl. 1), S12–S15. gence, money, and family relations, contribute to quality
Porch, B., Wyckes, J., and Feeney, D. M. (1985). Haloperidol, of life and are markedly a¤ected for the aphasic person
thiazides, and some antihypertensives slow recovery from
(Hinckley, 1998) and that person’s relatives. All may
aphasia. Paper presented at the Annual Meeting of the So-
ciety for Neuroscience. expect considerable disruption of professional, social,
Sabe, L., Leiguarda, R., and Starkstein, S. E. (1992). An open- and family life, reduced social contact, depression, lone-
label trial of bromocriptine in nonfluent aphasia. Neurol- liness, frustration, and aggression (Herrmann and Wal-
ogy, 42, 1637–1638. lesch, 1989).
Schallert, T., Jones, T., Weaver, M., Shapiro, L., Crippens, D., Recovery and response to rehabilitation in aphasia
and Fulton, R. (1992). Pharmacologic and anatomic con- are also significantly influenced by emotional and psy-
siderations in recovery of function. Physical Medicine and chosocial factors. The aphasic person’s family and
Rehabilitation, 6, 375–393. other caregivers need to be involved as much as possible
Small, S. L. (1994). Pharmacotherapy of aphasia: A critical in intervention, and this involvement extends beyond
review. Stroke, 25, 1282–1289.
discharge. The experienced disability rather than the
Small, S. L. (2000). The future of aphasia treatment. Brain and
Language, 71, 227–232. impairment itself is the focus of rehabilitation. Rehabili-
Small, S. L. (2001). Biological approaches to the treatment of tation increasingly includes community-based work and
aphasia. In A. Hillis (Ed.), Handbook on adult language support from not-for-profit organizations and self-help
disorders: Integrating cognitive neuropsychology, neurology, groups.
and rehabilitation (in press). Philadelphia: Psychology Whereas intervention during the acute stages of
Press. aphasia is largely based on the medical model, adjust-
Snyder, E. Y., Yoon, C., Flax, J. D., and Macklis, J. D. (1997). ment to aphasia is set more broadly within a social
Multipotent neural precursors can di¤erentiate toward approach. Several broad psychosocial and quality-of-life
replacement of neurons undergoing targeted apoptotic areas have been incorporated into rehabilitation: dealing
degeneration in adult mouse neocortex. Proeedings of Na-
with depression and other emotions, social reintegration,
tional Academy of Sciences of the United States of America,
94, 11663–11668. and the development of autonomy and self-worth. Au-
Vernon, M. W., and Sorkin, E. M. (1991). Piracetam. An tonomy involves cooperating with others to achieve
overview of its pharmacological properties and a review of ends, whereas independence implies acting alone and
its therapeutic use in senile cognitive disorders. Drugs and may not be an achievable goal. These areas provide a
Aging, 1, 17–35. basis for developing broad-ranging programs.
Walker-Batson, D. (2000). Use of pharmacotherapy in the
treatment of aphasia. Brain and Language, 71, 252–254.
Walker-Batson, D., Devous, M. D., Curtis, S., Unwin, D. H., Emotion
and Greenlee, R. G. (1991). Response to amphetamine to We need to distinguish the direct e¤ects of damage on
facilitate recovery from aphasia subsequent to stroke. Clin-
the neurophysical substrate of emotion and the indirect
ical Aphasiology, 21, 137–143.
Walker-Batson, D., Smith, P., Curtis, S., Unwin, H., and e¤ects, which are natural reactions to catastrophic per-
Greenlee, R. (1995). Amphetamine paired with physical sonal circumstances (Code, Hemsley, and Herrmann,
therapy accelerates motor recovery after stroke. Further 1999). Our understanding of these factors is improving.
evidence. Stroke, 26, 2254–2259. Three di¤erent forms of depression can follow damage:
Zhang, W. R., Kitagawa, H., Hayashi, T., Sasaki, C., Sakai, catastrophic reaction, major post-stroke depression, and
K., Warita, H., et al. (1999). Topical application of minor post-stroke depression. There is little research
neurotrophin-3 attenuates ischemic brain injury after tran- separating reactive from direct e¤ects but Herrmann,
sient middle cerebral artery occlusion in rats. Brain Re- Bartels, and Wallesch (1993) found significantly higher
search, 842, 211–214. ratings for physical signs of depression, generally con-
sidered direct e¤ects, during the acute stage.
Some view depression accompanying aphasia within
Aphasia Treatment: Psychosocial Issues the framework of the grief model (Tanner and Ger-
stenberger, 1988). In this view, individuals grieve for the
loss of communication, moving through the stages of
Although researchers are developing a useful under- denial, anger, bargaining, depression, and acceptance.
standing of aphasia as a neurocognitive condition, the Whether people do in fact go through these stages has
world we experience is a social and interactive one. The not been investigated, but it has served as a framework
social aspects of life contribute to quality of life, al- for counseling. Determining denial, bargaining, accep-
though quality of life has been di‰cult to characterize tance, and so on is problematic but has been investigated
scientifically and in ways that have clinical utility (Hilari interpretively with personal construct therapy techniques
et al., in press). by Brumfitt (1985), who argues that aphasia a¤ects a
Psychosocial refers to the social context of emotional person’s core role constructs, with grief for the essential
experience. Most emotions are closely associated with element of self as a speaker.
social interactions. Aphasia has implications for the A further problem is that the symptoms of depression,
individual’s whole social network, especially the imme- such as changes in sleep and eating, restlessness, and
Aphasia Treatment: Psychosocial Issues 261
crying, can also be caused by physical illness, anxiety, gration, providing a valuable resource and helping to
hospitalization, and factors unrelated to mood state. establish, facilitate, and maintain groups.
Language impairment plays a special role in the problem The e‰cacy of self-help groups in which the aphasic
of identifying and measuring mood, and one approach members decide on the group’s purpose, take responsi-
has been to use the nonverbal Visual Analogue Mood bility for running the group, and serve as o‰cers of the
Scale (VAMS), substituting schematic faces for words group (e.g., chair, secretary) is being evaluated, particu-
(Stern, 1999). larly in relation to the development of independence and
Åström et al. (1993) found major depression in 25% autonomy. Structured support groups are of benefit. The
of stroke survivors, which rose to 31% at 3 months post self-help groups in the United Kingdom attract mainly
onset, fell to 16% at 12 months, and increased again over younger and less severely impaired individuals and use
the next 2 years to 29%. Some workers take the position little in the way of statutory resources (Code et al.,
that drugs should be avoided, but others suggest incor- 2001).
porating drugs and psychotherapy, with drugs perhaps Wahrborg et al. (1997) have reported benefits from
being more appropriate at early stages to counter direct integrating aphasic people into educational programs
e¤ects and psychotherapy and counseling more appro- and organizations, and Elman (1998) has introduced
priate later, when the individual is more ready to deal adult education instructors into an aphasia center.
with the future. Individual and group counseling are Work and other purposeful activity is an important
e¤ective, and aphasic people themselves have recently value dimension central in the development and mainte-
become involved as counselors. nance of self-worth and autonomy. Returning to work
remains a constant concern of many aphasic people.
Social Reintegration and Self-Esteem Ramsing, Blomstrand, and Sullivan (1991) explored
prognostic factors for return to work, but there has been
Self-esteem and self-worth are complex constructs, tied little follow-up to this research. Parr et al. (1997) report
to social activity, that workers suggest should be central that only one person in their study who was working at
to psychosocial rehabilitation. The importance of self the time of the stroke returned to the same employment.
has been examined by Brumfitt (1993) using personal A few found part-time work, and the rest became un-
construct techniques. Facilitating participation in the employed or retired. Garcia, Barrette, and Laroche
community entails passing responsibility to the individ- (2000) studied perceived barriers to work and found that
ual gradually so that the individual can develop auton- therapists focused on personal and social barriers, em-
omy, develop greater self-esteem, and take greater ployers focused on organizational ones, and aphasic
ownership of the issues that they face. The importance people focused on barriers of all types. The groups also
of involving the aphasic individual fully has been suggested strategies for reducing barriers to work.
addressed by Parr et al. (1997). Family therapy, to include people close to the aphasic
Hersh (1998) argues that particularly at discharge person, is generally beneficial and can lead to positive
from formal therapy, an account of the ongoing changes (Nichols, Varchevker, and Pring, 1996).
management of psychosocial adjustment is needed.
Simmons-Mackie (1998) argues that the traditional con-
cept of a plateau being reached, where linguistic progress
Conclusions
slows down for many, is not relevant when considering A psychosocial approach to improving quality of life
the social consequences of aphasia. The aim should be to aims to aid social reintegration in such a way that the
prepare and assist clients to integrate into a community. individual is able to maintain identity, develop self-
Social a‰liation is stressed as a means of maintaining esteem and purpose, and become socially rea‰liated.
and developing self-identity. Significant others, professionals, and volunteers are in-
Not-for-profit organizations and centers such as the volved. This approach o¤ers a challenge to clinicians,
Pat Orato Aphasia Center in Ontario and the Aphasia as it extends their role and requires a more comprehen-
Center in Oakland, California, provide community- sive approach to management. There remains a lack of
based programs for aphasic people and their families, evidence-based approaches to managing psychosocial
including the training of relatives and professionals as adjustment, but it is clear that volunteers, organizations,
better conversation partners. Through such programs, charities, and community-based centers are contributing.
the psychosocial well-being of both aphasic participants The challenge facing clinical aphasiology is to evaluate
and their families is improved (Hoen, Thelander, and the benefits of psychosocial support for aphasic people
Worsley, 1997). Training volunteers as communication and its impact on quality of life.
partners results in gains in psychological well-being and
communication among aphasic participants, caregivers,
and the communication partners themselves. In phi- Acknowledgments
losophy and approach, these centers resemble United The first author was a Fellow at the Hanse Institute of
Kingdom charities such as Speakability and Connect. Advanced Study, Delmenhorst, Germany, when this
Organizations like these are increasingly o¤ering more chapter was completed.
long-term and psychosocially oriented programs. Many
use volunteers, who figure increasingly in social reinte- —Chris Code and Dave J. Muller
References aphasic participants in a 34-week course at a Folk High

School. Aphasiology, 11, 709–715.
Åström, M., Adolfsson, R., and Asplund, K. (1993). Major
depression in stroke patients: A 3-year longitudinal study. Further Readings
Stroke, 24, 976–982.
Brumfitt, S. (1985). The use of repertory grids with aphasic Borenstein, P., Linell, S., and Wahrborg, P. (1987). An inno-
people. In N. Beail (Ed.), Repertory grid techniques and vative therapeutic programme for aphasic patients and their
personal constructs. London: Croom Helm. relatives. Scandinavian Journal of Rehabilitation Medicine,
Brumfitt, S. (1993). Losing your sense of self: What aphasia 19, 51–56.
can do. Aphasiology, 7, 569–574. Carnworth, T. C. M., and Johnson, D. A. W. (1987). Psychi-
Code, C., Hemsley, G., and Herrmann, M. (1999). The emo- atric morbidity among spouses of patients with stroke.
tional impact of aphasia. Seminars in Speech and Language, British Medical Journal, 294, 409–411.
20, 19–31. Christensen, A. (1997). Communication in relation to self-
Code, C., Eales, C., Pearl, G., Conan, M., Cowin, K., and esteem. Aphasiology, 11, 727–734.
Hickin, J. (2001). Profiling the membership of self-help Code, C. (Ed.). (1996). The impact of neurogenic communica-
groups for aphasic people. International Journal of Lan- tion disorders: Beyond the impairment. Disability and Reha-
guage and Communication Disorders, Supplement, 36, 41– bilitation, 18. [Special issue]
45. Code, C. (Ed.). (1999). Management of psychosocial issues in
Elman, R. (1998). Memories of the ‘‘plateau’’: Health care aphasia. Seminars in Speech and Language, 20. [Special
changes provide an opportunity to redefine aphasia treat- issue]
ment and discharge. Aphasiology, 12, 227–231. Elman, R., and Bernstein-Ellis, E. (1999). Psychosocial aspects
Garcia, L. J., Barrette, J., and Laroche, C. (2000). Perceptions of group communication treatment: Preliminary findings.
of the obstacles to work reintegration for persons with Seminars in Speech and Language, 20, 65–72.
aphasia. Aphasiology, 14, 269–290. Gainotti, G. (Ed.). (1997). Emotional, psychological and psy-
Herrmann, M., Bartels, C., and Wallesch, C. W. (1993). chosocial problems of aphasic patients. Aphasiology, 11.
Depression in acute and chronic aphasia: Symptoms, [Special issue]
pathoanatomo-clinical correlations, and functional implica- Lapointe, L. L. (1999). Quality of life in aphasia. Seminars in
tions. Journal of Neurology, Neurosurgery, and Psychiatry, Speech and Language, 20, 5–17.
56, 672–678. Müller, D. J. (1992). Psychosocial aspects of aphasia. In
Herrmann, M., and Wallesch, C.-W. (1989). Psychosocial G. Blanken, J. Dittmann, H. Grimm, J. C. Marshall, and
changes and adjustment with chronic and severe nonfluent C.-W. Wallesch (Eds.), Linguistic disorders and pathologies:
aphasia. Aphasiology, 3, 513–526. An international handbook. Berlin: Walter de Gruyter.
Hersh, D. (1998). Beyond the ‘‘plateau’’: Discharge dilemmas Parr, S. (2001). Psychosocial aspects of aphasia: Whose per-
in chronic aphasia. Aphasiology, 12, 207–218. spectives? Folia Phoniatrica et Logopaedica, 53, 266–288.
Hilari, K., Wiggins, R. D., Roy, P., Byng, S., and Smith, S. C. Parr, S., Pound, C., Syng, S., and Long, B. (1999). The aphasia
(in press). Predictors of health-related quality of life handbook. Woodhouse Eves, U.K.: Ecodistribution.
(HRQOL) in people with chronic aphasia. Aphasiology. Rice, B., Paull, A., and Müller, D. J. (1987). An evaluation of
Hinckley, J. J. (1998). Investigating the predictors of lifestyle a social support group for spouses and aphasic adults.
satisfaction among younger adults with chronic aphasia. Aphasiology, 1, 247–256.
Aphasiology, 12, 509–518. Sarno, M. T. (Ed.). (1995). Aphasia recovery: Family-consumer
Hoen, B., Thelander, M., and Worsley, J. (1997). Improvement issues. Topics in Stroke Rehabilitation, 2. [Special issue]
in psychosocial well-being of people with aphasia and their Sarno, M. T. (1997). Quality of life in aphasia in the first post-
families: Evaluation of a community-based programme. stroke year. Aphasiology, 11, 665–679.
Aphasiology, 11, 681–691. Wahrborg, P. (1991). Assessment and management of emotional
Kagan, A. (1998). Supported conversation for adults with and psychosocial reactions to brain damage and aphasia.
aphasia: Methods and resources for training conversation London: Whurr.
partners. Aphasiology, 9, 816–831.
Nichols, F., Varchevker, A., and Pring, T. (1996). Working
with people with aphasia and their families: An exploration
of the use of family therapy techniques. Aphasiology, 10,
767–781.
Aphasic Syndromes: Connectionist
Parr, S., Byng, S., Gilpin, S., and Ireland, C. (1997). Talk- Models
ing about aphasia. Buckingham, U.K.: Open University
Press.
Ramsing, S., Blomstrand, C., and Sullivan, M. (1991). Prog- Connectionist models of aphasic syndromes first
nostic factors for return to work in stroke patients with emerged in the late nineteenth century. Broca (1861)
aphasia. Aphasiology, 5, 583–588. described a patient, Leborgne, whose speech was limited
Simmons-Mackie, N. (1998). A solution to the discharge di- to the monosyllable tan but whose ability to understand
lemma in aphasia: Social approaches to aphasia manage- spoken language and nonverbal cues and ability to ex-
ment. Aphasiology, 12, 231–239. press himself through gestures and facial expressions
Stern, R. (1999). Assessment of mood states in aphasia. Semi-
nars in Speech and Language, 20, 33–50.
were normal. Leborgne’s brain contained a lesion whose
Tanner, D. C., and Gerstenberger, D. L. (1988). The grief center was in the posterior portion of the inferior fron-
response in neuropathologies of speech and language. tal convolution of the left hemisphere, now known as
Aphasiology, 2, 79–84. Broca’s area. Broca claimed that Leborgne had lost
Wahrborg, P., Borenstein, P., Linell, S., Hedberg-Borenstein, ‘‘the faculty of articulate speech’’ and that this brain
E., and Asking, A. (1997). Ten year follow-up of young region was the neural site of the mechanism involved in
Aphasic Syndromes: Connectionist Models 263
speech production. In 1874, Karl Wernicke described a anatomical grounds (Marie, 1906; Moutier, 1908), dis-
patient with a speech disturbance that was very di¤er- missed as simplifications of reality that were of help only
ent from that seen in Leborgne. Wernicke’s patient was to schoolboys (Head, 1926), and ignored in favor of
fluent, but his speech contained words with sound di¤erent approaches to language (Jackson, 1878; Gold-
errors, other errors of word forms, and words that were stein, 1948). Nonetheless, they endured. Benson and
semantically inappropriate. Unlike Leborgne, Wernicke’s Geschwind (1971) reviewed the major approaches to
patient did not understand spoken language. Wernicke aphasia as they saw them and concluded that all
related the two impairments—one of speech production researchers recognized the same basic patterns of aphasic
and one of comprehension—by arguing that the patient impairments, despite using di¤erent nomenclature.
had sustained damage to ‘‘the storehouse of auditory Three more syndromes have been added by theorists
word forms,’’ leading to speech containing the types of such as Benson (1979), and Lichtheim’s model has been
errors that were seen and impaired comprehension. By rounded out with specific hypotheses about the neuro-
extrapolation from a similar case he had not personally anatomical bases for several functions that he could only
examined, Wernicke concluded that the patient’s lesion guess at.
was in the posterior portion of the left superior temporal Additional neuroanatomical foundation was first
gyrus, now known as Wernicke’s area, and that this re- suggested in a very influential paper by Geschwind
gion was the locus of the ‘‘storehouse of auditory word (1965). Geschwind argued that the inferior parietal lobe
forms.’’ Wernicke argued that, in speaking, word sounds was a tertiary association cortical area that received
were conveyed from Wernicke’s area to Broca’s area, projections from the association cortex immediately ad-
where the motor programs for speech were developed. jacent to the primary visual, auditory, and somesthetic
This connection gave this type of model its name. cortices in the occipital, temporal, and parietal lobes.
Lichtheim (1885) deveoped a more general model of Because of these anatomical connections, the inferior
this type. Lichtheim recognized seven syndromes, listed parietal lobe served as a cross-modal association region,
in Table 1. Lichtheim argued that these syndromes fol- associating word sounds with the sensory qualities of
lowed lesions in the regions of the brain depicted in objects. This underlay word meaning, in Geschwind’s
Figure 1. These syndromes were criticized on neuro- view. Damasio and Tranel (1993) extended this model to
Table 1. Aphasic Syndromes Described by Lichtheim (1885)

Syndrome Clinical Manifestations Hypothetical Deficit Classical Lesion Location
Broca’s aphasia Major disturbance in speech Disturbances in the speech Posterior aspects of the 3rd
production with sparse, halting planning and production frontal convolution (Broca’s
speech, often misarticulated, mechanisms area)
frequently missing function
words and bound morphemes
Wernicke’s aphasia Major disturbance in auditory Disturbances in the permanent Posterior half of the first
comprehension; fluent speech representations of the sound temporal gyrus and possibly
with disturbances of the sounds structures of words adjacent cortex (Wernicke’s
and structures of words area)
(phonemic, morphological, and
semantic paraphasias); poor
repetition and naming
Pure motor speech Disturbance of articulation; Disturbance of articulatory Outflow tracts from motor cortex
disorder apraxia of speech, dysarthria, mechanisms
anarthria, aphemia
Pure word deafness Disturbance of spoken word Failure to access spoken words Input tracts from auditory
comprehension, repetition often system to Wernicke’s area
impaired
Transcortical motor Disturbance of spontaneous Disconnection between White matter tracts deep to
aphasia speech similar to Broca’s conceptual representations of Broca’s area connecting it to
aphasia with relatively words and sentences and the parietal lobe
preserved repetition; motor speech production
comprehension relatively system
preserved
Transcortical sensory Disturbance in single-word Disturbance in activation of word White matter tracts connecting
aphasia comprehension with relatively meanings despite normal parietal lobe to temporal lobe
intact repetition recognition of auditorily or portions of inferior parietal
presented words lobe
Conduction aphasia Disturbance of repetition and Disconnection between the sound Lesion in the arcuate fasciculus
spontaneous speech (phonemic patterns of words and the and/or cortico-cortical
paraphasias) speech production mechanism connections between
Wernicke’s and Broca’s areas
syndromes are incomplete and unsystematic. For in-

stance, the speech production problem seen in Broca’s
aphasia can consist of one or more of a large number
of impairments: dysprosodic speech, poorly articulated
speech, agrammatism, an unusual number of short
phrases. If all we know about a patient is that she or he
has Broca’s aphasia, we cannot tell which of these prob-
lems (or other) that person has.
A second problem is that identical deficits occur in
di¤erent syndromes. For instance, certain types of nam-
ing problems can occur in any aphasic syndrome (Ben-
son, 1979). Because of this, most applications of the
clinical taxonomy result in widespread disagreements as
to a patient’s classification (Holland, Fromm, and
Swindell, 1986) and to a large number of ‘‘mixed’’ or
‘‘unclassifiable’’ cases (Lecours, Lhermitte, and Bryans,
1983). The criteria for inclusion in a syndrome are often
somewhat arbitrary: How bad does a patient’s compre-
hension have to be for the patient to be identified as
having Wernicke’s aphasia instead of conduction apha-
sia, or global aphasia instead of Broca’s aphasia? There
have been many e¤orts to answer this question (see, e.g.,
Figure 1. The classical connectionist model (modified from
Lichtheim, 1885). W indicates Wernicke’s area, the site of long-
Goodglass and Kaplan, 1972, 1982; Kertesz, 1979), but
term storage of word sounds. B indicates Broca’s area, the site none is satisfactory.
for speech planning. C represents the concept center, which A third problem with the classical aphasic syndromes
Wernicke thought was di¤usely located in parietal lobe. Infor- is that they are not as well correlated with lesion sites as
mation flows along the pathways indicated by lines. The pres- the theory claims they should be. These syndromes are
ence of these pathways (‘‘connections’’) gives this type of related to lesion sites reasonably well only in cases of
model its name. rapidly developing lesions, such as stroke. Even in these
types of lesions, the syndromes are never applied to
acute and subacute phases of the illness. Even in the
actions, arguing that associations between word sounds chronic phase of diseases such as stroke, at least 15% of
and memories of actions were created in the association patients have lesions that are not predictable from their
cortex in the inferior frontal lobe. Geschwind (1965) and syndromes (Basso et al., 1985), and some researchers
Damasio and Damasio (1980) also argued that the ana- think this figure is much higher—as much as 40% or
tomical link between Wernicke’s area and Broca’s area more, depending on what counts as an exception to the
(in which a lesion caused conduction aphasia) was the rule (de Bleser, 1988). We now know that the relation-
white matter tract known as the arcuate fasciculus. ship between lesion location and syndrome is more
The three and a half decades that have passed since complex than we had thought, even in cases in which the
publication of Geschwind’s paper have brought new classical localization captures part of the picture. Broca’s
evidence for these syndromes and their relationships aphasia, for instance, does not usually occur in the
to brain lesions. Aphasic syndromes have been related chronic state after lesions restricted to Broca’s area but
to the brain using a series of neuroimaging techniques, requires much larger lesions (Mohr et al., 1978). Some
first radionuclide scintigraphy with technetium99 , then theorists have argued that the localizing value of the
computed tomography, magnetic resonance imaging, classical syndromes reflects the co-occurrence of variable
and positron emission tomography. All have confirmed combinations of language processing deficits with motor
the relationship of the major syndromes to lesion loca- impairments that a¤ect the fluency of speech (Caplan,
tions. These aphasic syndromes and their relationships 1987; McNeil and Kent, 1991). From this point of view,
to the brain figure prominently in recent reviews of the localizing value of the classical syndromes is due to
aphasia in leading medical journals (e.g., Damasio, the invariant location of the motor system.
1992). Finally, the classical syndromes o¤er very limited
Despite this revival, the connectionist approach to help to the clinician planning therapy, because the syn-
aphasic syndromes is under renewed attack. dromes give insu‰cient information about what is
A major limitation of the classical syndromes is that wrong with a patient. For example, knowing that a pa-
they stay at arm’s length from the linguistic details of tient has Broca’s aphasia does not tell the therapist what
language impairments. The classical aphasic syndromes aspects of speech need remediation—articulation of
basically reflect the relative ability of patients to perform sound segments, prosody, production of grammatical
entire language tasks (speaking, comprehension, etc.), elements, formulation of syntactic structures, and so on.
not the integrity of specific operations within the lan- Nor does it guarantee that the patient does not need
guage processing system. Linguistic descriptions in these therapy for a comprehension problem; it only implies
Aphasiology, Comparative 265
that any comprehension problem is mild relative to the Goldstein, K. (1948). Language and language disturbances.
problems of other aphasics or to the patient’s speech New York: Grune and Stratton.
problem. Finally, it does not guarantee that the patient Goodglass, H., and Kaplan, E. (1972). The assessment of
does not have other problems, such as anomia, di‰culty aphasia and related disorders. Philadelphia: Lea and
Febiger.
reading, and the like. In practice, most clinicians do not
Goodglass, H., and Kaplan, E. (1982). The assessment of
believe that they have adequately described a patient’s aphasia and related disorders (2nd ed.). Philadelphia: Lea
language problems when they have identified that pa- and Febiger.
tient as having one of the classic aphasic syndromes. Head, H. (1926). Aphasia and kindred disorders of speech. New
Rather, they specify the nature of the disturbance found York: Macmillan.
in the patient within each language-related task; for ex- Holland, A. L., Fromm, D., and Swindell, C. S. (1986). The
ample, they indicate that a patient with Broca’s aphasia labeling problem in aphasia: An illustrative case. Journal of
is agrammatic, has a mild anomia, and so on. Detailed Speech and Hearing Disorders, 51, 176–180.
psycholinguistic and linguistic descriptions of aphasic Jackson, H. H. (1878). On a¤ections of speech from disease of
impairments are slowly replacing the disconnection the brain. Reprinted in J. Taylor (Ed.), Selected writings of
John Hughlings Jackson. New York: Basic Books, 1958.
approach to syndromes.
Kertesz, A. (1979). Aphasia and associated disorders: Taxon-
It is a feature of the history of science, and some think omy, localization and recovery. New York: Grune and
a tenet of the philosophy of science, that people do Stratton.
not abandon a theory because it has inadequacies. Lecours, A. R., Lhermitte, F., and Bryans, B. (1983). Aphasi-
Some philosophers of science think that no theory is ever ology. Paris: Baillard.
proven wrong. According to this view, theories are Lichtheim, L. (1885). On aphasia. Brain, 7, 433–484.
abandoned because people get tired of them, and people Marie, P. (1906). Revision de la question de l’aphasie: La troi-
get tired of theories because they have others that they sieme circonvolution frontale gauche ne joue aucun role
think are better. This perspective on science applies to special dans la fonction du langage. Semaine Medicale, 26,
the connectionist approach to aphasic syndromes. The 241–247.
McNeil, M. R., and Kent, R. D. (1991). Motoric character-
classic syndromes have not been abandoned, but their
istics of adult aphasic and apraxic speakers. In G. R.
acceptance is waning, and there are new developments Hammond (Ed.), Advances in psychology: Cerebral control
that address some of their inadequacies. of speech and limb movements (pp. 317–354). New York:
Elsevier/North Holland.
Acknowledgment Mohr, J. P., Pessin, M. S., Finkelstein, S., Funkenstein, H.,
Duncan, G. W., and Davis, K. R. (1978). Broca aphasia:
Work was supported in part by a grant from NIDCD
Pathologic and clinical. Neurology, 28, 311–324.
(DC00942). Moutier, F. (1908). L’Aphasie de Broca. Paris: Steinheil.
—David Caplan Wernicke, C. (1874). Der Aphasische Symptomenkomplex.
Breslau: Cohn and Weigart. Reprinted in translation in
References Boston Studies in Philosophy of Science, 4, 34–97.
Basso, A., Lecours, A. R., et al. (1985). Anatomoclinical cor-

relations of the aphasias as defined through computerized
tomography: Exceptions. Brain and Language, 26, 201–229. Aphasiology, Comparative
Benson, D. F. (1979). Aphasia, alexia and agraphia. London,
Churchill Livingstone.
Benson, D. F., and Geschwind, N. (1971). Aphasia and related Comparative aphasiology is the systematic comparison
cortical disturbances. In A. B. Baker and L. H. Baker of aphasia symptoms across languages, including the
(Eds.), Clinical neurology. New York: Harper and Row. comparison of acquired reading problems across lan-
Broca, P. (1861). Remarques sur le siege de la faculte de la guages and writing systems. The goal of study is to
parole articulee, suives d’une observation d’aphemie (perte
ensure that theories that claim to account for the var-
de parole). Bulletin de la Societe d’Anatomie, 36, 330–357.
Caplan, D. (1987). Discrimination of normal and aphasic sub- ious constellations of aphasic symptoms can handle
jects on a test of syntactic comprehension. Neuropsy- the similarities and di¤erences seen in aphasic speakers
chologia, 25, 173–184. of languages of di¤erent types (see Menn, 2001, for a
Damasio, A. R. (1992). Aphasia. New England Journal of discussion of language typology in the context of
Medicine, 326, 531–539. comparative aphasiology). Serious experimental and
Damasio, A., and Tranel, D. (1993). Nouns and verbs are clinical comparative work began in the 1980s; however,
retrieved with di¤erently distributed neural systems. Pro- researchers long before that time understood that com-
ceedings of the National Academy of Science of the United parative data are essential. Otherwise, general theories of
States of America, 90, 4957–4960. aphasia would depend on data from the few, closely re-
Damasio, H., and Damasio, A. R. (1980). The anatomical
lated languages of the countries in Europe and North
basis of conduction aphasia. Brain, 103, 337–350.
de Bleser, R. (1988). Localization of aphasia: Science or fic- America where research in neurolinguistics was being
tion? In D. Denes, C. Semenza, and P. Bisiacchi (Eds.), undertaken: English, French, German, and, for a time,
Perspectives on cognitive neurology. Hove, U.K.: Erlbaum. Russian.
Geschwind, N. (1965). Disconnection syndromes in animals The clearest example of such a premature class of
and man. Brain, 88, 237–294, 585–644. theories was the ‘‘least pronunciation e¤ort’’ approach
to agrammatism. This approach focused on the preva- deployed using di¤erent mechanisms? (Jaeger et al.,
lence of bare-stem forms in agrammatic output in En- 1996; Pinker, 1999).
glish, French, and German and suggested that such
forms were used to avoid extra articulatory e¤ort. A
Recent History
more phonologically sophisticated theory (Kean, 1979),
relying on the same small database, proposed that By 1980, comparative language acquisition studies (e.g.,
agrammatic speakers were constrained to produce only the work published in Slobin, 1985–1995) were well
the minimum phonological word. However, the lan- established and provided both intellectual and logis-
guages then available for study have bare-stem forms tical models for comparative aphasiology. Bellugi’s team
that are not only short but also grammatically unmarked at the Salk Institute began to compare aphasic syn-
(mostly first- or third-person singular present tense of dromes in hearing/speaking and deaf/signing individ-
verbs and the singular of nouns) and very frequent. uals (Bellugi, Poizner, and Klima, 1989), and an
Therefore, the data could also support a markedness- international group coordinated by Bates at the Uni-
based or a frequency-based account. Or the underlying versity of California–San Diego began using psycho-
problem could be morphological or morphosyntactic in- linguistic techniques to do cross-linguistic studies of
stead of articulatory or phonological. English and Italian, eventually expanding to Chinese,
A reason to think that the problem is actually mor- Russian, Spanish, and other languages. At the Aphasia
phological (problems with retrieving inflections) or Research Center of the Boston University School of
morphosyntactic (problems with computing which Medicine, a team focused on morphosyntax in agram-
inflections the syntax demands) is that endings of par- matic narratives (Cross Language Aphasia Study I,
ticiples and infinitive forms are better preserved than Menn and Obler, 1988, 1990a) created a standard elic-
other verb forms in English, Italian, French, and Ger- itation protocol and began to collect data from speakers
man. Is this because they have no tense marking or no with agrammatic aphasia and matched controls. Data
person marking? Or is there another reason? were collected on 14 languages: the non-Indo-European
To test the hypothesis that a problem with inflections languages Mandarin Chinese, Finnish, Hebrew, and
is a specific type of problem with morphosyntax, we look Japanese and the Indo-European languages Dutch, En-
for languages where inflections are controlled in di¤erent glish, French, German, Hindi, Icelandic, Polish, Serbo-
ways. For example, if problems with verbs are blamed Croatian, and Swedish. Many of these languages (plus
on di‰culty with person or number agreement, we Hungarian and Turkish) are also represented in spe-
should see whether verb problems also exist in languages cial issues of Brain and Language (1991, 41:2) and
without agreement, such as Japanese. Or the problem Aphasiology (1996, 10:6). Michel Paradis has led inter-
may be deeper than morphosyntax: the verb problem national work on bilingual aphasia, including the devel-
could be due to a semantic di¤erence between nominal opment of the extensive set of Bilingual Aphasia Tests;
and verbal types of elements, as suggested by experi- Paradis (2001), which was also published as a special
mental work on Chinese, which has no agreement or in- issue of Journal of Neurolinguistics (14:2–4), includes
flection (Bates, Chen, et al., 1991; Chen and Bates, contributions on non-Indo-European Basque and Hun-
1998). Or perhaps multiple factors interact—a more dif- garian and Indo-European Afrikaans, Catalan, Czech,
ficult claim to test. Farsi (Persian), Friulian, Greek, and Spanish, as well as
Another type of problem that demands a comparative material on African-American English and more data on
approach is the question of why there are so few adjec- Finnish, Polish, Hebrew, and Swedish.
tives in aphasic language. From, say, a German-centered
point of view, we might ask, Is this a conceptual prob- Methods of Study
lem, an agreement problem (nonexistent in English, so
that could not be the sole problem), or a problem in Comparative studies raise special methodological issues
inserting elements between article and noun (in which because of the need to ensure that all materials pose
case, Romance languages, where most NPs have article- comparable levels of di‰culty across languages and
noun-adjective order, should not show the e¤ect)? cultures. Drawings acceptable in one country may be
Issue after issue requiring a comparative approach anomalous in another; words that are (apparently)
can be listed in the same way. Would relative clauses translation equivalents may not be comparably frequent,
that do not require movement (as in Chinese or Japa- and so on (Menn et al., 1996). Several chapters in Para-
nese) be deployed better than ones that do? Is the dis (2001) point out the importance of allowing for the
observed problem with the passive voice in English to be e¤ects of bilingualism and multiple dialect use, which are
explained in terms of movement rules and traces, in present in most of the world’s population. The presenta-
terms of its morphological complexity, in terms of its tion of comparative production data requires an elabo-
low frequency, or in terms of its pragmatic unnatural- rated interlinear translation format, so that any reader,
ness in a single-sentence test paradigm? Are irregular familiar with the language or not, can see what the sub-
verbs preserved better than regular ones because of their ject actually said, what he or she should have said, and
generally greater frequency or because, as some theorists what the errors were. A widely used version is derived
claim, they are stored in di¤erent places in the brain or from the interlinear morphemic translation style codified
Aphasiology, Comparative 267
by Lehmann (1982). The example below is taken from etc.). (Stimuli that convey this complex syntactic struc-
Farsi data (Nilipour, 2000). ture without varying the word order cannot be con-
structed in most European languages.)
1. sar[-am] While these results showed that agrammatic aphasics
2. man [be] dast [va] sar xombâre [xor-d] could use morphological information, Bates’s group has
3. I [to] hand [and] head[-my] shrapnel [hit:PAST:3SG] shown that morphological cues are also the ones most
4. I hand, head shrapnel [hit]. likely to be underutilized by speakers with all forms of
5. pro [postposition] Noun [conjunction] Noun [possessive aphasia, as well as by control subjects who are loaded
clitic] Noun [Verb] with competing experimental task demands (Blackwell
and Bates, 1995).
Line 2 is the original as spoken, but edited to remove Note that not all comparative studies make direct
hesitations and phonetic or phonemic errors. Omitted contrasts across two or more languages. A study is also
words are supplied in square brackets if they can be comparative if it selects a language to work in specifi-
reconstructed from the context. Line 1 gives the target cally because that language enables us to tease apart
forms for each substitution error, placed on the first line variables of interest. The elaborate morphology and free
directly above the incorrect form. The third line is a word order of Serbo-Croatian allowed Lukatela and
morph-by-morph translation into the language of publi- colleagues to examine comprehension of morphology
cation, with standard abbreviations indicating case, with word order held constant. Similarly, gender agree-
tense, aspect, and so on, with abbreviations for a‰xes. ment in French was used by Jakubowicz and Goldblum
Line 4 contains a ‘‘smooth’’ translation into the lan- (1995, p. 242) to construct an experiment contrasting
guage of the publication. Lines 1 and 4 are as similar as the preservation of grammatical morphemes in non-
possible in their degree and types of errors—i.e., equally fluent aphasia. They found that ‘‘local’’ (within noun
agrammatic, equally paraphasic. Line 5, not used in phrase) markings were better preserved than ones re-
all publications, identifies part of speech, and codes quiring computation across major syntactic boundaries.
for functor (all lowercase) versus content word (first Luzzatti and De Bleser (1996) reported a similar result
letter uppercase), making it easier to count members of for production.
these categories. This is done because so much psycho-
linguistic theorizing hinges on form class and on the
content/functor distinction. Production Studies and Findings
A variety of production studies have supported the fol-
lowing general claims. (1) The greater the semantic im-
Comprehension Studies and Recent Findings
portance of a morpheme, the more likely that it will be
Bates’s group (e.g., Bates, Friederici, and Wulfeck, produced. For example, although grammatical mor-
1987a) has used a task in which hearers are presented phemes are in general prone to errors, and free gram-
with a string of two nouns and a transitive verb (e.g., matical morphemes tend to be omitted by speakers with
‘‘The cow the pencils kick’’) and must decide which of nonfluent aphasias, negation is almost never omitted
two nouns (or noun phrases) is the agent. The string is (Menn and Obler, 1990b; see also Friederici, Weissen-
presented in all orders, whether grammatical or not in born, and Kail, 1991). (2) The larger the paradigm of
the language in question, and often with conflicting cues choices for a given form, the more likely that errors will
from word order, animacy, and number agreement. be made (Bates, Wulfeck, and MacWhinney, 1991;
Their key finding has been that people with aphasia Paradis, 2001b). Aphasic production errors in morpho-
show the same language-specific preferences for inter- syntax tend to be only one semantic feature away from
preting these strings—the same tendency to place more the target (gender or number or case or tense); the di-
reliance on word order, animacy, or agreement in rection of errors is probabilistic, but more frequent
making their interpretations—as do speakers without forms are more likely to be produced correctly (Dressler,
neurological impairment. Thus, agrammatic aphasia, in 1991). However, an individual may have a preferred
particular, cannot be an eradication of grammar. Fur- ‘‘default’’ form not shared by other aphasic speakers
ther confirmation of this claim comes from Serbo- of the same language (Magnúsdóttir and Thráinsson,
Croatian: Lukatela, Shankweiler, and Crain (1995) used 1991). (3) In paradigms with multiple stem forms, errors
a slightly more natural picture-choice comprehension tend to keep the same stem as the target (Mimouni and
task to show that people with agrammatic aphasia were Jarema, 1997). Semantically appropriate case forms may
able to distinguish between subject-gap sentences (‘‘The be chosen even when the verb or preposition that would
lady is kissing the man who is holding an umbrella’’) and control that case is not produced. Most errors, especially
object-gap sentences (‘‘The lady is kissing the man that those of nonfluent aphasics, are misselections from
the umbrella is covering’’), even when these sentences existing paradigms, but a few, notably some instances in
were constructed with the same word order, so that the Basque, involve the creation of nonexistent forms from
hearers had to rely only on case markers and agreement existing morphemes (Laka and Erriondo Korostola,
markers (subject-verb agreement, modifier-noun agree- 2001) or the production of nonexistent stem forms
ment, agreement between pronouns and their referents, (Swedish: Månsson and Ahlsén, 2001).
(4) Classifier languages show classifier errors (Tzeng Friederici, A., Weissenborn, J., and Kail, M. (1991). Pronoun
et al., 1991), with nonfluent aphasic speakers tending comprehension in aphasia: A comparison of three lan-
to omit them or to fall back on a ‘‘neutral’’ classifier; guages. Brain and Language, 41, 289–310.
speakers with fluent aphasia tend to commit more sub- Jaeger, J. J., Lockwood, A. H., Kemmerer, D. L., Van Valin,
R. D., Murphy, B. W., and Khalak, H. G. (1996). A posi-
stitution errors, as is characteristic of their production
tron emission tomography study of regular and irregular
patterns across morpheme categories and languages. verb morphology in English. Language, 72, 451–497.
Of course, widening the database also broadens the Jakubowicz, C., and Goldblum, M.-C. (1995). Processing of
questions: Why are utterance-final particles preserved in number and gender inflections by French-speaking apha-
Japanese aphasia but not so well in Chinese (Paradis, sics. Brain and Language, 51, 242–268.
2001b)? Why does canonical word order make compre- Kean, M.-L. (1979). Agrammatism: A phonological deficit?
hension and elicited production easier across languages, Cognition, 7, 69–83.
even those that have ‘‘free’’ word order, and why is Laka, I., and Erriondo Korostola, L. (2001). Aphasia mani-
some noncanonical order occasionally used as a pro- festations in Basque. In M. Paradis (Ed.), Manifestations of
duction default? Cross-linguistic psycholinguistic and aphasia symptoms in di¤erent languages (pp. 49–73).
Amsterdam: Pergamon.
computational-linguistic approaches seem to be the most
Lehmann, C. (1982). Directions for interlinear morphemic
promising avenues to further understanding of aphasia, translations. Folia Linguistica, 16, 199–224.
and, more generally, to the question of how language is Lukatela, K., Shankweiler, D., and Crain, S. (1995). Syntactic
represented in the human brain. processing in agrammatic aphasia by speakers of a Slavic
Note. Given the state of the art and the extent of language. Brain and Language, 49, 50–76.
individual variation within aphasia syndromes, I use Luzzatti, C., and De Bleser, R. (1996). Morphological pro-
the terminology of various authors cited, without at- cessing in Italian agrammatic speakers: Eight experiments
tempting to di¤erentiate between the overlapping diag- in lexical morphology. Brain and Language, 54, 26–74.
nostic categories of Broca’s aphasia and agrammatic Magnúsdóttir, S., and Thráinsson, H. (1991). Subject-verb
aphasia (agrammatism); ‘‘non-fluent’’ aphasia includes agreement in aphasia. In H. A. Sigurdsson, T. G. Indri-
dason, and E. Rögnvaldson (Eds.), Papers from the 12th
both of these categories, plus several others from the
Scandinavian Conference on Linguistics (pp. 256–266).
traditional clinical categories. ‘‘Fluent’’ aphasia in- Reykjavı́k, Iceland: Linguistic Institute, University of Ice-
cludes anomic aphasia (anomia) and Wernicke’s apha- land.
sia. Textbook anomic aphasia has fluent articulation, Månsson, A.-C., and Ahlsén, E. (2001). Grammatical features
disfluencies due only to word-finding di‰culties, and no of aphasia in Swedish. In M. Paradis (Ed.), Manifestations
comprehension problems; textbook Wernicke’s aphasia of aphasia symptoms in di¤erent languages (pp. 281–296).
has word-finding problems, comprehension problems, Amsterdam: Pergamon.
fluent articulation, and possible use of empty speech or Menn, L. (2001). Comparative aphasiology. In F. Boller and
nonsense ‘‘words’’ to compensate for impaired lexical J. Grafman (Eds.), Handbook of neuropsychology (2nd ed.),
retrieval. vol. 3: Language and aphasia (R. S. Berndt, vol. ed.,
pp. 51–68). Amsterdam: Elsevier.
—Lise Menn Menn, L., Niemi, J., and Ahlsén, E. (1996). Cross-linguistic
studies of aphasia: Why and how. Aphasiology, 10, 523–532.
References Menn, L., and Obler, L. K. (1988). Findings of the Cross-
Language Aphasia Study: Phase I. Agrammatic narrative.
Bates, E., Chen, S., Tzeng, O., Li, P., and Opie, M. (1991). The Aphasiology, 2, 347–350.
noun-verb problem in Chinese aphasia. Brain and Lan- Menn, L., and Obler, L. K. (1990a). Agrammatic aphasia: A
guage, 41, 203–233. cross-language narrative sourcebook. Amsterdam: John
Bates, E., Friederici, A., and Wulfeck, B. (1987a). Compre- Benjamins.
hension in aphasia: A cross-linguistic study. Brain and Lan- Menn, L., and Obler, L. K. (1990b). Conclusion: Cross-
guage, 32, 19–67. language data and theories of agrammatism. In L. Menn
Bates, E., Wulfeck, B., and MacWhinney, B. (1991). Cross- and L. K. Obler (Eds.), Agrammatic aphasia (vol. II, pp.
linguistic studies of aphasia: An overview. Brain and Lan- 1369–1389). Amsterdam: John Benjamins.
guage, 41, 123–148. [Special issue] Mimouni, Z., and Jarema, G. (1997). Agrammatic aphasia in
Bellugi, U., Poizner, H., and Klima, E. S. (1989). Language, Arabic. Aphasiology, 11, 125–144, 1997.
modality, and the brain. Trends in Neurosciences, 10, 380– Nilipour, R. (2000). Agrammatic language: Two cases from
388. Farsi. Aphasiology, 14, 1205–1242.
Blackwell, A., and Bates, E. (1995). Inducing agrammatic Paradis, M. (Ed.). (2001a). Manifestations of aphasia symptoms
profiles in normals: Evidence for the selective vulnerability in di¤erent languages. Amsterdam: Pergamon.
of morphology under cognitive resource limitation. Journal Paradis, M. (2001b). By way of a preface: The need for
of Cognitive Neuroscience, 7, 228–257. awareness of aphasia syndromes in di¤erent languages. In
Chen, S., and Bates, E. (1998). The dissociation between nouns M. Paradis (Ed.), Manifestations of aphasia symptoms in
and verbs in Broca’s and Wernicke’s aphasia: Findings from di¤erent languages. Amsterdam: Pergamon.
Chinese. Aphasiology, 12, 5–36. Pinker, S. (1999). Words and rules. New York: Basic Books.
Dressler, W. U. (1991). The sociolinguistic and patholinguistic Slobin, D. (Ed.). (1985–1995). The cross-linguistic study of
attrition of Breton phonology, morphology, and morpho- language acquisition (vols. 1–5). Hillsdale, NJ: Erlbaum.
nology. In H. W. Seliger and R. M. Vago (Eds.), First lan- Tzeng, O. J. L., Chen, S., and Hung, D. L. (1991). The classi-
guage attrition (pp. 99–112). Cambridge, U.K.: Cambridge fier problem in Chinese aphasia. Brain and Language, 41,
University Press. 184–202.
Argument Structure: Representation and Processing 269
Further Readings Niemi, J., and Laine, M. (1997). Syntax and inflectional mor-
phology in aphasia: Quantitative aspects of Wernicke
Ahlsén, E., Nespoulous, J.-L., Dordain, M., Stark, J., Jarema, speakers’ narratives. Journal of Quantitative Linguistics, 4,
G., Kadzielawa, D., et al. (1996). Noun-phrase production 181–189.
by agrammatic patients: A cross-linguistic approach. Sasanuma, S., and Fujimura, O. (1971). Selective impairment
Aphasiology, 10, 543–560. of phonetic and nonphonetic transcription of words in Jap-
Bastiaanse, R., Edwards, S., and Kiss, K. (1996). Fluent anese aphasic patients: Kana vs. kanji in visual recognition
aphasia in three languages: Aspects of spontaneous speech. and writing. Cortex, 7, 1–18.
Aphasiology, 10, 561–576. Slobin, D. (1991). Aphasia in Turkish: Speech production in
Bastiaanse, R., and van Zonneveld, R. (1998). On the relation Broca’s and Wernicke’s patients. Brain and Language, 41,
between verb inflection and verb position in Dutch agram- 149–164.
matic aphasics. Brain and Language, 64, 165–181. Tzeng, O. J. L. (1992). Reading and lateralization. In
Bates, E., Friederici, A., and Wulfeck, B. (1987b). Grammati- W. Bright (Ed.), International Encyclopedia of Linguistics
cal morphology in aphasia: Evidence from three languages. (vol. 3). New York: Oxford University Press.
Cortex, 23, 545–574. Vakareliyska, C. (1993). Implications from aphasia for the
Bates, E., and Wulfeck, B. (1989). Comparative aphasiology: syntax of null-subject sentences: Underlying subject slot in
A cross-linguistic approach to language breakdown. Bulgarian. Cortex, 29, 409–430.
Aphasiology, 3, 111–142. Wulfeck, B., Bates, E., and Capasso, R. (1991). A cross-
Eng, N., Obler, L. K., Harris, K. S., and Abramson, A. S. linguistic study of grammaticality judgments in Broca’s
(1996). Tone perception deficits in Chinese-speaking Broca’s aphasia. Brain and Language, 41, 311–336.
aphasics. Aphasiology, 10, 649–656. Yiu, E., and Worrall, L. E. (1996). Agrammatic production: A
Grodzinsky, Y. (1984). The syntactic characterization of cross-linguistic comparison of English and Cantonese.
agrammatism. Cognition, 16, 99–120. Aphasiology, 10, 623–648.
Grodzinsky, Y. (1990). Theoretical perspectives on language
deficits. Cambridge, MA: MIT Press.
Halliwell, J. F. (2000). Korean agrammatic production. Argument Structure: Representation
Hickok, G., Wilson, M., Clark, L., Klima, E. S., Kritchevsky, and Processing
M., and Bellugi, U. (1999). Discourse deficits following
right hemisphere damage in deaf signers. Brain and Lan-
guage, 66, 233–248. In a Principles and Parameters syntax framework, sen-
Jarema, G. (1998). The breakdown of morphology in aphasia: tences are derived by two operations, merger and move-
A cross-linguistic perspective. In B. Stemmer and W. ment. Merger takes two categories as input (e.g., V and
Whitaker (Eds.), Handbook of neurolinguistics (pp. 221– NP) and merges them into a single, higher-order cate-
234). Orlando, FL: Academic Press. gory (e.g., VP). There are, however, constraints on the
Jarema, G., and Kehayia, E. (1992). Impairment of inflectional categories that can be merged successfully. Consider the
morphology and lexical storage. Brain and Language, 43, following pairs:
541–564.
Kegl, J., and Poizner, H. (1997). Cross-linguistic/cross-modal 1a. [NP The girl] sneezed
syntactic consequences of left-hemisphere damage: Evidence 1b. *[NP The girl] sneezed [NP the boy]
from an aphasic signer and his identical twin. Aphasiology,
11, 1–38. 2a. [NP The girl] defeated [NP the boy]
MacWhinney, B., and Osmán-Sági, J. (1991). Inflectional 2b. *[NP The girl] defeated
marking in Hungarian aphasics. Brain and Language, 41,
165–183.
3a. [NP The girl] gave [NP the prize] [PP to [NP the
Menn, L. (1989). Comparing approaches to comparative boy]]
aphasiology. Aphasiology, 3, 143–150. 3b. *[NP The girl] gave [NP the prize]
Menn, L., O’Connor, M. P., Obler, L. K., and Holland, A. L. The (a) examples above are well-formed sentences;
(1995). Non-fluent aphasia in a multi-lingual world. Amster-
dam: John Benjamins.
the (b) versions, containing the same verbs but di¤erent
Menn, L., Reilly, K. F., Hayashi, M., Kamio, A., Fujita, structures following the verbs, are ill-formed. Thus, not
I., and Sasanuma, S. (1998). The interaction of pre- all verbs can fit into all sentence structures. How, then,
served pragmatics and impaired syntax in Japanese and does a theory of syntax account for these facts? Bor-
English aphasic speech. Brain and Language, 61, 183– rowing from logic, we can say that sentences are com-
225. posed of a verb (i.e., predicate) and a set of arguments.
Miceli, G., Mazzucchi, A., Menn, L., and Goodglass, H. A verb denotes an activity or event and an argument
(1983). Contrasting cases of Italian agrammatic aphasia denotes a participant in the activity or event. So, in the
without comprehension disorder. Brain and Language, 19, grammatical (a) versions above, the sentences contain
65–97. the appropriate number of arguments the verb entails;
Miceli, G., Silveri, M. C., Villa, G., and Caramazza, A. (1984).
On the basis of agrammatics’ di‰culty in producing main
in the ungrammatical (b) versions there is either an extra
verbs. Brain and Language, 36, 447–492. argument (as in (1b)) or a required argument is missing
Nicol, J. L., Jakubowicz, C., and Goldblum, M. C. (1996). (as in (2b) and (3b)), hence violating the argument struc-
Sensitivity to grammatical marking in English-speaking ture of the verb.
and French-speaking non-fluent aphasics. Aphasiology, 10, Not all the phrases in a sentence function as argu-
593–622. ments of a verb. Consider:
4. [The girl] defeated [the boy] on the beach/this after- those arguments can serve the role of subject, object, and
noon/with great finesse. oblique object, respectively. The verb donate, for exam-
ple, requires three arguments—a subject NP, a direct
The bracketed NPs are clearly participants in the event
object NP, and an indirect (oblique) object NP—as in:
denoted by defeated, and thus are arguments of the verb.
However, the italicized phrases do not represent partic- 7. [The girl/AGENT] donated [the present/THEME] to
ipants in the event. Instead, they carry additional infor- [the boy/GOAL].
mation (i.e., where the event took place, when, and the Unlike (7), where the properties of the verb donate
manner in which it took place). These expressions are entail canonical linking, there are verbs with properties
considered to be adjuncts; that is, they are adjunctive that entail noncanonical linking. Consider, for example,
or in addition to the information specified by the verb. receive, which entails a reversal of the canonical assign-
Simplifying a bit, adjuncts are typically optional while ment of Agent and Goal arguments:
arguments are often required.
8. [The boy/GOAL] received [the present/THEME]
Thematic Roles. The NPs that are participants/argu- from [the girl/AGENT].
ments of the verb in (4) play di¤erent semantic roles
Sentences (7) and (8) reflect a well-known bias that
in relation to the verb defeated. A more comprehensive
suggests the ‘‘sender’’ seems more volitional and is a
account of argument structure, then, needs to consider a
more plausible candidate for the Agent role than a
description of these roles. For example, in (4) the NP the
‘‘receiver’’ (Dowty, 1991). Importantly, there are no
boy is the Agent of defeat, while the girl is the a¤ected
positional or configurational distinctions between the
object and hence the Patient (or Theme). Some common arguments to signal the di¤erence in thematic order
thematic roles, then, are Agent, Experiencer, Theme/
in the above examples; that is, the underlying syntax
Patient, Goal, and Location.
between the constructions appears to be the same. Thus,
A verb (that is, a lexical category) assigns its thematic
the distinction is based on the inherent properties of the
roles to its arguments through theta marking. For ex-
verb.
ample, the verb defeat is said to theta-mark the subject
Unlike the sentences above, where thematic roles can
argument with the Agent role and the object argument
be directly assigned from inherent lexical information
with the Theme (Patient) role:
and linking relationships, sentences with so-called dis-
5. defeat V placed arguments require indirect thematic role assign-
ment. For example, consider the following noncanonical
(Agent Theme) cleft-object sentence: ‘‘It was the boy who the girl
[NP The girl] defeated [NP the boy] *The girl defeated kissed yesterday.’’ The direct object NP the boy
! !

has been displaced from its canonical, post-verb argu-
ment position in the sentence, leaving a gap. Such
constructions are often referred to as filler-gaps. Verb-
Thus, each lexical category (e.g., verb) has a set of argument structure properties influence such construc-
argument structure features that must be satisfied in the tions rather directly; for example, in the cleft-object
sentence in which the word appears. If those features are case, a verb must license a direct object argument posi-
not satisfied, the sentence will be ungrammatical. tion in order to form a filler-gap dependency.
Verbs can also have clauses as arguments; these too Given its syntactic and semantic importance, then,
need to be theta-marked: argument structure (in various forms) has played a priv-
6. know V ileged role in accounts of language processing. One of
the earliest attempts to show that such lexically based
(Experiencer Proposition) information has repercussions for normal adult sentence
[NP The coach] knew [CP that the girl defeated the boy] processing was that of Fodor, Garrett, and Bever (1968).
!
! They inserted verbs di¤ering in grammatical complexity

(defined, in current terms, by the types of arguments

each allowed) into matched sentences and found that
The verb know assigns Experiencer to the subject NP o¤-line performance on those sentences decreased when
argument and Proposition to the CP argument. verbs were more complex. Similar e¤ects were later
found on-line (e.g., Shapiro, Zurif, and Grimshaw,
Processing. Canonical linking rules have been hy- 1987).
pothesized to play a key role in the acquisition (van der What is important here is not just the fact that there
Lely, 1994) and the processing of such verb-argument are observed ‘‘e¤ects’’ of argument structure, but what
structures (see, e.g., McRae, Spivey-Knowlton, and Ta- those e¤ects suggest about the architecture of the sen-
nenhaus, 1998). This linking or mapping refers to the tence processing system. Briefly, most current accounts
regular, most frequent relation found between thematic claim that when a verb (or any theta-assigning head of a
roles and syntactic functions (Pesetsky, 1995). For ex- phrase, including prepositions) is encountered in a sen-
ample, if an individual knows that a verb involves an tence, its various argument structure configurations are
Agent, Patient/Theme, and Goal, she can infer that momentarily activated (see, e.g., Pritchett, 1992; Mac-
Argument Structure: Representation and Processing 271
Donald, Pearlmutter, and Seidenberg, 1994). On some Frazier, L., and Clifton, C. (1996). Construal. Cambridge, MA:
accounts this information is ordered in terms of ‘‘pref- MIT Press.
erence,’’ which then helps determine which of a set of Friederici, A. D., Hahne, A., and Mecklinger, A. (1966).
parses the system initially attempts (Shapiro, Nagel, and Temporal structure of syntactic parsing: Early and late
event-related brain potential e¤ects. Journal of Experi-
Levine, 1993; Trueswell, Tanenhaus, and Kello, 1993).
mental Psychology: Learning, Memory, and Cognition, 22,
Such preference e¤ects suggest that argument structure 1219–1248.
information may be used immediately to help analyze Grodzinsky, Y. (2000). The neurology of syntax: Language use
sentence input. Indeed, an influential set of theories (e.g., without Broca’s area. Behavioral and Brain Sciences, 23,
MacDonald, Pearlmutter, and Seidenberg, 1994) sug- 47–117.
gests just that. MacDonald, M. C., Pearlmutter, N. J., and Seidenberg, M. S.
However, there remains an equally influential alter- (1994). Lexical nature of syntactic ambiguity resolution.
native which suggests that there are (at least) two passes Psychological Review, 101, 676–703.
through a sentence (Frazier and Clifton, 1996). The first McRae, K., Spivey-Knowlton, M. J., and Tanenhaus, M. K.
pass considers only categorical information (e.g., DET, (1998). Modeling the influence of thematic fit (and other
constraints) in on-line sentence comprehension. Journal of
N, NP, V, VP, etc.) and perhaps the number of argu-
Memory and Language, 38, 283–312.
ments a verb entails, and essentially builds a skeletal Pesetsky, D. (1995). Zero syntax. Cambridge, MA: MIT Press.
phrase structure representation of the input; the second Pritchett, B. (1992). Grammatical competence and parsing per-
pass considers lexical-semantic and contextual informa- formance. Chicago: University of Chicago Press.
tion. In some of these accounts, detailed thematic infor- Russo, K. D., Peach, R. K., and Shapiro, L. P. (1998). Verb
mation is explicitly claimed to be part of the second-pass preference e¤ects in the sentence comprehension of fluent
analysis (Friederici, Hahne, and Mecklinger, 1996). aphasic individuals. Aphasiology, 12, 537–545.
Finally, the representation and processing of argu- Schwartz, M. F., Linebarger, M. C., Sa¤ran, E. M., and Pate,
ment structure has important implications for language D. C. (1987). Syntactic transparency and sentence inter-
disorders underlying aphasia. Briefly, the ‘‘mapping def- pretation in aphasia. Language and Cognitive Processes, 2,
55–113.
icit’’ account (e.g., Schwartz et al., 1987) has suggested
Shapiro, L. P., Gordon, B., Hack, N., and Killackey, J. (1993).
that the sentence comprehension patterns evinced by Verb-argument structure processing in complex sentences in
some agrammatic Broca’s aphasic individuals may be Broca’s and Wernicke’s aphasia. Brain and Language, 45,
explained by their inability to ‘‘map’’ thematic roles onto 423–447.
grammatical (i.e., subject, object) positions, particularly Shapiro, L. P., Nagel, N., and Levine, B. A. (1993). Prefer-
in sentences that have noncanonical mapping. A more ences for a verb’s complements and their use in sentence
detailed and circumscribed account of the deficit is processing. Journal of Memory and Language, 32, 96–114.
o¤ered by the trace deletion hypothesis (e.g., Grod- Shapiro, L. P., Zurif, E., and Grimshaw, J. (1987). Sentence
zinsky, 2000). Here, the claim is that knowledge of processing and the mental representation of verbs. Cogni-
argument structure is intact for these individuals (for tion, 27, 219–246.
Trueswell, J. C., Tanenhaus, M. K., and Kello, C. (1993).
on-line evidence of this fact, see Shapiro et al., 1993).
Verb-specific constraints in sentence processing: Separating
However, traces of moved referential NPs or arguments e¤ects of lexical preference from garden-paths. Journal of
are deleted, and hence indirect thematic role assignment Experimental Psychology: Learning, Memory, and Cogni-
is blocked. Instead, these individuals appear to use an tion, 19, 528–553.
‘‘agent-first’’ strategy for arguments that cannot receive van der Lely, H. K. J. (1994). Canonical linking rules: Forward
a grammatically computed thematic role, explaining versus reverse linking in normally developing and specifi-
performance on a wide range of sentence types. cally language-impaired children. Cognition, 51, 29–72.
Unlike Broca’s aphasic individuals, those individuals Zurif, E. B., Swinney, D., Prather, P., Solomon, J., and
most likely characterized as Wernicke’s syndrome type Bushell, C. (1993). An on-line analysis of syntactic process-
appear to be insensitive to the argument structure prop- ing in Broca’s and Wernicke’s aphasia. Brain and Language,
45, 448–464.
erties of verbs, even where on-line comprehension is
at issue (Shapiro et al., 1993; Russo, Peach, and Sha-
piro, 1998). Yet, their deficit does not seem to a¤ect on- Further Readings
line comprehension of sentences with moved arguments Canseco-Gonzalez, E., Shapiro, L. P., Zurif, E. B., and Baker,
(Zurif et al., 1993). These patterns therefore suggest a E. (1990). Predicate-argument structure as a link between
double dissociation between the activation of argument linguistic and nonlinguistic representations. Brain and Lan-
structures and the syntactic parsing routines underlying guage, 39, 391–404.
the comprehension of sentences with moved arguments. Friederici, A. D., and Frisch, S. (2000). Verb argument struc-
ture processing: The role of verb-specific and argument-
—Lewis P. Shapiro specific information. Journal of Memory and Language, 43,
476–507.
References Grodzinsky, Y., Shapiro, L. P., and Swinney, D. A. (Eds.).
Dowty, D. (1991). Thematic proto-roles and argument selec- (2000). Language and the brain: Representation and pro-
tion. Language, 3, 547–619. cessing. San Diego, CA: Academic Press.
Fodor, J. A., Garrett, M., and Bever, T. G. (1968). Some syn- Kemmerer, D. (2000). Grammatically relevant and grammati-
tactic determinants of sentential complexity: II. Verb struc- cally irrelevant features of verb meaning can be indepen-
ture. Perception and Psychophysics, 6, 453–461. dently impaired. Aphasiology, 14, 997–1020.
McRae, K., Ferretti, T. R., and Amyote, L. (1997). Thematic In his important treatise on attention and e¤ort,
roles as verb-specific concepts. Language and cognitive pro- Kahneman (1973) specified some defining attributes of
cesses: Special issue on lexical representations in sentence attention. Among his attributes, he suggested that at-
processing. tention is a limited capacity commodity (whether viewed
Rayner, K., Carlson, M., and Frazier, L. (1983). The inter-
as a single or multiple pool system). Attention is mobile
action of syntax and semantics during sentence process-
ing. Journal of Verbal Learning and Verbal Behavior, 22, and can be shifted either through mechanisms of ori-
358–374. enting, enduring dispositions, or through the executive
Shapiro, L. P., and Levine, B. A. (1990). Verb processing control system. The distributor of processing resources
during sentence comprehension in aphasia. Brain and Lan- allots attention according to a policy that (1) is biased
guage, 38, 21–47. toward novel stimuli, (2) has the ability to allocate
Shapiro, L. P., Zurif, E., and Grimshaw, J. (1989). Verb rep- attention to a particular domain or message, and (3)
resentation and sentence processing: contextual impenetra- operates as a function of externally generated arousal
bility. Journal of Psycholinguistic Research, 18, 223–243. levels. That attention is limited in capacity has been a
Tanenhaus, M. K., Boland, J. E., Mauner, G. A., and Carlson, central organizing principle for much of the research in
G. N. (1993). More on combinatory lexical information:
attention and has given rise to the ‘‘dual task’’ paradigm,
Thematic structure in parsing and interpretation. In
G. T. M. Altmann and R. Shillcock (Eds.), Cognitive mod- a widely used research method for investigating atten-
els of speech processing: The Second Sperlonga Meeting (pp. tion. The dual task is an experimental procedure where-
297–319). Hove, U.K.: LEA. by two tasks are performed concurrently and some
aspect of each task is manipulated independently. The
tasks are frequently manipulated by having the subject
Attention and Language voluntarily allocate di¤erent percentages of attention or
e¤ort to each task (e.g., 50%/50%, 25%/75%, 75%/25%,
100%/0%, 0%/100%). If attention is shared between the
The construct of attention has a long and occasionally two tasks, a trading of performance levels is expected
tortuous history. Though regarded as central to psy- and is typically expressed as a performance operating
chology and as fundamental to human experience by curve (POC). While the validity of the voluntary alloca-
James (1890), Wundt (1973), and other founders of tion part of the design has been challenged (Gopher,
psychology, its inability to be characterized unitarily Brickner, and Navon, 1982), an alternative method is
has led many to regard it as theoretically incoherent frequently used in which the inherent di‰culty of the
(Cohen, 1993). Fischler (2001) has discussed the multi- two tasks is manipulated parametrically. Again, a trad-
componential nature of attention and the information- ing of performance levels is expected if the two tasks
processing, factor-analytic, and brain systems contexts share a common pool or source of attention, and a POC
in which these processes are studied. The processes or is plotted and measured in order to test this hypothesis.
components of attention are frequently characterized The dual task paradigm, however, is not the only or
as (1) overall arousal, (2) orienting to novel stimuli, (3) even the most widely used approach to study attention.
selectivity to endogenous or exogenous stimuli, (4) divi- Without doubt, the Stroop test (Stroop, 1935) is the most
sion among concurrent tasks, (5) executive control of widely researched attention task. In this task, unwanted
attention (resource allocation), and (6) vigilance or sus- intrusion of information is assessed through the rapid
tained attention. Though conceptualized as independent identification of colors or words of stimuli that are either
processes or components, the demonstration of this congruent (e.g., written word ‘‘red’’ in red print) or in-
modularity has been di‰cult to instantiate, and studies congruent (e.g., written word ‘‘red’’ in blue print). In this
designed to do so often remain confounded. For exam- task, the subject is required to identify either the word or
ple, selective focus of one’s conceptual and perceptual the color, and accuracy and response times are mea-
systems to external stimuli requires a mechanism for sured. Naming the color of a written word in the incon-
inhibiting some stimuli while allowing passage and acti- gruent condition produces poorer accuracy and longer
vation of the intended stimuli. This notion invokes the response times than in congruent conditions, indicating
distinctions between top-down and bottom-up process- a competition for activation and inhibition of linguistic
ing, resource- versus data-driven processing, and con- and nonlinguistic intentions and stimuli. N400 evoked
trolled versus automatic processing. It also invokes the potentials (Kutas and Hillyard, 1980; Bentin, 1987;
notions of selective versus divided attention, as well as Holcomb, 1988) and functional imaging (e.g., Just et al.,
an executive system that is capable of directing or allo- 1996) are also common methods used to assess the role
cating mental e¤ort toward specific stimuli or actions. of attention in language processing.
These attentional processes accomplish this in finitely Experimental paradigms are not the only source of
timed intervals and in controlled amounts. Indeed, the evidence that attention is a construct worthy of study.
models of attention are complex, and the study of at- Introspection also has provided a motivation for enter-
tention is untidy. However, the struggle has produced a taining the notion of attention and its relationship to
large and continuous flow of theoretical and experimen- language. Indeed, most adults have had the experience
tal evidence supporting its validity as a field of study. of having read several pages of written material only to
The importance of attention in theories of consciousness, discover that nothing of what was read was remembered
cognition, and brain dysfunction justifies the pursuit. because the mind had wandered and focused on review-
Attention and Language 273
ing yesterday’s particularly puzzling diagnostic or a nonlinguistic information. For example, the attentional
previous argument with a colleague or the dean, or had demands placed on language processing have been illus-
focused on planning an upcoming holiday, course lec- trated for lexical/semantic processing through priming
ture, or treatment plan. Likewise, most have discovered paradigms (Neely, 1977), through evoked potentials
the need to turn the radio o¤ when encountering a di‰- (Kutas and Hillyard, 1980), and through dual-task
cult driving condition or courteously requesting the studies (Arvedson and McNeil, 1987; Murray, 2000).
children to refrain from their backseat banter while for- Attentional demands for language have also been dem-
mulating a response to the patrolman approaching the onstrated in dual-task studies for syntactic processing
car following an apparent tra‰c violation. Although it by Blackwell and Bates (1995), for phonemic processing
is quite intuitive that attention to both internal and ex- by Tseng, McNeil, and Milenkovic (1993), for auditory
ternal stimuli plays an important role in many (perhaps prosody processing by Slansky and McNeil (1997), and
all) language tasks, major syntheses and analyses of between language and nonlinguistic tasks by LaPointe
the general attention literature (e.g., Lang, Simons, and and Erickson (1991).
Balaban, 1997; Pashler, 1998) and the neuropsycholog- Disorders of language are common and account for
ical deficits of attention (e.g., Cohen, 1993; van Zomeren a sizable proportion of all communication disorders.
and Brouwer, 1994) have failed to address the role of Within the various classification systems for language
attention in language processing. Indeed, not one of disorders, it is widely recognized that there are multiple
these major texts devoted to attention have addressed causes. Most systems acknowledge deficits at the repre-
the role of attention in developmental disorders of sentational level, including the rules used to govern these
language such as specific language impairment (SLI), representations. Deficits at this level are often referred
or in acquired disorders a¤ecting language processing, to as deficits of linguistic competence. A variety of per-
such as those of aphasia or traumatic brain injury. Only formance factors are also recognized that can cause an
very recently has this subject received space in edited otherwise competent or intact linguistic system to mal-
books on language and aphasia (e.g., Nadeau, Rothi, function. Examples of performance deficits include dis-
and Crosson, 2001) and only relatively recently have orders of linguistic-specific memory processes (Baddeley,
theoretical formulations (McNeil, Odell, and Tseng, 1993; Crosson, 2001b) and slowed perceptual or cogni-
1991) of how attention might account for language tive mechanisms (Tallal, Stark, and Mellits, 1985). Dis-
impairments, and summary reviews (Murray, 2000; orders of various aspects of the attentional system
Crosson, 2001a; Fischler, 2001) o¤ered explanations or include orienting of attention (Robin and Rizzo, 1989),
hypotheses of how attention might interact with lan- selective attention (Petry et al., 1994; Murray, Holland,
guage impairments. and Beeson, 1998), inability to engage or disengage at-
Language knowledge is characterized by the infor- tention (Posner, Snyder, and Davidson, 1980), and re-
mation that is represented in the brain along with the source allocation (McNeil, Odell, and Tseng, 1991). The
rules that govern it. Linguistic theory attempts to ac- construct of attentional deficits underlying language
count for the structure of the information (rules and deficits is neither new (e.g., Kreindler and Fradis, 1968)
representations) that is stored in memory. Psycholin- nor restricted to aphasia. Campbell and McNeil (1985),
guistic theory attempts to account for conditions under for example, illustrated attentional deficits in an
which the rules and representations are stored or acquired pediatric language disorder population, and
accessed and the various ways in which the di¤erent Barkley (1996) has applied the construct to attention
components are combined to produce or comprehend deficit-hyperactivity disorder. However, restricting the
sounds, morphophonemes, words, phrases, sentences, discussion to the language impairment in aphasia, the
and discourse. Informing and directing the linguistic past decade has seen a renewed interest in various
system requires each individual language user to engage aspects of attention, but primarily in the allocation of
in a finitely tuned interplay between internally generated processing resources.
intentions, linguistic knowledge, and a massive amount While skepticism remains apparent in some circles,
of sensory information that is continuously available, in it is widely recognized that explanations of language
addition to the selection (planning), programming, and and other domains of cognition (e.g., memory, learning,
execution of appropriate responses. This temporally executive function) that fail to account for attentional
demanding interplay creates an astonishing array of phenomena will remain incomplete. This is especially
factors that have to be sorted and managed at all true for those areas of cognitive dysfunction resulting
instances in time and on a continuous basis. It is the from brain damage (congenital or acquired, regardless
domain and role of attention and resource allocation of the time or cause of the injury) and developmental
to account for the gating (inhibition) and activation of disabilities.
endogenous intentions and exogenous stimuli involved
in the formulation, comprehension, and production of —Malcolm R. McNeil
language. Indeed, the role of attention in normal lan-
guage processing has a long history, and evidence
supports the conclusion that all levels of language pro-
References
cessing require and compete for attentional resources Arvedson, J. C., and McNeil, M. R. (1987). Accuracy and
with other language processes and with the processing of response times for semantic judgments and lexical decisions
with left and right hemisphere lesions. Clinical Aphasiology, Murray, L. L. (2000). The e¤ects of varying attentional
17, 188–201. demands on the word retrieval skills of adults with aphasia,
Baddeley, A. D. (1993). Working memory or working atten- right hemisphere brain damage, or no brain damage. Brain
tion? In A. D. Baddeley and L. Weiskrantz (Eds.), Atten- and Language, 72, 40–72.
tion: Selection, awareness and control. A tribute to Donald Murray, L. L., Holland, A. L., and Beeson, P. M. (1998).
Broadbent (pp. 152–170). Oxford: Oxford University Press. Spoken language of individuals with mild fluent aphasia
Barkley, R. A. (1996). Critical issues in research on atten- under focused and divided-attention conditions. Journal of
tion. In G. R. Lyon and N. A. Krasnegor (Eds.), Attention, Speech, Language, and Hearing Research, 41, 213–227.
memory, and executive function (pp. 450–456). Baltimore: Nadeau, S. E., Gonzalez Rothi, L. J., and Crosson, B. (Eds.).
Paul H. Brookes. (2001). Aphasia and language: Theory to practice. New
Bentin, S. (1987). Event-related potentials, semantic processes, York: Guilford Press.
and expectancy factors in word recognition. Brain and Neely, J. H. (1977). Semantic priming and retrieval and lexical
Language, 31, 308–327. memory: Roles of inhibitionless spreading activation and
Blackwell, A., and Bates, E. (1995). Inducing agrammatic limited-capacity attention. Journal of Experimental Psy-
profiles in normals: Evidence for the selective vulnerability chology: General, 106, 226–254.
of morphology under cognitive resource limitation. Journal Pashler, H. E. (1998). The psychology of attention. Cambridge,
of Cognitive Neuroscience, 7, 228–257. MA: MIT Press.
Campbell, T. F., and McNeil, M. R. (1985). E¤ects of presen- Petry, M., Crosson, B., Rothi, L. J., Bauer, R. M., and
tation rate and divided attention on auditory comprehen- Schauer, C. A. (1994). Selective attention and aphasia in
sion in children with an acquired language disorder. Journal adults: Preliminary findings. Neuropsychologia, 32, 1397–
of Speech and Hearing Research, 28, 513–520. 1408.
Cohen, R. A. (1993). The neuropsychology of attention. New Posner, M. I., Snyder, C. R. R., and Davidson, B. J. (1980).
York: Plenum Press. Attention and the detection of signals. Journal of Experi-
Crosson, B. (2001a). Systems that support language processes: mental Psychology: General, 109, 160–174.
Attention. In S. E. Nadeau, L. J. Gonzalez Rothi, and B. Robin, D. A., and Rizzo, M. (1989). The e¤ects of focal cere-
Crosson (Eds.), Aphasia and language: Theory to practice bral lesions on intramodal and cross-modal orienting of
(pp. 372–398). New York: Guilford Press. attention. Clinical Aphasiology, 18, 61–74.
Crosson, B. (2001b). Systems that support language processes: Slansky, B. L., and McNeil, M. R. (1997). Resource allocation
Verbal working memory. In S. E. Nadeau, L. J. Gonzalez in auditory processing of emphatically stressed stimuli in
Rothi, and B. Crosson (Eds.), Aphasia and language: aphasia. Aphasiology, 4–5, 461–472.
Theory to practice (pp. 399–418). New York: Guilford Stroop, J. R. (1935). Studies of interference in serial verbal
Press. reactions. Journal of Experimental Psychology, 18, 643–
Fischler, I. (2001). Attention, resource allocation, and lan- 662.
guage. In S. E. Nadeau, L. J. Gonzalez Rothi, and B. Tallal, P., Stark, R. E., and Mellits, D. (1985). The relationship
Crosson (Eds.), Aphasia and language: Theory to practice between auditory temporal analysis and receptive language
(pp. 348–371). New York: Guilford Press. development: Evidence from studies of developmental lan-
Gopher, D., Brickner, M., and Navon, D. (1982). Di¤erent guage disorder. Neuropsychologia, 4, 527–534.
di‰culty manipulations interact di¤erently with task em- Tseng, C.-H., McNeil, M. R., and Milenkovic, P. (1993).
phasis: Evidence for multiple resources. Journal of Experi- An investigation of attention allocation deficits in aphasia.
mental Psychology: Human Perception and Performance, 8, Brain and Language, 45, 276–296.
146–158. van Zomeren, A. H., and Brouwer, W. H. (1994). Clinical
Holcomb, P. J. (1988). Automatic and attentional processing: neuropsychology of attention. New York: Oxford University
An event-related brain potential analysis of semantic pri- Press.
ming. Brain and Language, 35, 371–395. Wundt, W. M. (1973). An introduction to psychology. New
James, H. (1890). The principles of psychology (2 vols.). New York: Arno.
York: Holt.
Just, M. A., Carpenter, P. A., Keller, T. A., Eddy, W. F., and Further Readings
Thulborn, K. R. (1996). Brain activation modulated by
sentence comprehension. Science, 274, 114–116. Ballesteros, S., Manga, D., and Coello, T. (1989). Attentional
Kahneman, D. (1973). Attention and e¤ort. Englewood Cli¤s, resources in dual-task performance. Bulletin of the Psycho-
NJ: Prentice Hall. nomic Society, 27, 425–428.
Kreindler, A., and Fradis, A. (1968). Performances in aphasia: Erickson, R. J., Goldinger, S. D., and LaPointe, L. L. (1996).
A neurodynamical diagnostic and psychological study. Paris: Auditory vigilance in aphasic individuals: Detecting non-
Gauthier-Villars. linguistic stimuli with full or divided attention. Brain and
Kutas, M., and Hillyard, S. A. (1980). Reading senseless Cognition, 30, 244–253.
sentences: Brain potentials reflect semantic incongruity. Friedman, A., Polson, M. C., and Dafoe, C. G. (1988). Divid-
Science, 207, 203–205. ing attention between the hands and the head: Performance
Lang, P. J., Simons, R. F., and Balaban, M. T. (1997). Atten- trade-o¤s between rapid finger tapping and verbal memory.
tion and orienting: Sensory and motivational processes. Journal of Experimental Psychology: Human Perception and
Mahwah, NJ: Erlbaum. Performance, 14, 60–68.
LaPointe, L. L., and Erickson, R. J. (1991). Auditory vigilance Gopher, D., and Sanders, A. F. (1984). S-Oh-R: Oh stages! Oh
during divided task attention in aphasic individuals. resources. In W. Prinz and A. F. Sanders (Eds.), Cognition
Aphasiology, 5, 511–520. and motor processes. Berlin: Springer-Verlag.
McNeil, M. R., Odell, K. H., and Tseng, C.-H. (1991). Toward Hirst, W., and Kalmar, D. (1987). Characterizing attentional
the integration of resource allocation into a general theory resources. Journal of Experimental Psychology: General,
of aphasia. Clinical Aphasiology, 20, 21–39. 116, 68–81.
Auditory-Motor Interaction in Speech and Language 275
Maxfield, L. (1997). Attention and semantic priming: A review words accurately and by the e¤ects of late-onset deafness
of prime task e¤ects. Consciousness and Cognition: An In- on speech output (Waldstein, 1989).
ternational Journal, 6, 204–218. Clinical evidence supports the view that ‘‘sensory’’
Murray, L. L. (1999). Attention and aphasia: Theory, research cortex participates in speech production. The classical
and clinical implications. Aphasiology, 13, 91–112.
fluent aphasias—Wernicke’s aphasia, conduction apha-
Murray, L. L., Holland, A. L., and Beeson, P. M. (1997a).
Auditory processing in individuals with mild aphasia: A sia, transcortical sensory aphasia, and anomic aphasia—
study of resource allocation. Journal of Speech, Language, are all associated with left posterior cerebral lesions,
and Hearing Research, 40, 792–808. that is, with regions that are commonly thought to be
Murray, L. L., Holland, A. L., and Beeson, P. M. (1997b). sensory in nature. Yet each of these fluent aphasias
Grammaticality judgments of mildly aphasic individuals has prominent speech output symptoms: semantic and/
under dual-task conditions. Aphasiology, 11, 993–1016. or phonemic paraphasias (speech errors), paragram-
Navon, D. (1984). Resources: A theoretical soup stone? Psy- matism (inappropriate use of grammatical markers),
chological Review, 91, 2216–2234. and anomia (naming di‰culties) (Damasio, 1992). This
Navon, D., and Gopher, D. (1979). On the economy of observation demonstrates the general point that poste-
the human processing system. Psychological Review, 86,
rior ‘‘sensory’’ systems play an important role in speech
214–255.
Navon, D., and Gopher, D. (1979). Task di‰culty, resources production.
and dual-task performance. Attention and Performance, 15, Evidence relevant to the more specific issue of audi-
297–315. tory-motor integration comes from conduction aphasia
Norman, D., and Bobrow, D. (1975). On data limited and re- (Hickok, 2001). A hallmark of conduction aphasia is
source limited processing. Journal of Cognitive Psychology, the predominance of phonemic paraphasias, which can
7, 44–60. be evident across a large range of production tasks,
Schneider, W., and Shi¤rin, R. M. (1977). Controlled and including spontaneous speech, naming, reading aloud,
automatic human information processing: I. Detection, and repetition (Goodglass, 1992). The preponderance of
search, and attention. Psychological Review, 84, 1–66. phonemic errors has led some authors to characterize
Shi¤rin, R. M., and Schneider, W. (1977). Controlled and
conduction aphasia as a selective impairment in phono-
automatic human information processing: II. Perceptual
learning, automatic attending and general theory. Psycho- logical encoding for production (Wilshire and Mc-
logical Review, 84, 127–190. Carthy, 1996). Although the classical model holds that
Strum, W., and Willmes, K. (1991). E‰cacy of a reaction conduction aphasia is a disconnection syndrome involv-
training on various attentional and cognitive functions ing damage to the arcuate fasciculus (Geschwind, 1965),
in stroke patients. Neuropsychological Rehabilitation, 1, recent evidence has shown that the syndrome can be
259–280. caused by damage to, or electrical stimulation of,
auditory-related cortical fields in the left superior tem-
poral gyrus (Damasio and Damasio, 1980; Anderson et
al., 1999). This region has been strongly implicated in
Auditory-Motor Interaction in Speech speech perception, based on neuroimaging data (Zatorre
and Language et al., 1996; Norris and Wise, 2000), suggesting some
degree of overlap in the systems supporting sensory and
motor aspects of speech. This argument raises an appar-
Carl Wernicke argued that cortical areas involved in the ent paradox, namely, that damage to systems strongly
sensory representation of speech played an important implicated in speech perception (i.e., left superior tem-
role in speech production (Wernicke, 1874/1969). His poral gyrus) leads to a syndrome, conduction aphasia,
argument was based on the clinical observation that the characterized predominantly by a production deficit.
speech output of aphasic patients with posterior lesions This paradox can be resolved, however, on the assump-
in the left hemisphere was fluent but error prone. Mod- tion that speech perception is largely bilaterally organ-
ern evidence from both lesion and neuroimaging studies ized, and that residual abilities of right hemisphere
strongly supports Wernicke’s claim, and recent work has auditory systems function su‰ciently well to support
made progress in identifying an auditory-motor interface auditory comprehension (Hickok, 2000; Hickok and
circuit for speech. Poeppel, 2000).
Developmental considerations make a strong case Recent neuroimaging studies have supported and
for the existence of an auditory-motor integration net- extended findings from the clinical literature. The left
work for speech (Doupe and Kuhl, 1999). In learning superior temporal gyrus has been shown to activate
to articulate the speech sounds in the local linguistic during a variety of speech production tasks (where
environment, there must be a mechanism by which (1) speech is produced covertly, so that there is no external
sensory representations of speech uttered by others can auditory input) including picture naming (Levelt et al.,
be stored, (2) articulatory attempts can be compared 1998; Hickok et al., 2000), repetition (Buchsbaum,
against these stored representations, and (3) the degree Hickok, and Humphries, 2001), and word generation
of mismatch revealed by this comparison can be used to (Wise et al., 1991). Importantly, evidence from an MEG
shape future articulatory attempts. This auditory-motor study of picture naming (Levelt et al., 1998) has shown
integration network is still functional in adults, as that this left superior temporal activation occurs during
revealed by the fact that it is possible to repeat pseudo- a time frame prior to articulatory processes, suggesting
that this region is involved in phonological code retrieval Buchsbaum, B., Hickok, G., and Humphries, C. (2001). Role
in preparation for speaking and is not merely a form of left posterior superior temporal gyrus in phonological
of motor-to-sensory feedback mechanism, although the processing for speech perception and production. Cognitive
latter mechanism may also exist. Science, 25, 663–678.
Damasio, A. R. (1992). Aphasia. New England Journal of
Two studies have looked explicitly for overlap in
Medicine, 326, 531–539.
activation associated with speech perception and speech Damasio, H., and Damasio, A. R. (1980). The anatomical
production. The first used positron emission tomography basis of conduction aphasia. Brain, 103, 337–350.
to map areas of overlap when participants listened to Doupe, A. J., and Kuhl, P. K. (1999). Birdsong and human
stories versus performed a verb generation task (Papa- speech: Common themes and mechanisms. Annual Review
thanassiou et al., 2000). A region of overlap was found of Neuroscience, 22, 567–631.
in the superior temporal gyrus, predominantly on the Geschwind, N. (1965). Disconnexion syndromes in animals
left, as expected, based on results reviewed earlier. Ad- and man. Brain, 88, 237–294, 585–644.
ditional areas of overlap included inferior temporal Goodglass, H. (1992). Diagnosis of conduction aphasia. In
regions and portions of the left inferior frontal gyrus. S. E. Kohn (Ed.), Conduction aphasia (pp. 39–49). Hills-
dale, NJ: Erlbaum.
The second study (Buchsbaum et al., 2001) used func-
Hickok, G. (2000). Speech perception, conduction aphasia, and
tional magnetic resonance imaging to map activated the functional neuroanatomy of language. In Y. Grodzin-
regions when subjects first listened to and then covertly sky, L. Shapiro, and D. Swinney (Eds.), Language and the
rehearsed a set of three multisyllabic pseudo-words. Two brain (pp. 87–104). San Diego, CA: Academic Press.
left posterior sites responded both to the auditory and Hickok, G. (2001). Functional anatomy of speech perception
motor phases of the trial: a site in the sylvian fissure at and speech production: Psycholinguistic implications. Jour-
the parietal-temporal boundary (area Spt) and a more nal of Psycholinguistic Research, 30, 225–234.
ventral site in the superior temporal sulcus. Brodmann’s Hickok, G., Erhard, P., Kassubek, J., Helms-Tillery, A. K.,
area 44 (posterior Broca’s area) and a more dorsal pre- Naeve-Velguth, S., Strupp, J. P., et al. (2000). A func-
motor site also responded to both the auditory and tional magnetic resonance imaging study of the role of
left posterior superior temporal gyrus in speech produc-
motor phases of the trial. The activation time course of
tion: Implications for the explanation of conduction apha-
area Spt and of area 44 in that study were particularly sia. Neuroscience Letters, 287, 156–160.
strongly correlated, suggesting a tight functional relation. Hickok, G., and Poeppel, D. (2000). Towards a functional
A viable hypothesis is that the STS site supports audi- neuroanatomy of speech perception. Trends in Cognitive
tory representations of speech and that the Spt site serves Sciences, 4, 131–138.
as an interface system translating between auditory and Levelt, W. J. M., Praamstra, P., Meyer, A. S., Helenius, P.,
motor representations of speech. This hypothesis is con- and Salmelin, R. (1998). An MEG study of picture naming.
sistent with recent work in vision demonstrating the ex- Journal of Cognitive Neuroscience, 10, 553–567.
istence of visuomotor systems in the dorsal parietal lobe Norris, D., and Wise, R. (2000). The study of prelexical and
that compute coordinate transformations, such as trans- lexical processes in comprehension: Psycholinguistics and
functional neuroimaging. In M. S. Gazzaniga (Ed.), The
formations of retinocentric to head- and body-centered
new cognitive neurosciences (pp. 867–880). Cambridge, MA:
coordinates, which allows visual information to interface MIT Press.
with various motor-e¤ector systems that act on that Papathanassiou, D., Etard, O., Mellet, E., Zago, L., Mazoyer,
visual input (Andersen, 1997; Rizzolatti, Fogassi, and B., and Tzourio-Mazoyer, N. (2000). A common language
Gallese, 1997). network for comprehension and production: A contribution
Sensorimotor interaction is pervasive across many to the definition of language epicenters with PET. Neuro-
hierarchical levels in the central nervous system. The image, 11, 347–357.
empirical record supports conceptual arguments for sen- Rizzolatti, G., Fogassi, L., and Gallese, V. (1997). Parietal
sorimotor interaction in speech and language and has cortex: From sight to action. Current Opinion in Neuro-
begun to elucidate sensorimotor cortical circuits for biology, 7, 562–567.
Waldstein, R. S. (1989). E¤ects of postlingual deafness on
speech. This work helps bridge the gap between func-
speech production: Implications for the role of auditory
tional anatomical models of speech and language and feedback. Journal of the Acoustical Society of America, 88,
models of the functional organization of cortex more 2099–2144.
generally (Hickok and Poeppel, 2000). Wernicke, C. (1874/1969). The symptom complex of aphasia:
A psychological study on an anatomical basis. In R. S.
—Gregory Hickok Cohen and M. W. Wartofsky (Eds.), Boston studies in the
philosophy of science (pp. 34–97). Dordrecht: Reidel.
References Wilshire, C. E., and McCarthy, R. A. (1996). Experimental
investigations of an impairment in phonological encoding.
Anderson, J. M., Gilmore, R., Roper, S., Crosson, B., Bauer, Cognitive Neuropsychology, 13, 1059–1098.
R. M., Nadeau, S., et al. (1999). Conduction aphasia and Wise, R., Chollet, F., Hadar, U., Friston, K., Ho¤ner, E., and
the arcuate fasciculus: A reexamination of the Wernicke- Frackowiak, R. (1991). Distribution of cortical neural net-
Geschwind model. Brain and Language, 70, 1–12. works involved in word comprehension and word retrieval.
Andersen, R. (1997). Multimodal integration for the represen- Brain, 114(Pt. 4), 1803–1817.
tation of space in the posterior parietal cortex. Philosophical Zatorre, R. J., Meyer, E., Gjedde, A., and Evans, A. C. (1996).
Transactions of the Royal Society of London. Series B. Bio- PET studies of phonetic processing of speech: Review, rep-
logical Sciences, 352, 1421–1428. lication, and reanalysis. Cerebral Cortex, 6, 21–30.
Augmentative and Alternative Communication: General Issues 277
Further Readings scanning, row-column scanning, or encoding. Displays

may be either fixed (i.e., the symbol remains the same
Indefrey, P., and Levelt, W. J. M. (2000). The neural correlates before and after activation) or dynamic (i.e., the symbol
of language production. In M. S. Gazzaniga (Ed.), The new
cognitive neurosciences (pp. 845–865). Cambridge, MA:
visually changes with its selection). Finally, strategies,
MIT Press. the fourth component, are the specific intervention
Liberman, A. M., and Mattingly, I. G. (1985). The motor approaches in which AAC symbols, aids, and techniques
theory of speech perception revised. Cognition, 21, 1–36. are used to facilitate or develop language and com-
MacKay, D. G. (1987). The organization of perception and munication skills via AAC (see ASHA, 1991; ASHA, in
action: A theory for language and other cognitive skills. New preparation, for complete definitions).
York: Springer-Verlag. The role an AAC system plays in a particular child’s
Milner, A. D., and Goodale, M. A. (1995). The visual brain in life will vary depending on the type and severity of
action. Oxford: Oxford University Press. the child’s language disorder. Children who use AAC
Poeppel, D. (2001). Pure word deafness and the bilateral pro- include those individuals who present with congenital
cessing of the speech code. Cognitive Science, 21, 679–693.
disorders as well as those individuals with an acquired
language disorder. Children with congenital language
disorders include children with cerebral palsy, dual sen-
Augmentative and Alternative sory impairments, developmental apraxia of speech,
Communication: General Issues language learning disabilities, mental retardation, au-
tism, and pervasive developmental disorders. Acquired
language disorders may include traumatic brain injury
It is estimated that from 8 to 12 out of every 1000 indi- (TBI) and a range of other etiologies (e.g., sickle cell
viduals have a communication disorder severe enough anemia) that a¤ect language skills.
to require the use of augmentative and alternative com- Children with language disorders who can employ
munication (AAC) intervention (Beukelman and Ansel, AAC systems may range in age from toddlers to adoles-
1995). A large percentage of these individuals are chil- cents (Romski, Sevcik, and Forrest, 2001). The role
dren with spoken language disorders and a range of AAC plays in language intervention depends on the
etiologies. Manual signs, communication boards, and child’s individual communication needs. It is not
computers with voice output have been developed to restricted to use with children who do not speak at all
provide a means by which children with severe spoken and may benefit children with limited or unintelligible
language disorders can acquire language and communi- speech as well as those young children who may be at
cation skills. AAC is a language intervention approach. significant risk for failure to develop spoken commu-
The American Speech-Language-Hearing Associa- nication. There are no exclusionary criteria or pre-
tion defines AAC as an area of research, clinical, and requisites for learning to use an AAC system (National
educational practice that attempts to compensate, either Joint Committee, in preparation). Every child can com-
permanently or temporarily, for the impairment and municate! Communication is defined in the broadest
disability patterns of individuals with severe expres- sense as ‘‘any act by which one person gives to or
sive and receptive communication disorders that a¤ect receives from another person information about that
spoken, gestural, and/or written modes of communica- person’s needs, desires, perceptions, knowledge, or
tion. AAC is comprised of a system of four integrated a¤ective states’’ (National Joint Committee, 1992). The
components: symbols, aids, techniques, and strategies. modes by which children can communicate range along
The first component, symbols, are visual, auditory, and/ a representational continuum from symbolic (e.g., spo-
or tactile, used to represent vocabulary and described as ken words, manual signs, arbitrary visual-graphic
either aided or unaided. An aided AAC symbol employs symbols, printed words) to iconic (e.g., actual objects,
the use of an external medium (e.g., photographs, pic- photographs, line drawings, pictographic visual-graphic
tures, line drawings, objects, Braille, or written words), symbols) to nonsymbolic (e.g., signals such as crying
while an unaided AAC symbol utilizes the AAC user’s or physical movement) (Sevcik, Romski, and Wilkin-
body (e.g., sign language, eye pointing, vocalizations). son, 1991). AAC interventions incorporate a child’s
Aids are the second component; an aid is an object used full communication abilities, including vocalizations,
to transmit or receive messages and includes, for exam- gestures, manual signs, communication boards, and
ple, communication boards, speech-generating devices, speech-generating devices. Even if a child uses some
or computers. A technique, the third component, is the vocalizations and gestures, AAC systems can augment
approach or method used for generating or selecting communication with familiar and unfamiliar partners
messages as well as the types of displays used to view across multiple environments. Some children with severe
messages. Messages may be generated or selected via spoken language disorders who have no conventional
direct selection or scanning. Direct selection permits a way to communicate may express their communicative
child to communicate messages from a large set of wants and needs in socially unacceptable ways, such as
options using, for example, manual signing or pointing through aggressive or destructive, self-stimulatory, and/
with a finger or headstick to a symbol. Scanning is used or perseverative means. AAC systems can replace these
when message choices are presented to the child in a se- unacceptable means with conventional communication
quence and the child makes his or her selection by linear (Mirenda, 1997). AAC is truly multimodal, permitting a
child to use every mode possible to communicate mes- systems can be embedded e¤ectively within ongoing
sages and ideas. events of everyday life (Beukelman and Mirenda, 1998).
While AAC is an intervention approach, a team as- Using AAC systems in inclusive settings requires that
sessment is needed to describe, within a functional con- the team work together to ensure that the child has ac-
text, the child’s language and communicative strengths cess to his or her AAC device throughout the day and
and weaknesses and to determine what type of AAC that all adults and children who may interact with the
system will permit the child to develop language and child serve to support the child’s communications as
communication skills in order to participate in daily needed.
activities. AAC assessment is an ongoing process and One frequently asked question is whether the use
includes a characterization of the child’s current com- of AAC systems hinders speech development. Develop-
munication development (i.e., speech comprehension ing natural speech and literacy abilities are extremely
skills, communication modes), an inventory of the important goals of AAC intervention. The empirical
child’s environments including partners and oppor- evidence suggests that AAC system use may result in
tunities for communication, and a description of the increases in vocalizations and in some cases the devel-
child’s physical abilities to access communication, opment of intelligible speech (Beukelman and Mirenda,
including vision, hearing, and fine and gross motor 1998). There is no evidence to suggest that AAC hinders
skills. Fine and gross motor access includes physical or halts speech development. The use of AAC systems
access to an AAC system and in some cases seating may also facilitate the development of early literacy
and positioning options for optimal communication. A skills and later reading.
collaborative team approach to AAC service delivery In summary, for children with severe spoken com-
incorporates families and a range of professional dis- munication disabilities, the AAC assessment is an ongo-
ciplines including, though not limited to, speech- ing process that includes information about the child’s
language pathologists, general and special educators, communication development, the child’s environments,
and physical and occupational therapists. AAC abilities and the child’s physical abilities. Children with severe
may change over time, although sometimes very slowly, language disorders who use AAC systems can demon-
and thus the AAC system selected for the present may strate communication achievements far beyond tradi-
need to be modified as a child grows and develops. tional expectations. Recommended assessment and
Not surprisingly, standardized psychological and intervention practices are continuing to develop. The use
speech and language assessment batteries are often of appropriate AAC systems enables the child to com-
di‰cult to employ with children with severe spoken municate e¤ectively at home, school, play, and work.
language disorders because of the severity of their oral In addition to the development of communication skills,
communication impairments. These assessments may AAC increases social interactions with family and
not reveal an accurate picture of a child’s abilities since friends and participation in life activities.
many of these assessments are language-based and may See also augmentative and alternative communi-
be biased against a child who cannot speak. Often, the cation approaches in children.
children are unable to obtain basal scores on such tests
or their scores are so far below those of their chrono- —Mary Ann Romski, Rose A. Sevcik, and Melissa
logical age peers that converting a raw score into a Cheslock
standard score is not possible. Systematic behavioral References
observation within everyday environments and informal
measures that inventory and describe communication American Speech-Language-Hearing Association. (1991). Re-
demands in these settings are employed to measure the port: Augmentative and alternative communication. ASHA,
communication skills of children who will employ AAC 33(Suppl. 5), 9–12.
systems rather than standardized tests within isolated American Speech-Language-Hearing Association. (in prepara-
settings. tion). Service delivery by speech-language pathologists to
individuals using augmentative and alternative communi-
For most children who use AAC systems, language cation: Knowledge and skillls.
and communication development is the most important Beukelman, D. R., and Ansel, B. (1995). Research priorities
goal. Like all language and communication interven- in augmentative and alternative communication. Augmen-
tions, the long-term goal is to facilitate meaningful tative and Alternative Communication, 11, 131–134.
communication interactions during daily activities and Beukelman, D., and Mirenda, P. (1998). Augmentative and al-
routines. Goals should not only focus on the technolog- ternative communication: Management of severe communi-
ical means of access the child uses, but on the develop- cation impairments (2nd ed.). Baltimore: Paul H. Brookes.
ment of language and e¤ective communication skills. Mirenda, P. (1997). Supporting individuals with challenging
Depending on the child’s current language and commu- behavior through functional communication training and
nication skills, goals may range from developing a basic AAC: Research review. Augmentative and Alternative
Communication, 13, 207–225.
vocabulary of single symbols or signs to express basic National Joint Committee for the Communicative Needs
wants and needs to using sentences of symbols and signs of Persons with Severe Disabilities. (1992). Guidelines for
to convey complex communicative messages (Reichle, meeting the communication needs of persons with severe
York, and Sigafoss, 1991; Romski and Sevcik, 1996). It disabilities. ASHA, 34(Suppl. 7), 1–8.
is essential that AAC system use take place in inclusive National Joint Committee for the Communicative Needs of
environments. The literature strongly suggests that AAC Persons with Severe Disabilities. (in preparation). Position
statement and report on the eligibility of persons with severe treatment of bilingual impairment is to occur. Research
disabilities to benefit from communication services and on impaired bilingual acquisition and its treatment are
supports. then discussed.
Reichle, J., York, J., and Sigafoos, J. (1991). Implementing
augmentative and alternative communication: Strategies for
learners with severe disabilities. Baltimore: Paul H. Brookes. What Do We Know About Normal Bilingual
Romski, M. A., and Sevcik, R. A. (1996). Breaking the speech Development?
barrier: Language development through augmented means.
Baltimore: Paul H. Brookes. Contrary to the widespread view that simultaneous ac-
Romski, M. A., Sevcik, R. A., and Forrest, S. (2001). Assistive quisition of two languages is beyond a child’s normal
technology and augmentative communication in inclusive capacity, research on prenatal and newborn infants
early childhood programs. In M. J. Guralnick (Ed.), Early indicates that there are no neurocognitive limitations on
childhood inclusion: Focus on change (pp. 465–479). Balti- infants’ innate capacity to acquire two languages simul-
more: Paul H. Brookes. taneously (Genesee, 2001). Indeed, a growing body of
Sevcik, R. A., Romski, M. A., and Wilkinson, K. (1991). Roles research on children acquiring two languages simulta-
of graphic symbols in the language acquisition process for
neously indicates that key milestones in phonological,
persons with severe cognitive disabilities. Augmentative and
Alternative Communication, 7, 161–170. lexical, syntactic, and pragmatic development occur
within the same age range for bilingual children as
for monolingual children (Paradis, 2000; Deuchar and
Further Readings Quay, 2000; Comeau and Genesee, 2001; Genesee,
American Speech-Language-Hearing Association. http://www 2002a). Of course, there is considerable individual vari-
.asha.org/. ation in the rate and pattern of normal language devel-
Communication Aids Manufacturer’s Association. http://www opment among bilingual children as among monolingual
.aacproducts.org/ children, and this should be taken into account when
Glennen, S., and DeCoste, D. (1997). The handbook of aug- identifying possible cases of bilingual impairment. Delay
mentative and alternative communication. San Diego, CA: in the emergence of key milestones and variations in
Singular Publishing Group. pattern of development are not necessarily symptomatic
International Society for Augmentative and Alternative Com- of underlying impairment, although they might warrant
munication. http://www.isaac.org/
careful monitoring.
Lloyd, L., Fuller, D., and Arvidson, H. (1997). Augmentative
and alternative communication: A handbook of principles and
practices. Needham Heights, MA: Allyn and Bacon. Phonology. Preverbal bilingual children progress from
McEwen, I., and Lloyd, L. (1990). Positioning children with a stage in which there appears to be no system in ei-
cerebral palsy to use augmentative and alternative com- ther language to distinct phonological patterns in each
munication. Language, Speech, and Hearing Services in (Deuchar and Quay, 2000; Paradis, 2000). En route
Schools, 21, 15–21. to acquiring the target system, young bilingual children
Reichle, R., Beukelman, D., and Light, J. (2002). Implementing may demonstrate phonological patterns that deviate
an augmentative communication system: Exemplary strat- from those exhibited by monolingual children. The
egies for beginning communicators. Baltimore: Paul H. deviations that occur are not necessarily symptomatic of
Brookes.
impairment but may simply reflect the bilingual child’s
transitional mastery of the complex dual phonological
input that the child is exposed to. In the long run, most
children exposed to two languages simultaneously and
consistently exhibit no phonological di‰culties as they
Bilingualism and Language Impairment mature, as demonstrated by young children’s remark-
able ability to acquire native-like accents, in compari-
son to the notorious phonological disadvantage of older
This article is concerned with children who grow up second-language learners.
learning two languages simultaneously. These are chil-
dren who are exposed to two languages on a regular and Vocabulary. Bilingual children generally utter their
consistent basis beginning within the first year of birth. first words around the same time as monolingual chil-
(Children who begin learning a second language after 1 dren, and the lexical repertoire of bilingual children is
year of age are not considered because their pattern of generally of the same magnitude and scope as that of
development may be quite di¤erent from that of simul- same-age monolingual children when both languages are
taneous bilinguals.) Understanding the nature of and combined (Genesee and Nicoladis, 1995). When their
developing appropriate treatment for impairment in si- vocabulary in each language is considered separately,
multaneous bilingual acquisition (i.e., bilingual impair- it may be smaller and more restricted in scope than that
ment) is of the utmost importance because of the large of same-age monolingual children. Such di¤erences are
number of such children worldwide. This article dis- most likely attributable to the distinct environments in
cusses key features of normal bilingual acquisition and which they acquire each language and usually disappear
factors that can influence bilingual acquisition that do with age as the child’s experiences in each language
not implicate impairment. It is imperative to understand expand. It is not uncommon for domain-specific dif-
normal bilingual development if valid diagnosis and ferences in lexical proficiency to persist into adulthood,
however, if the bilingual individual’s contexts for ac- may not stick to the language of their conversational
quiring and using both languages are distinct. partner if their vocabulary, syntactic, or pragmatic skills
Many bilingual children exhibit ‘‘dominance’’ in one in that language are not well-developed. In such situa-
language; this can be reflected in syntactic and pragmatic tions, the child may call up the resources of the other
as well as lexical domains. Dominance can express itself language and code mix. Code mixing is the use of
as di¤erential proficiency, preference, or accuracy of use sounds, words, syntax, or pragmatic patterns from both
in one language in comparison to the other. In the case languages in the same utterance or stretch of conversa-
of vocabulary, for example, this can result in the child tion. Some bilingual children may even prefer to code
overusing words from the dominant language even in mix because they are accustomed to using and hearing
contexts where the nondominant language is appropri- others use two languages in the same conversation. In-
ate. Dominance is a normal feature of bilingual acquisi- deed, some bilingual children may never have encoun-
tion and can continue into adulthood, often as a result tered a monolingual person and thus are not used to
of greater exposure to one language. Such imbalances communicating with adults whose skills are limited to
do not usually imply impairment. Dominance should one language. Indeed, code mixing is a normal part of
be considered carefully when understanding bilingual interpersonal communication among fully proficient bi-
children’s language development since it can explain lingual adults, and thus young bilingual people are often
their reliance on or more advanced proficiency in one exposed to proficient adult bilinguals who mix.
language in comparison to the other. Contrary to earlier views, it is now clear that bilin-
gual code mixing is not a sign of linguistic confusion
Syntax. Contrary to earlier views (Volterra and or incompetence (Genesee, Nicoladis, and Paradis, 1995;
Taeschner, 1978, for example), it is now clear that bilin- Meisel, 1989, 2001). To the contrary, child bilingual
gual children develop separate grammatical systems for code mixing, like adult code mixing, is not random but
each language (Genesee, 2000; Meisel, 2001). This is is constrained according to the grammars of the two
evident as soon as they begin producing language that participating languages (Allen et al., 1999; Paradis,
is clearly organized according to grammatical princi- Nicoladis, and Genesee, 2000). In other words, children
ples (in English, from the two-word stage onward). For do not usually violate the grammatical rules of either
the most part, bilingual children demonstrate the same language when they code mix. Bilingual code mixing in
stages and patterns of syntactic development in each children is also situationally constrained, and bilingual
language as children who acquire the same languages children can adjust their rates of code mixing according
monolingually (e.g., Deuchar and Quay, 2000; Juan- to the rates of mixing of their interlocutors (Comeau,
Garau and Pérez-Vidal, 2000). Some bilingual children Genesee, and Lapagarette, in press). In short, code mix-
may show transfer (or so-called interference) e¤ects such ing is a communicative resource that bilingual children
that a grammatical pattern (rule) from one language use to extend their communicative competence.
appears inappropriately when the child uses the other Bilingual children’s language usage, including their
language (Döpke, 2000; Yip and Matthews, 2000). Such code mixing, is shaped by the sociocultural context
transfer e¤ects are usually limited in scope and generally in which they acquire their languages, leading in some
reflect grammatical overlap in the two languages. When cases to patterns that could be misinterpreted. For ex-
transfer occurs, it is often, although not always, asso- ample, a bilingual child may speak a language variety
ciated with much greater proficiency in or exposure to with phonological, lexical, or grammatical features that
one of the languages. Transfer e¤ects are usually short- would be considered deviant from the point of view of
term, provided the child continues to receive consistent the standard language but are normal in the child’s
and rich exposure to both languages. Some bilingual variety (Crago, 1992). They may exhibit conversational
children also exhibit more advanced levels of syntactic patterns, such as silence, that could be interpreted as
development in one language than the other (Paradis lack of pragmatic competence or even language disabil-
and Genesee, 1996). This can be due to greater exposure ity from the perspective of mainstream norms but are
to that language, inherent di¤erences in the acquisition normal and appropriate in the child’s cultural commu-
of specific syntactic patterns in the two languages, or nity (see Crago, 1992, for an example among the Inuit).
simply a preference for one language. These patterns are It is important to consider sociocultural factors as pos-
normal and are not due to impairment. sible explanations of patterns of bilingual usage that
might otherwise be attributed to impairment.
Pragmatics. There is an extensive body of research on
the development of pragmatic (or conversational) skills What Do We Know About Impairment in
in bilingual children. Bilingual children are able to use Bilingual Acquisition?
their two languages di¤erentially and appropriately with
others, even strangers with whom they have had no It is often thought that children exposed to two lan-
or limited contact; this is evident even in children in the guages early in development will experience a higher in-
one and early two-word stage (Genesee, Nicoladis, and cidence of impairment than monolingual children and
Paradis, 1995; Lanza, 1997; Comeau and Genesee, 2001; that their impairment is likely to be unique and more
Genesee, 2002b). Bilingual children’s pragmatic abilities severe than that of monolingual children. These expec-
can be limited by their proficiency. In particular, they tations are based on the assumption, noted earlier, that
acquisition of two languages simultaneously (or consec- should not be used normatively, nor should they be
utively) during the preschool years exceeds the child’s the sole basis for the diagnosis of impairment in bilin-
innate endowment to learn language. In e¤ect, it is gual children, since the latter may demonstrate normal
thought that bilingualism causes the impairment. How- patterns of performance that could be construed as
ever, such an assumption is misguided. While we cur- impaired when compared with monolingual norms
rently lack adequate normative studies of the incidence (Dodd, So, and Wei, 1996). Standardized tests that
of impairment in bilingual children, the extant evidence are normed exclusively on monolingual children are not
provides no reason to believe that exposure to two likely to make allowances for the sociocultural and
languages causes more delayed or impaired develop- exposure di¤erences that bilingual children experience
ment than one would find in a monolingual population. learning two languages, and as a result, they are likely
Moreover, given the overwhelming evidence that bilin- to misrepresent performance di¤erences that bilingual
gual children demonstrate the same milestones and pat- children present as underlying impairment. Decision cri-
terns of development as monolingual children, there teria that recognize alternative paths to normal language
is no reason to expect unique patterns of impairment development must be used in the diagnosis of impair-
among bilingual children. Indeed, Paradis et al. (2003) ment in children acquiring two (or more) languages
found that the pattern and severity of impairment in a simultaneously.
group of French-English children with a clinical diagno- There is no evidence that language impairment is due
sis of impairment did not di¤er from that of age- and to or exacerbated by the simultaneous acquisition of two
language-matched monolingual and bilingual controls languages. It is likely that impairment is due to a fun-
also with impairment. In contrast, Crutchley and her damental problem in the child’s innate capacity to ac-
colleagues found that bilingual children referred to spe- quire language that manifests itself in whatever language
cial language units for children with language impair- or languages the child is learning. Thus, children in the
ment in Britain exhibited more severe and unique process of acquiring two languages who are suspected
patterns of di‰culty on a variety of standardized mea- of impairment should not be limited to one language
sures in comparison to their monolingual counterparts on the assumption that this will benefit their language
(Crutchley, Conti-Ramsden, and Botting, 1997). How- development. Nor should treatment be restricted to one
ever, these findings must be interpreted with caution, language only. To the contrary, children who have been
since, as noted by the authors, there may have been a exposed to two languages from birth may experience
bias toward inclusion of more severely impaired bilin- significant personal trauma and sociocultural disadvan-
gual children in this sample. As well, the use of norm- tages if they are deprived of the benefits of knowing two
referenced tests standardized on monolingual children languages. Moreover, restricting children suspected of
may bias interpretation toward impairment, since nor- impairment to one language may entail significant long-
mal bilingual-specific patterns of development were not term economic, professional, personal, and social dis-
taken into account. In support of the acquisition results, advantages in the case of individuals living in bilingual
treatment studies with impaired bilingual children (both communities. While we still have much to learn about
simultaneous and consecutive) have demonstrated that typical and impaired bilingual acquisition, there is no
outcomes following bilingual treatment are just as posi- evidence at present that would recommend or justify a
tive or even more positive than those following mono- decision that bilingual children with impairment learn
lingual treatment (Gutierrez-Clellen, 1999; Perozzi and only one language.
Sanchez, 1992; Thordardottir, Weismer, and Smith, See also bilingualism, speech issues in.
1997). In sum, and contrary to the bilingualism-as-risk
notion, there is no evidence that bilingual impairment is Acknowledgments
more severe than or di¤erent in kind from monolingual
impairment, and there is no evidence to support mono- I would like to thank Martha Crago and Elin Thordar-
dottir for helpful suggestions on earlier versions of this
lingual treatment over bilingual treatment.
chapter. I would also like to thank the Social Sciences
On the basis of this evidence, it is recommended that
impairment in children acquiring two languages simul- and Humanities Research Council, Ottawa, Canada, for
support of my research cited in this chapter.
taneously be assessed in the same manner as in mono-
lingual children, taking into account what we know —Fred Genesee
about normal bilingual acquisition and the factors that
can influence it: exposure, dominance, and sociocultural References
context. In addition to current best practices in assess-
ment of monolingual children, the following principles Allen, S., Genesee, F., Fish, S., and Crago, M. (1999, Novem-
should be observed when assessing bilingual children ber). Grammatical constraints on early bilingual code-
mixing: Evidence from children learning Inuktitut and
in order to ensure a valid diagnosis of impairment. (1) English. Paper presented at the Boston University Confer-
Evidence for impairment should be attested in both lan- ence on Language Development.
guages. (2) The pattern of impairment in each language Comeau, L., and Genesee, F. (2001). Bilingual children’s repair
should resemble that of monolingual children with im- strategies during dyadic communication. In J. Cenoz and
pairment acquiring the same languages. (3) Standardized F. Genesee (Eds.), Trends in bilingual acquisition (pp.
language tests that are normed on monolingual children 231–256). Amsterdam: John Benjamins.
Comeau, L., Genesee, F., Lapagarette, L. (in press). The Paradis, J., Crago, M., Genesee, F., and Rice, M. (2003). Bi-
modeling hypothesis and child bilingual code-mixing. Inter- lingual children with specific language impairment: How
national Journal of Bilingualism. do they compare with their monolingual peers? Journal of
Crago, M. (1992). Communicative interaction and second lan- Speech, Language, and Hearing Research, 46, 113–127.
guage acquisition: The Inuit example. TESOL Quarterly, Paradis, J., and Genesee, F. (1996). Syntactic acquisition in
26, 487–505. bilingual children: Autonomous or interdependent? Studies
Crutchley, A., Conti-Ramsden, G., and Botting, N. (1997). in Second Language Acquisition, 18, 1–25.
Bilingual children with specific language impairment and Paradis, J., Nicoladis, E., and Genesee, F. (2000). Early emer-
standardized assessments: Preliminary findings from a study gence of structural constraints on code-mixing: Evidence
of children in language units. International Journal of Bilin- from French-English bilingual children. Bilingualism: Lan-
gualism, 1, 117–134. guage and Cognition, 3, 245–261.
Deuchar, M., and Quay, S. (2000). Bilingual acquisition: Theo- Perozzi, J. A., and Sanchez, M. L. C. (1992). The e¤ect of in-
retical implications of a case study. Oxford, U.K.: Oxford struction in L1 on receptive acquisition of L2 for bilingual
University Press. children with language delay. Language, Speech, and Hear-
Dodd, B. J., So, L. K. H., and Wei, L. (1996). Symptoms ing Services in Schools, 23, 348–352.
of disorder without impairment: The written and spoken Thordardottir, E. T., Weismer, S. E., and Smith, M. E. (1997).
errors of bilinguals. In B. Dodd, R. Campbell, and L. Vocabulary learning in bilingual and monolingual clinical
Worrall (Eds.), Evaluating theories of language: Evidence intervention. Child Language Teaching and Therapy, 13,
from disordered communication (pp. 119–136). San Diego, 215–227.
CA: Singular Publishing Group. Volterra, V., and Taeschner, T. (1978). The acquisition and
Döpke, S. (2000). Generation of and retraction from cross- development of language by bilingual children. Journal of
linguistically motivated structures in bilingual first lan- Child Language, 5, 311–326.
guage acquisition. Bilingualism: Language and Cognition, 3, Yip, V., and Matthews, S. (2000). Syntactic transfer in a
209–226. Cantonese-English bilingual child. Bilingualism: Language
Genesee, F. (2001). Bilingual first language acquisition: Ex- and Cognition, 3, 193–208.
ploring the limits of the language faculty. In M. McGroarty
(Ed.), 21st annual review of applied linguistics (pp. 153– Further Readings
168). Cambridge, U.K.: Cambridge University Press.
Genesee, F. (2002a). Rethinking bilingual acquisition. In J. M. Cummins, J. (1981). The role of primary language develop-
Dewaele (Ed.), Bilingualism: Challenges and directions for ment in promoting educational success for language minor-
future research (pp. 158–182). Clevedon, U.K.: Multi- ity students. In Schooling and language minority students: A
lingual Matters. theoretical framework (pp. 3–49). Los Angeles: California
Genesee, F. (2002b). Portrait of the bilingual child. In V. Cook State University, Evaluation, Dissemination and Assess-
(Ed.), Portraits of the second language user (pp. 170–196). ment Center.
Clevedon, U.K.: Multilingual Matters. Doyle, A. B., Champagne, M., and Segalowitz, N. (1978).
Genesee, F., and Nicoladis, E. (1995). Language development Some issues in the assessment of linguistic consequences
in bilingual preschool children. In E. E. Garcia and B. of early bilingualism. In M. Paradis (Ed.), Aspects of bilin-
McLaughlin (Eds.), Meeting the challenge of linguistic and gualism (pp. 13–20). Columbia, SC: Hornbeam Press.
cultural diversity in early childhood education (pp. 18–33). Genesee, F. (1989). Early bilingual development: One language
New York: Teachers College Press. or two? Journal of Child Language, 16, 161–179.
Genesee, F., Nicoladis, E., and Paradis, J. (1995). Language Goodz, N. S. (1989). Parental language mixing in bilingual
di¤erentiation in early bilingual development. Journal of families. Journal of Infant Mental Health, 10, 25–44.
Child Language, 22, 611–631. Grosjean, F. (1982). Life with two languages: An introduction to
Gutierrez-Clellen, V. F. (1999). Language choice in interven- bilingualism. Cambridge, MA: Harvard University Press.
tion with bilingual children. American Journal of Speech- Ho¤mann, C. (1991). An introduction to bilingualism. London:
Language Pathology, 8, 291–302. Longman.
Juan-Garau, M., and Pérez-Vidal, C. (2000). Subject realiza- Köppe, R., and Meisel, J. M. (1995). Code-switching in
tion in the syntactic development of a bilingual child. Bilin- bilingual first language acquisition. In L. Milroy and P.
gualism: Language and Cognition, 3, 173–192. Muysken (Eds.), One speaker, two languages: Cross-
Lanza, E. (1997). Language mixing in infant bilingualism: A disciplinary perspectives on code-switching (pp. 276–301).
sociolinguistic perspective. Oxford, U.K.: Clarendon Press. Cambridge, MA: Cambridge University Press.
Meisel, J. M. (1989). Early di¤erentiation of languages in Meisel, J. M. (1990). Two first languages: Early grammatical
bilingual children. In K. Hyltenstam and L. Obler (Eds.), development in bilingual children. Dordrecht: Foris.
Bilingualism across the lifespan: Aspects of acquisition, ma- Pearson, B. Z., Fernández, S. C., and Oller, D. K. (1993).
turity and loss (pp. 13–40). Cambridge, U.K.: Cambridge Lexical development in bilingual infants and toddlers:
University Press. Comparison to monolingual norms. Language Learning, 43,
Meisel, J. M. (2001). The simultaneous acquisition of two 93–120.
first languages: Early di¤erentiation and subsequent devel- Pearson, B. Z., Fernández, S., and Oller, D. K. (1995). Cross-
opment of grammars. In J. Cenoz and F. Genesee (Eds.), language synonyms in the lexicons of bilingual infants:
Trends in bilingual acquisition research (pp. 11–42). One language or two? Journal of Child Language, 22, 345–
Amsterdam: John Benjamins. 368.
Paradis, J. (2000). Beyond one system or two: Degrees of sep- Petitto, L. A., Katerelos, M., Levy, B. G., Gauna, K.,
aration between the languages of French-English bilingual Tetreault, K., and Ferraro, V. (2001). Bilingual signed and
children. In S. Döpke (Ed.), Cross-linguistic structures in spoken language acquisition from birth: Implications for the
simultaneous bilingualism (pp. 175–200). Amsterdam: John mechanism underlying early bilingual language acquisition.
Benjamins. Journal of Child Language, 28, 453–496.
Communication Disorders in Adults: Functional Approaches to Aphasia 283
Communication Disorders in Adults: ment and treatment of aphasia. Although functional

approaches can be applied to disorders such as traumatic
Functional Approaches to Aphasia brain injury and dementia, the bulk of the literature
concerns aphasia, and it will be featured here.
Functional communication has been a clinical theme
since Martha Taylor Sarno first used the term as a
label for her Functional Communication Profile (1968).
Functional Assessment
Since then, the concept of functional communication has A recent report from the National Committee on Vital
broadened in scope, with the result that there are now and Health Statistics (2001) foreshadows the emerging
within this field two pertinent connotations for the word emphasis on the need to lay ‘‘the groundwork for greater
functional. Both are applicable to functional approaches use of functional status information in and beyond clin-
to assessment and treatment of communication disorders ical care.’’ This report also supports the World Heath
in adults. Elman and Bernstein-Ellis (1995) suggest that Organization’s revised International Classification of
the first connotation invokes a sense of the basics: for Functioning, Disability and Health (ICF, 1999). The
example, having the language necessary for signaling ICF makes it clear that in addition to measuring im-
survival needs or rudimentary wants, for getting help, or pairment such as aphasia, assessment must also consider
for using ‘‘yes’’ and ‘‘no’’ reliably and accurately. Func- how that impairment limits an individual’s ability to
tional in the second sense connotes smooth running, go about the activities of daily living. The ICF takes one
getting through the worst of one’s communication prob- more step: It also requires the assessment of the e¤ects of
lem, or learning satisfactory compensatory skills that activity limitations on the ability to resume one’s previ-
permit individuals only occasionally to have to remind ous level of participation in society. Measuring activities
themselves that they still have residual problems in and limitations brought about by aphasia, as well as how
communicating. one’s participation is a¤ected, is precisely the domain
The term communication as used in the assessment of functional assessment. Such assessment does not sub-
and treatment of adult language disorders also has stitute for tests that inventory the nature and extent of
two connotations. For some clinicians, communication aphasic impairment. However, it dictates that such pro-
is almost synonymous with language, and their work cedures must be supplemented by other measures.
emphasizes recovery or restitution of language skills. But Functional communication assessment measures are
clinicians whose interest is on functional communica- far-ranging. They include observing aphasic persons’
tion utilize a more comprehensive definition. For them, communicative interactions, interviewing aphasic indi-
communication typically encompasses not only lan- viduals and their families about communication needs,
guage, but also other behaviors that permit individuals and analyzing their discourse and conversation. A few
to exchange information and socialize even when they formal tests, such as Communicative Activities of Daily
speak di¤erent languages. Most pertinent to adult lan- Living (CADL-2; Holland, Frattali, and Fromm, 1999)
guage disorders are gesturing, drawing, and other ways and rating scales such as the ASHA FACS (Frattali
of getting messages across, or learning how to guide et al., 1995), the Functional Communication Profile
others to provide the support and sca¤olding that facili- (FCP; Sarno, 1969), and the Communication E¤ective-
tates interpersonal interchange. ness Inventory (CETI; Lomas et al., 1989) are used to
These expanded definitions are crucial to understand- measure activities and activity limitations in ICF terms.
ing the di¤erences between functional and more tradi- The natural alignment of functional and pragmatic
tional approaches to assessment and treatment of the approaches also extends to ICF’s next level, address-
language disorders that are acquired in adulthood, typi- ing restrictions in societal participation. Being able to
cally the result of insults to the brain and occurring resume activities and to participate in society clearly
to individuals who previously had normal language and relates to quality of life. There are many quality-of-life
communication. For such individuals, understanding measures available, but few at present focus specifically
the way that language functions in communication re- on the e¤ects of communication problems. Nonetheless,
mains relatively spared, in contrast to their deficits of e¤ective functional assessment at the level of participa-
impaired lexicon, grammar, and phonology. As a result, tion should be considered through interview and obser-
functional approaches tend to stress communication vation until more formal measures are available. This is
strengths rather than linguistic deficits. because the ultimate goal of good therapy (functionally
Because they emphasize everyday language and com- oriented or not) is to improve the quality of an aphasic
munication use, functional approaches rely heavily on person’s life in this broadest sense.
the context in which such activities occur. They focus on We now turn our attention to functional approaches
authentic interpersonal exchange and interaction across to the treatment of adult communication disorders.
a variety of settings, as well as communicative activities Specific clinical techniques for functional treatment
that occur in everyday life. Functional approaches also are methodologically similar to those that are used in
include usual conversational partners and emphasize more traditional, impairment-focused treatments. That
their new role in facilitating as normal communication is, principles of learning and counseling are also used
as is possible. With this background in mind, the fol- in functional treatment approaches. However, there is a
lowing summarizes functional approaches to assess- great di¤erence in the tasks that comprise traditional
and functional treatments. In the latter, both stimuli and less formal training, accomplished both through didactic
treatment tasks are geared to everyday events and inter- means and by counseling. Partner-centered approaches
actions, or to communication strategies that can be used have been successfully used with family members, vol-
when language skills break down. unteers and even clinicians. The approaches share the
An individual’s own pattern, style, and opportunities rationale that communication can be improved not
for communication are emphasized. As a result, the only by improving communication skills in aphasic
process is less clinician-driven and more the result of individuals, but also when others in the communica-
collaboration between aphasic individuals, their families, tive environment learn how to listen more e¤ectively,
and their clinicians than are most other approaches. how to encourage multimodal communicative attempts,
Functional approaches encourage the participation of and how to ask questions of aphasic people more
aphasic individuals as well as their families in choosing appropriately.
treatment goals that are almost always cast in every- In summary, functional approaches rely on a real-
day terms. The clinician’s role in the goal-setting phase world perspective. Clinicians typically ask, How will this
treatment is to guide and counsel about how realistic the a¤ect this person’s daily life, and that of his or her fam-
goals are, or to propose modifications that might be ac- ily? Finally, functions of language, such as being able
ceptable. For example, instead of the clinician’s unilat- to invite, deny, or request, are essential to daily social
eral decision to work on general impairments in word interactions, and therefore to functional approaches.
retrieval (which might target words chosen on the basis In the last 35 years, researchers and clinicians have
of their frequency of occurrence or imageability), a col- refined their abilities to assess everyday communication
laborative approach might result in work that features and to measure the functional outcomes of treatment
retrieving the names of family, friends, and pets. Thus, e¤orts. Treatment methods that focus on the functional
even if traditional wisdom suggests that treatment have also been expanded, and data now support the
should begin with easy targets and proceed from them observation that functional changes can be achieved as
hierarchically, the functionally motivated clinician might a result. Qualitative as well as quantitative research on
conclude that personal relevance is more important. The functional approaches can be expected to expand our
Functional Communication Planner (Worrall, 1999) knowledge over the next corresponding time period.
provides a systematic format for determining the focus
of such treatment. —Audrey L. Holland and Jacqueline J. Hinckley
There are many variations to the functional treatment
theme when it is directed specifically toward the aphasic References
person in individual treatment. Perhaps the best known
functional approaches involve training aphasic individu- Davis, G. A., and Wilcox, M. J. (1985). Adult aphasia rehabil-
als to use strategies that facilitate their speech or aid in itation: Applied pragmatics. San Diego, CA: College-Hill
improving their comprehension. Some examples include Press.
Elman, R., and Bernstein-Ellis, E. (1995). What is func-
supplementing or even substituting speech attempts with tional? American Journal of Speech-Language Pathology, 4,
drawing or writing, or, for auditory comprehension, 115–117.
teaching aphasic persons to request repetition of a mes- Elman, R. J. (1999). Group treatment for neurogenic communi-
sage that he or she does not understand. Often these cation disorders: The expert clinician’s approach. Woburn,
alternative strategies are practiced in activities that pro- MA: Butterworth-Heinemann.
mote the exchange of unknown information in a manner Frattali, C., Thompson, C., Holland, A., Ferketic, M., and
that approximates the normal interchange of everyday Wohl, C. (1995). Functional Assessment of Communication
communication (PACE treatment; Davis and Wilcox, Skills for Adults (ASHA FACS). Rockville, MD: American
1985). Speech-Language-Hearing Association.
Other approaches involve developing and practicing Holland, A. L. (1991). Pragmatic aspects of intervention in
aphasia. Journal of Neurolinguistics, 6, 197–211.
scenarios and scripts that can be used to recount impor- Holland, A., Frattali, C., and Fromm, D. (1991). Commmuni-
tant aspects of the person’s life, such as how an aphasic cative Activities of Daily Living (CADL-2). Austin, TX:
man met his wife. A related approach might be to work Pro-Ed.
on specific situations of personal relevance. These situa- Kagan, A. (1998). Supported conversation for adults with
tion-specific scripts can be as diverse as a script that aids aphasia: Methods and resources for training conversation
an individual in getting help in an emergency to con- partners. Aphasiology, 12, 816–831.
sulting with a travel agent to plan a trip or even telling a Lomas, J., Pickard, L., Bester, S., Elbard, H., Finlayson, A.,
few jokes. Finally, group treatment focused on conver- and Zoghabib, C. (1989). The Communicative E¤ective-
sational skills is becoming increasingly more frequent, as ness Index: Development and psychometric evaluation of a
illustrated by the rich examples provided in Elman’s functional communication measure for adult aphasia. Jour-
nal of Speech and Hearing Disorders, 54, 113–124.
book (1999). National Committee on Vital Health Statistics. (2001). Report
There are also a number of ways to train others in the on the use of the ICF for recording functional status infor-
aphasic person’s environment to take a disproportionate mation. Available: www.ncvhs.hhs.gov/010716rp.htm.
share of the burden of communication. Supported con- Sarno, M. T. (1969). Functional Communication Profile: A
versation (Kagan, 1998) is one such approach. Others manual of directions (Rehabilitation Monograph No. 42).
include conversational coaching (Holland, 1991), and New York: NYU Medical Center.
Communication Disorders in Infants and Toddlers 285
World Health Organization. (1999). International Classifica- Parr, S. (1995). Everyday reading and writing in aphasia: Role
tion of Function and Disability (ICF). Available: www change and the influence of pre-morbid literacy practice.
.who.int/inf-pr-1999/en/note99-19.html. Aphasiology, 9, 223–238.
Worrall, L. (1999). Functional Communication Planner. Oxon, Penn, C. (1998). The profiling of syntax and pragmatics in
U.K.: Winslow. aphasia. Clinical Linguistics and Phonetics, 2, 179–208.
Penn, C. (1999). Pragmatic assessment and therapy for persons
with bain damage: What have clinicians gleaned in two
Further Readings decades? Brain and Language, 68, 535–552.
Pound, C., Parr, S., Lindsay, J., and Woolf, C. (2000). Beyond
Aten, J., Caliguiri, M., and Holland, A. (1982). The e‰cacy aphasia: Therapies for living with communication disability.
of functional communication therapy for chronic aphasic Oxon, U.K.: Winslow.
patients. Journal of Speech and Hearing Disorders, 47, Prutting, C. A., and Kirchner, D. (1983). Applied pragmatics.
93–96. In T. Gallagher and C. A. Prutting (Eds.), Pragmatic
Bates, E. (1976). Language in context. New York: Academic assessment and intervention issues in language. San Diego,
Press. CA: College-Hill Press.
Brumfitt, S. (1993). Losing your sense of self: What aphasia Sarno, M. T. (1993). Aphasia rehabilitation: Psychosocial and
can do. Aphasiology, 7, 569–574. ethical considerations. Aphasiology, 7, 321–324.
Burns, M., Dong, K., and Oehring, A. (1995). Family Simmons-Mackie, N., and Damico, J. (1996a). The contribu-
involvement in the treatment of aphasia. Topics in Stroke tion of discourse markers to communicative competence in
Rehabilitation, 2, 68–77. aphasia. American Journal of Speech-Language Pathology,
Doyle, P. J., Goldstein, H., Bourgeois, M., and Nakles, K. 5, 37–43.
(1989). Facilitating generalized requesting behavior in Simmons-Mackie, N., and Damico, J. (1996b). Accounting for
Broca’s aphasia: An experimental analysis of a general- handicaps in aphasia: Assessment from an authentic social
ization training procedure. Journal of Applied Behavior perspective. Disability and Rehabilitation, 18, 540–549.
Analysis, 22, 157–170. Wilcox, M., Davis, G., and Leonard, L. (1978). Aphasics’
Elman, R., and Bernstein-Ellis, E. (1999). The e‰cacy of group comprehension of contextually conveyed meaning. Brain
communication treatment in adults with chronic aphasia. and Language, 6, 362–377.
Journal of Speech, Language, and Hearing Research, 42, Worrall, L., and Frattali, C. (2000). Functional communication
411–419. in aphasia. New York: Thieme.
Hirsch, F., and Holland, A. (1999). Assessing quality of life
after aphasia. In L. Worrall and C. Frattali (Eds.), Func-
tional communication in aphasia. New York: Thieme. Communication Disorders in Infants
Holland, A. (1982). Observing functional communication of
aphasic adults. Journal of Speech and Hearing Disorders, and Toddlers
47, 50–56.
Holland, A., and Hinckley, J. (2002). Assessment and treat- Prior to 1986, many speech-language pathologists serv-
ment of pragmatic aspects on communication in apahasia. ing infants and toddlers with communication disorders
In A. Hillis (Ed.), Handbook of adult language disorders:
used an expert service delivery model. In the expert
Integrating cognitive neuropsychology, neurology and re-
habilitation. New York: Psychology Press. model, the speech-language pathologist is viewed as the
Hopper, T., and Holland, A. (1998). Situation-specific treat- professional who provides solutions by way of direct in-
ment for aphasia. Aphasiology, 12, 933–944. tervention to a child. Service delivery is often direct, and
Jordan, L., and Kaiser, W. (1996). Aphasia: A Social families have little control over the focus of and method
Approach. London: Chapman and Hall. of intervention. However, with passage of Public Law
LaPointe, L. (2002). Functional and pragmatic directions in 99-457, a shift in service delivery philosophies occurred,
aphasia treatment. In R. de Bleser and I. Papathanasiou with a new emphasis on a family-centered model
(Eds.), The sciences of aphasia, Vol. 1, From theory to ther- (Donahue-Kilburg, 1992).
apy. Oxford: Elsevier. PL 99-457, Part H, mandated comprehensive, coor-
Lyon, J. G., Cariski, D., Keisler, L., Rosenbek, J., Levine, R.,
Kumpula, J., et al. (1997). Communication partners:
dinated, community-based and family-centered services
Enhancing participation in life and communication for for infants and toddlers exhibiting disabilities in physi-
adults with aphasia in natural settings. Aphasiology, 11, cal, cognitive, communication, social or emotional, and/
693–708. or adaptive development. A range of services, including
Marshall, R. C. (1993). Problem-focused group treatment speech-language services and audiology, are available at
for clients with mild aphasia. American Journal of Speech- no cost to the parents except where federal or state law
Language Pathology, 2, 31–37. provides for a system of payment by families. Reautho-
Newho¤, M., and Apel, K. (1990). Impairments in pragmatics. rization of IDEA in 1997 led to a change in name (now
In L. L. LaPointe (Ed.), Aphasia and related neurogenic referred to as Part C). The reauthorization emphasizes
language disorders. New York: Thieme. services to children in natural environments (i.e., loca-
Oelschlaeger, M., and Damico, J. (1999). Participation of a
conversation partner in the word searches of a person with
tions where the child would be served if he or she did not
aphasia. American Journal of Speech-Language Pathology, have a disability), using family-directed service delivery
8, 62–71. to identify family needs, and a‰rms families as members
Parr, S. (1992). Everyday reading and writing practices of nor- of the evaluation team.
mal adults: Implications for aphasia assessment. Aphasiol- Interagency coordinating councils (ICCs) exist at the
ogy, 6, 273–283. federal, state, and local levels. Eligibility criteria for
early intervention services under Part C is decided by with families of infants and toddlers with communica-
each state’s lead ICC, and as a result, varies from state tion disorders. The speech-language pathologist should
to state. Eligibility is often determined by the presence understand each family’s cultural belief system as it
of a developmental delay in physical, cognitive, speech applies to views about disabilities, communication
and language, social or emotional, and adaptive (i.e., behaviors, and childrearing. Adopting a family-guided
self-help) skills; or eligibility may be based on the degree approach to early intervention requires a nonjudgmental
of risk that the child has for developing a delay. There respect for the family’s views. The first step to under-
are three types of risks: established risk, biological risk, standing and respecting family and cultural views is to
and environmental risk. In the case of established risk, understand one’s own family and culture. This places the
a child displays a diagnosed medical condition, such as speech-language pathologist in a position to understand
Down syndrome, fragile X syndrome, or Turner’s syn- di¤erences of opinion and reduce potential misunder-
drome, that is known to influence development nega- standings. The next step is to learn about the family’s
tively. Children with an established risk qualify for early belief system by observing, listening, and sharing infor-
intervention services. In contrast, a child who is biologi- mation. Skills in interviewing and counseling are crucial
cally at risk exhibits characteristics (e.g., very low birth for obtaining and clarifying information in a non-
weight, otitis media, prematurity) that may result in threatening and respectful manner.
developmental di‰culties. A child with an environmen- Unlike the expert model, which focuses only on the
tal risk is exposed to conditions that may interfere with client’s needs, the family-guided model focuses on both
normal development, such as poor nutrition, poor envi- the family and the child. Child assessment and family
ronmental stimulation, or caregivers with substance assessment may both take place. A family assessment is
abuse problems. Children with biological or environ- voluntary and often conducted through interviews and
mental risks are considered to be at risk rather than to surveys designed to collect information about the fam-
have an established risk. Some states include children ily’s resources, priorities, and concerns. Child assess-
who are at risk in their eligibility criteria for services; ments must be multidisciplinary and comprehensive,
others do not. conducted by trained personnel, and include both
One group of toddlers seen by speech-language pa- strengths and weaknesses of the child. The assessment
thologists who have been studied extensively display should also be nondiscriminatory and confidential. In-
slow expressive language development (SELD). SELD is formed consent must be obtained. Professionals have 45
characterized by an expressive vocabulary of less than days from screening to complete the evaluation process
50 words at 24 months of age and no word combina- that may determine the presence of a disorder and de-
tions, with no known hearing, cognitive, emotional/ termine eligibility.
behavioral, gross neurological, oral-motor, or environ- Assessment of infants and toddlers includes a vari-
mental deprivation (Rescorla, 1989; Paul, 1993). Paul ety of tasks and a range of informational questions
(1996) concluded that children with SELD should not be asked of family members. Areas assessed include infant
regarded as having a disorder; rather, they should be state behaviors, respiration, oral-motor skills (including
considered at risk for further language impairment. Her sucking and swallowing evaluations), nonverbal com-
longitudinal study of children identified as having SELD munication behaviors, play behaviors, caregiver-child
indicated that approximately 74% of the children identi- interaction, receptive and expressive vocabulary skills,
fied with SELD as toddlers were no longer classified as phonological skills, word combinations, syntactic devel-
having an expressive language delay by kindergarten opment, functions for language use, and cognitive skills
age. Paul recommended a ‘‘watch-and-see’’ policy for (Donahue-Kilburg, 1992; Dickson, Linder, and Hudson,
children with SELD who do not display other risk fac- 1993; Paul, 2001). Assessment in the neonatal intensive
tors. The watch-and-see policy would monitor children care unit may involve working with nurses and parents
on a regular basis for their linguistic progress. However, and revolves around making the environment as posi-
other researchers have opposed such a policy, for a vative as possible for the infant (Ziev, 1999). Assessment
riety of reasons (van Kleeck, Gillam, and Davis, 1997). includes formal, standardized tests, informal methods
Children with specific language impairment (SLI), which such as play-based assessment and temptation tasks, and
can include receptive and expressive language skills, may parental report.
also be seen for early intervention services. Risk factors Often assessment is conducted in an arena format,
for SLI include heredity, long periods of untreated otitis where one member of the team serves as the primary
media, and parental characteristics such as low socio- facilitator. Other team members observe and make notes
economic status, directive interaction style, and extreme about the child’s behavior in their specialty area. In a
parental concern (Olswang, Rodriquez, and Timler, transdisciplinary arena assessment, the primary facili-
1998). tator will adopt the roles of the other professionals.
The family plays a vital role in providing services Families can be involved in assessment by providing
to infants and toddlers with communication disorders. information, observing and collecting information, and
They are part of the team from the referral stage through interacting with their child. The level of involvement
assessment, intervention, and dismissal. It is critical to is the family’s choice. It is important to remember that
consider the cultural variables that contribute to each families are dynamic systems and that roles and func-
family’s unique system of functioning when working tions of family members can change. A family initially
Communication Disorders in Infants and Toddlers 287
reluctant to participate may want to participate in the References

next session. Frequent communication is necessary to
ensure that services remain family-guided and family- Dickson, K., Linder, T. W., and Hudson, P. (1993). Observa-
tion of communication and language development. In
friendly.
T. Linder (Ed.), Transdisciplinary play-based assessment
Families also assist the speech-language pathologist (pp. 163–215). Baltimore: Paul H. Brookes.
in determining outcomes rather than goals. These out- Donahue-Kilburg, G. (1992). Family-centered early interven-
comes can be child-oriented, family-oriented, or both. tion for communication disorders: Prevention and treatment.
The outcomes are written in family-friendly language, Gaithersburg, MD: Aspen.
often in the family’s own words. These outcomes are Girolametto, L., Pearce, P. S., and Weitzman, E. (1996).
written in the form of an Individual Family Service Plan Interactive focused stimulation for toddlers with expressive
(IFSP). Importantly, the IFSP is a process as well as language delays. Journal of Speech, Language, and Hearing
document. The process begins at referral and involves Research, 39, 1274–1283.
getting to know the family and child. It is an informal Girolametto, L., Pearce, P. S., and Weitzman, E. (1997).
E¤ects of lexical intervention on the phonology of late
transferal of information that builds the foundation
talkers. Journal of Speech, Language, and Hearing Research,
of trust and respect underlying future interactions. The 40, 338–348.
written IFSP will contain information about the person Girolametto, L., Weizman, E., Wiigs, M., and Pearce, P. S.
who serves as the family service coordinator, the child’s (1999). The relationship between maternal language mea-
status in the areas of physical, cognitive, communica- sures and language development in toddlers with expressive
tion, social-emotional, and adaptive development, a plan vocabulary delays. American Journal of Speech-Language
that indicates the frequency and length of services, who Pathology, 8, 364–374.
will pay for services, the method of providing services, Loeb, D. F., and Armstrong, N. (2001). Case studies on the
and how service will be provided in natural environ- e‰cacy of expansions and subject-verb-object models in
ments. The family service provider is selected by the early language intervention. Child Language Teaching and
Therapy, 17, 35–54.
family and is the main coordinator of services. The IFSP Olswang, L., Rodriquez, B., and Timler, G. (1998). Recom-
also includes a transition plan for children leaving Part C mending intervention for toddlers with specific language
services and entering preschool (or Part B) services. It is learning di‰culties: We may not have all the answers, but
optional on the part of the family whether or not they we know a lot. American Journal of Speech-Language Pa-
want the IFSP to contain information on the family’s thology, 7, 23–32.
resources, concerns, and priorities. IFSPs can be used for Paul, R. (1993). Patterns of development in late talkers: Pre-
children ages 0–6 years. Thus, on transitioning to Part school years. Journal of Childhood Communication Dis-
B services, families may choose an IFSP instead of an orders, 15, 7–14.
Individual Education Plan. Paul, R. (1996). Clinical implications of the natural history of
Once outcomes have been selected, the type of inter- slow expressive language development. American Journal of
vention approach is determined. It is important to Paul, R. (2001). Language disorders from infancy through ado-
understand the family’s view of speech and language lescence (2nd ed.). St. Louis: Mosby.
development and the role they believe they play in their Rescorla, L. (1989). The Language Development Survey: A
child’s development of speech and language. Family screening tool for delayed language in toddlers. Journal of
members may choose to play an integral role in the in- Speech and Hearing Disorders, 54, 587–599.
tervention; others may prefer to have the intervention Robertson, S. B., and Ellis Weismer, S. (1999). E¤ects of
conducted by the speech-language pathologist. Interven- treatment on linguistic and social skills in toddlers with
tion studies thus far have provided evidence that early delayed language development. Journal of Speech, Lan-
intervention with infants and toddlers can facilitate pre- guage, and Hearing Research, 42, 1234–1248.
linguistic behaviors, expressive vocabulary, phonology, van Kleeck, A., Gillam, R. B., and Davis, B. (1997). When is
‘‘Watch and see’’ warranted? A response to Paul’s 1996
social skills, and early word combinations (Girolametto articles, ‘‘Clinical implications of the natural history of slow
et al., 1996, 1997, 1999; Robertson and Ellis Weismer, expressive language development.’’ American Journal of
1999; Yoder and Warren, 1998; Loeb and Armstrong, Speech-Language Pathology, 6, 34–39.
2001). Some child-oriented techniques include parallel Yoder, P., and Warren, S. (1998). Maternal responsivity pre-
talk, recasting, and expansions. Some interventions fo- dicts the prelinguistic communication intervention that
cus modeling language e¤ectively in everyday routines facilitates generalized intentional communication. Journal
and scripts. The ultimate goal of this early intervention of Speech, Language, and Hearing Research, 41, 1207–
for children with established risk and those who are at 1219.
risk is to reduce the likelihood that they will require fu- Ziev, M. S. R. (1999). Earliest intervention: Speech-language
ture intervention and special education services and will pathology services in the neonatal intensive care unit.
ASHA, May/June, 32–36.
gain the skills needed to participate socially and aca-
demically as they grow.
See also speech disorders in children: birth-
related risk factors. Further Readings
American Speech-Language-Hearing Association. (1989).
Communication-based services for infants, toddlers, and
—Diane Frome Loeb their families, ASHA, 31, 32–34.
Billeaud, F. P. (2000). Communication disorders in infants Communication Skills of People with

and toddlers (2nd ed.). Needham Heights, MA: Allyn and
Bacon. Down Syndrome
Bliele, K. M. (1997). Language intervention with infants and
toddlers. In L. McCormick, D. F. Loeb, and R. L. Schie-
felbusch (Eds.), Supporting children with communication
di‰culties in inclusive settings (pp. 307–333). Needham Down syndrome is the most common genetic disorder
Heights, MA: Allyn and Bacon. in children. The genotype involves an extra copy of the
Calandrella, A. M., and Wilcox, J. M. (2000). Predicting short arm of chromosome 21, either as trisomy (95%
language outcomes for young prelinguistic children with of cases), a translocation, or expressed mosaicly. This
developmental delay. Journal of Speech, Language, and condition is not inherited and occurrs on average in
Hearing Research, 43, 1061–1071. about 1 in 800 live births in the United States. Incidence
Gallagher, T. M., and Watkin, K. L. (1998). Prematurity and increases as maternal and paternal age increase. Down
language developmental risk: Too young or too small? syndrome a¤ects almost every system in the body. For
Topics in Language Disorders, 3, 15–25. example, brain size is smaller in adults though the same
Guralnick, M. J. (2001). Early childhood inclusion. Baltimore:
size at birth, 50% of these children have significant heart
Paul H. Brookes.
Hammer, C. H., and Weiss, A. L. (2000). African American defects requiring surgery, neuronal density in the brain is
mothers’ views of their infants’ language development significantly reduced, middle ear infection persists into
and language-learning environment. American Journal of adulthood, hypotonia ranges from mild to severe, and
Speech-Language Pathology, 9, 126–140. cognitive performance ranges from normal performance
Ore, G. (1997). A silver lining. Infant-Toddler Intervention, 7, to severe mental retardation. The remainder of this
79–92. article will summarize the specific speech, language and
Rescorla, L., and Alley, A. (2001). Validation of the Language communication features associated with this syndrome.
Development Survey (LDS): A parent report tool for iden- The unique features of speech, language, and hearing
tifying language delay in toddlers. Journal of Speech, Lan- ability of children with Down syndrome have been
guage, and Hearing Research, 44, 556–566.
detailed by Miller, Leddy, and Leavitt (1999). First is the
Rescorla, L., Roberts, J., and Dahlsgaard, K. (1997). Late
talkers at 2: Outcome at age 3. Journal of Speech, Lan- frequent hearing loss in infants and children, with more
guage, and Hearing Research, 44, 434–445. than 75% of young children found to have at least a
Rescorla, L., and Schwartz, E. (1990). Outcome of toddlers mild hearing problem at sometime in childhood. These
with specific expressive language delay. Applied Psycho- hearing problems can fluctuate, but about one-third
linguistics, 11, 393–407. of children have recurring problems throughout early
Scherer, T. (1999). The speech and language status of toddlers childhood that can lead to greater language and speech
with cleft lip and/or palate following early vocabulary in- delay. These results suggest particular attention be
tervention. American Journal of Speech-Language Pathol- directed to monitoring responsiveness to everyday
ogy, 8, 81–93. speech and frequent hearing testing through childhood.
Tannock, R., and Girolametto, L. (1992). Reassessing parent-
Second, there are unique verbal language characteristics
focused language intervention programs. In S. Warren and
J. Reichle (Eds.), Causes and e¤ects in communication and of persons with Down syndrome. Children experience
language intervention (pp. 49–79). Baltimore: Paul H. slower development of language relative to other cogni-
Brookes. tive skills. Communication performance is characterized
Thal, D. (2001). Language and gesture in early identified late- by better language comprehension than production.
talkers. Paper presented at the Symposium for Research on Vocabulary use is better than the mastery of the gram-
Child Language Disorders, Madison, WI. mar of the language. Progress in speech and language
Tomblin, J. B. (1996). Genetic and environmental contribu- performance is linked to several related factors, includ-
tions to the risk for specific language impairment. In M. ing hearing status, speech-motor function status, and
Rice (Ed.), Toward a genetics of language. Mahwah, NJ: advancing cognitive skills associated with a stimulating
Erlbaum.
verbal and nonverbal environment. Progress in speech,
Wetherby, A., Warren, S., and Reichle, J. (1998). Transitions in
prelinguistic communication. Baltimore: Paul H. Brookes. language, and communication should be expected be-
Yoder, P. J., and Warren, S. F. (2001). Relative treatment yond early childhood through adolescents (Chapman,
e¤ects of two prelinguistic communication interventions Hesketh, and Kistler, 2002). A third unique feature is a
on language development in toddlers with developmental protracted period of unintelligible speech. Speech intelli-
delays vary by maternal characteristics. Journal of Speech, gibility is a persistent problem of persons with Down
Language, and Hearing Research, 44, 224–237. syndrome through late childhood. Most family members
Ziev, M. S. R. (1999). Earliest intervention: Speech-language have some di‰culty understanding the speech of their
pathology services in the neonatal intensive care unit. children in everyday communication. Treatment pro-
ASHA, May/June, 32–36. tocols can improve speech intelligibility, leading to
improved communication of children and adults. Fi-
nally, the development of writing and literacy skills in
persons with Down syndrome should be expected. Chil-
dren participating in early reading and writing experi-
ences experience better communication and academic
skills than their peers with less literacy experience.
Communication Skills of People with Down Syndrome 289
Early reading programs have been successful at teaching Clearly, these constructs are interrelated and it would be
sight word vocabulary to children as young as 3 years of impossible to determine which is causal for any specific
age. behavior. We have found that motivation is central to
maintaining consistent response patterns. If we can pro-
Assessment Principles vide a task that is su‰ciently motivating, attention is
maintained and responses are more consistent. Our work
The following principles have evolved from our research suggests that successful assessments can be conducted
over the past 10 years. Children with Down syndrome by careful preparation and understanding the interests
are very challenging to evaluate. The first and perhaps of each child, what activities they like, what holds their
the only principle is to not make any assumptions about attention at home and school, and then selecting as-
perceptual, motor, and cognitive skills or the child’s ex- sessment materials that can be imbedded into these
perience with oral and written language. We suggest the activities.
following as guidelines for developing an assessment
protocol. Access all information sources about current Memory. The work of Michael Marcell (Marcell and
communication abilities across contexts—school, home, Weeks, 1988) documents verbal short-term memory def-
day care, and community. Review all data available on icits. This has significant implications for assessment of
motor and cognitive development as well as percep- language comprehension and production, particularly
tual (hearing and vision) status to direct the assessment when using standardized procedures that require pro-
decision-making process. Use flexible communication cessing specific stimuli and remembering it long enough
assessment protocols that can meet the specific attention to provide to appropriate response. Clearly, memory
shifts and motivational challenges of people with Down deficits may also be contributing to behaviors that may
syndrome. Make sure the context of the assessment be labeled as inattention or that result in inconsistent
matches the child’s performance level (i.e., play-based, response patterns. In our experience, providing visual
observation, and standardized measures). Contrast support enhances performance when verbal abilities are
measures conducted within familiar contexts, child- or tested. This may involve pictures, graphic material, or
family-centered approaches to assessment, with those printed words.
taken in the absence of relevant context, i.e., stan-
dardized tests. Assess the child’s communication envi- Motor Limitations. It has been widely reported that
ronments as well as the child’s independence in activities children with Down syndrome have motor deficits.
of daily living. Keep in mind that many persons with Hypotonia is frequently cited as a cause, but there are
Down syndrome have a history of failing tests, ‘‘escap- little data to support this claim. Motor deficits are quite
ing’’ boring formal assessment procedures. Cover all variable, with some children performing at age level
bases evaluating cognition performance, hearing, verbal and others show significant motor limitations delaying
and printed language comprehension and production, the onset of ambulation and other motor milestones.
oral nonspeech function, and speech behaviors. Re- Testing protocols must take into consideration the mo-
member that a child’s performance in the o‰ce or clinic tor demands on the child relative to the child’s motor
may represent a very thin slice of the full range of his or abilities. Make sure that the assessment tasks require
her capability. motor responses within the child’s capabilities.
Limitations of Standardized Tests Relevant for Vision. France (1992) provides a detailed account of
Children with Down Syndrome the visual deficits of children with Down syndrome. He
followed a group of 90 children and reported that 49%
Coggins (1998) identifies a number of limitations of had visual acuity deficits, with myopia being the most
standardized measures when testing children with devel- common. He also documented oculomotor imbalance
opmental disabilities. Most tests are examiner directed, in over 40% of the children, convergent strabismus ac-
limiting the child’s initiations and spontaneous language counting for the majority of these cases. In the majority
to single words or phrases. It is often di‰cult to trans- of these cases only glasses were required to achieve nor-
late test results into clinically useful outcomes. Rigid test mal vision. The message here is to make sure the chil-
administration guidelines make it di‰cult to use most dren can see the stimuli during testing.
measures with atypical children. Parents are typically
excluded from active participation in the assessment Hearing. Hearing remains an issue for children with
process. Down syndrome because of frequent episodes of otitis
media. Monitoring hearing should be done every 6
Challenges for Accurate Assessment months for the first 10 years of life. In our work we have
found that 33% of our children always had a hearing
Consistency of Responding. One of the most frustrating loss, 33% had a loss sometimes, and only 33% never had
characteristics of these children is the lack of consistency a loss. This was after screening out all of those children
of responding in assessment tasks. This variability is with significant hearing loss due to other causes. When
associated with two things in our experience. The first is oral language is tested, it is important to know the
rapid shifts in attention and the second is motivation. child’s hearing status on the day of testing.
Summary. Designing a testing protocol requires atten- ities, and providing new challenging opportunities. As
tion to the skills and abilities the child is expected to we consider increasing the frequency of communication
bring to the task. These include attention and motivation with children with Down syndrome, we find that family
di¤erences, memory deficits, hearing and visual deficits, styles are similar to those of typical children. Parents
and motor limitations. Each of these can compromise adjust their language and encourage their child’s per-
the outcome of the assessment if accommodations are formance through attention and support for task com-
not made. It is also clear that in order to optimize the pletion. Increasing the frequency of talk should be
consistency of responding, alternative testing formats encouraged in the context of family communication
will have to be implemented. These testing formats will about the child’s daily activities.
need to be less rigid, be context-based, and be child- Guidelines for developing optimal environments for
centered rather than examiner-centered. A skilled clini- talking with children (Miller, 1981) include six rules: (1)
cian will have to follow the child’s lead to implement Be enthusiastic. No one wants to talk with someone
functional, criterion reference, play-based assessments. who does not appear to be interested in what they are
Observational methods will also provide important saying. (2) Be patient. Allow children time and space to
information. perform. Don’t be afraid of pauses. Don’t overpower
the child with requests or directions. (3) Listen and fol-
Who Is Responsible for the Development of low the child’s lead. Help maintain the child’s focus
Communication Skills? (topic and meaning) with your responses, comments, and
questions. (4) Value the child. Recognize the child’s
The answer is that everyone is responsible—parents, comments as important and worth your individual at-
teachers, and speech-language pathologists—but parents tention. (5) Don’t play the fool. A valued conversational
have the most pervasive role in the process. A recent partner has something to say worth listening to, so pay
book by Betty Hart and Todd Risley documents the attention. (6) Learn to think like a child. Consider that
contribution parents make to their children’s language the child’s perspective of the world is di¤erent at di¤er-
development in the first 3 years of life (Hart and Risley, ent levels of cognitive development.
1995). Their results support what we have known for Research on language learning in children with Down
some time about language development: (1) parents are syndrome has documented that language learning is
the first teachers of speech, language, and communica- occurring and continues through adolescence (Chapman
tion skills; (2) children will follow their parent’s model of et al., 2002). The recent research on family communica-
communication action and style; and (3) social relations tion style and frequency of communication may account
are central for the development of communication skills. for why some children with Down syndrome learn lan-
guage more rapidly than other children. In our experi-
Promoting Family Communication Supporting ence, families with children making good progress with
Language Development their language and communication skills share common
features. They select language levels relative to their
The language and cognitive development of typical child’s ability to understand the message and not at their
children can be improved by simply talking to them, by ability to produce messages. They have realistic com-
producing more and longer utterances with more communication goals. These families expect that their child
plex vocabulary. This strategy can be used if we keep in will learn to read. They focus on understanding the con-
mind children’s ability to comprehend the language tent of their child’s message and are not as concerned
addressed to them and how parents adjust their own with the form of message. They make sure hearing test-
language to optimize the child’s chances of understanding is scheduled every six months through the devel-
ing the message. Parents automatically adjust their lan- opmental period. And they plan frequent outings to
guage to children on almost every linguistic dimension, provide their children with varied experiences outside the
phonological, syntactic, and semantic, including slowing home.
their rate of speech. The advice to talk more to children
must be understood to mean talk more while adjusting Who Is Running the Show, Parents Versus
language input to meet the child’s level of language
comprehension. This will facilitate processing the mes-
Professionals?
sage. Communication is the product of this game. More Diane Crutcher (1993) has powerfully presented the key
talk in the absence of the rest of the features necessary issues underlying tensions between parents and pro-
for successful communication cannot improve language fessionals in speech and language. She articulates three
development. If this were not true, children’s language issues parents perceive as limitations of speech-language
would improve as a function of the amount of time they intervention. The first is the lack of professional time,
spend listening to the radio or watching television. awareness, or unwillingness to explore intervention
Families that are successful communicators perhaps techniques specifically for their individual child. Second
talk more to their children, but the increased talking is in is the failure to modify intervention techniques into
the context of exchanging messages. Families are gener- strategies that fit a family’s natural lifestyle. The time
ally encouraging as a communication style, urging their constraints that most clinicians work under promote a
children to try new experiences, discussing their activ- ‘‘one size fits all’’ mentality. A lack of sensitivity to in-
Dementia 291
dividual family styles and needs renders many family Dementia

intervention programs ine¤ective, with families judged
as uninterested when in fact it is the therapists that
have failed. The third limitation is the failure of speech- Dementia is a syndrome characterized by deficits in
language professionals to realize that families have other multiple cognitive domains, including short- and long-
aspects of their lives that need attention, i.e., the activ- term memory and at least one of the following: aphasia,
ities of daily living, financial challenges, health concerns, apraxia, agnosia, and impaired executive function
other educational issues, and other family members. She (American Psychiatric Association, 1994). The deficits
also points out that most school and clinic settings allow must be su‰ciently potent to a¤ect social and occupa-
limited time for family interaction, perhaps once-a-year tional functioning and apparent in the absence of delir-
visits. While most of these limitations can be attributed ium. Dementia is associated with many disorders, but
to job settings, we must ensure that the family context most commonly with Alzheimer’s disease (AD) and
not be overlooked when designing e¤ective intervention Lewy body disease (LBD). Other common dementia-
sequences for children with Down syndrome. producing diseases are vascular disease and Parkinson’s
See also mental retardation. disease (PD).
Because communicative functioning is a manifesta-
—Jon F. Miller
tion of cognition, it is necessarily a¤ected when an indi-
vidual experiences the multiple cognitive deficits that
References define the syndrome of dementia. However, di¤erences
Chapman, R. S., Hesketh, L. J., and Kistler, D. (2002). Pre- in the nature and distribution of neuropathology in the
dicting longitudinal change in language production and common dementia-associated diseases produce unique
comprehension in individuals with Down syndrome: Hier- patterns of communication deficits.
archical linear modeling. Journal of Speech, Language, and
Coggins, T. (1998). Clinical assessment of emerging language: E¤ect of Alzheimer’s Neuropathology on
How to gather evidence and make informed decisions. In A. Communicative Function
Wetherby, S. Warren, and J. Reichle (Eds.), Transitions in
Prelinguistic Communication. Baltimore: Paul H. Brookes. The neurochemical alterations, neurofibrillary tangles,
Crutcher, D. (1993). Parent perspectives: Best practice and and neuritic plaques that characterize AD begin in the
recommendations for research. In A. Kaiser and D. Gray entorhinal cortex, perforant pathway, and hippocampal
(Eds.), Enhancing children’s communication: Research foun- formation. Gradually, cells throughout the neocortex are
dations for intervention. Baltimore: Paul H. Brookes. a¤ected, especially those in temporoparietal cortex. The
France, T. (1992). Ocular disorders in Down syndrome. In I. formation of tangles and plaques eventuates in cell death
Lott and E. McCoy (Eds.), Down syndrome: Advances in and interferes with intercellular transmission. Subcorti-
medical care (pp. 147–156). New York: Wiley-Liss. cal structures and the motor strip are relatively free of
Hart, B., and Risley, T. (1995). Meaningful di¤erences. Balti- neuropathology throughout much of the disease, which
accounts for the fact that the speech of individuals with
Marcell, M., and Weeks, S. (1988). Short-term memory di‰-
culties and Down syndrome. Journal of Mental Deficiency AD is spared.
Research, 32, 153–162. Because AD begins in the hippocampal complex,
Miller, J. (1981). Assessing language production in children. an area important for the formation of recent or epi-
Boston: Allyn and Bacon. sodic memory, the typical initial manifestation is loss
Miller, J., Leddy, M., and Leavitt, L. (1999). Improving the of memory for recent events. With disease progression
communication of people with Down syndrome. Baltimore: to frontal cortex and temporoparietal association areas,
Paul H. Brookes. other declarative memory systems are a¤ected, specifi-
cally semantic and lexical memory. Because the basal
Further Readings ganglia and motor cortex are spared throughout most of
Chapman, R. (1997). Language development in children and the disease course, procedural memory also is spared.
adolescents with Down syndrome. Mental Retardation In the early stages of AD, communicative functions
and Developmental Disabilities Research Reviews, 3, 307– dependent on recent memory, such as holding a con-
312. versation, are a¤ected. A¤ected individuals forget what
Cole, K., Dale, P., and Thal, D. (Eds.). (1996). Assessment of they have just heard, seen, or said. Many sentences are
communication and language. Baltimore: Paul H. Brookes. left unfinished, with forgotten communicative intentions,
Miller, J., and Paul, R. (1995). The clinical assessment of lan- and repetitiousness is common. Comprehension of writ-
guage comprehension. Baltimore: Paul H. Brookes. ten materials, particularly long passages, diminishes be-
Paul, R. (1995). Language disorders from infancy through ado- cause of memory impairment. The mechanics of reading
lescence. New York: Mosby.
Warren, S., and Yoder, P. (1997). Communication, language
are spared, however, and individuals with mild AD can
and mental retardation. In W. E. MacLean (Ed.), Ellis’ still write, though they make frequent spelling errors.
handbook of mental deficiency, psychological theory and re- The expression of grammar and syntax is remarkably
search (3rd ed., pp. 379–403). Mahwah, NJ: Erlbaum. intact, although occasional errors may be made. In-
Wetherby, A., Warren, S., and Reichle, J. (1998). Transitions in dividuals with mild AD can usually follow three-
prelinguistic communication. Baltimore: Paul H. Brookes. stage commands, answer comparative questions, name
familiar items on confrontation, generate exemplars in a of cases of dementia (Cummings and Benson, 1992;
category, define familiar words, and describe pictures, McKeith et al., 1992) have this cause. Patients with
although their descriptions do not contain as much fac- LBD often have concomitant Alzheimer’s pathology,
tual information as those of age-matched healthy elders and some have proposed that LBD is a variant of AD.
(Bayles and Tomoeda, 1993; Hopper, Bayles, and Kim, However, there are cases of pure LBD, which argues for
2001). the theory that LBD is neuropathologically distinct (see
By the middle stages of AD, when a¤ected individuals Cercy and Bylsma, 1997, for a review).
have become disoriented for time and place and memory Symptoms of LBD include fluctuating cognition in
problems are more florid, communication is more dis- 80%–90% of patients (Byrne et al., 1989; McKeith et al.,
rupted (Bayles and Tomoeda, 1995). Meaningful verbal 1992), visual or auditory hallucinations, paranoid delu-
output diminishes because individuals have increasing sions, extrapyramidal features, and confusion (McKeith
trouble generating a series of meaningful ideas. Writing and O’Brien, 1999). Intellectual deterioration is more
words to dictation may remain, but writing letters or rapid than that observed in AD patients and disease du-
pieces of any length is problematic. The ability to read is ration is shorter.
retained, although a¤ected individuals rapidly forget Like the dementia of AD, LB dementia has an insid-
what they have read. Grammar and syntax continue to ious onset and progressive course (Hansen, Salmon, and
resist prominent disease e¤ects. Galasko, 1990), producing the temporoparietal features
Persons with mid-stage AD can greet, name, and of aphasia, apraxia, and agnosia (Byrne et al., 1989).
express many needs. Most can participate in short con- Because the extrapyramidal features may be later occur-
versations, especially if those conversations involve ring, LBD in its early stages may be misdiagnosed as
only two people; however, they frequently have trouble AD. Prominent memory deficits may not be the pre-
retrieving desired names. They can answer questions senting feature but may appear as the disease progresses.
and understand common gestures. Two-stage commands The literature on the e¤ects of LBD on language and
are comprehensible by most persons with mid-stage AD, communicative functioning is scant. However, it is rea-
and some can follow three-stage commands. Reading sonable to expect that the associated confusion, memory
comprehension for single words remains good. Most and attentional deficits will disrupt communication in
individuals with mid-stage disease can still name on much the same way that they do in AD. Because fluctu-
confrontation and produce exemplars in a category, but ating cognitive status is a prominent feature of LBD,
not as e‰ciently or accurately as normal elders. Verbal clinicians can expect wide fluctuation in communication
output continues to diminish in terms of meaningfulness, skills.
and sentence fragments are more common. Heyman and colleagues (1999) compared the neuro-
By the end stage of the disease, memory loss is ex- psychological test performance of individuals with AD
tensive, disorientation may extend to self as well as with that of individuals with AD plus LBD. Both groups
time and place, and problem-solving skills are minimal. of patients were severely impaired on measures of mental
Urinary and then fecal incontinence develop, and ulti- status, verbal fluency, confrontation naming, concentra-
mately ambulatory ability is severely compromised or tion, visuospatial/constructional abilities, constructional
lost. Nonetheless, a few linguistic skills are intact (Bayles praxis, and word list learning and recognition. The sole
et al., 2000). Most individuals retain some functional significant di¤erence between the two groups appeared
vocabulary, although a small percentage are mute. on the delayed recall of a word list, with AD-only
Much of the language produced by those who still speak patients being more impaired.
is nonsensical. Nonetheless, many patients can follow a
one-stage command demonstrating comprehension of Vascular Dementia
language. The majority can read a simple word. Many
retain common social phrases such as ‘‘I don’t care’’ and Dementia can be produced by disease of the brain’s
‘‘I don’t know,’’ and can contribute to a conversation. vascular system, and the distribution of disease defines
the nature of the neuropsychological deficits. However,
E¤ects of Lewy Body Pathology on the many possible variations in the nature and distribu-
tion of vascular disease make it impossible to specify a
Communicative Function typical profile of neuropsychological deficits. For exam-
Lewy bodies are spherical, intracytoplasmic neuronal ple, individuals with an infarct in the territory of the left
inclusions that have a dense hyaline core and a halo posterior cerebral artery involving the temporo-occipital
of radiating filaments composed of proteins contain- region might exhibit amnesia, whereas an individual
ing ubiquitin and associated enzymes (McKeith and with an infarcts in the left anterior cerebral artery
O’Brien, 1999). They were first described in the literature involving the medial frontal region may have prominent
by the German neuropathologist Friederich Lewy in executive dysfunction, with both nonetheless meeting the
1912. Lewy bodies are classically associated with Par- criteria for dementia. Similarly, individuals with cortical
kinson’s disease, particularly in the basal ganglia, brain- infarct di¤er from those with subcortical infarcts.
stem, and diencephalic nuclei. They may also be Individuals at greatest risk for developing vascular
widespread in the cerebral cortex. Di¤use distribution of dementia are those who have experienced one or more
Lewy bodies is associated with dementia, and 10%–15% clinically evident ischemic strokes (Desmond et al.,
Dementia 293
1999). In fact, one-fourth to one-third develop dementia Bayles, K. A., and Tomoeda, C. K. (1983). Confrontation
within 3 months (Pohjasvaara et al., 1998; Tatemichi naming impairment in dementia. Brain and Cognition, 19,
et al., 1992). Examination of neuropsychological abilities 98–114.
has revealed inconsistent patterns of strengths and Bayles, K. A., and Tomoeda, C. K. (1993). The Arizona Bat-
tery for Communication Disorders of Dementia. Austin, TX:
weaknesses (Reichman et al., 1991), but executive func-
Pro-Ed.
tions often are disproportionately impaired, as are motor Bayles, K. A., and Tomoeda, C. K. (1994). The Functional
aspects of language production (Powell et al., 1988). Linguistic Communication Inventory. Austin, TX: Pro-Ed.
Bayles, K. A., and Tomoeda, C. K. (1995). The ABC’s of
Parkinson’s Dementia dementia. Austin, TX: Pro-Ed.
Parkinson’s disease is associated with a loss of striatal Bayles, K. A., Tomoeda, C. K., Cruz, R. F., and Mahendra,
N. (2000). Communication abilities of individuals with late-
dopaminergic neurons, particularly in the pars compacta
stage Alzheimer disease. Alzheimer Disease and Associated
region of the substantia nigra. Tremor is the best recog- Disorders, 14, 176–181.
nized symptom and is present in approximately half of Brown, R. G., and Marsden, C. D. (1988). Internal versus
individuals with PD (Martin et al., 1983). Often tremor external cues and the control of attention in Parkinson’s
begins unilaterally, increasing with stress and disappear- disease. Brain, 111, 323–345.
ing in sleep. Other early symptoms include aching, Byrne, E. J., Lennox, G., Lowe, J., and Godwin-Austen, R. B.
paresthesias, and numbness and tingling on one side of (1989). Di¤use Lewy body disease: Clinical features in 15
the body that ultimately spread to the other side. Other cases. Journal of Neurology, Neurosurgery, and Psychiatry,
classic motor symptoms are rigidity, slowness of move- 52, 709–717.
ment, and alterations in posture. Cercy, S. P., and Bylsma, F. W. (1997). Lewy bodies and pro-
gressive dementia: A critical review and meta-analysis.
Not all individuals with PD develop dementia, and
Journal of the International Neuropsychological Society, 3,
prevalence estimates vary. Marttila and Rinne (1976), in 179–194.
one of the most comprehensive studies of prevalence, Cummings, J. L., and Benson, D. F. (1992). Dementia: Defi-
reported it to be 29%. Other investigators have reported nition, prevalence, classification and approach to diagnosis.
similar estimates of dementia prevalence (Rajput and In J. L. Cummings and D. F. Benson (Eds.), Dementia: A
Rozdilsky, 1975; Mindham, Ahmed, and Clough, 1982; clinical approach. Boston: Butterworth-Heinemann.
Huber, Shuttleworth, and Christy, 1989). Widely de- Desmond, D. W., Erkinjuntti, T., Sano, M., Cummings, J. L.,
bated is the cause of the dementia, with some attrib- Bowler, J. V., Pasquier, F., et al. (1999). The cognitive syn-
uting it to cortical degeneration and others to subcortical drome of vascular dementia: Implications for clinical trials.
damage that impairs neurological control of attention Alzheimer Disease and Associated Disorders, 13, S21–S29.
Hansen, L., Salmon, D., and Galasko, D. (1990). The Lewy
(Brown and Marsden, 1988). Rinne and colleagues
body variant of Alzheimer’s disease: A clinical and patho-
(2000) argue that reduced fluorodopa uptake in the logic entity. Neurology, 40, 1–8.
caudate nucleus and frontal cortex produces impaired Heyman, A., Fillenbaum, G. G., Gearing, M., Mirra, S. S.,
performance on neuropsychological tests that require Welsh-Bohmer, K. A., Peterson, B., et al. (1999). Compari-
executive function. son of Lewy body variant of Alzheimer’s disease with pure
Individuals with PD, regardless of whether they de- Alzheimer’s disease. Neurology, 52, 1839–1844.
velop dementia, have speech motor deficits because the Hopper, T., Bayles, K. A., and Kim, E. (2001). Retained neu-
disease damages the basal ganglia and striatal-cortical ropsychological abilities of individuals with Alzheimer’s
circuitry, which are involved in motor function. Those disease. Seminars in Speech and Language, 22, 261–273.
who develop dementia have problems communicating Huber, S. J., Shuttleworth, E. C., and Christy, J. A. (1989).
Magnetic resonance imaging in dementia of Parkinson’s
for other reasons, namely deficits in memory, attention,
disease. Journal of Neurology, Neurosurgery, and Psychia-
and executive functions. However, considerable evidence try, 52, 1221–1227.
exists that language knowledge generally is preserved Huber, S. J., Shuttleworth, E. C., Paulson, G. W., et al. (1986).
(Pirozzolo et al., 1982; Bayles and Tomoeda, 1983; Cortical vs subcortical dementia: Neuropathological di¤er-
Huber et al., 1986). Bayles (1997) argued that impaired ences. Archives of Neurolology, 30, 1326–1330.
performance on tests that manipulate language, such as Lewy, F. H. (1912). Paralysis agitans: I. Pathologische Anato-
confrontation naming and sentence comprehension, re- mie. In Lewandowsky (Ed.), Handbuch der Neurologie.
sult more from nonlinguistic cognitive deficits than a loss New York: Springer.
of linguistic knowledge. Martin, W. E., Loewenson, R. B., Resch, J. A., and Baker,
See also alzheimer’s disease. A. B. (1983). Parkinson’s disease: A clinical analysis of 100
patients. Neurology, 23, 783–390.
—Kathryn A. Bayles Marttila, R. J., and Rinne, U. K. (1976). Dementia in Par-
kinson’s disease. Acta Neurologica Scandinavica, 54, 431–
References 441.
McKeith, I. G., and O’Brien, J. (1999). Dementia with Lewy
American Psychiatric Association. (1994). Diagnostic and sta- bodies. Australian and New Zealand Journal of Psychiatry,
tistical manual of mental disorders—Fourth edition. Wash- 33, 800–808.
ington, DC: Author. McKeith, I. G., Perry, R. H., Fairbairn, A. F., Jabeen, S., and
Bayles, K. A. (1997). The e¤ect of Parkinson’s disease on lan- Perry, E. K. (1992). Operational criteria for senile dementia
guage. Journal of Medical Speech-Language Pathology, 5, of Lewy body type (SDLT). Psychological Medicine, 22,
157–166. 911–922.
Mindham, R. H. S., Ahmed, S. W., and Clough, C. G. (1982). in cognitive function in idiopathic Parkinson’s disease.
A controlled study of dementia in Parkinson’s disease. Archives of Neurology, 53, 1140–1146.
Journal of Neurology, Neurosurgery, and Psychiatry, 45, Cheri, H. C. (1989). Dementia. Archives of Neurology, 46,
969–974. 806–814.
Pirozzolo, F. J., Hansch, E. C., Mortimer, J. A., et al. (1982). Ehrlich, J. S., Oler, L. K., and Clark, L. (1997). Ideational and
Dementia in Parkinson’s disease: A neuropsychological semantic contributions to narrative production in adults
analysis. Brain and Cognition, 1, 71–83. with dementia of the Alzheimer’s type. Journal of Commu-
Pohjasvaara, T., Erkinjuntti, T., Ylikoski, R., Hietanen, M., nication Disorders, 30, 79–99.
Vataja, R., and Kaste, M. (1998). Clinical determinants of Grossman, M., D’Esposito, M., Hughes, E., Onishi, K., Bias-
poststroke dementia. Stroke, 29, 75–81. sou, N., White-Devine, T., et al. (1996). Language com-
Powell, A. L., Cummings, J. L., Hill, M. A., and Benson, D. F. prehension profiles in Alzheimer’s disease, multi-infarct
(1988). Speech and language alterations in multi-infarct de- dementia, and frontotemporal degeneration. Neurology, 47,
mentia. Neurology, 38, 717–719. 183–189.
Rajput, A. H., and Rozdilsky, B. (1975). Parkinsonism and Nebes, R. (1992). Semantic memory dysfunction in Alzhei-
dementia: E¤ects of l-dopa. Lancet, 1, 1084. mer’s disease: Disruption of semantic knowledge or informa-
Reichman, W., Cummings, J., McDanniel, K., Flynn, F., and tion processing limitation. In L. R. Squire and N. Butters
Gornbein, J. (1991). Visuoconstructional impairment in de- (Eds.), Neuropsychology of memory (2nd ed., pp. 233–240).
mentia syndromes. Behavioral Neurology, 4, 153–162. New York: Guilford Press.
Rinne, J. O., Portin, R., Ruottinen, H., Nurmi, E., Bergman, Orange, J. B., and Purves, B. (1996). Conversational discourse
J., Haaparanta, M., et al. (2000). Cognitive impairment and and cognitive impairment: Implications for Alzheimer’s
the brain dopaminergic system in Parkinson disease. disease. Journal of Speech-Language Pathology and Audiol-
Archives of Neurology, 57, 470–475. ogy, 20, 139–150.
Tatemichi, T. K., Desmond, D. W., Mayeux, R., et al. (1992). Sabat, S. R. (1994). Language function in Alzheimer’s disease:
Dementia after stroke: Baseline frequency, risks, and A critical review of selected literature. Language and Com-
clinical features in a hospitalized cohort. Neurology, 42, munication, 14, 331–351.
1185–1193. Snowdon, D. A., Kemper, S. J., Mortimer, J. A., Greiner,
L. H., Wekstein, D. R., and Markesbery, W. R. (1996).
Further Readings Linguistic ability in early life and cognitive function and
Alzheimer’s disease in late life. Journal of the American
Azuma, T., and Bayles, K. A. (1997). Memory impairments Medical Association, 275, 528–532.
underlying language di‰culties in dementia. Topics in Lan- Tomoeda, C. K., and Bayles, K. A. (1993). Longitudinal
guage Disorders, 18, 58–71. e¤ects of Alzheimer disease on discourse production. Alz-
Azuma, T., Cruz, R., Bayles, K. A., Tomoeda, C. K., Wood, heimer Disease and Associated Disorders, 7, 223–236.
J. A., and Montgomery, E. B. (2000). Incidental learning
and verbal memory in individuals with Parkinson’s dis-
ease. Journal of Medical Speech-Language Pathology, 3,
163–174. Dialect Speakers
Bayles, K. A. (1991). Age at onset of Alzheimer’s disease.
Archives of Neurology, 48, 155–159.
Bayles, K. A. (1993). Pathology of language behavior in de- A dialect refers to any variety of language that is shared
mentia. In G. Blanken, J. Dittman, H. Grimm, J. C. Mar- by a group of speakers. It is not possible to speak a
shall, and C. W. Wallesch (Eds.), Linguistic disorders and language without also speaking a dialect (Wolfram and
pathologies: An international handbook (pp. 388–409). Ber- Schilling-Estes, 1998). Although all dialects of a lan-
lin: Walter de Gruyter. guage are equally systematic and complex, on a social
Bayles, K. A., Azuma, T., Cruz, R. F., Tomoeda, C. K., level, dialects are often described as falling on a contin-
Wood, J. A., and Montgomery, E. B. (1999). Gender dif- uum of standardness. The most standard dialect of a
ferences in language of Alzheimer disease patients revisited. language generally reflects an idealized prestige form
Alzheimer Disease and Associated Disorders, 13, 138–146. that is rarely spoken by anyone in practice. Rules for
Bayles, K. A., and Kaszniak, A. W., with Tomoeda, C. K.
producing this standard, however, can be found in for-
(1987). Communication and cognition in normal aging and
dementia. Austin, TX: Pro-Ed. mal grammar guides and dictionaries. Versions of the
Bayles, K. A., and Tomoeda, C. K. (1995). The ABCs of de- standard can also be found in formal texts that have
mentia (2nd ed.). Austin, TX: Pro-Ed. been written by established writers. Next in standardness
Bayles, K. A., Tomoeda, C. K., Kaszniak, A. W., and Trosset, are a number of formal and informal oral dialects. These
M. W. (1991). Alzheimer’s disease e¤ects on semantic dialects reflect the language patterns of actual speakers.
memory: Loss of structure or impaired processing? Journal Norms of acceptability for these dialects vary as a func-
of Cognitive Neuroscience, 3, 166–182. tion of the regional and social characteristics of di¤erent
Bayles, K. A., Tomoeda, C. K., and Trosset, M. W. (1993). communities and of di¤erent speakers within these
Alzheimer’s disease: E¤ects on language. Developmental communities. Nonstandard dialects represent the other
Neuropsychology, 9, 131–160.
Bayles, K. A., Tomoeda, C. K., Wood, J. A., Cruz, R., and
end of the continuum. These dialects also reflect spoken
McGeagh, A. (1996). Comparison of sensitivity to Alz- language, but they include socially stigmatized linguistic
heimer’s dementia of the ABCD and other measures. Jour- structures. Other terms used to describe nonstandard
nal of Medical Speech Language Pathology, 4, 183–194. dialects are nonmainstream and vernacular.
Bayles, K. A., Tomoeda, C. K., Wood, J. A., Montgomery, At the linguistic level, scholars repeatedly highlight
E. B., Cruz, R. F., McGeagh, A., et al. (1996). Change the arbitrary nature of a dialect’s social acceptability
Dialect Speakers 295
(i.e., standardness). In fact, Milroy and Milroy (2000) Besides contrastive forms, there are three other ways
argue that contradictory and changing attitudes to the in which dialects di¤er from one another. One way is in
same linguistic phenomenon can emerge at di¤erent times the frequency with which particular language forms are
in the history of a language. For example, as these authors produced. Wolfram’s (1986) analysis of consonant clus-
note, before World War II, absence of postvocalic /r/ in ter reduction in 11 di¤erent English dialects is useful for
words such as car and park was not stigmatized in New illustrating this finding. His data showed cluster reduc-
York City. By 1966, however, r-lessness had become a tion occurring 3% of the time for standard English and
stigmatized marker of casual style and lower social class. northern white working-class English, 4% for north-
English dialects containing r-lessness continue to be ern African-American working-class English, 10% for
stigmatized in the United States, but in England, English southern white working-class English, 36% for southern
dialects with this same linguistic pattern have high status. African-American working-class English, 5% for Appa-
Most linguistic patterns that occur in the standard lachian working-class English, 10% for Italian-American
dialects of a language also occur in those that are non- working-class English, 22%–23% for Chicano working-
standard. Seymour, Bland-Stewart, and Green (1998) class and Puerto Rican working-class (New York City)
refer to these language patterns as noncontrastive, English, 60% for Vietnamese English, and 81% for Na-
because all dialects of a language are thought to share tive American Puebloan English. For each dialect listed,
these forms. Despite the similarity that exists among the percentage reflects the degree to which consonant
dialects, nonstandard versions are typically described by clusters were reduced in regular past tense contexts that
listing only those language patterns that do not appear were followed by a nonconsonant (e.g., ‘‘Tom live in’’).
in the standard varieties. Seymour, Bland-Stewart, and As can be seen, all 11 of the dialects showed cluster
Green refer to these patterns as the contrastive features. reduction, but the frequency with which this pattern
Descriptions of nonstandard dialects become even nar- occurred in each dialect greatly varied.
rower when they are generated by the media and gen- A second way in which dialects di¤er from one an-
eral public, because these descriptions tend to highlight other has to do with the linguistic environments in which
only the contrastive patterns that are highly stigmatized particular language forms occur. The e¤ects of linguistic
(Rickford and Rickford, 2000). Zero marking of the context on language use are often described as linguistic
copula be (i.e., he walking) is an example of a pattern constraints or linguistic conditions (Chambers, 1995).
that is frequently showcased for African-American En- Studies of linguistic constraints typically occur with lan-
glish (AAE) (Rickford, 1999). Other stereotypic patterns guage forms that show systematic variability in their
include the use of ya’ll and fixing to to describe versions surface structure. An example of a variable form is one
of Southern White English (SWE), a- prefixing (e.g., he that can be overtly expressed (i.e., present in the surface
was a-walking . . .) to characterize Appalachian English, grammar; he is walking) in some contexts but is zero-
and pronouncing think and that as tink and dat to depict marked (i.e., absent from the surface grammar; he walk-
Cajun English. ing) in others.
Although it is relatively easy to identify the language One of the most widely studied variable forms is the
forms that di¤erentiate a nonstandard dialect from one copula be, and research on this structure has typically
that is viewed as standard, it is much more di‰cult to involved nonstandard versions of AAE. At least six dif-
identify patterns that distinguish one nonstandard dia- ferent linguistic constraints have been found to influence
lect of a language from another. One reason for this is AAE speakers’ use of the copula be. These constraints
that many nonstandard dialects of a language share the include the type of preceding noun phrase (noun phrase
same contrastive patterns. Unique contrastive patterns versus personal pronoun versus other pronoun), the
for di¤erent nonstandard dialects are particularly rare phonological characteristics of the preceding environ-
when the dialects being compared are produced in the ment (vowel versus consonant; voiced versus unvoiced
same community and by speakers of the same social consonant), the person, number, and tense of the verb
class. Oetting and McDonald (2001, 2002) illustrated context (first, second, third; present versus past), the
this finding by comparing the contrastive patterns of two grammatical function of the be form (copula versus
nonstandard dialects spoken in southeastern Louisiana. auxiliary), the nature of the following predicate clause
The data for this comparison were language samples (locative versus adjective versus noun), and the phono-
of children who lived in the same rural community and logical characteristics of the following environment
attended the same schools. Forty of the children were (vowel versus consonant) (Rickford, 1999). The person,
African American and spoke a southern rural version of number, and tense of the verb context, the grammatical
AAE, and 53 were white and spoke a rural version of function of the be form, and the nature of the following
SWE. The AAE and SWE dialects spoken by the chil- predicate clause also have been shown to a¤ect copula
dren were deemed distinct through the use of a listener be marking in various nonstandard SWE dialects
judgment task and a discriminant function analysis of (Wolfram, 1974; Wynn, Eyles, and Oetting, 2000). Other
35 di¤erent nonstandard (i.e., contrastive) language pat- morphosyntactic structures of English that have been
terns found in the transcripts. Nevertheless, of the 35 found to be influenced by linguistic constraints include,
contrastive patterns examined, 31 of them were found in but are not limited to, negation, do support, verb agree-
the conversational speech of both the AAE and SWE ment, relative pronouns, plural marking, and question
child speakers. inversion (Poplack, 2000).
A third way in which dialects di¤er from one another factors on dialect use interact in complex ways with
is in the semantic meanings or grammatical entailments those that are internal to a speaker. The dynamic inter-
of some forms (Labov, 1998). These particular cases in- actions that occur between and among these variables
volve language patterns that occur in most dialects of a help explain why dialect use is often described as fluid,
language, but their meanings or use in the grammar are flexible, and constantly changing.
unique to a particular dialect. These patterns are often The challenge for scientists in communication dis-
described as camouflaged forms because, on the surface, orders is to learn how a speech or language disorder
the contrastive nature of these forms can be di‰cult a¤ects a person’s use of language, regardless of the dia-
to notice (Wolfram and Schilling-Estes, 1998). Use of lect spoken. Thus far, most descriptions of childhood
had þ Ved is an example of a camouflaged pattern. In language impairment have been made within the context
most dialects of English, this structure carries past per- of standard dialect varieties only. Extending the study of
fect meaning (e.g., ‘‘I already had eaten the ice cream childhood language impairment to di¤erent nonstandard
when she o¤ered the pie’’). In some English dialects such dialects is a topic of current scholarly work and debate
as AAE, however, had þ Ved also can express preterite (see dialect versus disorder).
(i.e., simple past) meaning (Rickford and Rafal, 1996).
Ross, Oetting, and Stapleton (in press) provide the fol- —Janna B. Oetting
lowing sample from a 6-year-old AAE speaker to illus-
trate the preterite meaning of this form: ‘‘Then my
References
mama said, ‘It’s your mama. Let me talk to your daddy.’ Chambers, J. (1995). Sociolinguistic Theory. Malden, MA:
Then she had told my daddy to come with us and bring a Blackwell.
big rope so they could pull the car home.’’ Christian, D. (2000). Reflections of language heritage: Choice
Dialect use is a¤ected by factors that are both internal and chance in vernacular English dialects. In J. Peyton, P.
and external to a speaker (Milroy, 1987; Chambers, Gri‰n, W. Wolfram, and R. Fasold (Eds.), Language in
1995; Wolfram and Schilling-Estes, 1998). Internal fac- action (pp. 213–229). Cresskill, NJ: Hampton Press.
Dubois, S., and Horvath, B. (1998). Let’s tink about dat:
tors that have been shown to influence the type and
Interdental fricatives in Cajun English. Language Variation
density of one’s dialect include age, sex, race, region of and Change, 10, 245–261.
the country, socioeconomic status, type of community, Farr-Whitman, M. (1981). Dialect influence in writing. In M.
and type of social network. Interestingly, regardless Farr-Whiteman (Ed.), Writing: The nature, development,
of race, region, community, and network, members of and teaching of written communication (pp. 153–166). Hills-
lower social classes produce a greater frequency of con- dale, NJ: Erlbaum.
trastive dialect forms than members of higher classes. Labov, W. (1982). The social stratification of English in
Greater frequencies of contrastive patterns also have New York City. Washington, DC: Center for Applied
been found for younger adults than for older adults, and Linguistics.
for males than for females. Exceptions to these general- Labov, W. (1998). Co-existent systems in African-American
vernacular English. In S. Mufwene, J. Rickford, G. Bailey,
ities do exist, however. For example, Dubois and Hor- and J. Baugh (Eds.), African-American English (pp. 110–
vath (1998) documented a V-shaped age pattern rather 153). New York: Routledge.
than a linear one in their study of Cajun English. They Lucas, C., and Borders, D. (1994). Language diversity and
also found that the type and degree of the age pattern classroom discourse. Norwood, NJ: Ablex.
(V-shaped versus linear) depended on the speaker’s sex Milroy, J., and Milroy, L. (2000). Authority in language:
and type of social network. In particular, the V-shaped Investigating standard English (3rd ed.). New York:
pattern was more pronounced for men than for women, Routledge.
and only women from closed social networks showed the Milroy, L. (1987). Language and social networks (2nd ed.).
V-shaped pattern. Women in open networks showed a New York: Basil Blackwell.
linear pattern. Interestingly, though, the linear trend Oetting, J., and McDonald, J. (2001). Nonmainstream dialect
use and specific language impairment. Journal of Speech,
reflected higher frequencies of nonstandard dialect use Language, and Hearing Research, 44, 207–223.
by the older women than younger women. This finding Oetting, J., and McDonald, J. (2002). Methods for charac-
contrasts with what is typically reported in other non- terizing study participants’ nonmainstream dialect use in
standard dialect work, namely, that older adults present studies of specific language impairment. Journal of Speech,
fewer instances of nonstandard forms than younger Language, and Hearing Research, 45, 505–518.
adults. Poplack, S. (Ed.). (2000). The English history of African
Some of the external factors that have been shown American English. Maldon, MA: Blackwell.
to a¤ect dialect use include the type of speaking style Rickford, J. (1999). African American vernacular English.
(casual versus formal; interview versus conversation), Malden, MA: Blackwell.
speaking partner (familiar versus unfamiliar; with au- Rickford, J., and Rafal, C. (1996). Preterite had þ V-ed in the
narratives of African American preadolescents. American
thority versus without), modality of expression (speaking Speech, 71, 227–254.
versus writing versus reading), genre (persuasive versus Rickford, J., and Rickford, R. (2000). Spoken souls. New
informative versus imaginative), and type of speech act York: Wiley.
(comment versus request for information) (for data, see Ross, S., Oetting, J., and Stapleton, B. (in press). Preterite had-
Farr-Whitman, 1981; Labov, 1982; Smitherman, 1992; Ved: A narrative discourse structure of AAE. American
Lucas and Borders, 1994). Influences of these external Speech.
Dialect Versus Disorder 297
Seymour, H., Bland-Stewart, L., and Green, L. (1998). Dif- also formally rebuked the practice of diagnosing and
ference versus deficit in child African American English. treating any dialect of a language as an impairment. The
Language, Speech, and Hearing Services in Schools, 29, 96– data used to support this stance were sociolinguistic
108. findings regarding the systematic and complex nature of
Smitherman, G. (1992). Black English, diverging or converg-
all dialects, including those that are socially stigmatized
ing? The view from the national assessment of educational
progress. Language and Education, 6, 47–64. (see dialect speakers).
Wolfram, W. (1974). The relationship of white southern speech ASHA’s position statement remains relevant today.
to Vernacular Black English. Language, 50, 498–527. Research on children’s acquisition and use of di¤erent
Wolfram, W. (1986). Language variation in the United States. dialects is still a relatively new area of scientific en-
In O. Taylor (Ed.), Nature of communication disorders in deavor, but small strides have been made. For example,
culturally and linguistically diverse populations (pp. 73–115). there now exist numerous articles describing di¤erent
San Diego, CA: College-Hill Press. dialects of English, and current publications about
Wolfram, W., and Schilling-Estes, N. (1998). American En- childhood language development, assessment, and treat-
glish. Malden, MA: Blackwell. ment now routinely include discussions of dialect diver-
Wynn, C., Eyles, L., and Oetting, J. (2000). Nonmainstream
sity. The di¤erent types of testing biases that may
dialect use and copula be: Analysis of linguistic constraints.
Paper presented at the annual convention of the American surface when the language tester and testee come from
Speech, Language, Hearing Association, Washington, DC. di¤erent linguistic or cultural backgrounds have also
been described (Fagundes et al., 1998; Wilson, Wilson,
Further Readings and Coleman, 2000). A few traditional speech and lan-
guage tests have added alternative scoring procedures
Bernstein, C., Nunnally, T., and Sabino, R. (Eds.). (1997). for some nonstandard dialects (Garn-Nunn and Perkins,
Language variety in the South revisited. Tuscaloosa: Uni- 1999; Ryner, Kelly, and Krueger, 1999). Alternative or
versity of Alabama Press. new dialect scoring methods also have been created for
Feagin, C. (1979). Variation and change in Alabama English: A
the analysis of children’s conversational language sam-
sociolinguistic study of the white community. Washington,
DC: Georgetown University Press. ples (Nelson, 1991; Stockman, 1996; McGregor et al.,
Labov, W. (1994). Principles of linguistic change: Internal fac- 1997; Seymour, Bland-Stewart, and Green, 1998).
tors. Maldon, MA: Blackwell. Most of the advances listed started with methods and
Labov, W. (2001). Principles of linguistic change: Social fac- materials that were designed for speakers of standard
tors. Maldon, MA: Blackwell. English. Two di¤erent types of changes were then made
Montgomery, M., and Bailey, G. (Eds.). (1986). Language va- to these methods. One change involved broadening the
riety in the South. Tuscaloosa: University of Alabama Press. range of language forms that are considered normal by
Mufwene, J., Rickford, J., Bailey, G., and Baugh, J. (Eds.). including the contrastive patterns of di¤erent dialects.
(1998). African-American English (pp. 85–109). New York: The other type of change was to restrict the analysis to
Routledge.
only the noncontrastive patterns. Contrastive patterns
Peyton, J., Gri‰n, P., Wolfram, W., and Fasold, R. (Eds.).
(2001). Language in action: New studies of language in soci- are those that show variation in surface structure
ety essays in honor of Roger W. Shuy (pp. 213–229). Cres- across di¤erent dialects of a language (Seymour, Bland-
skill, NJ: Hampton Press. Stewart, and Green, 1998). In English, one contrastive
Washington, J., and Craig, H. (1994). Dialectal forms during pattern is the copula/auxiliary be. In standard dialects of
discourse of poor, urban, African American preschoolers. English, overt marking of be is obligatory in utterances
Journal of Speech and Hearing Research, 37, 816–823. such as ‘‘you are walking.’’ In other dialects, such as
Washington, J., and Craig, H. (1998). Socioeconomic status African-American English (AAE) and Southern White
and gender influences on children’s dialectal variations. English (SWE), overt marking of be in this context is
Journal of Speech and Hearing Research, 41, 618–626. optional. As a result, ‘‘you walking’’ and ‘‘you are
Wyatt, T. (1991). Linguistic constraints on copula be production
walking’’ are both felicitous in these dialects. The use of
in Black English child speech. Doctoral dissertation, Uni-
versity of Massachusetts, Amherst. contrast analysis when working with language sample
data is an example of an alternative assessment method
that treats the contrastive patterns of di¤erent dialects as
normal (McGregor et al., 1997). With this method, the
only language patterns that can be viewed as errors are
Dialect Versus Disorder those that cannot occur in the child’s dialect.
Noncontrastive patterns are those that do not show
surface variation across di¤erent dialects of a language
In 1983, the American Speech-Language-Hearing Asso- (Seymour, Bland-Stewart, and Green, 1998). One non-
ciation (ASHA) published a position statement on the contrastive pattern of English is S-V-O word order
topic of social dialects. A major point of the publication (Martin and Wolfram, 1998). This pattern is thought to
was to formally recognize the di¤erence between lan- be noncontrastive because all dialects of English thus far
guage variation that is caused by normal linguistic pro- have been shown to present this word order. Other pat-
cesses (i.e., dialects) and variation that is caused by an terns thought to be noncontrastive in English include
atypical or disordered language system (i.e., language various forms of complex syntax that make sentential
impairment). Through the position statement, ASHA coordination and subordination possible. An example of
a language assessment method that restricts the analysis trastive patterns of AAE has been shown to be relatively
to the noncontrastive patterns of dialects is Stockman’s unrelated to the length and complexity indices. For ex-
(1996) Minimal Competency Core (MCC) analysis. The ample, Jackson and Roberts (2001) report correlations
goal of MCC analysis is to identify and then evaluate a between children’s use of contrastive AAE patterns and
common core of language that can be found in multiple their utterance length and complexity scores to be at or
dialects of a language. As a criterion-referenced proce- below .11.
dure, MCC specifies a minimum level of competency for Oetting, Cantrell, and Horohov (1999) also examined
each language pattern and each age level examined. One the e¤ect of contrastive dialect forms on standard cal-
of the language items included in MCC is a child’s mean culations of utterance length and utterance complexity.
length of utterance (MLU). For English-speaking 3- The participants in their study were children who spoke
year-olds, Stockman (1996) sets the minimum MLU at a rural Louisiana version of SWE, and they ranged in
3.27 morphemes. She also lists 15 consonants in the age from 4 to 6 years. Approximately one-third of the
initial position and a set of semantic expressions and children were classified as specifically language impaired
pragmatic functions that all 3-year-olds should be able (SLI); the others were classified as normal. Three
to demonstrate, regardless of the English dialect they language indices, MLU, developmental sentence score
use. Another example of a relatively new assessment (DSS), and Index of Productive Syntax (IPSyn), were
tool that targets the noncontrastive features of dialects is evaluated. To examine the e¤ect of the contrastive
that formulated by Craig, Washington, and Thompson- forms, scores for MLU, DSS, and IPSyn were calculated
Porter (1998b), which uses Wh- questions and passive twice for each child, once using samples that contained
probes. utterances with contrastive forms and once using the
All of these advances treat the contrastive patterns same samples with the contrastive utterances removed.
of dialects as problematic for diagnostic purposes. The Results indicated that the diagnostic classification of
problem, as articulated by Seymour, Bland-Stewart, and each child as either normal or SLI was the same, re-
Green (1998), is that some contrastive patterns of some gardless of whether utterances with the contrastive forms
nonstandard English dialects can look very similar to were included or excluded.
those that are produced by standard English-speaking Recent studies also suggest that grammatical weak-
children who have a language impairment. Some of the nesses of children with SLI can be identified within the
surface patterns that are generated by both language contrastive forms. In addition to evaluating indices of
learning conditions include zero marking of be (e.g., utterance length and complexity, Oetting, Cantrell, and
‘‘you walking’’), zero marking of past tense (e.g., ‘‘yes- Horohov (1999) examined the grammatical profile of
terday she fall’’), and zero marking of third person SLI within the context of nine contrastive forms of SWE.
(e.g., ‘‘today he walk’’). Seymour, Bland-Stewart, and For five of the forms (i.e., third person regular, con-
Green refer to these patterns and other contrastive tractible copula, contractible auxiliary, uncontractible
forms as presenting a diagnostic conundrum because auxiliary, and auxiliary does), the children with SLI were
interpretations of their use as markers of either a normal found to present rates of overt marking that were lower
dialect or a grammatical impairment are di‰cult. These than those of their SWE-speaking age-matched and lan-
authors also state that the exclusion of the contrastive guage-matched peers. In a second study, Oetting and
patterns within assessment is necessary only until more is McDonald (2001) examined the grammatical weak-
known about children’s acquisition and use of these nesses of SLI in the context of two nonstandard dialects,
patterns. As more research is completed, new methods SWE and a rural Louisiana version of AAE. In this
that include the contrastive patterns should be made study, 35 di¤erent contrastive patterns were coded. Dif-
possible. ferences between the normal children and those with SLI
Recently, research has begun to focus on children’s were identified for 14 of the contrastive patterns. A full
acquisition and use of the contrastive patterns of dia- model discriminant function that involved counts of all
lects. Findings from some of these studies suggest that 35 patterns resulted in 90% of the children being cor-
these particular language forms may not be as prob- rectly classified as either normal or impaired. Stepwise
lematic as they first seemed. For example, at least four analyses yielded slightly di¤erent discriminant functions
studies have examined the e¤ect of these patterns on for identifying children with SLI in the SWE dialect as
standard calculations of children’s average utterance compared to the AAE dialect, but both models included
length and utterance complexity. Each of these studies language forms needed to formulate questions and
has shown children’s use of the contrastive patterns to mark tense. The finding that both dialect groups with
play a minimal role within these calculations. For three SLI were shown to have trouble with these two areas of
of the studies, the focus has been on the contrastive grammar is consistent with SLI studies that have been
patterns of AAE (Craig, Washington, and Thompson- completed with standard English speakers, nonstandard
Porter, 1998a; Jackson and Roberts, 2001; Smith, Lee, English speakers, and speakers of languages other than
and McDade, 2001). The participants in these studies English (e.g., Rice, Wexler, and Hershberger, 1998; Sey-
have ranged in age from 3 to 9 years. Measures of length mour, Bland-Stewart, and Green, 1998; Craig and
have been calculated on utterances, C-units, and T-units, Washington, 2000; Paradis and Crago, 2000).
and measures of complexity have involved counts of A few recent studies also have examined the devel-
complex syntax. In every case, children’s use of the con- opmental tragectories of particular contrastive patterns
Dialect Versus Disorder 299
in isolation (Wyatt, 1996; Henry et al., 1997; Jackson, glish. In E. Hughes, M. Hughes, and A. Greenhill (Eds.),
1998; Burns et al., 1999; Wynn, Eyles, and Oetting, Proceedings of the 21st Annual Boston University Conference
2000; Ross, Oetting, and Stapleton, in press). The con- on Language Development (pp. 269–280). Sommerville,
trastive patterns examined in these studies have included MA: Cascadilla Press.
Jackson, J. (1998). Linguistic aspect in African-American En-
aspectual be and preterite had þ Ved in AAE, copula
glish speaking children: An investigation of aspectual ‘be.’
be in AAE and SWE, and negative concord in Bristol Doctoral dissertation, University of Massachusetts,
English and Belfast English. Each of these studies has Amherst.
shown normally developing children to be remarkably Jackson, S., and Roberts, J. (2001). Complex syntax produc-
capable of learning the distributional properties of their tion of African American preschoolers. Journal of Speech,
native dialect. This finding occurs even when children Language, and Hearing Research, 44, 1083–1096.
who speak di¤erent dialects of a language live in the Martin, S., and Wolfram, W. (1998). The sentence in African
same community and attend the same schools (Wynn, American Vernacular English. In S. Salikoko, J. Rickford,
Eyles, and Oetting, 2000; Ross, Oetting, and Stapleton, G. Bailey, and J. Baugh (Eds.), African American Vernacu-
in press). Of the studies listed that also included chil- lar English: Structure, history, and use (pp. 11–37). New
York: Routledge.
dren with SLI, some group di¤erences (normal versus
McGregor, K., Williams, D., Hearst, S., and Johnson, A.
impaired) have been identified, but the nature of these (1997). The use of contrastive analysis in distinguishing
di¤erences warrants further study. di¤erence from disorder: A tutorial. American Journal of
Understanding the ways in which a childhood lan- Speech Language Pathology, 6, 45–56.
guage impairment manifests within the contrastive and Nelson, N. (1991). Black English sentence scoring. Unpublished
noncontrastive forms of di¤erent dialects is a topic of manuscript, Western Michigan University, Kalamazoo.
ongoing study. Additional work also is needed to extend Oetting, J., Cantrell, J., and Horohov, J. (1999). A study
the study of childhood language impairment to other of specific language impairment (SLI) in the context of
language-learning situations. Two such situations are nonstandard dialect. Clinical Linguistics and Phonetics, 13,
bilingual language acquisition and second language 25–44.
Oetting, J., and McDonald, J. (2001). Nonmainstream dialect
learning. Until this research is completed, our tools for
use and specific language impairment. Journal of Speech,
identifying children with language weaknesses and our Language, and Hearing Research, 44, 207–223.
understanding of language impairment as a construct Paradis, J., and Crago, M. (2000). Tense and temporality: A
will remain limited. comparison between children learning a second language
and children with SLI. Journal of Speech, Language, and
—Janna B. Oetting
References Rice, M., Wexler, K., and Hershberger, S. (1998). Tense over
time: The longitudinal course of tense acquisition in chil-
ASHA Committee on the Status of Racial Minorities. (1983). dren with specific language impairment. Journal of Speech,
Position paper: Social dialects and implications of the posi- Language, and Hearing Research, 41, 1412–1431.
tion on social dialects. ASHA, 25, 23–27. Ross, S., Oetting, J., and Stapleton, B. (in press). Preterite had-
Burns, F., Paulk, C., Johnson, V., and Seymour, H. (1999). Ved: A narrative discourse structure of AAE. American
Constraint analysis of typical and impaired African American Speech.
English-speaking children. Paper presented at the annual Ryner, P., Kelly, D., and Krueger, D. (1999). Screening low-
convention of the American Speech, Language, and Hear- income African American children using the BLT-2S and
ing Association, San Francisco. the SPELT-P. American Journal of Speech Language Pa-
Craig, H., and Washington, J. (2000). An assessment battery thology, 8, 44–52.
for identifying language impairments in African American Seymour, H., Bland-Stewart, L., and Green, L. (1998). Di¤er-
children. Journal of Speech, Language, and Hearing Re- ence versus deficit in child African American English. Lan-
search, 43, 366–379. guage, Speech, and Hearing Services in Schools, 29, 96–
Craig, H., Washington, J., and Thompson-Porter, C. (1998a). 108.
Average C-unit lengths in the discourse of African Ameri- Smith, T., Lee, E., and McDade, H. (2001). An investigation of
can children from low-income, urban homes. Journal of T-units in African American English-speaking and standard
Speech, Language, and Hearing Research, 41, 433–444. American English-speaking fourth-grade children. Commu-
Craig, H., Washington, J., and Thompson-Porter, C. (1998b). nication Disorders Quarterly, 22, 148–157.
Performances of young African American children on two Stockman, I. (1996). The promises and pitfalls of language
comprehension tasks. Journal of Speech, Language, and sample analysis as an assessment tool for linguistic mi-
Hearing Research, 41, 445–457. nority children. Language, Speech, and Hearing Services in
Fagundes, D., Haynes, W., Haak, N., and Moran, M. (1998). Schools, 27, 355–366.
Task variability e¤ects of the language test performance of Wilson, W., Wilson, J., and Coleman, T. (2000). Culturally
southern lower socioeconomic class African American and appropriate assessment: Issues and strategies. In T. Cole-
caucasian five-year-olds. Language, Speech, and Hearing man (Ed.), Clinical management of communication disorders
Services in Schools, 29, 148–157. in culturally diverse children (pp. 101–127). Boston: Allyn
Garn-Nunn, P., and Perkins, L. (1999). Appalachian English and Bacon.
and standardized language testing: Rational and recom- Wyatt, T. (1996). The acquisition of the African American
mendations for test adaption. Contemporary Issues in Com- English copula. In A. Kamhi, K. Pollock, and J. Harris
munication Sciences and Disorders, 26, 150–159. (Eds.), Communication development and disorders in Afri-
Henry, A., Maclaren, R., Wilson, J., and Finlay, C. (1997). can American children (pp. 95–116). Baltimore: Paul H.
The acquisition of negative concord in non-standard En- Brooks.
Wynn, C., Eyles, L., and Oetting, J. (2000). Nonmainstream mon everyday routines, such as going to a restaurant
dialect use and copula be: Analysis of linguistic constraints. or making a sandwich. Knowledge structures also in-
Paper presented at the annual convention of the American clude knowledge of common patterns of conversational
Speech, Language, and Hearing Association, Washington, exchange, such as question-answer sequences (Scheglo¤,
DC.
1980).
Discourse analyses di¤er primarily in the degree to
Further Reading which they focus on the relative contributions of text,
Wolfram, W., Adger, C., and Christian, D. (1999). Dialects in shared knowledge, and context. One influential model
schools and communities. Mahwah, NJ: Erlbaum. (van Dijk and Kintsch, 1983) is quite detailed in its ac-
count of the transformation of semantic content into
cognitive information content. It represents the seman-
Discourse tics of a text as a set of propositions. These propositions
display coherence through inference at a local level, as
microstructures, and at a global level, as macrostructures
The term discourse is applied to language considerations that represent the topic or gist of the text. The micro-
beyond the boundaries of isolated sentences, although a and macrostructures constitute the text base, which is
discourse in its simplest form may be manifested as a integrated with shared knowledge. The product of that
single utterance in context, such as ‘‘Children at play.’’ integration is a representation of the events depicted in
Discourse studies emerge from a variety of disciplines, the text, conceptualized as the situation model. Thus, this
with major contributions from linguistics and psychol- is a model of discourse as a process involving transfor-
ogy. Linguists have been motivated primarily by the mation of information.
desire to explain phenomena that cannot be accounted Linguists have also contributed greatly to our under-
for at the word and sentence levels, such as reference standing of the various means by which discourse is
or given/new information, while psychologists have coherently organized above the level of the sentence, and
emphasized strategic processes and the role of cognitive how the surface features of a text contribute to these
factors, such as memory, in the production and compre- higher levels of organization (Halliday, 1985). The re-
hension of discourse. This entry focuses on seminal lin- search has centered largely on narrative, in part because
guistic research areas that have had a strong influence on of its universality as a genre and the fundamental nature
the field. of the temporal organization on which it is based. The
A construct that defines the nature of discourse study, extensive research on narrative has historical roots in
regardless of discipline or perspective, is coherence. A Propp’s (1928/1968) analysis of folk tales (from the
discourse is coherent when it ‘‘hangs together,’’ or makes field of rhetoric) and Bartlett’s (1932) study of narrative
sense. This notion of coherence pervades the approaches remembering (from the field of psychology).
of a variety of discourse analysts. With some, it is used as In linguistics, an early and influential framework for
a technical term in its own right. With others, it forms an analyzing narrative structure was provided by Labov
underlying attribute of other constructs. Despite di¤er- and Waletzky (1967) and extended in Labov’s later
ences in terminology or focus, discourse approaches are work (1972). The model is organized around the role of
similar in their analysis of an organization that super- sentential grammar in discourse-level structure. In this
sedes any single sentence or utterance. Thus, the main framework, the verbal sequence of clauses is matched to
goal of discourse analysis is to di¤erentiate discourse the sequence of events that occurred, as the means by
from random sequences of sentences or utterances. which past experience is recapitulated in the narrative.
Discourse models generally assume that discourse co- The overall structure of the narrative progresses from
herence is realized through the integration of a variety of orientation to complicating action to resolution. An
resources: the information contained in the text, shared additional component of the overall structure is evalua-
knowledge, and the relevant features of the situation in tion, which is expressed through a wide range of lexico-
which the text is embedded. Thus, linguistic form and grammatical devices. Thus, this work established a
meaning alone are insu‰cient for discourse comprehen- fundamental approach to analyzing how an event
sion and production. For that reason, discourse para- sequence is realized in linguistic form. Contemporary
digms represent a dramatic shift from more traditional adaptations of this seminal approach to narrative analy-
linguistic pursuits that focus exclusively on linguistic sis are numerous (Bamberg, 1997).
forms in isolation. Work on textual cohesion (Halliday and Hasan, 1976)
The knowledge structures that contribute to the provides another view of the relationship between dis-
formation of coherence are thought to be varied, con- course organization and its component linguistic units.
ventional, and di¤erentiated from each other by the Halliday and Hasan define cohesion as ‘‘the set of pos-
kind of information they contain. For instance, story sibilities that exist in the language for making text hang
schema (Mandler, 1984) or superstructure (van Dijk and together [as a larger unit]’’ (p. 18). Cohesive ties, which
Kintsch, 1983) represent knowledge of the way in which are surface features of text, provide the relevant semantic
events unfold in a story (narrative), and how a story relation between pieces of the text. These ties can be
typically begins and ends. Script knowledge (Schank and lexical or grammatical and include devices such as ref-
Abelson, 1977) specifies the sequence of steps in com- erence, conjunction, and ellipsis. Thus, cohesion is simi-
Discourse 301
lar to coherence in that it is a relational concept; text of a macro-speech act, which consists of sequences of
is cohesive when it is coherent with respect to itself. The speech acts that function socially as one unit. The use
notion of cohesion has been widely applied, especially in of these speech acts during turn-taking in conversation
the area of reference. is also rule-governed (Sacks, Scheglo¤, and Je¤erson,
Linguistic accounts of discourse genre are another 1974; Scheglo¤, 1982). Another important pragmatic
means of analyzing how discourse can be coherent. Al- framework that guides the overall coherence of infor-
though narrative has been the most extensively studied mation exchange is that of a cooperative principle (Grice,
genre, Longacre (1996) provides a typology of various 1975). It subsumes the maxims of quantity, quality, rela-
discourse genres (e.g., procedural, expository, narrative) tion, and manner, which guide the amount, clarity, and
in terms of both underlying knowledge structure and relevance of information required in conversations
their linguistic realization. In his framework, discourse is between interlocutors. Finally, some of the most recent
classified by the nature of the relationship between the and socially relevant work in conversational discourse
events and doings in the discourse, the nature of refer- extends the notion of context by focusing strongly on
ence to agents in the discourse, and whether the events context in a variety of social settings. In this research,
happened in the past or are not yet realized. As is typical context is specified broadly to include participants and
of linguistic approaches, Longacre further specifies the their relative roles in particular societal settings in a
surface linguistic characteristics of each discourse type, given culture (e.g., Tannen, 1994).
such as the types of tense and aspect markings on the The advantage of discourse analysis lies in its poten-
verb, the typical forms of personal reference, and the tial to address both linguistic and cognitive factors
nature of the linkage between sentences for each dis- underlying a range of normal and disordered communi-
course genre. For instance, he specifies how the setting of cation performance.
narrative usually contains stative verbs, while the com- See also discourse impairments.
plicating action contains action verbs. He further ana-
lyzes how the peak or climax of the story can be marked —Hanna K. Ulatowska and Gloria Streit Olness
through devices such as shifts in tense or changes in
length and syntactic complexity (Longacre, 1981). References
An especially detailed account of how thinking is
transformed into language is found in cognitive- Austin, J. L. (1965). How to do things with words. New York:
linguistic work using narratives (Chafe, 1980). Chafe’s Oxford University Press.
framework addresses ways in which the flow of thought Bamberg, M. G. W. (Ed.). (1997). Oral versions of personal
is matched to units of language during the process of experience: Three decades of narrative analysis. Journal of
verbalization. Not only does he address lexicosyntactic Narrative and Life History, 7. [Special issue]
contributions to discourse formation, he also considers Bartlett, F. (1932). Remembering. Cambridge, U.K.: Cam-
the way in which syntax and intonation interact in dis- bridge University Press.
Chafe, W. L. (1980). The deployment of consciousness in the
course production. production of a narrative. In Advances in discourse pro-
Several of the discourse theorists explore the com- cesses: Vol. 4. The Pear stories: Cognitive, cultural, and
monality and variability of narrative features across linguistic aspects of narrative production (pp. 9–50) (R. O.
languages and cultures. These pursuits reflect an interest Freedle, series ed., and W. L. Chafe, vol. ed.). Norwood,
in universality that is pervasive in the field of linguistics NJ: Ablex.
in general. Contributions in this area include studies of Grice, H. P. (1975). Logic and conversation. In P. Cole and J.
cross-linguistic di¤erences in expression of the basic fea- Morgan (Eds.), Syntax and semantics: Vol. 3. Speech acts
tures of narrative, such as verbs or the marking of refer- (pp. 41–58). New York: Academic Press.
ence (Longacre, 1996); ethnic linguistic devices used Halliday, M. A. K. (1985). An introduction to functional gram-
in the various narrative components, such as evaluation mar. London: Edward Arnold.
Halliday, M. A. K., and Hasan, R. (1976). Cohesion in English.
(Labov, 1972); variations in linguistic expression both London: Longman.
within individuals and across individuals from di¤er- Labov, W. (1972). Language in the inner city: Studies in the
ent cultures (Chafe, 1980); and cultural presuppositions Black English Vernacular. Philadelphia: University of
reflected in the content of narratives (Polanyi, 1989). Pennsylvania Press.
Conversational discourse represents a discourse type Labov, W., and Waletzky, J. (1967). Narrative analysis: Oral
very di¤erent from the other discourse genres. Research versions of personal experience. In J. Helm (Ed.), Essays of
on conversational discourse emphasizes the role of con- the verbal and visual arts: Proceedings of the 1996 Annual
text and social interaction (e.g., Schi¤rin, 1994). A basic Spring Meeting of the American Ethnological Society (pp.
unit of analysis for this genre is the speech act, a con- 12–44). Seattle: University of Washington Press.
struct derived from early work in the philosophy of Longacre, R. E. (1981). A spectrum and profile approach to
discourse analysis. Text, 1, 337–359.
language (Austin, 1965; Searle, 1969). Speech acts are Longacre, R. E. (1996). The grammar of discourse (2nd ed.).
utterances defined by their pragmatic functions, such as New York: Plenum Press.
making statements, asking questions, making promises, Mandler, J. M. (1984). Stories, scripts and scenes: Aspects of
and giving orders. The sequence of speech acts can dis- schema theory. Hillsdale, NJ: Erlbaum.
play a coherence that extends beyond any one speech Polanyi, L. (1989). Telling the American story. Cambridge,
act. On this basis, van Dijk (1981) proposes his notion MA: MIT Press.
Propp, V. (1968/1928). Morphology of the folktale (L. Scott, Scheglo¤, E. A. (1990). On the organization of sequences as a
trans.). Austin: University of Texas Press. source of ‘‘coherence’’ in talk-in-interaction. In B. Dorval
Sacks, H., Scheglo¤, E. A., and Je¤erson, G. (1974). A sim- (Ed.), Conversational organization and its development
plest systematics for the organization of turn-taking in con- (pp. 51–77). Norwood, NJ: Ablex.
versation. Language, 50, 696–735. Schi¤rin, D. (1994). Approaches to discourse. Oxford, U.K.:
Schank, R. C., and Abelson, R. P. (1977). Scripts, plans, goals, Blackwell.
and understanding. Hillsdale, NJ: Erlbaum. Tannen, D. (Ed.). (1984). Coherence in spoken and written dis-
Scheglo¤, E. A. (1980). Preliminaries to preliminaries: ‘‘Can course. Norwood, NJ: Ablex.
I ask you a question?’’ Sociological Inquiry, 50, 104–152. Tannen, D. (1984). Conversational style. Norwood, NJ: Ablex.
Scheglo¤, E. A. (1982). Discourse as an interactional achieve- van Dijk, T. A. (1972). Some aspects of text grammars. The
ment: Some uses of ‘‘uh huh’’ and other things that come Hague: Mouton.
between sentences. In D. Tannen (Ed.), Georgetown Uni- van Dijk, T. A. (1977). Text and context: Explorations in
versity Round Table on Languages and Linguistics. Analyz- the semantics and pragmatics of discourse. New York:
ing discourse: Text and talk (pp. 71–93). Washington, DC: Longman.
Georgetown University Press. van Dijk, T. A. (1980). Macrostructures. The Hague: Mouton.
Schi¤rin, D. (1994). Approaches to discourse. Cambridge, MA: van Dijk, T. A. (1981). Studies in the pragmatics of discourse.
Blackwell. The Hague: Mouton.
Searle, J. (1969). Speech acts. Cambridge, U.K.: Cambridge van Dijk, T. A. (Ed.). (1997). Discourse as structure and pro-
University Press. cess. Thousand Oaks, CA: Sage.
Tannen, D. (1994). Gender and discourse. New York: Oxford Wierzbicka, A. (1985). Di¤erent cultures, di¤erent languages,
University Press. di¤erent speech acts. Journal of Pragmatics, 9, 145–163.
van Dijk, T. A. (1981). Studies in the pragmatics of discourse.
The Hague: Mouton.
van Dijk, T. A., and Kintsch, W. (1983). Strategies of discourse
comprehension. New York: Academic Press. Discourse Impairments
Further Readings
The history of aphasiology dates back more than a cen-
Brown, G., and Yule, G. (1983). Discourse analysis. Cam- tury, but impairments of discourse abilities have only
bridge, U.K.: Cambridge University Press. recently been described. Whereas changes in discourse
Bruner, J. (1991). The narrative construction of reality. Critical abilities have always been part of the qualitative
Inquiry, 18, 1–21. description of language following a brain lesion, the
Chafe, W. L. (1979). The flow of thought and the flow of lan-
conceptual frameworks needed to identify impaired
guage. In T. Givón (Ed.), Syntax and semantics: Vol. 12.
Discourse and syntax (pp. 159–182). New York: Academic components of discourse in brain-damaged individuals
Press. have been available only since the late 1970s (Joanette
Clark, H., and Haviland, J. (1977). Comprehension and the and Brownell, 1990). Initial descriptions of discourse
given-new contract. In R. O. Freedle (Ed.), Discourse pro- impairments essentially referred to traditional linguistic
cesses: Advances in research and theory. Vol. 1. Discourse indicators, such as the noun-verb ratio or the percentage
production and comprehension (pp. 1–40). Norwood, NJ: of subordinate clauses (e.g., Berko-Gleason et al., 1980;
Ablex. Obler and Albert, 1984). With increasing knowledge
Gee, J. P. (1986). Units in the production of narrative dis- about the organization of the meaning conveyed in dis-
course. Discourse Processes, 9, 391–422. course, more specific descriptors have been introduced,
Gee, J. P. (1991). A linguistic approach to narrative. Journal of
such as coherence (e.g., Irigaray, 1973) or T-units (e.g.,
Narrative and Life History, 1, 15–39.
Grice, H. P. (1957). Meaning. Philosophical Review, 67, 377–388. Ulatowska et al., 1983). However, those concepts and
Grimes, J. E. (1975). The thread of discourse. The Hague: descriptors were not connected with a broader concep-
Mouton. tual framework of discourse. Only recently have general
Gumperz, J. (1982). Discourse strategies. Cambridge, U.K.: integrative discourse models made it possible to link
Cambridge University Press. these various discourse components and to capture the
Kintsch, W. (1988). The role of knowledge in discourse com- di¤erent levels of cognitive processing needed in order to
prehension: A constructive-integration model. Psychological convey or understand verbal communication. This arti-
Review, 85, 363–394. cle summarizes discourse impairments associated with
Linde, C. (1993). Life stories: The creation of coherence. Ox- di¤erent pathological conditions with reference to these
ford, U.K.: Oxford University Press.
Martin, J. R. (1992). English text: System and structure.
integrated frameworks.
Amsterdam: Benjamins.
Ochs, E. (1979). Planned and unplanned discourse. In T. Givón Levels of Discourse Processing
(Ed.), Syntax and semantics: Vol. 12. Discourse and syntax Discourse is a set of utterances aimed at conveying a
(pp. 51–80). New York: Academic Press.
message among interlocutors. It can take many forms,
Ochs, E. (1997). Narrative. In T. A. van Dijk (Ed.), Discourse
studies: A multidisciplinary introduction. Vol. 1. Discourse as such as narrative, argument, or conversation. Because
structure and process (pp. 185–207). Thousand Oaks, CA: it combines language components in a communicative
Sage. context, discourse may be the most elaborate linguistic
Sacks, H. (1992). Lectures on conversation. Cambridge, MA: activity. The complexity of this activity can be captured
Blackwell. through multilevel models, such as that proposed by
Discourse Impairments 303
Carl Frederiksen (Frederiksen et al., 1990), in which with right hemisphere damage, traumatic brain injury,
each level of processing can be analyzed separately. or early-stage Alzheimer’s disease.
Selective impairments of these levels, leading to distinct Few studies have looked for impairments at the
discourse patterns, may be used to di¤erentiate among microstructural level. In one such study, Joanette et al.
various adult populations with neurological disorders. (1986) showed that both patients with right hemisphere
Seminal work by Kintsch and van Dijk (1978) largely damage and normal controls produced discourse with
inspired current integrated discourse models (for a similar microstructure. Stemmer and Joanette (1998)
review, see Mross, 1990). According to these models, confirmed this observation but found that individuals
discourse processing—production or comprehension— with left hemisphere damage tended to produce more
results from a number of cognitive operations that take fragmented micropropositions, lacking in arguments.
place on four levels of representation: This resulted in a disruption of the connective structure
of discourse, which requires predicates and arguments
Surface level—Traditional linguistic units such as
to be connected in order to form a semantic network of
phonemes or graphemes, morphemes, words, and their
propositions. Numerous other studies have looked at
combination into sentences constitute the surface level.
cohesion, which can be considered representative of the
Impairments at this level are described elsewhere in
microstructural level. Cohesion refers to the quality of
this book.
local relationships between the elements of discourse and
Semantic level—Concepts expressed in discourse,
is frequently expressed through linguistic markers such
along with the links between them, constitute the se-
as pronouns and conjunctions. Patients with traumatic
mantic level of processing. The smallest semantic unit
brain injury, dementia, and in some cases right hemi-
is the microproposition, which is made up of a predi- sphere damage have been reported to produce incohe-
cate (typically expressed by verbs or prepositions) and
sive discourse typically characterized by the use of vague
one or more arguments (typically expressed by nouns).
words or pronouns without clear referents. The lack
Discourse meaning can thus be represented as a se-
of cohesion prevents the interlocutor from knowing
mantic network made up of a list of hierarchically
what the speaker is talking about. In an incohesive
related micropropositions. The main ideas of a dis-
discourse, micropropositions are also incomplete, since
course can be represented by macropropositions. On
arguments are neither present nor identifiable, thus
the receptive side, these macropropositions are con-
leading to a break in local coherence. Because local
structed by applying rules in order to condense, elimi-
coherence is not established, discourse with cohesive
nate, or generalize micropropositions. The latter are
problems can be viewed as a disconnected or incomplete
related through logical, inferential, or pragmatic links semantic network.
to the world depicted by the discourse. Mirror pro-
The relationships among the elements of discourse
cessing stages allow one to go from the main ideas of
form the macrostructure of a discourse. At this level, a
communicative intent to micropropositional discourse. logical progression of ideas ensures the global coherence
The semantic level of discourse largely depends on the
of the discourse. Several authors have reported impair-
individual’s semantic memory (general knowledge) and
ments at this level in the narrative discourse of patients
is independent of the linguistic (surface) level.
with right brain damage, dementia, and traumatic brain
Situational level—The processing of micropropositions
injury (Nicholas et al., 1985; Glosser, 1993; Coelho, Liles,
and the relations among them leads to the construc-
and Du¤y, 1994; Ehrlich, 1994). Frequently the problem
tion of a situation model based on the subject’s world
lies in the absence of principal ideas. Such discourse is
knowledge. The situational level corresponds to the
incomplete and di‰cult to interpret. Another problem
representation of the situation or events depicted in the
occurs when discourse contains unexpected information,
discourse constructed by the interlocutor.
in which case it is referred to as tangential. In such cases
Structural level—Finally, the structural level corre- the links among ideas are not explicit and do not seem
sponds to the sequential and temporal organization of
logical; this may happen with right-hemisphere-damaged
meaning units in a discourse. This level is known as the
individuals. Often, individuals who produce tangential
structure of a discourse and is identified as the dis- discourse do not stay with the topic and jump from one
course schema, script, or frame. It is at this level that
subject to another. Such behavior may be attributed to
distinctions among narrative, argumentative, proce-
some pragmatic inability to take the interlocutor’s point
dural, or conversational discourse can be made.
of view into account and establish a common reference
Impairments of Discourse by Level of (Chantraine, Joanette, and Ska, 1998).
Processing Discourse processing also requires the elaboration of
a situation model. The presence of such a model testifies
Surface-level impairments constitute the essence of to the fact that the individual is able to make inferences
aphasia as traditionally defined: phoneme, morpheme, by bridging several pieces of information. This ability
word, and sentence impairments. The presence of is frequently impaired in patients with di¤use lesions
impairments at this level usually makes it di‰cult to (traumatic brain injury and dementia) or right brain
appreciate other levels of discourse. This explains why damage. In some cases, patients’ understanding is partial
discourse impairments are most likely to be noticed in and remains at a superficial level. If such patients per-
individuals without surface-level deficits, such as those ceive contradictions, they often try to resolve them by
invoking a plausible explanation from their own experi- De Santi, and J. S. Ehrlich (Eds.), Discourse analysis and
ence rather than from the information included or pre- applications: Studies in adult clinical populations (pp. 81–
supposed in the discourse. The di‰culty in integrating 94). Hillsdale, NJ: Erlbaum.
information into a situation model has been proposed Ehrlich, J. S. (1994). Studies of discourse production in adults
with Alzheimer’s disease. In R. L. Bloom, L. K. Obler, S.
as an explanation for the inability to understand jokes,
De Santi, and J. S. Ehrlich (Eds.), Discourse analysis
metaphors, or indirect requests that has been reported and applications: Studies in adult clinical populations (pp.
in patients with right brain damage and traumatic brain 149–160). Hillsdale, NJ: Erlbaum.
injury. In the case of dementia, it is thought to result Frederiksen, C. H., Bracewell, R. J., Breuleux, A., and
from degradation of the semantic system itself. Renaud, A. (1990). The cognitive representation and pro-
The structural level of discourse is the level at which cessing of discourse function and dysfunction. In Y. Joa-
information is organized with respect to a given script, nette and H. H. Brownell (Eds.), Discourse ability and brain
such as a narrative, which has to contain minimally a damage: Theoretical and empirical perspectives (pp. 69–
setting, a complication, and a resolution. Preservation of 110). New York: Springer-Verlag.
the text structure can guide the production or compre- Glosser, G. (1993). Discourse production patterns in neuro-
logically impaired and aged population. In H. H. Brownell
hension of discourse. Although text structure is consid-
and Y. Joanette (Eds.), Narrative discourse in neurologically
ered to be robust, patients with traumatic brain injury impaired and normal aging adults (pp. 191–212). San Diego,
and dementia have shown impairments at this level. For CA: Singular Publishing Group.
example, individuals with Alzheimer’s disease may omit Grafman, J. (1989). Plans, actions and mental sets: Managerial
some components of the narrative schema, even when knowledge units in the frontal lobes. In E. Perecman (Ed.),
pictures are provided to support their production (Ska Integrating theory and practice in clinical neuropsychology
and Guénard, 1993). Script deficits in discourse are (pp. 93–138). Hillsdale, NJ: Erlbaum.
thought to result from impairments similar to those af- Irigaray, L. (1973). Le langage des déments. The Hague: Mouton.
fecting routine activities of daily life through planning, Joanette, Y., and Brownell, H. H. (Eds.). (1990). Discourse
organizing, selecting, and inhibiting information (Graf- ability and brain damage: Theoretical and empirical per-
spectives. New York: Springer-Verlag.
man, 1989). Patients with frontal lesions, for example,
Joanette, Y., Goulet, P., Ska, B., and Nespoulous, J. L. (1986).
may exhibit di‰culties in the sequential ordering of and Informative content of narrative discourse in right brain-
hierarchical relations among actions belonging to a damaged right-handers. Brain and Language, 29, 81–105.
given script (Sirigu et al., 1995a, 1995b). Kintsch, W., and van Dijk, T. A. (1978). Toward a model of
Discourse abilities and their impairments constitute a text comprehension and production. Psychological Review,
privileged component of communication and language 85, 363–394.
that researchers can study in order to appreciate the Mross, E. F. (1990). Text-analysis: Macro- and microstructural
interaction between so-called linguistic and other com- aspects of discourse processing. In Y. Joanette and H. H.
ponents of cognition. Although long addressed in purely Brownell (Eds.), Discourse ability and brain damage: Theo-
descriptive terms, discourse impairments can now be retical and empirical perspectives (pp. 50–68). New York:
Springer-Verlag.
understood with reference to comprehensive models of
Nicholas, M., Obler, L. K., Albert, M. L., and Helm-
normal discourse processing. A description of discourse Estabrooks, N. (1985). Empty speech in Alzheimer’s disease
according to these models allows researchers to identify and fluent aphasia. Journal of Speech and Hearing Research,
the levels characteristically impaired in individuals with 28, 405–410.
particular brain lesions. The relationship between the Obler, L. K., and Albert, M. L. (1984). Language in aging. In
various levels of discourse impairment and the di¤erent M. L. Albert (Ed.), Neurology of aging (pp. 245–253). New
brain diseases or lesions is gradually becoming clearer. York: Oxford University Press.
The availability of such specific descriptions will lead to Sirigu, A., Zalla, T., Pillon, B., Grafman, J., Agid, Y., and
a better understanding of the communicative disability Dubois, B. (1995a). Selective impairments in managerial
a¤ecting individuals with discourse impairments and knowledge following pre-frontal cortex damage. Cortex, 31,
301–316.
should help clinicians develop strategies in order to help
Sirigu, A., Zalla, T., Pillon, B., Grafman, J., Agid, Y., and
a¤ected individuals overcome this disability. Dubois, B. (1995b). Encoding of sequence and boundaries
See also discourse. of script following prefrontal lesions. Cortex, 32, 297–310.
—Bernadette Ska, Anh Duong, and Yves Joanette Ska, B., and Guénard, D. (1993). Narrative schema in demen-
tia of the Alzheimer’s type. In H. H. Brownell and Y. Joa-
References nette (Eds.), Narrative discourse in neurologically impaired
and normal aging adults (pp. 299–316). San Diego, CA:
Berko-Gleason, G., Goodglass, H., Obler, L., Green, E., Hyde, Singular Publishing Group.
M., and Weintraub, S. (1980). Narrative strategies of apha- Stemmer, B., and Joanette, Y. (1998). The interpretation of
sic and normal speaking subjects. Journal of Speech and narrative discourse of brain-damaged individuals within
Hearing Research, 23, 370–382. the framework of a multilevel discourse model. In M. Bee-
Chantraine, Y., Joanette, Y., and Ska, B. (1998). Conversa- man and C. Chiarello (Eds.), Right hemisphere language
tional abilities in patients with right hemisphere damage. comprehension: Perspectives from cognitive neuroscience
Journal of Neurolinguistics, 11, 21–32. (pp. 329–348). Mahwah, NJ: Erlbaum.
Coelho, C. A., Liles, B. Z., and Du¤y, R. J. (1994). Cognitive Ulatowska, H. K., Freedman-Stern, R., Weiss Doyal, A., and
framework: A description of discourse abilities in traumati- Maccaluso-Haynes, S. (1983). Production of narrative dis-
cally brain-injured adults. In R. L. Bloom, L. K. Obler, S. course in aphasia. Brain and Language, 19, 317–334.
Functional Brain Imaging 305
Functional Brain Imaging distance imposed between MEG sensors and the brain.
Source localization in MEG is still limited by the non-
uniqueness of the inverse problem, which becomes
Several techniques are now available to study the func- increasingly troublesome as the number of signal gen-
tional anatomy of speech and language processing erators increases.
by measuring neurophysiological activity noninvasively. Most EEG and MEG studies in speech and language
This entry reviews the four dominant methods, electro- use an event-related potential (ERP) design. In such a
encephalography (EEG) and magnetoencephalography design, the onset of EEG recording is time-locked to the
(MEG), which measure the extracranial electromagnetic onset of an event—say, the presentation of a stimulus—
field, and positron emission tomography (PET) and and the resulting EEG response is recorded. Because the
functional magnetic resonance imaging (fMRI), which ERP signal is a small component of the overall EEG
measure local changes in blood flow associated with signal, the event of interest must be repeated several
active neurons. Each of these techniques has inherent times (up to 100), and the responses averaged. Another
strengths and weaknesses that must be taken into ac- increasingly popular use of electromagnetic responses
count when designing and interpreting experiments. involves mapping regional correlations (synchrony) in
EEG and MEG respectively measure the electrical oscillatory activity during cognitive and perceptual pro-
and magnetic field generated by large populations of cesses (Singer, 1999), which has been suggested to reflect
synchronously active neurons with millisecond tempo- cross-region binding.
ral resolution (Hamalainen et al., 1993; Nunez, 1995). Unlike the electromagnetic recording techniques,
Asynchronous activity cannot be easily detected because hemodynamic techniques such as PET and fMRI mea-
the signals produced by individual cells tend to cancel sure neural activity only indirectly (Villringer, 2000).
each other out rather than summing to produce a mea- The basic phenomenon underlying these methods is that
surable signal at sensors or electrodes outside the head. an increase in neural activity leads to an increase in
The bulk of EEG and MEG signals appear to be gen- metabolic demand for glucose and oxygen, which in turn
erated not by action potentials but by postsynaptic appears to be fed by a localized increase in cerebral
potentials in the dendritic trees of pyramidal cells. blood flow (CBF) to the active region. It is these hemo-
Although EEG has excellent temporal resolution, on dynamic reflections of the underlying neural activity that
the order of milliseconds, it is limited by poor spatial PET and fMRI measure, although in di¤erent ways.
resolution because of the smearing of the potentials by PET measures regional CBF (rCBF) in a fairly
the skull (Nunez, 1981). As a consequence, it is very straightforward manner (Cherry and Phelps, 1996):
di‰cult to identify the source of a signal from the distri- water (typically) is labeled with a radioactive tracer,
bution of electric potentials on the scalp. For any given oxygen 15; the radiolabeled material is introduced into
surface distribution, there are many possible source the bloodstream, typically intravenously; metabolically
distributions that might have produced the surface active regions in the brain have an increased rate of
pattern—thus, the inverse problem has no unique solu- blood delivery, and therefore receive a greater concen-
tion. This complication is particularly significant where tration of the radioactive tracer; the regional concentra-
there are multiple generators, as is often the case in tions of the tracer in the brain can then be measured
speech and language studies. The signals from di¤erent using a PET scanner, which detects the decay of the ra-
neural generators are mixed together in the potentials dioactive tracer. As the tracer material decays, positrons
recorded at scalp electrodes. are emitted from the radioactive nucleus and collide with
EEG measures the electrical field produced by syn- electrons. Such a collision results in annihilation of the
chronous neural activity; MEG measures the mag- positron and electron and the generation of two gamma
netic fields associated with these electric current sources. rays that travel away from the site of the collision in
There are important di¤erences, however, between opposite directions and exit the head. The PET scanner,
MEG and EEG signals. First, magnetic fields are unaf- which is composed of a ring of gamma ray sensors,
fected by the tissue they pass through, so there is far less detects the simultaneous arrival of two gamma rays on
distortion of the signal between the source and the sensor opposite sides of the sensor array, and from this infor-
in comparison to EEG (Hamalainen et al., 1993). Sec- mation the location of the collision site can be deter-
ond, because most MEG is a measure of only the radial mined. PET also measures other aspects of local energy
component of the magnetic field, MEG is e¤ectively metabolism using di¤erent labeled compounds, based on
blind to activity that occurs in cortical areas that are the same principle, namely, that the amount of the agent
oriented roughly parallel to the sensor (i.e., mostly gyral taken up is proportional to the local metabolic rate.
convexities). Conveniently for speech scientists, most Oxygen metabolism is measured with oxygen labeled
of human auditory cortex is buried inside the sylvian with oxygen 15, and glucose metabolism is measured
fissure, making MEG ideal for recording auditory or with a molecule similar to glucose called deoxyglucose
speech-evoked fields. MEG has a temporal resolution labeled with fluorine 18. The spatial resolution of PET
comparable to that of EEG. Theoretically, MEG has is ultimately limited by the average distance a positron
somewhat better spatial resolution than EEG because travels before it collides with an electron, which is in the
magnetic fields pass una¤ected through the tissues of the range of a few millimeters. In practice, however, a typi-
head, but this benefit is partly cancelled by the greater cal PET study has a spatial resolution of about 1 cm.
The temporal resolution of PET is poor, ranging from A related drawback is that it is often hard to distinguish
approximately 1 minute for oxygen-based experiments between a di¤erence in activation level and a di¤erence
to 30 minutes for glucose-based studies. in spatial distribution.
Typical PET experiments contrast rCBF maps gen- fMRI also is sensitive to hemodynamic changes, but
erated in two or more experimental conditions. For ex- not in the same way as PET. fMRI is based on a rather
ample, one might contrast the rCBF map produced by surprising physiological fact: when a region of brain is
listening to speech sounds with that produced in a rest- activated, both CBF and the metabolic rate of oxygen
ing baseline scan with no auditory input. Subtracting the increase, but the CBF increase is much larger. This
resting baseline map from the speech-sound activation means that the local venous blood is more oxygenated
map would yield a di¤erent map highlighting just those during activation, even though the metabolic rate of
brain regions that show a relative increase in metabolic oxygen has increased. The physiological significance of
activity during speech perception. Many studies attempt this is still not understood, but one possibility is that the
to isolate subcomponents of a complex process by using increased level of tissue oxygen is necessary to drive a
a variety of clever control conditions rather than a rest- higher oxygen flux into the mitochondria. The most
ing baseline. Whereas this general approach has yielded commonly used fMRI technique is sensitive to these
important insights, it must be used cautiously because it changes in the oxygen concentration of blood; this is the
makes several assumptions that may not hold true. One BOLD, or blood oxygenation level dependent, signal
of these, the ‘‘pure insertion’’ assumption, is that cogni- (Chen and Ogawa, 2000). The BOLD signal is intrinsic
tive operations are built largely of noninteracting stages, to the blood response, and so, unlike in PET, no radio-
such that manipulating one stage will not a¤ect proactive tracers are needed. A typical fMRI experiment
cesses occurring at another stage. This assumption has involves imaging the brain repeatedly, collecting a vol-
been seriously questioned, however (Sartori and Umilta, ume of images every few seconds for a period of several
2000). Another assumption of the subtraction method is minutes, during which time the participant is presented
that the component processes of interest have neural with alternating blocks of two (or more) stimulus or task
correlates that are to some extent modularly organized, conditions. Brain areas that are di¤erentially active in
and further that the modules are su‰ciently spatially one condition versus another will show a modulation of
distinct to be detected using current methods. In some the MR image intensity over time that correlates with
cases this assumption may be valid, but in others it may the stimulus (task) cycles. Under ideal conditions the
not be, so again, caution is warranted in interpreting spatial resolution of fMRI comes close to the size of a
results of subtraction-based designs. These issues arise in cortical column, although in most applications the reso-
fMRI designs as well. lution is closer to 3–6 mm (Kim et al., 2000). The tem-
Experimental designs that do not rely on subtraction poral resolution of fMRI is limited by the variability
logic are becoming increasingly popular. Correlational of the hemodynamic response. Under ideal conditions,
studies, for example, typically scan the participants fMRI appears to be capable of resolving stimulus onset
under several parametrically varied levels of a variable asynchronies in the range of a few hundreds of milli-
and look for rCBF patterns across scans that correlate seconds, and there is some indication that even better
with the manipulated variable. For example, one might temporal resolution (tens of milliseconds) is possible
look for brain regions that show systematic increases in (Bandettini, 2000). However, in most applications the
rCBF as a function of increasing memory load or of in- temporal resolution ranges from about 1 s to tens of
creasing rate of stimulus presentation. Alternatively, it is seconds, depending on the design.
possible in a between-subject design to look for correla- Because fMRI measures intrinsic signals, it is possible
tions between rCBF and performance on a behavioral to present the stimulus (task) conditions in short alter-
measure. nating blocks, or even as a series of individual events,
In order to increase signal-to-noise ratios in PET within a single scan, unlike a PET study (Aguirre, 2000).
studies, data from several participants are averaged. To Typical block design experiments present four to eight
account for individual di¤erences in brain anatomy, cycles of alternating blocks of di¤erent stimulus (task)
each participant’s PET scans are normalized to a stan- conditions per scan. Event-related fMRI designs, which
dard stereotaxic space and spatially smoothed prior to are modeled on ERP experiments, present stimuli indi-
averaging (Evans, Collins, and Holmes, 1996). Group vidually rather than in blocks. This a¤ords greater flexi-
averaged CBF maps are then overlaid onto normalized bility in experimental design: items consisting of di¤erent
anatomical MR images for spatial localization. Group conditions can be randomly intermixed to decrease
averaging does improve the signal-to-noise ratio, but it predictability of the upcoming items, and the blood
also has drawbacks. First, there is some loss of spatial responses to items can be sorted and averaged in a vari-
resolution. This is important, not just in terms of local- ety of ways, for example by accuracy or reaction time of
izing the precise site of an activation, but also in terms of simultaneously collected behavioral responses. The dis-
the ability to detect activations in the first place: spatially advantages of event-related designs include decreased
smaller activations are less likely to be detected than amplitude of the response due to shorter stimulus dura-
larger ones, even if they are equally robust, simply be- tions, and increased sensitivity to regional di¤erences in
cause there is a reduced likelihood of small activations response onset, which can provide better information
overlapping precisely in spatial location across subjects. in the temporal domain but also can make it di‰cult to
Inclusion Models for Children with Developmental Disabilities 307
model the hemodynamic response equally well across all Singer, W. (1999). Neurobiology: Striving for coherence.
activated brain regions. Nature, 397, 391–393.
fMRI is more sensitive than PET, allowing the detec- Villringer, A. (2000). Physiological changes during brain acti-
tion of reliable signals in individual subjects. Despite this vation. In C. T. W. Moonen and P. A. Bandettini (Eds.),
Functional MRI (pp. 3–13). New York: Springer.
distinct advantage, most fMRI analyses are modeled on
PET procedures, with spatial normalization of individ-
ual data sets, group averaging, and overlaying of acti-
vation maps onto normalized anatomical images. Also, Inclusion Models for Children with
as in PET experiments, most fMRI experiments utilize Developmental Disabilities
subtraction-based designs or correlational methods. Al-
though fMRI has several advantages over PET, it also
has several drawbacks related primarily to artifacts During 1998–99, 5,541,166 students with disabilities,
introduced into the signal from head motion, physiolog- or 8.75% of the school-age population ages 6–21 years,
ical noise (respiration, cardiac pulsations), and inhomo- received special education and related services under
geneities in the magnetic field coming from a variety Part B of the federal Individuals with Disabilities Edu-
of sources. Another drawback, particularly relevant for cation Act (IDEA) (U.S. Department of Education,
speech/language studies, is the greater than 100 dB noise 2000). IDEA specifies 13 disability categories based on
generated by the magnet during image acquisition. A etiological groupings. The largest single category of dis-
potentially promising solution to this latter problem ability served is specific learning disabilities (50.8%),
involves presenting auditory stimuli during silent periods with speech and language impairments the second
between image acquisition (Hall et al., 1999). largest category (19.4%). Children with mental retarda-
—Gregory Hickok tion account for 11.0%, while children with autism,
considered a low incidence disability, constitute 1% of
References those receiving special education and related services.
Since the original passage in 1975 of IDEA’s forerunner,
Aguirre, G. K. (2000). Experimental design for brain fMRI. In
C. T. W. Moonen and P. A. Bandettini (Eds.), Functional the Education for All Handicapped Children Act, the
MRI (pp. 369–380). New York: Springer. categorical model has served as the basis for determin-
Bandettini, P. A. (2000). The temporal resolution of functional ing who qualifies for special education in accord with
MRI. In C. T. W. Moonen and P. A. Bandettini (Eds.), two premises. First, each disability category represents
Functional MRI (pp. 205–220). New York: Springer. a separate and distinct condition that may co-occur with
Chen, W., and Ogawa, S. (2000). Principles of BOLD func- others but is not identical (Lyon, 1996), and second,
tional MRI. In C. T. W. Moonen and P. A. Bandettini dissimilar disability conditions, or deficits, require edu-
(Eds.), Functional MRI (pp. 103–113). New York: cational programs that di¤er from regular education
Springer. programs.
Cherry, S. R., and Phelps, M. E. (1996). Imaging brain func-
The concept of inclusive schooling emerged during
tion with positron emission tomography. In A. W. Toga
and J. C. Mazziotta (Eds.), Brain mapping: The methods the 1980s and gained momentum in the 1990s in re-
(pp. 191–221). San Diego, CA: Academic Press. sponse to the two categorical premises of IDEA and its
Evans, A. C., Collins, D. L., and Holmes, C. J. (1996). Com- predecessor. Special education was viewed as a form of
putational approaches to quantifying human neuro- de facto segregation for children with disabilities. Fur-
anatomical variability. In A. W. Toga and J. C. Mazziotta thermore, special education had assumed the mantle of
(Eds.), Brain mapping: The methods (pp. 343–361). San a placement setting rather than seeing its primary func-
Diego, CA: Academic Press. tion under federal regulations as the source of specially
Hall, D. A., Haggard, M. P., Akeroyd, M. A., Palmer, A. R., designed instruction and support services intended to
Summerfield, A. Q., Elliott, M. R., et al. (1999). ‘‘Sparse’’ meet children’s unique learning needs (Giangreco, 2001).
temporal sampling in auditory fMRI. Human Brain Map-
ping, 7, 213–223.
Because of these reasons, major educational reforms
Hamalainen, M., Hari, R., Ilmonieli, R., Knuutila, J., and were necessary to transform schools and instruction
Lounasmaa, O. V. (1993). Magnetoencephalography: (Skrtic, 1992). Educational equity and excellence for all
Theory, instrumentation, and applications to noninvasive students required ‘‘a restructured system of education,
studies of the working human brain. Reviews of Modern one that eliminates categorical special needs programs
Physics, 65, 413–497. by eliminating the historical distinction between general
Kim, S.-G., Lee, S. P., Goodyear, B., and Silva, A. C. (2000). and special education’’ (Skrtic, Sailor, and Gee, 1996,
Spatial resolution of BOLD and other fMRI techniques. In p. 146). Inclusive schooling became the democratic
C. T. W. Moonen and P. A. Bandettini (Eds.), Functional mechanism proposed to accomplish this restructuring.
MRI (pp. 195–203). New York: Springer. From the perspectives of curriculum and instruction,
Nunez, P. L. (1981). Electric fields of the brain: The neuro-
physics of EEG. New York: Oxford University Press.
inclusive practices in a single system of education should
Nunez, P. L. (1995). Neocortical dynamics and human EEG foster engaged learning. All classrooms should be learner-
rhythms. New York: Oxford University Press. centered, children should be supported to become active
Sartori, G., and Umilta, C. (2000). How to avoid the fallacies and self-regulated learners, and instructional practices
of cognitive subtraction in brain imaging. Brain and Lan- should be grounded to theme-based units, cooperative
guage, 74, 191–212. learning, team teaching, and student-teacher dialogue
that sca¤olds critical inquiry in both intellectual Depending on state education laws, speech-language
and social realms (National Research Council, 1999, pathologists may provide either special education
2000). (instructional) services or, in conjunction with special
IDEA specifies that children with disabilities must be education, related (support) services. The provision of
educated in the least restrictive environment. This means related services must meet standards of educational ne-
that, to the greatest possible extent, a child should be cessity and educational relevance (Giangreco, 2001). In
educated with children who do not have disabilities and full and partial inclusion, special education and related
an explanation must be provided in the Individualized services, as specified in an IEP, may be delivered through
Educational Plan (IEP) of the degree, if any, to which multiple models, all of which are premised on collabo-
a child will not participate in regular class activities ration. In this sense, collaboration means that team par-
(O‰ce of Special Education and Rehabilitation Services ticipants have particular beliefs and possess certain skills
[OSERS], 2000). In the practical implementation of this (Silliman et al., 1999; Giangreco, 2000). These include
requirement in the past 10 years, the terms least re- (1) a shared belief in the philosophy of inclusion, (2)
strictive environment, mainstreaming, and inclusion often empowerment as decision makers combined with respect
have become confused (Osborne, 2002). Least restrictive for varying decision-making values, (3) flexibility in
environment is a legal requirement. It specifies that chil- problem solving about how best to meet the language
dren can be removed from the regular education class- and literacy needs of individual students, (4) the shared
room for special education placement only if the nature expertise made possible through coteaching strategies,
and severity of the child’s disability are such that educa- and (5) high expectations for all students, regardless of
tion in regular classes with the use of supplementary aids their educational and disability status.
and services cannot be satisfactorily achieved (OSERS, Based on the collaboration concept, the American
2000). In other words, maintenance in the regular edu- Speech-Language-Hearing Association (ASHA, 1996)
cation classroom means that children receive special supported ‘‘inclusive practices’’ as an option for opti-
education and related services for less than 21% of the mally meeting the educational needs of children and
school day. Mainstreaming is an educational practice youth. A flexible array of service delivery models for
that refers to the placement of students in regular edu- implementing inclusive practices was also specified in
cation for part of the school day, such as physical accord with children’s needs at di¤erent points in time.
education or science classes. These four general models are not viewed as mutually
Inclusion is neither specified in law nor regulations as a exclusive or exhaustive. One model is direct service
placement. Similar to resource rooms and self-contained delivery in the form of ‘‘pull-out’’ services, considered
classrooms as optional placement settings, inclusion is appropriate only when there is a short-term objective for
also an option along the educational continuum of the a child to achieve, such as acquisition of a new commu-
least restrictive environment. In its broadest sense, nication skill through direct teaching. A second model
inclusion is an educational philosophy about schooling. is classroom-based service delivery, in which the speech-
Inclusive schools are communities of learners ‘‘where language pathologist and teachers collaborate, for ex-
everyone belongs, is accepted, supports, and is supported ample through team teaching, to incorporate children’s
by his or her peers and other members of the school IEP goals across the curriculum. The composition of
community while his or her educational needs are met’’ team teaching models varies, but the teams may include
(Stainback, Stainback, and Jackson, 1992, p. 5). Full in- a regular education teacher, a teacher of specific learn-
clusion is the complete integration of the regular and ing disabilities, and a speech-language pathologist who
special education systems where all children with dis- works full-time in the classroom (e.g., Silliman et al.,
abilities receive their education, including special educa- 1999). A third model is community-based service delivery,
tion and related services, as an integral part of the in which communication goals are specifically addressed
regular education curriculum. A major criticism of full in community settings, such as a vocational education
inclusion models has been that placement decisions be- program that is a focus of a transition service plan. This
gin to take precedence over decisions about children’s plan is designed to bridge between the secondary educa-
individual educational needs (Bateman, 1995). In con- tion curriculum and adulthood and must be included in
trast, partial inclusion, which is more typical of current an IEP for students beginning at age 14 years (OSERS,
inclusive schooling, pertains to those situations in which 2000). The consultative model of service delivery is a
one or more classrooms within a school or school district fourth model; its main characteristic is indirect assis-
are inclusive. Most advocates of full inclusion would tance. This model may be most appropriate when a
consider partial inclusion as inconsistent with the phi- child’s communication needs are so specific that they
losophy of inclusive schooling. Moreover, judicial tests do not apply to other children in the classroom (ASHA,
of inclusion have primarily involved students with mod- 1996).
erate to severe cognitive disabilities. In general, in case For any inclusion model to be e¤ective in meeting
law situations, courts have ruled that ‘‘inclusionary children’s diverse needs, two foundations must be func-
placement’’ should be the placement of choice, with a tional. One concerns the change process that serves to
segregated placement occurring only when the evidence translate conceptually sound research into everyday
is overwhelming, despite the school district’s best e¤orts, instructional practices. The process of changing beliefs
that inclusion is not feasible (Osborne, 2002). and practices must be explicitly understood, supported,
Inclusion Models for Children with Developmental Disabilities 309
and crafted to the particular educational situation A third predicament is the broad variation in instruc-
(Skrtic, 1992; Gersten and Brengelman, 1996). The sectional practices in inclusive classrooms. The fact that
ond fundamental support involves the building and sus- disruptions in language and communication develop-
taining of educational partnerships. A team’s capacity to ment are implicated in a wide range of disabilities but
sustain innovative and educationally relevant practices not acknowledged as central to children’s literacy learn-
requires the successful integration of collaborative prin- ing (Catts et al., 1999) also has significant ramifications
ciples and practices (Giangreco et al., 2000). for research on academic outcomes of inclusion. For
Few studies have evaluated the outcomes of inclusion example, the e¤ects of inclusion on emerging reading
programs for students with language impairments. At skill have been reported in a limited manner, primarily
least three predicaments have contributed to this situa- for children classified with learning (severe reading) dis-
tion. The first is the co-occurring disabilities dilemma. abilities (Klingner et al., 1998), less so for children with
Few inclusion programs have been reported in the liter- language impairment (Silliman et al., 2000). In general,
ature that specifically focus on children classified as the reading instruction for these children remained
having language impairment as the primary condition. undi¤erentiated from the reading practices used with
The absence of data can be attributed in part to the cate- other students in the classroom, resulting in minimal
gorical model, which fails to account adequately for co- gains. Thus, shifting a child to inclusion from a special
occurring disabilities, much less the existence of category education placement, or even maintaining a child in the
overlap. For example, the U.S. Department of Educa- regular education classroom, may mean that little has
tion (2000) continues to report that, for identification changed for that child in reality.
and assessment purposes, ‘‘learning disabilities and lan- One implication for instructional practices is that
guage disorders may be particularly hard to distinguish educational team members, including speech-language
. . . because these two disabilities present in similar ways’’ pathologists, should be skilled in designing multilevel
(p. II-32). Procedural requirements contribute one part instruction that takes into account the individual child’s
of this dilemma. State education agencies typically re- needs for the integration of oral language dimen-
port unduplicated counts of children with disabilities. sions with evidence-based practices for learning to read,
Only the primary disability category is provided. For write, and spell (National Institute of Child Health
example, specific learning disability, mental retardation, and Human Development, 2000). A second implication,
or autism is often reported as the primary condition, supported by ASHA (2001), is that speech-language
with language impairment classified as the secondary pathologists are critical stakeholders in children’s liter-
condition. However, when duplicated counts are avail- acy learning. They have the professional responsibility to
able, such as the count of all disabilities for each child bring their knowledge of language to the planning and
that the Florida Department of Education provides implementation of prevention, identification, assessment,
(U.S. Department of Education, 2000), language im- and intervention programs in order to facilitate chil-
pairment emerges as the most frequent disability asso- dren’s literacy development.
ciated with another disability. —Elaine R. Silliman
Second is the broad variations in research purposes
and methods. Most outcome studies have primarily References
addressed the social benefits of inclusion for children American Speech-Language-Hearing Association. (1996,
with severe development disabilities, such as autism Spring). Inclusive practices for children and youth with
or Down syndrome. The results are complicated to communication disorders: Position statement and technical
evaluate because of significant disparities among studies report. ASHA, 38(Suppl. 16), 35–44.
in their definitions of inclusion, sample characteristics, American Speech-Language-Hearing Association. (2001).
such as ages, grades, gender, and type of disabilities, and Roles and responsibilities of speech-language pathologists
the instructional or intervention focus (McGregor and with respect to reading and writing in children and adoles-
cents (position statement, executive summary of guidelines,
Vogelsberg, 1999; Murawski and Swanson, 2001). technical report). ASHA, Suppl. 21, 17–27.
Two major issues confront the design of future re- Bateman, B. D. (1995). Who, how, and where: Special educa-
search on the e‰cacy of inclusive practices (ASHA, tion’s issues in perpetuity. In J. M. Kaufman and D. P.
1996). First, in studying the cognitive and social com- Hallahan (Eds.), The illusion of full inclusion: A compre-
plexities of teaching and learning, multiple research hensive critique of a current special education bandwagon
methodologies pursued in a systematic manner are nec- (pp. 75–90). Austin, TX: Pro-Ed.
essary for exploring, developing, and testing hypotheses Catts, H. W., Fey, M. E., Zhang, X., and Tomblin, J. B.
(Friel-Patti, Loeb, and Gillam, 2001) about the e‰cacy (1999). Language basis of reading and reading dis-
of the four inclusion models. Second, in moving past the abilities: Evidence from a longitudinal investigation. Scien-
social outcome focus, research strategies should examine tific Studies of Reading, 3, 331–361.
Friel-Patti, S., Loeb, D. F., and Gillam, R. B. (2001). Looking
individual di¤erences in the ability to benefit academi- ahead: An introduction to five exploratory studies of Fast
cally and linguistically from inclusive education as a ForWord. American Journal of Speech-Language Pathol-
product of variations in instructional practices, expand- ogy, 10(3), 195–202.
ing the scope of investigation beyond children’s per- Gersten, R., and Brengelman, S. U. (1996). The quest to
formance on standardized measures of language and translate research into classroom practice. Remedial and
academic performance. Special Education, 17, 67–74.
Giangreco, M. F. (2000). Related services research for students Stainback, S., Stainback, W., and Jackson, H. J. (1992).
with low-incidence disabilities: Implications for speech- Toward inclusive classrooms. In S. Stainback and W.
language pathologists in inclusive classrooms. Language, Stainback (Eds.), Curriculum considerations in inclusive
Speech, and Hearing Services in Schools, 31, 230–239. classrooms: Facilitating learning for all students (pp. 3–17).
Giangreco, M. F. (2001). Guidelines for making decisions about Baltimore: Paul H. Brookes.
IEP services. Montpelier, VT: Vermont Department of U.S. Department of Education, O‰ce of Special Education
Education [on-line]. Available: http://www.state.vt.us/educ/ Programs. (2000). 22nd Annual Report to Congress on the
Cses/sped/main.htm. Implementation of the Individuals with Disabilities Act
Giangreco, M. F., Prelock, P. A., Reid, R. R., Dennis, R. E., [on-line]. Available: http://www.ed.gov/o‰ces/OSERS/.
and Edelman, S. W. (2000). Roles of related service per-
sonnel in inclusive schools. In R. A. Villa and J. S. Thou- Further Readings
sand (Eds.), Restructuring for caring and e¤ective education:
Piecing the puzzle together (pp. 360–388). Baltimore: Paul Catts, H. W., and Kamhi, A. G. (1999). Causes of reading
H. Brookes. disabilities. In H. W. Catts and A. G. Kamhi (Eds.), Lan-
Klingner, J. K., Vaughn, S., Hughes, M. T., Schumm, J. S. S., guage and reading disabilities (pp. 95–127). Boston, MA:
and Elbaum, B. (1998). Outcomes for students with and Allyn and Bacon.
without learning disabilities in inclusive classrooms. Learn- Ehren, B. J. (2000). Maintaining a therapeutic focus and shar-
ing Disabilities: Research and Practice, 13, 153–161. ing responsibility for student success: Keys to in-classroom
Lyon, G. R. (1996, Spring). Learning disabilities. The Future of speech-language services. Language, Speech, and Hearing
Children, 6(1), 54–76. Services in Schools, 31, 219–229.
McGregor, G., and Vogelsberg, T. T. (1999). Inclusive school- Farber, J., and Klein, E. (1999). Classroom-based assessment
ing practices and research foundations: A synthesis of the lit- of a collaborative intervention program with kindergarten
erature that informs best practices about inclusive schooling. and first grade children. Language, Speech, and Hearing
Baltimore: Paul H. Brookes. Services in Schools, 30, 83–91.
Murawski, W. W., and Swanson, H. L. (2001). A meta- Hadley, P. A., Simmerman, A., Long, M., and Luna, M.
analysis of co-teaching research: Where are the data? Re- (2000). Facilitating language development for inner-city
medial and Special Education, 22, 258–267. children: Experimental evaluation of a collaborative class-
National Institute of Child Health and Human Development. room-based intervention. Language, Speech, and Hearing
(2000). Report of the National Reading Panel. Teaching Services in Schools, 31, 280–295.
children to read: An evidence-based assessment of the scien- Idol, L. (1997). Key questions related to building collaborative
tific research literature on reading and its implications for and inclusive schools. Journal of Learning Disabilities, 30,
reading instruction: Reports of the subgroups (NIH Publica- 384–394.
tion No. 00-4754). Washington, DC: U.S. Government National Research Council. (2000). From neurons to neighbor-
Printing O‰ce. hoods: The science of early childhood development. Wash-
National Research Council. (1999). How people learn: Bridging ington, DC: National Academy Press.
research and practice. Washington, DC: National Academy Prelock, P. A. (2000a). Prologue: Multiple perspectives for
Press. determining the roles of speech-language pathologists in
National Research Council. (2000). How people learn: Brain, inclusionary classrooms. Language, Speech, and Hearing
mind, experience, and school (expanded ed.). Washington, Services in Schools, 31, 213–218.
DC: National Academy Press. Prelock, P. A. (2000b). Epilogue: An intervention focus for
Osborne, A. G., Jr. (2002). Legal, administrative and policy inclusionary practice. Language, Speech, and Hearing Ser-
issues in special education. In K. G. Butler and E. R. Silli- vices in Schools, 31, 296–298.
man (Eds.), Speaking, reading, and writing in children with Prelock, P. A., Beatson. J., Contompasis, S. H., and Bishop,
language learning disabilities: New paradigms for research K. K. (1999). A model for family-centered interdisciplinary
and practice (pp. 297–314). Mahwah, NJ: Erlbaum. practice in the community. Topics in Language Disorders,
O‰ce of Special Education and Rehabilitation Services, U.S. 19(3), 36–51.
Department of Education. (2000, July). A guide to the indi- Pressley, M., and El-Dinary, P. B. (1997). What we know
vidualized education program [on-line]. Available: http:// about translating comprehension-strategies instruction re-
www.ed.gov/o‰ces/OSERS. search into practice. Journal of Learning Disabilities, 30,
Silliman, E. R., Bahr, R., Beasman, J., and Wilkinson, L. C. 486–488.
(2000). Sca¤olds for learning to read in an inclusion class- Rankin-Erickson, J. L., and Pressley, M. (2000). A survey of
room. Language, Speech, and Hearing Services in Schools, instructional practices of special education teachers nomi-
31, 265–279. nated as e¤ective teachers of literacy. Learning Disabilities:
Silliman, E. R., Ford, C. S., Beasman, J., and Evans, D. Research and Practice, 15, 206–225.
(1999). An inclusion model for children with language Salisbury, C. L., and Dunst, C. J. (1997). Home, school, and
learning disabilities: Building classroom partnerships. Top- community partnerships: Building inclusive teams. In B.
ics in Language Disorders, 19(3), 1–18. Rainforth and J. York-Barr (Eds.), Collaborative teams for
Skrtic, T. M. (1992). The special education paradox: Equity as students with severe disabilities: Integrating therapy and edu-
the way to excellence. In T. Hehir and T. Latus (Eds.), cational services (pp. 57–87). Baltimore: Paul H. Brookes.
Special education at the century’s end: Evolution of theory Snow, C. E., Burns, M. S., and Gri‰n, P. (1998). Preventing
and practice since 1970 (pp. 203–272). Cambridge, MA: reading di‰culties in young children. Washington, DC: Na-
Harvard University Press. tional Academy Press.
Skrtic, T. M., Sailor, W., and Gee, K. (1996). Voice, collabo- Speece, D. L., MacDonald, V., Kilsheimer, L., and Krist, J.
ration, and inclusion: Democratic themes in educational (1997). Research to practice: Preservice teachers reflect
and social reform initiatives. Remedial and Special Educa- on reciprocal teaching. Learning Disabilities: Research and
tion, 17, 142–157. Practice, 12, 177–187.
Language Development in Children with Focal Lesions 311
Stone, C. A. (1998). The metaphor of sca¤olding: Its utility for early specialization. Such findings would suggest that the
the field of learning disabilities. Journal of Learning Dis- neural architecture for language is determined by innate
abilities, 31, 344–364. and probably genetic mechanisms. If, by contrast, chil-
Tabors, P. O., and Snow, C. E. (2001). Young bilingual children with left hemisphere damage successfully master
dren and early literacy development. In S. B. Neuman and
language skills, the implication is that, at least under
D. K. Dickinson (Eds.), Handbook of early literacy research
(pp. 159–178). New York: Guilford Press. extreme circumstances, alternative organizations can be
Tager-Flusberg, H. (Ed.). (1999). Neurodevelopmental dis- established. Such findings would suggest that the neural
orders. Cambridge, MA: MIT Press. architecture of language is an outcome of language
Thousand, J., Diaz-Greenberg, R., Nevin, A., Cardelle- learning. In its strongest formulation, this second posi-
Elawar, M., Beckett, C., and Reese, R. (1999). Perspective: tion asserts equipotentiality, that is, that either hemi-
Perspectives on a Freirean dialectic to promote inclu- sphere can serve language functions as long as the neural
sive education. Remedial and Special Education, 20, 323– commitment occurs before language learning forces left
328. hemisphere specialization (Lenneberg, 1967). If children
Westby, C. E. (1994). The vision of full inclusion: Don’t with left hemisphere damage show only minor delays,
exclude kids by including them. Journal of Childhood Com-
the implication is that alternative neural organizations
munication Disorders, 16, 13–22.
Zigmond, N. (1995). An exploration of the meaning and prac- are less favorable to language development or process-
tice of the context of full inclusion of students with learning ing than the classical language areas (Satz, Strauss, and
disabilities. Journal of Special Education, 29, 109–115. Whitaker, 1990), an intermediate position called con-
strained plasticity or ontogenetic specialization. This last
position would suggest that some aspects of brain struc-
ture may be determined by early and possibly genetic
Language Development in Children factors, but that the full development of left hemisphere
with Focal Lesions specialization emerges through development.
Challenges in the study of children with early focal
brain injury complicate obtaining and interpreting find-
Many lines of evidence support the concept that the left ings. Before modern neural imaging modalities became
hemisphere plays an essential and specialized role in available, the localization of damage was often uncer-
language processing in adults. Studies of adults with tain; this review considers empirical studies that used
focal brain injury find that approximately 95% of cases computed tomography or magnetic resonance imaging
of aphasia are associated with left hemisphere damage for lesion location. Because early focal brain injuries
(Goodglass, 1993). In the traditional view, damage to are rare, published studies usually consist of small,
the third frontal convolution, Broca’s area, is associated heterogeneous samples. Age at onset, extent of lesion,
with problems in language production, whereas damage time since injury, and associated problems vary within
to the first temporal convolution, Wernicke’s area, is and across these groups, making meta-analyses impos-
associated with problems in language comprehension. sible. The language capabilities of the infants, had
However, the picture is more complex. The ability to they not sustained injury, remain uncertain. Other vari-
predict the location of injury from the aphasic syndrome ables, including presence of seizure disorder or use of
(or vice versa) is limited, and the right hemisphere re- anticonvulsant medications, mediate outcomes (Vargha-
mains involved in aspects of language processing, such Khadem et al., 1992). Given all the sources of potential
as interpretation of prosody and metaphor and compre- variability, the use of group means to describe brain-
hension of complex syntax (Just et al., 1996). Nonethe- injured populations may combine disparate groups and
less, the left hemisphere contribution to language seems mask important distinctions. A preferable approach is
to be necessary. individual profiling in the setting of a contrastive group
How, when, and why does the left hemisphere be- of age- and developmentally matched children (Bishop,
come specialized for language functions? The study of 1983).
children who sustain focal left hemisphere damage prior An initial question to be addressed in considering the
to or during language development provides one ex- language development of children with focal lesions is
perimental approach to address these issues. Rarely, their overall intelligence and cognitive profiles. If these
children who have not yet learned to speak or who are children have severe intellectual impairments, then any
still developing language skills sustain brain injury to language deficits should be evaluated in relation to cog-
areas of the left hemisphere that typically serve language nitive measures such as intelligence quotient (IQ) tests.
function in adults. These children a¤ord a naturalis- Most studies concur that children with focal injury to
tic experimental opportunity to address the theoreti- either hemisphere, even those with total surgical removal
cal questions about the neural substrate of language of a hemisphere, score at or near the population mean
learning. (Bates, Vicari, and Trauner, 1999). In children, unlike in
If children with left hemisphere damage prior to lan- adults, di¤erential hemispheric mediation of verbal and
guage learning subsequently demonstrate serious delays performance or nonverbal functions is not typically
in language development, the implication is that the found (Vargha-Khadem et al., 1992; Vargha-Khadem,
mechanisms for language development reside within the Isaacs, and Muter, 1994; Bates, Vicari, and Trauner,
damaged regions of the left hemisphere, a position called 1999). The near-normal intellectual performance of
children with focal damage is a testament to the plastic- intact system. The system is capable of high-level skills,
ity of the human brain. such as narrative discourse and reading, but damage
anywhere in the system may reduce the levels of func-
Development of Functional Communication tioning in these areas, presumably because these skills,
Skills even in maturity, require an extensive neural network.
Children with left hemisphere damage to the classical Performance on Specific and Formal Measures
speech areas are not aphasic. Individual di¤erences
Despite adequacy of conversational language, children
occur in fluency, intelligibility, frequency of initiation,
with left hemisphere damage show subtle to moderate
and volume of output. However, in light of the chronic
impairments in selective aspects of language. In children
sensory and motor deficits that follow injury in these
with injuries acquired during childhood, expressive
children and the severe disruption of language that fol-
and receptive complex syntax is particularly vulnerable
lows similar injuries in adults, it is remarkable that their
(Aram, Ekelman, and Whitaker, 1986, 1987; Aram and
conversational language is normal or near normal (Bates
Ekelman, 1987). At school age, an on-line sentence
et al., 1997).
comprehension task suggested that their strategies for
Despite the favorable prognosis, children with focal
interpreting syntactic structures were developmentally
injury to either hemisphere may experience devel-
delayed compared with those of normal learners (Feld-
opmental delays in the onset of babbling and communi-
man, MacWhinney, and Sacco, 2002). However, chil-
cative gestures (Marchman, Miller, and Bates, 1991),
dren with right hemisphere damage also showed
vocabulary development, and use of word combinations
developmental delays on this task. An alternative expla-
in parent-child conversations (Thal et al., 1991; Feld-
nation to the interpretation that syntax skills may be
man et al., 1992). Once these children begin to acquire
particularly vulnerable to left hemisphere injuries is that
functional skills, they are comparable in their rate of
subjects with left hemisphere damage have greatest di‰-
developmental progress to each other and to children
culty with the most developmentally advanced areas
developing typically (Feldman et al., 1992). By age 4,
on an experimental assessment (Bishop, 1983). In this
children with focal left hemisphere damage can master
regard, children who sustain left hemisphere damage in
even the complex morphosyntactic structures of Hebrew,
the perinatal period have also been shown to have
at least using a criterion that multiple complex struc-
more di‰culties in lexical retrieval (Aram, Ekelman, and
tures are present in the spoken output (Levy, Amir, and
Whitaker, 1987), comprehension of di‰cult verbal ma-
Shalev, 1994).
terial, and formulating sentences than children with right
Location of injury is not related to rate of develop-
hemisphere injury (MacWhinney et al., 2000). They also
ment in the manner predicted from studies of adults with
show relatively more di‰culties with reading, writing,
brain injury. Children with right hemisphere damage
and spelling than children with right hemisphere damage
initially show greater delays in initial word comprehen-
(Woods and Carey, 1978). The procedures that demon-
sion and production than children with left hemisphere
strate the selective weaknesses of children with left
damage, and children with posterior left hemisphere
hemisphere damage require precise and constrained
damage, presumably including Wernicke’s area, develop
performance, as opposed to the relatively free form of
more slowly than children with damage to other areas
conversation.
of the left hemisphere, including Broca’s area (Thal et
On-line reaction time methodology has been used
al., 1991; Bates et al., 1997). These findings implicate the
in school-age children with focal lesions to determine
centrality of pattern recognition and subsequently of
whether particular information processing abilities are
language comprehension in the development of language
selectively compromised in children with left hemisphere
production.
damage (MacWhinney et al., 2000). Children with both
Children with focal damage achieve high-level lan-
left and right hemisphere damage had slower reaction
guage skills. By school age, they can produce narrative
times than age-matched normal peers on all of the audi-
discourse. Their productions are shorter and syntacti-
tory and visual reaction time tasks studied. The two
cally less sophisticated than those of age-matched peers,
tasks that best distinguished children with left hemi-
but narrative skills are comparable in children with left
sphere damage from children with other lesions and
hemisphere and right hemisphere damage (Reilly, Bates,
normal children were verbally repeating numbers pre-
and Marchman, 1998). Most children with early focal
sented in the auditory mode and naming numbers pre-
damage also learn to read, write, and spell, although the
sented in the visual mode, both tasks that require rapid
strategies they use may vary as a function of the side of
verbal output.
the lesion (Dennis, Lovett, and Wiegel-Crump, 1981).
These findings suggest that a wide neural network Age at Injury
involving both cerebral hemispheres is necessary to
launch language development, so that damage to a neu- In most studies, the younger the child at the time of focal
ral substrate in either hemisphere may delay language injury, the higher is the subsequent level of functioning
development. Once language skills begin to develop and in IQ and language. In children who sustain injuries
an initial neural network is beginning to be established, after age 5, verbal IQ is more likely to be a¤ected with
neural organization progresses at a similar rate as in an left hemisphere damage and performance IQ with right
Language Development in Children with Focal Lesions 313
hemisphere damage. Children who undergo total hemi- the cortex is pluripotential, if not equipotential. Synaptic
spherectomy for intractable seizures have better lan- connections seem to be formed and preserved on the
guage if the operation is done before 1 year of age than basis of experience. Experience-dependent commitment
after 1 (Dennis and Whitaker, 1976). Nonetheless, of neural substrate may explain some of the variability in
Vargha-Khadem and colleagues (1997) reported the case the neural organization of basic functions seen across
of a previously nonverbal child who began to speak at individuals. Many studies also demonstrate experience-
age 9 years, after he had undergone a hemispherectomy dependent progressive specialization of neural tissue.
of the left hemisphere for seizures and a reduction in In language development, studies of normal infants and
anticonvulsant medications. The ability to develop lan- toddlers using event-related potentials have shown that
guage appears to be preserved under some circumstances in the initial development of a language skill, such as
into middle childhood. word learning or recognition of syntactic markers, a
bilateral network becomes activated. As skill level in-
Brain Reorganization After Early Injury creases, the function lateralizes to the left hemisphere
(Mills, Co¤ey-Corina, and Neville, 1993, 1997).
Language functioning can be relocated to the right The language of children with focal injuries leads to
hemisphere after early damage to the left hemisphere. similar views on the neural basis of language learning.
Previously, the method used to determine the eloquent The development of language initially seems to use a
hemisphere was a sodium amytal carotid infusion, used wide bilateral neural network, such that damage to
in the presurgical evaluation of individuals with intrac- either hemisphere delays the onset. For reasons that re-
table seizures. If this anesthetic disrupts language when main unclear, a slight advantage to the left hemisphere
infused into the carotid artery on one side but not the for language function results in progressive specializa-
other, then that side is considered the ‘‘eloquent’’ hemi- tion of the left hemisphere as learning proceeds. If the
sphere. Rasmussen and Milner (1977) found evidence left hemisphere is damaged, then the other cortical areas
that individuals with early left hemisphere injury were specialize to serve language function.
far more likely to have language in the right hemi- Language functions may be preserved in adjacent
sphere than individuals with no previous left hemisphere regions of the left hemisphere or in homologous regions
injury, although some individuals with early left hemi- of the right hemisphere, depending on multiple factors.
sphere injury retained language functioning in the left Given the pluripotential nature of cortex, children with
hemisphere. left hemisphere damage perform well in conversational
Functional imaging o¤ers a noninvasive method to language. However, they have selective di‰culties when
reexplore this issue. The methods are appropriate for tasks require syntactic skills and other rapid, precise,
normal individuals as well as clinical populations and or constrained linguistic processing. However, children
can be used to determine the areas of activation for with right hemisphere injury may also have subtle to
many di¤erent tasks. In adults, functional magnetic res- moderate language disturbances. The plasticity of the
onance imaging (fMRI) has shown that a wide network brain for language function may come about at the
of areas is involved in sentence interpretation (Just et al., expense of other neuropsychological functions, includ-
1996); activation was more likely to include right hemi- ing visual-spatial processing. Functional imaging holds
sphere locations as the sentence di‰culty increased. enormous promise for investigating the organization of
Booth and colleagues (2000) used a similar fMRI para- language and other skills in intact individuals developing
digm to compare six children with perinatal injuries, typically as well as in children with focal injuries.
four with damage to left hemisphere areas, to normal —Heidi M. Feldman
adults and normal children. In adults and normal chil-
dren, a sentence comprehension task produced more References
activation in the left hemisphere than in the right hemi-
Aram, D. M., and Ekelman, B. L. (1987). Unilateral brain
sphere; greatest activation in the superior temporal,
lesions in childhood: Performance on the Revised Token
middle temporal, inferior frontal, and prefrontal areas; Test. Brain and Language, 32, 137–158.
and right hemisphere recruitment for di‰cult sentences, Aram, D. M., Ekelman, B. L., and Whitaker, H. A. (1986).
particularly in the adults. The children with left hemi- Spoken syntax in children with acquired unilateral hemi-
sphere damage performed less accurately than normal sphere lesions. Brain and Language, 27, 75–100.
children on the task. These children activated primarily a Aram, D. M., Ekelman, B. L., and Whitaker, H. A. (1987).
right hemisphere network and did not show an increase Lexical retrieval in left and right brain-lesioned chidlren.
in activation as a function of sentence di‰culty. The Brain and Language, 38, 105–121.
children with left hemisphere damage also had very poor Bates, E., Thal, D., Trauner, D., Fenson, J., Aram, D., Eisele,
performance on a mental rotation task that typically J., and Nass, R. (1997). From first words to grammar in
children with focal brain injury. Developmental Neuro-
activates right hemisphere areas. This finding suggests psychology, 13, 447–476.
that reorganization of language to the right hemisphere Bates, E., Vicari, S., and Trauner, D. (1999). Neural mediation
may compromise skills typically served by the right of language development: Perspectives from lesion studies
hemisphere. of infants and children. In H. Tager-Flusberg (Ed.), Neuro-
A variety of basic science methods show that cortical developmental disorders (pp. 533–581). Cambridge, MA:
tissue can take on a variety of functions, suggesting that MIT Press.
Bishop, D. (1983). Linguistic impairment after left hemi- Vargha-Khadem, F., Isaacs, E., and Muter, V. (1994). A re-
decortication for infantile hemiplegia? A reappraisal. Quar- view of cognitive outcome after unilateral lesions sustained
terly Journal of Experimental Psychology, 35(A), 199–207. during childhood. Journal of Child Neurology, 9(Suppl. 2),
Booth, J. R., MacWhinney, B., Thulborn, K. R., Sacco, K., 67–73.
Voyvodic, J., and Feldman, H. M. (2000). Patterns of Vargha-Khadem, F., Isaacs, E., van der Werf, S., Robb, S.,
brain activation in children with strokes engaged in three and Wilson, J. (1992). Development of intelligence and
cognitive tasks. Developmental Neuropsychology, 18, 139– memory in children with hemiplegic cerebral palsy: The
169. deleterious consequences of early seizures. Brain, 115(Pt. 1),
Dennis, M., Lovett, M., and Wiegel-Crump, C. A. (1981). 315–329.
Written language acquisition after left or right hemidecor- Woods, B., and Carey, S. (1978). Language deficits after ap-
tication in infancy. Brain and Language, 12, 54–91. parent clinical recovery from aphasia. Annals of Neurology,
Dennis, M., and Whitaker, H. A. (1976). Language acquisi- 6, 405–409.
tion following hemidecortication: Linguistic superiority of
the left over the right hemisphere. Brain and Language, 3, Further Readings
404–433.
Feldman, H. M., Holland, A. L., Kemp, S. S., and Janosky, Bates, E. (1999). Language and the infant brain. Journal of
J. E. (1992). Language development after unilateral brain Communication Disorders, 32, 195–205.
injury. Brain and Language, 42, 89–102. Bishop, D. V. (1997). Cognitive neuropsychology and devel-
Feldman, H. M., MacWhinney, B., and Sacco, K. (2002). opmental disorders: Uncomfortable bedfellows. Quarterly
Sentence processing in children with early left hemisphere Journal of Experimental Psychology. A. Human Experimen-
brain injury. Brain and Language, 83, 335–352. tal Psychology, 50, 899–923.
Goodglass, D. (1993). Understanding aphasia. San Diego, CA: Dennis, M., Spiegler, B. J., and Hetherington, R. (2000). New
Academic Press. survivors for the new millennium: Cognitive risk and re-
Just, M., Carpenter, P., Keller, T., Eddy, W., and Thulborn, serve in adults with childhood brain insults. Brain and Cog-
K. (1996). Brain activation modulated by sentence compre- nition, 42, 102–105.
hension. Science, 274, 114–116. Eisele, J. A., and Aram, D. M. (1995). Lexical and grammati-
Lenneberg, E. H. (1967). Biological foundations of language. cal development in children with early hemisphere damage:
New York: Wiley. A cross-sectional view from birth to adolescence. In P.
Levy, Y., Amir, N., and Shalev, R. (1994). Morphology in a Fletcher and B. MacWhinney (Eds.), Handbook of child
child with a congenital, left-hemisphere brain lesion: Impli- language (pp. 664–689). Oxford, U.K.: Blackwell.
cations for normal acquisition. In Constraints on language Rivkin, M. J. (2000). Developmental neuroimaging of children
acquisition: Studies of atypical children. Hillsdale, NJ: using magnetic resonance techniques. Mental Retardation
Erlbaum. and Developmental Disabilities Research Reviews, 6, 68–80.
MacWhinney, B., Feldman, H., Sacco, K., and Valdes-Perez, Semrud-Clikeman, M. (1997). Evidence from imaging on the
R. (2000). Online measures of basic language skills in chil- relationship between brain structure and developmental
dren with early focal brain lesions. Brain and Language, 71, language disorders. Seminars in Pediatric Neurology, 4,
400–431. 117–124.
Marchman, V. A., Miller, R., and Bates, E. (1991). Babble and Vargha-Khadem, F. (2001). Generalized versus selective cog-
first words in children with focal brain injury. Applied Psy- nitive impairments resulting from brain damage sustained in
cholinguistics, 12, 1–22. childhood. Epilepsia, 42(Suppl. 1), 37–40.
Mills, D. L., Co¤ey-Corina, S. A., and Neville, H. J. (1993). Vargha-Khadem, F., Isaacs, E., Watkins, K., and Mishkin, M.
Language acquisition and cerebral specialization in 20- (2000). Ontogenetic specialization of hemispheric function.
month-old infants. Journal of Cognitive Neuroscience, 53, In C. E. Polkey and M. Duchowney (Eds.), Intractable focal
317–334. epilepsy: Medical and surgical treatment (pp. 405–418).
Mills, D. L., Co¤ey-Corina, S. A., and Neville, H. J. (1997). London: Harcourt.
Language comprehension and cerebral specialization from Volpe, J. J. (2001). Perinatal brain injury: From pathogenesis
13 to 20 months. Developmental Neuropsychology, 13, to neuroprotection. Mental Retardation and Developmental
397–445. Disabilities Research Reviews, 7, 56–64.
Rasmussen, T., and Milner, B. (1977). The role of early left
brain injury in determining implications for models of
language development. Annals of the New York Academy of
Sciences, 299, 355–369. Language Disorders in Adults:
Reilly, J. S., Bates, E. A., and Marchman, V. A. (1998). Nar- Subcortical Involvement
rative discourse in children with early focal brain injury.
Brain and Language, 61, 335–375.
Satz, P., Strauss, E., and Whitaker, H. (1990). The ontogeny of The first suggestion of a link between subcortical struc-
hemispheric specialization: Some old hypotheses revisted. tures and language was made by Broadbent (1872), who
Brain and Language, 38, 596–614. proposed that words were generated as motor acts in the
Thal, D., Marchman, V., Stiles, J., Aram, D., Trauner, D., basal ganglia. Despite this suggestion, according to the
Nass, R., and Bates, E. (1991). Early lexical development in
children with focal brain injury. Brain and Language, 40(4),
classical anatomo-functional models of language orga-
491–527. nization proposed by Wernicke (1874) and Lichtheim
Vargha-Khadem, F., Carr, L. J., Isaacs, E., Brett, E., Adams, (1885), subcortical brain lesions could only produce lan-
C., and Mishkin, M. (1997). Onset of speech after left guage deficits if they disrupted the white matter fibers
hemispherectomy in a nine-year-old boy. Brain, 120(Pt. 1), that connect the various cortical language centres. Con-
159–182. sequently, aphasia has traditionally been regarded as
Language Disorders in Adults: Subcortical Involvement 315
a language disorder resulting from damage to the lan- lesions compared to posterior striatocapsular lesions.
guage areas of the dominant cerebral cortex. Since For example, Naeser et al. (1982) noted that patients
the late 1970s, however, this traditional view has been with capsular-putaminal lesions extending into the ante-
challenged by the findings of an increasing number rior-superior white matter typically had good compre-
of cliniconeuroradiological correlation studies that have hension and slow but grammatical speech. In contrast,
documented the occurrence of adult language disorders those with capsular-putaminal lesions including poste-
in association with apparently subcortical vascular rior white matter extension showed poor comprehen-
lesions. In particular, the introduction in recent decades sion and fluent Wernicke’s-type speech, while those with
of new neuroradiological methods for lesion localization anterior-superior and posterior white matter involve-
in vivo, including computed tomography in the 1970s ment were globally aphasic. Further support for this
and more recently magnetic resonance imaging, has anterior-posterior distinction was provided by Cappa
led to an increasing number of reports in the literature et al. (1983) and Murdoch et al. (1986). Despite this
of aphasia following apparently purely subcortical apparent consensus, several other studies have ques-
lesions. (For reviews of in vivo correlation studies, see tionned the accuracy and utility of the anterior-posterior
Alexander, 1989; Cappa and Vallar, 1992, and Murdoch, dichotomy by describing a number of cases in which the
1996.) Therefore, although the concept of subcortical patterns of language impairment could not be accounted
aphasia remains controversial, recent years have seen for in terms of this anatomical distinction (Kennedy and
a growing acceptance of a role for subcortical struc- Murdoch, 1993; Wallesch, 1985).
tures in language. Despite an abundance of theoretical In contrast to the striatocapsular lesions, language
models, however, the precise nature of that role remains disturbances following thalamic lesions present a more
elusive. uniform clinical picture, and it is generally accepted
Subcortical structures most commonly purported to that a typical thalamic aphasia can be characterized by
have a linguistic role include the basal ganglia, the tha- the clinical presentation. Most commonly the aphasia
lamus, and the subcortical white matter pathways. Some resulting from thalamic injury is of a mixed transcortical
evidence for a role for the cerebellum in language has presentation, sharing some features with both trans-
also been reported (Leiner, Leiner, and Dow, 1993). The cortical motor and transcortical sensory aphasia (Cappa
basal ganglia comprise the corpus striatum (including and Vignolo, 1979; Murdoch, 1996). The features of
the caudate nucleus and the putamen and internal cap- thalamic aphasia most commonly reported include
sule), the globus pallidus, the subthalamic nucleus, and preserved repetition, variable but often relatively good
the substantia nigra. Although these nuclei are primarily auditory comprehension, a reduction in spontaneous
involved in motor functions, the corpus striatum and speech output, a predominance of semantic paraphasic
globus pallidus have frequently been included in models errors, and anomia. Lesions of the dominant antero-
of subcortical participation in language. In addition, lateral thalamus (including the ventral anterior, ventral
several thalamic nuclei have also been implicated in lan- lateral, and anterior nuclei) have been highlighted as the
guage, in particular the ventral anterior nucleus, which loci of aphasic deficits, given that infarctions in this re-
has direct connections to the premotor cortex and indi- gion more consistently lead to aphasic disturbances than
rect connections to the temporoparietal cortex via the lesions involving the posterior parts of the thalamus
pulvinar. The basal ganglia and the thalamus are linked (Cappa et al., 1986).
to the cerebral cortex by way of a series of circuits re- Attempts to explain the clinical manifestations of
ferred to as the cortico-striato-pallido-thalamo-cortical subcortical aphasia have culminated in the formulation
loops. The majority of contemporary theories specify of several theories of subcortical participation in lan-
these loops as the neuroanatomical basis of subcortical guage. These theories, largely developed on the basis of
participation in language. speech and language data collected from subjects who
Although there is general agreement that critical have sustained cerebrovascular accidents involving the
white matter pathways and the thalamus are involved thalamus or striatocapsular region, have been expressed
in language, controversy and uncertainty surround the as neuroanatomically based models. Two models of
possible linguistic role of striatocapsular structures. subcortical participation in language have been quite in-
Although in vivo correlation studies have documented fluential. The first of these, the response/release/semantic
beyond reasonable doubt that language impairments can feedback model (Crosson, 1985), proposes a role for
occur in association with lesions confined to the striato- subcortical structures in regulating the release of pre-
capsular region of the dominant hemisphere, consider- formulated language segments from the cerebral cortex.
able variability has been reported in the nature and According to this model, the conceptual, word-finding,
degree of these language impairments, with no unitary and syntactic processes that fall under the rubric of lan-
striatocapuslar aphasia being identified (Kennedy and guage formulation occur in the anterior cerebral cor-
Murdoch, 1993; Nadeau and Crosson, 1997). Varied tex. The monitoring of anteriorly formulated language
impairments have been noted in spontaneous speech, segments, as well as the semantic and phonological
confrontation naming, repetition, auditory comprehen- decoding of incoming language, occurs in the posterior
sion, and reading comprehension. A number of authors temporoparietal cortex. Language segments are con-
have suggested that a di¤erence exists between the type veyed from the anterior language formulation center to
of aphasia associated with anterior striatocapsular the posterior language center via the thalamus prior to
release for motor programming. This operation allows integrative and decision-making capabilities rather than
the posterior semantic decoding centers to monitor the the simple neuroregulatory function proposed in Cross-
language segment for semantic accuracy. If an inaccu- on’s (1985) model. Specifically, the basal ganglia system
racy is detected, then the information required for and thalamus were hypothesized to process situational as
correction is conveyed via the thalamus back to the an- well as goal-directed constraints and lexical information
terior cortex. If the language segment is found to be from the frontal cortex and posterior language area, and
accurate during monitoring, then it is released from a to subsequently participate in the process of determining
bu¤er in the anterior cortex for subsequent motor pro- the appropriate lexical item, from a range of cortically
gramming. In addition to subcortical structures par- generated lexical alternatives, for verbal production. The
ticipating in the preverbal semantic monitoring process, most appropriate lexical alternative is then released by
the model also specifies that the striatocapsular struc- the thalamus for processing by the frontal cortex and
tures are involved in the release of the formulated lan- programming for speech. Cortical processing of selected
guage segment for motor programming. Specifically, lexical alternatives is made possible by inhibitory influ-
it is suggested that this release occurs through the ences of the globus pallidus on a thalamic gating mech-
cortico-striato-pallido-thalamo-cortical loop in the fol- anism. This most appropriate lexical alternative has an
lowing way. Once the language segments have been inhibitory e¤ect on the thalamus, promoting closure of
verified for semantic accuracy, the temporoparietal cor- the thalamic gate, resulting in activation of the cerebral
tex releases the caudate nucleus from inhibition. The cortex and production of the desired response. Cortical
caudate nucleus then serves to weaken inhibitory pallidal processing of subordinate alternatives is suppressed as a
regulation of thalamic excitatory outputs in the anterior consequence of pallidal disinhibition of the thalamus,
language center, which in turn arouses the cortex to and the inhibition of cortical activity.
enable the generation of motor programs for semanti- Despite the apparent di¤erences in the two models,
cally verified language segments. According to this model, they both ascribe an important role to the subcortical
Crosson (1985) hypothesized that subcortical lesions nuclei in language processing, especially at the lexical
within the cortico-striato-pallido-thalamo-cortical loop level of language organization. It is equally apparent
would produce language deficits confined to the lexical- that each of these models has a number of limitations
semantic level. and that no one model has achieved uniform accep-
Crosson’s (1985) original conception of the response- tance. A major limitation of these models is that neither
release mechanism has since been revised and elaborated explains the considerable variability in clinical presenta-
in terms of the neural substrates involved (Crosson, tion of subcortical aphasia. According to Cappa (1997),
1992a, 1992b). Although the actual response-release a further problem is that the models suggest such exten-
mechanism in the modified version resembles that in the sive and widely distributed systems subserving lexical
original conception, the route for this release is altered. processing that specific predictions appear to be di‰cult
The formulation of a language segment causes frontal to disprove on the basis of pathological evidence. Put
excitation of the caudate, which increases inhibition of more simply, these models do not lend themselves
specific fields within the globus pallidus; however, this readily to empirical testing. Yet another limitation arises
level of inhibition alone is not su‰cient to alter pallidal from the nature of the research on which these models
output to the thalamus. An increase in posterior lan- are based. The available models of subcortical partici-
guage cortex excitation to the caudate, which occurs pation in language are largely based on the observation
once a language segment has been semantically verified that certain contrasting deficits of language produc-
posteriorly, provides a boost to the inhibition of the tion arise in subjects with particular subcortical vascular
pallidum. The pallidal summation of this anterior and lesions when tested on traditional tests of language
posterior inhibitory input allows the release of the ven- function. These language measures were typically
tral anterior thalamus from inhibition by the globus designed for taxonomic purposes regarding traditional
pallidus, causing the thalamic excitation of the frontal cortical-based aphasia syndromes and may be inade-
language cortex required to trigger the release of the quate for developing models of brain functioning (Car-
language segment for motor programming. Overall, the amazza, 1984). It has also been argued that language
revised model provides an integrated account of how deficits associated with subcortical vascular lesions may
subcortical structures might influence language output actually be related to concomitant cortical dysfunction
through a neuroregulatory mechanism that is consistent via various pathophysiological mechanisms. For in-
with knowledge of cortical-subcortical neurotransmitter stance, cortical infarction may not have been detected by
systems and structural features. neuroimaging. Also, subcortical lesions may result in
A second model of subcortical participation in lan- diaschisis or the functional deactivation of distant re-
guage was proposed by Wallesch and Papagno (1988). lated cortical structures (Metter et al., 1983). Further,
This model, referred to as the lexical selection model, language dysfunction following subcortical lesions may
also proposes that subcortical structures participate in be related to decreases in cortical perfusion, causing
language processes via a cortico-striato-pallido-thalamo- widespread cortical damage that may or may not be
cortical loop. Wallesch and Papagno (1988) postulated detected by neuroimaging (Nadeau and Crosson, 1997).
that the subcortical components of the loop constitute a As yet, however, the relationship between the structural
frontal lobe system comprised of parallel modules with site and etiology of subcortical lesions, the extent of
Language Disorders in Adults: Subcortical Involvement 317
cortical hypometabolism and hypoperfusion, and asso- orders associated with subcortical lesions (pp. 7–41). New
ciated language function remains to be fully elucidated. York: Oxford University Press.
Further clarification of the role of subcortical struc- Cappa, S. F., and Vignolo, L. A. (1979). Transcortical features
tures in language is likely to come through the use of aphasia following left thalamic hemorrhage. Cortex, 15,
121–130.
of functional imaging techniques and neurophysiologi-
Caramazza, A. (1984). The logic of neuropsychological re-
cal methods such as electrical and magnetic evoked search and the problem of patient classification in aphasia.
responses, as well as from the study of the language Brain and Language, 21, 9–20.
abilities of patients with circumscribed neurosurgical Crosson, B. (1985). Subcortical functions in language: A
lesions involving subcortical structures (e.g., thalamot- working model. Brain and Language, 25, 257–292.
omy and pallidotomy). Functional imaging techniques Crosson, B. (1992a). Subcortical functions in language and
such as positron emission tomography (PET) and func- memory. New York: Guilford Press.
tional magnetic resonance imaging (fMRI) enable brain Crosson, B. (1992b). Is the striatum involved in language? In
images to be collected while the subject is performing G. Vallar, S. F. Cappa, and C. W. Wallesch (Eds.), Neuro-
various language production tasks (e.g., picture naming, psychological disorders associated with subcortical lesions
(pp. 268–293). Oxford: Oxford University Press.
generating nouns) or during language comprehension
Kennedy, M., and Murdoch, B. E. (1993). Chronic aphasia
(e.g., listening to stories). These techniques therefore subsequent to striatocapsular and thalamic lesions in the left
enable visualization of the brain regions involved in a hemisphere. Brain and Language, 44, 284–295.
language task, with a spatial resolution as low as a few Leiner, H. C., Leiner, A. L., and Dow, R. S. (1993). Cognitive
millimeters. The use of fMRI in the future is therefore language functions of the human cerebellum. Trends in
likely to further inform the debate as to the role of sub- Neurosciences, 16, 444–447.
cortical structures in language. A review of the exten- Lichtheim, L. (1885). On aphasia. Brain, 7, 433–484.
sive literature on PET studies indicates that some studies Metter, E. J., Riege, W. H., Hanson, W. R., Kuhl, D. E.,
published since 1994 have demonstrated activation of Phelps, M. E., Squire, L. R., et al. (1983). Comparison of
the thalamus and basal ganglia during completion of metabolic rates, language and memory in subcortical apha-
sias. Brain and Language, 19, 33–47.
language tasks such as picture naming (Price, Moore, et
Murdoch, B. E. (1996). The role of subcortical structures in
al., 1996) and word repetition (Price, Wise, et al., 1996). language: Illuminations from clinico-neuroradiological
In summary, although a role for the thalamus in lan- studies of brain damaged subjects. In B. Dodd, R. Camp-
guage is generally accepted, some controversy still exists bell, and L. Worrall (Eds.), Evaluating theories of language:
as to whether the structures of the striatocapsular region Evidence from disordered communication (pp. 137–160).
participate directly in language processing or play a role London: Whurr.
as supporting structures for language. Contemporary Murdoch, B. E., Thompson, D., Fraser, S., and Harrison, L.
theories suggest that the role of subcortical structures (1986). Aphasia following non-haemorrhagic lesions in the
in language is essentially neuroregulatory, relying on left striatocapsular region. Australian Journal of Human
quantitative neuronal activity. Although these theories Communication Disorders, 14, 5–21.
Nadeau, S. E., and Crosson, B. (1997). Subcortical aphasia.
have a number of limitations, for the present they
do serve as frameworks for generating experimental Naeser, M. A., Alexander, M. P., Estabrooks, N., Levine,
hypotheses which can then be tested in order to advance H. L., Laughlin, S. A., and Geschwind, N. (1982). Aphasia
our understanding of subcortical brain mechanisms in with predominantly subcortical lesion sites: Description of
language. three capsular/putaminal aphasia syndromes. Archives of
Neurology, 39, 2–14.
—Bruce E. Murdoch Price, C. J., Moore, C., Humphreys, G. W., Frackowiak, R. S.,
and Friston, K. J. (1996). The neural signs sustaining object
References recognition and naming. Proceedings of the Royal Society of
London, B, 263, 1501–1507.
Alexander, M. P. (1989). Clinico-anatomical correlations of Price, C. J., Wise, R. J., Warburton, E. A., Moore, C. J.,
aphasia following predominantly subcortical lesions. In Howard, D., Patterson, K., et al. (1996). Hearing and say-
F. Boller and J. Grafman (Eds.), Handbook of neuro- ing: The functional neuroanatomy of auditory word pro-
psychology (pp. 47–66). Amsterdam: Elsevier. cessing. Brain, 119, 919–931.
Broadbent, G. (1872). On the cerebral mechanism of speech and Wallesch, C. W. (1985). Two syndromes of aphasia occurring
thought. London. with ischemic lesions involving the left basal ganglia. Brain
Cappa, S. F. (1997). Subcortical aphasia: Still a useful concept? and Language, 25, 357–361.
Brain and Language, 58, 424–426. Wallesch, C. W., and Papagno, C. (1988). Subcortical aphasia.
Cappa, S. F., Cavallotti, G., Guidotti, M., Papagno, C., and In F. C. Rose, R. Whurr, and M. A. Wyke (Eds.), Aphasia
Vignolo, L. A. (1983). Subcortical aphasia: Two clinical-CT (pp. 256–287). London: Whurr.
scan correlation studies. Cortex, 19, 227–241. Wernicke, C. (1874). Der aphasische Symtomencomplex. Bres-
Cappa, S. F., Papagno, C., Vallar, G., and Vignolo, L. A. (1986). lau: Cohn and Weigert.
Aphasia does not always follow left thalamic hemorrhage:
A study of five negative cases. Cortex, 22, 639–647. Further Readings
Cappa, S. F., and Vallar, G. (1992). Neuropsychological dis-
order after subcortical lesions: Implications for neural Alexander, M. P., Naeser, M. A., and Palumbo, C. L. (1987).
models of language and spatial attention. In G. Vallar, S. F. Correlations of subcortical CT lesion sites and aphasia
Cappa, and C. W. Wallesch (Eds.), Neuropsychological dis- profiles. Brain, 110, 961–991.
Basso, A., Della-Sala, S., and Farabola, M. (1987). Aphasia AAE are overrepresented in special education classes,
arising from purely deep lesions. Cortex, 23, 29–44. in part because of their linguistic background (Kuelen,
Crosson, B. (1999). Subcortical mechanisms in language: Weddington, and Debose, 1998).
Lexical-semantic mechanisms and the thalamus. Brain and An important factor contributing to this over-
Cognition, 40, 414–438.
representation is clinicians’ failure to di¤erentiate legiti-
Crosson, B., Zawacki, T., Brinson, G., Lu, L., and Sadek, J. R.
(1997). Models of subcortical functions in language: Cur- mate patterns of AAE from symptoms of a language
rent status. Journal of Neurolinguistics, 10, 277–300. disorder. This failure results from an assessment process
Fabbro, F., Clarici, A., and Bava, A. (1996). E¤ects of left that relies heavily on identifying deviations from SAE as
basal ganglia lesions on language production. Perceptual signs of impairment. Moreover, when these deviations
and Motor Skills, 82, 1291–1298. are the sole symptoms and no confirming evidence exists
Friederici, A. D., von Cramon, Y., and Kotz, S. (1999). Lan- of concomitant disorders such as hearing impairment,
guage related brain potentials in patients with cortical and cognitive-intellectual deficits, neurological impairment,
subcortical left hemisphere lesions. Brain, 122, 1033–1047. or psycho-emotional problems, the reliance on deviant
Kennedy, M., and Murdoch, B. E. (1991). Patterns of speech SAE patterns for diagnosis is even greater.
and language recovery following left striatocapsular haem-
A case in point is specific language impairment (SLI),
orrhage. Aphasiology, 5, 489–510.
Kirk, A., and Kertesz, A. (1994). Cortical and subcortical a disorder presumably restricted to aberrant language
aphasias compared. Aphasiology, 8, 65–82. symptoms without a known cause (Watkins and Rice,
Metter, E. J. (1992). Role of subcortical structures in aphasia: 1994). Because the language symptoms of SLI can
Evidence from studies of resting cerebral glucose metabo- appear similar to legitimate language patterns of AAE
lism. In G. Vallar, S. F. Cappa, and C. W. Wallesch (Eds.), (Seymour, Bland-Stewart, and Green, 1998), African-
Neuropsychological disorders associated with subcortical American children are at risk for SLI misdiagnosis. This
lesions (pp. 478–500). Oxford: Oxford University Press. kind of misdiagnosis epitomizes linguistic and cultural
Parent, A., and Hazrati, L. N. (1995). Functional anatomy of bias in assessment, which has been a major issue of con-
the basal ganglia: 1. The cortico-basal ganglia-thalamo- cern to clinical professionals committed to equity and
cortical loop. Brain Research Reviews, 20, 91–127.
fairness in testing. Although far from resolved, linguistic
Vignolo, L. A., Macario, M., and Cappa, S. F. (1992). Clini-
cal-CT scan correlations in a prospective series of patients bias in testing has been addressed by focusing on three
with acute left-hemispheric subcortical stroke. In G. Vallar, related areas: language acquisition milestones for AAE,
S. F. Cappa, and C. W. Wallesch (Eds.), Neuropsychologi- reduction of bias in assessment methods, and reduction
cal disorders associated with subcortical lesions (pp. 335– of bias in intervention strategies.
343). Oxford: Oxford University Press.
Wallesch, C. W. (1997). Symptomatology of subcortical apha- Language Acquisition Milestones for AAE. Much of
sia. Journal of Neurolinguistics, 10, 267–275. what is known about language acquisition in SAE
Wallesch, C. W., Johannsen-Harbach, H., Bartels, C., and undoubtedly also applies to the AAE-speaking child.
Herrmann, M. (1997). Mechanisms of and misconcep- However, it is not altogether clear whether speakers of
tions about subcortical aphasia. Brain and Language, 58,
AAE and SAE follow parallel tracks in mastering their
403–409.
respective adult systems. Acquisition data on AAE sug-
gest that the two are quite similar until approximately
the age of 3, at which point they diverge (Cole, 1980).
Language Disorders in African- This claim rests largely on evidence that young children
American Children from both language groups produce similar kinds of
developmental ‘‘errors.’’ However, these similarities
may not occur for the same reasons, since several early
Interest in language disorders among African-American developmental patterns also appear to match the adult
children arises from the recognition that a significant AAE system. For example, absent morphological inflec-
number of these children speak a form of English vari- tions are common in the emerging language of AAE and
ously referred to as Black English, African-American SAE as well as in adult AAE.
English, African-American Vernacular English and Whether these early patterns are a function of devel-
Ebonics (see dialect speakers). African-American En- opment or are manifestations of the AAE system is an
glish (AAE), the term preferred here, di¤ers su‰ciently important question. It may be that AAE development
from Standard American English (SAE) to adversely and maturation are uniquely influenced by adult AAE
a¤ect the educational and clinical treatment of African- in ways unlikely for SAE. Some preliminary evidence
American children. In addressing this issue, the Ameri- to support this position comes from the work of Wyatt
can Speech, Language, and Hearing Association (1995), who showed that African-American preschoolers
(ASHA) has taken the o‰cial position that children followed the same adult AAE constraint conditions in
should not be viewed as having a speech and language their optional use of zero copulas. No comparable anal-
problem because they speak AAE (ASHA, 1983). ysis has been done on zero copulas at earlier periods or
ASHA’s position is consistent with that of the Linguistic for developmental SAE patterns, however.
Society of America (LSA), which asserts AAE to be Evidence of di¤erences in acquisition becomes clearer
legitimate, systematic, and rule-governed (LSA, 1997). as children’s language systems mature and AAE patterns
Despite proclamations of this kind, child speakers of become more evident (Washington and Craig, 1994).
Language Disorders in African-American Children 319
Features that once appeared similar between AAE did not contrast between AAE and SAE were better
and SAE begin to disappear in SAE and may even in- predictors of language disorders than those that were
crease in frequency in AAE, as with the zero copula contrastive.
after the age of 3 (Kovac, 1980). Between the preschool Each of the above recommendations can be useful
period and age 5, archetypical AAE features are ob- in identifying possible language disorders. However, to
served in children across socioeconomic levels, but their determine the nature of the problem requires a more in-
density is greater among low socioeconomic classes and depth and complete analysis of the child’s language,
among males (Wyatt, 1995; Washington and Craig, since language disorders are likely to extend beyond only
1998). language behaviors shared between AAE and SAE, or
Although the descriptive accounts of early AAE have only in complex sentences. To ignore AAE features or
provided important information about the character- any aspect of the child’s language in determining the
istics and pervasiveness of child AAE, still limited mile- nature of a problem could yield an incomplete and dis-
stone data exist about age ranges at which language torted profile. Therefore, it is necessary to examine the
structures are mastered and the appropriate form those child’s productive capacity for a variety of targeted lan-
structures should take. In contrast, a rich source of ac- guage structures that have been identified in a represen-
quisition data in SAE establishes when children of vari- tative sample of language and that can be probed further
ous ages acquire language milestones in ways consistent under various linguistic and situational contexts (Sey-
with their SAE peers. This disparity in milestone data mour, 1986). With su‰cient evidence about the nature
between AAE and SAE requires a somewhat di¤erent of a child’s language problems, the foundation then
assessment strategy in order to reduce bias. exists for intervention.
Reduction of Bias in Assessment Methods. Of the sev- Reduction of Bias in Intervention Strategies. Decisions
eral kinds of possible bias in language disorders (Taylor about intervention strategies depend directly on evi-
and Payne, 1983), perhaps the most intractable is lin- dence obtained about the nature of the child’s problem.
guistic and cultural bias associated with existing stand- For reasons stated earlier, this evidence can be more
ardized tests. These tests are biased because they have valid and less biased when alternative or no stan-
not been specifically designed for and standardized dardized testing methods are used. However, because
on AAE. As a consequence, alternative and ‘‘non- these methods are time-consuming and require a mul-
standardized’’ assessment methods have been recom- tiple phase process, Seymour (1986) advocated a
mended (Seymour and Miller-Jones, 1981; Leonard and diagnostic-intervention model in which intervention is
Weiss, 1983; Stockman, 1996). These methods include part of ongoing assessment. In this model, intervention is
language sampling analysis and criterion-referenced lan- based on diagnostic hypotheses formulated from an ini-
guage probes, which are both common methods in tial and tentative diagnosis, and then tested by language
the clinical process and typically complement norm- probes. The process is one of repeatedly formulating
referenced testing. Their specific use with AAE-speaking hypotheses, testing them, and reformulating them, again
children is important because they o¤er a less biased, and again, as needed.
richer, more dynamic and naturalistic source for analysis The test-retest approach is recommended for AAE-
than is found in the more linguistically biased, relatively speaking children largely because of the uncertainty
restrictive, and artificial context of standardized tests. about the nature of AAE. This uncertainty is less a fac-
However, there are disadvantages with language tor in identifying a language disorder, since identification
sampling and language probes as well. They are time- can be made without focusing on AAE features. How-
intensive and possibly less reliable, and they too are ever, when determining the nature of the child’s prob-
limited by the inadequate normative descriptions of lem and treating those problems, AAE features should
AAE. In an attempt to minimize the importance of spe- not be avoided if a complete and accurate account of
cific AAE norms in the assessment process, several the child’s language is the objective. Consider the kind
authors have proposed focusing alternative assessment of diagnostic information needed for an AAE-speaking
methods on those language elements that are not specific child who fails to produce any copulas, unlike optional
to AAE features. Such a focus circumvents di‰cult copula use by his AAE-speaking peers. At least two in-
questions about the status of patterns such as absent tervention strategies are possible: (1) to apply an SAE
language elements. Stockman (1996) proposed the Min- model by targeting copulas wherever they are obligatory
imal Competency Core (MCC), which is a criterion- in SAE, or (2) to follow an AAE model and target cop-
referenced measure that represents the lowest end of a ulas in a manner consistent with optional use. Unfortu-
competency scale of obligatory language patterns that nately, no matter how desirable the latter course of
typically developing children should demonstrate, irre- action might be, it is unlikely without greater knowl-
spective of their language backgrounds. Similarly, Craig edge of the linguistic conditions that determine optional
and Washington (1994) advocated the avoidance of sev- use.
eral AAE-specific features dominated by morphosyntax Consequently, a default to an SAE model in situa-
by focusing on complex sentence constructions common tions where clinical solutions for AAE are not readily
to both AAE and SAE. Also, Seymour, Bland-Stewart, apparent may be inevitable until AAE is more fully
and Green (1998) showed that language features that described and viewed as a complete grammar comprised
of systems (Green, 1995), rather than as simply a list of Further Readings

structures defined by their contrast with SAE. A system’s
account requires answers to some complex linguistic Adler, S., and Birdsong, S. (1983). Reliability and validity of
and social questions about African-American children’s standardized testing tools used with poor children. Topics in
development and use of language in a context charac- Language Disorders, 3, 76–87.
Battle, D. E. (Ed.). (1993). Communication disorders in multi-
terized by linguistic duality.
cultural populations. Boston: Andover.
—Harry N. Seymour Baugh, J. (1983). Black street speech: Its history, structure and
survival. Austin: University of Texas Press.
References Burling, R. (1973). English in black and white. New York: Holt,
Rinehart and Winston.
American Speech-Language-Hearing Association. (1983). Po- Dillard, J. L. (1972). Black English: It’s history and usage in the
sition paper on social dialects. ASHA, 25, 23–25. United States. New York: Random House.
Cole, L. (1980). A developmental analysis of social dialect Edwards, J. R. (1979). Language and disadvantage. Amster-
features in the spontaneous language of preschool Black dam: Elsevier.
children. Dissertation Abstracts International, 41(06), 2132B Green, L. (2002). African American English: A linguistic intro-
(University Microfilms No. AAC8026783). duction. New York: Cambridge University Press.
Craig, H. K., and Washington, J. A. (1994). The complex Kamhi, A. G., Pollock, K. E., and Harris, J. L. (1996). Com-
syntax skills of poor, urban, African American preschoolers munication development and disorders in African American
at school entry. Language, Speech, and Hearing Services in children: Research, assessment, and intervention. Baltimore:
Schools, 25, 181–190. Paul H. Brookes.
Green, L. (1995). Study of verb classes in African American Labov, W. (1970). The logic of nonstandard English. In F.
English. Linguistics and Education, 7, 65–81. Williams (Ed.), Language and poverty. Chicago: Markham.
Kovac, C. (1980). Children’s acquisition of variable features. Labov, W. (1972). Language in the inner city: Studies in the
Dissertation Abstracts International, 42(02), 687A (Univer- Black English Vernacular. Philadelphia: University of
sity Microfilms No. AAC8116548). Pennsylvania Press.
Kuelen, J. E., Weddington, G. T., and Debose, C. E. (1998). Leonard, L. (1998). Children with specific language impairment.
Speech, language, learning, and the African American child. Cambridge, MA: MIT Press.
Boston: Allyn and Bacon. Mufwene, S. S., Rickford, J. R., Bailey, G., and Baugh, J.
Leonard, L. B., and Weiss, A. L. (1983). Application of non- (1998). African-American English. London: Rutledge.
standardized assessment procedures to diverse linguistic Rickford, J. R. (1999). African American Vernacular English.
populations. Topics in Language Disorders, 3, 35–45. Oxford: Blackwell.
Linguistic Society of America. (1997). Resolution on the Oak- Seymour, H. N., and Roeper, T. (1999). Grammatical acquisi-
land ‘‘Ebonics’’ issue. Adopted at the annual meeting, Chi- tion of African American English. In O. Taylor and L.
cago, IL. Leonard (Eds.), Language acquisition across North Amer-
Seymour, H. N. (1986). Clinical intervention for language dis- ica: Cross-cultural and cross-linguistic perspectives (pp.
orders among nonstandard speakers of English. In O. L. 109–153). San Diego, CA: Singular Press Publishing Group.
Taylor (Ed.), Communication disorders in culturally and lin- Seymour, H. N., Abdulkarim, L., and Johnson, V. (1999). The
guistically diverse populations. San Diego, CA: College-Hill Ebonics controversy: An educational and clinical dilemma.
Press. Topics in Language Disorders, 19, 66–77.
Seymour, H. N., Bland-Stewart, L., and Green, L. J. (1998). Smith, E. (1998). What is Black English? What is Ebonics?
Di¤erence versus deficit in child African-American English. In L. Delpit and T. Perry (Eds.), The real Ebonics debate:
Language, Speech, and Hearing Services in Schools, 29, Power, language, and the education of African American
96–108. children. Boston: Beacon Press.
Seymour, H. N., and Miller-Jones, D. (1981). Language and Smitherman, G. (1977). Talkin’ and testifyin’: The language of
cognitive assessment of Black children. In N. Lass (Ed.), Black America. Boston: Houghton-Mi¿in.
Speech and language: Advances in basic research and prac- Stockman, I., and Vaughn-Cooke, F. (1982). Semantic cate-
tice (pp. 203–263). New York: Academic Press. gories in the language of working class black children. In
Stockman, I. (1996). The promise and pitfalls of language C. E. Johnson and C. L. Thew (Eds.), Proceedings of the
sample analysis as an assessment tool for linguistic mi- Second International Child Language Conference, 1, 312–327.
nority children. Language, Speech, and Hearing Services in Stockman, I., and Vaughn-Cooke, F. (1989). Addressing new
Schools, 27, 355–366. questions about black children’s language. In R. Fasold
Taylor, O., and Payne, K. (1983). Culturally valid testing: and D. Shi¤rin (Eds.), Language change and variation (pp.
A proactive approach. Topics in Language Disorders, 3, 274–300). Amsterdam: John Benjamins.
8–20. Taylor, O. (1986). Treatment of communication disorders in
Washington, J., and Craig, H. (1994). Dialectal forms during culturally and linguistically diverse populations. San Diego,
discourse of poor, urban, African American preschoolers. CA: College-Hill Press.
Journal of Speech and Hearing Research, 37, 816–823. Terrell, S., and Terrell, F. (1983). Distinguishing linguistic
Washington, J. A., and Craig, H. K. (1998). Socioeconomic di¤erence from disorders. Topics in Language Disorders, 3,
status and gender influences on children’s dialectal varia- 1–7.
tions. Journal of Speech, Language, and Hearing Research, Vaughn-Cooke, F. B. (1983). Improving language assessment
41, 618–626. in minority children. ASHA, 25, 29–34.
Watkins, R. V., and Rice, M. L. (1994). Specific language Williams, R. L. (Ed.). (1975). Ebonics: The true language of
impairments in children. Baltimore: Paul H. Brookes. Black folks. St. Louis: Institute of Black Studies.
Wyatt, T. (1995). Language development in African Ameri- Wolfram, W., and Fasold, R. W. (1974). The study of social
can English child speech. Linguistics and Education, 7, 7– dialects in American English. Englewood Cli¤s, NJ: Prentice
22. Hall.
Language Disorders in Latino Children 321
Wolfram, W. (1976). Levels of sociolinguistic bias in testing. In population, coupled with greater acceptability of lin-
D. S. Harrison and T. Trabasso (Eds.), Black English: A guistic diversity in the United States, might reverse the
seminar. Hillsdale, NJ: Erlbaum. previous trend, in which immigrants lost their native
language by the third generation (Veltman, 1988). The
more likely trend is for a continuous growth of a Latino
Language Disorders in Latino Children population that is bilingual.
Bilingualism is not a one-dimensional concept.
Rather, bilingualism may be viewed as existing on
The Latino population encompasses a diverse group of several continua representing di¤erent language com-
people who self-identify as descendants of individuals petencies in form, content, and use of the language
who came to the United States from a predominantly (Valdes and Figueroa, 1994). Collectively, these individ-
Spanish-speaking country. Over the past decade, the ual competencies will dictate the child’s linguistic profi-
Latino population in the United States has increased ciency in a language. The degree to which proficiency is
four times faster than the general population (Guzmán, exhibited in any one language at a particular point in
2001). It is estimated that the size of the Latino popula- time is influenced by the situation, the topic, individuals,
tion will represent one-quarter of the U.S. population, or and context (Romaine, 1995; Zentella, 1997). A shift in
approximately 81 million Latinos, by the year 2050. The topic or a shift in participants may result in a switching
large growth of the Latino population is largely attrib- of the code. This type of code switching is a verbal skill
utable to its high fertility rate (National Center for that requires a large degree of linguistic competence.
Health Statistics, 1999); a large proportion of the popu- Code switches are also made by less proficient speakers
lation is in the childbearing years, and families tend to be as a way to compensate for insu‰cient knowledge of one
larger. Children under the age of 15 account for 30.5% language.
of the Latino population. Latino children exhibit varying degrees of proficiency
The Latino population is linguistically diverse with in both English and Spanish. Given the pervasiveness of
respect to dialects and languages spoken. The dialects English language media, the use of Spanish by the ma-
spoken in the country of origin and subsequently jority of Latino families, and the communities in which
brought to the United States evolved from the di¤erent Latinos are raised, it is doubtful that many Latino chil-
regional dialects spoken by the original settlers, the lan- dren reach school age as true monolingual Spanish
guages spoken by the native peoples of the Americas, speakers. Some of the children may be considered to be
and the languages spoken by later immigrants. There are sequential bilinguals because their major exposure prior
two major groups of Spanish dialects, radical and con- to entering school was to Spanish and their linguistic
servative (Guitart, 1996). The radical dialects are spoken skills in English are minimal. These children’s main
primarily in the coastal areas of Spanish-speaking coun- exposure to English will come when they enter school.
tries and the Caribbean, while the conservative dialects Impressionistically, many of these children are indis-
are spoken in the interior parts of the countries. The tinguishable from monolingual Spanish speakers (e.g.,
dialects vary in phonology, morphosyntax, semantics, Spanish-speaking children in Mexico). However, di¤er-
and pragmatics, with the most drastic qualitative di¤erences become apparent when their Spanish is compared
ences seen in phonology and lexicon. The di¤erences in with the language spoken by true monolingual Spanish
morphosyntax are more quantitative than qualitative. speakers (Merino, 1992). Children who have been
The specific dialects spoken by Latino children will be exposed to both languages at home and who tend to
influenced by the dialects spoken in their community. communicate in both languages, the so-called simulta-
Other factors influencing the dialect spoken include the neous bilinguals, show a wide range of linguistic skills in
degree of contact with Spanish and English speakers, English and Spanish by the time they reach the school-
whether the speaker is learning both languages simulta- age years. However, their exposure to and use of Spanish
neously or sequentially, and the prestige attached to the makes even the most English-fluent members of this
various dialects with which the individual comes in con- group di¤erent from their monolingual peers. In envi-
tact (Poplack, 1978; Wolfram, Adger, and Christian, ronments that do not foster the development of the
1999). child’s first language, language attrition occurs. Some
Speaking Spanish is one of the major ties that bind language patterns attributable to language attrition are
the Latino population, and approximately 80% of the similar to patterns seen in children with language dis-
population reportedly speaks it. The vast majority of the orders (e.g., gender errors) (Restrepo, 1998; Anderson,
Latino population consider themselves bilingual; a small 1999).
percentage is monolingual in either English or Spanish. Given the large linguistic variability in the popula-
Twenty-eight percent of the Latino population report tion, di¤erentiating between a language di¤erence (ex-
that they ‘‘do not speak English well’’ or speak it ‘‘not at pected community variation) and a language disorder
all’’ (U.S. Census Bureau, 2000). The number of mono- (communication that deviates significantly from the
lingual Spanish speakers and bilingual English-Spanish norms of the community; Taylor and Payne, 1994) is not
speakers reflects the fact that 35% of the Latino popula- simple. Our existing literature on language development
tion is foreign born and that the majority of foreign-born in Latino children focuses primarily on a limited number
Latinos entered the United States in the last three de- of grammatical structures used by monolingual Spanish-
cades. Continuous immigration and growth of the Latino speaking children (Gutierrez-Clellen, 1998; Goldstein,
2000; Bedore and Leonard, 2001). However, most La- linguistically congruent with those of the children’s fam-
tino children are either bilingual or in the process of ilies (Lynch and Hanson, 1992; Maestas and Erickson,
becoming bilingual, and therefore existing normative 1992; van Kleeck, 1994; Iglesias and Quinn, 1997). The
data on language acquisition by monolingual children language of intervention should be based on the chil-
(e.g., Miller and Leadholm, 1992; Sebastı́an and Slobin, dren’s linguistic competencies, parents’ preference, and
1994) do not accurately represent the language develop- functionality, not on the clinician’s lack of proficiency in
ment of the majority of Latino children. speaking the child’s language. The bilingual literature on
Assessments are further complicated by the fact that typical and atypical language learners strongly supports
most of the available assessment protocols assume a high the notion that intervention should be conducted in the
degree of homogeneity of exposure to the content of test child’s strongest and most environmentally functional
items and to the sociolinguistic aspects of the testing sit- language (Gutierrez-Clellen, 1999). In many cases, this
uation. The cross-cultural child socialization literature will mean using Spanish as the language of intervention.
suggests that Latino children’s home routines are not The skills gained in the acquisition of the first language
always compatible with the content or the routines typi- will facilitate acquisition of the second.
cally required in a language assessment (Iglesias and
Quinn, 1997). Thus, poor performance on a particular —Aquiles Iglesias
assessment may reflect lack of experience rather than a
child’s inability to learn (Peña and Quinn, 1997).
The growing number of Latinos and their over- References
representation in statistical categories that place children
at higher risk for disabilities or developmental delays American Speech-Language-Hearing Association. (1985).
(Arcia et al., 1993; Annie E. Casey Foundation, 2000; Clinical management of communicatively handicapped
Iglesias, 2002) make it imperative that our assessment minority language populations. ASHA, 27, 29–32.
protocols not only accommodate linguistic di¤erences Anderson, R. (1999). Loss of gender agreement in L1 attri-
tion: Preliminary results. Bilingual Research Journal, 23,
across groups but also takes into account children’s
389–407.
experiences. A variety of assessment protocols that take Annie E. Casey Foundation. (2000). Kids count data book.
into consideration the languages and dialects spoken and Baltimore: Author.
the child’s experiences have been suggested (Erickson Arcia, E., Keyes, L., Galalgher, J. J., and Chabhar, M. (1993).
and Iglesias, 1986; Peña, Iglesias, and Lidz, 2001; Wyatt, Status of young Mexican-American children: Implications
2002). These protocols suggest the judicious use of for early intervention systems. Chapel Hill, NC: Carolina
standardized tests, taking into consideration norming Institute for Child and Family Policy.
samples and possible situational test biases, and con- Bedore, L., and Leonard, L. (2001). Grammatical morphology
sideration of alternative nonstandardized assessment deficits in Spanish-speaking children with specific language
procedures such as ethnographic analyses, criterion- impairment. Journal of Speech and Hearing Research, 44,
904–924.
referenced assessments, and dynamic assessments. Fur- Erickson, J., and Iglesias, A. (1986). Assessment of communi-
ther, consistent with IDEA regulations (Individuals with cation disorders in non-English proficient children. In O.
Disabilities Act Amendments of 1997) and ASHA’s po- Taylor (Ed.), Nature of communication disorders in cultur-
sition statement on the assessment of cultural-linguistic ally and linguistically diverse populations (pp. 181–217). San
minority populations (American Speech-Language- Diego, CA: College-Hill Press.
Hearing Association, 1985), the assessments need to be Goldstein, B. (2000). Cultural and linguistic diversity resource
provided and administered in the child’s native lan- guide for speech-language pathologists. San Diego, CA: Sin-
guage. In many cases this will require the examiner to be gular Publishing Group.
bilingual in Spanish and English or will require the use Guitart, J. (1996). Spanish in contact with itself and the pho-
of qualified interpreters (Kayser, 1998). nological characterization of conservative and radical styles.
In J. Jensen (Ed.), Spanish in contact: Issues in bilingualism
The interpretation of the assessment results must take (pp. 151–157). Somerville, MA: Cascadilla Press.
into consideration the growing literature on language Gutierrez-Clellen, V. F. (1998). Syntactic skills of Spanish-
development in Latino populations (Goldstein, 2000); speaking children with low school achievement. Lan-
with recognition that performance may di¤er from the guage, Speech, and Hearing Services in Schools, 29, 207–
expected norm because of a language di¤erence rather 215.
than a language disorder. Although most children will Gutierrez-Clellen, V. F. (1999). Language choice in interven-
show normal development in one or both languages, tion with bilingual children. American Journal of Speech-
some will demonstrate weaker than expected perfor- Language Pathology, 8, 291–302.
mance in one or both languages. The data obtained must Guzmán, B. (2001). The Hispanic population, Census 2000
be carefully examined in the context of the languages briefs. Washington, DC: U.S. Census Bureau.
Iglesias, A. (2002). Latino culture. In D. Battle (Ed.), Commu-
and the dialects to which the child has been exposed and nication disorders in multicultural populations (pp. 179–202).
the experiences the child brings to the testing situation. Woburn, MA: Butterworth-Heinemann.
Intervention, if warranted, will require a culturally Iglesias, A., and Quinn, R. (1997). Culture as a context for
competent approach to services delivery in which the early intervention. In K. Thurman, J. Cornwell, and S. R.
families’ belief systems, including views on disability, are Gottwald (Eds.), Contexts of early intervention: Systems
respected and intervention approaches are culturally and and settings (pp. 55–72). Baltimore: Paul H. Brookes.
Language Disorders in Latino Children 323
Individuals with Disabilities Act Amendments of 1997, USC Further Readings

1400 et seq. (1997).
Kayser, H. (1998). Assessment and intervention resource Anderson, R. (1996). Assessing the grammar of Spanish-
for Hispanic children. San Diego, CA: Singular Publishing speaking children: A comparison of two procedures.
Group. Language, Speech, and Hearing Services in the Schools, 27,
Lynch, E., and Hanson, M. J. (1992). Developing cross-cultural 333–344.
competence: A guide for working with young children and Bedore, L. (1999). The acquisition of Spanish. In O. Taylor
their families. Baltimore: Paul H. Brookes. and L. Leonard (Eds.), Language acquisition in North
Maestas, A. G., and Erickson, J. G. (1992). Mexican immi- America (pp. 167–207). San Diego, CA: Singular Publish-
grant mothers’ beliefs about disabilities. American Journal ing Group.
of Speech-Language Pathology, 1, 5–10. Bedore, L. (2001). Assessing morphosyntax in Spanish-
Merino, B. (1992). Acquisition of syntactic and phonological speaking children. Seminars in Speech and Language, 22,
features in Spanish. In H. W. Langdon and L. R. Cheng 65–77.
(Eds.), Hispanic children and adults with communication Goldstein, B., and Iglesias, A. (2002). Issues of cultural and
disorders (pp. 57–98). Gaithersburg, MD: Aspen. linguistic diversity. In R. Paul (Ed.), Introduction to clinical
Miller, J., and Leadholm, B. (1992). Language sample analysis methods in communication disorders (pp. 261–279). Balti-
guide: The Wisconsin guide for the identification and de- more: Paul H. Brookes.
scription of language impairment in children. Madison: Wis- Gutierrez-Clellen, V. F., and Heinrich-Ramos, L. (1993). Ref-
consin Department of Education. erential cohesion in the narratives of Spanish-speaking
National Center for Health Statistics. (1999). National vital children: A developmental study. Journal of Speech and
statistics (No. 47-18). Washington, DC: National Center for Hearing Research, 36, 559–568.
Health Statistics. Gutierrez-Clellen, V. F., and Ho¤stetter, R. (1994). Syntactic
Peña, E., Iglesias, A., and Lidz, C. S. (2001). Reducing test bias complexity in Spanish narratives: A developmental study.
through dynamic assessment of children’s word learning Journal of Speech and Hearing Research, 35, 363–372.
ability. American Journal of Speech-Language Pathology, Gutierrez-Clellen, V. F., and Quinn, R. (1993). Assessing nar-
10, 138–154. ratives of children from diverse cultural/linguistic groups.
Peña, E., and Quinn, R. (1997). Task familiarity: E¤ects on Language, Speech, and Hearing Services in Schools, 24, 2–9.
the test performance of Puerto Rican and African Ameri- Gutierrez-Clellen, V. F., Restrepo, M. A., Bedore, L., Peña,
can children. Language, Speech, and Hearing Services in E., and Anderson, R. (2000). Language sample analysis in
Schools, 28, 323–332. Spanish-speaking children: Methodological considerations.
Poplack, S. (1978). Dialect acquisition among Puerto Rican Language, Speech, and Hearing Services in Schools, 31, 88–
bilinguals. Language in Society, 7, 89–103. 98.
Restrepo, M. A. (1998). Identifiers of predominantly Spanish- Iglesias, A. (2001). What test should I use? Seminars in Speech
speaking children with language impairment. Journal of and Language, 22, 3–15.
Speech and Hearing Research, 41, 1398–1411. Jackson-Maldonado, D., Thal, D., Marchman, V., Bates, E.,
Romaine, S. (1995). Bilingualism. Cambridge, U.K.: Blackwell. and Gutierrez-Clellen, V. (1993). Early lexical development
Sebastı́an, E., and Slobin, D. I. (1994). Development of of Spanish-speaking infants and toddlers. Journal of Child
linguistic forms: Spanish. In D. I. Slobin (Ed.), Relating Language, 20, 523–549.
events in narrative. A crosslinguistic developmental study Kohnert, K. J., Bates, E., and Hernandez, A. E. (1999). Bal-
(pp. 239–284). Hillsdale, NJ: Erlbaum. ancing bilinguals: Lexical-semantic production and cogni-
Taylor, O., and Payne, K. (1994). Language and communicative processing in children learning Spanish and English.
tion di¤erences. In W. Secord (Ed.), Human communication Journal of Speech and Hearing Research, 41, 1103–1114.
disorders: An introduction (pp. 136–173). Upper Saddle Kvaal, J. T., Shipstead-Cox, N., Nevitt, S. G., Hodson, B., and
River, NJ: Prentice Hall. Launer, P. B. (1988). The acquisition of 10 Spanish mor-
U.S. Census Bureau. (2000). Age by language spoken at home phemes by Spanish-speaking children. Language, Speech,
by ability to speak English for the population 5 years and and Hearing Services in Schools, 19, 384–394.
over. Available: http://www.census.gov. [Accessed Aug. 6, Langdon, H. W. (1994). Meeting the needs of the non-English-
2001.] speaking parents of a communicatively disabled child.
Valdes, G., and Figueroa, R. A. (1994). Bilingualism and test- Clinics in Communication Disorders, 4, 227–236.
ing: A special case of bias. Norwood: Ablex. Oller, D. K., and Eilers, E. R. (2002). Language and literacy in
van Kleeck, A. (1994). Potential bias in training parents as bilingual children. Bu¤alo, NY: Multilingual Matters.
conversational partners with their children who have delays Ortiz, S. A. (2001). Assessment of cognitive abilities in
in language development. American Journal of Speech- Hispanic children. Seminars in Speech and Language, 22,
Language Pathology, 3, 67–78. 17–37.
Veltman, C. J. (1988). The future of the Spanish language in the Perez-Leroux, A. T., and Glass, W. R. (1997). Contemporary
United States. Washington, DC: Hispanic Policy Develop- perspectives on the acquisition of Spanish: Developing gram-
ment Project. mars. Somerville, MA: Cascadilla Press.
Wolfram, W., Adger, C. T., and Christian, D. (1999). Dialects Seliger, H. W., and Vago, R. M. (1991). First language attri-
in schools and communities. Mahwah, NJ: Erlbaum. tion. New York: Cambridge University Press.
Wyatt, T. A. (2002). Assessing the communicative abilities Silva-Corvalan, C. (1995). Spanish in four continents. Wash-
of clients from diverse cultural and language backgrounds. ington, DC: Georgetown University Press.
In D. Battle (Ed.), Communication disorders in multi-
cultural populations (pp. 415–450). Boston: Butterworth-
Heinemann.
Zentella, A. C. (1997). Growing up bilingual. Malden, MA:
Blackwell.
Language Disorders in School-Age you hear in elephant?’’). Standardized tests of phonolog-

ical awareness, such as the Comprehensive Test of Pho-
Children: Aspects of Assessment nological Processing, can also be used to identify deficits
in school-age children.
Approximately 7% of kindergarten children have a pri-
mary language disorder characterized by a significant Syntax and Morphology
delay in the comprehension or production of spoken
language. Although these children have normal intelli- The conversational speech of school-age children with
gence and hearing and are free of obvious neurological language disorders is often characterized by utterances
deficits such as cerebral palsy and severe emotional that are shorter and simpler than those of their peers
disturbances such as autism, their limitations in spoken with typical language development. For example, by age
language often persist throughout childhood, adoles- 10 or 12 years, a typical child can produce discourse
cence, and well into adulthood. such as the following: ‘‘The other day, while I was wait-
In addition, children with this disorder, often called ing for bus 28, which goes to the mall, my friend Harry,
specific language impairment, frequently demonstrate who lives in Brighton, stopped by and wanted to play a
limitations in written language development during the game of checkers. Finally, after we had played about an
school-age years, including problems in decoding print, hour, the bus arrived and we had to stop, but I was glad
comprehending text, spelling, and producing essays for because he had already beaten me four games.’’ These
school assignments. Problems in the social use of lan- two sentences, containing 32 and 28 words, respectively,
guage, particularly during peer interactions, frequently and six verbs each (both finite and nonfinite), would
a¤ect these children as well. be well beyond the syntactic competence of school-age
children with language disorders.
To express the same content, the child with a lan-
Phonology guage disorder might need to employ a series of eight or
Most typically developing children have mastered the ten shorter utterances. Although those utterances might
sound system of their native language by 7 or 8 years of be free of grammatical errors, it is unlikely that they
age. In contrast, children with language disorders may would contain the advanced, low-frequency syntactic
demonstrate errors in the production of speech sounds structures used by the typical child, such as the adverbial
during adolescence. Common errors include distortions clause containing an adjective clause (while I was waiting
of sibilants (s; z) and liquids (l; r), reduction of conso- for bus 28, which goes to the mall ), the elaborated subject
nant clusters (kl; sp), imprecise articulation during rapid (my friend Harry, who lives in Brighton), the perfect as-
speech (Johnson et al., 1999), and di‰culty articulating pect (had played, had beaten), or the adverbial conjunct
polysyllabic words (thermometer, rhinoceros; Catts, finally to link ideas across sentences. In addition, the
1986). Phonological errors can impair intelligibility and child with a language disorder might exhibit numerous
make the child overly self-conscious. For these reasons, false starts, hesitations, and revisions—‘‘maze behavior’’
they should be addressed during language interven- (Loban, 1976)—and may struggle to call up the details
tion. Standardized tests that can be used to identify of the situation, producing a laborious and confusing
phonological disorders include the Arizona Articulation message.
Proficiency Scale and the Goldman-Fristoe Test of Similar problems can be observed in written language
Articulation. when children are asked to produce narrative, persua-
Phonological disorders in young children are some- sive, or expository texts for school assignments. School-
times predictive of later problems in learning to age children with language disorders characteristically
read, particularly when additional deficits in phonologi- produce shorter texts with fewer details, poorer organi-
cal awareness are present. Problems in phonological zation, and a greater number of grammatical and spell-
awareness—the ability to analyze and manipulate the ing errors than their age-matched peers (e.g., Gillam and
sounds of the language—commonly occur in school-age Johnston, 1992; Snowling, Bishop, and Stothard, 2000).
children with language disorders and underlie their di‰- They may also show evidence of morphological di‰-
culties in learning to decode and to spell words (Catts, culties in writing. For example, they may omit the plural
1993; Lombardino et al., 1997). Unfortunately, di‰- and past tense markers (‘‘They use their skate yester-
culties in decoding text can seriously hamper a child’s day’’), or they may fail to use past irregular verbs cor-
reading comprehension, just as di‰culties in spelling can rectly in obligatory contexts (‘‘He teached them to read ’’)
hamper the writing process. It is therefore very impor- even as late as 12 years of age (Windsor, Scott, and
tant that deficits in phonological awareness be addressed Street, 2000). Problems in the use of derivational
during language intervention. morphemes—prefixes and su‰xes such as non-, -tion,
Phonological awareness can be evaluated with a vari- and -ment—also occur in the speech and writing of chil-
ety of tasks. For example, sound deletion requires the dren with language disorders (Moats and Smith, 1992).
child to delete a particular sound or syllable in a word The best way to evaluate a child’s syntax and mor-
(‘‘Say tall without the t,’’ ‘‘Say haircut without the phology is to analyze spoken and written language
hair’’), and sound segmenting requires that the number samples with the aid of computer programs such as
of sounds or syllables in a word be counted (‘‘How many Systematic Analysis of Language Transcripts (Miller
sounds do you hear in crab?’’ ‘‘How many syllables do and Chapman, 2000). Standardized language tests can
Language Disorders in School-Age Children: Aspects of Assessment 325
also be administered, such as the Clinical Evaluation disorders may receive social penalties from their typi-
of Language Fundamentals–3, the Oral and Written cally developing peers in the form of teasing, hurtful
Language Scales, the Test of Language Development– comments, and personal rejection. This type of negative
Primary, and the Test of Language Development– feedback can cause a child to avoid social situations and
Intermediate. the opportunities they present for language develop-
ment. For example, through regular interactions with
peers, children are able to observe others using complex
Semantics language and can themselves practice using appropriate
Compared to their age-matched peers, school-age chil- phonological patterns, syntactic structures, morphemes,
dren with language disorders frequently demonstrate words, and figurative expressions in varied contexts such
limitations in lexical knowledge, particularly in relation as greeting others, having conversations, exchanging
to words that express abstract ( pride, courage), poly- information, agreeing and disagreeing, and persuading
semous (deep, absorbing), or technical (equation, parab- others to do things. A variety of pragmatic di‰culties
ola) meanings (Wiig and Secord, 1998). Figurative have been reported in school-age children with language
expressions such as metaphors (The lawyer was a bull- disorders. In relation to peer interactions, these di‰-
dozer questioning the witness), idioms (throw in the towel, culties include limitations in the ability to access ongoing
read between the lines), and proverbs (Every cloud has a play groups; to collaborate, persuade, and negotiate; to
silver lining) also pose comprehension di‰culties, along engage in extended conversations; and to deliver bad
with slang expressions (grandma lane), sarcasm (‘‘Your news tactfully (Bliss, 1992; Brinton et al., 1997; Brinton,
room is SO clean now!’’), and humor (Q: ‘‘Which sport is Fujiki, and Higbee, 1998; Brinton, Fujiki, and McKee,
the quietest?’’ A: ‘‘Bowling. You can hear a pin drop’’) 1998; Fujiki et al., 2001).
(Nippold and Fey, 1983; Lutzer, 1988; Spector, 1990; Pragmatic development can be evaluated through
Milosky, 1994). Word retrieval, the ability to call up role-playing tasks and social skills rating scales. Al-
words with speed and accuracy, is often impaired as though the specific sources of pragmatic deficits are
well, particularly in relation to low-frequency (tambou- often unclear, in some cases they stem from limitations in
rine) or abstract (religions) words (German, 1994). the child’s social cognition—knowledge of the thoughts,
Deficiencies in semantic development can seriously feelings, and beliefs of others—and from deficits in the
limit a child’s spoken and written communication, af- ability to use words, morphemes, and syntactic struc-
fecting social development and academic progress. For tures that mark politeness and empathy (e.g., ‘‘I don’t
example, di‰culties in word retrieval and humor com- know why you weren’t chosen for the play, but I’m sure
prehension can prevent a child from freely engaging you’ll get the lead next time’’) (Bliss, 1992). These prob-
in telling jokes and riddles, a popular pastime among lems, which can limit a child’s ability to maintain
school-age children. Because teachers’ classroom talk friendships, should be addressed in concert with other
and the textbooks used in schools frequently contain language goals.
di‰cult words and expressions, children with semantic See also language disorders in school-age chil-
deficits often fail to understand much of what they hear dren: overview.
and read. These e¤ects are cyclical, because listening
—Marilyn A. Nippold
and reading themselves are major sources of language-
learning input during the school-age years. As a result,
children who are deficient in listening and reading will References
continue to fall farther behind their peers in language
development as they grow older. Bliss, L. S. (1992). A comparison of tactful messages by chil-
Semantic development in school-age children can be dren with and without language impairment. Language,
evaluated through the use of standardized tests such as Speech, and Hearing Services in Schools, 23, 343–347.
Brinton, B., Fujiki, M., and Higbee, L. M. (1998). Participa-
the Peabody Picture Vocabulary Test–III, the Test of
tion in cooperative learning activities by children with spe-
Word Knowledge, and the Test of Word Finding–2. In- cific language impairment. Journal of Speech, Language,
formal observation of a child’s use of words, phrases, and Hearing Research, 41, 1193–1206.
and expressions in social and academic contexts can also Brinton, B., Fujiki, M., and McKee, L. (1998). Negotiation
be informative. Deficiencies in listening and reading skills of children with specific language impairment. Jour-
comprehension can be identified using standardized nal of Speech, Language, and Hearing Research, 41, 927–
tests such as the Clinical Evaluation of Language 940.
Fundamentals–3 and the Woodcock Reading Mastery Brinton, B., Fujiki, M., Spencer, J. C., and Robinson, L. A.
Tests–Revised, and by inspecting the child’s scores on (1997). The ability of children with specific language im-
academic achievement tests. pairment to access and participate in an ongoing interac-
tion. Journal of Speech, Language, and Hearing Research,
40, 1011–1025.
Pragmatics Catts, H. W. (1986). Speech production/phonological deficits
in reading-disordered children. Journal of Learning Dis-
Problems in pragmatics—the social use of language— abilities, 19, 504–508.
commonly occur in school-age children with language Catts, H. W. (1993). The relationship between speech-language
disorders. Because of their phonological, syntactic, mor- impairments and reading disabilities. Journal of Speech,
phological, and semantic deficits, children with language Language, and Hearing Research, 36, 948–958.
Fujiki, M., Brinton, B., Isaacson, T., and Summers, C. (2001). Fey, M. E., Windsor, J., and Warren, S. E. (Eds.). (1995).
Social behaviors of children with language impairment Language intervention: Preschool through the elementary
on the playground: A pilot study. Language, Speech, and years. Baltimore: Paul H. Brookes.
Hearing Services in Schools, 32, 101–113. Moats, L. C. (2000). Speech to print: Language essentials for
German, D. J. (1994). Word finding di‰culties in children teachers. Baltimore: Paul H. Brookes.
and adolescents. In G. P. Wallach and K. G. Butler (Eds.), Nippold, M. A. (1998). Later language development: The
Language learning disabilities in school-age children and school-age and adolescent years. Austin, TX: Pro-Ed.
adolescents: Some principles and applications (pp. 323–347). Paul, R. (2001). Language disorders from infancy through ado-
New York: Macmillan. lescence: Assessment and intervention (2nd ed.). St. Louis:
Gillam, R. B., and Johnston, J. R. (1992). Spoken and written Mosby.
language relationships in language/learning-impaired and Wallach, G. P., and Butler, K. G. (Eds.). (1994). Language
normally achieving school-age children. Journal of Speech learning disabilities in school-age children and adolescents:
and Hearing Research, 35, 1303–1315. Some principles and applications. New York: Macmillan.
Johnson, C. J., Beitchman, J. H., Young, A., Escobar, M.,
Atkinson, L., Wilson, B., et al. (1999). Fourteen-year follow-
up of children with and without speech/language impair-
ments: Speech/language stability and outcomes. Journal of
Speech, Language, and Hearing Research, 42, 744–760.
Language Disorders in School-Age
Loban, W. (1976). Language development: Kindergarten Children: Overview
through grade twelve. Urbana, IL: National Council of
Teachers of English.
Lombardino, L. J., Riccio, C. A., Hynd, G. W., and Pinheiro, Even though the symptoms and severity of a child’s
S. B. (1997). Linguistic deficits in children with reading dis- language disorder may change over time, language dis-
abilities. American Journal of Speech-Language Pathology, orders tend to be chronic. Preschoolers who are identi-
6, 71–78. fied with language disorders are at substantial risk for
Lutzer, V. D. (1988). Comprehension of proverbs by average experiencing language disorders during the school years
children and children with learning disorders. Journal of
and are also at risk for the academic, social, and voca-
Learning Disabilities, 21, 104–108.
Miller, J., and Chapman, R. (2000). SALT: Systematic Analy- tional di‰culties often associated with language dis-
sis of Language Transcripts [Computer program to ana- orders. Like younger children with language disorders,
lyze language samples]. Madison, WI: Language Analysis school-age children with language disorders are charac-
Laboratory, Waisman Research Center, University of terized by their heterogeneity. This heterogeneity mani-
Wisconsin–Madison. fests itself in the severity of the disorder, with some
Milosky, L. M. (1994). Nonliteral language abilities: Seeing children showing mild grammatical di‰culties, others
the forest for the trees. In G. P. Wallach and K. G. Butler showing no syntactic knowledge, and still others having
(Eds.), Language learning disabilities in school-age children no expressive language. For children with severe lan-
and adolescents: Some principles and applications (pp. 275– guage disorders, spoken language may present inordi-
303). New York: Macmillan.
nate di‰culties. In these instances, children may use
Moats, L. C., and Smith, C. (1992). Derivational morphology:
Why it should be included in language assessment and augmentative forms of communication, such as graphic
instruction. Language, Speech, and Hearing Services in systems, manual signs, and electronic speech output
Schools, 23, 312–319. devices, to facilitate language development or to serve as
Nippold, M. A., and Fey, S. H. (1983). Metaphoric under- alternate forms of communication.
standing in preadolescents having a history of language The heterogeneity of language disorders in school-
acquisition di‰culties. Language, Speech, and Hearing age children is also evident in the particular aspects
Services in Schools, 14, 171–180. of language that are disordered, with some children,
Snowling, M., Bishop, D. V. M., and Stothard, S. E. (2000). Is for example, showing word-finding deficits, others hav-
preschool language impairment a risk factor for dyslexia in ing di‰culty understanding complex directions, and yet
adolescence? Journal of Child Psychology and Psychiatry,
others exhibiting global language deficits. Convention-
41, 587–600.
Spector, C. C. (1990). Linguistic humor comprehension of ally, a distinction is drawn between language disorders
normal and language-impaired adolescents. Journal of that a¤ect only the production of language (expressive
Speech and Hearing Disorders, 55, 533–541. language disorders) and those that a¤ect language com-
Wiig, E., and Secord, W. (1998). Language disabilities in prehension in addition to production (mixed receptive-
school-age children and youth. In G. Shames, E. Wiig, and expressive language disorders). Children with either
W. Secord (Eds.), Human communication disorders: An expressive or mixed language disorders may have a
introduction (pp. 185–226). Boston: Allyn and Bacon. concomitant speech disorder, reflecting di‰culty with
Windsor, J., Scott, C. M., and Street, C. K. (2000). Verb and speech sound representation and/or production. Dis-
noun morphology in the spoken and written language orders of reading and writing also may accompany
of children with language learning disabilities. Journal of
language disorders (see language impairment and
Speech, Language, and Hearing Research, 43, 1322–1336.
reading disability).
For some children, language is the only developmen-
Further Readings tal area in which they experience obvious di‰culty; these
Catts, H., and Kamhi, A. (1999). Language and reading dis- children are often identified as having specific language
abilities. Boston: Allyn and Bacon. impairment (SLI) (see specific language impairment in
Language Disorders in School-Age Children: Overview 327
children). Omission of grammatical markers may be Across populations of children, di‰culties in all do-
the most salient language characteristic in SLI, but it is mains of language, semantic, syntactic, and pragmatic,
not the only language deficit that may hinder a child’s have been found. Current thinking in speech-language
academic performance. In other children the language pathology, however, is not to address individual skills in
disorder is secondary to other cognitive, motor, or sen- isolation but to focus on broader aspects of the child’s
sory disorders. language and the learning environment that will best
Several populations of school-age children are at promote the child’s current and future communicative
risk for language disorders. These populations include success (Fey, Catts, and Larrivee, 1995). This includes
children with developmental disabilities, such as chil- recognizing the link between language, especially pho-
dren with mental retardation, autism, or a pervasive nological awareness (awareness of the sound structure of
developmental disorder, and also children in whom only words), and literacy skills (Catts and Kamhi, 1999). Oral
subtle cognitive deficits are implicated. Among the latter narrative production is another area that has received
are children with learning disabilities or disorders as well attention, in part because the ability to tell a cohesive
as children with attention deficit disorder, characterized story rests on other language and cognitive skills and in
by frequent instances of inattention and impulsiveness, part because good narrative skills seem to be associated
and children with disruptive behavior disorder, marked with good academic performance (Hughes, McGillivray,
by aggressive behavior or the violation of social norms. and Schmidek, 1997). Also, children with language dis-
Children with hearing impairments are also at risk for orders are at risk for fewer and less e¤ective social
language disorders. Although most school-age language interactions than other children of the same age. Thus,
disorders are developmental, children may have acquired the language foundations for social interaction, particu-
language disorders resulting from closed head injuries, larly conversational skills, constitute a major area to be
seizure disorders, or focal lesions such as stroke or addressed.
tumors. Taken together, children with language dis- The school years cover a broad developmental range,
orders constitute a large group of students for whom and language disorders during adolescence are as im-
language poses substantial di‰culties. portant to identify as disorders occurring at earlier ages.
About 5% of students in the United States show However, language development is more gradual and
a learning disorder (American Psychiatric Association, individual in adolescence than it is in younger chil-
1994). Learning disorders are identified as disorders of dren, and identification of a disorder may be particu-
reading, written expression, and mathematics. However, larly challenging. Later language developments, such as
many children with learning disorders appear to have the acquisition of figurative language (e.g., metaphors
an associated di‰culty with spoken language that sub- and idioms), advanced lexical and syntactic skills (e.g.,
stantially a¤ects their ability to meet classroom lan- defining abstract words, using complex sentences), ana-
guage demands. The comorbidity of language disorders, logical reasoning, and e¤ective conversational skills,
learning disorders, and also attention deficit and dis- such as negotiation and persuasion, each develop over
ruptive behavior disorders is well-established. The over- an extended period. At the same time, competence with
lap between disorders is at some level intuitive. For these language skills is fundamental for dealing e¤ec-
instance, children with attention deficit disorder often tively with the academic and social curricula of high
show deficits in executive functions, such as di‰culties in school. Adolescents with language disorders are at risk
goal setting, monitoring behavior, and self-awareness for dropping out of school or in other ways not making a
(Ylvisaker and DeBonis, 2000). These characteristics successful transition to employment or university after
may have deleterious e¤ects on the child’s ability to deal high school. Thus, emphasizing adolescents’ functional
with the complex language tasks encountered in the competence in social communication has been increas-
classroom. ingly advocated.
Traditionally, a distinction was made between lan- Both standardized and nonstandardized measures are
guage delay and language deviance. For example, chil- used to assess school-age language disorders in children
dren with mental retardation were considered to show a and adolescents. Although below-average performance
delayed profile of language development, consistent with on standardized tests compared with chronological or
delay in other cognitive abilities. Children with autism developmental norms remains the primary way of
were considered to show deviant language characterized identifying the presence of a language disorder, criterion-
by patterns not found for typically achieving children. referenced assessments provide a more direct guide
Current research suggests that this global distinction to intervention. In criterion-referenced assessments, the
does not fully capture the language profiles of children emphasis is on how well the child reaches certain levels
with language disorders. Contrary to the idea of simple of achievement rather than on how the child’s language
delay, for example, children and adolescents with Down performance compares with that of other children of the
syndrome show greater deficits in expressive than in re- same age. For instance, criterion-referenced assessments
ceptive language (Chapman et al., 1998). And contrary can be used to determine how well a child understands
to the notion of overall language deviance, children and vocabulary used in classroom textbooks or how e¤ec-
adolescents with autism have been found to produce tively a child initiates conversations with other children.
narratives similar to those of children with mental retar- (Paul, 2001, describes many standardized and criterion-
dation (Tager-Flusberg and Sullivan, 1995). referenced language assessments.)
Another form of nonstandardized assessment focuses Olswang, L. B., Bain, B., and Johnson, G. (1992). Using dy-
on the underlying cognitive processing skills that poten- namic assessment with children with language disorders.
tially are linked to some language disorders. This evalu- In S. F. Warren and J. Reichle (Eds.), Causes and e¤ects
ation includes tasks assessing verbal working memory in communication and language intervention (pp. 187–215).
Baltimore: Paul H. Brookes.
(e.g., recalling an increasing number of real words),
Paul, R. (2001). Language Disorders from Infancy through
phonological working memory (e.g., imitating nonsense Adolescence: Assessment and Intervention (2nd ed). St.
words), and auditory perception (e.g., discriminating Louis: Mosby.
speech and nonspeech sounds). School-age children with Restrepo, M. A. (1998). Identifiers of predominantly Spanish-
language disorders have been distinguished from their speaking children with language impairment. Journal of
age peers by lower accuracy on verbal working memory Speech, Language, and Hearing Research, 41, 1398–1411.
and nonword repetition tasks (Ellis Weismer, Evans, Tager-Flusberg, H., and Sullivan, K. (1995). Attributing men-
and Hesketh, 1999; Ellis Weismer et al., 2000). Dynamic tal states to story characters: A comparison of narratives
assessment also has been advocated as an e¤ective non- produced by autistic and mentally retarded individuals.
standardized assessment strategy (Olswang, Bain, and Applied Psycholinguistics, 16, 241–256.
Ylvisaker, M., and DeBonis, D. (2000). Executive function
Johnson, 1992). In dynamic assessment, aspects of a
impairment in adolescence: TBI and ADHD. Topics in
language task are altered systematically to examine the Language Disorders, 20, 29–57.
conditions under which a child can achieve optimal suc-
cess. Thus, dynamic assessment can be used to deter-
mine a child’s potential for benefiting from intervention,
Further Readings
and also what guidance or structure will be most Bishop, D. V. M. (1994). Grammatical errors in specific lan-
helpful in intervention. Criterion-referenced, processing- guage impairment: Competence or performance limitations?
dependent, and dynamic assessments may be especially Applied Psycholinguistics, 15, 507–550.
important for children from culturally and linguistically Bishop, D. V. M., Bishop, S., Bright, P., James, C., Delaney,
diverse backgrounds, for whom many current stan- T., and Tallal, P. (1999). Di¤erent origin of auditory and
phonological processing problems in children with language
dardized language tests may be inadequate or inappro-
impairment: Evidence from a twin study. Journal of Speech,
priate (Restrepo, 1998; Craig and Washington, 2000). Language, and Hearing Research, 42, 155–168.
See also speech disorders in children: speech- Boudreau, D., and Chapman, R. (2000). The relationship be-
language approaches; specific language impairment tween event representation and linguistic skill in narratives
in children. of children and adolescents with Down syndrome. Journal
—Jennifer Windsor Campbell, T., Dollaghan, C., Needleman, H., and Janosky, J.
(1997). Reducing bias in language assessment: Processing-
References dependent measures. Journal of Speech, Language, and
American Psychiatric Association. (1994). Diagnostic and sta- Chapman, S. B., Watkins, R., Gustafson, C., Moore, S., Levin,
tistical manual of mental disorders—Fourth edition. Wash- H., and Kufera, J. (1997). Narrative discourse in children
ington, DC: American Psychiatric Press. with closed head injury, children with language impairment,
Catts, H. W., and Kamhi, A. (Eds.). (1999). Language and and typically developing children. American Journal of
reading disabilities. Boston: Allyn and Bacon. Speech-Language Pathology, 6, 66–75.
Chapman, R. S., Seung, H., Schwartz, S., and Kay-Raining Fey, M. E., Windsor, J., and Warren, S. (Eds.). (1995). Lan-
Bird, E. (1998). Language skills of children and adolescents guage intervention: Preschool through the elementary years.
with Down syndrome: II. Production deficits. Journal of Baltimore: Paul H. Brookes.
Speech, Language, and Hearing Research, 41, 861–873. Gallagher, T. M. (1999). Interrelationships among children’s
Craig, H., and Washington, J. (2000). An assessment battery language, behavior, and emotional problems. Topics in
for identifying language impairments in African American Language Disorders, 19, 1–15.
children. Journal of Speech, Language, and Hearing Re- Gillam, R., and Johnston, J. (1992). Spoken and written lan-
search, 43, 366–379. guage relationships in language/learning-impaired and nor-
Ellis Weismer, S., Evans, J., and Hesketh, L. (1999). An ex- mally achieving school-age children. Journal of Speech and
amination of verbal working memory capacity in children Hearing Research, 35, 1303–1315.
with specific language impairment. Journal of Speech, Lan- Kay-Raining Bird, E., Cleave, P., and McConnell, L. (2000).
guage, and Hearing Research, 42, 1249–1260. Reading and phonological awareness in children with
Ellis Weismer, S., Tomblin, B., Zhang, X., Buckwalter, P., Down syndrome: A longitudinal study. American Journal of
Chynoweth, J., and Jones, M. (2000). Nonword repetition Speech-Language Pathology, 9, 319–330.
performance in school-age children with and without lan- Leadholm, B. J., and Miller, J. (1992). Language Sample
guage impairment. Journal of Speech, Language, and Hear- Analysis: The Wisconsin guide. Madison: Wisconsin De-
ing Research, 43, 865–878. partment of Public Instruction.
Fey, M. E., Catts, H., and Larrivee, L. (1995). Preparing pre- Leonard, L. B. (1998). Children with specific language impair-
schoolers for the academic and social challenges of school. ment. Cambridge, MA: MIT Press.
In M. E. Fey, J. Windsor, and S. Warren (Eds.), Language Marchman, V. A., Wulfeck, B., and Ellis Weismer, S. (1999).
intervention: Preschool through the elementary years (pp. 3– Morphological productivity in children with normal lan-
37). Baltimore: Paul H. Brookes. guage and SLI: A study of the English past tense. Journal of
Hughes, D., McGillivray, L., and Schmidek, M. (1997). Guide Speech, Language, and Hearing Research, 42, 206–219.
to narrative language: Procedures for assessment. Eau McCauley, R. J. (1996). Familiar strangers: Criterion-
Claire, WI: Thinking Publications. referenced measures in communication disorders. Lan-
Language Impairment and Reading Disability 329
guage, Speech, and Hearing Services in Schools, 27, 122– Poor readers may have di‰culties in vocabulary, gram-
131. mar, or text-level processing (Vogel, 1974; Catts et al.,
McCormick, L., Frome Loeb, D., and Schiefelbusch, R. 1999; McArthur et al., 2000). In at least some cases,
(1997). Supporting children with communication di‰culties in these deficits are severe enough for children to have
inclusive settings: School-based language intervention. Bos-
been identified as language impaired (Catts et al., 1999;
ton: Allyn and Bacon.
Nippold, M. A. (2000). Language development during the McArthur et al., 2000).
adolescent years: Aspects of pragmatics, syntax, and se- In addition to these language deficits, children with
mantics. Topics in Language Disorders, 20, 15–28. reading disabilities have di‰culties in other areas of
Scott, C. M., and Stokes, S. (1995). Measures of syntax in language processing, specifically phonological processing
school-age children and adolescents. Language, Speech, and (Bradley and Bryant, 1983; Fletcher et al., 1994; Catts
Hearing Services in Schools, 26, 309–319. et al., 1999). The most noteworthy of these deficits
Rice, M. L. (1993). ‘‘Don’t talk to him; he’s weird.’’ A social are problems in phonological awareness. Phonological
consequences account of language and social interaction. In awareness is the explicit awareness of, or sensitivity, to
A. P. Kaiser and D. Gray (Eds.), Enhancing Children’s the sounds of speech. It is one’s ability to attend to, re-
Communication: Research Foundations for Intervention
flect on, or manipulate phonemes. Children with reading
(pp. 139–158). Baltimore: Paul H. Brookes.
Romski, M. A., and Sevcik, R. (1992). Developing augmented disabilities are consistently more impaired in phonologi-
language in children with severe mental retardation. In S. F. cal awareness than in any other single ability (Torgesen,
Warren and J. Reichle (Eds.), Causes and e¤ects in commu- 1996). Poor readers often have di‰culties making judg-
nication and language intervention (pp. 113–130). Baltimore: ments about the sounds in words or in their ability to
Paul H. Brookes. segment or blend phonemes. Such problems make it dif-
Scott, C. M., and Windsor, J. (2000). General language per- ficult for children to learn how the alphabet represents
formance measures in spoken and written discourse pro- speech and how this knowledge can be used to decode
duced by school-age children with and without language printed words.
learning disabilities. Journal of Speech, Language, and Children with reading disabilities have also been
reported to have other deficits in phonological process-
Wallach, G. P., and Butler, K. (1994). Language learning dis-
abilities in school-age children and adolescents: Some princi- ing (Wagner and Torgesen, 1987). Poor readers have
ples and applications. New York: Merrill. problems in phonological retrieval (i.e., rapid naming),
Windsor, J., Doyle, S., and Siegel, G. (1994). Language acqui- phonological memory, and phonological production
sition after mutism: A longitudinal case study of autism. (reviewed in Catts and Kamhi, 1999). Although these
Journal of Speech and Hearing Research, 37, 96–105. di‰culties in phonological processing often co-occur
with those in phonological awareness, there are notable
exceptions. For example, one current theory proposes
Language Impairment and Reading that phonological awareness and rapid naming are
somewhat independent, so that poor readers may have
Disability deficits in either area alone or in combination (Wolf and
Bowers, 1999). However, it is proposed that children
Reading is a language-based activity. As such, there is with deficits in both areas, or what is termed a double
frequently an overlap between developmental language deficit, are at greatest risk for reading di‰culties.
impairments and reading disabilities. Evidence of a rela- Although most children with reading disabilities have
tionship between these developmental disabilities has a history of language problems, the overlap between
come from two perspectives. One has been the study of language and reading disabilities is not complete. In
the language development of children with reading dis- each group of poor readers who have been studied, at
abilities, and the other has been the investigation of least some participants do not appear to have a history
the reading outcomes of children with spoken language of language problems. When language problems are
impairments. defined on the basis of di‰culties in vocabulary, gram-
mar, or text-level processing, about half of poor readers
Language Problems in Children with Reading show no evidence of a language impairment (Mattis,
Disabilities 1978; Catts et al., 1999; McArthur et al., 2000). How-
ever, when phonological processing deficits are also
Numerous studies have documented that children with included, the percentage of una¤ected poor readers is
reading disabilities have problems in language develop- about 25%–30% (Catts et al., 1999). These results are
ment. Most investigations have involved concurrent ex- due, at least in part, to the fact that other nonlinguistic
amination of language problems in children with existing factors likely contribute to reading disabilities. Current
reading disabilities (Vogel, 1974; Bradley and Bryant, theories include visual deficits and speed of processing
1983; McArthur et al., 2000), while a few have studied problems as alternative or additional causes of reading
the early language abilities of children who later became problems (Eden et al., 1995; Nicholson and Fawcett,
reading disabled (Scarborough, 1990; Catts et al., 1999). 1999). However, the lack of an apparent association
The latter approach is critical to determining the direc- between reading disabilities and language impairments
tion of causality. may also be the result of discontinuities in the growth of
Children who become poor readers often have lan- various aspects of language and reading abilities. These
guage problems, or at least a history of these problems. discontinuities may obscure the relationship between
language and reading disabilities at certain points in Adams (1990), for example, reported that children with
time and highlight the association at others (Scar- language impairments at age 4 who were no longer lan-
borough, 2001). guage impaired at age 5½ had normal reading achieve-
ment at age 8. Catts et al. (2002) further found that
Reading Outcomes in Children with Language children with language impairments in kindergarten
Impairments who did not show language impairments in second grade
had significantly better reading outcomes than those who
The overlap between language impairments and reading continued to have language problems. Finally, the ‘‘per-
disabilities has also been established by studies of the sistence hypothesis’’ is also supported by evidence that
reading outcomes of children with language impair- change in language impairment status is related to sever-
ments. In the earliest of these studies, children with a ity of language di‰culties, type of language impairment,
clinical history of language impairments were located and nonverbal IQ, each of which has been associated with
later in childhood or adulthood and their academic reading outcome in children with language impairment
achievement was compared with their earlier speech- (Bishop and Adams, 1990, Catts et al., 2002).
language abilities (Aram and Nation, 1980; Hall and These conclusions, however, are compromised some-
Tomblin, 1978). More recently, studies have identified what by long-term follow-up data. Specifically, Stothard
children with language impairment in preschool or et al. (1998) showed that the children studied by Bishop
kindergarten and followed them into the school grades and Adams (1990) who had improved in language
(Bishop and Adams, 1990; Catts, 1993; Stothard et al., abilities by age 5½ and who did not have reading prob-
1998; Rescorla, 2000; Catts et al., 2002). Both lines of lems at age 8½ subsequently did have language and
research indicate that children with a history of language reading problems when tested at age 15. In fact, 52%
impairments are at high risk for reading problems. In were found to read significantly below grade level. These
almost every instance, the reading outcomes of children results are consistent with Scarborough’s proposal of
with language impairment have been found to di¤er sig- ‘‘illusory recovery’’ (Scarborough, 2001). According
nificantly from those of children with typical language to this proposal, children who appear to have resolved
development. In addition, many children with a history their language problems early on, later show language
of language impairments could be classified as reading impairments and demonstrate significant disabilities.
disabled. Across studies, the percentage of children with Scarborough (2001) has argued that illusory recovery
language impairment who have been found to have and apparent relapse may be the result of nonlinear
subsequent reading problems has varied from approxi- growth in language development. She suggests that dif-
mately 40% to 90%, with a median value of about 50%. ferent aspects of language are characterized by spurts
Despite a strong tendency for children with language and plateaus in growth. Thus, individual di¤erences in
impairment to develop reading problems, not all of language development may be more apparent at some
these children become poor readers. Research indicates stages than others.
that the type and severity of the language disorder are Nonlinear growth in di¤erent aspects of reading
related to reading outcome. Children with more severe development may also influence the observation of a re-
or broader-based language impairments are at greater lationship between language and reading disabilities.
risk for reading disabilities than those with less severe For example, in the early stages, reading development
problems or problems confined to a single dimension is characterized by rapid improvement in word recogni-
of language (i.e., expressive language) (Rescorla, 2000; tion skills, which rest heavily on phonological processing
Catts et al., 2002). Also, children with language impair- abilities. At later stages, individual di¤erences become
ments who have concomitant nonverbal cognitive deficits more related to language comprehension abilities (Hoo-
(i.e., low nonverbal IQ) have poorer reading achievement ver and Gough, 1990). Thus, children with deficits in
than those with normal nonverbal cognitive abilities phonological processing will most likely have problems
(Bishop and Adams, 1990; Catts et al., 2002). Phonolog- in the early stages of learning to read, while those with
ical processing abilities are also related to reading out- vocabulary and grammar deficits will be especially at
come in children with language impairments. For risk in the later stages of reading development.
example, Catts (1993) found that measures of phonologi- In summary, the two lines of research reviewed here
cal awareness and rapid naming were predictive of read- converge in support of a close relationship between lan-
ing achievement, especially word recognition abilities, in guage impairments and reading disabilities. However,
children with language or articulation impairments. this relationship is not complete and may be compli-
The persistence of a language impairment may be an cated by nonlinearities in both language and reading
important predictor of reading outcome in many chil- development.
dren. Some children with language impairment appear
to resolve their language di‰culties prior to school en- —Hugh W. Catts
try, while others continue to manifest language impair-
ments into the school years. Those who continue to have References
significant language problems are at a much greater risk Aram, D., and Nation, J. (1980). Preschool language disorders
for reading disabilities in the early school grades than and subsequent language and academic di‰culties. Journal
those with improved language abilities. Bishop and of Communication Disorders, 13, 159–179.
Language Impairment in Children: Cross-Linguistic Studies 331
Bishop, D. V. M., and Adams, C. (1990). A prospective study Language Impairment in Children:
of the relationship between specific language impairment,
phonological disorders and reading retardation. Journal Cross-Linguistic Studies
Child Psychology and Psychiatry, 31, 1027–1050.
Bradley, L., and Bryant, P. (1983). Categorizing sounds
and learning to read: A causal connection. Nature, 301,
419–421. For many years the study of language impairment in
Catts, H. (1993). The relationship between speech-language children focused almost exclusively on children learning
impairments and reading disabilities. Journal of Speech and English. The past decade has seen a decided and salutary
Hearing Research, 36, 948–958. broadening of this exclusive focus as researchers from
Catts, H. W., Fey, M. E., Tomblin, J. B., and Zhang, X. many nations have become interested in understanding
(2002). A longitudinal investigation of reading outcomes the impaired language acquisition of children speaking
of children with language impairments. Journal of Speech, diverse languages. The concern for understanding the
Language, and Hearing Research, 45, 1142–1157. cross-linguistic nature of developmental language im-
Catts, H. W., Fey, M. E., Zhang, X., and Tomblin, J. B. pairment has followed in the path of investigations into
(1999). Language basis of reading and reading dis-
the cross-linguistic nature of typical language develop-
abilities: Evidence from a longitudinal investigation. Scien-
tific Studies in Reading, 3, 331–361. ment that were initiated by Slobin and his colleagues
Catts, H. W., and Kamhi, A. G. (1999). Language and reading (e.g., Slobin, 1985).
disabilities. Needham Heights, MA: Allyn and Bacon. Cross-linguistic studies of language impairment in
Eden, G. F., Stein, J. F., Wood, M. H., and Wood, F. B. children have both a theoretical and a practical impetus.
(1995). Verbal and visual problems in reading disability. Theoretically, the study of typical and impaired lan-
Journal of Learning Disabilities, 28, 272–290. guage development across a number of structurally dif-
Fletcher, J. M., Shaywitz, S. E., Shankweiler, D. P., Katz, L., ferent languages should reveal commonalities as well as
Liberman, I. Y., Stuebing, K. K., et al. (1994). Cognitive di¤erences in how children learn languages. Such studies
profiles of reading disability: Comparisons of discrepancy should help researchers determine what is universal and
and low achievement definitions. Journal of Educational
Psychology, 86, 6–23.
what is variable in the ways that children learn lan-
Hall, P., and Tomblin, J. B. (1978). A follow-up study of chil- guages. By sorting the variable from the universal, such
dren with articulation and language disorders. Journal of research enhances knowledge about the properties of
Speech and Hearing Disorders, 43, 227–241. language development that are general to the process of
Hoover, W. A., and Gough, P. B. (1990). The simple view of language learning and those that are determined by the
reading. Reading and Writing: An Interdisciplinary Journal, structure of the language that the child is exposed to.
2, 127–160. Practically, understanding what form impairment takes
Mattis, S. (1978). Dyslexia syndromes: A working hypothesis in various languages improves the possibilities for as-
that works. In A. L. Benton and D. Pearl (Eds.), Dyslexia: sessment and treatment in those languages. Understand-
An appraisal of current knowledge (pp. 45–58). New York: ing the contribution of input to the nature and timing
Oxford University Press.
McArthur, G. M., Hogben, J. H., Edwards, V. T., Heath,
of acquisition in a specific language has important con-
S. M., and Mengler, E. D. (2000). On the ‘‘specifics’’ of sequences for developing intervention approaches. On
specific reading disability and specific language impairment. the other hand, understanding the common properties of
Journal of Child Psychology and Psychiatry, 41, 869–874. language learning helps to clarify in what ways impair-
Nicholson, R. I., and Fawcett, A. J. (1999). Developmental ment results from possible disruptions of basic human
dyslexia: The role of the cerebellum. Dyslexia, 5, 155–177. biological and cognitive mechanisms.
Rescorla, L. (2000). Do late-talkers turn out to have reading Most cross-linguistic studies of children with language
di‰culties a decade later? Annals of Dyslexia, 50, 87–102. impairment have been concerned with the morphological
Scarborough, H. S. (1990). Very early language deficits in dys- deficits experienced by children with specific language
lexic children. Child Development, 61, 1728–1743. impairment (SLI) (see specific language impairment in
Scarborough, H. S. (2001). Connecting early language and lit-
eracy to later reading (dis)abilities: Evidence, theory, and
children). Some such grammatical deficits are found to
practice. In S. Neuman and D. Dickison (Eds.), Handbook occur in almost all languages that have been studied so
for early literacy research. New York: Guilford Press. far, regardless of their structure. Other deficits are par-
Stothard, S. E., Snowling, M. J., Bishop, D. V. M., Chip- ticular to individual language families. One deficit that
chase, B., and Kaplan, C. (1998). Language impaired pre- appears to exist in many languages has to do with the
schoolers: A follow-up in adolescence. Journal of Speech, acquisition of verbal inflection. Finite verbal inflections
Language, and Hearing Research, 41, 407–418. and auxiliary verbs produced by children with SLI are
Torgesen, J. (1996). Phonological awareness: A critical factor in optionally missing or substituted for in the following
dyslexia. Baltimore: Orton Dyslexia Society. languages: German (Clahsen, Bartke, and Gollner, 1997;
Vogel, S. A. (1974). Syntactic abilities in normal and dyslexic Rice, Ru¤ Noll, and Grimm, 1997), Dutch (de Jong,
children. Journal of Learning Disabilities, 7, 47–53.
Wagner, R. K., and Torgesen, J. K. (1987). The nature of
1999), Swedish (Hansson, 1997), Norwegian (Meyer
phonological processing and its causal role in the acquisi- Bjerkan, 1999), English (Rice and Wexler, 1996), French
tion of reading skills. Psychological Bulletin, 101, 1–21. (Jakubowicz and Nash, 2001; Paradis and Crago, 2001),
Wolf, M., and Bowers, P. G. (1999). The double-deficit hy- Italian (Bottari, Cipriani, and Chilosi, 1996; Bortolini,
pothesis for the developmental dyslexias. Journal of Learn- Caselli, and Leonard, 1997; Bottari et al., 2001), Japa-
ing Disabilities, 91, 415–438. nese (Fukuda and Fukuda, 2001), Greek (Clahsen and
Dalalakis, 1999), Inuktitut (Crago and Allen, 2001), and morphology. However, succeeding studies of other lan-
Arabic (Abdalla, 2002). guages with very phonologically salient verbal inflec-
Various language families show interesting patterns tions, such as Inuktitut and Japanese, have shown that
that relate to the structure or typology of those particu- children with SLI did, in fact, have di‰culty with the
lar groups of languages. For example, studies of Ger- acquisition of verbal inflections. This demonstrates how
manic languages such as Dutch, Swedish, and German a series of cross-linguistic studies can be useful in estab-
have shown that there are word order consequences re- lishing whether a certain theoretical explanation of SLI
lated to the omission of finite verb inflections and auxil- is meaningful across languages.
iaries in the speech of children with SLI. Even though the number of languages that are being
Studies of Romance languages such as Italian and studied is expanding, there are only a few studies that
French have demonstrated that children with SLI have can be considered truly cross-linguistic in design. These
greater trouble acquiring the past tense than the present few genuine cross-linguistic studies involve either a for-
tense, while English-speaking children with SLI have mat that compares in one study two groups of subjects
di‰culty with both tenses. Impaired speakers of French who speak two di¤erent languages or a specific language
and Italian also have di‰culty with the production of in which the grammatical variables tested are as identical
object clitics. Surprisingly, however, Italian and French as possible to a previously studied language. In addition,
children with SLI di¤er in their acquisition of deter- studies in which investigators have examined the gram-
miners. Italian-speaking children have more di‰culty matical properties of children speaking a single specific
with this aspect of their grammar than the French- language also display some methodological short-
speaking children. It is unfortunate that in the family of comings impeding well-founded conclusions. They have
Romance languages, there are no comparable acquisi- varied in the age and criteria of selection of the children
tion studies of children with SLI learning Spanish, either being studied. They have also varied in the particular
in the Americas or in Europe. properties of the children’s grammar that were assessed.
Speakers of non-Indo-European languages such as Regrettably, such design issues have made the kinds of
Inuktitut and Arabic have di¤erent patterns of impair- comparisons that are essential to establishing the uni-
ment than other groups. For instance, Arabic speakers versal and variable properties of acquisition and impair-
replace incorrect verbal inflections with a default form ment often inconclusive.
that is typically tense-bearing in the adult language. This In summary, despite certain limitations, the expan-
is di¤erent from the Germanic and Romance languages, sion of studies of language impairment in children to
where tense-bearing morphemes, such as verbal inflec- include a wider variety of languages has enlivened
tions or auxiliary verbs, tend to be dropped, resulting in theoretical debate, led to new, linguistically based
a nonfinite verb form such as an infinitive, a participle, understandings, and enriched perspectives on this com-
or the verb stem appearing as the main verb in the sen- municative disorder. It is important that studies of lan-
tence. Specific language impairment in Inuktitut has guage impairment in childhood continue to encompass
been shown to present yet another dimension. Here, the more and di¤erent languages. In fact, languages spoken
trouble a child with impairment had with verbal inflec- by the vast majority of children in the world’s popula-
tion did not resemble younger normally developing chil- tion remain virtually unexplored. It is equally important
dren. This is a di¤erent pattern than has been found in that more studies be designed to be truly cross-linguistic
many other languages where children with SLI show a in nature, with variables and criteria for impairment
pattern of optional verbal inflection that resembles that established as uniformly as possible. The study of bilin-
of younger normally developing children. gual children with impairment provides a unique oppor-
Cross-linguistic studies can also play a particularly tunity for the cross-linguistic observation of language
useful role in verifying hypotheses about the nature of impairment. These children are perfectly matched to
the deficits experienced by children with SLI. They allow themselves as the learners of two languages (see bilin-
researchers to check out explanations based on particu- gualism and language impairment). Finally, just as
lar languages with results from typologically di¤erent important as understanding how children with language
languages. For instance, Leonard, one of the pioneers impairment learn di¤erent languages is recognizing that
of cross-linguistic studies of child language impairment, they are also members of di¤erent cultures. Language
hypothesized from his study of English-speaking chil- and culture are inextricably linked and as such they in-
dren that a plausible explanation for the grammatical fluence each other as well as the manifestations of and
deficit in SLI was that these children had di‰culty beliefs about childhood language impairment.
establishing a learning paradigm for morphology in lan-
guages where the morphemes had low phonological —Martha Crago and Johanne Paradis
saliency. He and his colleagues sought systematic cross-
linguistic verification for this explanation by studying References
children with SLI who were learning languages with Abdalla, F. (2002). Specific language impairment in Arabic-
more salient morphology, such as Italian (Leonard et al., speaking children: Deficits in morphosyntax. Unpublished
1992) or Hebrew (Dromi, Leonard, and Shteiman, 1993; dissertation, McGill University, Montreal.
Dromi et al., 1999). Indeed, such children appeared to Bortolini, U., Caselli, M. C., and Leonard, L. (1997). Gram-
have less di‰culty with the acquisition of their verbal matical deficits in Italian-speaking children with specific
Language in Children Who Stutter 333
language impairment. Journal of Speech, Language, and Crago, M., and Paradis, J. (2003). Two of a kind: Common-
Hearing Research, 40, 809–820. alities and variation in across learners and languages. In Y.
Bottari, P., Cipriani, P., and Chilosi, A. (1996). Root infinitives Levy and J. Schae¤er (Eds.), Linguistic competence across
in Italian SLI children. In A. Stringfellow, D. Cahana- populations: Towards a definition of Specific Language Im-
Amitay, E. Hughes, and A. Zukowski (Eds.), BUCLD 20 pairment (pp. 97–110). Mahwah, NJ: Erlbaum.
(pp. 75–86). Somerville, MA: Cascadilla Press. Hakansson, G. (1998). Language impairment and the realiza-
Bottari, P., Cipriani, P., Chilosi, A., and Pfanner, L. (2001). tion of finiteness. In A. Greenhill, M. Hughes, H. Littlefield,
The Italian determiner system in normal acquisition, spe- and H. Walsh (Eds.), BUCLD 22 (pp. 314–324). Somer-
cific language impairment, and childhood aphasia. Brain ville, MA: Cascadilla Press.
and Language, 77, 283–293. Jakubowicz, C., Nash, L., Rigaut, C., and Gérard, C. (1998).
Clahsen, H., Bartke, S., and Göllner, S. (1997). Formal features Determiners and clitic pronouns in French-speaking chil-
in impaired grammars: A comparison of English and Ger- dren with SLI. Language Acquisition, 7, 113–160.
man SLI children. Journal of Neurolinguistics, 10, 151–171. Jakubowicz, C., Nash, L., and van der Velde, M. (1999). In-
Clahsen, H., and Dalalakis, J. (1999). Tense and agreement in flection and past tense morphology in French SLI. In A.
Greek SLI: A case study. Essex Research Reports in Lin- Greenhill, H. Littlefield, and C. Tano (Eds.), BUCLD 23
guistics, 1–25. (pp. 289–300). Somerville, MA: Cascadilla Press.
Crago, M., and Allen, S. (2001). Early finiteness in Inuktitut: Paradis, J., and Crago, M. (2000). Tense and temporality: A
The role of language structure and input. Language Acqui- comparison between children learning a second language
sition, 9(1), 59–111. and children with SLI. Journal of Speech, Language, and
de Jong, J. (1999). Specific language impairment in Dutch: Hearing Research, 43, 834–847.
Inflectional morphology and argument structure. Enschede, Paradis, J., Crago, M., Genesee, F., and Rice, M. (2000). Dual
The Netherlands: Print Partners Ipskamp. language disorders: Extended optional infinitive in bilingual
Dromi, E., Leonard, L., Adam, G., and Zadunaisky-Ehrlich, children with specific language impairment. Paper presented
S. (1999). Verb agreement morphology in Hebrew-speaking at the Boston University Conference on Language Devel-
children with specific language impairment. Journal of opment, Boston.
Speech, Language, and Hearing Research, 42, 1414–1431. Roberts, S., and Leonard, L. (1997). Grammatical deficits
Dromi, E., Leonard, L., and Shteiman, M. (1993). The gram- in German and English: A crosslinguistic study of chil-
matical morphology of Hebrew-speaking children with spe- dren with specific language impairment. First Language,
cific language impairment: Some competing hypotheses. 17, 131–150.
Fukuda, S., and Fukuda, S. (2001). The acquisition of complex
predicates in Japanese specifically language-impaired and
normally developing children. Brain and Language, 77, Language in Children Who Stutter
305–320.
Hansson, K. (1997). Patterns of verb usage in Swedish children
with SLI: An application of recent theories. First Language, A connection between language and stuttering in young
17, 195–217. children is intuitive. As noted by Yairi (1983) and others
Jakubowicz, C., and Nash, L. (2001). Functional categories
(e.g., Ratner, 1997), stuttering first appears in children
and syntactic operations in (ab)normal language acquisi-
tion. Brain and Language, 77, 321–339. between ages 2 and 4 years, during a time of rapid
Leonard, L., Bortolini, U., Caselli, M., McGregor, K., and expansion in expressive and receptive language ability.
Sabbadini, L. (1992). Morphological deficits in children Moreover, the repetitions and prolongations that char-
with specific language impairment: The status of features in acterize stuttering are observed as the child uses sounds
the underlying grammar. Language Acquisition, 2, 151–179. to form words and words to form phrases and sentences.
Meyer Bjerkan, K. (1999). Do SLI children have an optional The apparent link between domains has given rise to
infinitive stage? San Sebastian, Spain: International Associ- theoretical accounts of stuttering that emphasize lin-
ation for the Study of Child Language. guistic variables. For example, one working account of
Paradis, J., and Crago, M. (2001). The morphosyntax of spe- stuttering suggests that underlying di‰culties with pho-
cific language impairment in French: An extended optional
nological encoding, di‰culties that self-correct prior
default account. Language Acquisition, 9(4), 269–300.
Rice, M., Ru¤ Noll, K., and Grimm, H. (1997). An extended to actual language production but that slow language
optional infinitive stage in German-speaking children with processing, yield disfluencies (Postma and Kolk, 1993).
specific language impairment. Language Acquisition, 6, Linguistic factors are implicated in several other theo-
255–296. retical accounts of stuttering as well (Wingate, 1988;
Rice, M., and Wexler, K. (1996). Toward tense as a clinical Perkins, Kent, and Curlee, 1991).
marker of specific language impairment. Journal of Speech, Despite the intuitive appeal of connections between
Language, and Hearing Research, 39, 1236–1257. language and stuttering, many of the most fundamental
Slobin, D. (1985). The cross-linguistic study of language acqui- questions in this area of inquiry continue to be debated.
sition. Mahwah, NJ: Erlbaum. The language abilities of young children who stutter
have been the focus of research and controversy for
Further Readings many years (see Yairi et al., 2001, and Wingate, 2001,
Bastiaanse, R., and Bol, G. (2001). Verb inflection and verb for examples of the ongoing dialogue on this topic). In
diversity in three populations: Agrammatic speakers, nor- addition to examinations of the language development
mally developing children, and children with specific lan- status of young children who stutter, the connection be-
guage impairment. Brain and Language, 77, 274–282. tween language and stuttering in young children has
been studied in other ways, namely, through evaluation

of linguistic variables that appear to exert an influence
on stuttering behavior. Relevant linguistic variables in-
clude grammatical complexity and location of stuttering
events in the language planning or production process.
In both areas, the developmental status of children who
stutter and the study of linguistic influences on stutter-
ing, a growing body of knowledge speaks to the associ-
ations of language and stuttering in young children. This
article highlights key research findings in these areas and
summarizes what is known about the interface of lan-
guage and stuttering in young children.
Figure 1. Mean length of utterance, compared with normative
Language Ability and Stuttering in Young expectations, for children whose stuttering persists or recovers.
(From Watkins, R. V., Yairi, E., and Ambrose, N. G. [1999].
Children Early childhood stuttering: III. Initial status of expressive lan-
The scholarly literature reveals a relatively longstand- guage abilities. Journal of Speech, Language, and Hearing Re-
ing view of the child who stutters as more likely to search, 42, 1125–1135. Reproduced with permission.)
have language learning di‰culties or impairments than
typically developing peers. Through analysis of sponta-
neous language sample data, a group of scholars has in young children who stutter. More than a decade ago,
empirically evaluated the expressive language abilities of Nippold (1990) reported that there was no compelling
a large cohort of young children who stutter (Watkins evidence that children who stuttered had a higher rate of
and Yairi, 1997; Watkins, Yairi, and Ambrose, 1999). language learning di‰culties than the general popula-
The Illinois Stuttering Research Project has prospec- tion. In 2001, Miles and Ratner reported the perfor-
tively tracked a group of young children who stutter, mance of a group of young children who stuttered on a
beginning as near stuttering onset as possible and con- range of expressive and receptive language measures; the
tinuing longitudinally for a number of years to monitor children in their sample scored at or slightly above the
persistence in versus recovery from stuttering. This average level of performance on every reported measure
work has focused on expressive language abilities, com- (i.e., the group scored at or above a percentile rank of
paring the performance of young children who stutter 50 or at or above a standard score of 100 on the mea-
with normative expectations on a range of language sures used). Rommel and colleages (1999) in Germany
sample measures, such as mean length of utterance reported that preschool-age participants who stuttered
(MLU, a general index of grammatical ability), number had language skills at or above age expectations.
of di¤erent words (NDW, a general measure of vocabu- Anderson and Conture (2000) also reported language
lary skills), and Developmental Sentence Score (DSS, an scores at or above average for a group of children who
index of grammatical skills). The researchers found that, stuttered.
as a group, children who stutter perform at or above In light of these findings, there is little empirical sup-
normative expectations in their expressive language port for the hypothesis that language development and
skills. More specifically, Watkins, Yairi, and Ambrose stuttering are linked to a common, underlying commu-
(1999) reported data based on analysis of 83 pre- nication di‰culty, at least in any significant number of
schoolers who stuttered. Children who entered the study young children. On closer examination of the research
between the ages of 2 and 3 years (i.e., exhibited stutter- literature, methodological issues appear that may ac-
ing onset between ages 2 and 3 years) scored about 1 SD count for the view that children who stutter frequently
above normative expectations on several expressive lan- have concomitant language di‰culties. Several early
guage measures calculated from spontaneous samples. studies of language ability in young children who stut-
Children who entered the study between the ages of 3 tered did not consider socioeconomic status, potentially
and 4 years or 4 and 5 years performed at or near nor- comparing young children who stuttered and were
mative expectations. Interestingly, the children whose from lower or middle-income backgrounds with typi-
stuttering would ultimately persist (roughly 25% of the cally developing youngsters from university-based fami-
total sample) did not di¤er in expressive language skills lies (see Ratner, 1997). Furthermore, several past studies
from the children who would later recover from stutter- of language ability in young children who stuttered
ing, when their language skills were compared near did not evaluate the children’s skills in light of nor-
the time of stuttering onset. Figure 1 provides a sample mative expectations. Other studies reported higher than
of the findings of Watkins, Yairi, and Ambrose (1999), expected rates of concomitant language disabilities in
showing the MLU for children who entered the longitu- young children who stuttered but evaluated children
dinal study at three di¤erent age groupings relative to long after stuttering onset, or included children of very
normative expectations. di¤erent ages. When relations between language ability
The findings of several other investigators lend sup- and stuttering are examined long after stuttering onset,
port to these results regarding expressive language skills children may well have learned to adapt their expressive
Language in Children Who Stutter 335
language in various ways in order to limit or reduce linguistic factors appear to influence disfluencies in the
stuttering events. Such studies may be interesting, same way for stutterers and nonstutterers alike. That is,
but they ask very di¤erent questions from those ad- linguistic variables such as grammatical complexity and
dressed by investigations of language skills near the on- the loci of stuttering tend to influence individuals whose
set of stuttering. Any or all of these methodological disfluencies occur at typical rates in language production
choices could have considerable impact on findings per- as well as individuals whose disfluencies are frequent
taining to patterns and pathways of language acquisi- enough to yield identification as a ‘‘stutterer.’’
tion in youngsters who stutter, and all could be in the The domain of language is relevant in the study of
direction that predict a less favorable performance for early childhood stuttering. The majority of young chil-
children who stutter in comparison with typically devel- dren who stutter display expressive language abilities at
oping children. or above normative expectations. In addition, a number
In general, there is a growing consensus that lan- of linguistic variables, such as the grammatical com-
guage development is not particularly vulnerable in plexity of an utterance, exert an influence on the likeli-
young children who stutter. Continued study of lan- hood of a stuttering event. It may be informative for
guage strengths or challenges in conjunction with stut- future investigation in this area of inquiry to move
tering may reveal developmental asynchronies (e.g., toward detailed and specific analyses of profiles of lan-
perhaps early precocious language skill is a particular guage strength and evaluation of synchrony versus
risk factor for stuttering, or perhaps accelerated lan- asynchrony within and across developmental domains.
guage development in one domain, such as syntax or See also speech disfluency and stuttering in
semantics, creates di‰culties with fluency when profi- children.
ciencies in other domains, such as motoric abilities, are
less sophisticated). These possibilities await empirical —Ruth V. Watkins
study and will require detailed linguistic analyses to
evaluate. References
Anderson, J., and Conture, E. (2000). Language abilities of
Linguistic Influences on Stuttering children who stutter: A preliminary study. Journal of Flu-
ency Disorders, 25, 283–304.
There is ample evidence that stuttering events are influ- Au-Yeung, J., Howell, P., and Pilgrim, L. (1998). Phonological
enced by linguistic variables. Brown (1945) was perhaps words and stuttering on function words. Journal of Speech,
the first researcher to suggest linguistic influences on Language, and Hearing Research, 41, 1019–1030.
stuttering events with his groundbreaking report of ap- Brown, S. F. (1945). The loci of stutterings in the speech se-
parent influences of a word’s grammatical form class quence. Journal of Speech Disorders, 10, 181–192.
(i.e., content versus function word) on stuttering loci Logan, K. J., and Conture, E. G. (1995). Length, grammatical
in adults who stuttered. In brief, Brown reported that complexity, and rate di¤erences in stuttered and fluent con-
versational utterances of children who stutter. Journal of
adults who stuttered were significantly more likely to be
Fluency Disorders, 20, 35–61.
disfluent on content words (e.g., nouns and verbs) than Miles, S., and Ratner, N. B. (2001). Parental language input to
on function words (e.g., prepositions and pronouns). children at stuttering onset. Journal of Speech, Language,
Since Brown’s seminal work, researchers have con- and Hearing Research, 44, 1116–1130.
tinually refined analyses in the study of linguistic influ- Nippold, M. (1990). Concomitant speech and language dis-
ences on stuttering. We now know, for example, that orders in stuttering children: A critique of the literature.
young children are generally more likely to stutter on Journal of Speech and Hearing Disorders, 55, 51–60.
sentences of greater grammatical complexity than on Perkins, W. H., Kent, R. D., and Curlee, R. F. (1991). A theory
sentences of less grammatical complexity (Logan and of neuropsycholinguistic function in stuttering. Journal of
Conture, 1995; Yaruss, 1999). Furthermore, the content- Speech and Hearing Research, 34, 734–752.
Postma, A., and Kolk, H. (1993). The covert repair hypothesis:
function variable appears not to be the most relevant Prearticulatory repair processes in normal and stuttered
influence on stuttering loci; instead, stuttering events disfluencies. Journal of Speech and Hearing Research, 36,
are significantly more likely on either a content word or 472–487.
on a phrase-initial function word that precedes a con- Ratner, N. B. (1997). Stuttering: A psycholinguistic perspec-
tent word than in other phrasal locations (Au-Yeung, tive. In R. Curlee and G. Siegel (Eds.), Nature and treat-
Howell, and Pilgrim, 1998). The underlying influence ment of stuttering: New directions (2nd ed., pp. 99–127).
here is thought to be the planning unit in language for- Boston: Allyn and Bacon.
mulation, such that disfluencies are significantly more Rommel, D., Hage, A., Kalehne, P., and Johannsen, H. S.
likely to occur at the beginning of a language planning (1999). Developmental, maintenance, and recovery of
unit, when remaining components of the unit continue to childhood stuttering: Prospective longitudinal data 3 years
after first contact. In K. Baker, L. Rustin, and K. Baker
be refined for production. (Eds.), Proceedings of the Fifth Oxford Disfluency Confer-
These findings reveal that aspects of language plan- ence. Berkshire, U.K.: Chappel Gardner.
ning, formulation, and production exert an influence on Watkins, R. V., and Yairi, E. (1997). Language production
stuttering for children and adults. These findings support abilities of children whose stuttering persisted or recov-
the view that linguistic variables are relevant in charac- ered. Journal of Speech and Hearing Research, 40, 385–
terizing stuttering events. It is noteworthy, however, that 399.
Watkins, R. V., Yairi, E., and Ambrose, N. G. (1999). Early and there is an abrupt decline in non-speech-like vocal-
childhood stuttering: III. Initial status of expressive lan- izations. Canonical babbling increasingly uses the pho-
guage abilities. Journal of Speech, Language, and Hearing nemes found in the adult input language, yet it makes no
Research, 42, 1125–1135. reference to any real-world objects or actions. However,
Wingate, M. (1988). The structure of stuttering: A psycholin-
the parents of these hearing infants treat their infants’
guistic perspective. New York: Springer-Verlag.
Wingate, M. (2001). SLD is not stuttering. Journal of Speech, babbles as if they were meaningful, engaging in recipro-
Language, and Hearing Research, 44, 381–383. cal ‘‘dialogues’’ and commenting on the infants’ utter-
Yairi, E. (1983). The onset of stuttering in two- and three-year- ances. For deaf infants, early vocalization and marginal
old children. Journal of Speech and Hearing Disorders, 48, babbling is not delayed, and they produce a similar
171–177. range of speech-like sounds, which suggests that this
Yairi, E., Watkins, R. V., Ambrose, N. G., and Paden, E. stage is biologically driven. But canonical babbling is
(2001). What is stuttering? Journal of Speech, Language, and considerably delayed and appears to be deviant in both
Hearing Research, 44, 384–390. vocal quantity and quality (Mogford, 1993). So, hearing
Yaruss, S. (1999). Utterance length, syntactic complexity, parents of deaf infants do not respond to their infant’s
and childhood stuttering. Journal of Speech, Language, and
vocalizations as proto-communications, being more
likely to ignore or talk through them. Thus, most deaf
infants are already at a disadvantage toward the end of
the first year of life, both in their ability to extract the
Language of the Deaf: Acquisition of phonemes of their spoken language from the input and
English reproduce them, and in the initial structuring of conver-
sational dialogues in parent-infant interaction (Paul,
2001).
E¤ective English language skills are essential for the
education of deaf children and for the integration of deaf
Phonology
individuals into the wider, hearing society. However, the
average deaf student’s English language competence Speech intelligibility is a persistent problem for deaf
over the course of his or her schooling is limited. Al- children with moderate to profound hearing loss
though periodic reviews of nationwide achievement test- (>60 dB loss in the better ear), particularly if the major
ing reveal some improvement in the average reading hearing loss is in the higher frequencies of sound
levels of deaf students over the past 30 years, the increase (>1500 Hz). Studies report from 50% to 80% of moder-
is small, and in English literacy skills most deaf students ate to profoundly deaf children having either ‘‘very hard
fall farther and farther behind their hearing peers over to understand’’ or ‘‘totally unintelligible’’ speech (Car-
the course of their schooling. The average reading com- ney, 1986). Omission or distortion of consonant sounds
prehension scores for deaf students in the United States is common and has a major impact on intelligibility
rises from the second-grade level at age 9 to around the (Osberger and McGarr, 1982). Many of the speech
fourth-grade level at age 17 (de Villiers, 1992). Among errors of deaf children reflect the phonological processes
white deaf high school graduates, only about 15% read and constraints operating in normal speech development
above the sixth-grade level, and the percentage is only in young hearing children (e.g., consonant cluster reduc-
about 5% for those who are from African-American or tion, fronting of place of articulation, voicing errors, or
Hispanic backgrounds (Allen, 1994). deletion of final consonants), but they persist in the
The hearing child and the typical deaf child are in speech of deaf children well into later childhood (Mur-
very di¤erent stages of language development when they phy and Dodd, 1995). Control of voice quality and in-
reach the point of formal schooling and reading instruc- tonation is also di‰cult for children with substantial
tion. Normally hearing 5- or 6-year-old children have a hearing loss. Consequently, instances of consistently
speaking vocabulary of several thousand words and have high or low pitch, nasalized speech, and rhythmical
mastered most of the complex syntax of English. Thus, errors such as unusual breath groups and either mis-
for hearing children, the acquisition of reading is pri- placed syllabic stress or added syllables are common and
marily learning to map printed English onto an existing produce a characteristic set of deaf ‘‘accents’’ in spoken
knowledge of spoken English (Adams, 1990). English (Osberger and McGarr, 1982; Paterson, 1994;
For the deaf child, the situation is quite di¤erent. Murphy and Dodd, 1995).
Impairment of hearing has a negative e¤ect on the ac-
quisition of a spoken language from early in the child’s
life, and all of the major milestones in normal language
Vocabulary
acquisition are considerably delayed. For example, As measured by parental reports on the MacArthur
around 6 months of age, normal-hearing infants begin Communicative Development Inventory (CDI), the av-
producing the first approximations to consonant-vowel erage hearing child progresses rapidly from an expressive
combinations, or syllables, over quite a wide range of vocabulary of approximately 100 words at 18 months to
speech-like sounds (so-called ‘‘marginal’’ babbling). A 300 words at 2 years and 550 words at 3 years (Fenson
few months later ‘‘canonical’’ babbling emerges, in et al., 1993). In contrast, the average deaf toddler with
which the child produces a more restricted set of pho- hearing parents produces about 30 words at 2 years and
netic units in repetitive, rhythmic syllabic organization, 200 words at 3 years, whether those words are spoken or
Language of the Deaf: Acquisition of English 337
signed (Mayne, Yoshinago-Itano, Sedey, and Carey, language at the same time. The better the child’s English
2000). Deaf children whose hearing loss was identified language skills are before formal schooling, the easier
by 6 months of age and who have above-average cogni- the task of reading acquisition becomes.
tive skills fare best, showing a vocabulary spurt in the
third year of life similar to that found in hearing chil- New Developments
dren, though about 6 months later (Mayne et al., 2000).
However, even these successful deaf children fall below This pattern of delay and di‰culty with language and
the 25th percentile for hearing children on norms for literacy in deaf youngsters is mitigated by two tech-
the CDI at 30–36 months of age (see also Mayne, nological advances that are transforming language
Yoshinago-Itano, and Sedey, 2000). Thus, the average 5- intervention with deaf toddlers. The first is the imple-
to 6-year-old deaf child is some 2 years behind hearing mentation of universal newborn hearing screening pro-
peers in vocabulary size when the child begins the task of grams, with the potential for identifying almost all
learning to read. infants with a significant degree of hearing loss within
the first few months of life. By 2000, some two dozen
Syntax states in the United States had mandated universal
screening of hearing for infants prior to hospital dis-
The English of deaf children also exhibits characteristic charge. Researchers in the state of Colorado demon-
syntactic problems. For example, tense markers on the strated that identification of hearing loss prior to 6
verb and many other grammatical morphemes and months of age followed by e¤ective ongoing intervention
function words (e.g., articles ‘‘a’’ and ‘‘the,’’ or copula services and parent training programs was a major con-
and auxiliary verbs) are inconsistently provided or miss- tributor to successful language outcomes for deaf chil-
ing from most deaf children’s spoken or signed English dren (Yoshinaga-Itano and Appuzzo, 1998a, 1998b;
when they reach the early grades of formal schooling. Yoshinago-Itano, Sedey, et al., 1998).
These aspects of English grammar continue to provide The second important advance is the development
great di‰culty for deaf children during the school years of multichannel cochlear implant technology. Electrodes
(Quigley and King, 1980; de Villiers, 1988). In the terms implanted into the cochlear now bypass the hair cells
of generative grammar (Radford, 1990; Leonard, 1995), and stimulate the auditory nerve directly. An external
the functional categories Inflectional Phrase (IP) and receiver and processor analyzes the incoming speech
Determiner Phrase (DP), which host the marking of according to a predetermined strategy and transforms
tense for verbs and specification for nouns, may be the complex pattern of sound frequencies and ampli-
incompletely specified in the grammar of deaf children. tudes into a corresponding pattern of electrical stimula-
If so, one might expect problems also with a final function across a number of electrodes in the cochlear. Over
tional category that structures the embedding of clauses, the past 15 years, major developments in the complexity
the Complementizer Phrase (CP) (de Villiers, de Villiers, of the analysis that can be carried out in the externally
and Hoban, 1994). Hearing children at the age of 5 or 6 worn speech processor (increasingly miniaturized by
have mastered a variety of multiclause embedded sen- developing computer technology) first led to remark-
tence forms—especially temporal and causal adverbial able improvements in speech perception in postlingually
clauses, complement structures, and relative clauses— deafened adults with implants. Now younger and youn-
that are essential for creating cohesion in narrative dis- ger profoundly deaf children who lost their hearing at
course and other extended conversation (de Villiers, or soon after birth and who would not benefit much
1988, 1991; Engen, 1994). However, in deaf children, from conventional hearing aids are receiving cochlear
clauses and sentences tend to be strung together with implants, some as young as 18 months of age (Niparko
‘‘and’’ or ‘‘then,’’ and complex embedded structures are et al., 2000). Children who receive an implant early in
usually missing or malformed in their English produc- life, followed by intensive auditory and speech training,
tion and also are poorly comprehended (Engen and can achieve speech intelligibility and conversational
Engen, 1983; de Villiers, de Villiers, and Hoban, 1994; fluency that exceed the levels typically observed in
Engen, 1994; Berent, 1996). Thus the spoken, signed, or profoundly deaf children who use hearing aids (Fryauf-
written English narratives of deaf students can often Bertschy et al., 1997; Spencer, Tye-Murray, and Tom-
be characterized as a list of sentences each describing blin, 1998; Tomblin et al., 1999; Svirsky et al., 2000).
an event, but with little cohesion or coherence because However, there remains substantial individual variation
the characters and events are not linguistically linked in the degree of success of these implants with prelin-
together by referential, causal, and temporal cohesion gually deaf toddlers (Pisoni et al., 2000). Although
markers (de Villiers, 1991; Engen, 1994). some children exhibit dramatic improvement in their
In summary, when deaf children reach the point speech perception and production, and may even acquire
of acquiring English literacy skills, they usually have a grammatical and vocabulary skills at a rate matching
severely limited vocabulary and lack knowledge of the that of hearing children, others show only limited spoken
complex syntax of English that is critical for combining language gains even after 3 or 4 years of implant use.
sentences together into cohesive text. Indeed, much of The sources of this variability are still rather poorly
the deaf child’s English language learning comes from understood but may include such factors as age at im-
printed English, and the task of learning to read becomes plantation, preimplant residual hearing, type of speech
one of both cracking the print code and learning the processor used, and success at tuning or ‘‘mapping’’ the
implant, as well as the educational and family environ- opmental Inventories. San Diego, CA: Singular Publishing
ment (Fryauf-Bertschy et al., 1997; Niparko et al., 2000; Group.
Osberger and Fisher, 2000; Pisoni et al., 2000). Fryauf-Bertschy, H., Tyler, R., Kelsay, D., Gantz, B., and
Woodworth, G. (1997). Cochlear implant use by prelin-
gually deafened children: The influences of age at implant
Conclusion and length of device use. Journal of Speech, Language, and
In summary, most children who are born with a moder- Hearing Research, 40, 183–199.
ate to profound hearing loss or who become deaf in the Leonard, L. (1995). Functional categories in the grammars
of children with specific language impairment. Journal of
first few years of life su¤er a pervasive disruption in
their acquisition of all aspects of spoken English (and Mayne, A., Yoshinago-Itano, C., and Sedey, A. (2000). Re-
its signed forms). This leads to major di‰culties in the ceptive vocabulary development of infants and toddlers who
children’s acquisition of literacy skills. Early identifica- are deaf and hard of hearing. Volta Review, 100, 29–52.
tion of hearing loss and the immediate implementation Mayne, A., Yoshinago-Itano, C., Sedey, A., and Carey, A.
of intervention strategies involving the best available (2000). Expressive vocabulary development of infants and
technologies for amplification, and parental training in toddlers who are deaf and hard of hearing. Volta Review,
early language intervention in a rich interactional con- 100, 1–28.
text, seem to o¤er the best outcome for these children in Mogford, K. (1993). Oral language acquisition in the pre-
their acquisition of English. linguistically deaf. In D. Bishop and K. Mogford (Eds.),
Language development in exceptional circumstances. Hills-
—Peter A. de Villiers dale, NJ: Erlbaum.
Murphy, J., and Dodd, B. (1995). Hearing impairment. In B.
References Dodd (Ed.), Di¤erential diagnosis and treatment of children
with speech disorder. San Diego, CA: Singular Publishing
Adams, M. (1990). Beginning to read: Thinking and learning Group.
about print. Cambridge, MA: MIT Press. Niparko, J., Kirk, K., Mellon, N., Robbins, L., Tucci, D., and
Allen, T. (1994). Who are the deaf and hard-of-hearing students Wilson, B. (Eds.). (2000). Cochlear implants: Principles and
leaving high school and entering postsecondary education? practices. Philadelphia: Lippincott, Williams, and Wilkins.
Washington, DC: Gallaudet University. Osberger, M., and Fisher, L. (2000). Preoperative predictors
Berent, G. (1996). The acquisition of English syntax by deaf of postoperative implant performance in children. Annals
learners. In W. Ritchie and T. Bhatia (Eds.), Handbook of of Otology, Rhinology and Laryngology, 109(Suppl. 185),
second language acquisition. San Diego, CA: Academic 44–46.
Press. Osberger, M., and McGarr, N. (1982). Speech production
Carney, A. (1986). Understanding speech intelligibility in the characteristics of the hearing-impaired. In N. Lass (Ed.),
hearing impaired. In K. Butler (Ed.), Hearing impairment Speech and language: Advances in basic research and prac-
and language disorders: Assessment and intervention. Gai- tice. New York: Academic Press.
thersburg, MD: Aspen. Paterson, M. (1994). Articulation and phonological disorders
de Villiers, J. G., de Villiers, P. A., and Hoban, E. (1994). The in hearing-impaired school aged children with severe and
central problem of functional categories in the English profound sensorineural hearing losses. In J. Bernthal and N.
syntax of deaf children. In H. Tager-Flusberg (Ed.), Con- Bankson (Eds.), Child phonology: Characteristics, assess-
straints on language acquisition: Studies of atypical children. ments and intervention with special populations. New York:
Hillsdale, NJ: Erlbaum. Thieme.
de Villiers, P. A. (1988). Assessing English syntax in hearing- Paul, P. (2001). Language and deafness (3rd ed.). San Diego,
impaired children: Elicited production in pragmatically CA: Singular Publishing Group.
motivated situations. In R. Kretschmer and L. Kretschmer Pisoni, D., Cleary, M., Geers, A., and Tobey, E. (2000). Indi-
(Eds.), Communication assessment of hearing-impaired chil- vidual di¤erences in e¤ectiveness of cochlear implants in
dren: From conversation to classroom. Journal of the Acad- children who are prelingually deaf: New process measures
emy of Rehabilitative Audiology: Monograph Supplement, of performance. Volta Review, 100, 111–164.
21, 41–71. Quigley, S., and King, C. (1981). An invited article: Syntactic
de Villiers, P. A. (1991). English literacy development in deaf performance of hearing-impaired and normal individuals.
children: Directions for research and intervention. In J. Applied Psycholinguistics, 1, 329–356.
Miller (Ed.), Research on child language disorders: A decade Radford, A. (1990). Syntactic theory and the acquisition of En-
of progress. Austin, TX: Pro-Ed. glish syntax. Oxford, U.K.: Blackwell.
de Villiers, P. A. (1992). Educational implications of deafness: Spencer, L., Tye-Murray, N., and Tomblin, J. (1998). The
Language and literacy. In R. Eavey and J. Klein (Eds.), production of English inflectional morphology, speech pro-
Hearing loss in childhood: A primer. Columbus, OH: Ross duction and listening performance in children with cochlear
Laboratories. implants. Ear and Hearing, 19, 310–318.
Engen, E. (1994). English language acquisition in deaf children Svirsky, M., Robbins, A., Kirk, K., Pisoni, D., and Miyamoto,
in programs using manually-coded English. In A. Vonen, R. (2000). Language development in profoundly deaf chil-
K. Arnesen, R. Enerstvedt, and A. Nafstad (Eds.), Bilin- dren with cochlear implants. Psychological Science, 11,
gualism and literacy: Proceedings of an international work- 153–158.
shop. Oslo, Norway: Skadalan Publications. Tomblin, J. B., Spencer, L., Flock, S., Tyler, R., and Gantz, B.
Engen, E., and Engen, T. (1983). Rhode Island Test of Lan- (1999). A comparison of language achievement in children
guage Structure. Austin, TX: Pro-Ed. with cochlear implants and children using hearing aids.
Fenson, L., Dale, P., Reznick, D., Thal, E., Bates, E., Har- Journal of Speech, Language, and Hearing Research, 42,
tung, J., et al. (1993). MacArthur Communication Devel- 497–511.
Language of the Deaf: Sign Language 339
Yoshinaga-Itano, C., and Appuzzo, M. (1998a). Identification dren with deaf parents who are exposed to sign language
of hearing loss after age 18 months is not early enough. from birth. But some deaf children with hearing parents
American Annals of the Deaf, 143, 380–387. are also exposed to a natural sign language through
Yoshinaga-Itano, C., and Appuzzo, M. (1998b). The develop- contact with native-signing deaf children and adults in
ment of deaf and hard-of-hearing children identified early
educational settings. Thus, deaf children may be speech-
through the high risk registry. American Annals of the Deaf,
143, 416–424. delayed, but they are not necessarily language-delayed
Yoshinago-Itano, C., Sedey, A., Coulter, D., and Mehl, A. or disordered in any sense.
(1998). Language of early- and late-identified children with
hearing loss. Pediatrics, 102, 1161–1171. Characteristics of American Sign Language
and Other Natural Sign Languages
Further Readings
This entry focuses on American Sign Language (ASL)
Annals of Otology, Rhinology, and Laryngology. (2000). Sup- because it is the natural sign language used in the United
plement 185, 109(12). [Special issue] States and it has been the most extensively studied.
Bench, R. (1992). Communication skills in hearing impaired
children. London, U.K.: Whurr. However, many of the issues raised here about the
Geers, A., and Moog, J. (1989). Factors predictive of the unique properties of natural sign languages and the nor-
development of literacy in profoundly hearing-impaired mal pattern of acquisition of those languages by deaf
adolescents. Volta Review, 91, 69–86. children exposed to complete and early input apply to all
Geers, A., and Moog, J. (Eds.). (1994). E¤ectiveness of coch- natural sign languages.
lear implants and tactile aids for deaf children. Volta Review, ASL and other natural sign languages are formally
96(5). [Special issue] structured at di¤erent levels and follow the same uni-
Jeanes, R., Nienhuys, T., and Rickards, F. (2000). The prag- versal constraints and organizational principles of all
matic skills of profoundly deaf children. Journal of Deaf natural languages. Like the distinctive features of spoken
Studies and Deaf Education, 5, 237–247.
phonology, a limited set of handshapes, movements, and
Lederberg, A., and Spencer, P. (2001). Vocabulary develop-
ment of deaf and hard of hearing children. In M. D. Clark, places of articulation on the face and body distinguish
M. Marschark, and M. Karchmer (Eds.), Context, cognition, di¤erent lexical signs. For example, in ASL the signs for
and deafness. Washington, DC: Gallaudet University Press. SUMMER, UGLY, and DRY are produced with the
Levitt, H., McGarr, N., and Ge¤ner, D. (1987). Development same handshape and movement, but in di¤erent loca-
of language and communication skills in hearing-impaired tions on the face (Bellugi et al., 1993). Just as in spoken
children (ASHA Monograph No. 26). Rockville, MD: languages, the syntactic rules of ASL operate on under-
American Speech-Hearing-Language Association. lying abstract categories defined by their linguistic func-
Novelli-Olmstead, T., and Ling, D. (1984). Speech production tion, such as subjects and objects, or noun phrases and
and speech discrimination by hearing-impaired children. verb phrases. Furthermore, grammatical processes are
Volta Review, 86, 72–80.
recursive, embedding one phrase or clause within an-
Osberger, M. (Ed.). (1986). Language and learning skills of
hearing-impaired students (ASHA Monograph No. 23). other (Liddell, 1980).
Rockville, MD: American Speech-Language-Hearing On the other hand, the visual-spatial modality of
Association. natural sign languages leads to several distinctive prop-
Schirmer, B. (1994). Language and literacy development in erties. Spoken languages are mostly sequential, in that
children who are deaf. New York: Maxwell Macmillan the order of speech elements determines meaning. For
International. example, temporal or adverbial modulations in meaning
Stoker, R., and Ling, D. (Eds.). (1992). Speech production in are expressed in spoken English by inflectional su‰xes
hearing-impaired children and youth: Theory and practice. and prefixes that are added to the verb root. In contrast,
Volta Review, 94(5). [Special issue] multiple features of meaning are communicated simul-
Wood, D., Wood, H., Gri‰ths, A., and Howarth, I. (1986).
taneously in sign languages: the place, direction, and
Teaching and talking with deaf children. London, U.K.:
Wiley. manner in which signs are produced frequently add to
Yoshinago-Itano, C., and Sedey, A. (Eds.). (2000). Language, or modulate the meaning of a sign. For example, ASL
speech, and social-emotional development of children who has evolved a system of simultaneous inflectional mor-
are deaf or hard of hearing: The early years. Volta Review, phology on the verb that indicates person, number, dis-
100(5). [Special issue] tributional aspect, and such temporal aspects as the
repetition, habituality, and duration of the action. The
single sign GIVE, for example, can be inflected to com-
Language of the Deaf: Sign Language municate the meanings ‘‘give to me,’’ ‘‘give regularly,’’
‘‘give to them,’’ ‘‘give to a number of people at di¤erent
times,’’ ‘‘give over time,’’ ‘‘give to each,’’ and ‘‘give to
Hearing loss limits deaf children’s access to spoken lan- each over time’’ (Bellugi et al., 1993).
guages, but deaf communities around the world acquire Second, sign languages use space as a grammatical
natural sign languages that create complete communica- and semantic device. For example, in ASL the noun re-
tion systems with the same subtlety and level of syntactic ferring to a particular person or object can be assigned
and semantic complexity as any spoken language. At the (or indexed ) to a location in space, typically to one or
core of these communities are the 8%–10% of deaf chil- other side of the signer. Referring back to that place
Figure 1. (From Bellugi, U., et al. [1993]. The acquisition of syntax and space in young deaf signers. In D. Bishop and K. Mogford
[Eds.], Language development in exceptional circumstances. Hillsdale, NJ: Erlbaum. Reproduced with permission.)
in space by pointing to it then acts as an anaphoric space, but it employs di¤erent pronoun usage and facial
pronoun. Similarly, for verbs such as GO, GIVE, markers to indicate that the actor’s perspective is being
INFORM, or TEACH that involve directionality or taken (Emmory and Reilly, 1995).
movement in their meaning, the starting and end points Finally, like some spoken languages, ASL incorpo-
of the sign and its direction of movement between points rates classifiers, linguistic markers that identify such fea-
in space are used as an agreement marker on the verb to tures as the size, shape, animacy, and function of objects.
indicate the subject and recipient of the action (Wilbur, In ASL di¤erent handshapes encode these properties of
1987; Bellugi et al., 1993). the objects and are incorporated into movement verbs
Third, ASL makes extensive use of simultaneous (Wilbur, 1987; Schick, 1990). The most iconic of the
nonmanual facial expressions and body movements as classifier handshapes are those that reflect the size and
adverbial, grammatical, and semantic devices. Some of shape of the object referred to. So, a single handshape
the facial expressions accompanying signed sentences depicts all medium-sized cylindrical objects, such as a
seem to be expressions of intensity or attitude, such as cup, a small tube, and a vase. Other classifiers are more
pursing the lips or pu‰ng out the cheeks, but others abstract, representing classes of objects that do not re-
have obligatory syntactic roles. Raising the eyebrows semble each other (or the classifier sign) in size or shape.
and tilting the head forward slightly changes a declara- Thus, one classifier is used to represent all vehicles,
tive sentence into a yes/no question. Topicalized clauses including boats and bicycles. Some classifier signs can
such as relative clauses specifying information about a serve as pronouns in sentences (Humphries, Padden, and
referent are marked by raised eyebrows and the head O’Rourke, 1980).
tilted back (Liddell, 1980). Finally, ASL has several
ways to negate an utterance, and a nonmanual marker— Acquisition of ASL by Native-Signing Children
a headshake with the eyebrows squeezed together—is
used with or without the negative signs for NO and Deaf children exposed at an early age to a su‰ciently
NOT to negate a clause (Humphries, Padden, and rich input acquire a full natural sign language as e¤ort-
O’Rourke, 1980; Wilbur, 1987). lessly and rapidly as hearing children acquire their native
In reporting speech or action, the person whose per- spoken language. Deaf babies ‘‘babble’’ in sign at about
spective is to be taken is assigned to a location in space the same age as their hearing peers babble in speech,
(Emmory and Reilly, 1995). Then the signer turns his repeating the handshape or movement components of
body as if signing from the perspective of that location signs in a rhythmic fashion (Pettito and Marentette,
and signs what the person did or said. This ‘‘role shift’’ 1991). Just as in canonical babbling the child’s phonetic
is analogous to direct speech in spoken languages. It repertoire comes to be restricted to that of the child’s
is accompanied by a break in eye gaze away from the native language, so too the sign-babbling deaf child
conversational partner and the use of ‘‘first-person’’ shifts to incorporating only the restricted set of hand-
pronouns in the role of the character involved. Report- shapes and movements found in the input sign language.
ing action also uses a role shift of the signer’s body in Notably, the phonetic units found in ‘‘canonical’’ man-
ual babbling are the same ones later used in the first root forms of these verbs and use more fixed word orders
meaningful signs (Pettito, 2000). in their early sentences to express grammatical and se-
Although there is considerable variation among chil- mantic relationships, typically the most frequent sign
dren, the first signs with a clear reference may emerge a order seen in the adult input (Newport and Meier,
month or two earlier in deaf children than the first spo- 1985). The use of agreement morphology on these verbs
ken words of hearing children. Motor control over the emerges over the period between age 2 and 5 years, being
hands and arms su‰cient to produce recognizable signs mastered first for referents that are present in the dis-
develops a little ahead of control over the vocal articu- course context and only later for absent referents
lators (Bonvillian, 1999). Early vocabularies in sign indexed in space (Newport and Meier, 1985). The pat-
or speech refer to the same categories of objects and tern of development and the errors are predicted not by
actions: the significant people, animals, objects, and iconicity, the fact that the path of the action is clearly
actions in the common environment of most toddlers, traced in space, but by a linguistic model of morpholog-
hearing or deaf. Some signs in ASL are iconic in that ical complexity observed in spoken languages (Newport
they resemble the referent. For example, the sign for and Meier, 1985; Slobin, 1985). Just as speaking children
CAT is made by stroking the side of the upper lip to in- overgeneralize inflectional rules like the regular -ed past
dicate whiskers. However, while iconicity may facilitate tense (e.g., ‘‘holded’’ for ‘‘held’’) or use noncausative
the comprehension of some signs, semantic domain and verbs in a causative sense (e.g., ‘‘He falled me down’’),
phonological complexity seem to be stronger determi- signing deaf children overgeneralize spatial agreement
nants of early sign productions (Bonvillian, 1999). marking to verb signs in ASL that cannot be used in that
The phonological properties of early signs are a¤ected way. So, verbs like SAY or LIKE, which cannot be
by motor control and perceptual salience, as well as by directionally marked, are sometimes extended toward
linguistic constraints (Conlin et al., 2000; Marentette the object, and verbs like DRINK and EAT may be
and Mayberry, 2000). For example, sign location seems signed as if coming from an indexed referent in space,
to be the most accurately produced feature of early signs not in their required location on the signer (Bellugi et al.,
because the place of articulation is perceptually more 1993).
salient than the handshape and manner of movement Classifiers first emerge at around age 3 but are mas-
and requires less fine motor control. Signing children tered over a long period of time, with some forms still
produce a pattern of regular substitutions of phonetic giving children trouble at age 6 or 7. Size and shape
elements in their early signs, just like the phonological classifier handshapes resemble their referents more
substitutions in hearing toddlers’ early spoken words closely and are acquired first (Schick, 1990). Mastery
(Bellugi et al., 1993). of the more abstract classifiers depends not only on
By around 2 years of age, children begin to master the mastery of the syntax and semantics of ASL, but also
pronoun system, in ASL a system of pointing gestures to on the child’s conceptual development in classifying such
the self (‘‘me’’), to one’s conversational partner (‘‘you’’), objects as cars, boats, and bicycles all together as a
or to indexed referential spaces (‘‘he,’’ ‘‘she,’’ ‘‘it’’). De- functional class.
spite the iconic transparency of first- and second-person Despite our natural attention to a¤ective markers
pronouns, signing children follow the same develop- in facial expressions, nonmanual syntactic and semantic
mental timeline and exhibit the same problems as markers can be di‰cult for both first and second lan-
speaking children do in learning pronouns that shift ref- guage learners of ASL to acquire (Reilly, 2000). The
erence, depending on who is the speaker and addressee. marker itself may appear early, but its full linguistic use
They even tend to make the same reversal error in early can take several years to master. Thus the negative
acquisition of ‘‘I/me’’ and ‘‘you,’’ using a point facing headshake is first used by native-signing deaf children in
outward toward their conversational partner when re- the second year to negate signs, but young children ex-
ferring to themselves, and pointing to themselves when perience di‰culty in timing this nonmanual marker to
referring to ‘‘you’’ (Pettito, 1987). More abstract pro- coincide with the correct manual signs, and they may
nouns that involve pointing to an indexed referent ungrammatically extend the headshake across more than
assigned to a location in signing space are the last one clause or sentence (Anderson and Reilly, 1997).
acquired, around age 5 (Ho¤meister and Wilbur, 1980). Thus the same developmental processes and con-
In two- and three-word signed utterances, semantic straints apply to the natural acquisition of ASL and
relationships between sentence elements emerge in a re- other sign languages as to the normal development of
liable order of acquisition that seems to be determined any spoken language. The pattern of acquisition is pri-
by the conceptual development of the child: reference to marily dictated by the linguistic and cognitive complex-
the existence or disappearance of objects, then action ity of forms, not by their iconicity in the visual mode of
relationships, and finally state relationships (properties, the language.
possession, or location of objects), just as in early spoken
language acquisition (Brown, 1973; Newport and Ash- Natural Sign Language Versus Artificial
brook, 1977). Signed Versions of Spoken Language
Although much of the syntax of ASL is conveyed
through spatial morphology rather than word order, In several countries, signed versions of the native spoken
young signing children begin by producing uninflected language have been created by educators. Unlike natural
sign languages, they are typically signed simultaneously together from isolated villages into a centralized school
with the spoken language as a form of sign-supported for the deaf in Nicaragua in the early 1980s. Motivated
speech. In the United States there are several versions of by the pressure to communicate among themselves, the
manually coded English (MCE). The most widely used deaf students evolved a more and more complex sign
of these are Signed English, Signed Exact English, and language, with each generation of students inheriting a
Seeing Essential English. These all use lexical signs from more complex form of the language and then elaborat-
ASL and English word order, but they vary in the degree ing it. Several researchers have identified the emergence
to which created signs encode all of the function words of many of the apparently universal features of natural
or derivational and inflectional morphology of English. sign languages in the evolution of Nicaraguan Sign
The educational value of these MCE systems over Language over the past 20 years, among them compo-
ASL and oral or written English is hotly disputed. They nentiality of lexical signs, simultaneity of meaning ex-
have two primary drawbacks. First, because signs take pression, nonmanual syntactic markers, and the use of
longer to produce than words, signing a sentence in a space for grammatical purposes (Kegl, Senghas, and
linear sequence following that of English takes much Coppola, 1999; Senghas, 2000).
longer than speaking the same sentence in English. The
simultaneous speaker-signer either has to slow down her A Critical Period for Sign Language
speech to an unnatural extent or has to leave out aspects Acquisition
of the signed portion of the message. Thus, deaf children
There seems to be a sensitive or even critical period in
receive either an incomplete and ungrammatical lan-
early childhood in which exposure to a relatively com-
guage input or a simplified one, so that their exposure to
complex English syntax is artificially reduced. plete sign language must take place. Deaf individuals
exposed to ASL for the first time in late childhood or
Second, several universal features of natural sign lan-
adolescence are less proficient ASL users in adulthood
guages that have evolved to allow e¤ective and rapid
than those who learn ASL in the first few years of life,
communication of meaning in a visual-spatial mode are
even though they may have been using ASL as their
not incorporated into MCE systems. These features in-
primary means of communication for decades (Fischer,
clude the use of space as a grammatical and semantic
1998; Mayberry and Eichen, 1991; Newport, 1991).
device, simultaneous morphology, and nonmanual lin-
guistic markers. Indeed, in several respects MCE systems
directly violate these universal principles and so can be
ASL and Finger Spelling
very confusing for deaf children who have been exposed There are some influences on ASL from the surrounding
to ASL (Johnson, Liddell, and Erting, 1989). Deaf chil- dominant English language culture, for example the im-
dren may then naturalize MCE so that it more closely portation of loan words from English that are finger-
conforms to ASL’s use of space and simultaneity (Sup- spelled using the manual alphabet to represent English
pala, 1991). letters. Finger-spelled vocabulary is highly selective, al-
most always nouns and rarely verbs. In everyday con-
versations these finger-spelled words are frequently the
Going Beyond the Input names of people and places, but in educational settings
Many deaf children are exposed to incomplete versions finger spelling is used to represent technical words and
of ASL, from deaf parents who are not native signers or concepts for which there is not a natural sign in ASL.
hearing parents still learning ASL. In this input mor- Padden (1998) has argued that this is similar to the way
phological rules in particular may be inconsistently used. in which most if not all spoken languages import foreign
However, children systematize their sign language to vocabulary words. Indeed, deaf children of deaf parents
make consistent rules out of what are only statistical who cannot yet read or write English begin to produce
regularities in their parents’ signing (Newport, 1999; finger-spelled words as ‘‘signs’’ before they learn to con-
Newport and Aslin, 2000). For example, spatial agree- nect them to English orthography.
ment markers on verbs of motion that were correct only Padden and Ramsey (1998) suggest that finger spell-
about two-thirds of the time in the parental input were ing may interact with ASL skills in providing deaf
used correctly more than 90% of the time by their chil- children with better access to English literacy learning.
dren at age 7. The children create a more regular rule out They find that deaf teachers using ASL in the classroom
of inconsistently used morphology, thus going beyond actually finger-spell more words than hearing teachers.
the input to acquire a more complete form of ASL. Most important, these deaf teachers e¤ectively chain to-
Even with no conventional signed input, deaf children gether the di¤erent expressive modes of communication,
of hearing parents in oral environments create rich ges- switching from finger spelling to printed text to finger
tural systems, although these are not complete languages spelling, or from ASL sign to finger spelling to text, in
(Goldin-Meadow, 2001). Over a longer period of time order to facilitate the connection between text and
and several generations of deaf signers, a more complete meaning and to develop better print decoding skills.
sign language may emerge. This phenomenon is seen in In Padden and Ramsey’s study, the deaf students with
the creation of a new Nicaraguan Sign Language out of better ASL and finger-spelling skills developed better
many home gestural systems by deaf children brought English-reading skills.
ASL and English Literacy Gustason, G. (1990). Signing exact English. In H. Bornstein
(Ed.), Manual communication: Implications for education.
The extent to which the development of phonological Washington, DC: Gallaudet University Press.
decoding from print to spoken phonemes is necessary for Ho¤meister, R., de Villiers, P. A., Engen, E., and Topol,
fluent reading in deaf students is still in dispute. How- D. (1997). English reading achievement and ASL skills in
ever, skill in ASL does not interfere with learning to deaf students. In E. Hughes, M. Hughes, and A. Greenhill
read printed English. Rather, it is a strong independent (Eds.), Proceedings of the 21st Annual Boston University
contributor to reading comprehension levels for deaf Conference on Language Development. Somerville, MA:
Cascadilla Press.
children in educational settings using sign language, even
Ho¤meister, R., and Wilbur, R. (1980). The acquisition of sign
when controlling for having deaf or hearing parents language. In H. Lane and F. Grosjean (Eds.), Recent per-
(Ho¤meister et al., 1997; Strong and Prinz, 1997). Hav- spectives on American Sign Language. Hillsdale, NJ: Erlbaum.
ing a fluent sign language can facilitate learning to read Humphries, T., Padden, C., and O’Rourke, T. (1980). A basic
in several ways: by increasing the children’s comprehen- course in American Sign Language. Silver Springs, MD: T.J.
sion of the instructional process (if the teacher is also Publishers.
fluent in ASL), increasing the number of semantic Johnson, R., Liddell, S., and Erting, C. (1989). Unlocking the
concepts the child understands, developing extended curriculum: Principles for achieving success in deaf education
discourse skills that are critical to early reading, and (Working Paper 89-3). Washington, DC: Gallaudet Re-
fostering metacognitive skills such as communication search Institute, Gallaudet University.
Kegl, J., Senghas, A., and Coppola, M. (1999). Creation
monitoring and planning (Nelson, 1998).
through contact: Sign language emergence and sign lan-
guage change in Nicaragua. In M. DeGra¤ (Ed.), Language
Conclusion creation and language change: Creolization, diachrony, and
The visual-spatial nature of sign languages—the fact development. Cambridge, MA: MIT Press.
that they are articulated with the hands and perceived Liddell, S. (1980). American Sign Language syntax. The
Hague, Netherlands: Mouton.
through the eyes—does not relegate them to the realm Marentette, P., and Mayberry, R. (2000). Principles for an
of pantomime and gesture. Natural sign languages are emerging phonological system: A case study of early ASL
as subtle and complex as any spoken language and are acquisition. In C. Chamberlain, J. Morford, and R. May-
structured according to universal linguistic principles. berry (Eds.), Language acquisition by eye. Mahwah, NJ:
Deaf children exposed to a complete and consistent nat- Erlbaum.
ural sign language early in childhood acquire the lan- Mayberry, R., and Eichen, E. (1991). The long-lasting advan-
guage normally, following the same stages and learning tage of learning sign language in childhood: Another look
processes as are observed in hearing children acquiring at the critical period for language acquisition. Journal of
their native spoken language. Memory and Language, 30, 486–512.
Nelson, K. (1998). Toward a di¤erentiated account of facili-
—Peter A. de Villiers and Jennie Pyers tators of literacy development and ASL in deaf children.
Topics in Language Disorders, 18, 73–88.
References Newport, E. (1991). Contrasting concepts of the critical period
for language. In S. Carey and R. Gelman (Eds.), The epi-
Anderson, D., and Reilly, J. (1997). The puzzle of negation: genesis of mind: Essays on biology and cognition. Hillsdale,
How children move from communicative to grammatical NJ: Erlbaum.
negation in ASL. Applied Psycholinguistics, 18, 411–429. Newport, E. (1999). Reduced input in the study of signed lan-
Bellugi, U., vanHoek, K., Lillo-Martin, D., and O’Grady, L. guages: Contributions to the study of creolization. In M.
(1993). The acquisition of syntax and space in young deaf DeGra¤ (Ed.), Language creation and language change:
signers. In D. Bishop and K. Mogford (Eds.), Language Creolization, synchrony, and development. Cambridge, MA:
development in exceptional circumstances. Hillsdale, NJ: MIT Press.
Erlbaum. Newport, E., and Ashbrook, E. (1977). The emergence of
Bonvillian, J. (1999). Sign language development. In M. Bar- semantic relations in American Sign Language. Papers and
rett (Ed.), The development of language. Hove, U.K.: Psy- Reports on Child Language Development, 13, 16–21.
chology Press. Newport, E., and Aslin, R. (2000). Innately constrained learn-
Brown, R. (1973). A first language. Cambridge, MA: Harvard ing: Blending of old and new approaches to language
University Press. acquisition. In S. C. Howell, S. Fish, and T. Keith-Lucas
Conlin, K., Mirus, G., Mauk, C., and Meier, R. (2000). The (Eds.), Proceedings of the 24th Annual Boston University
acquisition of first signs: Place, handshape and movement. Conference on Language Development. Somerville, MA:
In C. Chamberlain, J. Morford, and R. Mayberry (Eds.), Cascadilla Press.
Language acquisition by eye. Mahwah, NJ: Erlbaum. Newport, E., and Meier, R. (1985). The acquisition of Ameri-
Emmory, K., and Reilly, J. (1995). Sign, gesture, and space. can Sign Language. In D. Slobin (Ed.), The cross-linguistic
Hillsdale, NJ: Erlbaum. study of language acquisition. Vol. 1. The data. Hillsdale,
Fischer, S. (1998). Critical periods for language acquisition: NJ: Erlbaum.
consequences for deaf education. In A. Weisel (Ed.), Issues Padden, C. (1998). Early bilingual lives of deaf children. In I.
unresolved: New perspectives on language and deaf educa- Parasnis (Ed.), Cultural and language diversity and the deaf
tion. Washington, DC: Gallaudet University Press. experience. New York: Cambridge University Press.
Goldin-Meadow, S. (2001). The resilience of language. Phila- Padden, C., and Ramsey, C. (1998). Reading ability in signing
delphia: Psychology Press. deaf children. Topics in Language Disorders, 18, 40–46.
Pettito, L. (1987). On the autonomy of language and gesture: American Sign Language. Journal of Speech and Hearing
Evidence from the acquisition of personal pronouns in Research, 36, 1258–1270.
American Sign Language. Cognition, 27, 1–52. Mayer, C., and Akamatsu, C. (1999). Bilingual-bicultural
Petitto, L. (2000). The acquisition of natural signed languages: models of literacy education for deaf students: Considering
Lessons in the nature of human language and its biological the claims. Journal of Deaf Studies and Deaf Education, 4,
functions. In C. Chamberlain, J. Morford, and R. May- 1–8.
berry (Eds.), Language acquisition by eye. Mahwah, NJ: Meier, R., and Newport, E. (1990). Out of the hands of babes:
Erlbaum. On a possible sign advantage in language acquisition. Lan-
Pettito, L., and Marentette, P. (1991). Babbling in the manual guage, 66, 1–23.
mode: Evidence for the ontogeny of language. Science, 251, Padden, C., and Humphries, T. (1988). Deaf in America:
1493–1496. Voices from a culture. Cambridge, MA: Harvard University
Reilly, J. (2000). Bringing a¤ective expression into the service Press.
of language: Acquiring perspective marking in narratives. Parasnis, I. (Ed.). (1998). Cultural and language diversity and
In K. Emmory and H. Lane (Eds.), The Signs of Language the deaf experience. New York: Cambridge University
revisited: An anthology to honor Ursula Bellugi and Edward Press.
Klima. Mahwah, NJ: Erlbaum. Schein, J., and Stewart, D. (1995). Language in motion:
Schick, B. (1990). The e¤ects of morphosyntactic structure on Exploring the nature of sign. Washington, DC: Gallaudet
the acquisition of classifier predicates in ASL. In C. Lucas University Press.
(Ed.), Sign language research: Theoretical issues. Washing-
ton, DC: Gallaudet University Press.
Senghas, A. (2000). The development of early spatial mor- Linguistic Aspects of Child Language
phology in Nicaraguan Sign Language. In S. C. Howell, S. Impairment—Prosody
Fish, and T. Keith-Lucas (Eds.), Proceedings of the 24th
Annual Boston University Conference on Language Devel-
opment. Somerville, MA: Cascadilla Press. In linguistics, ‘‘prosody’’ refers to sound patterns in lan-
Slobin, D. (1985). Crosslinguistic evidence for the language- guage involving more than a single segment or phoneme.
making capacity. In D. Slobin (Ed.), The crosslinguistic
Since the early 1980s, the study of prosody has bloss-
study of language acquisition. Vol. 2. Theoretical issues.
Hillsdale, NJ: Erlbaum. omed, both in linguistics and in the allied areas of
Strong, M., and Prinz, P. (1997). A study of the relationship computer speech analysis and synthesis, adult sentence
between American Sign Language and English literacy. processing, infant speech perception, and language pro-
Journal of Deaf Studies and Deaf Education, 2, 37–46. duction. The study of prosody has provided insight into
Supalla, S. (1991). Manually coded English: The modality the word and sentence productions of young children
question in signed language development. In P. Siple and with normally developing language and language dis-
S. Fischer (Eds.), Theoretical issues in sign language re- orders. In particular, prosody has proven a useful tool
search. Vol. 2. Psychology. Chicago: University of Chicago for examining children’s ‘‘deviant’’ utterances; that is,
Press. utterances that deviate from what we would expect
Wilbur, R. (1987). American Sign Language: Linguistics and
from an adult speaker with normal speech and language.
applied dimensions (2nd ed.). Columbus, OH: Merrill.
For example, a child’s production of ‘‘blue’’ as ‘‘bu’’ will
be considered a deviant utterance for purposes of this
Further Readings discussion.
Akamatsu, C. T., and Stewart, D. (1998). Constructing simul- The article begins with a brief overview of two aspects
taneous communication: The contributions of natural sign of prosody, syllable shape and meter, that have been the
language. Journal of Deaf Studies and Deaf Education, 3, focus of many studies of child language production. It
302–319. then provides examples of some recent studies that
Chamberlain, C., Morford, J., and Mayberry, R. (Eds.). demonstrate e¤ects of syllable shape and meter on devi-
(2000). Language acquisition by eye. Mahwah, NJ: ant productions of children with normal and disordered
Erlbaum.
language. To foreshadow the general finding across the
Emmorey, K., and Reilly, J. (1998). The development of quo-
tation and reported action: Conveying perspective in ASL. studies presented here, we state that syllable shapes and
In E. Clark (Ed.), Proceedings of the Twenty-Ninth Annual metrical patterns that are frequent across the world’s
Child Language Research Forum. Chicago: Center for the languages or in the language the child is learning are
Study of Language and Information. most resistant to deviations.
Ho¤meister, R. (1990). American Sign Language and educa- Beginning with syllable shape, it has been noted that
tion of the deaf. In H. Bornstein (Ed.), Manual com- all languages of the world have syllables comprising a
munication: Implications for education. Washington, DC: consonant plus a vowel (CV). Only some languages al-
Gallaudet University Press. low additional syllable shapes, such as V, VC, CVC,
Klima, E., and Bellugi, U. (1979). The signs of language. CCVC, and so on. In linguistic terms, the CV syllable
Cambridge, MA: Harvard University Press.
Lane, H., Ho¤meister, R., and Bahan, B. (1996). A journey
shape is said to be ‘‘unmarked.’’ Even when languages
into the DEAF-WORLD. San Diego, CA: Dawn Sign Press. allow syllable shapes other than CV, the shapes are often
Marschark, M. (1993). Psychological development of deaf chil- restricted. For example, Japanese allows only CVC syl-
dren. New York: Oxford University Press. lables ending in /n/.
Mayberry, R. (1993). First-language acquisition after child- Furthermore, in languages like English that allow
hood di¤ers from second-language acquisition: The case of consonant clusters (syllables of the shape CCVC, CVCC,
Linguistic Aspects of Child Language Impairment—Prosody 345
etc.), these clusters generally conform to a sonority ent codas in di¤erent prosodic environments. Zamuner
sequencing principle (e.g., Hooper, 1976). Briefly, so- and Gerken (1998) reported that normally developing 2-
nority may be conceptualized as the openness of the vo- year-olds produced more codas and more coda types on
cal tract for a particular segment. Vowels are the most nonsense words when the coda occurred in a stressed
sonorous; glides are less sonorous, followed by liquids, syllable (either on a monosyllabic item or an item with
nasals, fricatives, and stops. Clements (1990) argues that a weak-strong stress pattern, e.g., /mPbIb/). Zamuner
an ideal syllable structure is one in which a sequence of (2001) discovered that children from the same popula-
segments increases from the onset to the vowel with no tion produced obstruent codas, which are more frequent
or a minimal decline from the vowel to the coda. English in English, sooner than sonorant codas on CVC non-
word-initial consonant clusters such as /pr/ and /kw/, sense words. She also found that the same coda was
which comprise a stop plus liquid or glide, are consistent produced less frequently when it occurred in nonsense
with this principle, as are English word-final clusters names exhibiting less frequent biphones (e.g., CV, VC)
such as /rp/ and /nt/. than more frequent biphones.
The foregoing discussion of syllable shapes concerns With respect to consonant cluster reduction, several
what is allowed in words of a language. It is also im- studies have shown a role for syllable shape, and in par-
portant to note that even languages that allow a variety ticular sonority sequencing, in this phenomenon (e.g.,
of syllable shapes nevertheless have strong statistical Barlow and Dinnsen, 1998; Ohala, 1999). These studies
tendencies toward particular shapes. For example, En- have revealed that children with normal language and
glish CVC words are very likely to end in /t/ and very language disorders were more likely to produce the least
unlikely to end in /dZ/. These statistical tendencies are sonorous consonant of an initial cluster and the least
the subject of a growing interest in researchers studying sonorous consonant of a final cluster. That is, they pro-
prosody and its role in child language production. duced CV sequences that were closer to the ideal syllable
Let us now turn to meter. In many languages, multi- shape suggested by Clements (1990).
syllabic words exhibit a characteristic stress pattern. For Turning now to the role of meter in language pro-
example, the majority of words in English have the pat- duction, several researchers have noted that children are
tern found in ‘‘apple’’ and ‘‘yellow,’’ that is, a stressed or more likely to omit weak syllables from the beginning of
strong syllable followed by an unstressed or weak sylla- words like ‘‘gira¤e’’ and ‘‘banana’’ and, more generally
ble. Stressed syllables are louder, longer, and higher in weak syllables that do not belong to a trochaic foot (e.g.,
pitch than unstressed syllables. The frequency of the Wijnen, Krikhaar, and Den Os, 1994). The bias to pro-
strong-weak word pattern is consistent with the obser- duce trochaic feet has also been observed at the level of
vation that the basic unit of stress in English is a trochaic sentence production, where the determiner ‘‘the’’ is more
(strong-weak) foot, with a foot defined as a grouping of likely to be preserved in a sentence like ‘‘He pats the
a single strong syllable plus adjacent weak syllables. zebra’’ (‘‘pats the’’ forms a trochaic foot) than ‘‘He
Feet not only explain the dominant stress pattern of brushes the bear’’ (the syllabic verb inflection makes the
words in a language, they also help us to understand formation of a trochaic foot containing ‘‘the’’ impossi-
how lexical words like nouns and verbs combine with ble; Gerken, 1996). Thus, the e¤ects of prosody are not
grammatical words like determiners and auxiliary verbs restricted to what have traditionally been considered
in phrases. For example, when we say phrases like phonological deviations but extend to morphosyntactic
‘‘drink of water’’ and ‘‘pick a card,’’ we tend to combine deviations as well. It is interesting to note that not all
the grammatical words (in this case ‘‘of ’’ and ‘‘a’’) with languages show as strong a bias toward trochaic feet as
the preceding strong syllable, even though these words English does. For example, Spanish has many words like
belong syntactically with the following word (in a prep- ‘‘banana,’’ which exhibit a weak-strong-weak pattern.
ositional phrase and noun phrase, respectively). That is, Spanish-learning children have been shown to produce
English speakers tend to create trochaic feet whenever determiners at an earlier age than their English-learning
they can, giving the language a characteristic metrical counterparts, again suggesting a role for prosody in
pattern. children’s morphosyntactic development (Lleó and
The discussion up to this point has revealed that lan- Demuth, 1999).
guages of the world and particular languages are biased At least some of children’s weak syllable omissions
toward specific syllable shapes (e.g., CV) and meters appear to occur during late stages of language produc-
(e.g., trochaic). Beginning with syllable shape, let us now tion rather than during utterance planning, as evidenced
consider how these prosodic biases a¤ect the productions by work by Carter (1999). Normally developing 2-year-
of children with normally developing and disordered olds and older children with language impairment pro-
language. Two of the most frequent syllable shape devi- duced sentences like ‘‘He kissed Cassandra’’ and ‘‘He
ations from a standard target produced by children are kissed Sandy.’’ Note that the former sentence type was
final consonant or coda deletion and consonant cluster frequently produced with the first syllable of the name
reduction. With respect to coda deletion, this phenome- omitted (Cassandra ! Sandra). Acoustic measurements
non has been viewed as one in which the speaker is revealed that, even though the two types of sentences
resorting to the most common syllable shape, CV. contained the same number of overtly produced sylla-
However, recent studies indicate that there are signifi- bles (four), children produced the first sentence type
cant di¤erences in the rate at which children omit di¤er- with a longer duration, suggesting that they reserved a
timing slot for the syllable they eventually omitted. Lleó, C., and Demuth, K. (1999). Prosodic constraints on the
One possible source of weak syllable omissions is a lack emergence of grammatical morphemes: Crosslinguistic evi-
of complete control over the motor sequences involved dence from Germanic and Romance languages. In A.
in producing trochaic vs. weak-strong feet (Go¤man, Greenhill, H. Littlelfield, and C. Tano (Eds.), Proceedings
of the 23rd Annual Boston University Conference on Lan-
1999).
guage Development (pp. 407–418). Somerville, MA: Casca-
Finally, several studies have revealed joint e¤ects of dilla Press.
syllable shape and meter on deviant utterances. In the Ohala, D. (1998). Medial cluster reduction in early child
Zamuner and Gerken study discussed above, children speech. In E. Clark (Ed.), Proceedings of the Twenty-ninth
showed di¤erent rates of coda preservation for strong Annual Child Language Research Forum (pp. 128–135).
and weak syllables. Ohala (1998) found that young chil- Palo Alto, CA: Stanford University Press.
dren with normal language were less likely to reduce Ohala, D. (1999). The influence of sonority on children’s clus-
word-medial consonant clusters in words with a strong- ter reductions. Journal of Communication Disorders, 32,
strong stress pattern. In a study of weak syllable omis- 397–422.
sion in young children with normal language, Kehoe Wijnen, F., Krikhaar, E., and Den Os, E. (1994). The (non)-
realization of unstressed elements in children’s utterances:
and Stoel-Gammon (1997) noted more omissions of the
Evidence for a rhythmic constraint. Journal of Child Lan-
middle syllable of words like ‘‘elephant,’’ which exhibit guage, 21, 59–83.
a strong-weak-weak pattern, if the syllable began with Zamuner, T. (2001). Input-based phonological acquisition:
a sonorant consonant. Carter (1999) found that adults Codas, frequency and universal grammar. Unpublished
with a variety of types of aphasia were more likely to doctoral dissertation, University of Arizona.
omit word-initial weak syllables with V and VC syllable Zamuner, T., and Gerken, L. A. (1998). Young children’s
shapes than CV shapes. It seems likely that such results production of coda consonants in di¤erent prosodic
would be found in children with normal and disordered environments. In E. Clark (Ed.), Proceedings of the Twenty-
language as well. ninth Annual Child Language Research Forum (pp. 121–
In summary, linguistic studies of canonical syllable 128). Palo Alto, CA: Stanford University Press.
shapes and metrical patterns across languages and with-
in particular languages provide the tools for fine-grained Further Readings
analyses of deviant forms produced by children with Allen, G., and Hawkins, S. (1980). Phonological rhythm: Def-
normal and disordered language. The results of these inition and development. In G. Yeni-Komshian, J. Kava-
analyses feeds a growing consensus that those forms that nagh, and G. Ferguson (Eds.), Child phonology. Vol. I.
are very frequent in languages of the world or in the Production. New York: Academic Press.
child’s target language are generally more robust and Barlow, J., and Gierut, J. (1999). Optimality theory in phono-
less susceptible to deviations from the accepted standard. logical acquisition. Journal of Speech, Language, and Hear-
Further research is needed to reveal the mechanism un- ing Research, 42, 1482–1498.
derlying prosody’s clear e¤ect on language production Bortolini, U., and Leonard, L. (1996). Phonology and gram-
matical morphology in specific language impairment: Ac-
(see also prosodic deficits). counting for individual variation in English and Italian.
—LouAnn Gerken Applied Psycholinguistics, 17, 85–104.
Chiat, S., and Hirson, A. (1987). From conceptual intention to
utterance: A study of impaired language output in a child
References with developmental dysphasia. British Journal of Disorders
of Communication, 22, 37–64.
Barlow, J., and Dinnsen, D. (1998). Asymmetrical cluster Demuth, K. (1996). The prosodic structure of early words. In
reduction in a disordered system. Language Acquisition, 7, J. Morgan and K. Demuth (Eds.), Signal to syntax. Mah-
1–49. wah, NJ: Erlbaum.
Carter, A. (1999). An integrated acoustic and phonological in- Echols, C. (1993). A perceptually-based model of children’s
vestigation of weak syllable omissions. Unpublished doc- earliest productions. Cognition, 46, 245–296.
toral dissertation, University of Arizona. Ellis Weismer, S. (1997). The role of stress in language pro-
Clements, G. (1990). The role of the sonority cycle in core syl- cessing and intervention. Topics in Language Disorders, 17,
labification. In J. Kingston and M. Beckman (Eds.), Papers 41–52.
in laboratory phonology (pp. 283–333). New York: Cam- Fee, E. J. (1997). The prosodic framework for language learn-
bridge University Press. ing. Topics in Language Disorders, 17, 53–62.
Gerken, L. A. (1996). Prosodic structure in young children’s Fikkert, P. (1994). On the acquisition of prosodic structure.
language production. Language, 72, 683–712. Dordrecht: Holland Institute of Generative Linguistics.
Go¤man, L. (1999). Prosodic influences on speech production Gerken, L. A. (1994). A metrical template account of chil-
in children with specific language impairment and speech dren’s weak syllable omissions. Journal of Child Language,
deficits: Kinematic, acoustic, and transcription evidence. 21, 565–584.
Journal of Speech, Language, and Hearing Research, 42, Gerken, L. A., and McGregor, K. (1998). An overview of
1499–1517. prosody and its role in normal and disordered child lan-
Hooper, J. (1976). An introduction to generative phonology. guage. American Journal of Speech Language Pathology, 7,
New York: Academic Press. 38–48.
Kehoe, M., and Stoel-Gammon, C. (1997). Truncation pat- Hammond, M. (1999). The phonology of English: A prosodic
terns in English-speaking children’s word productions. optimality-theoretic approach. London: Oxford University
Journal of Speech and Hearing Research, 40, 526–541. Press.
Melodic Intonation Therapy 347
Hogg, R., and McCully, C. B. (1987). Metrical phonology. buccofacial apraxia is usually observed, as well as right
Cambridge, UK: Cambridge University Press. hemiplegia that is greater in the arm than leg. The
Klein, H. B. (1981). Production strategies for the pronuncia- program therefore seems to be particularly suited for
tion of early polysyllabic lexical items. Journal of Speech patients with Broca’s or mixed nonfluent aphasia with
accompanying apraxia of speech (Tonkovich and Peach,
McGregor, K. (1994). Article use in the spontaneous samples
of children with specific language impairment: The impor- 1989; Square, Martin, and Bose, 2001). These charac-
tance of considering syntactic contexts. Clinical Linguistics teristics also generally exclude patients with Wernicke’s,
and Phonetics, 8, 153–160. transcortical, or global aphasia.
Nespor, M., and Vogel, I. (1986). Prosodic phonology. Dor- The initial computed tomographic profile for good
drecht, Holland: Foris. candidates included a large lesion in Broca’s area
Panagos, J. M., and Prelock, P. A. (1997). Prosodic analysis of extending superiorly to the left premotor and sensori-
child speech. Topics in Language Disorders, 17, 1–10. motor cortex for the face and deep to the periventricular
Selkirk, E. (1984). Phonology and syntax. Cambridge, MA: white matter, putamen, and internal capsule. The lesion
MIT Press. also typically spared Wernicke’s area and the tempo-
Selkirk, E. (1996). The prosodic structure of function words. In
ral isthmus. No lesions of the right hemisphere were
J. Morgan and K. Demuth (Eds.), Signal to syntax. Mah-
wah, NJ: Erlbaum. detected; this evidence was used to support the preser-
Shattuck-Hufnagel, S., and Turk, A. (1996). A prosody tutorial vation of melodic functions in these patients (Naeser and
for investigators of auditory sentence processing. Journal of Helm-Estabrooks, 1985). Naeser (1994) subsequently
Psycholinguistic Research, 25, 193–247. identified two important areas in the subcortical white
matter that appeared to have an important role regard-
ing recovery of spontaneous speech. Lesions of good
responders involved no more than half of the total area,
Melodic Intonation Therapy including the medial subcallosal fasciculus and the mid-
dle one-third of the periventricular white matter. The
extent of lesion in cortical language areas, including
Among the many published approaches for the treat- Broca’s area, could not be used to discriminate among
ment of aphasia, melodic intonation therapy (MIT) is individuals who responded well or poorly to MIT.
one of the few techniques whose clinical e¤ectiveness has Lesions may have involved Wernicke’s area or the sub-
been established by peer review (American Academy of cortical temporal isthmus, but when they did, they
Neurology, 1994). The e¤ectiveness of the program is involved less than half of those areas.
based on the specific guidelines for patient candidacy, its During the beginning stages of an MIT program,
formalized protocol, and a variety of reports testifying to emphasis is placed on the production of syntactically
improved communication competence following MIT. and phonologically simplified phrases and sentences that
After evaluating the available evidence, the American gradually increase in complexity throughout the course
Academy of Neurology considers the program to be of the program. Ideally, language materials are themati-
promising when administered by a qualified speech- cally related and relevant to the patient’s daily needs
language pathologist. and background. A large corpus of materials is recom-
The guiding principles and procedures associated with mended to vary the stimuli from session to session and to
MIT were set forth in the early works of Albert, Sparks, decrease practice e¤ects. It is debatable whether the use
and Helm (1973), Sparks, Helm, and Albert (1974), and of supplementary pictures or written sentences is ap-
Sparks and Holland (1976). More recent descriptions propriate (Helm-Estabrooks and Albert, 1991; Sparks,
of the program can be found in Helm-Estabrooks and 2001). Frequent treatment, perhaps twice daily, is essen-
Albert (1991) and Sparks (2001). Generally, three prin- tial, but when unattainable, family members might be
ciples form the conceptual foundation for MIT. First, in used to assist with the program (Sparks, 2001).
most of the population, the right cerebral hemisphere MIT focuses on three elements represented in the
mediates music and speech prosody. Second, the right spoken prosody of verbal utterances: the melodic line or
hemisphere is preserved in most individuals with apha- variation in pitch in the spoken phrase or sentence, the
sia, and as a result, singing abilities are generally pre- tempo and rhythm of the utterance, and the points of
served even in the most severe cases of aphasia. Third, stress for emphasis. The intoned pattern has a range of
the preserved musical and prosodic capabilities of the only three or four whole notes that is selected from sev-
right hemisphere can be exploited to rehabilitate lan- eral reasonable speech prosody patterns for the target
guage production in patients with aphasia. sentence. Tempo is slowed by syllable lengthening;
The goals of MIT are to facilitate some recovery of phrase accuracy appears to be best when syllable dura-
language production in severely nonfluent speakers with tions approximate 2.0 s per syllable. The e¤ects of
poorly articulated or severely restricted verbal output. this tempo are most pronounced when patients are
Good candidates have poor repetition but at least required to intone utterances independent of the clinician
moderately preserved to essentially normal language (Laughlin, Naeser, and Gordon, 1979). Rhythm and
comprehension. Attempts at self-correction are evident. stress are exaggerated by elevating intoned notes and
They are emotionally stable, if sometimes depressed, and increasing loudness. Clinicians tap out and further rein-
highly motivated to improve their speech. A coexisting force the rhythm and stress of the utterances using
the patient’s hand. The emphasis on slow tempo, precise MIT might have an intact left primary motor area that
rhythm, and distinct stress appears to facilitate the is deprived of input from the damaged left Broca’s
processing of the structure and the articulation of the area. Improved speech production might then result
intoned utterances. from transcallosal input to left hemisphere speech motor
The MIT program consists of four levels. In level I, centers arising from the MIT-activated right hemisphere
the clinician hums a melody pattern within the three- to homologue of Broca’s area. An alternative explanation
four-note range and aids the patient in tapping the involved input from a disconnected intact left Broca’s
rhythm and stress of the stimulus melody to establish the area to an intact left primary motor area via a trans-
process of intoning melody patterns with hand tapping. collosal pathway involving the right hemisphere homo-
Level II requires the patient to tap and repeat the clin- logues to these areas.
ician’s production of the intoned utterance and to re- Belin et al. (1996) used positron emission tomography
spond to a probe question eliciting an intoned repetition to investigate recovery from nonfluent aphasia follow-
of the intoned utterance. Hand tapping is not used ing treatment with MIT. Changes in cerebral blood flow
in response to probe questions. The clinician provides were measured while the participant listened to and re-
assistance by intoning the utterance in unison with the peated simple words, and during repetition of intoned
patient and then fading his participation so that the pa- words. Abnormal activation of right hemisphere struc-
tient subsequently intones the utterance on his own. In tures homotopic to those normally activated in the intact
level III, unison intoning of the utterance is followed left hemisphere was observed during the simple word
by immediate fading of the clinician’s participation. The tasks performed without intoning, while word repeti-
patient then produces the target utterance following an tion with intoning reactivated essential motor language
enforced delay after the clinician presents it. Finally, zones, including Broca’s area and the adjacent left pre-
the patient gives an appropriate intoned response to an frontal cortex. Belin et al. concluded that MIT is more
intoned probe question from the clinician. A backup strongly associated with exaggerated speech prosody
procedure is introduced at this level to provide the than with singing and therefore recruits language-related
patient an opportunity to correct errors. The backups brain areas of the left hemisphere rather than right
consist of repeating the previous step and attempting the hemisphere areas.
failed step again, and as such constitute an ‘‘indirect’’ Boucher et al. (2001) investigated whether the pro-
approach to correcting errors. The goal of level IV is cessing of melodic contours in music applies similarly
normal speech prosody. Latencies for delayed repetition to the processing of speech prosody. According to these
are increased, and the training sentences become more authors, melody is associated with musical tone and
complex. A technique called Sprechgesang (speech-song) rhythm. Tonal elements include pitch, timbre, and chord
is used in the transition to speech prosody. In this tech- and correspond to intonation in speech. Musical rhythm
nique, the constant pitch of the intoned words is replaced refers to the timing distribution of tonal elements and
by the variable pitch of speech while retaining the tem- is comparable to the stress points of speech. Although
po, rhythm, and stress of the intoned sentence. Unison there is support for right hemisphere processing of into-
production of the target sentence in Sprechgesang is fol- nation, Boucher et al. (2001) provide evidence that the
lowed by fading, delayed spoken repetition using normal left hemisphere is involved in the processing of rhythm,
speech prosody, and production using normal prosody and consequently question whether melody-based inter-
in response to a probe question with normal speech ventions such as MIT facilitate speech production
prosody. because of right hemisphere contributions. Following
MIT uses a scoring method where values of 2, 1, or interventions in two speakers with nonfluent aphasia us-
0 can be obtained. Full scores (i.e., 1 for items with ing stimuli emphasizing tone or rhythm in varying con-
no backups, 2 for items with backups) are assigned to ditions, equal or greater success in responding was found
successful responses, while partial scores (i.e., 1) are for conditions emphasizing rhythm than for conditions
assigned to responses that require a backup where avail- emphasizing melodic intoning. Boucher et al. concluded
able. No score is assigned to unsuccessful responses fol- that the right hemisphere explanation for the facilitating
lowing multiple attempts. The average score for three e¤ects of MIT could not be supported strongly.
sessions must be higher than the average score of the
three previous sessions for the participant to remain in —Richard K. Peach
the program. An overall score of 90% or better for five
consecutive sessions is required to advance from one References
level of MIT to the next.
The neurophysiological model o¤ered by the devel- Albert, M., Sparks, R., and Helm, N. (1973). Melodic into-
opers of MIT to account for its e¤ectiveness has been nation therapy for aphasia. Archives of Neurology, 29,
130–131.
controversial since it was first proposed. Berlin (1976) American Academy of Neurology. (1994). Assessment: Me-
stated that the evidence linking the right hemisphere lodic intonation therapy. Neurology, 44, 566–568.
to the interpretation of nonverbal acoustic processes like Belin, P., Van Eeckhout, P., Zilbovicius, M., Remy, P., Fran-
music is insu‰cient to conclude that MIT activates the cois, C., Guillaume, S., et al. (1996). Recovery from non-
right hemisphere in some way to control motor speech fluent aphasia after melodic intonation therapy: A PET
gestures. Instead, he suggested that good candidates for study. Neurology, 47, 1504–1511.
Memory and Processing Capacity 349
Berlin, C. I. (1976). On ‘‘Melodic Intonation Therapy for Yamadori, A., Osumi, Y., Masuhara, G., and Okubo, M.
aphasia’’ by R. W. Sparks and A. L. Holland. Journal of (1977). Preservation of singing in Broca’s aphasia. Journal
Speech and Hearing Disorders, 41, 298–300. of Neurology, Neurosurgery, and Psychiatry, 40, 221–224.
Boucher, V., Garcia, L. J., Fleurant, J., and Paradis, J. (2001).
Variable e‰cacy of rhythm and tone in melody-based
interventions: Implications for the assumption of a right-
hemisphere facilitation in non-fluent aphasia. Aphasiology, Memory and Processing Capacity
15, 131–149.
Helm-Estabrooks, N., and Albert, M. L. (1991). Manual of
aphasia therapy. Austin, TX: Pro-Ed. Research on the role of working memory in language
Laughlin, S. A., Naeser, M. A., and Gordon, W. P. (1979). disorders has stemmed mainly from the phonological
E¤ects of three syllable durations using the melodic intona- loop model (e.g., Baddeley, 1986; Gathercole and Bad-
tion therapy technique. Journal of Speech and Hearing Re- deley, 1993) or the capacity theory of comprehension
search, 22, 311–320. (e.g., Just and Carpenter, 1992). These models di¤er in
Naeser, M. A. (1994). Neuroimaging and recovery of auditory their conception of working memory and in the para-
comprehension and spontaneous speech in aphasia with digms typically used to assess this construct (cf. Mont-
some implications for treatment in severe aphasia. In A.
Kertesz (Ed.), Localization and neuroimaging in neuro- gomery, 2000a); however, a central premise of both
psychology (pp. 245–295). San Diego, CA: Academic frameworks is that there is a limited pool of operational
Press. resources available to perform computations, such that
Naeser, M. A., and Helm-Estabrooks, N. (1985). CT scan le- processing and storage of linguistic information is
sion localization and response to melodic intonation ther- degraded when demands exceed available resources.
apy with nonfluent aphasia cases. Cortex, 21, 203–223. Numerous investigations based on these two approaches
Sparks, R., Helm, N., and Albert, M. (1974). Aphasia rehabil- have demonstrated an association between working
itation resulting from melodic intonation therapy. Cortex, memory capacity and normal language functioning in
10, 303–316. children and adults. In young children, individual di¤er-
Sparks, R. W. (2001). Melodic intonation therapy. In R. Cha-
ences in phonological working memory predict vocabu-
pey (Ed.), Language intervention strategies in aphasia and
related neurogenic communication disorders (4th ed., pp. lary development and are related to di¤erences in word
703–717). Philadelphia: Lippincott Williams and Wilkins. repertoire, utterance length, and grammatical construc-
Sparks, R. W., and Holland, A. L. (1976). Method: Melodic tion use (e.g., Gathercole and Baddelely, 1990b; Adams
intonation therapy for aphasia. Journal of Speech and and Gathercole, 2000). School-age children’s perfor-
Hearing Disorders, 41, 287–297. mance on working memory measures is significantly
Square, P. A., Martin, R. E., and Bose, A. (2001). Nature and correlated with spoken language comprehension as well
treatment of neuromotor speech disorders in aphasia. In R. as with reading recognition and comprehension (e.g.,
Chapey (Ed.), Language intervention strategies in aphasia Gaulin and Campbell, 1994; Swanson, 1996). Working
and related neurogenic communication disorders (4th ed., pp. memory capacity predicts a number of verbal abilities in
847–884). Philadelphia: Lippincott Williams and Wilkins.
adults, including reading comprehension levels, under-
Tonkovich, J. D., and Peach, R. K. (1989). What to treat:
Apraxia of speech, aphasia, or both. In P. A. Square (Ed.), standing of ambiguous passages and syntactically com-
Acquired apraxia of speech in aphasic adults (pp. 115–144). plex sentences, and the ability to make inferences (e.g.,
London: Erlbaum. King and Just, 1991; Carpenter, Miyake, and Just,
1994).
Investigators have examined short-term or working
Further Readings memory abilities in children with varying profiles of
language and cognitive deficits, including children with
Baum, S., and Pell, M. (1999). The neural bases of prosody: Down syndrome, Williams’ syndrome, Landau-Kle¤ner
Insights from lesion studies and neuroimaging. Aphasiology, syndrome, learning disabilities, and specific language
13, 581–608.
Behrens, S. (1985). The perception of stress and lateralization
impairment (SLI). Of special interest are children with
of prosody. Brain and Language, 26, 332–348. SLI, who demonstrate significant language deficits in the
Blumstein, S., and Cooper, W. (1974). Hemispheric processing absence of any clearly identifiable cause such as mental
of intonation contours. Cortex, 10, 146–158. retardation or hearing loss. One theoretical camp views
Keith, R., and Aronson, A. (1975). Singing as a therapy SLI in terms of limited processing capacity. There are
for apraxia of speech and aphasia. Brain and Language, 2, various formulations of limited capacity accounts of
483–488. SLI, including hypotheses about specific deficits in pho-
Pell, M. (1999). Fundamental frequency encoding of linguistic nological working memory and hypotheses regarding
and emotional prosody by right-hemisphere damaged more generalized di‰culties in information processing
speakers. Brain and Language, 69, 161–192. and storage that a¤ect performance across modalities
Robinson, G., and Solomon, D. (1974). Rhythm is processed
by the speech hemisphere. Journal of Experimental Psy-
(cf. Leonard, 1998). Di‰culties discussed here are
chology, 102, 508–511. limited to poor nonword repetition, reduced listening
Van Lancker, D., and Sidtis, J. (1992). The identification of span, and poor serial recall.
a¤ective-prosodic stimuli by left- and right-hemisphere- Children with SLI exhibit deficits in nonword repeti-
damaged subjects: All errors are not created equal. Journal tion, a paradigm that has been used extensively by Bad-
of Speech and Hearing Research, 35, 963–970. deley and colleagues (and others as well) as a measure of
phonological working memory (Gathercole and Badde- scores than was a measure of auditory processing (Tal-
ley, 1990a; Montgomery, 1995; Dollaghan and Camp- lal’s Auditory Repetition Test). Tomblin and colleagues
bell, 1998; Edwards and Lahey, 1998; Ellis Weismer (2002) recently investigated candidate genes associated
et al., 2000; Briscoe, Bishop, and Norbury, 2001). Non- with developmental language disorder, testing for asso-
word repetition has proved to be useful clinically as a ciations between candidate loci in a sample of 476
culturally nonbiased measure for distinguishing between children and their parents. A two-stage approach was
children with and without language disorders. In one used to search for loci associated with language disorder,
of the initial investigations of nonword repetition in as diagnosed by standardized tests of listening and
SLI, Gathercole and Baddeley (1990a) concluded that speaking or by a measure of phonological memory
children with SLI demonstrate significantly poorer pho- (nonword repetition). Preliminary results were suggestive
nological working memory than controls matched on of an association of CFTR (a marker on chromosome 7)
nonverbal cognition or language level (however, see van with both the phonological memory and spoken lan-
der Lely and Howard, 1993; Howard and van der Lely, guage phenotypes.
1995). The findings of Gathercole and Baddeley (1990a) Another paradigm widely employed in research on
were replicated by Montgomery (1995), who similarly the association between language and working memory
interpreted his results as indicating that children with abilities uses a listening/reading span task (e.g., Dane-
SLI have reduced phonological memory capacity. man and Carpenter, 1980). The person is required to
Other studies have sought to determine whether di‰- perform two tasks concurrently (involving processing
culties with nonword repetition reflect cognitive pro- and storage), such as making true/false judgments about
cesses other than working memory deficits (Edwards and sentences and recalling the last word in each sentence
Lahey, 1998; Briscoe, Bishop, and Norbury, 2001). After following the presentation of all sentences in a set. The
a thorough investigation of possible explanations for number of sentences within a set increases throughout
nonword repetition deficits in SLI, Edwards and Lahey the task in order to assess memory span. Ellis Weismer,
(1998) concluded that neither auditory discrimination Evans, and Hesketh (1999) found that children with SLI
nor response processes could account for the di‰culties. evidenced limitations in verbal working memory com-
Instead, they attributed the deficits to problems in the pared to age-matched controls, based on their perfor-
formation or storage of phonological representations in mance on a listening span task developed by Gaulin and
working memory. Children with SLI usually do not dif- Campbell (1994). Findings primarily pointed to quanti-
fer from normal language peers in their ability to repeat tative di¤erences between the groups involving reduced
short, simple nonwords; rather, breakdowns on nonword capacity for the children with SLI; however, there were
repetition tasks typically occur on the most complex some indications of qualitative di¤erences in terms of
stimuli (Ellis Weismer et al., 2000; Briscoe, Bishop, and distinct patterns of word-recall errors and di¤erent pat-
Norbury, 2001). When children with SLI were compared terns of associations between working memory and
with children with mild to moderate hearing loss, both performance on language and nonverbal cognitive mea-
groups showed similar di‰culty with longer nonwords, sures. Montgomery (2000a, 2000b) examined the rela-
but children with SLI also displayed deficits on digit re- tion between working memory and sentence processing
call and were more negatively a¤ected by phonological in children with SLI. Using an adaptation of Daneman
complexity (Briscoe, Bishop, and Norbury, 2001). These and Carpenter’s listening span task, he demonstrated
investigators concluded that auditory perceptual deficits that children with SLI exhibit reduced capacity under
are not su‰cient to explain the range of language and dual-load conditions. Performance on the listening span
literacy di‰culties observed in children with SLI and measure was significantly correlated with performance
suggested that some kind of processing capacity limita- on an o¤-line sentence comprehension task but not with
tion underlay their language deficits. on-line sentence processing. Montgomery concluded that
Several genetic investigations of developmental lan- the slower real-time sentence processing in children with
guage disorder have examined phonological memory SLI was primarily a function of ine‰cient lexical re-
as indexed by nonword repetition. Bishop, North, and trieval operations rather than limitations in working
Donlan (1996) administered a nonword repetition task memory; however, he posited that their di‰culties with
to participants in a study of twins with language im- o¤-line sentence comprehension tasks were related to
pairment. Children with persistent language impair- di‰culties coordinating the requisite processing and
ment as well as those with resolved language impairment storage functions, revealing limitations in functional
exhibited significant deficits in nonword repetition. working memory capacity.
Comparison of nonword repetition performance in Serial memory deficits in children with SLI have been
monozygotic and dizygotic twin pairs revealed a sig- documented by Gillam and colleagues (Gillam, Cowan,
nificant heritability component. Based on these results, and Day, 1995; Gillam, Cowan, and Marler, 1998). The
Bishop et al. suggested that deficits in nonword repeti- initial study employed a su‰x e¤ect procedure in which
tion provide a phenotypic marker of heritable forms of a spoken list to be recalled was followed by a ‘‘su‰x’’
developmental language disorder. Bishop and colleagues (nonword item) that was not to be recalled. The su‰x
(1999) replicated the earlier results and found that non- had a disproportionately negative e¤ect on recency recall
word repetition gave high estimates of group heritability. for the children with SLI when strict serial position cri-
This measure was a better predictor of low language teria for scoring were imposed. The subsequent investi-
Memory and Processing Capacity 351
gation by Gillam et al. sought to determine the nature of Briscoe, J., Bishop, D. V. M., and Norbury, C. F. (2001).
working memory deficiencies in children with SLI, using Phonological processing, language, and literacy: A com-
a modality e¤ect paradigm in which input modality, rate parison of children with mild-to-moderate sensorineural
of input, and response modality were manipulated. To hearing loss and those with specific language impairment.
Journal of Child Psychology and Psychiatry, 42, 329–340.
control for di¤erences in capacity across the groups,
Carpenter, P., Miyake, A., and Just, M. (1994). Working
trials were administered at a level consistent with each memory constraints in comprehension: Evidence from
child’s working memory span. Children with SLI and individual di¤erences, aphasia, and aging. In M. A. Gerns-
controls demonstrated traditional primacy, recency, and bacher (Ed.), Handbook of psycholinguistics (pp. 1075–
modality e¤ects and similar performance when audio- 1122). San Diego, CA: Academic Press.
visual stimuli were paired with spoken responses. How- Daneman, M., and Carpenter, P. (1980). Individual di¤erences
ever, children with SLI exhibited reduced recency e¤ects in working memory and reading. Journal of Verbal Learn-
and poor recall when visually presented items were ing and Verbal Behavior, 19, 450–466.
paired with pointing responses. The investigators con- Dollaghan, C., and Campbell, T. (1998). Nonword repetition
cluded that neither output processes nor auditory tem- and child language impairment. Journal of Speech, Lan-
guage, and Hearing Research, 41, 1136–1146.
poral processing could account for the working memory
Edwards, J., and Lahey, M. (1998). Nonword repetitions of
deficits in children with SLI. They suggested instead children with specific language impairment: Exploration of
that children with SLI have problems retaining or trans- some explanations for their inaccuracies. Applied Psycho-
forming phonological codes, particularly on tasks linguistics, 19, 279–309.
requiring multiple mental operations. They further Ellis Weismer, S., Evans, J., and Hesketh, L. J. (1999). An ex-
speculated that these capacity limitations in working amination of verbal working memory capacity in children
memory may be due to rapid decay of phonological with specific language impairment. Journal of Speech, Lan-
representations or to performance limitations involving guage, and Hearing Research, 42, 1249–1260.
the use of less demanding coding and retrieval strategies. Ellis Weismer, S., Tomblin, J. B., Zhang, X., Buckwalter, P.,
In conclusion, there is considerable evidence that Chynoweth, J. G., and Jones, M. (2000). Nonword repeti-
tion performance in school-age children with and without
children with SLI have limitations in working memory,
language impairment. Journal of Speech, Language, and
yet there are a number of unresolved issues. In light Hearing Research, 43, 865–878.
of the known heterogeneity of the SLI population, it Gathercole, S., and Baddeley, A. (1990a). Phonological mem-
seems unlikely that any single factor can account for the ory deficits in language disordered children: Is there a causal
language di‰culties of all children. Additional research connection? Journal of Memory and Language, 29, 336–360.
is warranted to examine individual variation within Gathercole, S., and Baddeley, A. (1990b). The role of phono-
this population and to explore whether limitations logical memory in vocabulary acquisition: A study of young
in working memory are di¤erentially implicated in vari- children learning new words. British Journal of Psychology,
ous subtypes of SLI. Another important issue pertains 81, 439–454.
to whether deficits in working memory capacity are Gathercole, S., and Baddeley, A. (1993). Working memory and
language processing. Hove, U.K.: Erlbaum.
restricted to processing of verbal material or extend to
Gaulin, C., and Campbell, T. (1994). Procedure for assessing
nonverbal information as well. That is, it is important verbal working memory in normal school-age children:
to determine whether the evidence supports a domain- Some preliminary data. Perceptual and Motor Skills, 79,
specific model or a generalized capacity deficit model. 55–64.
Finally, future studies should determine whether mem- Gillam, R., Cowan, N., and Day, L. (1995). Sequential mem-
ory limitations are actually a causal factor in SLI, an ory in children with and without language impairment.
outgrowth of the language problems, or an independent Journal of Speech and Hearing Research, 38, 393–402.
area of di‰culty for children with language disorder. Gillam, R., Cowan, N., and Marler, J. (1998). Information
processing by school-age children with specific language
—Susan Ellis Weismer impairment: Evidence from a modality e¤ect paradigm.
Journal of Speech, Language, and Hearing Research, 41,
References 913–926.
Howard, D., and van der Lely, H. (1995). Specific language
Adams, A. M., and Gathercole, S. (2000). Limitations in impairment in children is not due to a short-term memory
working memory: Implication for language development. deficit: Response to Gathercole and Baddeley. Journal of
International Journal of Language and Communication Dis- Speech and Hearing Research, 38, 466–472.
order, 35, 95–116. Just, M., and Carpenter, P. (1992). A capacity theory of
Baddeley, A. (1986). Working memory. Oxford, U.K.: Clar- comprehension: Individual di¤erences in working memory.
endon Press. Psychological Review, 99, 122–149.
Bishop, D. V. M., North, T., and Donlan, C. (1996). Nonword King, J., and Just, M. (1991). Individual di¤erences in syntactic
repetition as a behavioral marker for inherited language processing: The role of working memory. Journal of Mem-
impairment: Evidence from a twin study. Journal of Child ory and Language, 30, 580–602.
Psychology and Psychiatry, 36, 1–13. Leonard, L. (1998). Children with specific language impairment.
Bishop, D. V. M., Bishop, S., Bright, P., James, C., Delaney, Cambridge, MA: MIT Press.
T., and Tallal, P. (1999). Di¤erent origin of auditory and Montgomery, J. (1995). Sentence comprehension in children
phonological processing problems in children with language with specific language impairment: The role of phonological
impairment: Evidence from a twin study. Journal of Speech, working memory. Journal of Speech and Hearing Research,
Language, and Hearing Research, 42, 155–168. 38, 177–189.
Montgomery, J. (2000a). Relation of working memory to problems in rehearsal? Journal of Child Psychology and
o¤-line and real-time sentence processing in children with Psychiatry, 41, 223–244.
specific language impairment. Applied Psycholinguistics, 21, Metz-Lutz, M. N., Seegmuller, C., Kleitz, C., de Saint-Martin,
117–148. A., Hirsch, E., and Marescaux, C. (1999). Landau-Kle¤ner
Montgomery, J. (2000b). Verbal working memory and sen- syndrome: A rare childhood epileptic aphasia. Journal of
tence comprehension in children with specific language Neurolinguistics, 12, 167–179.
impairment. Journal of Speech, Language, and Hearing Re- Nation, K., Adams, J., Bowyer-Crane, C., and Snowling, M.
search, 43, 293–308. (1999). Working memory deficits in poor comprehenders
Swanson, H. L. (1996). Individual and age-related di¤erences reflect underlying language impairments. Journal of Experi-
in children’s working memory. Memory and Cognition, 24, mental Child Psychology, 73, 139–158.
70–82. Oakhill, J., Cain, K., and Yuill, N. (1998). Individual di¤er-
Tomblin, J. B., Murray, J. C., Nishimura, C., Zhang, X., Ellis ences in children’s comprehension skill: Toward an inte-
Weismer, S., O’Brien, M., and Palmer, P. (2002). Posi- grated model. In C. Hulme and R. M. Joshi (Eds.), Reading
tive association and sibpair linkage results on children with and spelling: Development and disorders (pp. 343–367).
impairments in spoken language and phonological memory. Mahwah, NJ: Erlbaum.
Unpublished manuscript, Department of Speech Pathology Swanson, H. L. (1999). Reading comprehension and working
and Audiology, University of Iowa, Iowa City. memory in learning-disabled readers: Is the phonological
van der Lely, H., and Howard, D. (1993). Children with spe- loop more important than the executive system? Journal of
cific language impairment: Linguistic impairment or short- Experimental Child Psychology, 72, 1–31.
term memory deficit? Journal of Speech and Hearing Zera, D. A., and Lucian, D. (2001). Self-organization and
Research, 36, 1193–1207. learning disabilities: A theoretical perspective for the inter-
pretation and understanding of dysfunction. Learning Dis-
ability Quarterly, 24, 107–118.
Further Readings
Chapman, R., and Hesketh, L. J. (2000). Behavioral phenotype
of individuals with Down syndrome. Mental Retardation
and Developmental Disabilities Research Reviews, 6, 84–95. Mental Retardation
Cohen, N., Vallance, D., Barwick, M., Im, N., Menna, R.,
Horodezky, N., and Issacson, L. (2000). The interface be-
tween ADHD and language impairment: An examination Mental retardation is characterized by ‘‘significantly
of language, achievement, and cognitive processing. Journal subaverage intellectual functioning, existing concur-
of Child Psychology and Psychiatry, 41, 353–362. rently with related limitations in two or more of the fol-
Cowan, N. (1996). Short-term memory, working memory, and lowing adaptive skills areas: communication, self-care,
their importance in language processing. Topics in Lan- home-living, social skills, community use, self-direction,
guage Disorders, 17, 1–18. health and safety, functional academics, leisure and
de Jong, P. F. (1998). Working memory deficits of reading work’’ (Luckerson et al., 1992, p. 5). Mental retardation
disabled children. Journal of Experimental Child Psychol-
ogy, 70, 75–96.
thus applies to a broad range of children and adults,
Donlan, C., and Masters, J. (2000). Correlates of social devel- from those with mild deficits who function fairly well in
opment in children with communication disorders: The society to those with extremely severe deficits who re-
concurrent predictive value of verbal short-term memory. quire a range of support in order to function. Regardless
International Journal of Language and Communication Dis- of the extent of mental retardation, the likelihood that
orders, 35, 211–226. communication development will be delayed is high.
Ellis Weismer, S. (1996). Capacity limitations in working In fact, language delays or disorders are often an early
memory: The impact on lexical and morphological learning outward signal of mental retardation.
by children with language impairment. Topics in Language Prior to the 1960s, a child who was diagnosed with
Disorders, 17, 33–44. mental retardation received little or no attention from
Evans, J., Alibali, M., and McNeil, N. (2001). Divergence
of verbal expression and embodied knowledge: Evidence
investigators or practitioners in communication dis-
from speech and gesture in children with specific language orders because it was thought that the child could not
impairment. Language and Cognitive Processes, 16, 309–331. learn and thus would make few gains in speech devel-
Farmer, M. (2000). Language and social cognition in children opment. Following changes in policy and federal legis-
with specific language impairment. Journal of Child Psy- lation, the 1960s saw the emergence of the modern
chology and Psychiatry, 41, 627–636. scientific study of mental retardation. Since then, signifi-
Fazio, B. (1999). Arithmetic calculation, short-term memory, cant research findings about language and communica-
and language performance with specific language impair- tion development have enhanced the speech, language,
ment: A 5-year follow-up. Journal of Speech, Language, and and communication outcomes for children and adults
Hearing Research, 42, 420–431. with mental retardation.
Gillam, R. (Ed.). (1998). Memory and language impairments in
children and adults. Frederick, MD: Aspen.
With respect to communication, children and adults
Jarrold, C., and Baddeley, A. (1999). Genetically dissociated with mental retardation can be broadly divided accord-
components of working memory: Evidence from Down’s ing to whether or not the individual speaks. Most
and Williams syndrome. Neuropsychologia, 37, 637–651. children and adults with mental retardation or devel-
Jarrold, C., Baddeley, A., and Hewes, A. (2000). Verbal short- opmental disabilities do learn to communicate through
term memory deficits in Down syndrome: A consequence of speech, either spontaneously or with the aid of speech
Mental Retardation 353
and language intervention during the developmental of functional communication skills, although the areas
period (Rosenberg and Abbeduto, 1993). A substantial of concentration vary with age and experience.
body of research has addressed the language and com- One intervention approach that has been developed
munication abilities of children and adults with mental for use with individuals with severe communication dif-
retardation who speak. In particular, strong empirical ficulties is augmentative and alternative communication
findings about the communication abilities of children (AAC). AAC encompasses all forms of communica-
and adults with Down syndrome, fragile X syndrome, tion, from simple gestures, manual signs and picture
and Williams syndrome suggest a complex picture, with communication boards to American Sign Language and
di¤erent relations between language comprehension and sophisticated computer-based devices that can speak
production and between language and cognition. The in phrases and sentences for their users. Children with
development of communicative and language interven- mental retardation who can benefit from AAC are usu-
tion approaches for children with mental retardation ally identified based on communication profiles. The
who speak is an area of remarkable developments majority of children with mental retardation who use
(Kaiser, 1993). Psycholinguistic research findings and AAC have more severe forms of mental retardation.
behavioral instructional procedures have provided the These children never develop any speech, or develop
foundation for language intervention protocols for only a few words, or are echolalic. For them, AAC pro-
teaching children with mental retardation specific speech vides a means with which to develop receptive and ex-
and language skills. An early emphasis on direct in- pressive language skills (Romski and Sevcik, 1996).
struction was followed by a shift away from the formal See also communication skills of people with
aspects of language and toward the teaching of lexical down syndrome; mental retardation and speech in
and pragmatic skills, measuring generalization, and the children.
use of intervention approaches in a natural environment
to promote the child’s social competence. Techniques —Mary Ann Romski and Rose A. Sevcik
include milieu teaching, parent-implemented interven-
tion, and peer-mediated approaches. These are each References
identifiable, distinct language interventions with sup- Bricker, D. (1993). Then, now, and the path between: A brief
porting empirical evidence that they work. Perhaps the history of language intervention. In A. Kaiser and D. Gray
most important recent development is the extension of (Eds.), Enhancing children’s communication: Research foun-
intervention approaches to infants and toddlers with dations for intervention (pp. 11–31). Baltimore: Paul H.
developmental disabilities, a move reflecting examina- Brookes.
tions of interventions targeted to intentional commu- Kaiser, A. (1993). Parent-implemented language intervention:
An environmental system perspective. In A. Kaiser and D.
nication and language comprehension (Bricker, 1993).
Gray (Eds.), Enhancing children’s communication: Research
Overall, the field has developed by expanding the con- foundations for intervention (pp. 63–84). Baltimore: Paul H.
tent and focus of intervention programs and fine-tuning Brookes.
the procedures used to deliver the interventions. Greater Luckasson, R., Coulter, D. L., Polloway, E. A., Reiss, S.,
sophistication in language intervention strategies now Schalock, R. L., Snell, M. E., et al. (1992). Mental retarda-
permits an examination of the relationship between the tion: Definition, classification, and systems of supports (9th
characteristics a child brings to the intervention and the ed.). Washington, DC: AAMR.
attributes of the intervention itself. Romski, M. A., and Sevcik, R. A. (1996). Breaking the speech
Some children and adults with mental retardation, barrier: Language development through augmented means.
however, encounter significant di‰culty developing oral Baltimore: Paul H. Brookes.
Rosenberg, S., and Abbeduto, L. (1993). Language and com-
communication skills. Such di‰culty during childhood
munication in mental retardation: Development, processes,
results in inability to express oneself, to maintain social and intervention. Hillsdale, NJ: Erlbaum.
contact with family, to develop friendships, and to func-
tion successfully in school. As the child moves through
adolescence and into adulthood, inability to commu- Further Readings
nicate continues to compromise his or her ability to Chapman, R. (1997). Language development in children and
participate in society, from accessing education and em- adolescents with Down syndrome. Mental Retardation and
ployment to engaging in leisure activities and personal Developmental Disabilities Research Reviews, 3, 307–312.
relationships. For the most part, individuals who expe- Miller, J. F., Leddy, M., and Leavitt, L. (1999). Improving the
rience considerable di‰culty communicating are those communication of people with Down syndrome. Baltimore:
with the most significant degrees of mental retardation. Paul H. Brookes.
They may also exhibit other disabilities, including sei- National Joint Committee for the Communication Needs of
zure disorders, cerebral palsy, sensory impairments, or Persons with Severe Disabilities. (2001). Access to commu-
nication services and supports: Concerns regarding the
maladaptive behaviors. They range in age from very application of restrictive ‘‘eligibility’’ policies. Unpublished
young children just beginning development to adults document. Washington, DC: Author.
with a broad range of life experiences, including a his- Romski, M. A., and Sevcik, R. A. (1997). Augmentative and
tory of institutionalization. These children and adults alternative communication for children with developmental
can and now do benefit from language and communica- disabilities. Mental Retardation and Developmental Dis-
tion intervention focusing on the development and use abilities Research Reviews, 3, 363–368.
Sevcik, R. A., and Romski, M. A. (1997). Comprehension morpheme. This contrasts with the 3sg present indicative
and language acquisition: Evidence from youth with severe verbal su‰x -s, which makes no semantic contribution,
cognitive disabilities. In L. B. Adamson and M. A. Romski because what is semantically ‘‘one’’ or ‘‘more than one’’
(Eds.), Communication and language acquisition: Discoveries is the subject of the sentence—a noun phrase. Numer-
from atypical language development (pp. 184–201). Balti-
osity (and person) are properties of noun meanings, not
Warren, S., and Yoder, P. (1997). Communication, language, verb meanings. The inflection -s appears on the verb as
and mental retardation. In W. McLean (Ed.), Ellis’ hand- a consequence of the number and person of the sub-
book of mental deficiency, psychological theory and research ject, that information having been copied onto the verb.
(pp. 379–403). Mahwah, NJ: Erlbaum. Agreement is therefore an uninterpretable morpheme:
it replicates a bit of meaning represented elsewhere,
surfacing only because the morphosyntax of English
requires it. It has no impact at Logical Form. Such
Morphosyntax and Syntax morphemes are seen as a plausible locus for impairment
(cf. Clahsen, Bartke, and Göllner, 1997).
A second common kind of uninterpretable morphol-
This article discusses issues in the linguistic analysis ogy is case marking. Many case markings have nothing
of morphosyntax and syntax in children with language to do with meaning. For example, in She saw me and I
disorders. It presupposes familiarity with generative was seen by her, the di¤erence between accusative me
linguistic theory. The goal is to illustrate what kinds and nominative I does not correspond to any change in
of grammatically based explanations are available to the semantic role played by the speaker; rather, it reflects
theories of disorders. The task of such explanations is purely syntactic information. A third kind of uninter-
to unify superficially diverse markers of a disorder into pretable morphology arises in concord. For instance, in
natural classes that linguistic theory motivates or to Latin the forms of many words within a noun phrase
characterize observed errors. The discussion is divided, reflect its case and number, as well as the gender of the
as suggested by recent generative theory, into issues head noun: for example, ill-as vi-as angust-as ‘‘those-
concerning the lexicon versus issues concerning the acc.fem.pl streets-acc.fem.pl narrow-acc.fem.pl.’’
computation of larger structures from lexical atoms. Among the uninterpretable features there may be a
The lexicon can be subdivided along various dimen- further distinction to be drawn, as follows. Person and
sions, any of which might be relevant in capturing dis- number information is interpretable on noun phrases but
sociations found in nonnormal language development. A not on verbs; in a sense, there is an asymmetry between
primary split divides content words, i.e., nouns, verbs, the contentful versus the merely duplicated instantiation
adjectives, and adverbs, from all other morphemes, i.e., of those features (cf. Clahsen, 1989). Case is di¤erent:
functional or closed-class elements. (I ignore the chal- it is taken to be uninterpretable both on the recipient/
lenging issue of where adpositions belong; they likely are checkee (noun phrase) and on the source/checker (verb,
heterogeneous, a dichotomy among them being evinced preposition, or Tense); it does not exist at Logical Form
in adult disorders [Rizzi, 1985; Grodzinsky, 1990].) at all. Thus, symmetrically uninterpretable functional
In Chomskyan syntax of the 1980s (cf. Emonds, 1985; elements such as Case constitute another natural class.
Chomsky, 1986; Fukui, 1986) there was a seemingly Grammatical gender is also an example of a morpho-
arbitrary split between functional meanings that were syntactic feature that has no semantic counterpart.
treated as autonomous syntactic functional categories There are two ways in which gender marking might be
(e.g., Tense, Determiner, Agreement, Complementizer) impaired, calling for di¤erent explanations: first, chil-
and those that were not (e.g., number marking on nouns, dren might not be able to consistently recall the gender
participial a‰xes, infinitival su‰xes on verbs). Some of particular nouns; second, children might not be able
attempts were made to understand disorders of acquisi- to consistently copy gender information from the noun,
tion in terms of this heterogeneous system (cf. Leonard, which is associated with gender in the lexicon, to other
1998). However, the harder linguists have looked, the parts of a noun phrase (a problem with concord).
more functional heads for which they have found struc- Turning now from the lexicon to the syntax, under
tural evidence. On parsimony grounds we now expect Minimalism the transformational operations Merge and
all functional meanings to be represented in syntactic Move are restricted in a way that e¤ectively builds some
positions separate from those of content words (cf. van former filters into their definitions. Taking this seriously,
Gelderen, 1993; Hoekstra, 2000; Jakubowicz and certain ways of talking about language disorders in
Nash, 2001). terms of missing or dysfunctional subcomponents of
Among functional elements (morphemes or features syntax no longer make much sense. For example, in
thereof ) a further distinction is made, dubbed ‘‘inter- the theory of Chomsky (1981), it was plausible to talk
pretability’’ by Chomsky (1995). Some morphemes en- about, say, bounding theory (which encompasses the
code elements of meaning and contribute directly to the constraints on movement operations) being inoperative
interpretation of a sentence; for example, the English as a consequence of some disorder; this simply meant
noun plural su‰x -s combines with a noun whose that certain structures that were always generable by
meaning describes a kind of entity, and adds information Move a were no longer declared invalid after the fact
about number; for example, dog þ s ¼ caninehood þ by virtue of a movement that was too long. There is no
more-than-one. Plural -s is therefore an interpretable natural translation of this idea into Minimalist machin-
ery because locality is part of what defines an operation based associations were unchecked and overextended;
as a Move. A language disorder could in principle in- alternatively, it might have a subtler symptomology
volve a change in this definition, but we cannot think of whereby any verb could be correctly learned once its
this as simply excising a piece of the grammar. Similarly, past tense was heard, but in a Wug-testing situation
since movement in Minimalism is just a means to an nonce forms could not be inflected. A more moderate
end, it makes little sense to think of eliminating move- impairment might entail that at least some irregular
ment while leaving the rest of the theory intact; the sys- forms would be demonstrably learnable, but in pro-
tem is now designed such that movement is the only way duction they would not always be retrieved reliably or
to satisfy a fundamental requirement that drives the quickly enough to block a general rule (Clahsen and
computational system, namely, the need to create valid Temple, 2002), apparently violating the Elsewhere
Logical Forms by eliminating uninterpretable features. Condition.
The syntactic accounts of language disorders that We turn now to the logic governing ways in which the
have been suggested often involve impoverished struc- most commonly produced error types from the child
ture. One way to execute this is to posit that particular language disorder literature can be explained. Under
functional heads are either missing from syntactic struc- the strong separation of computational combinatoric
tures or do not contain (all) the features that they would machinery versus lexical storage pursued in Minimalist
for adults (cf. Wexler, 1998). Another way is by refer- syntax, it is possible for children with normal syntactic
ence to the position of the heads in the clausal structure structures to sound very unlike adults, because in their
rather than by reference to their content. Thus, it can be lexicon certain morphemes either are missing or have
proposed that all structure above a particular point, incorrect features associated with them. For example,
say VP, is either missing (Radford, 1990) or optionally the fact that some children learning English might never
truncated (Rizzi, 1994). The extent to which particu- produce -s in an obligatory 3sg context could be consis-
larly the tree pruning variant of this idea (Friedmann tent with them having mastered the syntax of agreement,
and Grodzinsky, 1997) can be characterized as simply if their lexicon has a missing or incorrect entry for 3sg.
lesioning one independently motivated piece of grammar Consequently, it is important to document the lexical
is open to debate. inventory, that is, whether each child does at least
The division between stored representations and sometimes produce the forms of interest or can in some
computational procedures is also relevant in (particu- way be shown to know them. Similarly, suppose that
larly inflectional) morphology, where the contrast be- children learning a given language produce agreement
tween general rules and stored probabilistic associations mismatch errors of just one type, namely, that non-3sg
has a long history (Pinker, 1999). Where precisely the subjects appear with 3sg verb forms. That is, suppose
line should be drawn di¤ers, depending on one’s theory that in Spanish we find errors like yo tiene (‘‘I has (3sg)’’)
of morphology. For instance, if there are any rules at but no errors like ella tengo (‘‘she have (1sg)’’). One
all, then surely the process that adds (a predictable allo- could postulate that 3sg forms (e.g., tiene) are unspeci-
morph of ) -d to form the past tense of a verb is a rule, fied for person and number features and are inserted
and unless all morphology is seen as executed by rules, whenever more specific finite forms (e.g., tengo) are
the relationship between am and was is stored in the unavailable, or that tiene has wrongly been learned as a
lexicon as an idiosyncratic fact about the verb be, not 1sg form. In either case, again, the syntax might be fully
encoded as a rule. But in between lie numerous sub- intact.
regularities that could be treated either way. For exam- Errors due to lexical gaps should pattern di¤erently
ple, the alternation in sing-sang, drink-drank, and so on from errors due to syntactically absent heads. Consider
could be represented as a family of memorized associa- the Tense head as an example. The meaning of tense it-
tions or as a rule (with some memorized exceptions) self (past versus present) can be expressed morphologi-
triggered by the phonological shape of the stem ([In]). cally in the Tense head position, and in addition Tense is
Most psychologically oriented research has assumed that commonly thought to house the uninterpretable feature
there is only one true rule for each inflectional feature that requires the presence of an overt subject in non-null
such as [past tense] (supplying the default), while the subject languages (the unhelpfully named EPP feature),
generative phonology tradition has used numerous rules and the uninterpretable feature that licenses nominative
to capture subpatterns within a paradigm, only the most case on its specifier (the subject). If Tense were com-
general of which corresponds to the default in having pletely missing from the grammar of some children, this
no explicit restriction on its domain. Thus, although a would predict not only that they would produce no tense
dissociation between rules and stored forms is expected morphemes, but also that they would not enforce the
under virtually any approach, precise predictions vary. overt subject requirement and would not (syntactically)
There are several ways in which (inflectional) mor- require nominative case on subjects. If, on the other
phology might be impaired. One of the two mechanisms hand, what is observed is not an absence of tense mark-
might be entirely inoperative, in which case either every ing but rather an incorrect choice of how to express
inflected form would have to be memorized by rote (no Tense morphophonologically (e.g., singed, instead of
rules; cf. Gopnik, 1994) or every word would have to be sang), this would not be compatible with absence of the
treated as regular (no associations). The latter would be Tense head and would not predict the other syntactic
evinced by overregularizations. The former might yield consequences mentioned. Only omission of an inflec-
overirregularizations if, in the absence of a rule, analogy- tional marking or perhaps supernumerary inflection
(e.g., Did he cried?) could have a syntactic cause; incor- 100% (i.e., talking like adults in this regard) and the
rect allomorph selection could not. third child never produced -s. In this circumstance we
If a feature such as Tense is expressed some of the would have no evidence of a developmental stage at
time but not always, this could have two underlying which -s is optional, and the third child’s productions
causes that lead to di¤erent syntactic expectations. One would be consistent with her simply not knowing the
possibility is that Tense is part of the syntactic represen- form of the 3sg present tense verbal inflection in En-
tation in all cases and its inconsistent expression reflects glish. At another extreme, if each of the three children
some intermittent problem in its morphophonological were producing -s in two-thirds of obligatory contexts,
spell-out (cf. Phillips, 1995; Marcus, 1995); in that case we would have to posit a grammar in which Tense or
no syntactic consequences are predicted. The other pos- Agreement are optional (or else multiple concurrent
sibility is that Tense is intermittently absent from syn- grammars). The same logic applies in the temporal di-
tactic representations. In just this scenario we expect that mension: a period of 0% production followed by a
the syntactic properties controlled by the Tense head period of 100% production calls for a di¤erent analy-
should be variable (e.g., subjects are sometimes nomina- sis from a period during which production is at 50%
tive, sometimes not), and furthermore, utterance by within single recording sessions. Therefore, data report-
utterance, syntax and morphology should correlate: the ing needs to facilitate assessment of the extent to which
Tense morpheme should be missing if and only if nomi- samples being pooled represent qualitatively comparable
native case is not assigned/checked and the overt subject grammars. In addition to measures of central tendency
requirement is not enforced. and variance, this calls for the kind of distributional in-
It is crucial to understand such predicted contingen- formation found in ‘‘box-and-whiskers’’ plots.
cies as claims concerning the distribution of contrasting Furthermore, to say that a child ‘‘has acquired X’’ by
forms. It seems clear that some children go through a a certain age, where X is some morpheme or class of
stage during which, for example, some of the English morphemes, is not strictly meaningful from the linguistic
pronoun forms are not produced at all; this is particu- perspective adopted here. We can speak of attaining
larly common for she. What behavior we should expect levels of normal or adult performance, and we can speak
at this stage depends on assumptions about the archi- of having established a lexical entry with the correct
tecture of the syntax-morphology interface. Taking an feature specifications, but there is nothing in grammar
‘‘early insertion’’ view, under which only complete that could correspond to a claim such as the following:
words from the lexicon can be inserted in syntactic deri- ‘‘A child is taken to have acquired X once her rate of
vations, a child lacking a lexical entry for the features production of X morphemes in obligatory contexts is
[pron, 3sg, fem, NOM] should be unable to generate a greater than 90%’’ (cf. Brown, 1973). This begs the
sentence requiring such an item. In contrast, on a ‘‘late question of what the grammar was like when production
insertion’’ nonblocking approach such as Distributed of X was at 85%: If X was not at that time a part of her
Morphology (Halle and Marantz, 1993), there is always grammar, how did the child manage to create the illu-
some form (perhaps phonologically null) that can be sion of using it correctly so much of the time? Also, rates
used to realize the output of a syntactically valid deriva- of production in obligatory contexts must be comple-
tion. The syntactic tree is built up using feature bundles mented by correct usage rates; one is scarcely interpret-
such as [pron, 3sg, fem, NOM], without regard to which able without the other. If X is always used correctly,
vocabulary item might fill such a position. The architec- then even very low production rates signal knowledge of
ture dictates that if there is no vocabulary entry with the properties of X and the syntactic conditions on its
exactly this set of features, then the item containing the distribution. But if X is also frequently used incorrectly,
greatest subset of these features will be inserted. Thus, neither type of knowledge can be inferred.
a child who knows the word her and knows that it is a
feminine singular pronoun could insert it in such a tree, —Carson T. Schütze
producing Her goes from a fully adultlike finite clause
structure. References
We conclude with some methodological points. Gen- Brown, R. (1973). A first language: The early stages. Cam-
erative grammar’s conception of linguistic knowledge as bridge, MA: Harvard University Press.
a cognitive (hence also neural) representation in the Chomsky, N. (1981). Lectures on government and binding.
mind of an individual at some particular time dictates Dordrecht: Foris.
that, in analyzing behavioral data collected over an Chomsky, N. (1986). Knowledge of language: Its nature, origin,
extended period or from multiple children, pooled data and use. New York: Praeger.
cannot be directly interpreted at face value. For exam- Chomsky, N. (1995). The Minimalist program. Cambridge,
ple, suppose we analyze transcripts of three children’s MA: MIT Press.
Clahsen, H. (1989). The grammatical characterization of
spontaneous productions sampled over a period of 6 developmental dysphasia. Linguistics, 27, 897–920.
months and find that in obligatory contexts for some Clahsen, H., and Temple, C. (2003). Words and rules in chil-
grammatical morpheme, say 3sg -s, the overall rate of dren with Williams syndrome. In Y. Levy and J. Schae¤er
use is 67%. Virtually nothing can be concluded from (Eds.), Language competence across populations: Toward a
this datum. It could represent (among other possibilities) definition of Specific Language Impairment (pp. 323–352).
a scenario in which two children were producing -s at Hillsdale, NJ: Erlbaum.
Clahsen, H., Bartke, S., and Göllner, S. (1997). Formal fea- Clahsen, H., and Almazan, M. (1998). Syntax and morphology
tures in impaired grammars: A comparison of English and in Williams syndrome. Cognition, 68, 167–198.
German SLI children. Journal of Neurolinguistics, 10, Clahsen, H., and Almazan, M. (2001). Compounding and in-
151–171. flection in language impairment: Evidence from Williams
Emonds, J. E. (1985). A unified theory of syntactic categories. syndrome (and SLI). Lingua, 111, 729–757.
Dordrecht: Foris. Curtiss, S., Katz, W., and Tallal, P. (1992). Delay versus devi-
Friedmann, N., and Grodzinsky, Y. (1997). Tense and agree- ance in the language acquisition of language-impaired chil-
ment in agrammatic production: Pruning the syntactic tree. dren. Journal of Speech and Hearing Research, 35, 373–
Brain and Language, 56, 397–425. 383.
Fukui, N. (1986). A theory of category projection and its Curtiss, S., and Schae¤er, J. (1997). Syntactic development in
applications. Ph.D. dissertation, MIT, Cambridge, MA. children with hemispherectomy: The Infl-system. In E.
Gelderen, E. van (1993). The rise of functional categories. Hughes, M. Hughes, and A. Greenhill (Eds.), Proceedings
Amsterdam: John Benjamins. of the 21st Annual Boston University Conference on Lan-
Gopnik, M. (1994). Impairments of syntactic tense in a familial guage Development (pp. 103–114). Somerville, MA: Casca-
language disorder. McGill Working Papers in Linguistics, dilla Press.
10, 67–80. Dalalakis, J. (1996). Developmental language impairment: Evi-
Grodzinsky, Y. (1990). Theoretical perspectives on language dence from Greek and its implications for morphological
deficits. Cambridge, MA: MIT Press. representation. Ph.D. dissertation, McGill University,
Halle, M., and Marantz, A. (1993). Distributed morphology Montreal.
and the pieces of inflection. In K. L. Hale and S. J. Keyser Fowler, A. E., Gelman, R., and Gleitman, L. R. (1994). The
(Eds.), The view from Building 20: Essays in linguistics in course of language learning in children with Down syn-
honor of Sylvain Bromberger (pp. 111–167). Cambridge, drome. In H. Tager-Flusberg (Ed.), Constraints on language
MA: MIT Press. acquisition: Studies of atypical children (pp. 91–140). Hills-
Hoekstra, T. (2000). The function of functional categories. In dale, NJ: Erlbaum.
L. Cheng and R. Sybesma (Eds.), The first Glot Interna- Gopnik, M., and Goad, H. (1997). What underlies inflectional
tional state-of-the-article book (pp. 1–25) Berlin: Mouton de error patterns in genetic dysphasia? Neurolinguistics, 10,
Gruyter. 109–137.
Jakubowicz, C., and Nash, L. (2001). Functional categories Harris, N. G. S., Bellugi, U., Bates, E., Jones, W., and Rossen,
and syntactic operations in (ab)normal language acquisi- M. (1997). Contrasting profiles of language development in
tion. Brain and Language, 77, 321–339. children with Williams and Down syndromes. Devel-
Leonard, L. B. (1998). Children with specific language impair- opmental Neuropsychology, 13, 345–370.
ment. Cambridge, MA: MIT Press. Levy, Y., Amir, N., and Shalev, R. (1992). Linguistic develop-
Marcus, G. F. (1995). Children’s overregularization of English ment of a child with a congenital, localized LH lesion.
plurals: A quantitative analysis. Journal of Child Language, Cognitive Neuropsychology, 9, 1–32.
22, 447–459. Levy, Y., and Kavé, G. (1999). Language breakdown and lin-
Phillips, C. (1995). Syntax at age two: Cross-linguistic di¤er- guistic theory: A tutorial overview. Lingua, 107, 95–143.
ences. In C. T. Schütze, J. Ganger, and K. Broihier (Eds.), McGill Working Papers in Linguistics, 10, 1994. [Special issue
Papers on language processing and acquisition (MIT Work- on the linguistic aspects of familial language impairment]
ing Papers in Linguistics 26, pp. 325–382). Cambridge, MA: Oetting, J. B., and Horohov, J. E. (1997). Past tense marking
MIT Press. by children with and without Specific Language Impair-
Pinker, S. (1999). Words and rules: The ingredients of language. ment. Journal of Speech, Language, and Hearing Research,
New York: Basic Books. 40, 62–74.
Radford, A. (1990). Syntactic theory and the acquisition of En- Oetting, J. B., and Rice, M. (1993). Plural acquisition in chil-
glish syntax: The nature of early child grammars of English. dren with Specific Language Impairment. Journal of Speech
Oxford: Blackwell. and Hearing Research, 36, 1236–1248.
Rizzi, L. (1985). Two notes on the linguistic interpretation Rice, M. L. (Ed.). (1996). Toward a genetics of language.
of Broca’s aphasia. In M.-L. Kean (Ed.), Agrammatism Mahwah, NJ: Erlbaum.
(pp. 153–164). Orlando, FL: Academic Press. Rossen, M., Klima, E. S., Bellugi, U., Bihrle, A., and Jones,
Rizzi, L. (1994). Some notes on linguistic theory and language W. (1996). Interaction between language and cognition:
development: The case of root infinitives. Language Acqui- Evidence from Williams syndrome. In J. H. Beitchman,
sition, 3, 371–393. N. J. Cohen, M. M. Konstantareas, and R. Tannock (Eds.),
Wexler, K. (1998). Very early parameter setting and the unique Language, learning, and behavior disorders: Developmental,
checking constraint: A new explanation of the optional in- biological, and clinical perspectives (pp. 367–382). New
finitive stage. Lingua, 106, 23–79. York: Cambridge University Press.
Stevens, T., and Karmilo¤-Smith, A. (1997). Word learning in a
Further Readings special population: Do individuals with Williams syndrome
obey lexical constraints? Journal of Child Language, 24,
Avrutin, S., Haverkort, M., and van Hout, A. (Eds.). (2001). 737–765.
Language acquisition and language breakdown. Brain and Stromswold, K. (1998). Genetics of spoken language disorders.
Language, 77(3). [Special issue] Human Biology, 70, 297–324.
Bishop, D. (Ed.). (2001). Language and cognitive processes in Tager-Flusberg, H. (Ed.). (1994). Constraints on language
developmental disorders. Language and Cognitive Processes, acquisition: Studies of atypical children. Hillsdale, NJ:
16(2–3). [Special issue] Erlbaum.
Capirci, O., Sabbadini, L., and Volterra, V. (1996). Language Ullman, M. T., and Gopnik, M. (1999). Inflectional morphol-
development in Williams syndrome: A case study. Cognitive ogy in a family with inherited specific language impairment.
Neuropsychology, 13, 1017–1039. Applied Psycholinguistics, 20, 51–117.
van der Lely, H. K. J. (Ed.). (1998). Language impairment in The OME-associated hearing loss, which is often
children. Language Acquisition, 7(2–4). [Special issue] variable in degree, recurrent, and at times asymmetri-
van der Lely, H. K. J., and Christian, V. (2000). Lexical cal, has been hypothesized to disrupt the rapid rate of
word formation in children with grammatical SLI: A language-processing, causing a loss of language infor-
grammar-specific versus an input-processing deficit? Cogni-
mation. This disruption has been hypothesized to a¤ect
tion, 75, 33–63.
van der Lely, H. K. J., and Stollwerck, L. (1996). A grammati- children’s language acquisition in the areas of phonology,
cal specific language impairment in children: An autosomal vocabulary, syntax, and discourse in several ways. First,
dominant inheritance? Brain and Language, 52, 484–504. the disruption and variability in auditory input due to
Zukowski, A. (2001). Uncovering grammatical competence in OME may cause children to encode information incom-
children with Williams syndrome. Ph.D. dissertation, Boston pletely and inaccurately into their phonological working
University, Boston. memory. Consequently, children’s lexical development
may be hindered if they have inaccurate representations
of words, which may then result in imprecise lexical rec-
ognition or production. Second, OME-associated hear-
ing loss may result in di‰culties acquiring inflectional
Otitis Media: E¤ects on Children’s morphology and grammar. Children may not hear or
Language may inaccurately hear certain grammatical morphemes
that are of low phonetic substance, such as inflections of
short duration and low intensity (e.g., third person /s/,
Whether recurrent or persistent otitis media during past tense /‘‘ed’’/) and unstressed function words (‘‘is,’’
the first few years of life increases a child’s risk for ‘‘the’’). Third, children’s use of language may also be
later language and learning di‰culties continues to be a¤ected because they may miss subtle nuances of lan-
debated. Otitis media is the most frequent illness of early guage (e.g., intonation marks, questions), which inter-
childhood, after the common cold. Otitis media with feres with their ability to follow conversations. Children
e¤usion (OME) denotes fluid in the middle ear accom- with prolonged or frequent OME may also learn to tune
panying the otitis media. OME generally causes mild to out, particularly in noisy situations, resulting in atten-
moderate fluctuating conductive hearing loss that per- tion di‰culties for auditory-based information. Di‰-
sists until the fluid goes away. It has been proposed that culty maintaining sustained attention could compromise
a child who experiences repeated and persistent episodes children’s ability to sustain discourse (i.e., to follow and
of OME and associated hearing loss in early childhood elaborate on the topic of the conversation) and to orga-
will have later language and academic di‰culties. Unlike nize and produce coherent narratives (both requiring
the well-established relationship between moderate or auditory memory and recall).
severe permanent hearing loss and language develop- Recent models of a potential linkage between a his-
ment, a relationship between OME and later impair- tory of OME and subsequent impaired language devel-
ment in language development is not clear. This entry opment hypothesize that not only factors inherent in the
describes the possible e¤ect of OME on language devel- child but also the child’s environment and the interac-
opment in early childhood, research studies examining tion between the child and the environment can a¤ect
the OME–language learning linkage, and the implica- this relationship (Roberts and Wallace, 1997; Vernon-
tions of this literature for clinical practice. For informa- Feagans, Emanual, and Blood, 1997; Roberts et al.,
tion about the relationship of OME to children’s speech 1998; Vernon-Feagans, 1999). These additional factors
development, see early recurrent otitis media and include both risk factors (e.g., the child has poor phone-
speech development. mic awareness skills, the mother has less than a high
More than 80% of children have had at least one epi- school education, the child care environment is noisy)
sode of otitis media before 3 years of age, and more than and protective factors (e.g., the child has an excellent
40% have had three or more episodes (Teele, Klein, and vocabulary, a literacy-rich home environment, and a re-
Rosner, 1989). The middle ear transmits sounds from the sponsive child care environment). Thus, it is proposed
outer ear to the inner ear, from which information is that the potential impact of OME on children’s lan-
carried by the acoustic nerve to the brain. In OME, the guage development depends on the number and timing
middle ear is inflamed, the tympanic membrane between of OME episodes and associated hearing loss; the child’s
the outer and middle ear is thickened, and fluid is present cognitive, linguistic, and perceptual abilities; the respon-
in the middle ear cavity. The fluid can persist for several siveness and supportiveness of the child’s environment;
weeks or even months after the onset of an episode of and interactions among these variables.
otitis media. The fluid generally results in a mild to Over the past three decades, more than 90 original
moderate conductive hearing loss. The hearing loss is studies have examined whether children who had fre-
typically around 26 dB HL, but it can range from no quent episodes of OME in early childhood score lower
hearing loss to a moderate loss (around 50 dB HL), on measures of language than children without such a
making it hard to hear conversational speech. It has history. Earlier studies examining an association be-
been suggested that frequent and persistent hearing loss tween OME and later language were retrospective in
during the first few years of life, a time that is critical for design (the children’s history of OME was documented
language learning, causes later language di‰culties. by parents reporting the frequency with which children
Otitis Media: E¤ects on Children’s Language 359
had OME or by a review of medical records collected by skills between 1 and 5 years of age (Roberts et al., 1995,
di¤erent medical providers) and were more likely to 1998, 2000). They did find that the caregiving environ-
contain measurement errors. More recent studies of the ment (responsiveness of the child’s home and child care
OME–language linkage were prospective, with chil- environments) mediated the relationship between chil-
dren’s OME histories documented longitudinally from dren’s history of OME and associated hearing loss and
early infancy and repeated at specific sampling intervals. later communication development at 1 and 2 years of
Prospective studies are more likely to have greater ob- age (Roberts et al., 1995, 1998, 2000, 2002). That is,
jectivity and accuracy over time, avoiding many of the children with more OME and associated hearing loss
methodological limitations of previous studies. tended to live in less responsive caregiving environments,
Several prospective studies have found a relationship and these environments were linked to lower perfor-
between a history of otitis media in early childhood and mance on measures of receptive and expressive language
later language skills during the preschool and early ele- skills. More recently, Roberts and colleagues reported
mentary school years. More specifically, in comparison that children with greater incidence of OME scored
with children who infrequently experienced otitis media, lower in expressive language upon entering school but
infants and preschoolers with a history of OME scored caught up with their peers in expressive language by
lower on standardized assessments of receptive and second grade. However, a child’s home environment was
expressive language (Teele et al., 1984; Wallace et al., much more strongly related to early expressive language
1988; Friel-Patti and Finitzo-Hieber, 1990) and in spe- skills than was OME. These and other ongoing pro-
cific language areas, including syntax (Teele et al., 1990), spective studies highlight the importance of examin-
vocabulary (Teele et al., 1984), and narratives (Feagans ing the multiple factors that a¤ect children’s language
et al., 1987). However, many studies failed to find asso- development.
ciations between an early history of OME and later The potential impact of frequent and persistent hear-
measures of overall receptive or expressive language, ing loss due to OME on later language skills may be
vocabulary, or syntax (Teele et al., 1990; Peters et al., particularly important to examine in children from spe-
1994; Paradise et al., 2000; Roberts et al., 2000). cial populations who are already at risk for language
Several ongoing prospective studies are providing and learning di‰culties. Children who have Down
new and important information on whether a history syndrome, fragile X syndrome, Turner’s syndrome, Wil-
of OME in early childhood causes later language di‰- liams’s syndrome, cleft palate, and other craniofacial
culties. Three recent experimental studies (Maw et al., di¤erences often experience frequent and persistent
1999; Rovers et al., 2000; Paradise et al., 2001) examined OME in early childhood (Zeisel and Roberts, 2003;
whether prompt insertion of tympanostomy tubes (to Casselbrant and Mandel, 1999). This increased risk
drain the fluid for children with frequent or persistent for OME among special populations may be due
OME) improved children’s language development, com- to craniofacial structural abnormalities, hypotonia, or
pared with delaying the insertion of tympanostomy immune system deficiencies. A few retrospective studies
tubes. Paradise and colleagues (2001) randomized 429 have reported that a history of OME further delays the
children (at mean age of 15 months) who had persistent language development of children from special popula-
or frequent OME to have tympanostomy tubes inserted tions (Whiteman, Simpson, and Compton, 1986; Loni-
either promptly or 6–9 months later and reported no gan et al., 1992).
language di¤erences between the two treatment groups The question of whether recurrent OME a¤ects the
at age 3 years of age. Rovers and colleagues (2000) also later acquisition of language is still unresolved, in part
did not find that prompt insertion of tympanostomy because of the conflicting findings of studies that have
tubes improved children’s language development. Maw examined this issue. There is increasing support from
and colleagues (1999) did find e¤ects on language devel- prospective studies that for typically developing chil-
opment 9 months after treatment; however, 18 months dren, OME may not be a substantial risk factor for later
after treatment there were no longer di¤erences between language development. Although a few studies report a
the groups. very mild association between OME and later impair-
Other prospective studies considered the impact ment of receptive and expressive language skills during
of multiple factors such as the educational level of the infancy and the preschool years, the e¤ect is generally
mother and the extent of hearing loss a child experienced very small, accounting for only about 1%–4% of the
during early childhood on children’s language devel- variance. Furthermore, it is clear that the caregiving
opment. The Pittsburgh group (Feldman et al., 1999; environment at home and in child care plays a much
Paradise et al., 2000) reported weak but significant cor- more important role than OME in children’s later lan-
relations between OME in the first 3 years of life and guage development. Future research should examine if
language development (accounting for 1%–3% of the frequent hearing loss due to OME relates to children’s
variance in language skills), after controlling for many language development. The impact of a history of OME
family background variables. Roberts and colleagues and associated hearing loss on the language development
(1995, 1998, 2000) prospectively studied the relationship of children from special populations should also be fur-
of both children’s OME and hearing history to language ther studied. Some typically developing children as well
development. They did not find a direct relationship be- as children from special populations may be at increased
tween OME or hearing history and children’s language risk for later language and learning di‰culties due to a
history of OME and associated hearing loss. Until fur- Maw, R., Wilks, J., Harvey, I., Peters, T. J., and Golding, J.
ther research can resolve whether such a relationship (1999). Early surgery compared with watchful waiting for
between a chronic history of OME and later language glue ear and e¤ect on language development in preschool
skills exists and can determine what aspects of language children: A randomised trial. Lancet, 353, 960–963.
Paradise, J. L., Dollaghan, C. A., Campbell, T. F., Feldman,
are a¤ected, hearing status and language skills need to
H. M., Bernard, B. S., Colborn, D. K., et al. (2000). Lan-
be considered in the management of young children with guage, speech sound production, and cognition in three-
histories of OME. year-old children in relation to otitis media in their first
Several strategies have been recommended for young three years of life. Pediatrics, 105, 1119–1130.
children who are experiencing chronic OME (Roberts Paradise, J. L., Feldman, H. M., Campbell, T. F., et al. (2001).
and Medley, 1995; Roberts and Wallace, 1997; Vernon- E¤ect of early or delayed insertion of tympanostomy tubes
Feagans, 1999; Roberts and Zeisel, 2000). First, a child’s on developmental outcomes at the age of three years. New
hearing, speech, and language should be tested after England Journal of Medicine, 344, 1179–1187.
3 months of bilateral OME, or after four to six episodes Peters, S. A. F., Grievink, E. H., van Bon, W. H. J., and
of otitis media in a 6-month period, or when families or Schilder, A. G. M. (1994). The e¤ects of early bilateral
otitis media with e¤usion on educational attainment: A
caregivers are concerned about a child’s development.
prospective cohort study. Journal of Learning Disabilities,
Second, families and other caregivers (e.g., child care 27, 111–121.
providers) of young children with recurrent or persistent Roberts, J. E., Burchinal, M. R., Medley, L. P., Zeisel, S. A.,
OME need clear and accurate information in order Mundy, M., Roush, J., et al. (1995). Otitis media, hear-
to make decisions about the child’s medical and edu- ing sensitivity, and maternal responsiveness in relation to
cational management. Third, children who experience language during infancy. Journal of Pediatrics, 126, 481–
recurrent or persistent OME, similar to all children, will 489.
benefit from a highly responsive language- and literacy- Roberts, J. E., Burchinal, M. R., Jackson, S. C., Hooper, S. R.,
enriched environment. Caregivers should respond to Roush, J., Mundy, M., et al. (2000). Otitis media in early
communication attempts, provide frequent opportunities childhood in relation to preschool language and school
readiness skills among African American children. Pediat-
for children to participate in conversations, and read
rics, 106, 1–11.
often to their children. Fourth, children with chronic Roberts, J. E., Burchinal, M. R., and Zeisel, S. A. (2002). Oti-
OME will benefit from an optimal listening environment tis media in early childhood in relation to children’s school-
in which the speech signal is easy to hear and back- age language and academic skills. Pediatrics, 110, 1–11.
ground noise is kept to a minimum. Fifth, some children Roberts, J. E., Burchinal, M. R., Zeisel, S. A., Neebe, E. C.,
with a history of OME may exhibit language and other Hooper, S. R., Roush, J., et al. (1998). Otitis media, the
developmental di‰culties, and benefit from early inter- caregiving environment, and language and cognitive out-
vention. Finally, the results of ongoing research studies comes at two years. Pediatrics, 102, 346–352.
combined with previous studies should help determine Roberts, J. E., and Medley, L. (1995). Otitis media and speech-
whether a history of OME in early childhood places language sequelae in young children: Current issues in
management. American Journal of Speech-Language Pa-
children at risk for later language di‰culties, and if so,
thology, 4, 15–24.
how to then target intervention strategies. Roberts, J. E., and Wallace, I. F. (1997). Language and otitis
—Joanne E. Roberts media. In J. E. Roberts, I. F. Wallace, and F. Henderson
(Eds.), Otitis media in young children: Medical, develop-
mental and educational considerations (pp. 133–161). Balti-
References Roberts, J. E., and Zeisel, S. A. (2000). Ear infections and lan-
guage development. American Speech-Language-Hearing
Casselbrant, M. L., and Mandel, E. M. (1999). Epidemiology. Association and the National Center for Early Develop-
In R. M. Rosenfeld and C. D. Bluestone (Eds.), Evidence- ment and Learning. Rockville, MD.
based otitis media (pp. 117–136). St. Louis: B. C. Decker. Rovers, M. M., Straatman, H. M., Ingels, K., van der Wilt, G.,
Feagans, L., Sanyal, M., Henderson, F., Collier, A., and van den Broek, P., and Zielhuis, G. (2000). The e¤ect of
Applebaum, M. (1987). Relationship of middle ear diseases ventilation tubes on language development in infants with
in early childhood to later narrative and attentions skills. otitis media with e¤usion: A randomized trial. Pediatrics,
Journal of Pediatric Psychology, 12, 581–594. 106, e42.
Feldman, H. M., Dollaghan, C. A., Campbell, T. F., Colborn, Teele, D. W., Klein, J. O., Rosner, B. A., and the Greater
D. K., Kurs-Lasky, M., Jaosky, J. E., et al. (1999). Parent- Boston Otitis Media Study Group. (1984). Otitis media with
reported language and communication skills of one and two e¤usion during the first three years of life and development
years of age in relation to otitis media in the first two years of speech and language. Pediatrics, 74, 282–287.
of life. Pediatrics, 104, 52. Teele, D. W., Klein, J. O., and Rosner, B. A. (1989). Epi-
Friel-Patti, S., and Finitzo-Hieber, T. (1990). Language learn- demiology of otitis media during the first seven years of life
ing in a prospective study of otitis media with e¤usion in in children in greater Boston. Journal of Infectious Diseases,
the first two years of life. Journal of Speech and Hearing 160, 83–94.
Research, 33, 188–194. Teele, D. W., Klein, J. O., Chase, C., Menyuk, P., Rosner,
Lonigan, C. J., Fischel, J. E., Whitehurst, G. J., Arnold, D. S., B. A., and the Greater Boston Otitis Media Study Group.
and Valdez-Menchaca, M. C. (1992). The role of otitis (1990). Otitis media in infancy and intellectual ability,
media in the development of expressive language disorder. school achievement, speech, and language at age 7 years.
Developmental Psychology, 28, 430–440. Journal of Infectious Diseases, 162, 685–694.
Perseveration 361
Vernon-Feagans, L. (1999). Impact of otitis media on speech, Perseveration

language, cognition, and behavior. In R. M. Rosenfeld and
C. D. Bluestone (Eds.), Evidence-based otitis media (pp.
353–398). St. Louis: B. C. Decker. Perseveration, a term introduced by Neisser in 1895,
Vernon-Feagans, L., Emanuel, D. C., and Blood, I. (1997).
refers to the inappropriate continuation or repetition of
The e¤ect of otitis media on the language and attention
skills of daycare attending toddlers. Developmental Psy- an earlier response after a change in task requirements.
chology, 30, 701–708. Although individuals without brain damage may display
Wallace, I. F., Gravel, J. S., McCarton, C. M., Stapelis, D. R., occasional perseverative behaviors (e.g., Ramage et al.,
Bernstein, R. S., and Ruben, R. J. (1988). Otitis media, 1999), as Allison (1966) pointed out, when perseveration
auditory sensitivity, and language outcomes at one year. is pronounced, ‘‘it is a reliable, if not a pathognomonic
Laryngoscope, 98, 64–70. sign of disturbed brain function’’ (p. 1029). Indeed, per-
Whiteman, B. C., Simpson, G. B., and Compton, W. C. (1986). severation has been described in association with a vari-
Relationship of otitis media and language impairment in ety of neurological and psychiatric conditions, including
adolescents with Down syndrome. Mental Retardation, 24, stroke, head injury, dementia, Parkinson’s disease, and
353–356.
schizophrenia.
Zeisel, S. A., and Roberts, J. E. (2003). Otitis media in young
children with disabilities. Infants and Young Children: An Perseveration is such a notable and fascinating clini-
Interdisciplinary Journal of Special Care Practices, 16, 106– cal phenomenon that for more than 100 years, various
119. researchers have attempted to more precisely describe its
characteristics, label its subtypes, and identify its neuro-
Further Readings pathological correlates and neuropsychological mecha-
nisms. Good agreement has emerged from these studies
Bluestone, C. D., and Klein, J. O. (2001). Otitis media in as to the characteristics of various forms of persevera-
infants and children. Philadelphia: Saunders. tion, with rather little agreement as to labels for sub-
Casby, M. W. (2001). Otitis media and language development:
types. And although most investigators agree that the
A meta-analysis. American Journal of Speech-Language
Pathology, 10, 65–80. frontal lobes and their associated white matter pathways
Gravel, J. S., and Wallace, I. F. (1992). Listening and language play a prominent role in perseveration, other areas of
at 4 years of age: E¤ects of otitis media. Journal of Speech the brain have been implicated. The neuropsychologi-
and Hearing Research, 35, 588–595. cal mechanisms responsible for perseveration also are
Gravel, J. S., and Wallace, I. F. (2000). E¤ects of otitis media uncertain and probably vary according to subtypes.
with e¤usion on hearing in the first three years of life. Among the mechanisms implicated are persistent mem-
Journal of Speech, Language, and Hearing Research, 43, ory traces, failure to inhibit prepotent responses, patho-
631–644. logical inertia, and failure to disengage attention. For a
Kavanagh, J. F. (Ed.). (1986). Otitis media and child develop- review of some of this literature, see Hotz and Helm-
ment. Parkton, MD: York Press.
Estabrooks (1995a).
National Institute of Child Health and Human Development
Early Child Care Research Network. (2001). Child care and It is important for professionals working with indi-
common communicable illnesses. Archives of Pediatrics and viduals having neurological conditions to be aware of
Adolescent Medicine, 155, 481–488. perseveration and recognize its subtypes, because this
Paradise, J. L., Feldman, H. M., Colborn, K., Campbell, T. F., behavior can contaminate experimental and clinical test
Dollagahn, C. A., Rockette, H. E., et al. (1999). Parental results and reduce communicative e¤ectiveness. Persev-
stress and parent-rated child behavior in relation to otitis eration can occur in any behavioral output modality,
media in the first three years of life. Pediatrics, 104, 1264– including speech, writing, gesturing, drawing, and other
1273. forms of construction. Three primary forms of persever-
Rach, G. H., Zielhuis, G. A., van Baarle, P. W., and van den ation have been described, with one of these forms hav-
Broek, P. (1991). The e¤ect of treatment with ventilating
ing four subtypes. The terms used here are derived from
tubes on language development in preschool children
with otitis media with e¤usion. Clinical Otolaryngology, 16, several sources (e.g., Santo-Pietro and Rigrodsky, 1986;
128–132. Sandson and Albert, 1987; Lundgren et al., 1994; Hotz
Rach, G. H., Zielhuis, G. A., and van den Broek, P. and Helm-Estabrooks, 1995b).
(1988). The influence of chronic persistent otitis media with Stuck-in-set perseveration is the inappropriate main-
e¤usion on language development of 2- to 4-year-olds. In- tenance of a category or framework of response after
ternational Journal of Pediatric Otorhinolaryngology, 15, introduction of a new task. For example, as a part of a
253–261. standardized test, an individual with traumatic brain in-
Roberts, J. E., Wallace, I. F., and Henderson, F. (Eds.). jury without aphasia was asked to list as many animals
(1997). Otitis media in young children: Medical, develop- as he could in 1 minute. He listed ten animals before he
mental and educational considerations. Baltimore: Paul H.
was given the following instructions: ‘‘Now I want you
Brookes.
Wallace, I. F., Gravel, J. S., Schwartz, R. G., and Ruben, R. J. to name as many words as you can that start with the
(1996). Otitis media, communication style of primary care- letter m. [He was shown a lowercase m.] Here are the
givers, and language skills of 2 year olds: A preliminary rules. Do not name words that begin with a capital M.
report. Journal of Developmental Behavioral Pediatrics, 17, Do not say the same words with a di¤erent ending, like
27–35. mop, then mopped or mopping. [The written letter m was
removed.] Okay, you have 1 minute to name as many
words you can think of that start with the letter m.’’ In Rigrodsky, 1982; Emery and Helm-Estabrooks, 1989;
response, the man said ‘‘monkey,’’ ‘‘mouse’’ in the first Helm-Estabrooks et al., 1998). There is good evidence
few seconds, then ‘‘man’’ after 15 seconds. He produced that perseverative behaviors are unrelated to time post
no further responses for the remaining time. Thus, al- onset of aphasia but are correlated significantly with
though he understood the concept of listing m words, aphasia severity. Thus, perseveration can be a persistent
he could not disengage from the idea of listing animals, problem for individuals with aphasia and interfere with
and his score for producing words according to a letter/ all modalities of communicative expression. As such,
sound category was contaminated by stuck-in-set perseveration is an important treatment target for speech
perseveration. and language clinicians working with aphasic individu-
Continuous perseveration is the inappropriate prolon- als, although few approaches have been described thus
gation or continuation of a behavior without an inter- far. Exceptions are the program designed by Helm-
vening response or stimulus. For example, a woman Estabrooks, Emery, and Albert (1987) to reduce verbal
with Alzheimer’s disease was given the following spoken recurrent perseveration, and the strategies described by
and written instructions: ‘‘Draw a clock. Put in all the Bryant, Emery, and Helm-Estabrooks (1994) to man-
numbers. Set the hands to 10 minutes after 11.’’ She age various forms of perseveration in severe aphasia.
wrote numbers 1 through 18 in the circle provided before See Helm-Estabrooks and Albert (2003) for updated
she ran out of space. She then drew a hand to the num- descriptions on these methods.
ber 10, but continued to draw hands to each number.
Thus, either she was unable to disengage from the idea —Nancy Helm-Estabrooks
of drawing clock hands or she was unable to inhibit that
particular graphomotor activity. References
Recurrent perseveration is the inappropriate recur-
rence of a previous response following presentation of Albert, M. L., and Sandson, J. (1986). Perseveration in apha-
sia. Cortex, 22, 103–115.
a new stimulus or after giving a di¤erent intervening
Allison, R. S. (1966). Perseveration as a sign of di¤use and fo-
response. For example, a man with fluent aphasia was cal brain damage. I. British Medical Journal, 2, 1027–1032.
asked to write the days of the week. He wrote, ‘‘Mon- Bryant, S. L., Emery, P. A., and Helm-Estabrooks, N. (1994).
day, Tuesday, Wednesday, Tuesday, Friday, Saturday, Management of di¤erent forms of perseveration in severe
Monday, Sunday.’’ aphasia. Seminars in Speech and Language, 15, 71–84.
Various subtypes of recurrent perseveration have Emery, P., and Helm-Estabrooks, N. (1989). The role of per-
been described and labeled. A primary distinction can be severation in aphasic confrontation naming performance.
made between carryover of part of the phonemic struc- Proceedings of Clinical Aphasiology Conference, 18, 64–83.
ture of a previous word and repetition of an entire word. Emery, P., and Helm-Estabrooks, N. (1995). Whole word
An example of phonemic carryover perseveration, in perseverations: Where do they come from? Brain and Lan-
guage, 51, 29–31.
which part of the phonemic makeup of a previous word Helm-Estabrooks, N., and Albert, M. L. (2003). Manual of
is inappropriately repeated, is ‘‘comb’’ for comb, then aphasia and aphasia therapy. Austin, TX: Pro-Ed.
‘‘klower’’ for flower. Helm-Estabrooks, N., Emery, P., and Albert, M. L. (1987).
Within the category of whole-word carryover, three Treatment of aphasic perseveration (T.A.P.) program: A
types of perseveration occur, semantic, lexical, and new approach to aphasia therapy. Archives of Neurology,
program-of-action perseveration. 44, 1253–1255.
Semantic perseverations are words that are semanti- Helm-Estabrooks, N., Ramage, A., Bayles, K., and Cruz,
cally related to the target (e.g., repetition of the naming R. (1998). Perseverative behavior in fluent and non-fluent
response apple when shown a lemon). aphasic adults. Aphasiology, 12, 689–698.
Lexical perseverations are words that have no obvious Hotz, G., and Helm-Estabrooks, N. (1995a). Perseveration:
Part I. A review. Brain Injury, 9, 151–159.
semantic relation to the target (e.g., repetition of the Hotz, G., and Helm-Estabrooks, N. (1995b). Perseveration:
word key when asked to name scissors). Part II. A study of perseveration in closed-head injury.
Program-of-action perseverations are repeated words Brain Injury, 9, 161–172.
that begin with the same initial sound as a previous Lundgren, K., Helm-Estabrooks, N., Magnusdottir, S., and
response (e.g., repetition of the response wristwatch for Emery, P. (1994). Exploring mechanisms underlying
subsequent objects, such as wrench, whose names begin ‘‘recurrent’’ perseveration: An aphasic naming study. Brain
with /r/). and Language, 47, 370–373.
Although, as mentioned earlier, perseveration occurs Neisser, A. (1895). Krankenvorstellung. Allgemeine Zeit-
in association with many neurological and psychiatric schrifte fur Psychitrie, 51, 1016–1021.
conditions, perseverative behavior is especially notable Ramage, A., Bayles, K., Helm-Estabrooks, N., and Cruz, R.
(1998). Frequency of perseveration in normal subjects.
in acquired aphasia. The results of their study of persev- Brain and Language, 66, 329–340.
erative behaviors in aphasic individuals prompted Albert Sandson, J., and Albert, M. L. (1984). Varieties of persevera-
and Sandson (1986) to suggest that it ‘‘may even com- tion. Neuropsychologia, 22, 715–732.
prise an integral part of [the] specific language deficits Santo-Pietro, M. J., and Rigrodsky, S. (1986). Patterns of oral-
in aphasia’’ (p. 105). This suggestion is supported by verbal perseveration in adult aphasics. Brain and Language,
the work of other investigators (e.g., Santo-Pietro and 29, 1–17.
Phonological Analysis of Language Disorders in Aphasia 363
Further Readings Wernicke’s aphasia. Her studies have focused on the

di‰culties these patients have with constructing phone-
Bayles, K. A., Tomoeda, C. K., Kaszniak, A. W., Sterns, mic strings once the full-form lexical representations
L. Z., and Eagans, K. K. (1985). Verbal perseveration of
dementia patients. Brain and Language, 25, 102–116.
have been accessed in their stored form. Through the
Brown, J. W., and Chobor, K. L. (1989). Frontal lobes and mechanisms of models (Shattuck-Hufnagel, 1979; Gar-
the problem of perseveration. Journal of Neurolinguistics, 4, rett, 1984), Kohn has successfully characterized many
65–85. paraphasic types, located the production process where
Helm-Estabrooks, N., Bayles, K., and Bryant, S. (1994). Four the error occurs, and specified which type of error is
forms of perseveration in dementia and aphasia patients diagnostic of distinct aphasic syndromes. She has incor-
and normal elders. Brain and Language, 47, 457–460. porated syllable structure constraints, phonotactic pat-
Freedman, T., and Gathercole, C. E. (1966). Perseveration: terns, and the principle of sonority. Furthermore, Kohn,
The clinical symptoms in chronic schizophrenia and organic Smith, and Alexander (1996) have charted the recovery
dementia. British Journal of Psychiatry, 112, 27–32. patterns of Wernicke’s patients who in the acute stages
Fuld, P. A., Katzman, R., Davis, P., and Terry, R. D. (1982).
Intrusions as a sign of Alzheimer’s dementia: Chemical and
produced neologistic jargon. They observed that in cer-
pathological verification. Annals of Neurology, 11, 155–159. tain of these patients the aphasia resolved to a chronic
Hudson, J. (1968). Perseveration. Brain, 91, 143–582. stage in which the patients were clearly getting closer to
Luria, A. R. (1965). Two kinds of motor perseveration in underlying lexical representations and producing less se-
massive injury of the frontal lobes. Brain, 88, 1–9. vere phonemic paraphasia, with lower number of ran-
Shindler, A. G., Caplan, L. R., and Hier, D. B. (1984). Intru- dom segments in those errors. In another set of patients
sions and perseverations. Brain and Language, 23, 148–158. the aphasia resolved to a chronic stage in which the
Yamadori, A. (1981). Verbal perseverations in aphasia. Neu- patients maintained severe lexical access disruptions,
ropsychologia, 19, 591–594. producing less neology but more lexical blocks, cir-
cumlocutions, and other errors indicative of lingering
Phonological Analysis of Language anomia.
David Caplan (1987) and colleagues (Caplan and
Disorders in Aphasia Waters, 1992) have published widely on phonological
breakdown in aphasia. He, as well as Kohn, has ana-
Various approaches have been utilized in the study lyzed the phonemic string construction di‰culties of
of phonological disorders in aphasia, but each usually the syndrome of ‘‘reproduction’’ conduction aphasia. He
shares certain assumptions of the others. This overview localizes this disruption at the point where final sound
presents these di¤erent orientations with the recognition production is called for from either a semantic input,
that there is a good deal of overlap in each. an auditory/lexical input, or a written input. That is, the
patient will produce phonemic errors in object naming,
Neurolinguistic in repetition, or in reading aloud. Any one patient may
produce paraphasias with all or certain ones of these
Renee Beland (and several colleagues) have used inputs. Cognitive neuropsychological dissociations are
‘‘underspecification theory’’ to capture the types of pho- numerous here.
nological errors in fluent aphasics and the constraints on Buckingham and Kertesz (1976) analyzed the neo-
those errors (Beland, 1990; Beland, Caplan, and Nes- logistic jargon of several patients with fluent aphasia.
poulous, 1990). The phonological model here has three Each patient revealed a good deal of phonemic errors,
levels: the minimally specified level, a lexical level, and a but many errors rendered underlying targets opaque.
surface level. Feature markedness, phonotactic patterns, Other forms, however, were opaque and otherwise
and syllable constituent slots (onset, rime, nucleus, coda) abstruse where there was no clear transparency of any
are then used to describe the nature and location of certain underlying lexical target. All patients had severe
phonemic paraphasias and the constraints on those anomia, which led to the suggestion that there could
paraphasias. be two separate productive mechanisms or processes
As a student of Roman Jakobson, Sheila Blumstein that might give rise to the production of these non-
(1973) is the intermediary between Prague School pho- recognizable word-form errors, but in that study the
nological studies of aphasic errors (e.g., Jakobson, 1968) question was left unexplored. The issue was broached
and a host of neurolinguistic studies of paraphasia pub- again in Buckingham (1977), but not until Butterworth
lished since the early 1970s. Her contributions to the (1979) was a specific idea put forth, that of a ‘‘ran-
study of the neuropsychology and neurobiology of hu- dom generator.’’ Subsequently, Buckingham (1990a)
man language sound structure are herculean (see Blum- attempted to mollify the critics of the random generator
stein, 1995, 1998, for discussion of much of her work). by showing that the idea was in a sense a metaphor
Her initial study of the typology of phonemic paraphasia for the use and appreciation of general phonological
set the stage for numerous ensuing studies. knowledge, which for all speakers underlies the ability to
Susan Kohn (1989, 1993) has provided a wealth recognize ‘‘possible words’’ in a language and to produce
of information on phonological breakdown in fluent ‘‘nonce’’ forms when necessary. The issue of a dual route
aphasics, usually patients with conduction aphasia and for the neologism has reappeared recently. Gagnon and
Schwartz (1996) found no bimodal distribution in their neologisms, where target words were not recognizable.
corpus of phonemic paraphasias and neologisms, and Recovery patterns were then analyzed and simulated,
thus argued for a single route for their productions. where it appeared to the authors that although there was
In contradistinction, Kohn, Smith, and Alexander (1996) improvement in productions, the productions themselves
observed patients who demonstrated both routes, one remained either of the decay lesion type or of the con-
for phonemic paraphasia and another that involved nection lesion type. The only problem with phonological
some sort of a ‘‘backup mechanism for ‘reconstructing’ breakdown would be that those who had produced neo-
a phonological representation when either partial or no logisms early on may resolve to a less severe phonemic
stored phonological information about a word is made paraphasia, where targets could be increasingly gleaned,
available to the production system’’ (p. 132), which thereby indicating level interaction. That scenario, of
reduces to some kind of generating device. course, would mean that a connection weight lesion
In Buckingham (1990b), the principle of sonority (see pattern would resolve to a decay rate lesion pattern.
Ohala, 1992, for some criticism of this principle) was Most connectionist-oriented accounts do not provide for
invoked to provide an account of the structural con- dual routes for neologisms (e.g., Gagnon and Schwartz,
straints on phonemic paraphasia, and shortly thereafter 1996; Gagnon et al., 1997; Goldman, Schwartz, and
Christman (1994) extended the utilization of sonority Wilshire, 2001; Nadeau, 2001), locating their production
to capture the pattern constraints on a large corpus of exclusively between the lexeme and the phoneme strata.
neologisms, where she statistically demonstrated that For instance, the model of Hillis et al. (1999, p. 1820)
neologisms abided by both sonority and phonotactic accounts for phonemic paraphasias and neologisms with
dictates. A major portion of her contribution was to di¤erent degrees of connectionist weakenings between
show how sonority could go beyond mere phonotactics these two strata, where only a ‘‘few’’ nontarget subword
to characterize neologistic word production. units would be activated for phonemic paraphasias but
‘‘many’’ nontarget subword units would be activated
Slips, Paraphasia, and the Continuity Thesis for neologisms. It is not yet clear just how connec-
Slips-of-the-tongue have always played a role in the tionist models will continue to eschew dual routes for
modeling, characterization, analysis, and explanation neologisms.
of paraphasia. There is a direct line from Hughlings-
Jackson in the second half of the nineteenth century to Apraxia of Speech Meets Phonemic
modern psycholinguistic studies of the ‘‘lapsus linguae,’’ Paraphasia: Phonology or Phonetics?
all of which have assumed at least some degree of conti-
Square, Roy, and Martin (1997) have recently dis-
nuity between the functional errors in normality and the
cussed posterior lesion apraxia of speech patients, which
paraphasias in pathology (Buckingham, 1999).
accords with ideas suggested in Buckingham and Yule
(1987). In addition, many presumed phonemic para-
Phonological Breakdown and Connectionist
phasias in Broca’s aphasia (e.g., Keller, 1978) could very
Modeling plausibly stem from apraxic asynchronies (Buckingham
Wheeler and Touretzky (1997) have combined a model and Yule, 1987). Speech perception disturbances have
of segmental ‘‘licensing’’ from phonological auto- been observed in Broca’s aphasics, and Wernicke’s
segmental theory with a connectionist simulation of aphasics often present with articulatory abnormalities
phonological disruptions in fluent aphasia with no psy- (Price, Indefrey, and van Turennout, 1999, p. 212). The
cholinguistic mechanisms at all. In an impressive and perceptual function of the inferior frontal gyrus has also
surprising fashion, they were able to simulate most error been observed by Hsieh et al. (2001). Galaburda has
types and conditions described in such work as Buck- uncovered motor regions in layer III of the left temporal
ingham (1986). cortex in the plenum region (Galaburda, 1982), and
Dell et al. (1997) have published one of the most in- Amaducci et al. (1981) have found asymmetrical con-
depth investigations of a connectionist system (of the centrations of choline-acetyltransferase (ChAT) in this
interactive activation type) that simulated normal nam- region from neurochemical assays at autopsy in human
ing and aphasic naming—and only naming. The three- subjects. Recent reviews (e.g., Blumstein, 1995) have
level (semantic, lexical, segmental) system had upward emphasized the highly distributed nature of the sound
and downward feedforward and feedback connections. system throughout the left perisylvian cortex. Little
The system was set up so that connection weight values wonder there is such a degree of indeterminacy with
and decay rate values could be varied globally through- aphasic sound system disruptions in stroke. From purely
out the system as a whole. Phonological errors (and linguistic reasoning, Ohala (1990) has challenged the
other errors) that indicated level interaction were pro- claim that phonetics and phonology are separate sys-
duced by lesioning only the decay rates: phonemic para- tems; rather, they are totally integrated. Vocal tract em-
phasias that did not render targets opaque and formal bodiment is tightly linked to most so-called phonological
verbal substitutions where error and target shared pho- properties (e.g., syllable constituency and phonotactics,
nological features. Lesioning only connection weights, sonority, feature markedness, coarticulatory processes),
however, simulated phonological errors (and other so that whatever element may be ‘‘phonologized’’ will
errors) that did not indicate interaction between levels: remain forever linked to vocal tract dynamics (see
Phonological Analysis of Language Disorders in Aphasia 365
Christman, 1992; MacNeilage, 1998). It would seem that Galaburda, A. (1982). Histology, architectonics, and asymme-
we are poised for major reevaluations in our under- try of language areas. In M. Arbib, D. Caplan, and J.
standing of how the human sound system breaks down Marshall (Eds.), Neural models of language processes (pp.
in aphasia. 435–445). New York: Academic Press.
Garrett, M. (1984). The organization of processing structure
—Hugh W. Buckingham for language production: Applications to aphasic speech.
In D. Caplan, A. Lecours, and A. Smith (Eds.), Biological
References perspectives on language (pp. 172–193). Cambridge, MA:
MIT Press.
Amaducci, L., Sorbi, S., Albanese, A., and Gianotti, G. (1981). Goldmann, R. E., Schwartz, M. F., and Wilshire, C. E. (2001).
Choline-acetyltransferase (ChAT) activity di¤ers in right The influence of phonological context on the sound errors
and left human temporal lobes. Neurology, 31, 799–805. of a speaker with Wernicke’s aphasia. Brain and Language,
Beland, R. (1990). Vowel epenthesis in aphasia. In J.-L. Nes- 78, 279–307.
poulous and P. Villiard (Eds.), Morphology, Phonology and Hillis, A., Boatman, D., Hart, J., and Gordon, B. (1999).
Aphasia (pp. 235–252). New York: Springer-Verlag. Making sense out of jargon: A neurolinguistic and
Beland, R., Caplan, D., and Nespoulous, J.-L. (1990). The computational account of jargon aphasia. Neurology, 53,
role of abstract phonological representations in word pro- 1813–1824.
duction: Evidence from phonemic paraphasias. Journal of Hsieh, L., Gandour, J., Wong, D., and Hutchins, G. (2001).
Neurolinguistics, 5, 125–164. Functional heterogeneity of inferior frontal gyrus is shaped
Blumstein, S. (1973). A phonological investigation of aphasic by linguistic experience. Brain and Language, 76, 227–252.
speech. The Hague: Mouton. Jakobson, R. (1968). Child language, aphasia, and phonological
Blumstein, S. (1995). The neurobiology of the sound structure universals (A. R. Keiler, Trans.). The Hague: Mouton.
of language. In M. S. Gazzaniga (Ed.), The cognitive neu- Keller, E. (1978). Parameters for vowel substitutions in Broca’s
rosciences (pp. 915–929). Cambridge, MA: The MIT Press. aphasia. Brain and Language, 5, 265–285.
Blumstein, S. (1998). Phonological aspects of aphasia. In M. Kohn, S. (1989). The nature of the phonemic string deficit in
Sarno (Ed.), Acquired aphasia (pp. 157–185). San Diego, conduction aphasia. Aphasiology, 3, 209–239.
CA: Academic Press. Kohn, S. (1993). Segmental disorders in aphasia. In G.
Buckingham, H. (1977). The conduction theory and neologistic Blanken, J. Dittmann, H. Grimm, J. Marshall, and C.-W.
jargon. Language and Speech, 20, 174–184. Wallesch (Eds.), Linguistic disorders and pathologies: An
Buckingham, H. (1986). The scan-copier mechanism and the international handbook (pp. 197–209). Berlin: Walter de
positional level of language production: Evidence from Gruyter.
aphasia. Cognitive Science, 10, 195–217. Kohn, S., Smith, K., and Alexander, M. (1996). Di¤erential
Buckingham, H. (1990a). Abstruse neologisms, retrieval defi- recovery from impairment to the phonological lexicon.
cits and the random generator. Journal of Neurolinguistics, Brain and Language, 52, 129–149.
5, 215–235. MacNeilage, P. (1998). The frame/content theory of evolution
Buckingham, H. (1990b). Principle of sonority, doublet cre- of speech production. Behavioral and Brain Sciences, 21,
ation, and the checko¤ monitor. In J.-L. Nespoulous and 499–546.
P. Villiard (Eds.), Morphology, phonology and aphasia (pp. Nadeau, S. E. (2001). Phonology: A review and proposals
193–205). New York: Springer-Verlag. from a connectionist perspective. Brain and Language, 79,
Buckingham, H. (1999). Freud’s continuity thesis. Brain and 511–579.
Language, 69, 76–92. Ohala, J. (1990). There is no interface between phonology
Buckingham, H., and Kertesz, A. (1976). Neologistic jargon and phonetics: A personal view. Journal of Phonetics, 18,
aphasia. Amsterdam: Swets and Zeitlinger. 153–171.
Buckingham, H., and Yule, G. (1987). Phonemic false evalua- Ohala, J. (1992). Alternatives to the sonority hierarchy for
tion: Theoretical and clinical aspects. Clinical Linguistics explaining segmental sequential constraints. In M. Ziol-
and Phonetics, 1, 113–125. kowski, M. Noske, and K. Deaton (Eds.), Papers from the
Butterworth, B. (1979). Hesitation and the production of ver- 26th Regional Meeting of the Chicago Linguistic Society.
bal paraphasias and neologisms in jargon aphasia. Brain Vol. 2. The parasession on the syllable in phonetics and pho-
and Language, 18, 133–161. nology (pp. 319–338). Chicago: Chicago Linguistic Society.
Caplan, D. (1987). Neurolinguistics and linguistic aphasiology. Price, C., Indefrey, P., and van Turennout, M. (1999). The
Cambridge, U.K.: Cambridge University Press. neural architecture underlying the processing of written and
Caplan, D., and Waters, G. (1992). Issues arising regarding the spoken word forms. In C. Brown and P. Hagoort (Eds.),
nature and consequences of reproduction conduction apha- The neurocognition of language (pp. 211–240). Oxford: Ox-
sia. In S. Kohn (Ed.), Conduction aphasia (pp. 117–149). ford University Press.
Hillsdale, NJ: Erlbaum. Shattuck-Hufnagel, S. (1979). Speech errors as evidence for
Christman, S. (1992). Abstruse neologism formation: Parallel a serial-ordering mechanism in sentence production. In W.
processing revisited. Clinical Linguistics and Phonetics, 6, Cooper and E. Walker (Eds.), Sentence processing: Psycho-
65–76. linguistic studies presented to Merrill Garrett (pp. 295–342).
Christman, S. (1994). Target-related neologism formation in Hillsdale, NJ: Erlbaum.
jargonaphasia. Brain and Language, 46, 109–128. Square, P., Roy, E., and Martin, R. (1997). Apraxia of speech:
Dell, G., Schwartz, M., Martin, N., Sa¤ran, E., and Gagnon, Another form of praxis disruption. In L. Rothi and K.
D. (1997). Lexical access in aphasic and nonaphasic speak- Heilman (Eds.), Apraxia: The neuropsychology of action
ers. Psychological Review, 104, 808–838. (pp. 173–206). East Sussex, U.K.: Psychology Press.
Gagnon, D., and Schwartz, M. (1996). The origins of neo- Wheeler, D., and Touretzky, D. (1997). A parallel licensing
logisms in picture naming by fluent aphasics. Brain and model of normal slips and phonemic paraphasias. Brain and
Cognition, 32, 118–120. Language, 59, 147–201.
Further Readings Lecours, A. (1982). On neologisms. In J. Mehler, E. Walker,

and M. Garrett (Eds.), Perspectives on mental representa-
Baum, S., and Slatkovsky, K. (1993). Phonemic false evaluation: Experimental and theoretical studies of cognitive pro-
tion? Preliminary data from a conduction aphasia patient. cesses and capacities (pp. 217–247). Hillsdale, NJ: Erlbaum.
Clinical Linguistics and Phonetics, 7, 207–218. Lecours, A., and Lhermitte, F. (1969). Phonemic paraphasias:
Beland, R., and Favreau, Y. (1991). On the special status of Linguistic structures and tentative hypotheses. Cortex, 5,
coronals in aphasia. In C. Paradis and J.-F. Prunet (Eds.), 193–228.
Phonetics and phonology. Vol. 2. The special status of coro- Lecours, A., and Rouillon, F. (1976). Neurolinguistic analysis
nals (pp. 201–221). San Diego, CA: Academic Press. of jargonaphasia and jargonagraphia. In H. Whitaker and
Blumstein, S. (1978). Segment structure and the syllable. In H. A. Whitaker (Eds.), Studies in neurolinguistics. Vol. 2
A. Bell and J. Hooper (Eds.), Syllables and segments (pp. (pp. 95–144). New York: Academic Press.
189–200). Amsterdam: North Holland. Martin, N., Dell, G., Sa¤ran, E., and Schwartz, M. (1994).
Blumstein, S. (1990). Phonological deficits in aphasia. In Origins of paraphasias in deep dysphasia: Testing the con-
A. Caramazza (Ed.), Cognitive neuropsychology and neuro- sequences of a decay impairment to an interactive activa-
linguistics: Advances in models of cognitive function and im- tion model of lexical retrieval. Brain and Language, 47,
pairment (pp. 33–53). Hillsdale, NJ: Erlbaum. 609–660.
Buckingham, H. (1980). On correlating aphasic errors with Nickels, L. (1997). Spoken word production and its breakdown
slips-of-the-tongue. Applied Psycholinguistics, 1, 199–220. in aphasia. East Sussex, U.K.: Psychology Press.
Buckingham, H. (1981). Where do neologisms come from? In J. Nickels, L., and Howard, D. (2000). When words won’t come:
Brown (Ed.), Jargonaphasia (pp. 39–62). New York: Aca- Relating impairments and models of spoken word produc-
demic Press. tion. In L. Wheeldon (Ed.), Aspects of language production
Buckingham, H. (1987). Phonemic paraphasias and psycholin- (pp. 115–142). East Sussex, U.K.: Psychology Press.
guistic production models for neologistic jargon. Aphasiol- Poeppel, D. (1996). A critical review of PET studies of phono-
ogy, 1, 381–400. logical processing. Brain and Language, 55, 317–351.
Buckingham, H. (1992). The mechanisms of phonemic para- Poeppel, D. (1996). Some remaining questions about studying
phasia. Clinical Linguistics and Phonetics, 6, 41–63. phonological processing with PET: Response to Demonet,
Buckingham, H. (1998). Explanations for the concept of Fiez, Paulesu, Petersen, and Zatorre (1996). Brain and Lan-
apraxia of speech. In M. Sarno (Ed.), Acquired aphasia guage, 55, 380–385.
(pp. 269–307). San Diego, CA: Academic Press. Schwartz, M., Sa¤ran, E., Block, D., and Dell, G. (1994).
Butterworth, B. (1992). Disorders in phonological encoding. Disordered speech production in aphasic and normal
Cognition, 42, 261–286. speakers. Brain and Language, 47, 52–88.
Caplan, D., and Waters, G. (1995). On the nature of the
phonological output planning processes involved in verbal
rehearsal: Evidence from aphasia. Brain and Language, 48,
191–220. Phonology and Adult Aphasia
Caramazza, A. (2000). Aspects of lexical access: Evidence from
aphasia. In Y. Grodzinsky, L. Shapiro, and D. Swinney
(Eds.), Language and the brain: Representation and process- The sound structure of language is the primary medium
ing (pp. 203–228). San Diego: Academic Press. for language communication. As a result, deficits af-
Cutler, A. (Ed.). (1981). Slips-of-the-tongue and language pro- fecting phonology, defined as the sound structure of
duction. Linguistics, 19, 561–847. [Special issue] language, may have a critical impact on language com-
Demonet, J. F., Fiez, J. A., Paulesu, E., Petersen, S. E., and munication in general, and specifically on the processes
Zatorre, R. J. (1996). REPLY: PET studies of phonological involved in both speaking and understanding.
processing: A critical reply to Peoppel. Brain and Language,
55, 352–379.
Ellis, A., Miller, D., and Sin, G. (1983). Wernicke’s aphasia Deficits in Speech Production
and normal language processing: A case study in cognitive A number of stages of processing underlie speech
neuropsychology. Cognition, 15, 111–144. production. The speaker must select a word candidate
Gagnon, D., Schwartz, M., Martin, N., Dell, G., and Sa¤ran,
E. (1997). The origins of formal paraphasias in aphasics’
from the lexicon and access its phonological form. Once
picture naming. Brain and Language, 59, 450–472. selected, the sound structure of the word or utterance
Garrett, M. (1982). Production of speech: Observations from must be planned—the phonological representation is
normal and pathological language use. In A. Ellis (Ed.), encoded in terms of the phonological properties of the
Normality and pathology in cognitive functions (pp. 19–76). sound segments, their order and context, and the proso-
London: Academic Press. dic structure of the word as a whole. The next stage of
Kohn, S. (1984). The nature of the phonological disorder in processing is articulatory implementation, in which the
conduction aphasia. Brain and Language, 23, 97–115. more abstract phonological representation is converted
Kohn, S. (1988). Phonological production deficits in aphasia. into a set of motor commands or motor programs for the
In H. Whitaker (Ed.), Phonological processes and brain phonetic realization of the utterance.
mechanisms (pp. 93–117). New York: Springer-Verlag.
Kohn, S., and Smith, K. (1994). Distinctions between two
There is some evidence to suggest that these stages of
phonological output disorders. Applied Psycholinguistics, production may be dissociated in the aphasias (Nadeau,
15, 75–95. 2001). However, the patterns suggest that the dissocia-
Kohn, S., Melvold, J., and Shipper, V. (1998). The preser- tions are not complete, and hence the production system
vation of sonority in the context of impaired lexical- appears to be neurally distributed in the left perisylvian
phonological output. Aphasiology, 12, 375–398. regions of the left hemisphere (Blumstein, 2000). In par-
Phonology and Adult Aphasia 367
ticular, patients with Broca’s aphasia and other patients tem, reflecting an inability to correctly encode or acti-
with anterior lesions appear to have predominantly vate the correct phonemic (i.e., phonetic feature)
articulatory implementation deficits and to a lesser ex- representation of the word. Evidence that the phonolog-
tent phonological selection and planning impairments. ical representations are intact comes from the variability
In contrast, those with Wernicke’s and conduction with which errors occur. An a¤ected individual may
aphasia and patients with posterior lesions appear to make one or a series of phonological errors on a target
have predominantly lexical selection and phonological word and also produce it correctly (Butterworth, 1992).
planning deficits and to a minor degree articulatory im- Despite these similarities, di¤erences have emerged
plementation impairments. in the patterns of production in naming and repetition
The clearest evidence for a dissociation between the tasks, suggesting a di¤erence in the locus of the under-
production stage of articulation implementation and lying impairment. Persons with conduction aphasia
the stages of phonological selection and planning comes produce many more phonological errors than persons
from investigations of the acoustic patterns of speech with Wernicke’s aphasia, and the patterns of production
productions. These studies show that persons with ante- in individuals with conduction aphasia more nearly ap-
rior aphasia, including those with Broca’s aphasia, have proximate the phonological representation of the target
a deficit in the articulatory implementation of speech. word than those of persons with Wernicke’s aphasia.
Articulatory timing and laryngeal control appear to be These results suggest that the basis of the deficit in
particularly a¤ected. The timing disorder emerges in Wernicke’s aphasia more likely resides in the processes
the production of those speech sounds requiring the co- involved in lexical selection, whereas the basis of the
ordination of two independent articulators, such as the deficit in conduction aphasia more likely lies in the pro-
production of voicing in stop consonants, as measured cesses involved in phonological planning (Kohn, 1993).
by voice-onset time (Blumstein et al., 1980), and the In sum, the evidence suggests that the neural system
complex timing relation between syllables (Kent and underlying speech production is a distributed neural
McNeil, 1987; Gandour et al., 1993). Deficits in laryn- network with functional subcomponents. Wernicke’s
geal control are evident in the production of sound area and the association cortices around it are implicated
segments as well as in the production of prosody. Indi- in the processes of lexical selection. The supramarginal
viduals with these deficits show lower and more variable gyrus and the white matter deep to it appear to be
amplitudes of glottal excitation during the production involved in phonological selection and planning. The
of voicing in fricative consonants (Code and Ball, 1982) motor areas including the frontal operculum, the pre-
as well as changes in the spectral characteristics asso- motor and motor regions posterior and superior to the
ciated with the production of stop consonants (Shinn frontal operculum, and the white matter below, includ-
and Blumstein, 1983). Prosodic disturbances are char- ing the basal ganglia and insula, appear to be involved in
acterized by a restricted fundamental frequency range the articulatory implementation of speech (Dronkers,
(Danly and Shapiro, 1982). Additionally, the produc- 1997; Damasio, 1998). Nonetheless, the evidence sug-
tion of tone in languages such as Thai and Chinese is gests that there is not a 1 : 1 relationship between these
a¤ected, although the global properties of tone, such neurological landmarks and the stages of output, since
as whether the tone is rising or falling, are maintained all stages of speech production appear to be a¤ected in
(Gandour et al., 1992). all of the patients, although to varying degrees.
Individuals with posterior aphasia, including those
with Wernicke’s and conduction aphasia, also appear to Deficits in Speech Perception
have a subtle articulatory implementation deficit that is
di¤erent from that of individuals with anterior aphasia. A number of stages of processing have been identified
Characteristics of this deficit include increased variabil- in auditory word reception. These stages include the
ity in a number of phonetic parameters, including vowel extraction of generalized auditory patterns from the
formant frequencies and vowel duration, and abnormal acoustic waveform, the conversion of this spectral
temporal patterns between syllables (Baum et al., 1990; representation to a more abstract phonetic feature/
Vijayan and Gandour, 1995). phonological representation, and the mapping of this
Deficits in selection and planning emerge across a phonological representation onto lexical form (i.e., a
wide spectrum of aphasia, including left hemisphere word in the lexicon) (Nadeau, 2001). Deficits at any one
anterior (Broca’s) and posterior (conduction and Wer- or all of these levels may potentially contribute to audi-
nicke’s) aphasia. Evidence in support of this is provided tory comprehension impairments.
by similar patterns of phonological errors produced by There is little evidence to suggest that aphasic patients
these patients in spontaneous speech (Blumstein, 1973; have deficits at the stage of extracting generalized audi-
Holloman and Drummond, 1991). Error types include tory patterns from the acoustic waveform (Polster and
phoneme substitution errors (e.g., ‘‘teams’’ ! [kimz]), Rose, 1998). However, they show impairments in pro-
simplification errors (e.g., ‘‘brown’’ ! [bawn]), addition cessing a number of auditory/phonetic parameters of
errors (e.g., ‘‘bet’’ ! [blEt]), and environment errors speech, including temporal cues such as voice-onset time,
(e.g., ‘‘degree’’ ! [gPdri] or ‘‘moon’’ ! [mum]). These a cue to the phonetic dimension of voicing, and spectral
utterances deviate phonologically from the target word cues such as formant transitions, cues to the phonetic
but are implemented correctly by the articulatory sys- dimension of place of articulation (Blumstein et al.,
1984; Shewan, Leeper, and Booth, 1984). Increasing the temporal lobe structures, is selectively involved in speech
duration of the formant transitions to allow more time receptive functions. Instead, the results are consistent
to process the rapid spectral changes associated with the with the view that phonetic/phonological processing
perception of place of articulation does not improve the has a distributed neural basis, one that involves the
performance of aphasic patients (Riedel and Studdert- perisylvian regions of the left hemisphere (Hickok and
Kenneday, 1985). In general, patients have great di‰- Poeppel, 2000; Burton, 2001). Although persons with
culty performing these tasks, particularly when the Wernicke’s aphasia make a large number of errors on
stimuli are synthetic speech stimuli as compared to nat- speech perception tasks, and although damage to the left
ural speech stimuli and when the task requires labeling supramarginal gyrus and the bordering parietal oper-
or naming the stimuli as compared to discriminating culum is correlated with speech perception deficits (Gow
them. Nonetheless, the patterns of performance suggest and Caplan, 1996), some persons with anterior aphasia
that individuals with aphasia can map the spectral rep- have shown poorer performance on these tasks than
resentations on to the phonetic features of language. those with Wernicke’s aphasia. Moreover, performance
Those who can perform the labeling and discrimination on speech perception tasks has failed to show a strong
tasks typically show categorical perception similar to correlation with severity of auditory language compre-
normal subjects. They perceive the stimuli as belonging hension. Thus, although phonetic and phonological
to discrete phonetic categories, and they show peaks in processing deficits may contribute to the auditory com-
discrimination at the phonetic boundaries between the prehension impairments of aphasics, they do not appear
phonetic categories. All of the deficits described above to be the primary cause of such impairments.
emerge across aphasic syndromes and do not correlate See also phonological analysis of language dis-
with the severity of auditory comprehension impairment orders in aphasia.
(Basso, Casati, and Vignolo, 1977).
In contrast to the di‰culty in perceiving acoustic —Sheila E. Blumstein
dimensions associated with voicing and place of articu-
lation, persons with aphasia are generally able to per- References
ceive those acoustic dimensions contributing to the
Basso, A., Casati, G., and Vignolo, L. A. (1977). Phonemic
perception of speech prosody, that is, intonation and identification defects in aphasia. Cortex, 13, 84–95.
stress. Even individuals with severe auditory compre- Baum, S. (1998). The role of fundamental frequency and du-
hension impairments are able to use intonation cues ration in the perception of linguistic stress by individuals
to recognize whether an utterance is a command, a yes- with brain damage. Journal of Speech and Hearing Re-
no question, an information question, or a statement search, 41, 31–40.
(Green and Boller, 1974). Nonetheless, some impair- Baum, S. R., Blumstein, S. E., Naeser, M. A., and Palumbo,
ments have emerged in the perception of more local C. L. (1990). Temporal dimensions of consonant and vowel
parameters of prosody, including word stress and production: An acoustic and CT scan analysis of aphasic
tone (Gandour and Dardarananda, 1983; Baum, 1998). speech. Brain and Language, 39, 33–56.
Blumstein, S. E. (1973). A phonological investigation of aphasic
However, no di¤erences have emerged in the perfor- speech. The Hague: Mouton.
mance of persons with anterior and posterior aphasia. Blumstein, S. E. (2000). Deficits of speech production and
Similar to studies of the acoustic-phonetic properties speech perception in aphasia. In R. Berndt (Ed.), Handbook
of speech, nearly all persons with aphasia show per- of neuropsychology (2nd ed., vol. 3, pp. 1–19). Amsterdam:
ceptual deficits in phonological processing (Blumstein, Elsevier.
Baker, and Goodglass, 1977; Jauhiainen and Nuutila, Blumstein, S. E., Baker, E., and Goodglass, H. (1977). Phono-
1977; Csepe et al., 2001). These deficits emerge in tasks logical factors in auditory comprehension in aphasia. Neu-
requiring subjects to discriminate words or nonsense ropsychologia, 15, 19–30.
syllables contrasting by one or more phonetic features Blumstein, S. E., Cooper, W. E., Goodglass, H., Statlender, S.,
(e.g., ‘‘dime’’—‘‘time’’ or ‘‘da’’—‘‘ta’’) or to point to an and Gottlieb, J. (1980). Production deficits in aphasia: A
voice-onset time analysis. Brain and Language, 9, 153–170.
auditorily presented target stimulus from a visual array Blumstein, S. E., Tartter, V. C., Nigro, G., and Statlender, S.
of objects or written nonsense syllables that are phono- (1984). Acoustic cues for the perception of place of articu-
logically similar. Individuals with aphasia have more lation in aphasia. Brain and Language, 22, 128–149.
di‰culty on labeling and pointing tasks than on dis- Burton, M. W. (2001). The role of the inferior frontal cortex in
crimination tasks. They also make more errors in the phonological processing. Cognitive Science, 25, 695–709.
perception of nonsense syllables than in the perception Butterworth, B. (1992). Disorders of phonological encoding.
of real words, although the patterns of performance are Cognition, 42, 261–286.
similar across these stimulus types. For all patients, the Code, C., and Ball, M. (1982). Fricative production in Broca’s
perception of consonants is worse than that of vowels; aphasia: A spectrographic analysis. Journal of Phonetics,
more errors occur when the stimuli contrast by a single 10, 325–331.
Csepe, V., Osman-Sagi, J., Molnar, M., and Gosy, M. (2001).
phonetic feature; and more errors occur in the percep- Impaired speech perception in aphasic patients: Event-
tion of place of articulation and voicing than in the per- related potential and neurophysiological assessment. Neu-
ception of other phonetic feature contrasts. ropsychologia, 39, 1194–1208.
The results of studies investigating the perception Damasio, H. (1998). Neuroanatomical correlates of the apha-
of speech challenge the view that the posterior left sias. In M. T. Sarno (Ed.), Acquired aphasia (pp. 43–70).
hemisphere, particularly Wernicke’s area and associated New York: Academic Press.
Poverty: E¤ects on Language 369
Danly, M., and Shapiro, B. (1982). Speech prosody in Broca’s Baum, S. R., Kelsch Danilo¤, J., and Danilo¤, R. (1982).
aphasia. Brain and Language, 16, 171–190. Sentence comprehension by Broca’s aphasics: E¤ects of
Dronkers, N. F. (1997). A new brain region for coordinating some suprasegmental variables. Brain and Language, 17,
speech articulation. Nature, 384, 159–161. 261–271.
Gandour, J., and Dardarananda, R. (1983). Identification of Blumstein, S. E., Cooper, W. E., Zurif, E. B., and Caramazza,
tonal contrasts in Thai aphasic patients. Brain and Lan- A. (1977). The perception and production of voice-onset
guage, 17, 24–33. time in aphasia. Neuropsychologia, 15, 371–383.
Gandour, J., Dechongkit, S., Ponglorpisit, S., Khunadorn, Boller, F. (1978). Comprehension disorders in aphasia: A his-
F., and Boongird, P. (1993). Intraword timing relations in torical overview. Brain and Language, 5, 149–165.
Thai after unilateral brain damage. Brain and Language, 45, Buckingham, H. W., Jr. (1998). Explanations for the concept
160–179. of apraxia of speech. In M. T. Sarno (Ed.), Acquired apha-
Gandour, J., Ponglorpisit, S., Khunadorn, F., Dechongkit, S., sia (pp. 269–308). New York: Academic Press.
Boongird, P., Boonklam, R., et al. (1992). Lexical tones in Dronkers, N., Pinker, S., and Damasio, A. (2000). Language
Thai after unilateral brain damage. Brain and Language, 43, and the aphasias. In E. Kandel, J. Schwartz, and T. Jessell
275–307. (Eds.), Principles of neural science (4th ed., pp. 1169–1187).
Gow, D. W., Jr., and Caplan, D. (1996). An examination of New York: McGraw-Hill.
impaired acoustic-phonetic processing in aphasia. Brain and Gandour, J. (1998). Phonetics and phonology. In B. Stemmer
Language, 52, 386–407. and H. A. Whitaker (Eds.), Handbook of neurolinguistics
Green, E., and Boller, F. (1974). Features of auditory compre- (pp. 208–221). New York: Academic Press.
hension in severely impaired aphasics. Cortex, 10, 133– Gandour, J., Dechongkit, S., Ponglorpisit, S., and Khunadorn,
145. F. (1994). Speech timing at the sentence level in Thai after
Hickok, G., and Poeppel, D. (2000). Towards a functional unilateral brain damage. Brain and Language, 46, 419–438.
neuroanatomy of speech perception. Trends in Cognitive Hickok, G. (2000). Speech perception, conduction aphasia, and
Science, 4, 131–138. the functional neuroanatomy of language. In Y. Grodzin-
Holloman, A. L., and Drummond, S. S. (1991). Perceptual and sky, L. Shapiro, and D. Swinney (Eds.), Language and the
acoustical analyses of phonemic paraphasias in nonfluent brain (pp. 87–104). New York: Academic Press.
and fluent dysphasia. Journal of Communication Disorders, Katz, W. (1988). Anticipatory coarticulation in aphasia:
24, 301–312. Acoustic and perceptual data. Brain and Language, 35,
Jauhiainen, T., and Nuutila, A. (1977). Auditory perception of 340–368.
speech and speech sounds in recent and recovered aphasia. Katz, W., Machetanz, J., Orth, U., and Schonle, P. (1990).
Brain and Language, 4, 572–579. A kinematic analysis of anticipatory coarticulation in the
Kent, R., and McNeil, M. (1987). Relative timing of sentence speech of anterior aphasic subjects using electromagnetic
repetition in apraxia of speech and conduction aphasia. articulography. Brain and Language, 38, 555–575.
In J. Ryalls (Ed.), Phonetic approaches to speech production Kent, R. D. (2000). Research on speech motor control and
in aphasia and related disorders (pp. 181–220). Boston: its disorders. Journal of Communication Disorders, 33,
College-Hill Press. 391–428.
Kohn, S. E. (1993). Segmental disorders in aphasia. In G. Kohn, S. E. (1989). The nature of the phonemic string deficit in
Blanken, J. Dittman, H. Grimm, J. C. Marshall, and C. W. conduction aphasia. Aphasiology, 3, 209–239.
Wallesch (Eds.), Linguistic disorders and pathologies: An Kohn, S. E. (Ed.). (1992). Conduction aphasia. Hillsdale, NJ:
international handbook (pp. 197–209). New York: Walter de Erlbaum.
Gruyter. Lieberman, P. (2000). Human language and our reptilian brain.
Nadeau, S. E. (2001). Phonology: A review and proposals from a Cambridge, MA: Harvard University Press.
connectionist perspective. Brain and Language, 79, 511–579. Miceli, G., Caltagirone, C., Gainotti, G., and Payer-Rigo, P.
Polster, M. R., and Rose, S. B. (1998). Disorders of auditory (1978). Discrimination of voice versus place contrasts in
processing: Evidence from modularity in audition. Cortex, aphasia. Brain and Language, 2, 434–450.
34, 47–65. Rosenbek, J. C., McNeil, M. R., and Aronson, A. E. (Eds.).
Riedel, K., and Studdert-Kennedy, M. (1985). Extending for- (1984). Apraxia of speech: Physiology, acoustics, linguistics,
mant transitions may not improve aphasics’ perception of management. San Diego, CA: College-Hill Press.
stop consonant place of articulation. Brain and Language, Ryalls, J. H. (Ed.). (1987). Phonetic approaches to speech pro-
24, 223–232. duction in aphasia and related disorders. Boston: College-
Shewan, C. M., Leeper, H., and Booth, J. (1984). An analysis Hill Press.
of voice onset time (VOT) in aphasic and normal subjects.
In J. Rosenbek, M. McNeill, and A. Aronson (Eds.),
Apraxia of speech (pp. 197–220). San Diego, CA: College-
Hill Press.
Shinn, P., and Blumstein, S. E. (1983). Phonetic disintegration
in aphasia: Acoustic analysis of spectral characteristics for
Poverty: E¤ects on Language
place of articulation. Brain and Language, 20, 90–114.
Vijayan, A., and Gandour, J. (1995). On the notion of a ‘subtle More than one in five children in the United States live
phonetic deficit’ in fluent/posterior aphasia. Brain and Lan-
in poverty, with pervasive consequences for their health
guage, 48, 106–119.
and development. These consequences include e¤ects
on language. Children who live in poverty develop lan-
Further Readings guage at a slower pace than more advantaged children,
Baum, S. R. (1992). The influence of word length on syllable and, after the age at which all children can be said to
duration in aphasia: Acoustic analysis. Aphasiology, 6, have acquired language, they di¤er from children from
501–513. higher income backgrounds in their language skills and
manner of language use. Children from low-income Grammatical development is also related to income
families are also overrepresented among children diag- or SES. Compared with higher SES children, children
nosed as language-impaired. from lower social strata produce shorter responses to
These relations between poverty and language are adult speech (McCarthy, 1930), score lower on stan-
robust, but they are complicated to describe and di‰cult dardized tests that include measures of grammatical
to interpret. Poverty itself is not a homogeneous condi- development (Morrisset et al., 1990; Dollaghan et al.,
tion but occurs on a gradient and is usually associated 1999), produce less complex utterances in spontaneous
with other variables that a¤ect language, particularly speech as toddlers (Arriaga et al., 1998) and at age 5
education and ethnicity. Thus, the source of poverty (Snow, 1999), and di¤er significantly on measures of
e¤ects is sometimes obscure. Further, the language chil- productive and receptive syntax at age 6 (Huttenlocher
dren display is always a combined function of their et al., 2002). As an indicator of the magnitude of these
language skill and language style. Thus, the nature e¤ects on grammatical development, the low-income
of poverty e¤ects is sometimes unclear. This article sample studied by Snow (1999) had an average MLU at
describes the observed associations between poverty and age 3 years 9 months that would be typical of children
language development, and then considers why those more than a year younger, according to norms based on
associations occur. Much of the relevant literature does a middle-class sample. At age 5 years 6 months they had
not use poverty per se as a variable but instead uses an average MLU typical of middle-class children age 3
correlated variables such as education or a composite years 1 month. On the other hand, the SES-related dif-
measure of socioeconomic status (SES). Thus, an im- ferences in productive syntax are not in whether children
portant task in studying poverty e¤ects is to identify can or cannot use complex structures in their speech, but
the causal factors at work when poverty, low levels in the frequency with which they do so (Tough, 1982;
of parental education, and other associated factors are Huttenlocher et al., in press). Studies of school-age chil-
correlated with low levels of language achievement in dren find SES-related di¤erences in the communicative
children. purposes to which language is put, such that children
with less educated parents less frequently use language to
Language Di¤erences Related to Income or SES. analyze and reflect, to reason and justify, or to predict
Before speech begins, infants who live in poverty pro- and consider alternative possibilities than children with
duce speech sounds and babble in much the same way more educated parents. The structural di¤erences in
and on the same developmental timetable as all normally children’s language associated with SES may be a by-
developing infants (Oller et al., 1995), and even at 3 product of these functional di¤erences (Tough, 1982).
years of age, SES is unrelated to the accuracy of chil- SES-related di¤erences in school-age children also
dren’s articulation (Dollaghan et al., 1999). In contrast, appear in the ability to communicate meaning through
virtually every other measure of language development language and to draw meaning from language, some-
reveals di¤erences between children from low-income times referred to as speaker and listener skills (Lloyd,
or low-SES families and children from higher income or Mann, and Peers, 1998). In the referential communica-
higher SES families. tion task, which requires children to describe one item in
The clearest and largest SES-related di¤erence is an array of objects so that a visually separated listener
in the area of vocabulary. Recordings of spontaneous with the same array can identify that item, lower SES
speech, maternal report measures, and standardized tests children are less able than higher SES children to pro-
all show that children from low-income and low-SES duce su‰ciently informative messages and to use infor-
families have smaller vocabularies than children from mation in messages addressed to them to make correct
higher income and higher SES families (Rescorla, 1989; choices (Lloyd et al., 1998). Children from lower socio-
Hart and Risley, 1995; Dollaghan et al., 1999). By the economic strata also perform less well than higher SES
age of 3 years, children from low-income families have children in solving mathematics word problems. This
productive vocabularies averaging around 500 words, poorer performance reflects a di¤erence in language
while children from higher income families have pro- ability, not mathematical ability, because the same chil-
ductive vocabularies averaging more than 1000 words dren show no di¤erence in performance in math calcu-
(Hart and Risley, 1995). Eighty percent of toddlers from lations (Jordan, Huttenlocher, and Levine, 1992).
low-income families score below the 50th percentile on The foregoing e¤ects are e¤ects of poverty or SES on
the MacArthur Communicative Development Inventory variation within the normal range. On average, children
(CDI) (Arriaga et al., 1998). Some findings suggest that from low-income families acquire language at a slower
low family income is more associated with measures of rate and demonstrate both di¤erences in language use
children’s productive vocabulary than with measures and poorer language skills than children from higher in-
of their receptive vocabulary (Snow, 1999). Two studies come families. Low SES is also a correlate of the diag-
have reported SES-related di¤erences in children’s nosis of specific language impairment (SLI) (Tomblin
vocabulary, with lower SES children showing larger et al., 1997), although it is not clear what this means,
vocabularies. Both used the CDI, and both attributed given that SLI is defined in terms of delay relative to
their findings to lower SES mothers’ tendencies to over- norms and that normative development is slower for
estimate their children’s abilities (Fenson et al., 1994; lower SES children (Fazio, Naremore, and Connell,
Feldman et al., 2000). 1996). For this reason, there have been calls for sensi-
Poverty: E¤ects on Language 371
tivity to SES e¤ects on standardized measures in diag- language-learning environments for children than more
nosing SLI (Arriaga et al., 1998; Feldman et al., 2000), a¿uent situations. Poverty is also associated with the
and e¤orts are underway to develop tests of language diagnosis of SLI, although most evidence suggests that
impairment that are direct tests of language learning the language environment is not the cause of SLI
ability rather than norm-referenced comparisons (Fazio, (Lederberg, 1980). In fact, studies of the heritability of
Naremore, and Connell, 1996; Campbell et al., 1997; language find a higher heritability for language impair-
Seymour, 2000). ment than for variation in language development within
the normal range (Eley et al., 1999; Dale et al., 2000).
Understanding the Relation Between Poverty and Lan- If input is the mediator of poverty e¤ects on language
guage Development. Low family income cannot di- but input does not explain SLI, then why is poverty
rectly cause the depressed language skills associated with associated with SLI? The inescapable conclusion is that
poverty but must operate via mediators that a¤ect lan- children can di¤er from the normative rate of devel-
guage. Potential mediators, or pathways, through which opment for two reasons: an impairment in the ability
poverty operates may include factors with general e¤ects to learn language or an inadequate language-learning
on health and development, such as nutrition, exposure environment. Low percentile scores on measures of lan-
to environmental hazards, and quality of schools and guage development do not, by themselves, distinguish
child care, and may also include factors with specific between these. While poverty does not cause language
e¤ects on language, such as the opportunity for one-to- impairment or a communication disorder, the impov-
one contact with an adult (McCartney, 1984) and the erished language-learning environment often associated
language use of parents and classroom teachers (Hut- with poverty can similarly impede language develop-
tenlocher et al., 2002). ment.
A variety of evidence suggests that an important me- See also specific language impairment in children.
diator of the relation between SES and language devel-
opment in children is the nature of the language-learning —Erika Ho¤
environment provided by the family (Ho¤, 2003) and
that low levels of parental education, more than low in- References
come per se, a¤ect the language-learning environment Arriaga, R. J., Fenson, L., Cronan, T., and Pethick, S. J.
in the home. Two predictors of language development, (1998). Scores on the MacArthur Communicative Develop-
the talk parents address to children and the exposure ment Inventory of children from low- and middle-income
to books that parents provide, di¤er as a function of families. Applied Psycholinguistics, 19, 209–223.
parental education. Compared with children of more Campbell, T., Dollaghan, D., Needleman, H., and Janosky, J.
educated parents, children of less educated parents hear (1997). Reducing bias in language assessment: Processing-
less speech, hear less richly informative speech, receive dependent measures. Journal of Speech, Language, and
less support for their own participation in conversa- Hearing Research, 40, 519–525.
Dale, P. S., Dionne, G., Eley, T. C., and Plomin, R. (2000).
tion, and are read to less (Hart and Risley, 1995; Ho¤- Lexical and grammatical development: A behavioral genetic
Ginsberg, 1998; U.S. Department of Education, 1998; perspective. Journal of Child Language, 27, 619–642.
Ho¤, Laursen, and Tardif, 2002). When mediators of Dollaghan, C. A., Campbell, T. F., Paradise, J. L., Feldman,
the relation between family SES and child language H. M., Janosky, J. E., Pitcairn, D. N., et al. (1999). Mater-
have been examined, properties of children’s language- nal education and measures of early speech and language.
learning environments have been found to account for Journal of Speech, Language, and Hearing Research, 42,
most of the variance attributable to SES both for syntax 1432–1443.
and for lexical development (Ho¤, 2003; Huttenlocher Eley, T. C., Bishop, D. V. M., Dale, P. S., Oliver, B., Petrill,
et al., 2002). In fact, even variation within low-income S. A., Price, T. S., et al. (1999). Genetic and environmental
samples is attributable to variation in language-learning origins of verbal and performance components of cogni-
tive delay in 2-year-olds. Developmental Psychology, 35,
environments. Among a group of 5-year-olds in Head 1122–1131.
Start programs (thus, children from low-income fami- Fazio, B. B., Naremore, R. C., and Connell, P. J. (1996).
lies), standard scores on a measure of comprehen- Tracking children from poverty at risk for specific language
sion vocabulary (the Peabody Picture Vocabulary Test) impairment: A 3-year longitudinal study. Journal of Speech
were significantly related to maternal use of sophisti- and Hearing Research, 39, 611–624.
cated vocabulary (i.e., low-frequency words) and to Feldman, H. M., Dollaghan, C. A., Campbell, T. F., Kurs-
the frequency of supportive mother-child interactions Lasky, M., Janosky, J. E., and Paradise, J. L. (2000). Mea-
(Weizman and Snow, 2001). In a di¤erent sample of 4- surement properties of the MacArthur Communicative
year-olds in Head Start, variation in productive and Development Inventory at ages one and two years. Child
comprehension vocabulary was accounted for by chil- Development, 71, 310–322.
Fenson, L., Dale, P. A., Reznick, J. S., Bates, E., Thal, D. J.,
dren’s literacy experiences at home (Payne, Whitehurst, and Pethick, S. J. (1994). Variability in early communicative
and Angell, 1994). development. Monographs of the Society for Research in
With respect to understanding the role of poverty in Child Development, 59 (Serial No. 242).
communication disorders, the literature presents a para- Hart, B., and Risley, T. (1995). Meaningful di¤erences in the
dox. Poverty is associated with slow language develop- everyday experience of young American children. Baltimore:
ment because poverty is associated with less supportive Paul H. Brookes.
Ho¤, E. (2003). Causes and consequences of SES-related Further Reading

di¤erences in parent-to-child speech. In M. H. Bornstein
(Ed.), Socioeconomic status, parenting, and child develop- Heath, S. B. (1983). Ways with words. Cambridge, U.K.:
ment. Mahwah, NJ: Erlbaum. Cambridge University Press.
Ho¤, E., Laursen, B., and Tardif, T. (2002). Socioeconomic
status and parenting. In M. H. Bornstein (Ed.), Handbook
of parenting (2nd ed.). Mahwah, NJ: Erlbaum.
Ho¤-Ginsberg, E. (1998). The relation of birth order and Pragmatics
socioeconomic status to children’s language experience
and language development. Applied Psycholinguistics, 19,
603–629. Pragmatics may be defined as ‘‘the study of the rules
Huttenlocher, J., Vasilyeva, M., Cymerman, E., and Levine, S. governing the use of language in social contexts’’
(2002). Language input at home and at school: Relation to (McTear and Conti-Ramsden, 1992, p. 19). Although
child syntax. Cognitive Psychology, 45, 337–374. there is some debate as to what should be included under
Jordan, N., Huttenlocher, J., and Levine, S. (1992). Di¤eren- the heading of pragmatics, traditionally it has been
tial calculation abilities in young children from middle-
thought to incorporate behaviors such as communica-
and low-income families. Developmental Psychology, 28,
644–653. tive intent (speech acts), conversational management
Lederberg, A. (1980). The language environment of children (turn taking, topic manipulation, etc.), presuppositional
with language delays. Journal of Pediatric Psychology, 5, knowledge, and culturally determined rules for linguistic
141–158. politeness. Some authors, working from a framework in
Lloyd, P., Mann, S., and Peers, I. (1998). The growth of which pragmatics is seen as the motivating force behind
speaker and listener skills from five to eleven years. First other components of language such as syntax and se-
Language, 18, 81–104. mantics, include an expanded list of behaviors within
McCarthy, D. (1930). The language development of the pre- this domain. For example, from this latter perspective,
school child. Minneapolis: University of Minnesota Press. behaviors such as those occurring in the interactive con-
McCartney, K. (1984). E¤ect of quality of day care environ-
ment on children’s language development. Developmental
text of early language acquisition would be considered
Psychology, 20, 244–260. pragmatic. Despite di‰culty in determining where the
Morrisset, C., Barnard, K., Greenberg, M., Booth, D., and boundaries of pragmatics should be drawn, as one con-
Spieker, S. (1990). Environmental influences on early lan- siders how language is used in real interactions with
guage development: The context of social risk. Development other people, it is impossible not to cross over into areas
and Psychopathology, 2, 127–149. more typically seen as social or cultural rather than lin-
Oller, D. K., Eilers, R. E., Basinger, D., Ste¤ens, M. L., and guistic (Ninio and Snow, 1996).
Urbano, R. (1995). Extreme poverty and the development Although the social implications of impaired com-
of precursors to the speech capacity. First Language, 15, munication skills have been considered for some time,
167–188. the pragmatic aspects of language impairment did not
Payne, A. C., Whitehurst, G. J., Angell, A. L. (1994). The role
of home literacy environment in the development of lan-
become a serious topic of study until the mid-1970s, fol-
guage ability in preschool children from low-income fami- lowing the lead of researchers studying typical language
lies. Early Childhood Research Quarterly, 9, 427–440. acquisition (see Leonard, 1998). The innovations for
Rescorla, L. (1989). The Language Development Survey: A language assessment and intervention that stemmed
screening tool for delayed language in toddlers. Journal of from this work motivated Duchan (1984) to characterize
Speech and Hearing Disorders, 54, 587–599. these e¤orts as ‘‘the pragmatics revolution.’’ Gallagher
Seymour, H. N. (2000). The development of a dialect sensitive (1991) summarized the contributions of pragmatics to
language test. Paper presented at the Symposium on Re- assessment by noting that greater awareness of the
search in Child Language Disorders, Madison, WI, June 3. pragmatic aspects of language resulted in a larger set of
Snow, C. E. (1999). Social perspectives on the emergence of behaviors on which a diagnosis of language impairment
language. In B. MacWhinney (Ed.), The emergence of lan-
guage (pp. 257–276). Mahwah, NJ: Erlbaum.
could be made. It also highlighted the importance of
Tomblin, J. B., Records, N. L., Buckwalter, P., Zhang, X., contextual variables in spontaneous language produc-
Smith, E., and O’Brien, M. (1997). Prevalence of specific tion. Clinicians gained an understanding that controlling
language impairment in kindergarten children. Journal of or standardizing these variables would fundamentally
Speech, Language, and Hearing Research, 40, 1245–1260. alter the nature of the interaction.
Tough, J. (1982). Language, poverty, and disadvantage in With respect to language intervention, goals were
school. In L. Feagans and D. C. Farran (Eds.), The lan- expanded to include a wide range of pragmatic behav-
guage of children reared in poverty (pp. 3–19). New York: iors. Further, providing intervention in more natural-
Academic Press. istic contexts, thereby allowing communication to be
U.S. Department of Education, National Center for Education motivated and reinforced by natural consequences, was
Statistics. (1998). Estimates of ‘‘read to every day’’ as pub-
lished in Federal Interagency Forum on Child and Family
highlighted. At the same time the value of using rou-
Statistics, Table ED1. Washington, DC: Author. tines, scripts, and similar procedures to provide greater
Weizman, Z. O., and Snow, C. E. (2001). Lexical input as contextual support for language usage also gained favor
related to children’s vocabulary acquisition: E¤ects of so- among clinicians (Gallagher, 1991).
phisticated exposure and support for meaning. Develop- The study of pragmatics also brought insights into
mental Psychology, 37, 265–279. how communication might be linked to other aspects of
Pragmatics 373
behavior. A prime example, stemming from work with situation is less clear for children with SLI. It is well
persons with pervasive developmental disabilities, was established that children in the latter group have di‰-
the insight that challenging behavior may have commu- culty with aspects of syntactic, morphological, and
nicative intent. Further, providing the individual with a semantic development. Studies examining pragmatic
more appropriate means of communicating the same in- behaviors have yielded more equivocal findings. For
tent would often result in a notable decrease in the un- example, children with SLI have been found to be less
desirable behavior (Carr and Durrand, 1985). capable of responding to stacked requests for clarifica-
Additionally, much of the recent work focusing on tion (Brinton, Fujiki, and Sonnenberg, 1988), less able
the social skills and peer relationships of children with to initiate utterances in conversation (Conti-Ramsden
language impairment (e.g., Hadley and Rice, 1991; and Friel-Patti, 1983), and less adept at entering ongoing
Craig and Washington, 1993; Brinton et al., 1997) is a conversations (Craig and Washington, 1993) than typi-
natural extension of earlier work studying the interac- cally developing peers at the same language level. Other
tional skills of these children. As Gallagher (1999) noted, researchers, however, have found that children with SLI
‘‘it was inevitable that the pragmatic language focus performed similarly to typically developing peers on
on communication would eventually lead to questions pragmatic variables when language level was controlled
about the interpersonal and intrapersonal roles of lan- (e.g., Fey and Leonard, 1984; Leonard, 1986). In sum-
guage’’ (p. vi). marizing this work, it appears that many (but not all)
Despite the positive contributions to assessment and children with SLI have di‰culty with many (but not
intervention procedures that have resulted from the all) aspects of pragmatic language behavior. For some
study of pragmatics, a clear sense of the role of prag- of these children, pragmatic deficits stem from problems
matic behaviors in language impairment has been di‰- with language form and content. For other children,
cult to achieve. Research with some groups of children pragmatic impairment is a central component of their
with language impairment has documented the presence language di‰culty.
of serious pragmatic problems. In other groups of chil- One way in which researchers have addressed the
dren the nature of pragmatic di‰culties has been more variability noted above has been to place children with
challenging to characterize. This variability can be seen SLI into subgroups more specifically characterizing the
by contrasting two groups for which language problems nature of their impairment. Several groups of researchers
play a major role: children with autism spectrum dis- (e.g., Bishop and Rosenbloom, 1987; Conti-Ramsden,
orders (ASD) and children with specific language im- Crutchley, and Botting, 1997) have developed taxono-
pairment (SLI). mies identifying a group of children whose language
Children with ASD have communication deficits that impairments are pragmatic in nature. Labeled as ‘‘se-
may be grouped within two categories: (1) the capacity mantic pragmatic deficit syndrome,’’ these children are
for joint attention to objects and events with other per- described as having a variety of pragmatic problems
sons, and (2) the ability to understand the symbolic in the face of relatively typical structural language skills.
function of language (Wetherby, Prizant, and Schuler, Areas of deficit include inappropriate topic manipula-
2000). Pragmatic deficits figure prominently within both tion, di‰culty assessing shared information, an overly
categories. For example, with respect to joint attention, high level of verbosity, and a lack of responsiveness
children with ASD produce a limited range of commu- to questions. Word-finding problems and di‰culty com-
nicative intentions. They may communicate to direct the prehending language are also associated with this sub-
behavior of others but not for purposes requiring joint group. In work by Conti-Ramsden, Crutchley, and
attention with another person, such as to share feelings Botting (1997), semantic pragmatic deficit syndrome
or experiences. Children with ASD also have di‰culty characterized approximately 10% of 242 participants
interpreting and responding to the emotional states of with SLI. In follow-up work, Conti-Ramsden and Bot-
others. This may be reflected in a lack of responsive- ting (1999) found that although the subcategories of im-
ness to positive a¤ect as well as in behaviors such as pairment were stable over the course of a year, many
the failure to appropriately coordinate eye gaze during individual children moved between subcategories.
interaction. It might be argued that children with pragmatic
Problematic behaviors stemming from di‰culty with problems in the face of relatively good structural lan-
symbol use include the often cited reliance on reenact- guage (and who do not meet criteria for ASD) might
ment strategies (e.g., a preschool child who used the form a separate category of impairment. Bishop (2000)
phrase ‘‘do ah’’ to mean that he was not feeling well, argued that this is not an ideal solution because prag-
stemming from appropriate use in a prior context), as matic impairment is not limited to children with good
well as developing maladaptive ways of communicating structural skills, nor is it always found in association
to compensate for a lack of more conventional means with semantic di‰culties. Rather, it may be more
(e.g., using head banging to communicate the desire productive to view structurally based SLI and ASD as
to avoid an unpleasant task) (Wetherby, Prizant, and two ends of a continuum on which many combina-
Schuler, 2000). Both of these examples of language use tions of pragmatic and structural language deficits may
have important pragmatic implications. occur.
Whereas it is accepted that pragmatic di‰culties In summary, pragmatic impairment may be found in
constitute a basic problem for children with ASD, the a wide range of children with language problems. In
some cases, it may constitute a central component of Gallagher, T. M. (1991). A retrospective look at clinical prag-
the impairment. For other children, the problem may be matics. In T. M. Gallagher (Ed.), Pragmatics of language:
secondary to other types of language problems. From Clinical practice issues (pp. 1–9). San Diego, CA: Singular
a clinical standpoint, it is important to recognize that Publishing Group.
Gallagher, T. M. (1999). Foreword (issue on children’s lan-
children with a variety of disabilities may have prag-
guage, behavior, and emotional problems). Topics in Lan-
matic problems. Given the close link between language guage Disorders, 19(2), vi–vii.
and social behavior, it is perhaps as important to recog- Hadley, P. A., and Rice, M. L. (1991). Conversational
nize that even language impairments that do not involve responsiveness of speech- and language-impaired pre-
pragmatic behaviors are likely to have implications for schoolers. Journal of Speech and Hearing Research, 34,
social interaction. To be most productive, interventions 1308–1317.
should be structured not only to improve specific lan- Leonard, L. B. (1986). Conversational replies of children with
guage skills but also to facilitate the use of language in specific language impairment. Journal of Speech and Hear-
interactions to improve relationships with peers and ing Research, 29, 114–119.
adults in the child’s social world. Leonard, L. B. (1998). Children with specific language impair-
ment. Cambridge, MA: MIT Press.
—Martin Fujiki and Bonnie Brinton McTear, M., and Conti-Ramsden, G. (1992). Pragmatic disabil-
ity in children. San Diego, CA: Singular Publishing Group.
References Ninio, A., and Snow, C. E. (1996). Pragmatic development.
Boulder, CO: Westview Press.
Bishop, D. V. M. (2000). Pragmatic language impairment: A Wetherby, A. M., Prizant, B. M., and Schuler, A. L. (2000).
correlate of SLI, a distinct subgroup, or part of the autistic Understanding the nature of communication and language
continuum? In D. V. M. Bishop and L. B. Leonard (Eds.), impairments. In A. M. Wetherby and B. M. Prizant (Eds.),
Speech and language impairments in children: Causes, char- Autism spectrum disorders: A transactional developmental
acteristics, intervention and outcome (pp. 99–113). Philadel- perspective (pp. 109–141). Baltimore: Paul H. Brookes.
phia: Taylor and Francis.
Bishop, D. V. M., and Rosenbloom, L. (1987). Classification Further Readings
of childhood language disorders. In W. Yule and M. Rutter
(Eds.), Language development and disorders: Clinics in Bishop, D. V. M., and Adams, C. (1989). Conversational
developmental medicine. London: MacKeith Press. characteristics of children with semantic-pragmatic dis-
Brinton, B., Fujiki, M., and Sonnenberg, E. A. (1988). order: II. What features lead to a judgement of inappro-
Responses to requests for clarification by linguistically nor- priacy? British Journal of Disorders of Communication, 24,
mal and language-impaired children in conversation. Jour- 241–263.
nal of Speech and Hearing Disorders, 53, 383–391. Bishop, D. V. M., and Adams, C. (1992). Comprehension
Brinton, B., Fujiki, M., Spencer, J. C., and Robinson, L. A. problems in children with specific language impairment:
(1997). The ability of children with specific language im- Literal and inferential meaning. Journal of Speech and
pairment to access and participate in an ongoing interac- Hearing Research, 35, 119–129.
tion. Journal of Speech, Language, and Hearing Research, Bishop, D. V. M., Chan, J., Hartley, J., Adams, C., and Weir,
40, 1011–1025. F. (2000). Conversational responsiveness in specific lan-
Carr, E. G., and Durrand, V. M. (1985). The social- guage impairment: Evidence of disproportionate pragmatic
communicative basis of severe behavior problems in chil- di‰culties in a subset of children. Development and Psycho-
dren. In S. Reiss and R. R. Bootzin (Eds.), Theoretical pathology, 12, 177–199.
issues in behavior therapy (pp. 219–254). New York: Aca- Blank, M., Gessner, M., and Esposito, A. (1979). Language
demic Press. without communication: A case study. Journal of Child
Conti-Ramsden, G., and Botting, N. (1999). Classification of Language, 6, 329–352.
children with specific language impairment: Longitudinal Brinton, B., and Fujiki, M. (1982). A comparison of request-
considerations. Journal of Speech, Language, and Hearing response sequences in the discourse of normal and
Research, 42, 1195–1204. language-disordered children. Journal of Speech and Hear-
Conti-Ramsden, G., Crutchley, A., and Botting, N. (1997). ing Disorders, 47, 57–62.
The extent to which psychometric tests di¤erentiate sub- Brinton, B., and Fujiki, M. (1989). Conversational management
groups of children with SLI. Journal of Speech, Language, with language-impaired children. Rockville, MD: Aspen.
and Hearing Research, 40, 765–777. Brinton, B., and Fujiki, M. (1994). Ways to teach conversa-
Conti-Ramsden, G., and Friel-Patti, S. (1983). Mothers’ tion. In J. Duchan, L. Hewitt, and R. Sonnenmeier (Eds.),
discourse adjustments to language-impaired and non- Pragmatics: From theory to practice (pp. 59–71). Engle-
language-impaired children. Journal of Speech and Hearing wood Cli¤s, NJ: Prentice Hall.
Disorders, 48, 360–367. Brinton, B., and Fujiki, M. (1995). E‰cacy of conversational
Craig, H. K., and Washington, J. A. (1993). The access intervention with children with language impairment. In M.
behaviors of children with specific language impairment. E. Fey, J. Windsor, and S. Warren (Eds.), Communication
Journal of Speech and Hearing Research, 36, 322–336. intervention for school-age children (pp. 183–212). Balti-
Duchan, J. (1984). Language assessment: The pragmatics more: Paul H. Brookes.
revolution. In R. Naremore (Ed.), Language science (pp. Brinton, B., Fujiki, M., and Powell, J. (1997). The ability of
147–180). San Diego, CA: College-Hill Press. children with language impairment to manipulate topic in a
Fey, M. E., and Leonard, L. (1984). Partner age as a variable structured task. Language, Speech, and Hearing Services in
in the conversational performance of specifically language- Schools, 28, 3–22.
impaired and normal-language children. Journal of Speech Brinton, B., Fujiki, M., Winkler, E., and Loeb, D. (1986).
and Hearing Research, 27, 413–423. Responses to requests for clarification in linguistically nor-
Prelinguistic Communication Intervention for Children with Developmental Disabilities 375
mal and language-impaired children. Journal of Speech and Westby, C. (1999). Assessment of pragmatic competence in
Hearing Disorders, 51, 370–378. children with psychiatric disorders. In D. Rogers-Adkinson
Conti-Ramsden, G., and Gunn, M. (1986). The development and P. Gri‰th (Eds.), Communication disorders and children
of conversational disability: A case study. British Journal of with psychiatric and behavioral disorders (pp. 177–258). San
Disorders of Communication, 21, 339–351. Diego, CA: Singular Publishing Group.
Craig, H. K. (1991). Pragmatic characteristics of the child with Windsor, J. (1995). Language impairment and social compe-
specific language impairment: An interactionist perspective. tence. In M. E. Fey, J. Windsor, and S. F. Warren (Eds.),
In T. M. Gallagher (Ed.), Pragmatics of language: Clinical Language intervention: Preschool through the elementary
practice issues (pp. 163–198). San Diego, CA: Singular years (pp. 213–238). Baltimore: Paul H. Brookes.
Publishing Group.
Craig, H. K., and Evans, J. (1989). Turn exchange character-
istics of SLI childrens’ simultaneous and non-simultaneous
speech. Journal of Speech and Hearing Disorders, 54, 334–
347. Prelinguistic Communication
Craig, H. K., and Evans, J. L. (1993). Pragmatics and SLI: Intervention for Children with
Within-group variations in discourse behaviors. Journal of
Speech and Hearing Research, 36, 777–789. Developmental Disabilities
Craig, H. K., and Gallagher, T. (1986). Interactive play: The
frequency of related verbal responses. Journal of Speech and
Hearing Research, 29, 375–383. The onset of intentional communication late in the first
Duchan, J. F. (1995). Supporting language learning in everyday year of life marks an infant’s active entry into his or her
life. San Diego, CA: Singular Publishing Group. culture and ignites important changes in how others re-
Duchan, J. F., Hewitt, L. E., and Sonnenmeier, R. M. (Eds.). gard and respond to the infant. A significant delay in the
(1994). Pragmatics: From theory to practice. Englewood onset of intentional communication is a strong indica-
Cli¤s, NJ: Prentice Hall. tor that the onset of productive language also will be
Fujiki, M., and Brinton, B. (1991). The verbal noncom- delayed (McCathren, Warren, and Yoder, 1996; Calan-
municator: A case study. Language, Speech and Hearing
drella and Wilcox, 2000). Such a delay may hold the in-
Services in Schools, 22, 322–333.
Gallagher, T. M. (1991). Pragmatics of language: Clinical fant in a kind of developmental limbo because the onset
practice issues. San Diego, CA: Singular Publishing of intentional communication triggers a series of trans-
Group. actional processes that support the emergence of pro-
Gallagher, T. M. (1999). Interrelationships among children’s ductive language just a few months later. In this article
language, behavior, and emotional problems. Topics in we discuss the research on the e¤ects of prelinguistic
Language Disorders, 19(2), 1–15. communication interventions aimed at teaching infants
Gallagher, T. M., and Prutting, C. A. (1983). Pragmatic as- and toddlers with developmental delays to be clear, fre-
sessment and intervention issues in language. San Diego, CA: quent prelinguistic communicators.
College-Hill Press. The onset of coordinated attention occupies a ‘‘piv-
McTear, M. F. (1985a). Pragmatic disorders: A case study
of conversational disability. British Journal of Disorders of
otal’’ juncture in prelinguistic communication develop-
Communication, 20, 129–142. ment. Before the emergence of coordinated attention,
McTear, M. F. (1985b). Pragmatic disorders: A question of an infant’s intention is very di‰cult to discern (Bates,
direction. British Journal of Disorders of Communication, Benigni, Bretherton et al., 1979). Almost simultaneously
20, 119–127. with the emergence of coordinated attention, the child
Mentis, M. (1994). Topic management in discourse: Assess- begins to move from preintentional to intentional com-
ment and intervention. Topics in Language Disorders, 14(3), munication. Requesting and commenting episodes pro-
29–54. vide the earliest contexts in which intentionality is
Nippold, M. A. (1994). Persuasive talk in social contexts: De- demonstrated (Bates, O’Connell, and Shore, 1987). Both
velopment, assessment, and intervention. Topics in Lan- functions require the infant to shift his or her attention
guage Disorders, 14(3), 1–12.
Prutting, C. A. (1982). Pragmatics as social competence. Jour-
between his or her partner and an object. Requesting
nal of Speech and Hearing Disorders, 47, 123–134. (also termed imperatives and protoimperatives in the
Prutting, C. A., and Kirchner, D. M. (1987). A clinical literature) is commonly defined as behavior that clearly
appraisal of the pragmatic aspects of language. Journal of indicates that the child wants something. Commenting
Speech and Hearing Disorders, 52, 105–119. (also termed joint attention, indicating, declarative,
Rapin, I. (1996). Developmental language disorders: A clinical and referencing in the literature) is the act of drawing
update. Journal of Child Psychology and Psychiatry and another’s attention to or showing a positive a¤ect about
Allied Disciplines, 37, 643–655. an object or interest (Bates, Benigni, Bretherton et al.,
Rapin, I., and Allen, D. (1983). Developmental language dis- 1987). Although other communicative functions also
orders: Nosologic considerations. In U. Kirk (Ed.), Neuro- emerge during this period (e.g., greeting, protesting),
psychology of language, reading and spelling (pp. 155–184).
New York: Academic Press.
requesting and commenting are considered the fun-
Rollins, P. R., Pan, B. A., Conti-Ramsden, G., and Snow, C. damental pragmatic building blocks of both prelin-
E. (1994). Communicative skills in children with specific guistic and linguistic communication (Bruner, Roy, and
language impairments: A comparison with their language- Ratner, 1980). They are also the two most frequent
matched siblings. Journal of Communication Disorders, 27, functions expressed during the prelinguistic period
189–206. (Wetherby, Cain, Yonclas et al., 1988).
The Transactional Model of Development and E¤ects of Prelinguistic Communication

Intervention Intervention
The e¤ectiveness of prelinguistic intervention in en- In their initial explorations of the e¤ects of prelinguistic
hancing later language development depends on the communication intervention, Yoder and Warren dem-
operation of a transactional model of social communi- onstrated that increases in the frequency and clarity of
cation development (Samero¤ and Chandler, 1975; prelinguistic requesting by children with developmental
McLean and Snyder-McLean, 1978). The model pre- delays as a result of the intervention covaried with sub-
sumes that early social and communication development stantial increases in linguistic mapping by teachers and
is facilitated by bidirectional, reciprocal interactions be- parents who were naive as to the specific techniques and
tween children and their environment. For example, a goals of the intervention (Warren, Yoder, Gazdag et al.,
change in the child such as the onset of intentional 1993; Yoder, Warren, Kim et al., 1994). These studies
communication may trigger a change in the social envi- also demonstrated strong generalization e¤ects in that
ronment, such as increased linguistic mapping (i.e., the intentional requesting function they taught was
naming objects and actions that the child is attending to) shown to generalize across people, setting, communica-
by their caregivers. These changes then support further tion styles, and time. Based on the promising results
development in the child (e.g., increased vocabulary), of these initial studies, Yoder and Warren (1998,
and subsequently further changes by the caregivers 1999a, 1999b, 2001a, 2001b) conducted a relatively large
(e.g., more complex language interaction with the child). (N ¼ 58) longitudinal experimental study of the e¤ects
In this way, both the child and the environment change of prelinguistic communication intervention on the
over time and a¤ect each other in reciprocal fashion communication and language development of children
as early achievements pave the way for subsequent with general delays in development. This study repre-
development. sented an experimental analysis of the transactional
A transactional model may be particularly well suited model of early social communication development.
to understanding social-communication development in Fifty-eight children between the ages of 17 and 32
young children because caregiver–child interaction can months (mean, 23; SD, 4) with developmental delays
play such an important role in this process. The period and their primary parent participated in this study.
of early development (age birth to 3 years) may repre- Fifty-two of these children had no productive words at
sent a unique time during which transactional e¤ects the outset of the study; the remaining six children had
can have a substantial impact on development. Young between one and five productive words. All children
children’s relatively restricted repertoire during this early scored below the l0th percentile on the expressive scale
period can make changes in their behavior more salient of the Communication Development Inventory (Fenson
and easily observable to caregivers. This in turn may al- et al., 1991).
low adults to be more specifically contingent with their The children were randomly assigned to one of two
responses to developing skills of the child than is possible treatment groups. Twenty-eight of the children received
later in development, when children’s behavioral reper- an intervention termed ‘‘prelinguistic milieu teaching’’
toire is far more expansive and complex. During this (PMT) for 20 minutes per day, 3 or 4 days per week, for
natural window of opportunity, the transactional model 6 months. The other 30 children received an intervention
may be employed by a clever practitioner to multiply the termed ‘‘responsive small group’’ (RSG). PMT repre-
e¤ects of relatively circumscribed interventions and per- sented an adaptation of milieu language teaching (e.g.,
haps alter the very course of the child’s development in a Warren and Bambara, 1989; Warren, 1992). RSG rep-
significant way. But the actions of the practitioner may resented an adaptation of the responsive interaction
need to be swift and intense, or they may be muted by approach (Wilcox, 1992; Wilcox and Shannon, 1998).
the child’s steadily accumulating history. These interventions are described in detail elsewhere
The generation of strong transactional e¤ects in (e.g., Warren and Yoder, 1998; Yoder and Warren,
which the growth of emotional, social, and communica- 1998). Caretakers were kept naive as to the specific
tion skills is sca¤olded by caregivers can have a multi- methods, measures, records of child progress, and child
plier e¤ect in which a relatively small dose of early goals throughout the study. This allowed Yoder and
intervention may lead to long-term e¤ects. These e¤ects Warren to investigate how change in the children’s be-
are necessary when we consider that a relatively ‘‘inten- havior as a result of the interventions might a¤ect the
sive’’ early intervention by a skilled clinician may repre- behavior of the primary caretaker, and how this in turn
sent only a few hours a week of a young child’s potential might a¤ect the child’s development later in time. Data
learning time (e.g., 5 hours per week of intensive inter- were collected at five points in time for each dyad: at
action would represent just 5 percent of the child’s pretreatment, at post-treatment, and 6, 12, and 18
available social and communication skill learning time months after the completion of the intervention.
if we assume the child is awake and learning 100 hours Both interventions had generalized e¤ects on in-
per week). Thus, unless direct intervention accounts for tentional communication development. However, the
a large portion of a child’s waking hours, transactional treatment that was most e¤ective depended on the pre-
e¤ects are mandatory for early intervention e¤orts to treatment maternal interaction style and the education
achieve their potential. level of the mother (Yoder and Warren, 1998, 2001b).
Prelinguistic Communication Intervention for Children with Developmental Disabilities 377
Specifically, Yoder and Warren found that for children cation will elicit mother’s linguistic mapping, which in
of highly responsive, relatively well-educated mothers, turn will facilitate the child’s vocabulary development.
PMT was e¤ective in fostering intentional communica-
tion development. However, for children with relatively Conclusion
unresponsive mothers, RSG was relatively more suc-
Prelinguistic communication intervention represents a
cessful in fostering generalized intentional communica-
promising approach for young children with develop-
tion development.
The two interventions di¤er along a few important mental delays. Research on this approach has been quite
limited to date. However, it is clear that the e¤ectiveness
dimensions that provide a plausible explanation for these
of specific interventions may be dependent to some
e¤ects. PMT uses a child-centered play context in which
communication prompts for more advanced forms of extent on mediating e¤ects of caretaker responsivity.
Therefore, the combining specific child-centered tech-
communication are employed, as well as social con-
niques such as PMT with parent training aimed at
sequences for target responses such as specific acknowl-
enhancing caretaker responsivity may be the most e‰-
edgment and compliance. RSG emphasizes following the
cacious approach.
child’s attentional lead and being highly responsive to
See also communication disorders in infants and
child initiation while avoiding the use of direct prompts
toddlers.
for communication. Maternal interaction style may have
influenced which intervention was most beneficial, be- —Steven F. Warren and Paul J. Yoder
cause children may develop expectations concerning
interactions with adults (including teachers and inter- References
ventionists) based on their history of interaction with
their primary caretaker(s). Thus, children with respon- Bates, E., Benigni, L., Bretherton, I., Camaioni, L., and Vol-
sive parents may learn to persist in the face of com- terra, V. (1979). The emergence of symbols, cognition and
munication breakdowns, such as might be occasioned communication in infancy. New York: Academic Press.
Bates, E., O’Connell, B., and Shore, C. (1987). Language and
by a direct prompt or time delay, because their history
communication in infancy. In J. Osofsky (Ed.), Handbook
leads them to believe that their communication attempts of infant development (pp. 149–203). New York: Wiley.
will usually be successful. On the other hand, children Bruner, J., Roy, C., and Ratner, R. (1980). The beginnings of
without this history may cease communicating when request. In K. E. Nelson (Ed.), Children’s language (Vol. 3,
their initial attempt fails. Thus, children of responsive pp. 91–138). New York: Gardner Press.
mothers in the PMT group persisted when prompted and Calandrella, A. M., and Wilcox, M. J. (2000). Predicting lan-
thus learned e¤ectively in this context, while children guage outcome for young prelinguistic children with devel-
with unresponsive parents did not. But when provided opmental delay. Journal of Speech, Language, and Hearing
with a highly responsive adult who virtually never Research, 43, 1061–1071.
prompted them over a 6-month period, children of Farren, D. C. (2000). Another decade of intervention for chil-
dren who are low income or disabled: What do we know
unresponsive mothers showed greater gains than chil- now? In J. Shonko¤ and S. Meisels (Eds.), Handbook of
dren of responsive parents receiving the same treatment. early intervention (2nd ed., pp. 510–548). Cambridge, U.K.:
The e¤ects of maternal responsivity as a mediator and Cambridge University Press.
moderator of intervention e¤ects rippled throughout the Fenson, L., Dale, P. S., Reznick, S., Thal, D., Bates, E., Har-
longitudinal follow-up period. Yoder and Warren dem- tung, J. P., Pethnick, S., and Reilly, J. S. (1991). Technical
onstrated that children in the PMT group with relatively manual for the MacArthur Communicative Development
responsive mothers received increased amounts of re- Inventories. San Diego, CA: San Diego State University.
sponsive input in direct response to their increased in- McCathren, R. B., Warren, S. F., and Yoder, P. J. (1996).
tentional communication (Yoder and Warren, 2001b). Prelinguistic predictors of later language development. In
Furthermore, the e¤ects of the intervention were found K. N. Cole, P. S. Dale, and D. J. Thal (Eds.), Communica-
tion and language intervention series: Vol. 6. Advances in
on both protoimperatives and protodeclaratives (Yoder assessment of communication and language (pp. 57–76).
and Warren, 1999a), became greater with time, and im- Baltimore: Paul H. Brookes.
pacted expressive and receptive language development 6 McLean, J., and Snyder-McLean, L. (1978). A transactional
and 12 months after intervention ceased (Yoder and approach to early language training. Columbus, OH:
Warren, 1999b, 2001a). It is important to consider this Charles E. Merrill.
finding in light of the substantial number of early inter- Samero¤, A. J., and Chandler, M. J. (1975). Reproductive risk
vention studies in which the e¤ects were reported to and the continuum of care-taking casualty. In F. D. Ho-
wash out over time (Farren, 2000). Finally, the finding rowitz, M. Hetherington, S. Scarr-Salapatck, and G. Siegel
that amount of responsive input by the primary care- (Eds.), Review of child development research (Vol. 4, pp.
giver was partly responsible for the association between 187–244). Chicago: University of Chicago Press.
Warren, S. F. (1992). Facilitating basic vocabulary acquisition
intentional communication increases and later language with milieu teaching procedures. Journal of Early Interven-
development (Yoder and Warren, 1999a), coupled with tion, 16(3), 235–251.
the longitudinal relationship between maternal respon- Warren, S. F., and Bambara, L. S. (1989). An experimental
sivity and expressive language development (Yoder and analysis of milieu language intervention: Teaching the
Warren, 2001a), supports the prediction of the transac- action-object form. Journal of Speech and Hearing Re-
tional model that children’s early intentional communi- search, 54, 448–461.
Warren, S. F., and Yoder, P. J. (1998). Facilitating the transi- e¤ect was small and, arguably, not clinically significant.
tion from preintentional to intentional communication. In Most children in both groups were still eligible for pre-
A. Wetherby, S. Warren, and J. Reichle (Eds.), Transitions school speech and language services at the end of the 12-
in prelinguistic communication (pp. 365–384). Baltimore: month study period.
Paul H. Brookes.
As disappointing as these results are, it is important
Warren, S. F., Yoder, P. J., Gazdag, G. E., Kim, K., and
Jones, H. A. (1993). Facilitating prelinguistic communica- to note that the study was designed to study the e¤ec-
tion skills in young children with developmental delay. tiveness of early communication services as typically
Journal of Speech and Hearing Research, 36, 83–97. provided in one community in the United Kingdom
Wetherby, A. M., Cain, D. H., Yonclas, D. G., and Walker, (Law and Conti-Ramsden, 2000). E¤ectiveness studies
V. G. (1988). Analysis of intentional communication of evaluate treatment e¤ects under relatively typical clinical
normal children from prelinguistic to the multiword stage. conditions. As such, the investigators learned that, on
Journal of Speech and Hearing Research, 31, 240–252. average, the children in the treatment group received
Wilcox, M. J. (1992). Enhancing initial communication skills only 6.2 hours of intervention, or 30 minutes per month.
in young children with developmental disabilities through In fact, the most intervention provided to any child was
partner programming. Seminars in Speech and Language,
only 15 hours over a 12-month period! This study dem-
13(3), 194–212.
Wilcox, M. J., and Shannon, J. S. (1998). Facilitating the onstrates most clearly that small, insignificant doses of
transition from prelinguistic to linguistic communication. In early language intervention are not e¤ective in eliminat-
A. Wetherby, S. Warren, and J. Reichle (Eds.), Transitions ing or reducing the broad range of problems associated
in prelinguistic communication (pp. 385–416). Baltimore: with preschool language impairment (see specific lan-
Paul H. Brookes. guage impairment in children; social development
Yoder, P. J., and Warren, S. F. (1998). Maternal responsivity and language impairment; preschool language
predicts the extent to which prelinguistic intervention facili- intervention).
tates generalized intentional communication. Journal of Unlike studies of e¤ectiveness, which monitor client
Speech, Language, and Hearing Research, 41, 1207–1219. change under typical clinical conditions, e‰cacy studies
Yoder, P. J., and Warren, S. F. (1999a). Maternal responsivity
are designed to determine, under more idealized, labo-
mediates the relationship between prelinguistic intentional
communication and later language. Journal of Early Inter- ratory conditions, whether an intervention is directly
vention, 22, 126–136. responsible for positive outcomes. There is ample evi-
Yoder, P. J., and Warren, S. F. (1999b). Self-initiated proto- dence that when preschool language interventions are
declaratives and proto-imperatives can be facilitated in pre- applied regularly with reasonable intensity, they are e‰-
linguistic children with developmental disabilities. Journal cacious, leading to clinically significant improvement in
of Early Intervention, 22, 337–354. the children’s language and early literacy skills. There
Yoder, P. J., and Warren, S. F. (2001a). Intentional commu- are many varieties of preschool interventions, however,
nication elicits language-facilitating maternal responses in and clinicians must carefully consider the options avail-
dyads with children who have developmental disabilities. able to them.
American Journal of Mental Retardation, 106, 327–335.
The principal di¤erences between di¤erent preschool
Yoder, P. J., and Warren, S. F. (2001b). Relative treatment
e¤ects of two prelinguistic communication interventions intervention approaches are best captured by deter-
on language development in toddlers with developmental mining where the interventions fall on a continuum of
delays vary by maternal characteristics. Journal of Speech, intrusiveness. Approaches that are highly intrusive use
Language, and Hearing Research, 44, 224–237. direct teaching methods in clinical settings, usually with
Yoder, P. J., Warren, S. F., Kim, K., and Gazdag, G. E. the clinician as the intervention agent, to address pre-
(1994). Facilitating prelinguistic communication skills in determined treatment objectives, such as specific words
young children with developmental delay: II. Systematic or grammatical structures. In contrast to the prescrip-
replication and extension. Journal of Speech and Hearing tive character of highly intrusive approaches, minimally
Research, 37, 841–851. intrusive approaches have goals that are stated more
broadly, with less focus on specific targets. Example
targets include the use of longer and more complex sen-
Preschool Language Intervention tences, personal reports, and stories, with the child using
an increasingly varied vocabulary. This gives the child
latitude to learn from the rich set of linguistic options
In 2000, the British Medical Journal published an im- available in intervention contexts. In general, the clini-
portant clinical trial of preschool speech and language cian exerts limited control over the child’s agenda.
services in 16 community clinics in Bristol, England Descriptions and examples of approaches at each end of
(Glogowska et al., 2000). This study was the largest trial the intrusiveness continuum follow.
of its kind to address the communication problems In protocols that are maximally intrusive, the child
of preschool children. Unfortunately, its findings were may examine a picture, object, or event presented by a
largely negative; after 12 months, the treatment group clinician, who then presents a linguistic model and a
showed a significant advantage on only one of the five request for the child to imitate. If the child imitates cor-
primary outcome variables, auditory comprehension, rectly, the clinician provides social or other reinforce-
over a control group that had received only ‘‘watchful ment and then presents another stimulus set. If the child
waiting’’ over the same time period. This lone treatment imitates incorrectly, the stimulus is repeated or sim-
Preschool Language Intervention 379
plified, and the child is prompted to imitate again. This awareness. In keeping with the limited clinician intru-
procedure originally stems from stimulus-response psy- siveness, however, the clinician does not directly teach
chology, but in contemporary versions, goals may be specific words, language structures, story structure, or
attacked in ways that are based on linguistic principles early literacy targets, nor are e¤orts made to get the
to encourage generalization to targets not directly child to imitate or to produce language out of context.
trained (Connell, 1986). As appealing as these child-oriented approaches are,
These types of intrusive approaches were popular in there is only limited empirical evidence that they are
the 1960s, 1970s, and 1980s, and experimental evidence e‰cacious in facilitating language use among children
indicates that they can be used to teach productive use with language impairments. Furthermore, it has not
of words, grammatical forms, and even conversational been adequately demonstrated that focusing broadly
behaviors to preschool children with language impair- on the communication of meaning leads to gains in the
ments (Cole and Dale, 1986; Yoder, Kaiser, and Alpert, specific areas of grammatical, phonological, and dis-
1991). In contrast to the interventions studied by Glo- course weakness exhibited by preschoolers with language
gowska et al. (2001), however, in these successful pro- impairment. Techniques such as following the child’s
grams, intervention is provided intensively, often for 10 lead, recasting the child’s utterances, and following the
minutes to 2 hours daily, with outcomes measured after child’s utterances with open-ended questions can be e‰-
periods of several months of intervention. ciently taught to parents or paraprofessionals, and this is
Maximally intrusive intervention options have fallen an important feature. For example, Dale et al. (1996)
out of favor because of evidence that language forms taught parents to use these procedures during shared
learned in this manner in the clinic do not transfer well to book reading with their children over two relatively brief
typical communicative contexts. Furthermore, because sessions. Parents made more changes as a result of the
teaching focuses on discrete language acts, and there are intervention than the children did, but outcomes were
no planned opportunities for the child to learn language measured after only 2 months. A longer intervention
incidentally, success depends to a large extent on the period may have resulted in greater e¤ects on the chil-
clinician’s ability to identify the most appropriate com- dren’s performance.
munication targets for each child. Contemporary language interventions typically are
The minimally intrusive approach described by hybrids that fall somewhere between the extremes in
Norris and Ho¤man (1990) is based on whole-language intrusiveness. For example, so-called milieu interven-
principles and di¤ers dramatically from maximally in- tions blend the identification of discrete intervention
trusive methods. There are three general steps to this targets and direct teaching using imitation and other
approach. The first involves selection of a theme around prompts (i.e., more intrusive components) with the prin-
which the therapy room or preschool classroom is ciples of creating natural contexts for communica-
organized. This theme typically is repeated across ses- tion, following the child’s lead, and recasting the child’s
sions, as the children engage in dramatic play, shared utterances (i.e., less intrusive components). These
book reading, art projects, and other theme-oriented approaches appear to be especially e‰cacious for chil-
activities. This thematic repetition provides greater fa- dren at the single-word or early multiword stages
miliarity, thus enabling children to become active par- (Yoder, Kaiser, and Alpert, 1991). Gibbard (1994)
ticipants in the activities, with reduced guidance from demonstrated that parents can be taught to use milieu
adults. It also provides for a natural repetition of lan- procedures in as few as 11 sessions over a 6-month
guage forms, such as words, grammatical structures, and period, yielding e¤ects commensurate with those of cli-
story structures, making it easier for children to learn nician-administered treatment. When they are applied
and use them. Second, the clinician follows the child’s with moderate intensity, milieu approaches increase
lead, waiting for the child to communicate rather than not only the length and complexity of children’s utter-
guiding the child’s attention. Third, the clinician eval- ances, but also the children’s conversational asser-
uates the child’s communicative e¤orts and provides ap- tiveness and responsiveness (Warren, McQuarter, and
propriate consequences. If the child’s e¤orts are unclear, Rogers-Warren, 1984).
the clinician may ask for clarification (e.g., ‘‘You want Another popular hybrid intervention is called focused
what?’’ ‘‘Do you want a cookie or a pencil?’’), use the stimulation. Most focused stimulation approaches create
cloze procedure by providing a model utterance for the contexts within which the interventionist produces
child to complete (e.g., ‘‘Tell Sandy you want a ’’), frequent models of the child’s social and linguistic tar-
or otherwise help the child to repair the communica- gets and creates numerous opportunities for the child to
tive attempt. If the child communicates adequately, the produce them. Interventionists follow the child’s lead,
child’s message is a‰rmed with an appropriate verbal or recasting the child’s utterances and using the child’s lan-
nonverbal act. In addition, after the child’s attempt (e.g., guage targets, but they do not prompt the child to imi-
‘‘Me eat cookie’’), the clinician can recast the child’s tate. Fey et al. (1993) used this type of approach over a
utterance by correcting its form (e.g., ‘‘Oh, you ate your 5-month period to facilitate the grammatical abilities of
cookie’’) or by altering its form in some way (‘‘Can I a group of 4- to 6-year-old children with impairments
have a cookie now?’’). In interventions such as this, it of grammatical production. They also trained parents to
is easy and appropriate to focus on early literacy skills, use these techniques over a 12-session parent interven-
such as letter knowledge, rhyming, and phonological tion. The children who received intervention exclusively
from their parents made gains that were, on average, Warren, S. F., McQuarter, R. T., and Rogers-Warren, A. K.
equivalent to the gains of the children who received 3 (1984). The e¤ects of mands and models on the speech of
hours of weekly individual and group intervention from unresponsive language-delayed preschool children. Journal
a clinician. Observed gains in the parent group were not of Speech and Hearing Disorders, 49, 43–52.
Yoder, P., Kaiser, A., and Alpert, C. (1991). An exploratory
as consistent across children as were the gains of the
study of the interaction between language teaching methods
children in the clinician group, however. and child characteristics. Journal of Speech and Hearing
In sum, preschool language intervention of several Research, 34, 155–167.
di¤erent types can be e‰cacious. Although individual
clinicians have their strong personal preferences, there Further Readings
are few experimental indications that any one approach
is dramatically superior to the others. To achieve clini- Camarata, S. M., and Nelson, K. E. (1992). Treatment e‰-
cally meaningful e¤ects, however, these interventions ciency as a function of target selection in the remediation of
must be presented rigorously over periods of at least child language disorders. Clinical Linguistics and Phonetics,
several months. Furthermore, it remains unclear whether 6, 167–178.
Camarata, S. M., Nelson, K. E., and Camarata, M. (1994).
existing approaches are su‰cient to minimize the risks Comparison of conversational-recasting and imitative pro-
for later social, behavioral, and academic problems pre- cedures for training grammatical structures in children with
schoolers with language impairments typically experi- specific language impairment. Journal of Speech and Hear-
ence once they reach school. To this end, promising ing Research, 37, 1414–1423.
hybrid preschool classroom interventions have been Cleave, P. L., and Fey, M. E. (1997). Two approaches to the
developed that aim to enhance not only the children’s facilitation of grammar in children with language impair-
spoken language, but their problems in social adaptation ments: Rationale and description. American Journal of
and early literacy as well (Rice and Wilcox, 1995; van Speech-Language Pathology, 6, 22–32.
Kleeck, Gillam, and McFadden, 1998). Culatta, B., and Horn, D. (1982). A program for achieving
generalization of grammatical rules to spontaneous dis-
—Marc E. Fey course. Journal of Speech and Hearing Disorders, 47, 174–
180.
References Fey, M. E. (1986). Language intervention with young children.
Boston: Allyn and Bacon.
Cole, K., and Dale, P. (1986). Direct language instruction and Fey, M. E., Catts, H. W., and Larrivee, L. (1995). Preparing
interactive language instruction with language-delayed pre- preschoolers with language impairment for the academic
school children: A comparison study. Journal of Speech and and social challenges of school. In M. E. Fey, J. Windsor,
Hearing Disorders, 29, 209–217. and S. F. Warren (Eds.), Language intervention: Preschool
Connell, P. (1986). Teaching subjecthood to language- through the elementary years (pp. 3–37). Baltimore: Paul H.
disordered children. Journal of Speech and Hearing Dis- Brookes.
orders, 29, 481–492. Fey, M. E., Cleave, P. L., and Long, S. H. (1997). Two
Dale, P. S., Crain-Thoreson, C., Notari-Syverson, A., and approaches to grammar facilitation in children with lan-
Cole, K. (1996). Parent-child book reading as an interven- guage impairments: Phase 2. Journal of Speech and Hearing
tion technique for young children with language delays. Research, 40, 5–19.
Topics in Early Childhood Special Education, 16, 213–235. Fey, M. E., Cleave, P. L., Ravida, A. I., Dejmal, A. R.,
Fey, M. E., Cleave, P. L., Long, S. H., and Hughes, D. L. Easton, D., and Long, S. H. (1994). E¤ects of grammar
(1993). Two approaches to the facilitation of grammar in facilitation on the phonological performance of children
language-impaired children: An experimental evaluation. with speech and language impairments. Journal of Speech
Journal of Speech and Hearing Research, 36, 141–157. and Hearing Research, 37, 594–607.
Gibbard, D. (1994). Parental-based intervention with pre- Fey, M. E., Krulik, T., Loeb, D. F., and Proctor-Williams, K.
school language-delayed children. European Journal of Dis- (1999). Sentence recast use by parents of children with typ-
orders of Communication, 29, 131–150. ical language and children with specific language impair-
Glogowska, M., Roulstone, S., Enderby, P., and Peters, T. ment. American Journal of Speech-Language Pathology, 8,
(2000). Randomised controlled trial of community based 273–286.
speech and language therapy in preschool children. British Fey, M. E., and Proctor-Williams, K. (2000). Recasting, eli-
Medical Journal, 321, 1–5. cited imitation, and modeling in grammar intervention for
Law, J., and Conti-Ramsden, G. (2000). Treating children with children with specific language impairments. In D. V. M.
speech and language impairments: Six hours of therapy is Bishop and L. B. Leonard (Eds.), Speech and language
not enough. British Medical Journal, 321, 908–909. impairments in children: Cause, characteristics, intervention,
Norris, J. A., and Ho¤man, P. R. (1990). Language interven- and outcome (pp. 177–194). London: Psychology Press.
tion within naturalistic environments. Language, Speech, Hayward, D., and Schneider, P. (2000). E¤ectiveness of teach-
and Hearing Services in Schools, 21, 72–84. ing story grammar knowledge to pre-school children with
Rice, M. L., and Wilcox, K. A. (1995). Building a language- language impairment: An exploratory study. Child Lan-
focused curriculum for the preschool classroom. Vol. 1, A guage Teaching and Therapy, 16, 255–284.
foundation for lifelong communication. Baltimore: Paul H. Kaiser, A. P., Yoder, P. J., and Keetz, A. (1992). Evaluating
Brookes. milieu teaching. In S. F. Warren and J. Reichle (Eds.),
Van Kleeck, A., Gillam, R. B., and McFadden, T. U. (1998). Causes and e¤ects in communication and language interven-
A study of classroom-based phonological awareness train- tion (pp. 9–47). Baltimore: Paul H. Brookes.
ing for preschoolers with speech and/or language disorders. Law, J., Boyle, J., Harris, F., Harkness, A., and Nye, C.
American Journal of Speech-Language Pathology, 7, 65–76. (1998). Screening for speech and language delay: A system-
Prosodic Deficits 381
atic review of the literature. Health Technology Assessment, brain lesion seems to play an important role in the
2, 858–865. nature of the ensuing deficits; however, the neural sub-
Leonard, L. B. (1975). Modeling as a clinical procedure in strates for prosody are still far from clear (see Baum and
language training. Language, Speech, and Hearing Services Pell, 1999, for a review). Historically, clinical impres-
in Schools, 6, 72–85.
sions led to the contention that subsequent to right
Leonard, L. B. (1998). The nature and e‰cacy of treatment.
In L. B. Leonard (Ed.), Children with specific language im- hemisphere damage (RHD), patients would present with
pairment (pp. 193–210). Cambridge, MA: MIT Press. flat a¤ect and monotonous speech, whereas patients with
Nelson, K. E., Camarata, S. M., Welsh, J., Butkovsky, L., and left hemisphere damage (LHD) would maintain normal
Camarata, M. (1996). E¤ects of imitative and conversa- speech prosody. As research progressed, several alterna-
tional recasting treatment on the acquisition of grammar in tive theories concerning the control of speech prosody
children with specific language impairment and younger were posited. Among these are the hypothesis that
language-normal children. Journal of Speech and Hearing a¤ective or emotional prosody is controlled within the
Research, 39, 850–859. right hemisphere, and thus RHD would yield emotional
Notari-Syverson, A., O’Connor, R. E., and Vadasy, P. F. prosodic deficits, whereas linguistic prosody is controlled
(1998). Ladders to literacy: A preschool activity book. Balti-
within the left hemisphere, yielding linguistic prosodic
Nye, C., Foster, S., and Seaman, D. (1987). E¤ectiveness of deficits when damage is confined to the left hemisphere
language intervention with the language/learning disabled. (e.g., Van Lancker, 1980; Ross, 1981). A second hy-
Journal of Speech and Hearing Disorders, 52, 348–357. pothesis proposes that prosody is principally controlled
Olswang, L. B., Bain, B. A., and Johnson, G. A. (1992). Using in subcortical regions and via cortical-subcortical con-
dynamic assessment with children with language disorders. nections (e.g., Cancelliere and Kertesz, 1990); evidence
In S. F. Warren and J. Reichle (Eds.), Causes and e¤ects in of prosodic deficits in individuals with Parkinson’s dis-
communication and language intervention (pp. 187–215). ease supports this view. A third alternative contends that
Baltimore: Paul H. Brookes. the individual acoustic cues to prosody (i.e., duration,
Paul, R. (2001). Language disorders from infancy through ado- amplitude, and fundamental frequency) are di¤erentially
lescence: Assessment and Intervention (2nd ed.). St. Louis:
lateralized to the right and left hemispheres, with tem-
Mosby.
Proctor-Williams, K., Fey, M. E., and Loeb, D. F. (2001). poral properties processed by the left hemisphere and
Parental recasts and production of copulas and articles spectral properties by the right hemisphere (e.g., Van
by children with specific language impairment and typical Lancker and Sidtis, 1992). Whereas several recent in-
language. American Journal of Speech-Language Pathology, vestigations have utilized functional neuroimaging
10, 155–168. techniques in normal individuals to address these hy-
Rice, M. L., and Hadley, P. A. (1995). Language outcomes of potheses (e.g., Gandour et al., 2000), by far the most
the language-focused curriculum. In M. L. Rice and K. A. data have been gathered in studies of individuals who
Wilcox (Eds.), Building a language-focused curriculum for have su¤ered brain damage. These investigations allow
the preschool classroom. Vol. 1. A foundation for lifelong us to characterize the nature of prosodic deficits that
communication (pp. 155–169). Baltimore: Paul H. Brookes.
may emerge in neurologically impaired populations. The
Robertson, S. B., and Ellis Weismer, S. (1997). The influence
of peer models on the play scripts of children with specific discussion is divided into a¤ective and linguistic proso-
language impairment. Journal of Speech, Language, and dic impairments.
Hearing Research, 40, 49–61. Beginning with deficits in the production and percep-
Schuele, C. M., Rice, M. L., and Wilcox, K. A. (1995). Re- tion of a¤ective prosody, one of the salient speech char-
directs: A strategy to increase peer initiations. Journal of acteristics of individuals who have su¤ered RHD is a flat
Speech and Hearing Research, 38, 1319–1333. a¤ect. That is, in conjunction with a reduction in emo-
Weiss, A. L. (2001). Preschool language disorders resource tional expression as reflected in facial expressions, clini-
guide. San Diego, CA: Singular Publishing Group. cal impressions suggest that individuals with RHD tend
Wilcox, M. J., Kouri, T. A., and Caswell, S. B. (1991). Early to produce speech that is reduced, if not devoid, of af-
language intervention: A comparison of classroom and in-
fect. In fact, based on clinical judgments of the speech of
dividual treatment. American Journal of Speech-Language
Pathology, 1, 49–62. RHD patients with varying sites of lesion, Ross (1981)
proposed a classification system for a¤ective impair-
ments, or aprosodias, that paralleled the popular aphasia
Prosodic Deficits syndrome classification system of Goodglass and Kaplan
(1983). Ross’s 1981 classification scheme sparked a good
deal of research on a¤ective prosodic deficits that ulti-
Among the sequelae of certain types of brain damage are mately resulted in its abandonment by the majority of
impairments in the production and perception of speech investigators. However, the investigations it catalyzed
prosody. Prosody serves numerous functions in lan- contributed significantly to our understanding of proso-
guage, including signaling lexical di¤erences when used dic impairments; much of the work inspired by Ross’s
phonemically in tone languages, providing cues to stress, proposal took advantage of increasingly reliable meth-
sentence type or modality, and syntactic boundaries, ods such as acoustic analysis of speech. When study-
and conveying a speaker’s emotions. Any or all of these ing RHD patients in an acute stage, results seemed to
functions of prosody may be impaired subsequent to support impairments in patients’ ability to accurately
brain damage. The hemispheric lateralization of the signal emotions such as happiness, sadness, and anger.
However, the majority of investigations of patients who guistic prosodic cues at the phrase or sentence level, both
had reached a more chronic stage (i.e., at least 3 months individuals with RHD and LHD have di‰culty identi-
post onset) reported few di¤erences between RHD fying declarative, interrogative, and imperative sentence
patients and normal controls in signaling various emo- types on the basis of prosodic cues alone. LHD but not
tions, as reflected in acoustic measures as well as per- RHD patients tend to be relatively more impaired in
ceptual judgments. Occasionally, studies have reported linguistic than a¤ective prosodic perception when direct
a¤ective prosodic impairments in speech production comparisons are made within a single study (Heilman et
subsequent to LHD (e.g., Cancelliere and Kertesz, al., 1984). Baum and colleagues (1997) have also noted
1990), although such findings are far less frequent. impairments in the perception of phrase boundaries by
With respect to the perception of a¤ective prosody, both LHD and RHD patients. Investigations of individ-
early studies again suggested deficits in the processing uals with basal ganglia disease due to Parkinson’s or
of emotions cued by vocal signals subsequent to RHD Huntington’s disease have also reported deficits in the
(see Baum and Pell, 1999). Additional investigations comprehension of prosody (e.g., Blonder, Gur, and Gur,
have also indicated that LHD patients may exhibit defi- 1989), suggesting that subcortical structures or cortical-
cits in the perception of a¤ective prosody, particularly subcortical connections are important in prosodic pro-
when the processing load is heavy (e.g., Tompkins and cessing. Due to its multiple functions in language,
Flowers, 1985). The finding that both LHD and RHD understanding prosodic deficits and the neural substrates
may yield impairments in prosodic processing led to the implicated in the processing of prosody is clearly a
proposal that the individual acoustic properties that complex task.
serve as prosodic cues (i.e., duration, F0, and amplitude) Although this article has considered a¤ective and lin-
may be processed independently in the two cerebral guistic prosody separately, this represents a somewhat
hemispheres and that patients with RHD and LHD artificial distinction, as they are integrated in natural
may rely to di¤erent degrees on multiple cues (e.g., Van speech production and perception. A handful of recent
Lancker and Sidtis, 1992). investigations have begun to address this integration,
With regard to linguistic prosody, in keeping with with mixed results appearing even in normal individuals
the hypothesized functional lateralization of prosody (e.g., Pell, 1999). Exploring the integration of a¤ective
described earlier (Van Lancker, 1980), numerous inves- and linguistic prosody in individuals who have su¤ered
tigations have demonstrated that individuals with LHD brain damage only compounds the problem and the
exhibit impairments in the production of linguistic pros- inconsistencies.
ody, particularly at the phonemic level (i.e., in tone lan- In summary, impairments in the production and per-
guages such as Mandarin, Norwegian, or Thai; e.g., ception of speech prosody may emerge subsequent to
Gandour et al., 1992). Deficits in the ability to signal focal brain damage to numerous cortical and sub-
emphatic stress contrasts, declarative versus interroga- cortical regions. The precise nature of the deficit may
tive sentence types, and syntactic clause boundaries have depend in part on the site of the lesion, but it seems
also been shown subsequent to LHD (e.g., Danly and to vary along the dimensions of the prosodic func-
Shapiro, 1982), but some studies have shown similar tional load (from a¤ective to linguistic), the size or
impairments in RHD patients (e.g., Pell, 1999) or have domain of the production or processing unit, and the
demonstrated that the production of only certain acous- specific acoustic parameters contributing to the prosodic
tic cues, primarily temporal parameters, is a¤ected in signal. Prosodic deficits clearly interact with other
LHD patients (e.g., Baum et al., 1997). The clearest evi- communicative impairments, including disorders of lin-
dence for the role of the left hemisphere in the pro- guistic and pragmatic processing, contributing to the
duction of linguistic prosody comes from studies of the symptom complexes associated with the aphasias, motor
phonemic use of tone; while this is arguably the ‘‘most speech disorders, and right hemisphere communication
linguistic’’ of the functions of prosody, it is also the deficits.
smallest unit (i.e., a single syllable) in which prosodic See also right hemisphere language and communi-
cues may be manifest. It has therefore been suggested cation functions in adults; right hemisphere lan-
that the size or domain of the production unit may play guage disorders.
a role in the brain regions implicated in prosodic pro-
cessing. As an obvious corollary, patients with LHD —Shari R. Baum
and RHD may display impairments limited to di¤erent
domains of prosodic processing. References
Impairments in the perception or comprehension of
linguistic prosody have also been found in both LHD Baum, S., and Pell, M. (1999). The neural bases of prosody:
and RHD patient groups, with varying results depending Insights from lesion studies and neuroimaging. Aphasiology,
on the nature of the stimuli or the task. Investigations 13, 581–608.
Baum, S., Pell, M., Leonard, C., and Gordon, J. (1997). The
focusing on the perception of stress cues have mainly ability of right- and left-hemisphere-damaged individuals to
reported reduced performance relative to normal by produce and interpret prosodic cues marking phrasal boun-
individuals with LHD. For instance, several studies have daries. Language and Speech, 40, 313–330.
shown that LHD patients are impaired in the ability Blonder, L. X., Gur, R. E., and Gur, R. C. (1989). The e¤ects
to identify phonemic (lexical) and emphatic stress (e.g., of right and left hemiparkinsonism on prosody. Brain and
Emmorey, 1987). With regard to the perception of lin- Language, 36, 193–207.
Reversibility/Mapping Disorders 383
Cancelliere, A., and Kertesz, A. (1990). Lesion localization in Pell, M. (1996). On the receptive prosodic loss in Parkinson’s
acquired deficits of emotional expression and comprehen- disease. Cortex, 32, 693–704.
sion. Brain and Cognition, 13, 133–147. Pell, M. (1999). The temporal organization of a¤ective and
Danly, M., and Shapiro, B. E. (1982). Speech prosody in non-a¤ective speech in patients with right-hemisphere
Broca’s aphasia. Brain and Language, 16, 171–190. infarcts. Cortex, 35, 455–477.
Emmorey, K. (1987). The neurological substrates for pro- Pell, M., and Baum, S. (1997a). The ability to perceive and
sodic aspects of speech. Brain and Language, 30(2), 305– comprehend intonation in linguistic and a¤ective contexts
320. by brain damaged adults. Brain and Language, 57, 80–99.
Gandour, J., Ponglorpisit, S., Khunadorn, F., Dechongkit, S., Pell, M., and Baum, S. (1997b). Unilateral brain damage, pro-
Boongird, P., Boonklam, R., et al. (1992). Lexical tones in sodic comprehension deficits, and the acoustic cues to pros-
Thai after unilateral brain damage. Brain and Language, 43, ody. Brain and Language, 57, 195–214.
275–307. Ross, E., Thompson, R., and Yenkosky, J. (1997). Laterali-
Gandour, J., Wong, D., Hsieh, L., Weinzapfel, B., Van zation of a¤ective prosody in brain and the callosal inte-
Lancker, D., and Hutchins, G. (2000). A crosslinguistic gration of hemispheric language functions. Brain and
PET study of tone perception. Journal of Cognitive Neuro- Language, 56, 27–54.
science, 12, 207–222. Zatorre, R. (1988). Pitch perception of complex tones and
Goodglass, H., and Kaplan, E. (1983). The assessment of human temporal-lobe function. Journal of the Acoustical
aphasia and related disorders. Philadelphia: Lea and Febiger. Society of America, 84, 566–572.
Heilman, K., Bowers, D., Speedie, L., and Coslett, H. (1984). Zatorre, R., Evans, A., and Meyer, E. (1994). Neural mecha-
Comprehension of a¤ective and nona¤ective prosody. Neu- nisms underlying melodic perception and memory for pitch.
rology, 34, 917–920. Journal of Neuroscience, 14, 1908–1919.
Pell, M. (1999). Fundamental frequency encoding of linguistic Zatorre, R., Evans, A., Meyer, E., and Gjedde, A. (1992).
and emotional prosody by right hemisphere damaged Lateralization of phonetic and pitch discrimination in
speakers. Brain and Language, 69, 161–192. speech processing. Science, 256, 846–849.
Ross, E. (1981). The aprosodias: Functional-anatomic organi-
zation of the a¤ective components of language in the right
hemisphere. Archives of Neurology, 38, 561–569. Reversibility/Mapping Disorders
Tompkins, C. A., and Flowers, C. R. (1985). Perception of
emotional intonation by brain-damaged adults: The influ-
ence of task processing levels. Journal of Speech and Hear- Impaired comprehension of reversible sentences is widely
ing Research, 28, 527–538. observed in aphasia. Reversible sentences (e.g., The cat
Van Lancker, D. (1980). Cerebral lateralization of pitch cues in chased the dog) cannot be interpreted accurately without
the linguistic signal. International Journal of Human Com-
attention to word order and other syntactic devices,
munication, 13(2), 227–277.
Van Lancker, D., and Sidtis, J. J. (1992). The identification of whereas the sole plausible interpretation of nonreversible
a¤ective-prosodic stimuli by left- and right-hemisphere- sentences (e.g., The cat drank the milk) can be derived
damaged subjects: All errors are not created equal. Journal from content words via semantic or pragmatic inferenc-
of Speech and Hearing Research, 35, 963–970. ing. Impaired comprehension of reversible sentences,
along with relatively intact comprehension of single
words and nonreversible sentences, most frequently co-
Further Readings occurs with Broca’s aphasia but is also observed in other
forms of aphasia (Caramazza and Zurif, 1976; Martin
Baum, S. (1998). The role of fundamental frequency and du- and Blossom-Stach, 1986; Caramazza and Micelli,
ration in the perception of linguistic stress by individuals
1991). This comprehension pattern, termed asyntactic
with brain damage. Journal of Speech, Language, and
Hearing Research, 41, 31–40. comprehension, has been studied intensively as evidence
Blonder, L. X., Pickering, J. E., Heath, R. L., Smith, C. D., about the syntactic abilities of aphasic listeners.
and Butler, S. M. (1995). Prosodic characteristics of speech Accounts of asyntactic comprehension di¤er in two
pre- and post-right hemisphere stroke. Brain and Language, dimensions. The first is competence versus performance:
51, 318–335. Does the failure to interpret reversible sentences cor-
Borod, J. (1993). Cerebral mechanisms underlying facial, rectly derive from a loss of linguistic knowledge or lan-
prosodic, and lexical emotional expression: A review of guage-processing ability, or does this failure stem from
neuropsychological studies and methodological issues. Neu- performance factors such as resource limitations? The
ropsychology, 7(4), 445–463. second dimension is parsing versus mapping: Does asyn-
Gandour, J., Larsen, J., Dechongkit, S., Ponglorpisit, S., and
Khunadorn, F. (1995). Speech prosody in a¤ective contexts
tactic comprehension derive from a failure to parse or
in Thai patients with right hemisphere lesions. Brain and from a failure to map an accurate parse onto a semantic
Language, 51, 422–443. representation?
Gandour, J., Wong, D., Hsieh, L., Weinzapfel, B., Van
Lancker, D., and Hutchins, G. (2000). A crosslinguistic Competence Versus Performance
PET study of tone perception. Journal of Cognitive Neuro-
science, 12, 207–222. Early competence-based interpretations of asyntactic
Gandour, J., Wong, D., and Hutchins, G. (1998). Pitch proc- comprehension pointed to loss of linguistic knowledge
essing in the human brain is influenced by language experi- or damage to the human parser as a common under-
ence. NeuroReport, 9, 2115–2119. lying source for this comprehension impairment and for
Ivry, R., and Robertson, L. (1998). The two sides of perception. agrammatism, a speech production pattern found in
Cambridge, MA: MIT Press. some Broca’s aphasics. Agrammatism is characterized by
omission or misselection of grammatical morphemes even those aphasic patients who normally perform well
and/or simplified, fragmentary grammatical structure. on simple active reversible sentences may fail when the
This hypothesis of a central syntactic disorder (Car- lexical content is manipulated so that the syntactically
amazza and Zurif, 1976; Berndt and Caramazza, 1980) correct interpretation is the opposite of the interpreta-
was motivated by the observation that asyntactic com- tion supported by lexicosemantic heuristics, as in, for
prehension and agrammatism of speech both suggest a example, a semantic/pragmatic anomaly task requiring
limited exploitation of syntactic devices. However, dou- the subject to detect the anomaly of The cheese ate the
ble dissociations between asyntactic comprehension and mouse (Sa¤ran, Schwartz, and Linebarger, 1998).
agrammatic production (Goodglass and Menn, 1985) An additional point to note is that while the linguis-
argue against a central account. Competence-based tically fine-grained accounts appeal to heuristics to ex-
explanations—using the term ‘‘competence’’ broadly to plain observed patterns of comprehension, they invoke
apply to the absolute inability to perform specific lin- these heuristics only as a response to parsing failure. The
guistic operations as a result of either loss of knowledge Sa¤ran et al. (1998) data, in contrast, suggest that
or damage to the psychological mechanisms responsible heuristics may occur in parallel with, and sometimes in
for computing linguistic representations—are under- competition with, syntactic analysis. Such a view accords
mined by the findings that (1) asyntactic comprehension with studies of sentence processing in normals, where the
can be induced in normal subjects under resource- influence of extragrammatical heuristics in normal sen-
demanding conditions (Miyake, Carpenter, and Just, tence comprehension is well documented (Slobin, 1966;
1994; Blackwell and Bates, 1995); (2) aphasic perfor- Bever, 1970; Trueswell, Tanenhaus, and Garnsey, 1994).
mance varies greatly from session to session (Kolk and
van Grunsven, 1985) and from task to task (Cupples and
Asyntactic Comprehension as Mapping Failure
Inglis, 1993); and (3) asyntactic comprehenders fre-
quently perform close to normally on grammaticality This explanation for asyntactic comprehension impli-
judgment tasks (Linebarger, Schwartz, and Sa¤ran, cates the mapping between syntactic structure and
1983), detecting grammatical ill-formedness in the same semantic interpretation (Linebarger, Schwartz, and Saf-
structures that they do not reliably comprehend. These fran, 1983). On this account, asyntactic listeners do
findings are more compatible with a performance ac- construct an adequate representation of the structure of
count than with an account that implicates loss of the the input sentence but fail to exploit this syntactic infor-
ability to perform the relevant language-processing mation for the recovery of meaning, specifically of the-
operations even under optimal conditions. matic roles such as agent and theme. The frequently
Performance accounts di¤er regarding the nature of observed di‰culty posed by passive and object-gapped
the hypothesized resource limitation. Some point to a sentences follows, on this account, from the fact that the
global resource deficit (Blackwell and Bates, 1995); order of content words in these structures conflicts with
others invoke more specific limitations (Miyake et al., extragrammatical order-based heuristics (Caplan, Baker,
1994; Caplan and Waters, 1995). Performance accounts and Dehaut, 1985). If such heuristics occur in parallel
also di¤er regarding the linguistic operations disrupted with grammatically based processing, as suggested by
by the hypothesized resource limitation: some implicate the literature on normals, then conflict between gram-
parsing (Kolk, 1995); others point to subsequent inter- matical structure and extragrammatical heuristics would
pretative processes. be predicted to lead to more errorful performance.
Evidence to choose between these two hypotheses
Asyntactic Comprehension as Parsing Failure is sparse. The grammaticality judgment data do not
undermine the performance (as opposed to the compe-
This class of performance-based explanations posits a tence) version of the parsing hypothesis, on the assump-
failure to retrieve the syntactic structure of the input tion that the grammaticality judgment task is less
sentence, a failure that may occur only when task resource-demanding than the sentence-picture matching
demands are high (Frazier and Friederici, 1991; but see task or other paradigms that require not only syntactic
Linebarger, 1995). On the basis of patterns of perfor- analysis but also semantic interpretation. Parsing, on
mance on comprehension tasks, some investigators this account, is performed in optimal circumstances, but
have attempted to pinpoint the grammatical locus of not when other task demands are high.
this failure, implicating, for example, the processing of The grammaticality judgment data do not contradict
closed-class elements (Bradley, Garrett, and Zurif, the mapping account, either. This explanation for asyn-
1980), syntactically moved elements (Grodzinsky, 1990), tactic comprehension posits a normal syntactic parse
or referential dependencies (Mauner, Fromkin, and that is not adequately mapped onto an interpretation. In
Cornell, 1993). One di‰culty for these more fine-grained fact, patterns of performance observed within the gram-
hypotheses (see also Linebarger, 1995) is the variability maticality judgment task may be seen as supporting the
observed in aphasic performance across di¤erent syn- mapping hypothesis. Errors related to constituent struc-
tactic structures (Berndt, Mitchum, and Haendiges, ture, verb subcategorization, and the legitimacy of syn-
1996). For example, these accounts predict good perfor- tactic gaps were detected more reliably by agrammatic
mance on simple active sentences, and hence fail to ac- patients than errors involving coindexation of pronouns
count for the di‰culties posed for some patients by such and other referential elements (Linebarger, 1990); such
sentences (Schwartz, Sa¤ran, and Marin, 1980). And patterns suggest an initial ‘‘first-pass’’ recovery of con-
Reversibility/Mapping Disorders 385
stituent structure that is not fully interpreted, a pattern See also attention and language; memory and
which falls naturally from the mapping hypothesis. processing capacity; trace deletion hypothesis.
The mapping hypothesis also receives support from a
—Marcia C. Linebarger
study in which aphasic subjects judged the plausibility of
simple reversible sentences and, in addition, of ‘‘padded’’
versions of these sentences (Schwartz et al., 1987; see References
also Kolk and Weijts, 1995). In the padded versions, the Berndt, R. S., and Caramazza, A. (1980). A redefinition of the
basic SVO (subject-verb-object) structure was elaborated syndrome of Broca’s aphasia: Implications for a neuro-
with extraneous material. For example, subjects were psychological model of language. Applied Psycholinguistics,
presented with The bird swallowed the worm, its padded 1, 225–278.
counterpart, As the sun rose, the bird in the cool wet grass Berndt, R. S., Mitchum, C. C., and Haendiges, A. N. (1996).
swallowed the worm quickly and went away, and with the Comprehension of reversible sentences in ‘‘agrammatism’’:
role-reversed versions of these sentences. In addition, A meta-analysis. Cognition, 58, 289–308.
these same predicate argument structures were em- Bever, T. G. (1970). The cognitive basis for linguistic struc-
bedded in noncanonical structures such as passives, ob- tures. In R. Hayes (Ed.), Cognition and language develop-
ment. New York: Wiley.
ject gaps, and other deviations from the simple SVO
Blackwell, A., and Bates, E. (1995). Inducing agrammatic
structure that typically cause di‰culties for asyntactic profiles in normals: Evidence for the selective vulnerability
comprehenders. The agrammatic and conduction apha- of morphology under cognitive resource limitation. Journal
sic subjects in this study performed well above chance on of Cognitive Neuroscience, 7, 228–257.
both the simple and padded sentences; their performance Bradley, D. C., Garrett, M. F., and Zurif, E. B. (1980). Syn-
declined to chance or near chance only on the non- tactic deficits in Broca’s aphasia. In D. Caplan (Ed.), Bio-
canonical structures. The good performance on padded logical studies of mental processes. Cambridge, MA: MIT
sentences supports the view that asyntactic compre- Press.
henders are able to construct an adequate representation Byng, S. (1988). Sentence processing deficits: Theory and ther-
of constituent structure, because the extraction of the apy. Cognitive Neuropsychology, 5, 629–676.
Caplan, D., Baker, C., and Dehaut, F. (1985). Syntactic deter-
elements critical to the plausibility judgment (bird, swal- minants of sentence comprehension in aphasia. Cognition,
low, and worm) requires an analysis of the structure of 21, 117–175.
the sentence. A nonsyntactic ‘‘nearest NP’’ strategy, for Caplan, D., and Waters, G. S. (1995). Aphasic disorders
example, would lead subjects to reject the padded sen- of syntactic comprehension and working memory capacity.
tence above, since grass immediately precedes swallowed. Cognitive Neuropsychology, 12, 637–649.
The literature on mapping therapy, an approach to Caramazza, A., and Micelli, G. (1991). Selective impairment of
remediation based on this hypothesis, contains reports thematic role assignment in sentence processing. Brain and
(Jones, 1986; Byng, 1988) of striking gains resulting from Language, 41, 402–436.
a training protocol focusing on the relationship be- Caramazza, A., and Zurif, E. B. (1976). Dissociation of algo-
tween grammatical functions such as subject/object and rithmic and heuristic processes in language comprehension:
Evidence from aphasia. Brain and Language, 3, 572–582.
thematic roles such as agent/theme. While subsequent Cupples, L., and Inglis, A. L. (1993). When task demands
studies have reported a variety of outcomes for this induce ‘‘asyntactic’’ comprehension: A study of sentence
approach to therapy, the reported successes suggest that interpretation in aphasia. Cognitive Neuropsychology, 10,
for at least a subset of patients, the breakdown in pro- 201–234.
cessing may occur in the assignment of thematic roles on Frazier, L., and Friederici, A. (1991). On deriving the proper-
the basis of grammatical function. ties of agrammatic comprehension. Brain and Language, 40,
51–66.
Goodglass, H., and Menn, L. (1985). Is agrammatism a uni-
tary phenomenon? In M.-L. Kean (Ed.), Agrammatism (pp.
1–26). New York: Academic Press.
Asyntactic Comprehension as Evidence About Grodzinsky, Y. (1990). Theoretical perspectives on language
Language Processing deficits. Cambridge, MA: MIT Press.
Jones, E. V. (1986). Building the foundations for sentence pro-
It can be argued that the fragility of linguistic processing duction in a non-fluent aphasic. British Journal of Disorders
in aphasia, whatever its cause, results in a dispropor- of Communication, 21, 63–82.
tionate influence of extralinguistic processing based on Kolk, H. H. J. (1995). A time-based approach to agrammatic
lexical content and word order rather than grammatical comprehension. Brain and Language, 50, 282–303.
structure. Therefore the patterns of misinterpretation Kolk, H. H. J., and van Grunsven, M. F. (1985). Agramma-
observed in aphasic subjects may not directly reflect tism as a variable phenomenon. Cognitive Neuropsychology,
their linguistic impairments, but rather a complex in- 2, 347–384.
teraction between ine‰cient or inaccurate linguistic Kolk, H. H. J., and Weijts, M. (1995). Judgements of semantic
anomaly in agrammatic patients: Argument movement,
analysis and extragrammatical interpretative processes. syntactic complexity, and the use of heuristics. Brain and
Furthermore, the heterogeneous patterns of interpretive Language, 54, 86–135.
errors even within specific subgroups such as agram- Linebarger, M. C. (1990). Neuropsychology of sentence pars-
matics suggest that there may be no unitary explanation ing. In A. Caramazza (Ed.), Cognitive neuropsychology and
for the impaired comprehension of reversible sentences neurolinguistics: Advances in models of cognitive function
in aphasia. (pp. 55–122). Hillsdale, NJ: Erlbaum.
Linebarger, M. C. (1995). Agrammatism as evidence about Claims related to right hemisphere contributions to
grammar. Brain and Language, 50, 52–91. language and communication can be stated in a strong
Linebarger, M. C., Schwartz, M. F., and Sa¤ran, E. M. (1983). form: that a specific function is housed in some region
Sensitivity to grammatical structure in so-called agrammatic of the right hemisphere that is necessary and su‰cient
aphasics. Cognition, 13, 361–392.
to support that function. However, most claims are more
Martin, R. C., and Blossom-Stach, C. (1986). Evidence of
syntactic deficits in a fluent aphasic. Brain and Language, general and also weaker: that normal task performance
28, 196–234. draws on intact right as well as left hemisphere struc-
Mauner, G., Fromkin, V. A., and Cornell, T. L. (1993). Com- tures. For example, understanding the point of an ironic
prehension and acceptability judgments in agrammatism: comment rests on a listener’s appreciation of phonology,
Disruption in the syntax of referential dependency. Brain word meaning, and grammatical relations, as well as
and Language, 45, 340–370. a speaker’s tone of voice, preceding context, and the
Miyake, A., Carpenter, P., and Just, M. A. (1994). A capacity speaker’s mood. In simplistic terms, the right hemi-
approach to asyntactic comprehension: Making normal sphere’s contributions to language and communication
adults perform like aphasic patients. Cognitive Neuro- are typically layered on top of the foundation provided
psychology, 11, 671–717.
by the left hemisphere. Even weak claims for the right
Sa¤ran, E. M., Schwartz, M., and Linebarger, M. C. (1998).
Semantic influences on thematic role assignment: Evi- hemisphere’s role are extremely important clinically be-
dence from normals and aphasics. Brain and Language, 62, cause injury to the right hemisphere often results in
255–297. impairments that significantly reduce a patient’s ability
Schwartz, M., Linebarger, M. C., Sa¤ran, E. M., and Pate, to communicate e¤ectively in natural settings.
D. S. (1987). Syntactic transparency and sentence inter- Another general point concerns localization of func-
pretation in aphasia. Language and Cognitive Processes, 2, tion. Often, a group of patients with right hemisphere
85–113. lesions is compared with a group of non-brain-injured
Schwartz, M. F., Sa¤ran, E. M., and Marin, O. S. M. (1980). controls. Although this type of comparison does not
The word order problem in agrammatism: I. Comprehen- allow localization of a particular function to the right
sion. Brain and Language, 10, 249–262.
hemisphere, it still supports the weaker interpretation
Slobin, D. I. (1966). Grammatical transformations and sen-
tence comprehension in childhood and adulthood. Journal mentioned earlier. In addition, some studies provide
of Verbal Learning and Verbal Behavior, 2, 219–227. strong support for right hemisphere localization by (1)
Trueswell, J. C., Tanenhaus, M., and Garnsey, S. M. (1994). directly comparing the e¤ects of unilateral lesions of the
Semantic influences on parsing: Use of thematic role infor- right and left hemispheres, (2) using lateralized presen-
mation in syntactic disambiguation. Journal of Memory and tation to intact left or right hemispheres, or (3) using
Language, 33, 285–318. functional imaging (PET, fMRI) to examine ‘‘on-line’’
brain activation in non-brain-injured adults. There is
growing support for the right hemisphere’s unique con-
tribution to language and communication.
Right Hemisphere Language and
Communication Functions in Adults Prosody. Prosody refers to variation in frequency, am-
plitude, duration, timbre, and rhythm. Prosodic contour
can be used to convey linguistic distinctions, such as
The role of the right cerebral hemisphere in language distinguishing between meanings of words or phrases
and communication represents a relatively young area (‘‘yellow jacket’’ meaning a kind of bee or a brightly
of research that has grown rapidly since the late 1970s. colored piece of clothing) and between speech acts (a
Recent interest in the role of the right hemisphere reflects question versus statement signaled by a rising pitch
an emphasis on language as a tool for communication toward the end of an utterance). Research with right
in natural contexts, and an awareness that normal lan- hemisphere-injured patients suggests that both expres-
guage use is the product of many regions of the two sive and receptive deficits can occur, although there is
hemispheres working in concert. Left hemisphere struc- disagreement across studies. In terms of production,
tures are routinely linked to the nuts and bolts of what there is some loss of control, sometimes manifested as an
might be termed basic language: phonology, lexical se- increased variability in pitch (specifically, fundamental
mantics, and syntax. In contrast, right hemisphere frequency) after temporal and rolandic area lesions (e.g.,
structures have been implicated in less tightly con- Colsher, Cooper, and Gra¤-Radford, 1987). Patients
strained domains, including some uses of prosody (the with right hemisphere lesions are also impaired on a va-
‘‘melody’’ of speech), metaphor, discourse such as con- riety of discrimination and production tasks (Behrens,
versations, stories, indirect requests, and other forms of 1989; Weintraub, Mesulam, and Kramer, 1981).
nonliteral language, and even the social-cognitive basis Prosody can also be used to convey a range of emo-
for discourse. These domains most closely associated tions, such as anger or sadness. Ross (1981) has pro-
with the right hemisphere are especially sensitive to con- posed a taxonomy of aprosodias to mirror the classical
text and are ideally suited to expressing nuance. This taxonomy of aphasias: a motor aprosodia associated
article will first present some general issues pertaining to with right frontal lesions, a receptive aprosodia asso-
research in these areas and then describe, in turn, repre- ciated with right temporal lesions, and a global aproso-
sentative findings relating to prosody, lexical processing dia associated with extensive frontal-temporal-parietal
and metaphor, and discourse. lesions. Other research has confirmed the separation
Right Hemisphere Language and Communication Functions in Adults 387
of an a¤ective deficit from a linguistic prosody compre- Waters, and Caplan, 1997). Another realm of impair-
hension deficit based on direct comparison between the ment centers on nonliteral language. Several studies of
e¤ects of left- and right-sided lesions (Heilman et al., sarcasm and irony comprehension suggest a problem
1984; Pell, 1998). using context (mood, sentence prosody) as a guide to
uncovering a speaker’s intended meaning (Tompkins
Lexical and Phrasal Metaphor. Several studies have and Flowers, 1985). Similarly, a host of studies show
used a sentence-picture matching task. Patients with that right-sided lesions alter patient’s production and
right hemisphere lesions, more so than aphasic patients comprehension of indirect requests, which also require
with left-sided lesions, tend to be overly literal, thereby consideration of the preceding context (Stemmer,
missing the conventional meaning of phrasal metaphors Giroux, and Joanette, 1994).
such as ‘‘he has a heavy heart’’ or idioms such as ‘‘turn- An overlapping body of work examines whether an
ing over a new leaf ’’ (Van Lancker and Kempler, 1987). underlying social cognitive impairment a¤ects discourse
The characteristic literalness has been extended to single- performance. The ability to explain behavior in terms of
word stimuli such as ‘‘warm,’’ ‘‘cold,’’ ‘‘deep,’’ and other people’s mental states, referred to as theory of
‘‘shallow’’ in a semantic similarity judgment task pre- mind, has been examined in several populations, includ-
sented to left- and right-lesioned patient groups (Brow- ing people with autism and stroke patients. Comprehen-
nell et al., 1984). Functional imaging in normal adults sion of stories and cartoons that rely on theory of
confirms that regions in the right hemisphere (includ- mind are relatively di‰cult for patients with right-sided
ing the dorsolateral prefrontal cortex, middle temporal lesions, but not for aphasic patients with left hemisphere
gyrus, and the precuneus in the medial parietal lobe) lesions (Happé, Brownell, and Winner, 1999). Also,
are di¤erentially activated during metaphor processing functional imaging studies in normal adults suggest
compared to literal sentence processing (Bottini et al., greater activation linked to theory of mind in a variety
1994). of regions, including the right middle frontal gyrus and
Studies of lexical semantic processing by the left and precuneus (Gallagher et al., 2000).
right hemispheres of normal adults, together with work There are, of course, unresolved issues. The range
on discourse, have been used to support a comprehensive of language and communication skills associated with
model. Beeman (1998), as well as others, suggests that right hemisphere injury is extensive. These skills seem
the left hemisphere is designed for focused processing of to represent several domains that will need to be exam-
the closest (literal) associations to a word demanded by ined separately, even though powerful unifying con-
preceding context and for actively dampening activation structs have been explored, such as coherence (Benowitz,
of alternative meanings. The right hemisphere, in con- Moya, and Levine, 1990) and working memory (Tomp-
trast, is sensitive to looser, more remote associations and kins et al., 1994). Finally, our understanding of local-
allows them to persist over time. Extrapolating Bee- ization of function involving the right hemisphere is
man’s model, one can imagine what happens when a poorly developed. Functions often associated with the
potential metaphor is presented: ‘‘Microsoft Corpora- right hemisphere may be as appropriately tied to pre-
tion is the tiger of the software industry.’’ The right frontal regions in either hemisphere (McDonald, 1993;
hemisphere maintains the various associations emanat- Stuss, Gallup, and Alexander, 2001).
ing from ‘‘Microsoft,’’ the topic of the metaphor, and See also discourse; discourse impairments.
‘‘tiger,’’ the vehicle; the overlapping or shared associa-
tions between ‘‘Microsoft’’ and ‘‘tiger’’ provide the —Hiram Brownell
ground for the metaphor. References
Discourse. Discourse processing requires that a listener Beeman, M. (1998). Coarse semantic coding and discourse
integrate meaning across sentences and from non- or comprehension. In M. Beeman and C. Chiarello (Eds.),
paralinguistic sources to achieve an understanding of an Right hemisphere language comprehension: Perspectives
from cognitive neuroscience (pp. 255–284). Mahwah, NJ:
entire story, joke, or conversation. Several studies have Erlbaum.
documented that right hemisphere lesions more than left Behrens, S. J. (1989). Characterizing sentence intonation in
hemisphere lesions result in decreased humor apprecia- a right hemisphere–damaged population. Brain and Lan-
tion (Shammi and Stuss, 1999). While patients with guage, 37, 181–200.
right-sided lesions have no trouble appreciating that Benowitz, L. I., Moya, K. L., and Levine, D. N. (1990).
short story jokes require an incongruous punch line, they Impaired verbal reasoning and constructional apraxia in
are deficient in apprehending exactly how a punch line subjects with right hemisphere damage. Neuropsychologia,
fits with the body of a joke on a deeper level, and they 28, 231–241.
have analogous problems with other types of discourse Bottini, G., Corcoran, R., Sterzi, R., Paulesu, E., Schenone, P.,
for which comprehension requires a reinterpretation Scarpa, P., et al. (1994). The role of the right hemisphere in
the interpretation of figurative aspects of language: A posi-
(Brownell et al., 2000). Patients with right hemisphere tron emission tomography activation study. Brain, 117,
injuries have trouble extracting gist from extended nar- 1241–1253.
rative even in the absence of an obvious need for rein- Brownell, H., Gri‰n, R., Winner, E., Friedman, O., and
terpretation (Hough, 1990), although there are highly Happé, F. (2000). Cerebral lateralization and theory of
constrained situations in which they are able to per- mind. In S. Baron-Cohen, H. Tager-Flusberg, and D. J.
form inferences that span sentence boundaries (Leonard, Cohen (Eds.), Understanding other minds: Perspectives from
autism and developmental cognitive neuroscience (2nd ed., Brownell, H. (2000). Right hemisphere contributions to under-
pp. 311–338). Oxford: Oxford University Press. standing lexical connotation and metaphor. In Y. Grodzin-
Brownell, H. H., Potter, H. H., Michelow, D., and Gardner, sky, L. Shapiro, and D. Swinney (Eds.), Language and the
H. (1984). Sensitivity to lexical denotation and connotation brain (pp. 185–201). San Diego, CA: Academic Press.
in brain-damaged patients: A double dissociation? Brain Brownell, H., and Friedman, O. (2001). Discourse ability
and Language, 22, 253–265. in patients with unilateral left and right hemisphere brain
Colsher, P. L., Cooper, W. E., and Gra¤-Radford, N. (1987). damage. In R. S. Berndt (Ed.), Handbook of neuro-
Intonation variability in the speech of right-hemisphere psychology (2nd ed., vol. 3, pp. 189–203). Amsterdam:
damaged patients. Brain and Language, 32, 379–383. Elsevier.
Gallagher, H. L., Happé, F., Brunswick, N., Fletcher, P. C., Deacon, T. W. (1997). The symbolic species: The co-evolution
Frith, U., and Frith, C. D. (2000). Reading the mind in of language and the brain. New York: Norton.
cartoons and stories: An fMRI study of ‘‘theory of mind’’ in Joanette, Y., Goulet, P., and Hannequin, D. (1990). Right
verbal and nonverbal tasks. Neuropsychologia, 38, 11–21. hemisphere and verbal communication. New York: Springer-
Happé, F., Brownell, H., and Winner, E. (1999). Acquired Verlag.
theory of mind impairments following right hemisphere Myers, P. S. (1999). Right hemisphere damage: Disorders
stroke. Cognition, 70, 211–240. of communication and cognition. San Diego, CA: Singular
Heilman, K. M., Bowers, D., Speedie, L., and Coslett, H. B. Publishing Group.
(1984). Comprehension of a¤ective and nona¤ective pros- Tompkins, C. A. (1995). Right hemisphere communication dis-
ody. Neurology, 34, 917–921. orders. San Diego, CA: Singular Publishing Group.
Hough, M. S. (1990). Narrative comprehension in adults with
right and left hemisphere brain-damage: Theme organiza-
tion. Brain and Language, 38, 253–277.
Leonard, C. L., Waters, G. S., and Caplan, D. (1997). The use
of contextual information by right brain-damaged individ- Right Hemisphere Language Disorders
uals in the resolution of ambiguous pronouns. Brain and
Language, 57, 309–342.
McDonald, S. (1993). Viewing the brain sideways? Frontal
For nearly 150 years, the language-dominant left cere-
versus right hemisphere explanations of non-aphasic lan-
guage disorders. Aphasiology, 7, 535–549. bral hemisphere has dominated research and clinical
Pell, M. (1998). Recognition of prosody following unilateral concern about language disorders that accompany brain
brain lesion: Influence of functional and structural attrib- damage in adults. However, it is now well established
utes of prosodic contours. Neuropsychologia, 36, 701–715. that unilateral right hemisphere brain damage can sub-
Ross, E. D. (1981). The aprosodias. Archives of Neurology, 38, stantially impair language and communication. The
561–569. language deficits associated with right hemisphere dam-
Shammi, P., and Stuss, D. T. (1999). Humour appreciation: A age, while often quite socially handicapping (Tompkins
role of the right frontal lobe. Brain, 122, 657–666. et al., 1998), are little understood. This article focuses on
Stemmer, B., Giroux, F., and Joanette, Y. (1994). Produc- disorders characterized by damage restricted to the right
tion and evaluation of requests by right hemisphere brain-
cerebral hemisphere in adults. Such individuals may
damaged individuals. Brain and Language, 47, 1–31.
Stuss, D. T., Gallup, G. G., and Alexander, M. P. (2001). The have di‰culties with some basic language tasks but are
frontal lobes are necessary for ‘‘theory of mind.’’ Brain, 124, not generally considered to have aphasia, because pho-
279–286. nology, morphology, syntax, and many aspects of se-
Tompkins, C. A., Bloise, C. G. R., Timko, M. L., and Baum- mantics are largely intact. About 50% of adults with
gaertner, A. (1994). Working memory and inference revi- right hemisphere damage have a verbal communication
sion in brain-damaged and normally aging adults. Journal disorder (Joanette et al., 1990). In one study, 93% of 123
of Speech and Hearing Research, 37, 896–912. adults with right hemisphere damage in a rehabilitation
Tompkins, C. A., and Flowers, C. (1985). Perception of emo- center had at least one cognitive deficit with the potential
tional intonation by brain-damaged adults: The influence to disrupt communication and social interaction (Blake
of task processing levels. Journal of Speech and Hearing
et al., 2002).
Research, 28, 527–538.
Van Lancker, D. R., and Kempler, D. (1987). Comprehension Heterogeneity typifies the population of adults with
of familiar phrases by left but not by right hemisphere right hemisphere damage: not all will have all charac-
damaged patients. Brain and Language, 32, 265–277. teristic communicative problems, and some will have no
Weintraub, S., Mesulam, M.-M., and Kramer, L. (1981). Dis- discernible problems. This heterogeneity often is unac-
turbances in prosody: A right hemisphere contribution to counted for in sample selection or data analysis, and
language. Archives of Neurology, 38, 742–744. its potential e¤ects are compounded by the small sam-
ples in most studies of language in patients with right
Further Readings hemisphere damage. A related di‰culty involves con-
trol group composition in research on language deficits
Baron-Cohen, S., Tager-Flusberg, H., and Cohen, D. J. (Eds.).
associated with right hemisphere damage. Non-brain-
(2000). Understanding other minds: Perspectives from autism
and developmental cognitive neuroscience (2nd ed.). Oxford: damaged samples typically comprise individuals who do
Oxford University Press. not have complications associated with being a patient.
Beeman, M., and Chiarello, C. (Eds.). (1998). Right hemisphere Individuals with left brain damage often are excluded
language comprehension: Perspectives from cognitive neuro- because they cannot perform the more complex tasks
science. Mahwah, NJ: Erlbaum. that are most revealing of language functioning after
Right Hemisphere Language Disorders 389
right hemisphere damage, and because di¤erences in Related Emotional and Nonverbal Processing Deficits.
impairment profiles make it di‰cult to equate groups Emotional processing deficits also are a hallmark of
for severity. Consequently, it is impossible to determine right hemisphere damage (Van Lancker and Pachana,
whether observed deficits are specific to right hemisphere 1998), potentially contributing to di‰culties with social
damage. Another major issue is the lack of consensus exchange. Adults with right frontal lesions may be emo-
on how to define or even what to call language deficits tionally disinhibited, while those with more posterior
associated with right hemisphere damage (cf. Myers, damage may minimize and rationalize their deficits.
1999), either in totality or as individual components of Many adults with right hemisphere damage demon-
an aggregate syndrome. Conceptual and terminological strate reduced nonverbal animation and coverbal
imprecision, and apparent overlap, are common in re- behaviors (Blonder et al., 1993). Some exhibit emotional
ferring to targets of inquiry such as nonliteral language interpretation deficits across modalities (pictures, body
processing, inferencing, integration, and reasoning from language, facial and vocal expressions, and complex
a theory of mind (Blake et al., 2002). Conclusions about discourse). Hypoarousal can occur in the presence of
language deficits after right hemisphere damage also are emotional stimuli, though not necessarily with impaired
complicated by intraindividual performance variability, emotional recognition (Zoccolotti, Scabini, and Violani,
whether due to factors such as di¤erential task process- 1982). Some work suggests problems in the way adults
ing requirements (e.g., Tompkins and Lehman, 1998), or with right hemisphere damage apply a relatively intact
to time following onset of injury (Colsher et al., 1987). appreciation of emotional material. For example, indi-
Finally, many language di‰culties ostensibly related to viduals with right hemisphere damage may do well in-
right hemisphere damage stem from or are exacerbated ferring the a¤ect conveyed by sentences that describe
by other perceptual and cognitive impairments, some emotional situations (Tompkins and Flowers, 1985) but
of which are as yet unidentified but others of which falter when required to match emotional inferences with
have not been evaluated consistently. Chief among these specific stimulus representations or settings (Cicone,
complications are hemispatial neglect, other attentional Wapner, and Gardner, 1980). Emotional misinterpreta-
di‰culties, and impairments of working memory and tion also may reflect problems appreciating the visuo-
related processing resources. spatial and acoustic/prosodic stimuli in which emotional
messages are embedded.
Overview and Potential Accounts of Symptoms Lexical-Semantic Processing. Right hemisphere dam-
in Characteristic Deficit Domains age is not usually considered to impair lexical structure,
but it has been shown to diminish performance on tasks
Prosody. Prosody is not uniquely a right hemisphere that involve lexical-semantic processing, such as picture
function (Baum and Pell, 1999). However, many adults naming, word-picture matching, word generation, and
with right hemisphere damage have di‰culty producing semantic judgment. These findings have been taken to
and comprehending prosody, whether it serves linguistic indicate a general, subtle, but specific deficit in lexical-
functions or conveys nuance and a¤ect (Joanette, Gou- semantic processing (e.g., Gainotti et al., 1983). How-
let, and Hannequin, 1990; Baum and Pell, 1999). Evi- ever, visual-perceptual problems could account for
dence is mixed on the occurrence and nature of prosodic di‰culties in many such tasks. Additionally, semantic
problems in right hemisphere damage. Speech prosody priming studies indicate no lexical-semantic processing
is most commonly described as flat, but by contrast may deficit after right hemisphere damage, under either au-
be characterized by an abnormally high fundamental tomatic or controlled activation (Joanette and Goulet,
frequency and variability (Colsher, Cooper, and Gra¤- 1998; Tompkins, Fassbinder, et al., 2002). This suggests
Radford, 1987). Dysarthrias often may be a source of that right hemisphere damage does not a¤ect represen-
spoken prosodic impairment (Wertz et al., 1998). Pro- tation and initial activation processes in the lexical-
sodic interpretation di‰culties largely may be due to semantic system (although currently it is not possible
perceptual deficits, apart from the linguistic or emotional to rule out slowed initial processing). Because clear dif-
characteristics of a message (Tompkins and Flowers, ficulties are evident only on metalinguistic tasks that
1985; Joanette, Goulet, and Hannequin, 1990). Some obscure lexical-semantic operations per se, the ‘‘lexical-
prosodic production and comprehension di‰culties also semantic deficits’’ of right hemisphere damage could re-
may stem from more general emotional processing defi- flect di‰culties with other specific task requirements or
cits (Van Lancker and Pachana, 1998). more general attentional or working memory limita-
Prosodic expression and comprehension deficits can tions (Joanette and Goulet, 1998; Tompkins, Fassbinder,
be dissociated, and Ross (1981) proposed a taxonomy et al., 2002).
for prosodic impairments, relating them to right hemi- Adults with right hemisphere damage also can have
sphere lesion site. However, the proposed functional- di‰culty making semantic judgments about words with
anatomical correlations have not been substantiated in metaphorical or emotional content. Thus, some inves-
other research (e.g., Wertz et al., 1998), and the neuro- tigators have suggested that right hemisphere damage
logical correlates of prosodic disruption are more com- impairs specific semantic domains. A prominent account
plex than Ross’s framework suggests (Baum and Pell, of di‰culties with metaphorical meanings was derived
1999; Joanette, Goulet, and Hannequin, 1990). from studies of hemispheric di¤erences in non-brain-
damaged individuals. This work suggests that as words marized by Tompkins, Fassbinder, et al. (2002), infer-
are processed for comprehension, the right hemisphere ence generation per se is not a primary interpretive
is solely responsible for maintaining activation of a roadblock. Rather, di‰culties appear to involve inte-
rather di¤use network of peripheral or secondary inter- gration processes that are needed to revise or repair
pretations and remote associates of those words. This erroneous interpretations that were activated by such
broad-based lexical-semantic activation is proposed messages. Relatedly, adults with right hemisphere dam-
to underpin figurative language interpretation, among age can have di‰culty synthesizing their mental repre-
other comprehension abilities (e.g., Beeman, 1998). By sentations of stimulus elements and stimulus contexts
extrapolation, right hemisphere damage is assumed to in order to determine nonliteral intent, as expressed in
impair the maintenance of weak associates and second- jokes, idioms, indirect requests, connotative meanings
ary meanings, making, for example, metaphorical inter- of words, and conversational irony. Again, right hemi-
pretations unavailable. Contrary to this proposal, sphere damage does not clearly a¤ect the representation
however, adults with right hemisphere lesions evince or activation of such nonliteral meanings, and adults
both initial, automatic priming and prolonged priming with right-sided lesions can represent relevant elements
of metaphorical meanings of words (i.e., 1000 ms after of stimulus contexts in nonliteral processing tasks (see
hearing the word; Tompkins, 1990). Again, di‰culties Tompkins, Fassbinder, et al., 2002). Finally, adults with
do not occur on implicit tasks with minimal strategic right hemisphere damage may perform poorly on tasks
processing demands; thus, factors related to processing that require reasoning from a ‘‘theory of mind,’’ which
capacity and processing load cannot be excluded as the involves an understanding of the ways in which knowl-
source of domain-specific deficits for metaphorical words edge, beliefs, and motivations guide behavior. Once
in adults with right hemisphere damage (Joanette and again, these di‰culties cannot be attributed to a failure
Goulet, 1998; Tompkins, Fassbinder, et al., 2002). The to understand or represent individual elements of com-
processing of emotional words has not been investigated prehension scenarios (Winner et al., 1998). Overall, for
with implicit methods, so the influences of processing these deficit areas, impaired performance by adults with
demands cannot be evaluated at present. right-sided lesions is evident in conditions of relatively
high processing demand.
Discourse and Conversation. A growing literature There are several emerging accounts of di‰culties
suggests possible impairments of building, extracting, experienced by adults with right hemisphere damage
applying, or manipulating the mental structures that in constructing coherent, integrated mental structures
guide discourse processing after right hemisphere dam- that support discourse production and comprehension.
age (Beeman, 1998; Tompkins, Fassbinder, et al., 2002). Brownell and Martino (1998) implicate problems with
Again, contrasting findings abound (Tompkins, 1995). ‘‘self-directed inference’’ (p. 325), which refers to com-
For example, although the output of adults with right prehenders’ e¤orts to discover and elaborate an in-
hemisphere damage most often is described as verbose, terpretive framework when overlearned interpretive
digressive, and lacking in informative content, some routines are inadequate. Reasoning from a theory of
produce a paucity of spoken discourse (Myers, 1999). mind also is gaining popularity as an explanatory con-
Deficits in organizing and integrating elements of dis- struct (Brownell and Martino, 1998). Another promi-
course structure may be evident as well. As with the nent hypothesis derives from characterization of the
lexical-semantic investigations, confounds introduced by normal right hemisphere’s ‘‘coarse semantic coding’’
typically metalinguistic assessment tasks (Tompkins and properties (Beeman, 1998, p. 255), or the broad-based
Baumgaertner, 1998) create di‰culties for interpreting activation of di¤use, peripheral, and secondary mean-
much of the literature on discourse in adults with right ings of words. Damage to the right hemisphere presum-
hemisphere lesions. ably creates di‰culty in activating and/or maintaining
Discourse production often is investigated in terms of the distant associates and subordinate meanings on
conversational pragmatics. In the few available studies, which connotative interpretations, various inferences,
heterogeneity again is the rule, with some participants and some discourse integration processes rely. However,
with right hemisphere damage displaying no deficits. Tompkins and colleagues (2000, 2001) demonstrated
Adults with right hemisphere damage most often are that adults with right hemisphere damage activate mul-
reported to have di‰culties with eye contact and some, tiple meanings of lexical and inferential ambiguities,
but not all, turn-taking parameters (Prutting and Kirch- and that abnormally prolonged activation of con-
ner, 1987; Kennedy et al., 1994). Idiosyncratic and am- textually incompatible interpretations predicts aspects of
biguous reference, poor re-use of common referents, and discourse comprehension after right hemisphere lesions.
excessive attention to peripheral details also may occur These authors propose that right hemisphere damage
(Chantraine, Joanette, and Ska, 1998). may impair a comprehension mechanism by which con-
Discourse comprehension problems are especially textually incompatible interpretations are suppressed,
evident when adults with right hemisphere damage must and they argue that this ‘‘suppression deficit’’ account
reconcile multiple, seemingly incongruent inferences. accommodates a variety of other existing data. The sup-
They have particular problems with messages that in- pression deficit and maintenance deficit views may be
duce conflicting interpretations, such those that contain reconcilable by considering within-hemisphere site of le-
ambiguities or violate canonical expectations. As sum- sion, a possibility currently under investigation (Tomp-
Right Hemisphere Language Disorders 391
kins, 2002). More generally, Stemmer and Joanette Colsher, P. L., Cooper, W. E., and Gra¤-Radford, N. (1987).
(1998) suggest that discourse deficits in individuals with Intonational variability in the speech of right-hemisphere
right-sided lesions reflect di‰culty with constructing and damaged patients. Brain and Language, 32, 379–383.
integrating new conceptual models. In a di¤erent vein, Gainotti, G., Caltagirone, C., Miceli, G., and Masullo, C.
(1983). Selective impairment of semantic-lexical discrimina-
Brownell and Martino (1998) maintain that many
tion in right-brain-damaged patients. In E. Perecman (Ed.),
impairments associated with right hemisphere damage Cognitive processing in the right hemisphere (pp. 149–167).
stem from a social disconnection or diminished interest New York: Academic Press.
in people. Finally, because the expression of deficits in Joanette, Y., and Goulet, P. (1998). Right hemisphere and the
various domains seems to be moderated by processing semantic processing of words: Is the contribution specific
abilities and demands, factors related to processing ca- or not? In E. G. Visch-Brink and R. Batiaanse (Eds.),
pacity and processing load need to be considered in a full Linguistic levels in aphasiology (pp. 19–34). San Diego, CA:
account of impairments and skills in individuals with Singular Publishing Group.
right hemisphere damage (Tompkins, Blake, et al., 2002; Joanette, Y., Goulet, P., and Hannequin, D. (1990). Right
Tompkins, Fassbinder, et al., 2002). hemisphere and verbal communication. New York: Springer-
Verlag.
Kennedy, M., Strand, E., Burton, W., and Peterson, C.
Impact and Management (1994). Analysis of first-encounter conversations of right-
The cognitive and behavioral problems of adults with hemisphere-damaged adults. Clinical Aphasiology, 22,
right hemisphere lesions can interfere with judgment 67–80.
Klono¤, P. S., Sheperd, J. C., O’Brien, K. P., Chiapello, D. A.,
and social skills, family relationships, functional living
and Hodak, J. A. (1990). Rehabilitation and outcome of
activities, and the potential to return to productive work right-hemisphere stroke patients: Challenges to traditional
(Klono¤ et al., 1990; Tompkins et al., 1998). Clinical diagnostic and treatment methods. Neuropsychology, 4,
management typically is symptom driven, although 147–163.
various authors emphasize the value of a theoretically Myers, P. S. (1999). Right hemisphere damage: Disorders of
oriented approach (Tompkins, 1995; Myers, 1999). communication and cognition. San Diego, CA: Singular
Treatment that focuses on the remediation of deficits Publishing Group.
may miss the bigger picture, in which therapeutic benefit Prutting, C. A., and Kirchner, D. M. (1987). A clinical ap-
is assessed in terms of its e¤ects on daily life activities praisal of the pragmatic aspects of language. Journal of
and psychosocial functioning (Tompkins et al., 1998; Speech and Hearing Disorders, 52, 105–119.
Ross, E. D. (1981). The aprosodias: Functional-anatomic or-
Tompkins, Fassbinder, et al., 2002). Treatment research
ganization of the a¤ective components of language in the
is urgently needed for this population. right hemisphere. Archives of Neurology, 38, 561–569.
See also discourse impairments; prosidic deficits. Stemmer, B., and Joanette, Y. (1998). The interpretation of
—Connie A. Tompkins and Wiltrud Fassbinder narrative discourse of brain-damaged individuals within the
framework of a multilevel discourse model. In M. Beeman
References and C. Chiarello (Eds.), Right hemisphere language com-
prehension: Perspectives from cognitive neuroscience (pp.
Baum, S. R., and Pell, M. D. (1999). The neural bases of 329–348). Mahwah, NJ: Erlbaum.
prosody: Insights from lesion studies and neuroimaging. Tompkins, C. A. (1990). Knowledge and strategies for pro-
Aphasiology, 13, 581–608. cessing lexical metaphor after right or left hemisphere
Beeman, M. (1998). Coarse semantic coding and discourse brain damage. Journal of Speech and Hearing Research, 33,
comprehension. In M. Beeman and C. Chiarello (Eds.), Right 307–316.
hemisphere language comprehension: Perspectives from cog- Tompkins, C. A. (1995). Right hemisphere communication dis-
nitive neuroscience (pp. 255–284). Mahwah, NJ: Erlbaum. orders: Theory and management. San Diego, CA: Singular
Blake, M., Du¤y, J. R., Myers, P. S., and Tompkins, C. A. Publishing Group.
(2002). Prevalence and patterns of right hemisphere cogni- Tompkins, C. A. (2002). Comprehension impairment and right
tive/communicative deficits: Retrospective data from an brain damage. Grant funded by the National Institutes of
inpatient rehabilitation unit. Aphasiology, 16, 537–547. Health: National Institute on Deafness and Other Commu-
Blonder, L. X., Burns, A. F., Bowers, D., Moore, R. W., and nicative Disorders (DC01820).
Heilman, K. M. (1993). Right hemisphere facial expressiv- Tompkins, C. A., and Baumgaertner, A. (1998). Clinical value
ity during natural conversation. Brain and Cognition, 21, of online measures for adults with right hemisphere brain
44–56. damage. American Journal of Speech-Language Pathology,
Brownell, H., and Martino, G. (1998). Deficits in inference 7, 68–74.
and social cognition: The e¤ects of right hemisphere brain Tompkins, C. A., Baumgaertner, A., Lehman, M. T., and
damage on discourse. In M. Beeman and C. Chiarello Fassbinder, W. (2000). Mechanisms of discourse compre-
(Eds.), Right hemisphere language comprehension: Per- hension impairment after right hemisphere brain damage:
spectives from cognitive neuroscience (pp. 309–328). Mah- Suppression in lexical ambiguity resolution. Journal of
wah, NJ: Erlbaum. Speech, Language, and Hearing Research, 43, 62–78.
Chantraine, Y., Joanette, Y., and Ska, B. (1998). Conversa- Tompkins, C. A., Blake, M. L., Baumgaertner, A., and Fass-
tional abilities in patients with right hemisphere damage. binder, W. (2002). Characterizing comprehension di‰culties
Journal of Neurolinguistics, 11, 21–32. after right brain damage: Attentional demands of suppres-
Cicone, M., Wapner, W., and Gardner, H. (1980). Sensitivity sion function. Aphasiology, 16, 559–572.
to emotional expressions and situations in organic patients. Tompkins, C. A., Fassbinder, W., Lehman-Blake, M. T., and
Cortex, 16, 145–158. Baumgaertner, A. (2002). The nature and implications of
right hemisphere language disorders: Issues in search of aphasia and related disorders (pp. 59–77). Boston: Little,
answers. In A. Hillis (Ed.), Handbook of adult language Brown.
disorders: Integrating cognitive neuropsychology, neurology, Du¤y, J. R., and Myers, P. S. (1991). Group comparisons
and rehabilitation (pp. 429–448). Philadelphia: Psychology across neurologic communication disorders: Some method-
Press. ological issues. In T. E. Prescott (Ed.), Clinical aphasiology
Tompkins, C. A., and Flowers, C. R. (1985). Perception of (pp. 1–14). Austin, TX: Pro-Ed.
emotional intonation by brain-damaged adults: The influ- Joanette, Y., and Ansaldo, A. I. (1999). Clinical note:
ence of task processing levels. Journal of Speech and Hear- Acquired pragmatic impairments and aphasia. Brain and
ing Research, 28, 527–538. Language, 68, 529–534.
Tompkins, C. A., and Lehman, M. T. (1998). Interpreting Joanette, Y., and Goulet, P. (1990). Narrative discourse
intended meanings after right hemisphere brain damage: An in right-brain-damaged right-handers. In Y. Joanette and
analysis of evidence, potential accounts, and clinical impli- H. H. Brownell (Eds.), Discourse ability and brain-damage:
cations. Topics in Stroke Rehabilitation, 5, 29–47. Theoretical and empirical perspectives (pp. 131–153). New
Tompkins, C. A., Lehman-Blake, M. T., Baumgaertner, A., York: Springer-Verlag.
and Fassbinder, W. (2001). Mechanisms of discourse Joanette, Y., and Goulet, P. (1994). Right hemisphere and
comprehension impairment after right hemisphere brain verbal communication: Conceptual, methodological, and
damage: Suppression in inferential ambiguity resolution. clinical issues. In M. Lemme (Ed.), Clinical aphasiology
Journal of Speech, Language, and Hearing Research, 44, (pp. 1–24). Austin, TX: Pro-Ed.
400–415. Kennedy, M. R. T. (2000). Topic scenes in conversations with
Tompkins, C. A., Lehman, M. T., Wyatt, A., and Schulz, R. adults with right-hemisphere brain damage. American Jour-
(1998). Functional outcome assessment of adults with right nal of Speech-Language Pathology, 9, 72–86.
hemisphere brain damage. Seminars in Speech and Lan- Kent, R. D., and Rosenbek, J. C. (1982). Prosodic disturbance
guage, 19, 303–321. and neurological lesion. Brain and Language, 15, 259–291.
Van Lancker, D., and Pachana, N. A. (1998). The influence of Lehman-Blake, M. T., and Tompkins, C. A. (2001). Predictive
emotion on language and communication disorders. In B. inferencing in adults with right hemisphere brain damage.
Stemmer and H. A. Whitaker (Eds.), Handbook of neuro- Journal of Speech, Language, and Hearing Research, 44,
linguistics (pp. 301–311). San Diego, CA: Academic Press. 639–654.
Wertz, R. T., Henschel, C. R., Auther, L. L., Ashford, J. R., Lojek-Osiejuk, E. (1996). Knowledge of scripts reflected in
and Kirshner, H. S. (1998). A¤ective prosodic disturbance discourse of aphasics and right-brain-damaged patients.
subsequent to right hemisphere stroke: A clinical applica- Brain and Language, 29, 68–80.
tion. Journal of Neurolinguistics, 11, 89–102. Myers, P. S. (2001). Toward a definition of RHD syndrome.
Winner, E., Brownell, H. H., Happe, F., Blum, A., and Pincus, Aphasiology, 15, 913–918.
D. (1998). Distinguishing lies from jokes: Theory of mind Myers, P. S., and Brookshire, R. H. (1996). E¤ect of visual
deficits and discourse interpretation in right hemisphere and inferential variables on scene descriptions by right-
brain-damaged patients. Brain and Language, 62, 89–106. hemisphere-damaged and non-brain-damaged adults. Jour-
Zoccolotti, P., Scabini, D., and Violani, C. (1982). Electro- nal of Speech and Hearing Research, 39, 870–880.
dermal responses in patients with unilateral brain damage. Stemmer, B., Giroux, F., and Joanette, Y. (1994). Produc-
Journal of Clinical Neuropsychology, 4, 143–150. tion and evaluation of requests by right-hemisphere brain-
damaged individuals. Brain and Language, 47, 1–31.
Tompkins, C. A., Bloise, C. G. R., Timko, M. L., and Baum-
Further Readings gaertner, A. (1994). Working memory and inference revi-
sion in brain-damaged and normally aging adults. Journal
Ardilla, A., and Rosselli, M. (1993). Spatial agraphia. Brain of Speech and Hearing Research, 37, 896–912.
and Cognition, 22, 137–147. Tompkins, C. A., Boada, R., and McGarry, K. (1992). The
Beeman, M., and Chiarello, C. (1998). Right hemisphere lan- access and processing of familiar idioms by brain-damaged
guage comprehension: Perspectives from cognitive neuro- and normally aging adults. Journal of Speech and Hearing
science. Mahwah, NJ: Erlbaum. Research, 35, 626–637.
Bihrle, A., Brownell, H. H., and Gardner, H. (1988). Humor Trupe, E. H., and Hillis, A. (1985). Paucity vs. verbosity: An-
and the right hemisphere: A narrative perspective. In H. A. other analysis of right hemisphere communication deficits.
Whitaker (Ed.), Contemporary reviews in neuropsychology In R. H. Brookshire (Ed.), Clinical aphasiology (pp. 83–96).
(pp. 109–126). New York: Springer-Verlag. Minneapolis: BRK.
Blonder, L. X., Bowers, D., and Heilman, K. M. (1991). The
role of the right hemisphere in emotional communication.
Brain, 114, 1115–1127.
Borod, J. C., Cicero, B. A., Obler, L. K., Welkowitz, J., Ehran, Segmentation of Spoken Language by
H. M., Santschi, C., et al. (1998). Right hemisphere emo- Normal Adult Listeners
tional perception: Evidence across multiple channels. Neu-
ropsychology, 12, 446–458.
Chiarello, C. (1998). On codes of meaning and the meaning Listening to spoken language usually seems e¤ortless,
of codes: Semantic access and retrieval within and between
but the processes involved are complex. A continuous
hemispheres. In M. Beeman and C. Chiarello (Eds.), Right
hemisphere language comprehension: Perspectives from cog- acoustic signal must be translated into meaning so that
nitive neuroscience (pp. 141–160). Mahwah, NJ: Erlbaum. the listener can understand the speaker’s intent. The
Cooper, W. E., and Klouda, G. V. (1987). Intonation in mapping of sound to meaning proceeds via the lexicon—
aphasic and right-hemisphere-damaged patients. In J. H. our store of known words. Any utterance we hear may
Ryalls (Ed.), Phonetic approaches to speech production in be novel to us, but the words it contains are familiar, and
Segmentation of Spoken Language by Normal Adult Listeners 393
to understand the utterance we must therefore identify consonant, respectively), then responses are slower than
the words of which it is composed. if the cues match. This shows that listeners are sensitive
We know a great many words; an educated adult’s to the coarticulatory mismatch and must have processed
vocabulary has been estimated at around 150,000 words. the consonant place cues in the vowel. However, the
Entries in the mental lexicon may include, besides stand- responses are still slower when the mismatching troo-
alone words, grammatical morphemes such as prefixes comes from troop than when it comes from trook. This
and su‰xes and multiword phrases such as idioms and suggests that the processing of consonant cues in the
cliches. Languages also di¤er widely in how they con- vowel has caused activation of the existing compatible
struct word forms, and this too will a¤ect what is stored real-word troop (Marslen-Wilson and Warren, 1994;
in the lexicon. But in any language, listening involves McQueen, Norris, and Cutler, 1999).
mapping the acoustic signal onto stored meanings. Second, multiple candidate words are simultaneously
The continuity of utterances means that boundaries activated during the listening process, including words
between individual words in speech are not overtly that are merely accidentally present in a speech signal.
marked. Speakers do not pause between words but run Thus, hearing strange-acting may activate stray, train,
them into one another. The problem of segmenting a range, jack, and so on, as well as the intended words.
speech signal into words is compounded by the fact that Evidence for multiple activation comes from cross-
words themselves are not highly distinctive. All the modal priming experiments in which a word-initial frag-
words we know are constructed of just a handful of dif- ment facilitates recognition of di¤erent words that it
ferent sounds; on average, the phonetic repertoire of a might become. Thus, lexical decision responses for visu-
language contains 30–40 contrasting sounds (Maddie- ally presented ‘‘captain’’ or ‘‘captive’’ are both facili-
son, 1984). As a consequence, words inevitably resemble tated when listeners have just heard the fragment
other words, and may have other words embedded capt- (compared with some other control fragment).
within them (thus strange contains stray, strain, train, Moreover, both are facilitated even if only one of them
rain, and range). Word recognition therefore involves matches the context (Zwitserlood, 1989).
identifying the correct form among a large number of Third, there is active competition between alternative
similar forms, in a stream in which they abut one an- candidate words. The more active a candidate word is,
other without a break (strange act contains jack and the more it may suppress its rivals, and the more com-
jacked ). petitors a word has, the more suppression it may un-
The only segmentation that is logically required is to dergo. Evidence for competition between simultaneously
find the words in speech. Whether listening also involves activated candidate words comes from experiments in
some intermediate level of coding is an issue of conten- which listeners must spot any real words occurring in
tion among speech researchers. Do listeners extract spoken nonsense strings. If the rest of the string partially
whole syllables from the speech stream and use this syl- activates a competitor word, then spotting the real
labic representation to contact the lexicon? Do they ex- embedded word is slowed. For instance, listeners spot
tract phonemes from the input, so that listening involves mess less rapidly in domess (which partially activates
an intermediate stage in which heard utterances are domestic, a competitor for the same portion of the signal
represented as strings of phonemes? Or does listening that supports mess) than in nemess (which supports no
involve matching speech input against holistic stored other word; McQueen, Norris, and Cutler, 1994; see
forms? The available evidence does not yet allow us to also Norris, McQueen, and Cutler, 1995; Vroomen and
distinguish among these positions (and other variants). de Gelder, 1995; Soto-Faraco, Sebastián-Gallés, and
There is agreement, however, on other aspects of the Cutler, 2001).
spoken-word recognition process. First, information in Because activated and competing words need not be
the signal is evaluated continuously and the results are aligned with one another, the competition process o¤ers
passed to the lexicon. Coarticulatory e¤ects that cause a potential means of segmenting the utterance. Thus, al-
cues to adjacent phonemes to overlap in time are e‰- though recognition of strange-acting may involve com-
ciently used. Thus robe, rope, wrote, road, and rogue all petition from stray, range, jack, and so on, this will
begin with ro-, but the vowel will in each case include eventually yield to joint inhibition from the two intended
anticipatory information about the place of articulation words, which receive greater support from the signal.
of the following consonant, and listeners can exploit this Adult listeners can also use information which their
(e.g., to narrow the field of candidates to only rope and linguistic experience suggests to be correlated with
robe, eliminating rogue, road, and wrote). the presence of a word boundary. For instance, in En-
Evidence for continuous evaluation comes from glish the phoneme sequence [mg] never occurs word-
experiments in which listeners perform lexical decision internally, so the occurrence of this sequence must imply
( judging whether a spoken string is indeed a real word) a word boundary (some go, tame goose); sequences such
on speech that has been cross-spliced so that the coarti- as [pf ] or [ml] or [zw] never occur syllable-internally, so
culatory e¤ects are no longer reliable. Thus, when lis- this sequence implies at least a syllable boundary (cupful,
teners hear troot they should respond ‘‘no’’—troot is seemly, beeswax). Listeners more rapidly spot embedded
not a word. If troot is cross-spliced so that a final -t is words whose edges are aligned with such a boundary-
appended to a troo- from either trook or troop (which correlated sequence (e.g., rock is spotted more easily
give coarticulatory cues to an upcoming velar or bilabial in foomrock than in foogrock; McQueen, 1998). Also,
words that begin with a common phoneme sequence are caressed; Johnson et al., 2003); that is, they are already
easier to extract from a preceding context than words sensitive to the apparently universal constraint on possi-
that begin with an infrequent sequence (e.g., in golnook ble words.
versus golnag, it will be easier to spot nag, which shares Finally, segmentation of second languages in later
its beginning with natural, navigate, narrow, nap, and life is not aided by the e‰ciency with which listeners
many other words; van der Lugt, 2001; see also Cairns exploit language-specific structure in recognizing speech.
et al., 1997). Segmentation procedures suitable for the native lan-
These latter sources of information are, of course, guage can be inappropriately applied to non-native input
necessarily language-specific. It is a characteristic of a (Cutler et al., 1986; Otake et al., 1993; Cutler and Otake,
particular vocabulary that more words begin with the 1994; Weber, 2001). This is one e¤ect making listening
na- of nag than with the noo- of nook; likewise, it is to a second language paradoxically harder than, for in-
vocabulary-specific that sequences such as [pf ] or [zw] or stance, reading the same language.
[ml] cannot occur within a syllable. Each of these three See also phonology and adult aphasia.
sequences is in fact legitimately syllable-internal in some
language ([pf ], for instance, in German: Pferd, Kopf ). —Anne Cutler
Other language-specific information is also used in
segmentation, notably rhythmic structure. In languages References
such as English and Dutch, most words begin with Cairns, P., Shillcock, R., Chater, N., and Levy, J. (1997).
stressed syllables, and listeners find it easier to segment Bootstrapping word boundaries: A bottom-up corpus-based
speech at the onset of stressed syllables (Cutler and approach to speech segmentation. Cognitive Psychology, 33,
Norris, 1988; Vroomen, van Zon, and de Gelder, 1996). 111–153.
This can be clearly seen in segmentation errors, as when Cutler, A., and Butterfield, S. (1992). Rhythmic cues to speech
a pop song line She’s a must to avoid is widely misper- segmentation: Evidence from juncture misperception. Jour-
ceived as She’s a muscular boy—the strong syllable void nal of Memory and Language, 31, 218–236.
is taken to be the onset of a new word, while the weak Cutler, A., Demuth, K., and McQueen, J. M. (2002). Univer-
sality versus language-specificity in listening to running
syllables to and a- are taken to be noninitial (Cutler and
speech. Psychological Science, 13, 258–262.
Butterfield, 1992). Cutler, A., Mehler, J., Norris, D., and Seguı́, J. (1986). The
The stress rhythm of English and Dutch is not uni- syllable’s di¤ering role in the segmentation of French and
versal; many other languages have di¤erent rhythmic English. Journal of Memory and Language, 25, 385–400.
structures. Indeed, syllabically based rhythm in French is Cutler, A., and Norris, D. (1988). The role of strong syllables
accompanied by syllabic segmentation in French listen- in segmentation for lexical access. Journal of Experi-
ing experiments (Mehler et al., 1981; Cutler et al., 1986; mental Psychology: Human Perception and Performance, 14,
Kolinsky, Morais, and Cluytens, 1995), while moraic 113–121.
rhythm in Japanese likewise accompanies moraic seg- Cutler, A., and Otake, T. (1994). Mora or phoneme? Further
mentation by Japanese listeners (Otake et al., 1993; evidence for language-specific listening. Journal of Memory
and Language, 33, 824–844.
Cutler and Otake, 1994). Johnson, E. K., Jusczyk, P. W., Cutler, A., and Norris, D.
Thus, although the type of rhythm is language- (2003). Lexical viability constraints on speech segmentation
specific, its use in speech segmentation seems universal. by infants without a lexicon. Cognitive Psychology, 46, 65–
Other universal constraints on segmentation exist, for 97.
example, to limit activation of spurious embedded com- Jusczyk, P. W., and Aslin, R. N. (1995). Infants’ detection of
petitors. It is harder to spot a word if the residual context sound patterns of words in fluent speech. Cognitive Psy-
contains only consonants (thus, apple is harder to find chology, 29, 1–23.
in fapple than in vu¤apple; Norris et al., 1997), an e¤ect Kolinksy, R., Morais, J., and Cluytens, M. (1995). Intermedi-
explained as a primitive filter selecting for possible ate representations in spoken word recognition: Evidence
words—vu¤ is not a word, but it might have been one, from word illusions. Journal of Memory and Language, 34,
19–40.
while f could never be a word. This constraint would Maddieson, I. (1984). Patterns of sounds. Cambridge, U.K.:
operate to rule out many spuriously present words in Cambridge University Press.
speech (such as tray and ray in stray). It is not a¤ected Marslen-Wilson, W. D., and Warren, P. (1994). Levels of
by what may be a word in a particular language (Norris perceptual representation and process in lexical access:
et al., 2001; Cutler, Demuth, and McQueen, 2002) and Words, phonemes, and features. Psychological Review, 101,
thus appears to be universal. 653–675.
The ability to extract words from continuous speech McQueen, J. (1998). Segmentation of continuous speech using
starts early in life, as shown by experiments in which phonotactics. Journal of Memory and Language, 39, 21–
infants listen longer to passages containing words that 46.
they had previously heard in isolation than to wholly McQueen, J. M., Norris, D., and Cutler, A. (1994). Competi-
tion in spoken word recognition: Spotting words in other
new passages (Jusczyk and Aslin, 1995); none of the words. Journal of Experimental Psychology: Learning,
passages can be comprehended by these young listeners, Memory and Cognition, 20, 621–638.
but they can recognize familiar strings embedded in the McQueen, J. M., Norris, D., and Cutler, A. (1999). Lexical
fluent speech. One-year-olds also detect familiar strings influence in phonetic decision making: Evidence from sub-
less easily if they are embedded in a context without a categorical mismatches. Journal of Experimental Psychol-
vowel (e.g., rest is found less easily in crest than in ogy: Human Perception and Performance, 25, 1363–1389.
Semantics 395
Mehler, J., Dommergues, J.-Y., Frauenfelder, U. H., and Seguı́, binations, and the nonlinguistic contexts in which they
J. (1981). The syllable’s role in speech segmentation. Journal occur.
of Verbal Learning and Verbal Behavior, 20, 298–305. In this article, semantic development in selected pop-
Norris, D., McQueen, J. M., and Cutler, A. (1995). Competi- ulations of children with language disorders is described
tion and segmentation in spoken word recognition. Journal
via intralinguistic referencing. Included are summaries
of Experimental Psychology: Learning, Memory and Cogni-
tion, 21, 1209–1228. of deficits in semantic processing that characterize each
Norris, D., McQueen, J. M., Cutler, A., and Butterfield, S. population as a whole, at early and later points in
(1997). The possible-word constraint in the segmentation of development.
continuous speech. Cognitive Psychology, 34, 191–243.
Norris, D., McQueen, J. M., Cutler, A., Butterfield, S., Early Semantic Development in Children with
and Kearns, R. (2001). Language-universal constraints on
speech segmentation. Language and Cognitive Processes, 16, Developmental Language Disorders
637–660. Children with autism or other pervasive developmental
Otake, T., Hatano, G., Cutler, A., and Mehler, J. (1993). Mora disorders (PDDs) typically demonstrate semantic sys-
or syllable? Speech segmentation in Japanese. Journal of tems that are weak relative to the formal systems of
Memory and Language, 32, 258–278.
Soto-Faraco, S., Sebastián-Gallés, N., and Cutler, A. (2001). syntax, morphology, and phonology. This weakness is
Segmental and suprasegmental mismatch in lexical access. manifested as use of words without regard to con-
Journal of Memory and Language, 45, 412–432. ventional meaning, context-bound extensions of word
van der Lugt, A. (2001). The use of sequential probabilities in meaning, and confusion regarding the mapping of per-
the segmentation of speech. Perception and Psychophysics, sonal pronouns onto their referents. Social and cogni-
63, 811–823. tive deficits are thought to contribute to this weakness.
Vroomen, J., and de Gelder, B. (1995). Metrical segmentation Whereas normally developing children readily make
and lexical inhibition in spoken word recognition. Journal inferences about word meanings by reading social and
of Experimental Psychology: Human Perception and Perfor- contextual cues, such as the speaker’s eye gaze and
mance, 21, 98–108.
intentions, children with autism do not (Baron-Cohen,
Vroomen, J., van Zon, M., and de Gelder, B. (1996). Cues to
speech segmentation: Evidence from juncture mispercep- Baldwin, and Crowson, 1997). This failure is often
tions and word spotting. Memory and Cognition, 24, viewed as part of a broader deficit in theory of mind.
744–755. The theory of mind deficit is further reflected in a
Weber, A. (2001). Language-specific listening: The case of pho- particular limitation of children with autism in the ac-
netic sequences. Doctoral dissertation, University of Nijme- quisition of words for cognitive states (Tager-Flusberg,
gen, The Netherlands. 1992).
Zwitserlood, P. (1989). The locus of the e¤ects of sentential- In specific language impairment (SLI), semantics
semantic context in spoken-word processing. Cognition, 32, is generally a strength relative to the formal domains
25–64. of morphosyntax and syntax. However, children with
SLI do demonstrate some semantic delay relative to
Further Readings their normal age-mates. For example, the appearance
Cutler, A., and Clifton, C. E. (1999). Comprehending spoken of first meaningful words is delayed an average of 11
language: A blueprint of the listener. In C. Brown and P. months (Trauner et al., 1995). This delay is maintained
Hagoort (Eds.), Neurocognition of language (pp. 123–166). throughout the early preschool years and is measurable
Oxford: Oxford University Press. with receptive vocabulary tests and experimental word-
Frauenfelder, U. H., and Floccia, C. (1998). The recognition learning paradigms, which show that children with SLI
of spoken words. In A. Friederici (Ed.), Language compre- are poor at fast mapping and long-term retention of
hension, a biological perspective (pp. 1–40). Heidelberg: words (Rice et al., 1994). Even when these children
Springer-Verlag.
succeed at word mapping, the semantic elaboration of
Grosjean, F., and Frauenfelder, U. H. (Eds.). (1997). Spoken
word recognition paradigms. Hove, U.K.: Psychology Press. those words in the lexicon is sparser than age expect-
Jusczyk, P. W. (1997). The discovery of spoken language. ations would predict. This sparseness is associated with
Cambridge, MA: MIT Press. greater numbers of naming errors during picture naming
McQueen, J. M., and Cutler, A. (Eds.). (2001). Spoken word (McGregor et al., 2002). In their spontaneous speech,
access processes. Hove, U.K.: Psychology Press. these children exhibit less lexical diversity, especially
verb diversity, than their normal age-mates (Watkins
et al., 1995). Finally, for children with SLI, processing
the subtle meanings carried by grammatical morphemes,
Semantics derivational morphemes, and word order is particularly
problematic (Bishop, 1997).
Short-term (working) memory deficits likely contrib-
The term semantics refers to linguistic meaning. Seman- ute to the delayed semantic development of children
tic development involves mapping linguistic forms onto with SLI (Gathercole and Baddely, 1990). These defi-
our mental models of the world and organizing these cits are associated with slower word learning, presum-
maps into networks of related information. Semantic ably because features of new words or their referents
processing involves the comprehension and production are not su‰ciently represented in short-term memory to
of meaning as conveyed by linguistic forms, their com- be committed to the long-term lexical store. Another
possible contributor to weak lexical semantics is the and Holland, 1989). Also, these children have di‰culty
limited ability of children with SLI to benefit from syn- building semantic networks. For example, when asked to
tactic bootstrapping. On tasks requiring the acting out select two antonyms, synonyms, or class coordinates
of unfamiliar verbs, children with SLI do not infer from lists of four words, both right- and left-lesioned
meanings from syntactic bootstrapping as well as their schoolchildren made significantly more errors than their
younger language-matched controls (van der Lely, normal age-mates (Eisele and Aram, 1995). In sponta-
1994), or they require a great deal of processing e¤ort to neous speech, their semantic deficits are manifested as
do so (O’Hara and Johnston, 1997). low numbers and diversity of words (Feldman et al.,
Down syndrome is another condition in which intra- 1992).
linguistic referencing reveals semantics as a relative Type and extent of insult, as well as age at onset,
strength. In the Down syndrome population, receptive influence semantic processing in children with acquired
single-word vocabulary is una¤ected in early childhood language disorders. In general, cognitive disorganization
and may exceed nonverbal cognitive ability by adoles- and poor long-term memory exacerbate semantic deficits
cence (Chapman, 1995). However, children with Down in this population (Levin et al., 1982).
syndrome demonstrate semantic delays relative to their
normal (mental-age-matched) peers. Soon after the onset
of first words, expressive vocabulary development begins
Later Semantic Development
to lag (Cardoso-Martins, Mervis, and Mervis, 1985). As Semantic weaknesses in older children and adolescents
children with Down syndrome learn to combine words extend to the discourse level. In order to process
into sentences, their expressive delays in lexical seman- discourse, children must integrate meanings across sen-
tics relative to their MLU-matched peers are mani- tences (linguistic context) with regard to shared infor-
fested as a lower rate of verb use per sentence (Hesketh mation, communicative goals, and the physical setting
and Chapman, 1998). As in SLI, children with Down (nonlinguistic context). Not surprisingly, most individu-
syndrome have di‰culty expressing meanings with als with language disorders, no matter their diagnostic
grammatical morphemes and noncanonical word order category, have significant di‰culties with semantics at
(Kumin, Councill, and Goodman, 1998). Limited diver- the discourse level. Children with acquired language
sity of vocabulary is characteristic of their discourse disorders have trouble recalling meaningful propositions
(Chapman, 1995). Also, similar to children with autism, from stories and, when producing stories, they have dif-
these children have di‰culty expressing internal states; ficulty conjoining meaning across sentences via referen-
use of words to express volition, ability, or cognition is tial and lexical ties (Ewing-Cobbs et al., 1998). Children
particularly compromised (Beeghly and Cicchetti, 1997). with developmental language disorders present with
The semantic delays of children with Down syndrome similar problems (Bishop and Adams, 1992). Processing
are multidetermined. Degree of mental retardation limits the implicit meanings, abstract meanings, and nonliteral
development of the semantic-conceptual system. Fluctu- meanings characteristic of sophisticated discourse is also
ating mild to moderate hearing loss, which is highly problematic for many children with either acquired or
prevalent in this population, also has some e¤ect developmental language disorders. Some argue that a
(see otitis media: effects on children’s language). subgroup of children with developmental language dis-
Finally, as in SLI, limited short-term memory is thought orders, who share certain characteristics of both SLI and
to play a role (Chapman, 1995). PDD syndromes, have semantic-pragmatic disorders as
their primary area of deficit. Such children are particu-
Early Semantic Development in Children with larly deficient in all higher level semantic attainments
(Rapin and Allen, 1983).
Acquired Language Disorders
In children who have acquired language disorders sub-
Semantic Development and Reading
sequent to unilateral focal lesions, intralinguistic pro-
files often vary according to location of the lesion in the Children with language impairments of both develop-
brain. Children with right hemisphere lesions tend to mental and acquired types have a higher incidence of
have more pronounced deficits in semantics than in reading impairment than the general population (see
formal aspects of language; children with left hemisphere language impairment and reading disability). For
lesions demonstrate the reverse pattern (Eisele and example, roughly half of a group identified as having
Aram, 1994). This generalization is gross, and semantic SLI at 5 years were reading-impaired at 15 years (Sto-
involvement will vary across individuals. Overall, com- thard et al., 1998). Part of this reading di‰culty relates
pared to their normal age-mates, children with unilateral to weaknesses in semantics (poor phonological and
lesions present with late onset of first words in both grammatical processing also plays a role). These chil-
comprehension and production, with right-lesioned children may not have su‰cient semantic representations on
dren having more significant comprehension deficits which to map orthography. Furthermore, they are less
than left-lesioned children (Thal et al., 1991). In experi- able to use semantic bootstrapping to infer the meaning
mental word-learning studies, unilateral brain-lesioned of words in context (Snowling, 2000).
children require more teaching trials than their normal The developmental relation between lexical semantics
peers to demonstrate comprehension (Keefe, Feldman, and reading is reciprocal. Poor knowledge in the seman-
Semantics 397
tic domain contributes to poor reading, and poor read- Cardoso-Martins, C., Mervis, C. B., and Mervis, C. A.
ing exacerbates lags in semantic development. Children (1985). Early vocabulary acquisition by children with
who have poor reading skills read fewer words over the Down syndrome. American Journal of Mental Deficiency,
course of a year and are less able to learn the words they 90, 177–184.
Chapman, R. S. (1995). Language development in children and
do read than those who have good reading skills (Nagy
adolescents with Down syndrome. In P. Fletcher and B.
and Anderson, 1984). MacWhinney (Eds.), Handbook of child language. Oxford:
Children with hyperlexia, often viewed as a subgroup Blackwell.
of the PDD spectrum, demonstrate a preoccupation with Eisele, J. A., and Aram, D. M. (1994). Di¤erential e¤ects of
decoding written words to the exclusion of meaning. early hemisphere damage on lexical comprehension and
Word recognition skills in these children are reported to production. Aphasiology, 7, 513–523.
be as much as 7 years in advance of grade-level expec- Ewing-Cobbs, L., Brookshire, B., Scott, M. A., and Fletcher,
tations, whereas reading comprehension is at or below J. M. (1998). Children’s narratives following traumatic
grade level (Whitehouse and Harris, 1984). Such chil- brain injury: Lingusitic structure, cohesion, and thematic
dren demonstrate that form and meaning can be sharply recall. Brain and Language, 61, 395–419.
Feldman, J. M., Holland, L., Kemp, S. S., and Janosky, J. E.
divorced in developmental disorders.
(1992). Language development after unilateral brain injury.
Future Trends Fenson, L., Dale, P., Reznick, S., Thal, D., Bates, E., Hartung,
Recent years brought improved methods for identifying J., et al. (1993). The Macarthur Communicative Development
Inventories. San Diego, CA: Singular Publishing Group.
lexical semantic deficits in children. Newly developed
Gathercole, S. E., and Baddeley, A. D. (1990). Phonological
parent-report inventories provide valid estimates of the memory deficits in language disordered children: Is there a
size of receptive lexicons in children functioning below causal connection? Journal of Memory and Language, 29,
the 16-month level and expressive lexicons in children 336–360.
functioning below the 30-month level (Fenson et al., Hesketh, L. J., and Chapman, R. S. (1998). Verb use by indi-
1993). For more skilled children, new ways of quantify- viduals with Down syndrome. American Journal on Mental
ing lexical-semantic abilities from discourse samples Retardation, 103, 288–304.
provide valid diagnostic indicators (Miller, 1996). Keefe, K. A., Feldman, H. M., and Holland, A. L. (1989).
The near future promises a burgeoning interest in se- Lexical learning and language abilities in preschoolers with
mantic development in children with language disorders. prenatal brain damage. Journal of Speech and Hearing Dis-
orders, 54, 395–402.
Increasingly, investigators, motivated by social interac-
Kumin, L., Councill, C., and Goodman, M. (1998). Expressive
tionist and dynamical systems theories, are demonstrat- vocabulary development in children with Down syndrome.
ing that semantic ability is not a static collection of Down Syndrome Quarterly, 3, 1–7.
knowledge but a system that emerges from interactions Levin, H., Eisenberg, H. M., Wigg, N. R., and Kobayashi, K.
among and between knowledge, context, and process- (1982). Memory and intellectual ability after head injury in
ing demands in real time. Aiding investigation of real- children and adolescents. Neurosurgery, 11, 668–673.
time semantic processing are new technologies and McGregor, K., Newman, R., Reilly, R., and Capone, N.
methods such as semantic priming, eye tracking, and (2002). Semantic representation and naming in children
event-related potentials. Aiding identification of seman- with specific language impairment. Journal of Speech, Lan-
tic deficits is the inclusion of dynamic word-learning guage, and Hearing Research, 45, 998–1014.
Miller, J. F. (1996). The search for the phenotype of disordered
tasks in diagnostic batteries. When these sophisticated
language performance. In M. L. Rice (Ed.), Toward a ge-
methods are widely employed, both the quality and the netics of language (pp. 297–314). Mahwah, NJ: Erlbaum.
quantity of semantic representations in children a¤ected Nagy, W. E., and Anderson, R. C. (1984). How many words
by language impairments will receive increased attention. are there in printed school English? Reading Research
Quarterly, 19, 304–330.
—Karla M. McGregor
O’Hara, M., and Johnston, J. (1997). Syntactic bootstrapping
References in children with specific language impairment. European
Journal of Disorders of Communication, 32, 147–164.
Baron-Cohen, S., Baldwin, D. A., and Crowson, M. (1997). Rapin, I., and Allen, D. (1983). Developmental language
Do children with autism use the speaker’s direction of gaze disorders: Nosological considerations. In U. Kirk (Ed.),
strategy to crack the code of language? Child Development, Neuropsychology of language, reading, and spelling (pp.
68, 48–57. 155–184). New York: Academic Press.
Beeghly, M., and Cicchetti, D. (1997). Talking about self Rice, M., Oetting, J., Marquis, J., Bode, J., and Pae, S. (1994).
and other: Emergence of an internal state lexicon in young Frequency of input e¤ects on word comprehension of chil-
children with Down syndrome. Development and Psycho- dren with specific language impairment. Journal of Speech
pathology, 9, 729–748. and Hearing Research, 37, 106–122.
Bishop, D. V. M. (1997). Uncommon understanding: Develop- Snowling, M. J. (2000). Language and literacy skills: Who is
ment and disorders of language comprehension in children. at risk and why? In D. V. M. Bishop and L. B. Leonard
East Sussex, U.K.: Psychology Press. (Eds.), Speech and language impairments in children:
Bishop, D. V. M., and Adams, C. (1992). Comprehension Causes, characteristics, intervention and outcome (pp.
problems in children with specific language impairment: 245–259). Philadelphia: Psychology Press.
Literal and inferential meaning. Journal of Speech and Stothard, S. E., Snowling, M. J., Bishop, D. V. M., Chipchase,
Hearing Research, 35, 119–129. B. B., and Kaplan, C. A. (1998). Language-impaired
preschoolers: A follow-up into adolescence. Journal of Lahey, M., and Edwards, J. (1999). Naming errors of children
Speech, Language, and Hearing Research, 41, 407–418. with specific language impairment. Journal of Speech, Lan-
Tager-Flusberg, H. (1992). Autistic children’s talk about psy- guage, and Hearing Research, 42, 195–205.
chological states: Deficits in the early acquisition of a theory Leonard, L. B. (1998). Children with specific language impair-
of mind. Child Development, 63, 161–172. ment. Cambridge, MA: MIT Press.
Thal, D. J., Marchman, V., Stiles, J., Aram, D., Trauner, D., McGregor, K. K., Friedman, R., Reilly, R., and Newman, R.
Nass, R., et al. (1991). Early lexical development in (2002). Semantic representation and naming in young chil-
children with focal brain injury. Brain and Language, 40, dren. Journal of Speech, Language, and Hearing Research,
491–527. 45, 332–346.
Trauner, D., Wulfeck, B., Tallal, P., and Hesselink, J. (1995). McGregor, K. K., and Waxman, S. R. (1998). Object naming
Neurologic and MRI profiles of language impaired children at multiple hierarchical levels: A comparison of pre-
(Technical Report, Publication No. CND-9513). San Di- schoolers with and without word-finding deficits. Journal of
ego, CA: Center for Research in Language, UCSD. Child Language, 25, 419–430.
Van der Lely, H. (1994). Canonical linking rules: Forward Mervis, C. B., and Bertrand, J. (1995). Acquisition of the novel
versus reverse linking in normally developing and specifi- name-nameless category (N3C) principle by young children
cally language-impaired children. Cognition, 51, 29–72. who have Down syndrome. American Journal of Mental
Watkins, R. V., Kelly, D. J., Harbers, H. M., and Hollis, Retardation, 100, 231–243.
W. (1995). Measuring children’s lexical diversity: Di¤er- Nation, K., and Snowling, M. J. (1999). Developmental dif-
entiating typical and impaired language learners. Journal of ferences in sensitivity to semantic relations among good
Speech and Hearing Research, 38, 1349–1355. and poor comprehenders: Evidence from semantic priming.
Whitehouse, D., and Harris, J. C. (1984). Hyperlexia in infan- Cognition, 70, B1–B13.
tile autism. Journal of Autism and Developmental Disorders, Nippold, M. A. (1988). Later language development (2nd ed.).
14, 281–289. Austin, TX: Pro-Ed.
Oetting, J. B., Rice, M. L., and Swank, L. K. (1995). Quick
incidental learning (QUIL) of words by school-age children
Further Readings with and without specific language impairment. Journal of
Bishop, D. V. M., and Adams, C. (1989). Conversational Rescorla, L. (1989). The Language Development Survey: A
characteristics of children with semantic-pragmatic dis- screening tool for delayed language in toddlers. Journal of
order: II. What features lead to a judgement of inappro- Speech and Hearing Disorders, 54, 587–599.
priacy? British Journal of Disorders of Communication, 24, Secord, W. A., and Wiig, E. H. (1993). Interpreting figurative
241–263. language expressions. Folia Phoniatrica, 45, 1–9.
Bloom, P. (2000). How children learn the meanings of words. Stanovich, K. E. (1986). Matthew e¤ects in reading: Some
Cambridge, MA: MIT Press. consequences of individual di¤erences in the acquisition of
Chapman, R. S., Seung, J. K., Schwartz, S. E., and Kay- literacy. Reading Research Quarterly, 21, 360–406.
Raining Bird, E. (2000). Predicting language production in
children and adolescents with Down syndrome: The role of
comprehension. Journal of Speech, Language, and Hearing
Research, 43, 340–350.
Dorman, C., and Katzir, B. (1994). Cognitive e¤ects of early Social Development and Language
brain injury. Baltimore: Johns Hopkins University Press. Impairment
Dunn, M., Gomes, J., and Sebastian, M. J. (1996). Proto-
typicality of responses of autistic, language disordered, and
normal children in a word fluency task. Child Neuro- Social skills determine to a large extent the success we
psychology, 2, 99–108. enjoy vocationally and avocationally and the amount of
Fletcher, P., and Peters, J. (1984). Characterizing language satisfaction we derive from our personal relationships.
impairment in children: An exploratory study. Language Social skill deficits in childhood have long been asso-
Testing, 1, 33–49. ciated with a variety of negative outcomes, including
Frith, U. (1989). Autism: Explaining the enigma. Oxford:
criminality, underemployment, and psychopathology (cf.
Blackwell.
Gathercole, S. E., and Baddeley, A. D. (1993). Working mem- Gilbert and Connolly, 1991). The potential impact of
ory and language. Hillsdale, NJ: Erlbaum. developmental language impairments on social develop-
Hollich, G. J., Hirsh-Pasek, K., and Golinko¤, R. M. (2000). ment is one of the most important issues facing families,
Breaking the language barrier: An emergentist coalition educators, and speech-language pathologists. Unfortu-
model for the origins of word learning. Monographs of the nately, there is little agreement on the definition of social
Society for Research in Child Development, 65. skills as a psychological construct, making service plan-
Jordan, F. M. (1990). Speech and language disorders following ning in this area challenging. In their review, Merrell and
childhood closed head injury. In B. E. Murdoch (Ed.), Gimpel (1998) presented no less than 16 di¤erent defi-
Acquired neurological speech/language disorders in childhood nitions that enjoy wide currency and reflect the interests
(pp. 124–147). London: Taylor and Francis.
Kail, R., and Leonard, L. B. (1986). Word-finding abilities in
of a variety of disciplines, including psychology, psy-
language-impaired children. ASHA Monographs, 25. chiatry, special education, and social work. However,
Kiernan, B., and Gray, S. (1998). Word learning in a commonalities across these di¤erent perspectives can be
supported-learning context by preschool children with spe- extracted. In a meta-analysis of 21 multivariate studies
cific language impairment. Journal of Speech, Language, that classified children’s social skills (total N ¼ 22,000),
and Hearing Research, 41, 161–171. Caldarella and Merrell (1997) identified five core
Social Development and Language Impairment 399
dimensions: peer relations, self-management, academics, a group of grade-matched typically developing con-
compliance, and assertion. trols during a role-enactment activity. Children with
What are the interrelationships between language SLI produced fewer strategies during a more verbally
impairments and these important areas of social devel- demanding hypothetical problem-solving activity. Brin-
opment? Language impairments occur with a large ton, Fujiki, and McKee (1998) examined the negotiation
variety of developmental disorders, and some, such as skills of six children with SLI (8 to 12 years old) during
mental retardation, autism, and pervasive developmen- conversational interactions with typically developing
tal delay, include social skill deficits as a primary diag- peers and found that children with SLI contributed fewer
nostic feature. In order to address the question of how and less mature negotiation strategies. The strategies
language impairment uniquely a¤ects social develop- used by children in the SLI group resembled those pro-
ment, however, we need to examine the social skills of duced by a younger group of children of equivalent lan-
children with specific language impairment (SLI) (see guage levels.
specific language impairment in children). SLI refers The third cluster of social skills identified by Caldar-
to a language deficiency that occurs in the absence of ella and Merrell’s (1997) meta-analysis was the aca-
other conditions commonly associated with language demics dimension. This dimension captures behaviors
disorders in children. Children with SLI show normal regarded by teachers as important to school adjustment
hearing, age-appropriate scores on nonverbal tests of and is represented by such skills as completing tasks
intelligence, and no obvious signs of neurological or independently, following teachers’ directions, and pro-
socioemotional impairment. Children with SLI represent ducing quality work. Indirect evidence for problems in
a heterogeneous group, and significant individual di¤er- this dimension comes from studies of SLI that have
ences exist among children diagnosed with this disorder. used rating scales to evaluate children’s socioemotional
However, a common profile in young, English-speaking characteristics. For example, Redmond and Rice (1998)
children with SLI is a mild to moderate deficit in a range compared standardized parent and teacher ratings of 17
of language areas and a more significant deficit in the use children with SLI collected at kindergarten and first
of grammatical morphology. grade to ratings collected on typically developing chil-
According to Caldarella and Merrell (1997), a large dren. The particular rating scales used included several
number of social skills contribute to the dimension items that relate to important academic skills (e.g., has
peer relations. These skills include specific discourse/ di‰culty following directions; fails to carry out tasks;
pragmatic behaviors such as complimenting others and messy work). These investigators found significant dif-
inviting others to play, as well as more general social ferences between groups on the teacher ratings of these
attributes such as peer acceptance. Several studies have problems but not on the parent ratings, suggesting
examined the peer interactions of preschool and school- that the social performance of children with SLI varies
age children with SLI and have documented the detri- significantly across situations, depending on the verbal
mental e¤ect that language impairments can have on this demands placed on them and the expectations of others.
area of social development. For example, children with Levels of academic success may also influence aca-
SLI are likely to be ignored by their typically developing demic social behaviors. In an epidemiological study of
peers, respond less often when their peers make initia- 164 second-grade children with language impairments,
tions, and rely more on adults to mediate their interac- Tomblin et al. (2000) found that levels of classroom be-
tions (Craig and Evans, 1989; Hadley and Rice, 1991; havior problems were higher among children with SLI
Rice, Sell, and Hadley, 1991; Craig and Washington, who also had reading disabilities than among children
1993; Brinton, Fujiki, and Higbee, 1998; Brinton, Fujiki, with SLI alone (see language disorders and reading
and McKee, 1998). Sociometric analyses confirm further disabilities).
the impression that children with SLI experience limited Prosocial behaviors such as cooperation and sharing
peer acceptance (Gertner, Rice, and Hadley, 1994; are captured by Caldarella and Merrell’s compliance di-
Fujiki, Brinton, Hart, et al., 1999). Some studies suggest mension. Information on the consequences of SLI in
that problems in peer group acceptance may extend this area of social skill development is limited. Farmer
to di‰culties establishing adequate friendships (Fujiki, (2000) compared the performances of 16 10-year-old
Brinton, Morgan, et al., 1999), whereas others have children with SLI with that of a group of typically
reported no di¤erences between children with SLI and developing children on a standardized teacher rating
typically developing children in the number and quality scale of prosocial behaviors and found no significant
of close friendships (Redmond and Rice, 1998). di¤erences.
Although peer relations represent the sine qua non The final social skills dimension in Caldarella and
of social development, there are other important social Merrell’s taxonomy is assertion. Several studies suggest
skills. Self-management refers to the ability to control children with SLI experience particular di‰culty in this
one’s temper, follow rules and limits, and compromise area. Craig and Washington (1993) examined the con-
with others (Caldarella and Merrell, 1997). A few studies versational skills of five 7-year-old children with SLI as
have assessed this dimension in children with SLI. Ste- they attempted to access ongoing peer interactions and
vens and Bliss (1995) examined conflict resolution abili- found that three of the five children had considerable
ties in 30 children with SLI in grades 3 through 7 and di‰culty asserting themselves in this situation. Brinton
found no significant di¤erences between this group and et al. (1997) replicated these results with older children
(8–12 years old). Children with SLI have also been con- specific areas of social skill development. The results of
sistently characterized as shy, passive, and withdrawn by this line of research will inform important areas of clini-
parents and teachers (Tallal, Dukette, and Curtiss, 1989; cal practice, such as diagnosis, prognosis, and treatment.
Fujiki, Brinton, and Todd, 1996; Redmond and Rice, See also psychosocial problems associated with
1998; Beitchman et al., 2001). Results from a recent 14- communicative disorders.
year longitudinal study of 77 children with speech and
language impairments suggest that characterizations of —Sean M. Redmond
low assertiveness may be longstanding and continue at
References
least into young adulthood for some children with SLI
(Beitchman et al., 2001). Beitchman, J. H., Wilson, B., Johnson, C. J., Atkinson, L.,
In sum, a small but growing body of research suggests Young, A., Adlaf, E., et al. (2001). Fourteen-year follow-up
that language impairments place children at risk for of speech/language impaired and control children: Psychi-
negative social consequences, specifically in the areas of atric outcome. Journal of the American Academy of Child
assertion and peer relations. Problems in these areas may and Adolescent Psychiatry, 40, 75–82.
Bishop, D. V. M. (2000). Pragmatic language impairment: A
be particularly detrimental for children with SLI because correlate of SLI, a distinct subgroup, or part of the autistic
they can contribute to what Rice (1993) has described continuum? In D. V. M. Bishop and L. B. Leonard (Eds.),
as a ‘‘negative social spiral.’’ Rice suggested that in re- Speech and language impairments in children: Causes, char-
sponse to repeated instances of communicative failure, acteristics, intervention and outcome (pp. 99–114). Philadel-
children with SLI may withdraw from peer interactions phia: Taylor and Francis.
or rely more on adults to mediate peer interactions. Brinton, B., and Fujiki, M. (1999). Social interactional behav-
However, these behavioral adjustments may turn out iors of children with specific language impairment. Topics in
to be counterproductive for both social and linguistic Language Disorders, 19, 34–48.
development because they limit children’s access to im- Brinton, B., Fujiki, M., and Higbee, L. (1998). Participation in
portant socialization experiences and opportunities to cooperative learning activities by children with specific lan-
guage impairment. Journal of Speech, Language, and Hear-
improve their limited language skills. ing Research, 41, 1193–1206.
Studies of children with SLI have consistently Brinton, B., Fujiki, M., and McKee, L. (1998). The negotia-
reported high levels of variability in social skill perfor- tion skills of children with specific language impairment.
mance, a finding that has important clinical and research Journal of Speech, Language, and Hearing Research, 41,
implications. Social skill deficits do not appear to be 927–940.
an inevitable consequence of developmental language Brinton, B., Fujiki, M., Spencer, J. C., and Robinson, L. A.
impairments, nor can social skill di¤erences between (1997). The ability of children with specific language im-
children be inferred from di¤erences in either the type or pairment to access and participate in an ongoing interac-
severity of language deficits (e.g., Brinton and Fujiki, tion. Journal of Speech, Language, and Hearing Research,
1999; Fujiki et al., 1999; Donlan and Masters, 2000). 40, 1011–1025.
Caldarella, P., and Merrell, K. W. (1997). Common dimen-
Given the heterogeneity of children with developmental sions of social skills of children and adolescents: A taxon-
language disorders, it is important that treatment teams omy of positive behaviors. School Psychology Review, 26,
supplement the assessment of language impairment with 264–278.
a separate assessment of social skills in the areas of peer Cohen, N. J., Vallance, D. D., Barwick, M., Im, N., Menna,
relations, self-management, academics, compliance, and R., Horodezky, N. B., et al. (2000). The interface between
assertion across di¤erent situational contexts (home, ADHD and language impairment: An examination of lan-
classroom, playground). guage, achievement, and cognitive processing. Journal of
Future investigations may reveal that variability in Child Psychology and Psychiatry, 41, 353–362.
children with language impairments is better accounted Craig, H. K., and Evans, J. (1989). Turn exchange character-
for by uncontrolled confounding factors. For example, istics of SLI children’s simultaneous and nonsimultaneous
speech. Journal of Speech and Hearing Disorders, 54,
although many studies suggest that SLI and attention- 334–347.
deficit/hyperactivity disorder commonly co-occur (cf. Craig, H. K., and Washington, J. A. (1993). The access
Cohen et al., 2000) the potential influence of this com- behaviors of children with specific language impairment.
orbidity on social skill development has not yet been Journal of Speech and Hearing Research, 36, 322–336.
considered. Likewise, a small portion of children diag- Donlan, C., and Masters, J. (2000). Correlates of social devel-
nosed with SLI demonstrate limitations in social cogni- opment in children with communicative disorders: The
tion commonly associated with autism and pervasive concurrent predictive value of verbal short-term memory
developmental delay. There has been a longstanding span. International Journal of Communication Disorders, 35,
controversy over the diagnostic boundaries between SLI 211–226.
and autism spectrum disorders (cf. Bishop, 2000), and Farmer, M. (2000). Language and social cognition in children
with specific language impairment. Journal of Child Psy-
social skill outcomes may be an important distinguishing chology and Psychiatry and Allied Disciplines, 41, 627–
characteristic of children who fall outside preconceived 636.
categories. Large-scale investigations comparing the so- Fujiki, M., Brinton, B., Hart, C. H., and Fitzgerald, A. (1999).
cial skills of children with SLI only with those of chil- Peer acceptance and friendship in children with specific
dren with SLI and other comorbid disorders are needed language impairment. Language, Speech, and Hearing
to delineate which language skills are associated with Services in Schools, 30, 183–195.
Social Development and Language Impairment 401
Fujiki, M., Brinton, B., Morgan, M., and Hart, C. H. (1999). Cantwell, D. P., and Baker, L. (1991). Psychiatric and devel-
Withdrawn and sociable behavior of children with language opmental disorders in children with communication disorder.
impairments. Language, Speech, and Hearing Services in Washington, DC: American Psychiatric Press.
Schools, 30, 183–195. Craig, H. K. (1991). Pragmatic characteristics of the child with
Fujiki, M., Brinton, B., and Todd, C. M. (1996). Social skills specific language impairment: An interactionist perspective.
of children with specific language impairment. Language, In T. M. Gallagher (Ed.), Pragmatics of language: Clinical
Speech, and Hearing Services in Schools, 27, 195–202. practice issues (pp. 163–198). San Diego, CA: Singular
Gertner, B. L., Rice, M. L., and Hadley, P. A. (1994). Influ- Publishing Group.
ence of communicative competence on peer preferences in Craig, H. K. (1993). Social skills of children with specific
a preschool classroom. Journal of Speech and Hearing Re- language impairment: Peer relationships. Language, Speech,
search, 37, 913–923. and Hearing Services in Schools, 24, 206–215.
Gilbert, D. G., and Connolly, J. J. (Eds.). (1991). Personality, Craig, H. K. (1995). Pragmatic impairments. In P. Fletcher
social skills, and psychopathology: An individual di¤erence and B. MacWhinney (Eds.), The handbook of child language
approach. New York: Plenum Press. (pp. 623–640). Cambridge, MA: Blackwell.
Hadley, P. A., and Rice, M. L. (1991). Conversational DeThorne, L. S., and Watkins, R. V. (2001). Listeners’ per-
responsiveness of speech- and language-impaired pre- ceptions of language use in children. Language, Speech, and
schoolers. Journal of Speech and Hearing Research, 34, Hearing Services in Schools, 32, 142–148.
1308–1317. Donahue, M. L., Hartas, D., and Cole, D. (1999). Research on
Merrell, K. W., and Gimpel, G. A. (1998). Social skills of interactions among oral language and emotional/behavioral
children and adolescents: Conceptualization, assessment, disorders. In D. Rogers-Adkinson and P. Gri‰th (Eds.),
treatment. Mahwah, NJ: Erlbaum. Communication disorders and children with psychiatric and
Redmond, S. M., and Rice, M. L. (1998). The socioemotional behavioral disorders (pp. 69–98). San Diego, CA: Singular
behaviors of children with SLI: Social adaptation or social Publishing Group.
deviance? Journal of Speech, Language, and Hearing Re- Elliot, S. N., and Gresham, F. M. (1991). Social skills inter-
search, 41, 688–700. vention guide: Practical strategies for social skills training.
Rice, M. L. (1993). ‘‘Don’t talk to him: He’s weird’’: A social Circle Pines, MN: American Guidance.
consequences account of language and social interactions. Farmer, M. (1997). Exploring the links between communi-
In A. P. Kaiser and D. B. Gray (Eds.), Enhancing children’s cation skills and social competence. Educational and Child
communication: Research foundations for intervention (pp. Psychology, 14, 38–44.
139–158). Baltimore: Paul H. Brookes. Fey, M. E., and Leonard, L. B. (1983). Pragmatic skills of
Rice, M. L., Sell, M. A., and Hadley, P. A. (1991). Social children with specific language impairment. In T. M. Gal-
interactions of speech- and language-impaired children. lagher and C. A. Prutting (Eds.), Pragmatic assessment and
Journal of Speech and Hearing Research, 34, 1299–1307. intervention issues in language (pp. 65–82). San Diego, CA:
Stevens, L. J., and Bliss, L. S. (1995). Conflict resolution abili- College-Hill Press.
ties of children with specific language impairment and chil- Gallagher, T. M. (1991). Language and social skills: Implica-
dren with normal language. Journal of Speech and Hearing tions for clinical assessment and intervention with school-
Research, 38, 599–611. age children. In T. M. Gallagher (Ed.), Pragmatics of
Tallal, P., Dukette, D., and Curtiss, S. (1989). Behavior/ language: Clinical practice issues (pp. 11–42). San Diego,
emotional profiles of preschool language-impaired children. CA: Singular Publishing Group.
Development and Psychopathology, 1, 51–67. Goldstein, H., and Gallagher, T. M. (1992). Strategies for
Tomblin, J. B., Zhang, X., Buckwalter, P., and Catts, H. promoting the social communicative competence of chil-
(2000). The association of reading disability, behavioral dren with specific language impairment. In S. L. Odom,
disorders, and language impairments among second-grade S. R. McConnell and M. A. McEvoy (Eds.), Social compe-
children. Journal of Child Psychology and Psychiatry and tence of young children with disabilities (pp. 189–213). Bal-
Allied Disciplines, 41, 473–482. timore: Paul H. Brookes.
Goodyer, I. M. (2000). Language di‰culties and psycho-
pathology. In D. V. M. Bishop and L. B. Leonard (Eds.),
Further Readings Speech and language impairments in children: Causes, char-
acteristics, intervention and outcome (pp. 227–244). Phila-
Beitchman, J. H., Cohen, N. J., Konstantareas, M. M., and delphia: Taylor and Francis.
Tannock, R. (Eds.). (1996). Language, learning, and behav- Gresham, F. M., and MacMillan, D. L. (1997). Social compe-
ior disorders: Developmental, biological, and clinical per- tence and a¤ective characteristics of students with mild dis-
spectives. New York: Cambridge University Press. abilities. Review of Educational Research, 67, 377–415.
Bishop, D. V. M. (1997). Uncommon understanding: Develop- Hummel, L. J., and Prizant, B. (1993). A socioemotional
ment and disorders of language comprehension in children. perspective for understanding social di‰culties of school-
East Sussex, U.K.: Psychology Press. age children with language disorders. Language, Speech,
Botting, N., and Conti-Ramsden, G. (2000). Social and be- and Hearing Services in Schools, 31, 216–224.
havioral di‰culties in children with language impairment. Miller, C. A. (2001). False belief understanding in children
Child Language Teaching and Therapy, 16, 105–120. with specific language impairment. Journal of Communica-
Brinton, B., and Fujiki, M. (1989). Conversational management tion Disorders, 34, 73–86.
with language-impaired children: Pragmatic assessment and Merrell, K. W. (1999). Behavioral, social, and emotional assess-
intervention. Rockville, MD: Aspen. ment of children and adolescents. Mahwah, NJ: Erlbaum.
Brinton, B., Fujiki, M., Montague, E., and Hanton, J. L. Prizant, B., Audet, L., Burke, G., Hummel, L., Maher, S.,
(2000). Children with language impairments in cooperative and Theadore, G. (1990). Communication disorders and
work groups: A pilot study. Language, Speech, and Hearing emotional/behavioral disorders in children and adolescents.
Services in Schools, 31, 252–264. Journal of Speech and Hearing Research, 55, 179–192.
Redmond, S. M., and Rice, M. L. (2002). Stability of behav- family that includes a high proportion of members with
ioral ratings of children with SLI. Journal of Speech, Lan- SLI, the evidence implicates a region on the long arm of
guage, and Hearing Research, 45, 190–200. chromosome 7 (Fisher et al., 1998). The same region
Rice, M. L., Hadley, P. A., and Alexander, A. L. (1993). Social was identified in a group of children with SLI who par-
biases toward children with speech and language impair-
ticipated in an epidemiological study (Tomblin, 1999).
ments: A correlative causal model of language limitations.
Applied Psycholinguistics, 14, 445–471. However, other recent studies of clinically referred cases
Rutter, M., Mawhood, L., and Howlin, P. (1992). Langu- of SLI have revealed prominent areas of linkage on
age delay and social development. In P. Feltcher and D. chromosomes 16 and 19, but not on chromosome 7 (SLI
Hall (Eds.), Specific Speech and Language Disorders in Consortium, 2002). Clearly, further refinement is needed
Children (pp. 63–78). San Diego, CA: Singular Publishing before the genetic basis for SLI is fully understood (see
Group. Bishop, 2002).
Windsor, J. (1995). Language impairment and social conse- In recent years, neuroanatomical evidence of di¤er-
quence. In M. E. Fey, J. Windsor, and S. F. Warren (Eds.), ences between individuals with SLI and typically devel-
Language intervention: Preschool through the elementary oping individuals has appeared in the literature (see
years (pp. 213–240). Baltimore: Paul H. Brookes.
Ahmed, Lombardino, and Leonard, 2000, for a recent
review). The specific di¤erences observed have varied
across studies. For example, symmetry of the right and
Specific Language Impairment in left perisylvian areas seems to be more likely in children
Children with SLI than in controls. Interestingly, this pattern
can also be seen in parents or siblings of children with
SLI even when they do not exhibit a language disorder.
Specific language impairment (SLI) is a term that is ap- Other studies have revealed a higher likelihood of atypi-
plied to children who show a significant deficit in their cal neuroanatomical patterns in children with SLI than
spoken language ability with no obvious accompanying in controls, but di¤erences among the children with SLI
problems such as hearing impairment, mental retarda- in the particular pattern seen (e.g., ventricular enlarge-
tion, or neurological damage. This type of language dis- ment, central volume loss).
order is regarded as developmental in nature because Although a diagnosis of SLI is not given to children
a¤ected children exhibit language learning problems unless they meet the criteria noted above, many children
from the outset. with SLI nevertheless show subtle weaknesses in other
Although SLI is receiving increased attention in the areas. For example, as a group, these children are slower
research and clinical literature, it is not a newly dis- and less accurate on nonlinguistic cognitive tasks such as
covered disorder. Children meeting the basic definition mental rotation (e.g., Miller et al., 2001), and less coor-
of SLI have been described in the literature since the dinated than their typically developing same-age peers
1800s, but have been given a wide range of clinical (Powell and Bishop, 1992; Hill, 2001). These findings
labels. More recent clinical labels used for children have led to proposals about the possible causes of SLI,
with SLI include developmental aphasia, developmental but the presence of children with SLI who show none
dysphasia, and developmental language disorder. The of these accompanying weaknesses raises the possibility
last continues to be used in the DSM-IV classification that SLI and subtle cognitive and motor weakness are
system, with the subtypes of ‘‘expressive’’ and ‘‘receptive comorbid. That is, the conditions that cause SLI fre-
and expressive’’ (American Psychiatric Association, quently co-occur with conditions that cause these other
1994). These subtypes acknowledge that some children problems, but the latter are not responsible for SLI.
with SLI may have significant limitations primarily in For children with SLI whose language problems
the area of language production, whereas others may are still present at 5 years of age, di‰culties with lan-
have major limitations in both the comprehension and guage may continue into adolescence and even adult-
production of language. hood (Bishop and Adams, 1990; Beitchman et al., 1996).
The prevalence of SLI is estimated to be approxi- Comparisons of young adults with a history of SLI and
mately 7% among 5-year-olds, based on epidemiological same-age adults with no such history reveal di¤erences
data (Tomblin et al., 1997). Males outnumber females; favoring the latter on a range of spoken production and
the most recent evidence suggests a ratio of approxi- comprehension tasks (Tomblin, Freese, and Records,
mately 1.5 : 1. 1992).
Children with SLI are two to three times more likely Children with SLI are at greater risk for reading defi-
than typically developing children to have parents or cits than children with typical language development.
siblings with a history of language problems (Tallal, This observation can be explained in part by the fact
Ross, and Curtiss, 1989; Tomblin, 1989; Tallal et al., that children with SLI and those with developmental
2001). For children with family histories of language dyslexia are overlapping populations (McArthur et al.,
problems, there is reason to suspect a genetic basis rather 2000). For example, prospective study of children from
than a primary environmental basis. Concordance rates homes with a positive history of dyslexia reveals signifi-
for SLI are considerably higher for monozygotic twins cantly more di‰culties with spoken language than chil-
than for same-sex dizygotic twins (Bishop, North, and dren with no such family history (Scarborough, 1990).
Donlan, 1995). Rapid progress is being made in the ge- The language di‰culties experienced by children with
netic study of SLI. For a well-studied three-generational SLI cover most or all areas of language, including
Specific Language Impairment in Children 403
vocabulary, morphosyntax, phonology, and pragmatics. ‘‘representational deficit for dependent relationships’’
However, these areas of language are rarely a¤ected to account assumes that children with SLI fail to grasp that
the same degree. In English, vocabulary and pragmatic movement or checking of grammatical features is oblig-
skills are often relative strengths, whereas phonology atory (van der Lely, 1998).
and especially morphosyntax are relative weaknesses. Other types of accounts assume that children with
This profile is not seen in all English-speaking children SLI might have the potential to acquire normal gram-
meeting the criteria for SLI. For example, some children mar but have limitations in processing that slow their
with SLI show notable word-finding problems. There identification and interpretation of the relevant input
have been several attempts at determining whether the and their ability to retrieve this information for produc-
di¤erences seen among children with SLI constitute dis- tion. In some cases the processing limitation is assumed
tinct subtypes or instead represent di¤erent points on a to be quite general (Johnston, 1994; Ellis Weismer,
continuum. Resolution of this issue will be important, 1996). In other cases the limitation is assumed to be
as identification of the correct phenotype of SLI will be specific to particular operations, such as phonological
necessary for further progress in the genetic study of this processing (Chiat, 2001) or the processing of brief or
disorder. rapidly presented auditory information (e.g., Tallal et al.,
The heterogeneity of SLI notwithstanding, certain 1996).
symptoms may have the potential to serve as ‘‘clinical Future research on SLI will make two types of con-
markers’’ of SLI. For English-speaking children, two tributions. Most obviously, greater understanding of
measures seem especially promising. One is a measure of this disorder should lead to more e¤ective methods of
the children’s use of grammatical morphemes pertain- assessment, treatment, and prevention. In addition, be-
ing to grammatical tense and agreement, such as regular cause the language disorder seen in SLI may in many
past -ed, third person singular -s, and copula and auxil- cases occur in the absence of accompanying impair-
iary forms of be (Rice and Wexler, 1996). The second is ments, it constitutes a challenge for theories of language
a measure of children’s ability to repeat nonsense words learning to explain.
containing several syllables (e.g., Dollaghan and Camp- See also language disorders in school-age chil-
bell, 1998). Both of these measures are quite accurate in dren: overview; speech disorders in children: a psy-
distinguishing children with SLI from their normally cholinguistic perspective.
developing age mates.
The linguistic profile of relative strengths and weak- —Laurence B. Leonard
nesses in SLI seems to be shaped to a significant degree References
by the language being acquired. For example, children
with SLI acquiring inflectionally rich languages such as Ahmed, S., Lombardino, L., and Leonard, C. (2001). Specific
Italian and Hebrew are not as severely impaired as their language impairment: Definitions, causal mechanisms,
English-speaking counterparts in their use of grammati- and neurobiological factors. Journal of Medical Speech-
cal inflections pertaining to tense and agreement. On the Language Pathology, 9, 1–15.
American Psychiatric Association. (1994). Diagnostic and sta-
other hand, Swedish-speaking children with SLI show tistical manual of mental disorders—Fourth edition. Wash-
more serious problems in using appropriate word order ington, DC: Author.
than do children with SLI acquiring English (see Leo- Beitchman, J., Wilson, B., Brownlee, E., Walters, H., and
nard, 1998, for a recent review). Lancee, W. (1996). Long-term consistency in speech/
Evidence for the e‰cacy of intervention is abundant language profiles: I. Developmental and academic out-
in the literature. For example, for preschool-age children comes. Journal of the American Academy of Child and
with SLI, approaches such as recasting have been rela- Adolescent Psychiatry, 35, 804–814.
tively successful (Camarata and Nelson, 1992; Fey, Bishop, D. (2002). Putting language genes in perspective.
Cleave, and Long, 1997). However, although the gains Trends in Genetics, 18, 57–59.
made in intervention usually go well beyond those that Bishop, D., and Adams, C. (1990). A prospective study of the
relationship between specific language impairment, phono-
can be expected by maturation alone, no intervention logical disorders and reading retardation. Journal of Child
approach has led to dramatic and rapid language gains Psychology and Psychiatry, 31, 1027–1050.
by these children on a consistent basis. This is especially Bishop, D., North, T., and Donlan, C. (1995). Genetic basis of
true when gains are defined in terms of use in spontane- specific language impairment: Evidence from a twin study.
ous speech. Developmental Medicine and Child Neurology, 37, 56–71.
Attempts to explain the nature of SLI vary consider- Camarata, S., and Nelson, K. E. (1992). Treatment e‰ciency
ably. Most of these accounts focus on the extraordinary as a function of target selection in the remediation of child
grammatical deficits often seen in children with SLI. It is language disorders. Clinical Linguistics and Phonetics, 6,
possible to classify these alternative accounts according 167–178.
to their principal assumptions. Some accounts assume Chiat, S. (2001). Mapping theories of developmental language
impairment: Premises, predictions and evidence. Language
that children with SLI lack particular types of gram- and Cognitive Processes, 16, 113–142.
matical knowledge. For example, the ‘‘extended optional Dollaghan, C., and Campbell, T. (1998). Nonword repeti-
infinitive’’ account assumes that children with SLI go tion and child language impairment. Journal of Speech,
through a protracted period during which they assume Language, and Hearing Research, 41, 1136–1146.
that tense and agreement are optional rather than oblig- Ellis Weismer, S. (1996). Capacity limitations in working
atory in main clauses (Rice and Wexler, 1996). The memory: The impact on lexical and morphological learning
by children with language impairment. Topics in Language van der Lely, H. (1998). SLI in children: Movement, economy,
Disorders, 17, 33–44. and deficits in the computational-syntactic system. Lan-
Fey, M., Cleave, P., and Long, S. (1997). Two models of guage Acquisition, 7, 161–192.
grammar facilitation in children with language impair-
ments: Phase 2. Journal of Speech, Language, and Hearing
Research, 40, 5–19. Further Readings
Fisher, S., Vargha-Khadem, F., Watkins, K., Monaco, A., and
Pembrey, M. (1998). Localisation of a gene implicated in a Bishop, D. (1997). Uncommon understanding: Development and
severe speech and language disorder. Nature Genetics, 18, disorders of comprehension in children. Hove, U.K.: Psy-
168–170. chology Press.
Hill, E. (2001). Non-specific nature of specific language Catts, H., Fey, M., Zhang, X., and Tomblin, J. B. (2001).
impairment: A review of the literature with regard to con- Estimating the risk of future reading di‰culties in kinder-
comitant motor impairment. International Journal of Lan- garten children: A research-based model and its classroom
guage and Communication Disorders, 36, 149–171. implementation. Language, Speech, and Hearing Services in
Johnston, J. (1994). Cognitive abilities of children with lan- Schools, 32, 38–50.
guage impairment. In R. Watkins and M. Rice (Eds.), Clahsen, H., Bartke, S., and Göllner, S. (1997). Formal fea-
Specific language impairments in children (pp. 107–121). tures in impaired grammars: A comparison of English
Baltimore: Paul H. Brookes. and German SLI children. Journal of Neurolinguistics, 10,
Leonard, L. (1998). Children with specific language impairment. 151–172.
Cambridge, MA: MIT Press. Conti-Ramsden, G., Botting, N., and Faragher, B. (2001).
McArthur, G., Hogben, J., Edwards, V., Heath, S., and Men- Psycholinguistic markers for specific language impair-
gler, E. (2000). On the ‘‘specifics’’ of specific reading dis- ment (SLI). Journal of Child Psychology and Psychiatry, 42,
ability and specific language impairment. Journal of Child 741–748.
Psychology and Psychiatry, 41, 869–874. Conti-Ramsden, G., Botting, N., Simkin, Z., and Knox, E.
Miller, C., Kail, R., Leonard, L., and Tomblin, J. B. (2001). (2001). Follow-up of children attending infant language
Speed of processing in children with specific language im- units: Outcomes at 11 years of age. International Journal of
pairment. Journal of Speech, Language, and Hearing Re- Language and Communication Disorders, 36, 207–219.
search, 44, 416–433. Dromi, E., Leonard, L., Adam, G., and Zadunaisky-Ehrlich,
Powell, R., and Bishop, D. (1992). Clumsiness and perceptual S. (1999). Verb agreement morphology in Hebrew-speaking
problems in children with specific language impairment. children with specific language impairment. Journal of
Developmental Medicine and Child Neurology, 41, 159–165. Speech, Language, and Hearing Research, 42, 1414–1431.
Rice, M., and Wexler, K. (1996). Toward tense as a clinical Gillam, R. (Ed.). (1998). Memory and language impairment in
marker of specific language impairment in English-speaking children and adults. Gaithersburg, MD: Aspen.
children. Journal of Speech and Hearing Research, 39, Gopnik, M., and Crago, M. (1991). Familial aggregation of a
1239–1257. developmental language disorder. Cognition, 39, 1–50.
Scarborough, H. (1990). Very early language deficits in dys- Hansson, K., Nettelbladt, U., and Leonard, L. (2000). Specific
lexic children. Child Development, 61, 1728–1743. language impairment in Swedish: The status of verb mor-
SLI Consortium. (2002). A genomewide scan identifies two phology and word order. Journal of Speech, Language, and
novel loci in specific language impairment. American Jour- Hearing Research, 43, 848–864.
nal of Human Genetics, 70, 384–398. Jakubowicz, C., and Nash, L. (2001). Functional categories
Tallal, P., Hirsch, L., Realpe-Bonilla, T., Miller, S., Brzusto- and syntactic operations in (ab)normal language acquisi-
wicz, L., Bartlett, C., et al. (2001). Familial aggregation in tion. Brain and Language, 77, 321–339.
specific language impairment. Journal of Speech, Language, Leonard, L. (2000). Specific language impairment across
and Hearing Research, 44, 1172–1182. languages. In D. Bishop and L. Leonard (Eds.), Speech
Tallal, P., Miller, S., Bedi, G., Byma, G., Wang, X., Nagara- and language impairments in children (pp. 115–129). Hove,
jan, S., et al. (1996). Language comprehension in language- U.K.: Psychology Press.
learning impaired children improved with acoustically Leonard, L., Eyer, J., Bedore, L., and Grela, B. (1997). Three
modified speech. Science, 271, 81–84. accounts of the grammatical morpheme di‰culties of
Tallal, P., Ross, R., and Curtiss, S. (1989). Familial aggrega- English-speaking children with specific language impair-
tion in specific language impairment. Journal of Speech and ment. Journal of Speech, Language, and Hearing Research,
Hearing Disorders, 54, 167–173. 40, 741–753.
Tomblin, J. B. (1989). Familial concentration of developmental McGregor, K. (1997). The nature of word-finding errors in
language impairment. Journal of Speech and Hearing Dis- preschoolers with and without word-finding deficits. Journal
orders, 54, 287–295. of Speech, Language, and Hearing Research, 40, 1232–1244.
Tomblin, J. B. (1999). Epidemiology and familial transmission Paradis, J., and Crago, M. (2000). Tense and temporality:
of specific language impairment. Paper presented at the A comparison between children learning a second language
annual meeting of the American Association for the Ad- and children with SLI. Journal of Speech, Language, and
vancement of Science, Anaheim, CA. Hearing Research, 43, 834–847.
Tomblin, J. B., Freese, P., and Records, N. (1992). Diagnosing Paul, R. (2000). Predicting outcomes of early expressive
specific language impairment in adults for the purpose of language delay: Ethical implications. In D. Bishop and
pedigree analysis. Journal of Speech and Hearing Research, L. Leonard (Eds.), Speech and language impairments in
35, 832–843. children (pp. 195–209). Hove, U.K.: Psychology Press.
Tomblin, J. B., Records, N., Buckwalter, P., Zhang, X., Smith, Plante, E. (1996). Phenotypic variability in brain-behavior
E., and O’Brien, M. (1997). The prevalence of specific studies of specific language impairment. In M. Rice (Ed.),
language impairment in kindergarten children. Journal of Toward a genetics of language (pp. 317–335). Baltimore:
Speech, Language, and Hearing Research, 40, 1245–1260. Paul H. Brookes.
Syntactic Tree Pruning 405
Rice, M., Wexler, K., and Hershberger, S. (1998). Tense over Friedmann and Grodzinsky (1997) to account for these
time: The longitudinal course of tense acquisition in chil- seemingly unrelated deficits and for the dissociations
dren with specific language impairment. Journal of Speech, between spared and impaired abilities within and across
Language, and Hearing Research, 41, 1412–1431. languages. The TPH is a linguistic generalization, for-
Thal, D., and Katich, J. (1997). Issues in early identification of
mulated within the generative grammar framework,
language impairment: Does the early bird always catch the
worm? In K. Cole, P. Dale, and D. Thal (Eds.), Assessment and was suggested to account for production only.
of communication and language (pp. 1–28). Baltimore: Paul (For a syntactic account of agrammatic comprehension,
H. Brookes. see trace deletion hypothesis; Grodzinsky, 2000.)
According to the TPH, individuals with agrammatic
aphasia are unable to project the syntactic tree up to
its highest node, and their syntactic tree is ‘‘pruned.’’ As
Syntactic Tree Pruning a result, syntactic structures and elements that require
the high nodes of the tree are impaired, but structures
and elements that involve only low nodes are preserved
agrammatism is a syntactic deficit following damage (Fig. 1).
to the left hemisphere, usually in Broca’s area and its According to syntactic theories within the genera-
vicinity (Zurif, 1995). The traditional view concerning tive tradition (e.g., Chomsky, 1995), sentences can be
speech production in agrammatism was that syntactic represented as phrase markers or syntactic trees. In these
ability is completely lost, and agrammatic aphasics rely syntactic trees, content and function words are repre-
on nonlinguistic strategies to concatenate words into sented in di¤erent nodes (head nodes and phrasal nodes)
sentences (Goodglass, 1976; Berndt and Caramazza, (Fig. 1). Functional nodes include, among others,
1980; Caplan, 1985), or that all functional elements are inflectional nodes: an agreement phrase (Agrs P), which
impaired in agrammatic speech production (Grodzinsky, represents agreement between the subject and the verb in
1990; Ouhalla, 1993). However, in recent years empirical person, gender, and number, and a tense phrase (TP),
evidence has accumulated to suggest that the deficit is representing tense inflection of the verb. Finite verbs
actually finer-grained. move from V 0 , their base-generated position within the
Speech production in agrammatism shows an intri- VP, to Agr 0 and then to T 0 in order to check (or collect)
cate and intriguing pattern of deficit. Individuals with their inflection. Thus, the ability to correctly inflect verbs
agrammatism correctly inflect verbs for agreement but for agreement and tense crucially depends on the AgrP
substitute tense inflection; they produce well-formed and TP nodes.
yes/no questions (in some languages), but not Wh ques- The highest phrasal node in the tree is the comple-
tions; they can produce untensed embedding but not mentizer phrase (CP), which hosts complementizers
full relative sentences, and coordination markers but such as ‘‘that’’ and Wh morphemes such as ‘‘who’’ and
not subordination markers. The tree pruning hypothesis ‘‘what’’ that moved from the base-generated position
(TPH) was suggested by Friedmann (1994, 2001) and within the VP. Thus, the construction of embedded
Figure 1. A syntactic tree (Pollock,

1989). The arch represents a possible
impairment site according to the tree
pruning hypothesis. Nodes below the
arch are intact and nodes above it are
impaired.
sentences and Wh questions depends on the CP node complementizer, use direct instead of indirect speech, or
being intact and accessible. produce an ungrammatical sentence (Menn and Obler,
Crucially, the nodes are hierarchically ordered in 1990; Hagiwara, 1995; Friedmann, 1998, 2001). The
the syntactic tree: the lowest node is the verb phrase, the di‰culty posed by embedded constructions is explained
nodes above it are the agreement phrase and the tense by tree pruning, as full relative clauses and sentential
phrase (in this order according to Pollock, 1989), and the embeddings require the CP, and when the CP is
complementizer phrase is placed at the highest point of unavailable, these structures are impaired. Interestingly,
the syntactic tree. embedded sentences that do not require the CP, such as
The TPH uses this hierarchical order and suggests reduced relatives (‘‘I saw the boy crying’’), are produced
that in agrammatism, the syntactic tree is pruned from a correctly by individuals with agrammatism.
certain node and up. Persons with agrammatic aphasia Tree pruning and the inaccessibility of CP cause a
who cannot project the syntactic tree up to the TP node deficit in another important set of structures, questions.
are not able to produce structures that require TP or the Seminal treatment studies by Shapiro, Thompson, and
node above it, CP. Persons with agrammatic aphasia their group (e.g., Thompson and Shapiro, 1995) show
whose tree is pruned at a higher point, CP, are unable that persons with agrammatic aphasia cannot produce
to produce structures that involve the CP. Importantly, well-formed Wh questions. Other studies show that in
nodes below the pruning site are intact, and therefore English, both Wh and yes/no questions are impaired,
structures that require only low nodes, such as AgrP and but in languages such as Hebrew and Arabic, Wh ques-
VP, are well-formed in agrammatic production. tions are impaired but yes/no questions are intact
How does tree pruning account for the intricate (Friedmann, 2002). Again, these dissociations between
pattern of loss and sparing in agrammatic production? and within languages are a result of the unavailability
The implications of tree pruning for three syntactic of the CP: agrammatic aphasics encounter severe di‰-
domains—verb inflection, embedding, and question culties when trying to produce Wh questions because
production—are examined here. Wh morphemes (who, what, where, etc.) reside at CP.
Inflections are impaired in agrammatic production, Yes/no questions in English also require an element at
but in a selective way. In Hebrew and in Palestinian CP, the auxiliary (‘‘Do you like cream cheese’’?), and are
Arabic, tense inflection was found to be severely im- therefore impaired too. Yes/no questions in Hebrew and
paired, whereas agreement inflection was almost intact Arabic, on the other hand, do not require any overt ele-
in a set of constrained tasks such as sentence complement in CP (‘‘You like hummus?’’), and this is why they
tion, elicitation, and repetition (Friedmann, 1994, 2001; are produced correctly.
Friedmann and Grodzinsky, 1997). Subsequent studies The tree pruning hypothesis is also instrumental in
reported a similar dissociation between tense and agree- describing di¤erent degrees of agrammatism severity.
ment in Spanish, Dutch, German, and English. If tense Clinical work shows that some individuals with agram-
and agreement reside in di¤erent nodes in the syntactic matism use a wider range of syntactic structures than
tree, and if TP is higher than AgrP, this dissociation is others and retain more abilities, such as verb inflection,
explained by tree pruning: the TP is inaccessible, and whereas other individuals use mainly simple sentences
therefore tense inflection is impaired, whereas the AgrP and substitute inflections. It is possible to characterize
node, which is below the pruning site, remains intact, the milder impairment as pruning at a higher site in the
and subsequently agreement inflection is intact. This can tree, at CP, whereas the more severe impairment results
also account for the findings from German and Dutch, from pruning at a lower position, TP. Thus, the more
according to which individuals with agrammatic aphasia mildly impaired individuals show impairment in Wh
frequently use nonfinite verbs in sentence-final position, questions and embedding, but their ability to inflect
instead of the required fully inflected main verbs in sec- verbs for both tense and agreement is relatively intact.
ond position (Kolk and Heeschen, 1992; Bastiaanse and More severely impaired individuals, who are impaired
van Zonneveld, 1998). The lack of TP and CP prevents also in TP, show impaired tense inflection in addition to
persons with agrammatic aphasia from moving the verbs impairment in questions and embedding. In both degrees
to T to collect the required inflection, and to C to second of severity, agreement inflection is intact. Crucially, no
position, and therefore they produce the verbs in a non- individual was found to exhibit a deficit in a low node
finite form, which does not require verb movement to without a deficit in higher nodes. In other words, there
high nodes. Verbs that do not move to high nodes stay in was no TP deficit without a deficit in CP, and no deficit
their base-generated position within VP, which is the in AgrP without a deficit in TP and CP.
sentence-final position (see Friedmann, 2000). Finally, tree pruning provides a principled explana-
Individuals with agrammatic aphasia are also known tion for the e¤ect that treatment in one syntactic domain
to use only simple sentences and to avoid embedded has on other domains. Thompson and Shapiro, for
sentences. When they do try to produce an embedded example, report that following question production
sentence, either a relative clause (such as ‘‘I saw the girl treatment, their patients started using sentential embed-
that the grandmother drew’’) or a sentential complement ding. This can be explained if treatment has enhanced
of a verb (‘‘The girl said that the grandmother drew the accessibility of the syntactic node that is common for
her’’), they fail, and stop before the complementizer (the the two structures, CP. Similarly, the decrease in verb
embedding marker ‘‘that,’’ for example), omit the omission that has been reported to accompany tense
Syntactic Tree Pruning 407
inflection improvement can be explained by enhanced Thompson, C. K., and Shapiro, L. P. (1995). Training sentence
accessibility to the inflectional node TP. production in agrammatism: Implications for normal and
Current linguistic theory thus provides a useful tool- disordered language. Brain and Language, 50, 201–224.
box to account for the complicated weave of spared and Zurif, E. B. (1995). Brain regions of relevance to syntactic
processing. In L. Gleitman and M. Liberman (Eds.), An
impaired abilities in speech production in agrammatic
invitation to cognitive science (2nd ed., vol. 1, pp. 381–397).
aphasia. The hierarchical structure of the syntactic tree Cambridge, MA: MIT Press.
enables an account of the highly selective syntactic defi-
cit in agrammatic production in terms of syntactic tree Further Readings
pruning.
Bastiaanse, R., and Grodzinsky, Y. (Eds.). (2000). Grammati-
—Naama Friedmann cal disorders in aphasia: A neurolinguistic perspective. Lon-
don: Whurr.
References Benedet, M. J., Christiansen, J. A., and Goodglass, H. (1998).
Bastiaanse, R., and van Zonneveld, R. (1998). On the relation A cross-linguistic study of grammatical morphology in
between verb inflection and verb position in Dutch agram- Spanish- and English-speaking agrammatic patients. Cor-
matic aphasics. Brain and Language, 64, 165–181. tex, 34, 309–336.
Berndt, R. S., and Caramazza, A. (1980). A redefinition De Bleser, R., and Luzzatti, C. (1994). Morphological pro-
of Broca’s aphasia: Implications for a neuropsychologi- cessing in Italian agrammatic speakers: Syntactic imple-
cal model of language. Applied Psycholinguistics, 1, 225– mentation of inflectional morphology. Brain and Language,
278. 46, 21–40.
Caplan, D. (1985). Syntactic and semantic structures in Friedmann, N., and Grodzinsky, Y. (2000). Split inflection in
agrammatism. In M.-L. Kean (Ed.), Agrammatism. New neurolinguistics. In M.-A. Friedemann and L. Rizzi (Eds.),
York: Academic Press. The acquisition of syntax: Studies in comparative devel-
Chomsky, N. (1995). The minimalist program. Cambridge, opmental linguistics (pp. 84–104). Geneva: Longman Lin-
MA: MIT Press. guistics Library Series.
Friedmann, N. (1994). Morphology in agrammatism: A disso- Friedmann, N., Wenkert-Olenik, D., and Gil, M. (2000). From
ciation between tense and agreement. Master’s thesis, Tel theory to practice: Treatment of agrammatic production in
Aviv University. Hebrew based on the Tree Pruning Hypothesis. Journal of
Friedmann, N. (1998). Functional categories in agrammatic Neurolinguistics, 13, 250–254.
production: A cross-linguistic study. Doctoral dissertation, Grodzinsky, Y., Shapiro, L. P., and Swinney, D. A. (Eds.).
Tel Aviv University. (2000). Language and the brain: Representation and pro-
Friedmann, N. (2000). Moving verbs in agrammatic processing. San Diego, CA: Academic Press.
duction. In R. Bastiaanse and Y. Grodzinsky (Eds.), Kean, M. L. (1995). The elusive character of agrammatism.
Grammatical disorders in aphasia: A neurolinguistic perspec- Brain and Language, 50, 369–384.
tive (pp. 152–170). London: Whurr. Rizzi, L. (1994). Some notes on linguistic theory and language
Friedmann, N. (2001). Agrammatism and the psychological development: The case of root infinitives. Language Acqui-
reality of the syntactic tree. Journal of Psycholinguistic Re- sition, 3, 371–393.
search, 30, 71–90. Ruigendijk, E., van Zonneveld, R., and Bastiaanse, R. (1999).
Friedmann, N. (2002). Question production in agramma- Case assignment in agrammatism. Journal of Speech, Lan-
tism: The Tree Pruning Hypothesis. Brain and Language, guage, and Hearing Research, 42, 962–971.
80, 160–187. Swinney, D., and Zurif, E. B. (1995). Syntactic processing in
Friedmann, N., and Grodzinsky, Y. (1997). Tense and agree- aphasia. Brain and Language, 50, 225–239.
ment in agrammatic production: Pruning the syntactic tree. Thompson, C. K., and Shapiro, L. P. (1994). A linguistic spe-
Brain and Language, 56, 397–425. cific approach to treatment of sentence production deficits
Goodglass, H. (1976). Agrammatism. In H. Whitaker and in aphasia. Clinical Aphasiology, 22, 307–323.
H. A. Whitaker (Eds.), Studies in neurolinguistics (vol. 1, Zurif, E., and Swinney, D. (1994). The neuropsychology of
pp. 237–260). New York: Academic Press. language. In M. A. Gernsbacher (Ed.), Handbook of psy-
Grodzinsky, Y. (1990). Theoretical perspectives on language cholinguistics (pp. 1055–1074). Orlando: Academic Press.
deficits. Cambridge, MA: MIT Press.
Grodzinsky, Y. (2000). The neurology of syntax: Language
use without Broca’s area. Behavioral and Brain Sciences, 23,
1–71. Trace Deletion Hypothesis
Hagiwara, H. (1995). The breakdown of functional categories
and the economy of derivation. Brain and Language, 50,
92–116. Some aphasic syndromes implicate grammar. Known
Kolk, H. H. J., and Heeschen, C. (1992). Agrammatism, para- for almost a century to be grammatically impaired in
grammatism and the management of language. Language speech production (see syntactic tree pruning), indi-
and Cognitive Processes, 7, 89–129. viduals with Broca’s and Wernicke’s aphasia are now
Menn, L., and Obler, L. (1990). Agrammatic aphasia: A
cross-language narrative sourcebook. Philadelphia: John
known to have receptive grammatical deficits as well.
Benjamins. The trace deletion hypothesis (TDH) is a collection of
Ouhalla, J. (1993). Functional categories, agrammatism and ideas about the proper approach to linguistic deficits
language acquisition. Linguistische Berichte, 143, 3–36. subsequent to brain damage in adults, particularly
Pollock, J. Y. (1989). Verb movement, Universal Grammar Broca’s aphasia. Support for a grammatical interpreta-
and the structure of IP. Linguistic Inquiry, 20, 365–424. tion of aphasia comes from a dense body of research from
many laboratories and varied test paradigms, adminis- Broca’s aphasics perform at above chance levels on
tered to large groups of aphasic speakers of a variety of nontransformational sentences and at chance levels on
languages. transformationally derived sentences.
This article explains why a precise characterization of What do such findings mean? Standard principles of
receptive deficits in aphasia is important, and how we error analysis dictate that a binary choice design allows
came about to know it. It first aims to demonstrate that for three logically possible outcomes: above-, below-,
Broca’s and Wernicke’s aphasia (and perhaps other syn- and at-chance performance levels. Above-chance perfor-
dromes) are syntactically selective disorders in which the mance is virtually normal; below-chance amounts to
line dividing impaired from preserved syntax is fine and systematic reversal of roles in SPM; at-chance means
amenable to a precise characterization. guessing, as if the subject were tossing a coin prior
The comprehension of sentences containing gram- to responding. Indeed, the response pattern of Broca’s
matical transformations is impaired in aphasia. Trans- aphasics on transformational sentences resembles guess-
formations, or syntactic movement rules, are important ing behavior. That said, the burden on the TDH is
intrasentential relations. In their various theoretical to provide a deductive explanation of why left anterior
guises, these rules are designed to explain dependency lesions, which allow normal comprehension of sentences
relations between positions in a sentence. As a first without transformations, lead to guessing when trans-
pass, we sacrifice precision for clarity, and say that formational movement is at issue.
transformationally moved constituents are found in The foregoing was a simplified introductory discus-
noncanonical positions (e.g., a subject in a passive sen- sion. In reality, things are somewhat more complicated.
tence, The teacher was watched by the student; The deletion of a trace in certain cases does not hinder
a questioned element in object questions, Which man performance in aphasia. Our characterization needs re-
did Susan see ?). Each of these sentences has finement. The leading idea is to view aphasic sentence
a nontransformational counterpart (e.g., The student interpretation as a composite process—an interaction
watched the teacher; Which man saw Susan?). Roughly, between incomplete syntax (i.e., representations lacking
in transformationally derived sentences, a constituent is traces) and a compensatory cognitive strategy. The in-
phonetically present in one (bolded) position but is terpretation of moved constituents, as we saw, depends
interpreted semantically in another, ‘‘empty’’ position crucially on the trace; without traces, the semantic role
( ), which is annotated by a t (for trace of move- of a moved constituent cannot be determined. Moved
ment) and serves as a link between the semantic role of constituents (bolded) are thus uninterpretable. A non-
the moved constituent and its phonetic realization. linguistic, linear order–based cognitive strategy is
Traces are therefore crucial for the interpretation of invoked to try and salvage uninterpreted NPs hNP1 ¼
transformational sentences. agent; NP2 ¼ themei. In English, constituents moved to
When aphasic comprehension is tested on the con- a clause-initial position will thus be agents. In certain
trast between transformational and nontransformational cases (e.g., subject questions), the strategy compensates
sentences, a big di¤erence is found: Broca’s aphasics correctly: in the subject relative the man who t loves
understand active sentences, subject questions, and the Mary is tall, the head of the relative clause, the man, is
like normally, yet fail on their transformational coun- moved, and receives its semantic interpretation (or the-
terparts. This has led to the claim that in receptive matic role) via the trace. A deleted trace blocks this
language, Broca’s aphasics cannot represent traces of process, and the strategy is invoked, assigning agent-
movement. These traces are deleted from the syntactic hood to the man and yielding the correct semantics: NP1
representations they build, hence TDH. This generaliza- (the man) ¼ agent by strategy, and NP2 (Mary) ¼ theme
tion helps localize grammatical operations in the brain. by the remaining grammar. In other cases the TDH sys-
Furthermore, the highly selective character of this deficit tem results in error: In object relatives—the man who
has major theoretical ramifications for linguistic theory Mary loves t is tall—the agent role assigned by the
and the theory of sentence processing. strategy (acting subsequent to trace deletion) gives rise to
How exactly does the TDH work? Let us look at a misleading representation: NP1 (the man) ¼ agent by
receptive tasks commonly used in research on brain- strategy, and NP2 (Mary) ¼ agent by the grammar. The
language relations. These are sentence-picture-matching result is a semantic representation with two potential
(SPM), grammaticality judgment (GJ), and measure- agents for the depicted action, which leads the patients
ments of reaction time (RT) during comprehension. to guessing behavior. The selective nature of the aphasic
We can consider the SPM method first. A sentence con- comprehension deficit is captured, which is precisely
taining two arguments is presented (e.g., The student what the TDH is designed to explain.
watched the teacher); it is ‘‘semantically reversible’’—the Languages with structural properties di¤erent from
lexical content does not reveal who did what to whom. English lend further support to the TDH. Thus, Chinese,
The task, however, requires exactly that: subjects are Japanese, German and Dutch, Spanish, and Hebrew
usually requested to choose between two action pic- have di¤erent properties, and the performance of Bro-
tures in which the roles are reversed (student watching ca’s aphasics is determined by the TDH as it interacts
teacher, teacher watching student). Reliance on syntax with the particular grammar of each language. Japanese
is critical. On being given a series of such sentences, active sentences, for example, have two configurations:
Trace Deletion Hypothesis 409
[Taro-ga Hanako-o nagutta (Taro hit Hanako)—Subject of structures that linguists assume have a firm neuro-
Object Verb], [Hanako-o Taro-ga t nagutta—Object biological basis; they a¤ord an unusual view on the inner
Subject t Verb]. These simple structures mean the same workings of the human sentence processing device; they
and are identical on every dimension except movement. connect localized neural tissue and linguistic concepts
Broca’s aphasics are above chance in comprehending the in a more detailed way than ever before. Finally, there
former and at chance level on the former, in keeping appears to be clinical (therapeutic) value to the view that
with the TDH. A similar finding is obtained in Hebrew. Broca’s aphasia and Wernicke’s aphasia are grammati-
In Chinese, an otherwise SVO language like English, cally selective: although currently experimental, prelimi-
heads of relative clauses (annotated by the subscript h) nary results suggest that therapeutic methods guided by
follow the relative (unlike English, in which they precede this view are somewhat e‰cacious.
it, as the example shows):
—Yosef Grodzinsky
(1) [t zhuei gou] de mauh hen da
chase dog that cat very big Further Readings
‘‘the cath that [t chased the dog] was very big’’
Ben-Shachar, M., Hendler, T., Kahn, I., Ben-Bashat, D.,
(2) [mau zhuei t] de gouh hen xiao and Grodzinsky, Y. (2001). Grammatical transformations
cat chased that dog very small activate Broca’s region: An fMRI study. Paper presented at
‘‘the dogh that [the cat chased t] was very small’’ a meeting of the Cognitive Neuroscience Society, New
York.
This structural contrast leads to remarkable contrastive Caplan, D., Alpert, N., and Waters, G. (1998). E¤ects of syn-
performance in Broca’s aphasia: in Chinese subject rela- tactic structure and propositional number on patterns of
tives (1), the head of the relative (the cat) moves to the regional blood flow. Journal of Cognitive Neuroscience, 10,
front of the sentence, lacks a role, and is assigned agent 541–552.
Goodglass, H. (1968). Studies in the grammar of aphasics. In
by the strategy, which leads to a correct representation
S. Rosenberg and J. Koplin (Eds.), Developments in applied
in which the cat indeed chases the dog. In Chinese, the psycholinguistics research. New York: Macmillan.
head (mau) also moves, yet to sentence-final position, Grodzinsky, Y. (1986). Language deficits and the theory of
and the strategy (incorrectly) assigns it the theme role. syntax. Brain and Language, 27, 135–159.
This representation now has two themes, the dog and Grodzinsky, Y. (1990). Theoretical perspectives on language
the cat, and the result is guessing. Similar considera- deficits. Cambridge, MA: MIT Press.
tions that hold for the object relatives are left to the Grodzinsky, Y. (2000). The neurology of syntax: Language
reader. English and Chinese thus yield mirror-image use without Broca’s area. Behavioral and Brain Sciences, 23,
results, which correlate with a relevant syntactic contrast 1–71.
between the two languages. Further intriguing cross- Grodzinsky, Y., and Finkel, L. (1998). The neurology of empty
categories: Aphasics’ failure to detect ungrammaticality.
linguistic contrasts exist.
Journal of Cognitive Neuroscience, 10, 281–292.
Moving to other experimental tasks, we encounter Grodzinsky, Y., Pierce, A., and Marakovitz, S. (1991). Neuro-
remarkable cross-methodological consistency. When the psychological reasons for a transformational analysis of
detection of a violation of grammaticality critically verbal passive. Natural Language and Linguistic Theory, 9,
depends on traces, the TDH predicts disability on the 431–453.
part of Broca’s aphasics. This is indeed the case. When Grodzinsky, Y., Piñango, M., Zurif, E., and Drai, D. (1999).
given GJ tasks, they show di¤erential performance be- The critical role of group studies in neuropsychology:
tween sentences with and without traces, again in keep- Comprehension regularities in Broca’s aphasia. Brain and
ing with the TDH. Finally, the rich literature on timed Language, 67, 134–147.
language reception in Broca’s aphasia (RT) suggests Hagiwara, H. (1993). The breakdown of Japanese passives and
Q-role assignment principles by Broca’s aphasics. Brain and
that on-line computation of trace-antecedent relations is Language, 45, 318–339.
compromised. Just, M. A., Carpenter, P. A., Keller, T. A., Eddy, W. F., and
This type of deficit is not restricted to Broca’s apha- Thulborn, K. R. (1996). Brain activation modulated by
sia. On at least some tests, Wernicke’s aphasics perform sentence comprehension. Science, 274, 114–116.
like Broca’s aphasics. There are contrasts between the Schwartz, M. F., Linebarger, M. C., Sa¤ran, E. M., and Pate,
two groups, to be sure, yet contrary to past views, there D. C. (1987). Syntactic transparency and sentence inter-
are overlapping deficits. Although this new picture is just pretation in aphasia. Language and Cognitive Processes, 2,
beginning to unfold, independent evidence from func- 85–113.
tional imaging of normal populations (fMRI) supports Shapiro, L. P., Gordon, B., Hack, N., and Killackey, J. (1993).
it. Both Broca’s and Wernicke’s regions of the healthy Verb-argument structure processing in complex sentences
in Broca’s and Wernicke’s aphasia. Brain and Language, 45,
brain are involved in transformational analysis, although 423–447.
likely in di¤erent ways. Shapiro, L. P., and Levin, B. A. (1990). Verb processing during
This series of cross-methodological findings from sentence comprehension in aphasia. Brain and Language,
di¤erent languages and populations, and the TDH as a 38, 21–47.
generalization over them, have a host of important the- Shapiro, L. P., and Thompson, C. K. (1994). The use of
oretical implications. They show that the natural classes linguistic theory as a framework for treatment studies in
aphasia. In P. Lemme (Ed.), Clinical aphasiology (vol. Zurif, E. B. (1995). Brain regions of relevance to syntactic
22). Austin, TX: Pro-Ed. processing. In L. Gleitman and M. Liberman (Eds.), An
Swinney, D., and Zurif, E. (1995). Syntactic processing in invitation to cognitive science (2nd ed., vol. 1). Cambridge,
aphasia. Brain and Language, 50, 225–239. MA: MIT Press.
Thompson, C. K., Shapiro, L. P., Ballard, K. J., Jacobs, B. J., Zurif, E. B., and Caramazza, A. (1976). Linguistic structures in
Schneider, S. S., and Tait, M. E. (1997). Training and gen- aphasia: Studies in syntax and semantics. In H. Whitaker
eralized production of WH- and NP-movement structures and H. H. Whitaker (Eds.), Studies in neurolinguistics (vol.
in agrammatic aphasia. Journal of Speech, Language, and 2). New York: Academic Press.
Part IV: Hearing
Amplitude Compression in Hearing termined frequency weighting factor. This cube root
dependence had first been described by Fletcher the year
Aids before (Fletcher, 1923a). Fletcher and Steinberg con-
cluded that
In the latter part of the 1980s, wide dynamic range com- it became apparent that the non-linear character of the ear[’s]
pression (WDRC) amplification was introduced into the transmitting mechanism was playing an important part in
hearing aid market. Within a few years it was widely determining the loudness of the complex tones (p. 307).
recognized as a fundamentally important new amplifica-
tion strategy. Within 10 years nearly every hearing aid Power law relations between the intensity of the
manufacturer had developed a WDRC product. physical stimulus and the psychophysical response are
Compression is useful as a processing strategy be- examples of Stevens’ law. Fletcher’s 1923 loudness
cause it compensates for the loss of cochlear outer hair growth equation, which for tones was found to be
cells, which compress the dynamic range of sound within L(I ) z I 1=3 , where L is the loudness and I is the acoustic
the cochlea. Sensorineural hearing loss is characterized intensity, established the important special case of Ste-
by loudness recruitment, which results from damage to vens’ law for sound intensity and pure-tone loudness.
the outer hair cells. WDRC compensates for this hair Their method is described in the caption of Figure 1. We
cell disorder, ideally restoring the limited dynamic range now know that Fletcher and Steinberg were observing
of the recruiting ear to that of the normal ear. This the compression induced by the cochlear outer hair cells
article reviews the history of loudness research, loudness (OHCs).
recruitment, cochlear compression e¤ects (such as the
upward spread of masking) that result from and char- Loudness Additivity
acterize OHC compression, and finally, outer hair cell In 1933 Fletcher and Munson published their seminal
physiology. The WDRC processing strategy is ex- paper on loudness. This paper detailed (1) the relation of
plained, and a short history of the development of isoloudness across frequency (loudness level, or phons);
WDRC hearing aids is provided. (2) their loudness growth argument, described below; (3)
a model showing the relation of masking to loudness;
Compression and Loudness and (4) the basic idea behind the critical band (critical
Acoustical signal intensity is defined as the flow of ratio).
acoustic energy in watts per meter squared (w/m 2 ). Regarding the second point, rather than thinking di-
Loudness is the perceptual intensity, measured in either rectly in terms of loudness growth, they tried to find a
sones or loudness units (LU). One sone is defined as the formula to describe how the loudnesses of several stimuli
loudness of a 1 kHz tone at 40 dB SPL, while 1 LU is combine. From loudness experiments with low- and
defined as the loudness at threshold. Zero loudness cor- high-pass speech and complex tones, and other unpub-
responds to zero intensity.1 lished experiments over the previous 10 years, they
For the case of pure tones, one sone is A975 LU. showed that, across critical bands, loudness (not inten-
Isoloudness intensity contours were first determined in sity) adds. Fletcher’s working hypothesis (Fletcher and
1927 by Kingsbury (Kingsbury, 1927; Fletcher, 1929, Steinberg, 1924) was that each signal is nonlinearly
p. 227). Such curves describe the relation between compressed in narrow bands (critical bands) by the coch-
equally loud tones (or narrow bands of noise) at di¤erent lea, neurally coded, and the resulting band rates are
frequencies. The intensity of an equally loud 1 kHz tone added.2 The 1933 experiment clearly showed how loud-
is called the loudness level, which has units of phons, ness (i.e., the neural rate, according to Fletcher’s model)
measured in w/m 2 . In 1923 Fletcher, and again in 1924 adds. Fletcher and Munson also determined the cochlear
Fletcher and Steinberg, published the first key papers on compression function G(I ) described below for tones
the measurement of the loudness for speech signals and speech. We now know that this function dramati-
(Fletcher, 1923a; Fletcher and Steinberg, 1924). In the cally changes with sensorineural hearing loss.
1924 paper the authors state Today this model concept is called loudness additivity.
ðy Their hypothesis was that when two equally loud tones
are presented together but separated in frequency so that
10a=30 ¼ G( f )10a( f )=30 df
0 they do not mask each other, the result is ‘‘twice as
loud.’’ The verification of this assumption lies in the
. . . the use of the above formula involved a summation of the predictive ability of the additivity assumption. For ex-
cube root of the energy rather than the energy.
ample, they showed that 10 tones that are all equally
where a is the relative intensity in dB SL, a is the ‘‘ef-
fective’’ loudness level, and G( f ) is an empirically de-
2There seems to be some confusion about what is added
within critical bands. Clearly, pressure must add within a criti-
1Fletcher and Munson (1933) were able to measure the cal band, or else we would not hear beats. Many books and
loudness below the single pure-tone threshold by using 10 papers assume that intensity adds within each critical band.
equally loud tones. This proves that the loudness at threshold is This is true in the ensemble sense for random signals, but such
not zero (Buus, Musch, and Florentine, 1998). a scheme will not work for tones on a single trial basis.
414 Part IV: Hearing
Figure 1. E¤ect of low- and high-pass filtering on the speech 17% of its original energy. The corresponding relative level for
loudness level. The broadband speech is varied in level until it 1 kHz high-pass-filtered speech is 7%. These functions are
is equal in loudness to the low-pass-filtered speech. This is shown as the solid lines in the figure. The high- and low-pass
repeated for each value of the filter cuto¤ frequency. The loudnesses do not add to 1 since the two solid lines cross at
experiment was then repeated for the high-pass speech. The about 11%. After taking the cube root, however, the loudness
percent reduction of the equally loud broadband speech energy curves cross at 50% (i.e., at 0.8 kHz, 0:125 1=3 ¼ 0:5), and
is plotted against the filter cuto¤ frequency. For example, if therefore sum to 100%. A level of 11.3 m BARS (dynes/cm 2 )
broadband speech is to be equal in loudness to speech that has corresponds to 1.13 Pa, which is close to 95 dB SPL. (From
been low-pass-filtered to 1 kHz, it must be reduced in level to Fletcher, 1929, p. 236.)
loud (they will be at di¤erent intensities, of course), of a (I ) found by Fletcher in di¤erent papers published
when played together, are 10 times louder, as long as between 1933 and 1953 are shown in Figure 2.
they do not mask each other. As another example,
Fletcher and Munson found that loudness additivity The Result. These two-tone loudness matching experi-
held for signals ‘‘between the two ears’’ as well as for ments showed that for f1 between 0.8 and 8.0 kHz, and
signals ‘‘in the same ear.’’ When the tones masked each f2 far enough away from f1 (above or below) so that
other (namely, when their masking patterns overlapped),
additivity still holds, but over an attenuated set of pat-
terns (Fletcher and Munson, 1933). Their 1933 model is
fundamental to our present understanding of auditory
sound processing.
The Method. A relative scale factor (gain) a may be

defined either in terms of the pressure or in terms of the
intensity. Since it is the voltage on the earphone that is
scaled, the most convenient definition of a is in terms of
the pressure, P. It is typically expressed in dB, given by
20 log10 (a).
Two equally loud tones were matched in loudness
by a single tone scaled by a . The asterisk indicates
this special value of a. The resulting definition of a is
given by
L(a P) ¼ 2L(P); (1)
which says that, when the single tone pressure, P, is
scaled by a ¼ a , the loudness, L(a P), is twice as loud
as the unscaled signal. Given the relative loudness level
(in phons) of ‘‘twice as loud,’’ defined by a (I ), the Figure 2. Loudness and a as a function of the loudness level, in
loudness growth function G(I ) may be found by graphi- phons. When a is 9 dB, loudness increases as the cube root of
cal methods or by numerical recursion, as shown in intensity. When a is 3 dB, loudness is proportional to inten-
Fletcher (1953, p. 190) and in Allen (1996b). The values sity. (From Fletcher, 1953.)
Amplitude Compression in Hearing Aids 415
there is no masking, the relative level a was found to be

A9 dB (ca. 1953) for P1 above 40 dB SPL. This value
decreased linearly to 2 dB for P1 at 0 phons, as shown in
Figure 2.
From this formulation, Fletcher and Munson found
that at 1 kHz, and above 40 dB SPL, the pure-tone
loudness G is proportional to the cube root of the signal
intensity [G(I ) ¼ (P=Pref ) 2=3 ] because a ¼ 2 3=2 (9 dB).3
Below 40 dB SPL, loudness was frequently assumed
to be proportional to the intensity [G(I ) ¼ (P=Pref ) 2 ,
a ¼ 2 1=2 , or 3 dB]. Figure 2 shows the loudness growth
curve and a given in Fletcher (1953, p. 192, Table 31) as
well as the 1938 and 1933 papers. As may be seen from
the figure, in 1933 they found values of a as high as
11 dB near 55 dB SL. Furthermore, the value of a at
Figure 3. Shown here is a recruitment-type loss corresponding
low levels is not 3 dB but is closer to 2 dB. Fletcher’s to a variable loss of gain on a log-log scale. The upper curve
statement that loudness is proportional to intensity (a is corresponds to the normal loudness curve; the lower curve
3 dB near threshold) was an idealization that was ap- corresponds to a simulated recruiting hearing loss. For an in-
pealing, but not supported by actual results. tensity level change from 56–60 dB, the loudness change is
smaller for the recruiting ear (0.5 sones) than in the normal
Recruitment and the Rate of Loudness Growth ear (1 sone). The belief that the loudness slope in the damaged
ear is greater led to the concept that the JND in the damaged ear
Once loudness had been quantified and modeled in 1933 should be smaller (this was the rationale behind the SISI test)
by Fletcher and Munson, Mark Gardner, a close per- (see Martin, 1986, p. 160). Both conclusions are false.
sonal friend and colleague of Harvey Fletcher, began
measuring the loudness growth of hearing-impaired
subjects. In about 1934 Gardner first discovered the ef-
fect that has become known as loudness recruitment 1.0 sone in the normal ear and 0.5 sones in the recruiting
(Gardner, 1994), first reported by Steinberg and Gardner ear. While the slope looks steeper on a log plot, the
in 1937. actual rate of loudness growth (in sones) in the recruiting
In terms of the published record, Fowler, a New York ear is smaller. Its misdefinition as ‘‘the abnormally rapid
ear, nose, and throat physician, is credited with the dis- growth of loudness’’ has lead to some serious conceptual
covery of recruitment in 1936. Fowler was in close touch errors about loudness and hearing loss. Correct state-
with the work being done at Bell Labs and was friendly ments about loudness recruitment include ‘‘the abnormal
with Wegel and Fletcher (they published papers to- growth of loudness’’ or ‘‘the abnormally rapid growth of
gether). Fowler made loudness measurements on his relative loudness DL=L (or log loudness).’’
many hearing-impaired patients and was the first to
publish the abnormal loudness growth results. Fowler Fowler’s Mistake. After learning from Wegel about the
coined the term recruitment (Fowler, 1936). yet unpublished recruitment measurements of Steinberg
Steinberg and Gardner (1937) were the first to cor- and Gardner, E. P. Fowler attempted to use recruitment
rectly identify recruitment as a loss of compression. to diagnose middle ear disease (Fowler, 1936). In cases
Since most sensorineural hearing loss is cochlear in ori- of hearing loss involving financial compensation, Fowler
gin, it follows that the loss of compression is in the stated that recruitment was an ‘‘ameliorating’’ factor
cochlea. Those interested in the details are referred to the (Fowler, 1942). In other words, he viewed recruitment as
following articles (Neely and Allen, 1997; Allen, 1997a; a recovery from hearing loss—its presence indicated a
Allen, 1999a). reduced hearing loss at high intensities. Thus, given two
people with equal threshold losses, the person having the
Loudness Growth in the Recruiting Ear. Figure 3 shows least amount of recruitment was given greater financial
a normal loudness growth function along with a simu- compensation (the loss could be due to middle ear dis-
lated recruiting loudness growth function. It is necessary ease, and the individual would receive greater compen-
to plot these functions on a log-log (log loudness versus sation than someone having a similar sensorineural loss).
dB SPL) scale because of the dynamic ranges of loudness In my view, it was Fowler’s poor understanding of
and intensity. The use of the dB and log loudness has recruitment that led to such terms as complete recruit-
resulted in a misinterpretation of the rate (slope) of ment versus partial recruitment and hyper-recruitment.
recruitment. In the figure we see that for a 4 dB change Complete recruitment means that the recruiting ear and
in intensity about 58 dB SPL, the loudness changes by the normal ear perceive the same loudness at high
intensities. Steinberg and Gardner described such a loss
3Since G(I ) ¼ (I=Iref ) b ¼ (P=Pref ) 2b , L(P) ¼ (P=Pref ) 2b . as a variable loss (i.e., sensorineural loss) and partial
Thus, Equation 1 gives (a P) 2b ¼ 2P 2b , or (a ) 2b ¼ 2, giving recruitment as a mixed loss (i.e., having a conductive
a ¼ 2 1=2b . When b ¼ 1=3, a ¼ 2 3=2 . component that acts as a frequency-dependent fixed
Figure 4. Masking data for a 400 Hz masker. The ab-

scissa is the intensity of the masker Im while the ordinate
is the threshold intensity of the probe Ip (Im ; fp ) (the
maskee), each in dB SL. Each curve corresponds to a
probe of a di¤erent frequency, labeled in kHz. Two
dashed lines are superimposed on the heavy curves cor-
responding to fp of 0.45 kHz (slope ¼ 1.0 dB/dB) and
3 kHz (slope ¼ 2.4 dB/dB). The curves for fp of 1, 2,
and 4 kHz are shown by light lines. Probe frequencies
below 1 kHz are shown as light dashed lines. (Data
from Wegel and Lane, 1924.)
attenuation). They, and Fowler, verified the conductive close to linear. Such near-linear growth is called Weber’s
component by estimating the air-bone gap. law. The masked-threshold probe intensity4 Ip is equal
Steinberg and Gardner attempted to set the record to the masker intensity Im plus 1 JND DI , namely
straight. They clearly understood what they were dealing
Ip (Im ) ¼ Im þ DI (Im ):
with, as is indicated in the following quote (Steinberg
and Gardner, 1937, p. 20): The masking appears to be linear because the relative
JND (e.g., DI =I A 0:1) is small. As the intensity of the
Owing to the expanding action of this type of loss it would
be necessary to introduce a corresponding compression in the masker is increased, the variations in the JND DI (Im )
amplifier in order to produce the same amplification at all with respect to the masker intensity Im appear negligible,
levels. making Ip (Im ) appear linear. Weber’s law is therefore
observed when the probe is within a critical bandwidth of
This model of hearing and hearing loss, along with the masker. One sees deviations from Weber’s law when
the loudness models of Fletcher and Munson (1933), are plotting more sensitive measures, such as DI (Im )=Im
basic to the eventual quantitative understanding of (Riesz, 1928).
cochlear signal processing and the cochlea’s role in de-
tection, masking, and loudness in normal and impaired Upward Spread of Masking. The suppression threshold,
ears. The work by Fletcher (1950) and Steinberg and Is ( fp ; fm ), is defined as the smallest masker intensity
Gardner (1937), and work on modeling hearing loss and such that the slope of Ip (Im ; fp ) with respect to Im is
recruitment by Allen (1991) support this view. greater than 1. Since the probe slope is close to 2.4 dB/
dB over a range of intensities, this threshold is best esti-
Compression and Masking mated from the intercept of the Ip (Im ; fp ) regression line
In 1922, one year after publishing the first threshold with the abscissa. For the 3 kHz probe, the suppression
measurements with Wegel, Fletcher published mea- threshold intensity is 60 dB SL. Such suppression is
surements on the threshold of hearing in the presence of only seen for probes greater than the masker frequency
a masking tone (Fletcher, 1923a, 1923b). Wegel and ( fp > fm ). For probes that are su‰ciently higher in fre-
Lane’s classic and widely referenced paper on masking, quency than the masker (e.g., fp b 2 kHz in Fig. 4), the
and their theory of the cochlea, soon followed, in 1924. masking is close to zero dB SL until the masker intensity
In Figure 4 we reproduce one of the figures from reaches the suppression threshold at about 50–60 dB SL.
Fletcher’s 1923 publication (which later appeared in In other words, the masked threshold, defined as the in-
the 1924 Wegel paper) showing the upward spread of tensity where the masking of the probe begins, and the
masking due to a 400 Hz tone. As we shall see, these suppression threshold are nearly the same. The suppres-
curves characterize the nonlinear compressive e¤ects of sion threshold for the dashed-line, superimposed on the
outer hair cell compression. ‘‘solid-fat’’ fp ¼ 3 kHz probe curve in Figure 4, is 60 dB
SL; its slope is 2.4 dB/dB. For every 1 dB increase in
Critical Band Masking. When the probe is near the
masker in frequency, as in the case of the 0.45 kHz 4An asterisk is used to indicate that the intensity is at
probe tone shown in Figure 4, the growth of masking is threshold.
the masker intensity Im , the probe threshold intensity

Ip (Im ; fm ; fp ) must be increased by 2.4 dB to return it to
its detection threshold (Delgutte, 1990; Allen, 1997c).
Namely, above Is ( fp ¼ 3 kHz, fm ¼ 0:4 kHz) ¼ 60 dB
SL (i.e., Im > 10 6 Im ),
2:4
Ip (Im ) Im

¼ : (2)
Im Is
From Figure 4, a surprising and interesting crossover
occurs near 65–70 dB for the 1 kHz probe. As high-
lighted by the dashed box, the 1 kHz probe threshold
curve crosses the 0.45 kHz probe threshold curve. At
high levels, there is more masking at 1 kHz than at the
probe frequency. This means that the masker excitation
pattern peak has shifted toward the base of the cochlea
(i.e., toward the stapes). Follow-up forward masking
studies have confirmed this observation (Munson and
Gardner, 1950, Fig. 8). McFadden (1986) presents an
excellent and detailed discussion of this interesting ‘‘half- Figure 5. This sketch shows a conceptual view of the e¤ect of a
octave shift’’ e¤ect that is recommended reading for all low-frequency suppressive masker on a high-frequency near-
serious students of hearing loss. threshold probe, as a function of place. The abcissa for A and
C is suppressor (masker) intensity in dB, while B and D are a
Downward Spread of Masking. For probes lower than cochlear place axis, where the base (stapes end) is at the origin.
the masker frequency (Fig. 4, 0.25 kHz), while the Panels A and B show the IHC cilia response R of a high-
threshold is low, the masking is weak, since it has a slope frequency, low-level probe, of fixed 20 dB SL intensity, being
that is less than linear. This may be explained by the suppressed by a high-intensity, low-frequency ( fm f fp ) vari-
able intensity suppressive masker having intensities Is of 50, 60,
migration of the more intense high-frequency (basal) and 70 dB SL. Panels A and B correspond to the isoprobe level
masker excitation pattern away from the weaker probe of 20 dB SL. Even though the probe input intensity is fixed at
excitation pattern (Allen, 1999b). 20 dB SL, the cilia response to the probe Rp is strongly sup-
pressed by the masker above the suppression threshold, indi-
The Physiology of Compression cated by the vertical dashed line in A and C. The lower panels
C and D show what happens when the high-frequency probe
What is the source of Fletcher’s tonal cube root loudness intensity is returned to threshold, indicated by Ip (Im ). To re-
growth (i.e., Stevens’ Law)? Today we know that the store the probe to threshold requires an increase of 1 dB/dB of
basilar membrane motion is nonlinear in intensity, as suppressor level, due to the linear suppressor growth in the
first described by Rhode in 1971, and that cochlear high-frequency tail region of the probe, at Xp .
OHCs are the source of the basilar membrane non-
linearity. The history of this insight is both interesting
and important. threshold. The close relationship between the two e¤ects
In 1937, Lorente de Nó theorized that abnormal has only recently been appreciated (Allen, 1997c, 1999b).
loudness growth associated with hearing loss (i.e., The 2TS and upward spread of masking (USM) e¤ects
recruitment) is due to hair cell damage (Lorente de Nó, are important to the hearing aid industry because they
1937). From noise trauma experiments on humans one quantify the normal cochlear compression that results
may conclude that recruitment occurs in the cochlea from OHC processing. To fully appreciate the USM and
(Carver, 1978). Animal experiments have confirmed this 2TS, we need to describe the role of the OHC in non-
prediction and have emphasized the importance of OHC linear cochlear processing. In Figure 5 the operation of
loss (Liberman and Kiang, 1978; Liberman and Dodds, USM/2TS is summarized in terms of neural excitation
1984). This loss of OHCs causes a loss of the basilar patterns.
membrane compression (Pickles, 1982, p. 287). It fol-
lows that the cube root tonal loudness growth starts with Cochlear Nonlinearity: How?
the nonlinear compression of basilar membrane motion
due to stimulus-dependent voltage changes within the We still do not know precisely what controls the basilar
OHC. membrane nonlinearity, although we know that it results
from OHC sti¤ness and length changes (He and Dallos,
Two-Tone Suppression. The neural correlate of the 2000), which are a function of the OHC membrane
2.4 dB/dB psychoacoustic suppression e¤ect (the up- voltage (Santos-Sacchi and Dilger, 1987). This voltage is
ward spread of masking) is called two-tone suppression determined by shearing displacement of the hair cell cilia
(2TS) (Sachs and Abbas, 1974; Fahey and Allen, 1985; by the tectorial membrane (TM). The most likely cause
Delgutte, 1990). Intense low-frequency tones attenuate of nonlinear basilar membrane mechanics is changes
low-level high-frequency tones to levels well below their in the micromechanical impedances within the organ of
Corti. This conclusion follows from ear canal impedance The formula for the Johnson RMS thermal electrical
measurements, expressed in terms of nonlinear power membrane noise voltage jVc j due to cell membrane
reflectance, defined as the retrograde to incident power leakage currents is given by5 hjVc j 2 i ¼ 4kTBR, where B
ratio (Allen et al., 1995). In a transmission line, the is the cell membrane electrical bandwidth, k is Boltz-
reflectance of energy is determined by the ratio of the mann’s constant, T is the temperature in degrees Kelvin,
load impedance at a given point divided by the local and R is the cell membrane leakage resistance. The cell
characteristic impedance of the line. It is this ratio that is bandwidth is limited by the membrane capacitance C.
level dependent (i.e., nonlinear). The relation between the cell RC time constant, t ¼ RC,
and the cell bandwidth is given by B ¼ 1=t, leading to
Two Models. It is still not clear how the cochlear gain rffiffiffiffiffiffiffiffiffi
4kT
is reduced, and that is the subject of intense research. jVc j ¼ : (3)
There are two basic but speculative theories. The first is C
a popular but qualitative theory, referred to as the coch- The cell membrane capacitance C has been determined
lear amplifier (Kim et al., 1980). The second is a more to be about 9.6 pF for the IHC (Kros and Crawford,
physical and quantitative theory that requires two basic 1990) and 20 pF for the OHC. From Equation 3,
assumptions. The first assumption is that the tectorial Vc ¼ 21 mV for IHCs at body temperature (T ¼ 310 K).
membrane acts as a bandpass filter on the basilar mem- Although the maximum DC voltage across the cilia is
brane signal (Allen, 1980). The second assumption is 120 mV, the maximum RMS change in cell voltage that
that the OHCs dynamically ‘‘tune’’ the basilar mem- has been observed is about 30 mV (I. J. Russell, per-
brane (i.e., the cochlear partition) by changing its net sonal communication). The ratio of 30 mV to the noise
sti¤ness, causing a dynamic migration in the character- floor voltage (21 mV), expressed in dB, is 63 dB. Thus it
istic place with intensity (Allen, 1997b). Migration is is impossible for the IHC to code the 120 dB dynamic
known to occur (McFadden, 1986), so this assumption is range of the acoustic signal. Because it is experimentally
founded on experimental dogma. observed that, taken as a group, IHCs do code a wide
We cannot yet decide which, if either, of these two dynamic range, the nonlinear motion of the basilar
theories is correct, but for the present discussion, it is not membrane must be providing compression within the
important. The gain of the inner hair cell (IHC) cilia mechanics of the cochlea prior to IHC detection (Allen
excitation function is signal dependent, compressing the and Neely, 1992; Allen, 1997a).
120 dB dynamic range of the acoustic stimulus to less
than 60 dB. When the OHC voltage becomes depolar- Summary. Based on a host of data, the physical source
ized, the OHC compliance increases, and the character- of cochlear hearing loss and recruitment is now clear.
istic frequency (CF) of the basilar membrane shifts The dynamic range of IHCs is limited to about 50 dB.
toward the base, reducing the nonlinear wide dynamic The dynamic range of the sound level at the eardrum,
range compression. however, is closer to 100–120 dB. Thus, there is a di‰-
culty in matching the dynamic range at the drum to that
Cochlear Nonlinearity: Why? of the IHC. This is the job of the OHCs.
It is known that OHCs act as nonlinear elements. For
The discussion above leaves unanswered why the OHCs example, the OHC soma axial sti¤ness, Kohc , depends
compress the signal on the basilar membrane. The an- directly on the voltage drop across the cell membrane,
swer to this question has to do with the large dynamic Vohc . As the OHC cilia excitation is varied from ‘‘soft’’
range of the ear. In 1922 Fletcher and Wegel were the to ‘‘loud,’’ the OHC membrane voltage is depolarized,
first to use electronic instruments to measure the thresh- causing the cell to increase its compliance (and length).
old and upper limit of human hearing (Fletcher and The result is compression due to a decrease in the IHC
Wegel, 1922a, 1922b), thereby establishing the 120 dB (cochlear) signal gain.
dynamic range of the cochlea.
The IHCs are the cells that process the sound before it Multiband Compression
is passed to the auditory nerve. Based on the Johnson
(thermal) noise within the IHC, it is possible to accu- During the two decades from 1965 to 1985, the clinical
rately estimate a lower bound on the RMS voltage audiological community was attempting to answer the
within the IHC. From the voltage drop across the cilia, question: Are compression hearing aids better than a
we may estimate the upper dynamic range of the cell. well-fitted linear hearing aid? A number of researchers
The total dynamic range of the IHC must be less than concluded that linear fitting is always superior to com-
this ratio, or less than 65 dB (e.g., 55–60 dB) (Allen, pression. When properly adjusted, linear filtering is close
1997b). The dynamic range of hearing is about 120 dB. to optimum for speech whose level has been adjusted for
Thus, the IHC does not have a large enough dynamic
range to code the dynamic range of the input signal. 5While the thermal noise is typically dominated by the shot
Spread-of-excitation models and neuron threshold dis- noise, the shot noise is more di‰cult to estimate. Since we are
tribution of neural rate do not address this fundamental trying to bound the dynamic range, the thermal noise is a
problem. Nature’s solution to this problem is the OHC- better choice for this purpose. The shot noise reduces the dy-
controlled basilar membrane compression. namic range further, strengthening the argument.
optimum listening. Papers that fall in this category in- build a multiband compression hearing aid as an inter-
clude Braida et al. (1979) and Lippmann et al. (1981). nally funded venture. Eventually Bell Labs built a digital
However, Lippmann et al. are careful to point out the wearable hearing aid prototype. It quickly became ap-
flaw in preadjusting the level (see p. 553). parent that the best processing strategy compromise was
Further criticisms were made by Plomp (1988, 1994), a two-band compression design that was generically
who argued that compression would reduce the modu- similar to the Villchur scheme. With my colleague Vin-
lation depth of the speech. However, compression of a cent Pluvinage, we designed digital hardware wearable
broadband signal does not reduce the modulations in hearing aids, and with the help of Joe Hall and David
sub-bands. Berkley of AT&T, and Patricia Jeng, Harry Levitt,
All these results placed the advocates of compres- Arlene Newman, and many others from City University
sion in a defensive minority position. Villchur vigo- of New York, we developed a fitting procedure and ran
rously responded to the challenge of Plomp, saying that several field trials (Allen et al., 1990). AT&T licensed its
Plomp’s argument was flawed (Villchur, 1989). The filter hearing aid technology to ReSound on February 27, 1987.
bandwidths used in WDRC hearing aids are not narrow Unlike today, in 1990 multiband compression was
enough to reduce the modulations in critical bandwidths. widely unaccepted, both clinically and academically
Other important papers arguing for compression include (Dillon, 2001). Why is this? It was, and remains, di‰cult
Hickson (1994), Killion (1996a, 1996b), Killion et al. to show quantitatively the nature of the improvement of
(1990), and Mueller and Killion (1996). A physiology WDRC. It is probably fair to say that only with the
paper that is frequently cited in the compression litera- success of ReSound’s WDRC hearing aid in the mar-
ture is Ruggero and Rich (1991). ketplace has the clinical community come to accept
Other work that found negative results used com- Villchur’s claims.
pression parameters that were not reasonable and time It may be possible to clarify the acceptance issue by
constants that were too slow. Long time constants with presenting two common views of what WDRC is and
compression produce very di¤erent results and are not in why it works. One’s adopted view strongly influences
the category of syllabic compression. Such systems typi- how he or she thinks about compression. They are the
cally have artifacts, such as noise ‘‘pumping,’’ or they articulation index (AI) view and the loudness view.
simply do not react quickly enough to follow a lively The articulation index view is based on the observa-
conversation. Imagine, for example, a listening situation tion that speech has a dynamic range of about 30 dB in
with a quiet and a loud talker having a conversation. In one-third octave frequency bands (French and Stein-
this situation, the compressor gain must operate at syl- berg, 1947). The assumption is that the AI will increase
labic rates to be e¤ective. The use of multiple bands in a recruiting ear as the compression is increased, if the
ensures that a signal in one frequency band does not speech is held at a fixed loudness. This view has led to
control the gain in another band. Slow-acting compres- unending comparisons between the optimum linear
sion (AGC) may be fine for watching television, but not hearing aid and the optimum compression hearing aid.
for conversational speech. Such systems might be viewed The loudness view is based on restoring the natural
as a replacement for a volume control (Dillon, 1996, dynamic range of all sounds to provide the impaired lis-
2001; Moore et al., 1985; Moore, 1987). tener with all the speech cues in a more natural way. Soft
A key advocate of compression was Ed Villchur, who sounds for normals should be soft for the impaired ear,
critically recognized the importance of Steinberg and and loud sounds should be loud. According to this view,
Gardner’s observations on recruitment as a loss of com- loudness is used as an index of audibility, and complex
pression. He vigorously promoted the idea of compres- arguments about JNDs, speech discrimination, and
sion amplification hearing aids. Personally supporting modulation transfer functions just confound the issue.
the cost of the research with dollars from his very This view is supported by the theory that OHCs com-
successful loudspeaker business, he contracted David press the IHC signals.
Blackmer of dbx to produce a multiband compression Neither of these arguments deals with important and
hearing aid for experimental purposes. Using his experi- complex issues such as changing of the critical band with
mental multiband compression hearing aid, Villchur hearing loss, or the temporal dynamics of the compres-
experimented on hearing-impaired individuals, and sion system. Analysis of these important details is inter-
found that Steinberg and Gardner’s observations and esting only after the signals are placed in the audible
predictions were correct (Villchur, 1973, 1974). Villchur range.
clearly articulated the point that a well-fitted compres-
sion hearing aid improved the dynamic range of audi- Summary
bility and that what counted, in the end, was audibility.
In other words, ‘‘If you can’t hear it, you can’t under- This article has reviewed the early research on loudness,
stand it.’’ This had a certain logical appeal. loudness recruitment, and masking, which are relevant
Fred Waldhauer, a Bell Labs analog circuit designer to compression hearing aid development. The outer hair
of some considerable ability, heard Villchur speak about cell is damaged in sensorineural hearing loss, and this
his experiments on multiband compression. After the causes the cochlea to have reduced dynamic range.
breakup of the Bell System in 1983, Waldhauer pro- When properly designed and fitted, WDRC has
posed to AT&T management that Bell Labs design and proved to be the most e¤ective speech processing
strategy we can presently provide for sensorineural Dillon, H. (1996). Compression? Yes, but for low or high fre-
hearing loss compensation. It works because it supple- quencies, for low or high intensities, and with what response
ments the OHC compressors, which are damaged with times? Ear and Hearing, 17, 287–307.
sensorineural hearing loss. Dillon, H. (2001). Hearing aids. Sydney, Australia: Boomerang
Press.
Fahey, P. F., and Allen, J. B. (1985). Nonlinear phenomena as
Acknowledgments observed in the ear canal, and at the auditory nerve. Journal
I would especially like to thank one anonymous review, of the Acoustical Society of America, 77, 599–612.
Harvey Dillon, Brent Edwards, Ray Kent, Mead Kill- Fletcher, H. (1923a). Physical measurements of audition and
their bearing on the theory of hearing. Journal of the
ion, Harry Levitt, Ryuji Suzuki, and Ed Villchur.
Franklin Institute, 196, 289–326.
—Jont B. Allen Fletcher, H. (1923b). Physical measurements of audition and
their bearing on the theory of hearing. Bell System Tech-
References nology Journal, 2, 145–180.
Fletcher, H. (1929). Speech and hearing. New York: Van Nos-
Allen, J. (1999a). Derecruitment by multiband compression in trand.
hearing aids. In C. Berlin (Ed.), The e¤erent auditory system Fletcher, H. (1950). A method of calculating hearing loss for
(chap. 4, pp. 73–86). San Diego, CA: Singular Publishing speech from an audiogram. Journal of the Acoustical Soci-
Group. ety of America, 22, 1–5.
Allen, J. (1999b). Psychoacoustics. In J. Webster (Ed.), Wiley Fletcher, H. (1953). Speech and hearing in communication.
encyclopedia of electrical and electronics engineering (vol. Huntington, NY: Krieger.
17, pp. 422–437). New York: Wiley. Fletcher, H., and Munson, W. (1933). Loudness, its definition,
Allen, J., and Neely, S. (1992). Micromechanical models of the measurement, and calculation. Journal of the Acoustical
cochlea. Physics Today, 45(7), 40–47. Society of America, 5, 82–108.
Allen, J. B. (1980). Cochlear micromechanics: A physical Fletcher, H., and Steinberg, J. (1924). The dependence of the
model of transduction. Journal of the Acoustical Society of loudness of a complex sound upon the energy in the various
America, 68, 1660–1670. frequency regions of the sound. Physical Review, 24, 306–
Allen, J. B. (1991). Modeling the noise damaged cochlea. In P. 317.
Dallos, C. D. Geisler, J. W. Matthews, M. A. Ruggero, and Fletcher, H., and Wegel, R. (1922a). The frequency-sensitivity
C. R. Steele (Eds.), The mechanics and biophysics of hearing of normal ears. Proceedings of the National Academy of
(pp. 324–332). New York: Springer-Verlag. Science, 8(1), 5–6.
Allen, J. B. (1996). Harvey Fletcher’s role in the creation of Fletcher, H., and Wegel, R. (1922b). The frequency-sensitivity
communication acoustics. Journal of the Acoustical Society of normal ears. Physical Review, 19, 553–565.
of America, 99, 1825–1839. Fowler, E. (1936). A method for the early detection of oto-
Allen, J. B. (1997a). DeRecruitment by multiband compression sclerosis. Archives of Otolaryngology, 24, 731–741.
in hearing aids. In W. Jesteadt et al. (Eds.), Modeling sen- Fowler, E. (1942). A simple method of measuring percentage
sorineural hearing loss (pp. 99–112). Mahwah, NJ: Erl- of capacity for hearing speech. Archives of Otolaryngology,
baum. 36, 874–890.
Allen, J. B. (1997b). OHCs shift the excitation pattern via French, N., and Steinberg, J. (1947). Factors governing the in-
BM tension. In E. Lewis, G. Long, R. Lyon, P. Narins, telligibility of speech sounds. Journal of the Acoustical So-
C. Steele, and E. Hecht-Poinar (Eds.), Diversity in auditory ciety of America, 19, 90–119.
mechanics (pp. 167–175). Singapore: World Scientific Gardner, M. (1994). Personal communication.
Press. He, D., and Dallos, P. (2000). Properties of voltage-dependent
Allen, J. B. (1997c). A short history of telephone psychophy- somatic sti¤ness of cochlear outer hair cells. Journal of the
sics. Journal of the Audiologic Engineering Society, Reprint Association for Research in Otolaryngology, 1, 64–81.
4636, pp. 1–37. Hickson, L. (1994). Compression amplification in hearing aids.
Allen, J. B., Hall, J. L., and Jeng, P. S. (1990). Loudness American Journal of Audiology, 11, 51–65.
growth in 1/2-octave bands (LGOB): A procedure for the Killion, M. (1996a). Compression: Distinctions. Hearing Re-
assessment of loudness. Journal of the Acoustical Society of view, 3(8), 29.
America, 88, 745–753. Killion, M. (1996b). Talking hair cells: What they have to say
Allen, J. B., Shaw, G., and Kimberley, B. P. (1995). Charac- about hearing aids. In C. Berlin (Ed.), Hair cells and hear-
terization of the nonlinear ear canal impedance at low ing aids. San Diego, CA: Singular Publishing Group.
sound levels. ARO, 18, 190 (abstr. 757). Killion, M., Staab, W., and Preves, D. (1990). Classifying auto-
Braida, L., Durlach, N., Lippmann, R., Hicks, B., Rabinowitz, matic signal processors. Hearing Instruments, 41(8), 24–26.
W., and Reed, C. (1979). Hearing aids: A review of past Kim, D., Neely, S., Molnar, C., and Matthews, J. (1980). An
research on linear amplification, amplitude compression, active cochlear model with negative damping in the coch-
and frequency lowering. American Speech and Hearing As- lear partition: Comparison with Rhode’s ante- and post-
sociation, Monograph, 19. mortem results. In G. van den Brink and F. Bilsen (Eds.),
Buus, S., Musch, H., and Florentine, M. (1998). On loudness Psychological, physiological and behavioral studies in hearing
at threshold. Journal of the Acoustical Society of America, (pp. 7–14). Delft, The Netherlands: Delft University Press.
104, 399–410. Kingsbury, B. (1927). A direct comparison of the loudness of
Carver, W. F. (1978). Loudness balance procedures. In J. Katz pure tones. Physical Review, 29, 588–600.
(Ed.), Handbook of clinical audiology (2nd ed., chap. 15, pp. Kros, C., and Crawford, A. (1990). Potassium currents in inner
164–178). Baltimore: Williams and Wilkins. hair cells isolated from the guinea-pig cochlea. Journal of
Delgutte, B. (1990). Two-tone suppression in auditory-nerve Physiology, 421, 263–291.
fibres: Dependence on suppressor frequency and level. Liberman, M., and Dodds, L. (1984). Single neuron labeling
Hearing Research, 49, 225–246. and chronic cochlear pathology: III. Stereocilia damage and
Assessment of and Intervention with Children Who Are Deaf or Hard of Hearing 421
alterations of threshold tuning curves. Hearing Research, Villchur, E. (1974). Simulation of the e¤ect of recruitment on
16, 55–74. loudness relationships in speech. Journal of the Acoustical
Liberman, M., and Kiang, N. (1978). Acoustic trauma in cats. Society of America, 56, 1601–1611.
Acta Otolaryngologica, Supplement, 358, 1–63. Villchur, E. (1989). Comments on: The negative e¤ect of am-
Lippmann, R., Braida, L., and Durlach, N. (1981). Study of plitude compression on multichannel hearing aids in the
multichannel amplitude compression and linear amplifica- light of the modulation transfer function. Journal of the
tion for persons with sensorineural hearing loss. Journal of Acoustical Society of America, 86, 425–427.
the Acoustical Society of America, 69, 524–534. Wegel, R., and Lane, C. (1924). The auditory masking of one
Lorente de No, R. (1937). The diagnosis of diseases of the pure tone by another and its probable relation to the
neural mechanism of hearing by the aid of sounds well dynamics of the inner ear. Physical Review, 23, 266–285.
above threshold. Transactions of the American Otological
Society, 27, 219–220.
Martin, F. N. (1986). Introduction to audiology (3rd ed.). En-
glewood Cli¤s, NJ: Prentice-Hall.
McFadden, D. (1986). The curious half-octave shift: Evidence Assessment of and Intervention with
of a basalward migration of the traveling-wave envelope Children Who Are Deaf or Hard of
with increasing intensity. In R. Salvi, D. Henderson, R.
Hamernik, and V. Coletti (Eds.), Applied and basic aspects Hearing
of noise-induced hearing loss (pp. 295–312). New York:
Plenum Press.
Moore, B. (1987). Design and evaluation of a two-channel The purpose of communication assessment of children
compression hearing aid. Journal of Rehabilitation Research with educationally significant hearing loss di¤ers from
and Development, 24, 181–192. the purpose of assessing children with language or
Moore, B., Laurence, R., and Wright, D. (1985). Improve- learning disabilities. Since the diagnosis of a hearing
ments in speech intelligibility in quiet and in noise produced disability has already been made, the primary goal of
by two-channel compression hearing aids. British Journal of communication assessment is to determine the impact of
Audiology, 19, 175–187. the hearing loss on language, speech, auditory skills, or
Mueller, H., and Killion, M. (1996). Available: http://
cognitive, social-emotional, educational and vocational
www.compression.edu. Hearing Journal, 49, 44–46.
Munson, W. A., and Gardner, M. B. (1950). Loudness pat- development, not to diagnose a disability. It is critical to
terns: A new approach. Journal of the Acoustical Society of determine the rate of language and communication de-
America, 22, 177–190. velopment and to identify strategies that will be most
Neely, S. T., and Allen, J. B. (1997). Relation between the rate beneficial for optimal development.
of growth of loudness and the intensity DL. In W. Jesteadt Plateaus in language development at the 9–10-year
et al. (Eds.), Modeling sensorineural hearing loss (pp. 213– age level, in reading development at the middle third
222). Mahwah, NJ: Erlbaum. grade to fourth grade level (Holt, 1993), and in speech
Pickles, J. O. (1982). An introduction to the physiology of hear- intelligibility at about 10 years (Jensema, Karchmer,
ing. London: Academic Press. and Trybus, 1978) have been reported in the literature.
Plomp, R. (1988). The negative e¤ect of amplitude com-
pression in multichannel hearing aids in the light of the
The language plateaus appear to be the result of devel-
modulation-transfer function. Journal of the Acoustical So- opmental growth, which ranges from 43%–53% for
ciety of America, 83, 2322–2327. children with profound hearing loss using hearing aids
Plomp, R. (1994). Noise, amplification, and compression: (Boothroyd, Geers, and Moog, 1991; Geers and Moog,
Considerations of three main issues in hearing aid design. 1988) to 60%–65% of the normal range of development
Ear and Hearing, 15, 2–12. for children with severe loss using hearing aids (Booth-
Rhode, W. (1971). Observations of the vibration of the basilar royd, Geers, and Moog, 1991) and for children with
membrane in squirrel monkeys using the Mossbauer tech- profound hearing loss using cochlear implants (Blamey
nique. Journal of the Acoustical Society of America, 64, et al., 2001). In contrast, in a study of 150 children,
158–176. Yoshinaga-Itano et al. (1998) reported that children with
Riesz, R. (1928). Di¤erential intensity sensitivity of the ear for
pure tones. Physical Review, 31, 867–875.
hearing loss only who were early-identified (within the
Ruggero, M., and Rich, N. (1991). Furosemide alters organ of first 6 months of life) had mean language levels at 90%
Corti mechanics: Evidence for feedback of outer hair cells of the rate of normal language development through the
upon basilar membrane. Journal of Neuroscience, 11, 1057– first 3 years of life. A study of children in Nebraska
1067. (Moeller, 2000) reported similar levels of language de-
Sachs, M., and Abbas, P. (1974). Rate versus level func- velopment (low-average range) for a sample of 5-year-
tions for auditory-nerve fibers in cats: Tone-burst stimuli. old children receiving early intervention services in the
Journal of the Acoustical Society of America, 56, 1835–1847. first 11 months of life. Later-identified children were able
Santos-Sacchi, J., and Dilger, J. P. (1987). Whole cell currents to achieve language development commensurate with
and mechanical responses of isolated outer hair cells. Hear- the early-identified/intervened group when their families
ing Research, 35, 143–150.
Steinberg, J., and Gardner, M. (1937). Dependence of hearing
were rated as ‘‘high parent involvement’’ in the inter-
impairment on sound intensity. Journal of the Acoustical vention services.
Society of America, 9, 11–23. With the advent of universal newborn hearing
Villchur, E. (1973). Signal processing to improve speech intel- screening, the population of children who are deaf or
ligibility in perceptive deafness. Journal of the Acoustical hard of hearing will change rapidly during the next
Society of America, 53, 1646–1657. decade. By 2001, 35 states had passed legislation to
establish universal newborn hearing screening programs, ciated with greater gains in expressive language (Mus-
five additional states had legislation in progress, and five selman and Churchill, 1991). At present, studies of
states had established programs without legislation. The causality have been insu‰cient to determine the e‰cacy
age at which hearing loss is identified should drop dra- or superiority of various intervention strategies. How-
matically throughout the United States, and this drop ever, some interventions that are theoretically grounded
should be accompanied by intervention services, begin- and are characteristic of programs that demonstrate
ning in the newborn period. In the state of Colorado, optimal language development are parent-centered,
infants referred from UNHS programs are being identi- provide objective developmental data, assist parental
fied with hearing loss at 6–8 weeks of age and enter into decisions about methodology, based on the devel-
intervention programs almost immediately thereafter. opmental progress of the individual child, include a
For the population of children identified with hearing strong counseling component aimed at reducing parental
loss within the first 2 months of life, baseline communi- stress and assisting parents in the resolution of their
cation assessments are typically conducted at 6-month grief, and provide guidance in parent-child interaction
intervals during the first 3 years of life (Stredler-Brown strategies.
and Yoshinaga-Itano, 1994; Yoshinaga-Itano, 1994). The language development of early-identified (within
Almost all of the infant assessment instruments are the first 6 months of life) children with hearing loss
parent questionnaires that address the development of is similar to their nonverbal development, particularly
receptive and expressive language (e.g., MacArthur in regard to symbolic play development (Snyder and
Communicative Development Inventories, Minnesota Yoshinaga-Itano, 1999; Yoshinaga-Itano and Snyder,
Child Development Inventory, Vineland Social Maturity 1999; Mayne et al., 2000). About 60% of the variance in
Scales), auditory skills, early vocalizations, cognitive, early language development is predicted by nonverbal
fine motor, gross motor, self-help, and personal-social/ cognitive measures such as symbolic play and age at
social-emotional issues. Videotaped analysis of parent- identification of hearing loss. Mode of communication,
child interaction style and spontaneous speech and lan- degree of hearing loss, socioeconomic status, ethnicity,
guage production is frequently included. and sex were not shown to predict language develop-
Spontaneous speech samples should be analyzed to ment. These results contrast sharply with the school-age
identify the number of di¤erent consonant phones. The literature, in which race, ethnicity, and socioeconomic
number of di¤erent consonant phones produced in a status are primary predictors of reading achievement
spontaneous 30-minute language sample taken in the (Holt, 1993).
home between 9 months and 50 months of age is a good In order to maintain the successful language develop-
predictor of speech intelligibility (Yoshinaga-Itano and ment of the early-identified children, the purpose of
Sedey, 2000). The primary development of speech for evaluation in the first 5 years of life should be to monitor
children with mild to moderate hearing loss is con- and chart the longitudinal developmental progress of the
centrated between 2 and 3 years of age, while the pre- child, with two primary goals: (1) to achieve language
school years, ages 3–5 years, are a significant growth development commensurate with nonverbal cognitive
period for children with moderate to severe hearing loss. development, and (2) to achieve language development
Speech development for children with profound hearing in children with hearing loss only, within the normal
loss who use hearing aids is very slow in the first 5 years range of development. In an analysis of almost 250 chil-
of life. Although 75% of children with mild through se- dren, an 80% probability of language within the low-
vere hearing loss achieved intelligible speech by 5 years normal range in the first 5 years of life was reported if
of age, only 20% of children with profound hearing loss a child identified with hearing loss had been born in
who used conventional amplification achieved this level a Colorado hospital with a universal newborn hear-
by age 5. Level of expressive language and degree of ing screening program (Yoshinaga-Itano, Coulter, and
hearing loss were the two primary predictors of speech Thomson, 2000).
intelligibility. In addition to an analysis of communication skills,
Videotaped interactions of parent-child communica- cognitive development, and age at identification, as-
tion can be analyzed for maternal bonding and emo- sessments should include information about the social-
tional availability (Pressman et al., 1999), turn-taking emotional development of both parents and children.
(Musselman and Churchill, 1991), use of pause time, Relationships have been found between language devel-
maintenance of topic, topic initiation, attention-getting opment and parental stress (Pipp-Siegel, Sedey, and
devices, the development of symbolic play, symbolic Yoshinaga-Itano, 2002), emotional availability (Press-
gesture, and communication intention strategies (com- man et al., 1999) parent involvement (Moeller, 2000),
ments, requests, answers, commands) of both the parent grief resolution (Pipp-Siegel, 2000), development of
and the child (Yoshinaga-Itano, 1994). These analyses sense of self (Pressman, 2000), and mastery motivation
provide important information for the family and in- (Pipp-Siegel et al., 2002). The relationships examined in
tervention provider to help design strategies for opti- these studies do not establish causes, but it is plausible
mal development. Reciprocal relationships have been that intervention strategies focused on these areas may
reported. Parents adjust language as their child’s lan- enhance the language development of young children
guage improves (Cross, Johnson-Morris, and Nienhuys, with hearing loss. Counseling strategies with parent sign
1985), and low levels of maternal turn control are asso- language instruction enhanced language development
Assessment of and Intervention with Children Who Are Deaf or Hard of Hearing 423
among children using total communication (Greenberg, Boothroyd, A., Geers, A. E., and Moog, J. S. (1991). Practical
Calderon, and Kusche, 1984). implications of cochlear implants in children. Ear and
Assessment strategies during the preschool period, Hearing, 12, 81S–89S.
ages 3–5 years, continue to focus on receptive and ex- Cross, T. G., Johnson-Morris, J. E., and Nienhuys, T. G.
(1985). Linguistic feedback and maternal speech: Compar-
pressive vocabulary, but their primary focus shifts to the
isons of mothers addressing hearing and hearing-impaired
development of syntax and morphology and pragmatic children. First Language, 1, 163–189.
language skills (Yoshinaga-Itano, 1999). Spontaneous Geers, A. E., and Moog, J. S. (1988). Predicting long-
language sample analysis is recommended for expressive term benefits from single-channel cochlear implants in pro-
syntax analysis. There is a shift from parent question- foundly hearing-impaired children. American Journal of
naire developmental assessments and assessments of Otology, 9, 169–176.
spontaneous communication to clinical or school-based Gray, C. D., and Hosie, J. A. (1996). Deafness, story under-
elicited assessments at this age period. Testers need to standing, and theory of mind. Journal of Deaf Studies and
ensure full access to the information being presented, Deaf Education, 1, 217–233.
either through fully functioning amplification devices, Greenberg, M. T., Calderon, R., and Kusche, C. A. (1984).
Early intervention using simultaneous communication with
adequate speech reading accessibility, or the skills of a
deaf infants: The e¤ects on communicative development.
fluent signer. Child Development, 55, 607–616.
During the school-age period, standardized assess- Holt, J. (1993). Stanford Achievement Test–8th Edition:
ments consist of regularly administered tests of language Reading comprehension subgroup results. American Annals
(receptive and expressive vocabulary, syntax, prag- of the Deaf, 138, 172–175.
matics, and phonology), reading and writing, mathe- Jackson, D. W., Paul, P. V., and Smith, J. C. (1997). Prior
matics, other content areas, and social-emotional knowledge and reading comprehension ability of deaf ado-
development (Yoshinaga-Itano, 1997). Researchers hy- lescents. Journal of Deaf Education and Deaf Studies, 2,
pothesize that there may be as many as four possible 172–184.
routes to literacy for children with hearing loss: (1) Jensema, C. J., Karchmer, M. A., and Trybus, R. J. (1978).
The rated speech intelligibility of hearing-impaired children:
spoken language to printed language decoded to speech,
Basic relationships. Washington, DC: Gallaudet College
(2) English-based signs to printed English, (3) American O‰ce of Demographic Studies.
Sign Language (ASL) to print, with English-based Mayne, A. M., Yoshinaga-Itano, C., Sedey, A. L., and Carey,
signs as an intermediary, and (4) ASL to print (Mussel- A. (2000). Expressive vocabulary development of infants
man, 2000). Assessments of knowledge of English se- and toddlers who are deaf or hard of hearing. Volta Review,
mantics, syntax, and phonological processing, accuracy 100(5), 1–28.
and speed of word identification, and orthographic Moeller, M. P. (2000). Early intervention and language devel-
encoding should be included, since several studies have opment in children who are deaf and hard of hearing. Pe-
found that these variables are significantly related to diatrics, 106, E43.
reading comprehension (Musselman, 2000). Among Musselman, C. (2000). How do children who can’t hear learn
to read an alphabetic script? A review of the literature on
children who use sign language, finger-spelling ability
reading and deafness. Journal of Deaf Studies and Deaf Ed-
and general language competence in either ASL or En- ucation, 5, 9–31.
glish should be included (Musselman, 2000). For all Musselman, C., and Churchill, A. (1991). A comparison of the
children, assessments should also focus on the meta- interaction between mothers and deaf children in auditory/
linguistic and metacognitive strategies (knowing how oral and total communication pairs. American Annals of the
to use and think about language) used by the students Deaf, 136, 5–16.
in person-to-person and written communication (Gray Paul, P. (1996). Reading vocabulary knowledge and deaf-
and Hosie, 1996). ‘‘Theory of mind’’ assessments ness. Journal of Deaf Studies and Deaf Education, 1, 3–
provide information about the cognitive ability of the 15.
student to understand a variety of di¤erent perspectives Pipp-Siegel, S. (2000). Resolution of grief of parents with young
children with hearing loss. Unpublished manuscript, De-
(Strassman, 1997). Students need to develop strategies
partment of Speech, Language, and Hearing Sciences. Uni-
to acquire and elaborate world knowledge, elaborate versity of Colorado, Boulder.
vocabulary knowledge (both conversational and writ- Pipp-Siegel, S., Sedey, A., Mayne, A., and Yoshinaga-Itano, C.
ten), and to use this knowledge to make inferences (in press). Mastery motivation predicts expressive language
in social, communicative interactions and reading/ in young children with hearing loss. Journal of Deaf Educa-
academic situations (Paul, 1996; Jackson, Paul, and tion and Deaf Studies.
Smith, 1997). Pipp-Siegel, S., Sedey, A. L., and Yoshinaga-Itano, C. (2002).
Predictors of parental stress in mothers of young children
—Christine Yoshinaga-Itano with hearing loss. Journal of Deaf Studies and Deaf Educa-
tion, 7, 1–17.
References Pressman, L. (2000). Early self-development in children with
hearing loss. Unpublished doctoral dissertation, Depart-
Blamey, P. J., Sarant, J. Z., Paatsch, L. E., Barry, J. G., Bow, ment of Speech, Language, and Hearing Sciences. Univer-
C. P., Wales, R. J., et al. (2001). Relationships among sity of Colorado, Boulder.
speech perception, production, language, hearing loss, and Pressman, L., Pipp-Siegel, S., Yoshinaga-Itano, C., and Deas,
age in children with impaired hearing. Journal of Speech, A. (1999). The relation of sensitivity to child expressive
Language, and Hearing Research, 44, 264–285. language gain in deaf and hard-of-hearing children whose
caregivers are hearing. Journal of Deaf Studies and Deaf shape, although sensitivity continues to improve into
Education, 4, 294–304. childhood. Several behavioral tests are used to assess
Snyder, L., and Yoshinaga-Itano, C. (1999). Specific play infant hearing, including conditioned head turning by
behaviors and the development of communication in chil- infants older than about 6 months and observation of
dren with hearing loss. Volta Review, 100, 165–185.
subtle behavioral responses to sounds in younger infants.
Strassman, B. K. (1997). Metacognition and reading in chil-
dren who are deaf: A review of the research. Journal of Deaf It is of clinical significance that for infants younger than
Studies and Deaf Education, 2, 140–149. 4–6 months, behavioral methods lack reliability for in-
Stredler-Brown, A., and Yoshinaga-Itano, C. (1994). dividual assessment (Bourland, Tharpe, and Ashmead,
F.A.M.I.L.Y. assessment: A multidisciplinary evaluation 2000). Infant-adult di¤erences reflect true sensory di¤er-
tool. In J. Roush and N. Matkin (Eds.), Infants and toddlers ences as well as nonsensory factors such as attention, but
with hearing loss (pp. 133–161). Baltimore: York Press. at least in older infants only about 4 dB of the infant-
Yoshinaga-Itano, C. (1994). Language assessment of infants adult di¤erence is attributable to nonsensory factors
and toddlers with significant hearing losses. Seminars in (Nozza and Henson, 1999). Electrophysiological mea-
Hearing, 15, 128–147. sures (primarily the auditory brainstem response, ABR)
Yoshinaga-Itano, C. (1997). The challenge of assessing lan-
reflect nearly adultlike hearing sensitivity from early in-
guage in children with hearing loss. Speech, Language, and
Hearing Services in Schools, 28, 362–373. fancy (Werner, Folsom, and Mancl, 1994). Although
Yoshinaga-Itano, C. (1999). Assessment and Intervention of electrophysiological measures are less susceptible than
preschool-aged deaf and hard-of-hearing children. In J. behavioral methods to inattention and o¤-task behavior,
Alpiner and P. McCarthy (Eds.), Rehabilitative audiology: another reason for the discrepancy between methods is
Children and adults (pp. 140–177). Baltimore: Williams and that the ABR reflects only peripheral processing. That is,
Wilkins. in early infancy, auditory sensitivity may be constrained
Yoshinaga-Itano, C., Coulter, D., and Thomson, V. (2000). by neural immaturity central to the processes measured
The Colorado Newborn Hearing Screening Project: E¤ects by the ABR.
on speech and language development for children with
hearing loss. Journal of Perinatology, 20, S132–S137.
Yoshinaga-Itano, C., and Sedey, A. (2000). Early speech de- Frequency Resolution
velopment in children who are deaf or hard of hearing:
Interrelationships with language and hearing. Volta Review, Like absolute sensitivity, frequency resolution ap-
100, 181–212. proaches adult values by 6 months after birth. The
Yoshinaga-Itano, C., Sedey, A., Coulter, D., and Mehl, A. segregation of auditory processing by frequency is dem-
(1998). Language of early- and later-identified children with onstrated by the enhanced e¤ectiveness of masking
hearing loss. Pediatrics, 102, 1161–1171. sounds that are closer in frequency to the signal. Infants’
Yoshinaga-Itano, C., and Snyder, L. (1999). The relationship cochlear frequency resolution has been studied via sup-
of language and symbolic play in deaf and hard-of-hearing pression tuning curves from otoacoustic emissions. Such
children. Volta Review, 100, 135–164. frequency resolution is adultlike at full-term birth (Bar-
gones and Burns, 1988; Abdala, 1998). Beyond the coch-
lea, resolution for higher frequencies (ca. 4 kHz) is not
Audition in Children, Development of finely tuned for the first 3 months after birth, but by 6
months resolution is adultlike, as seen in behavioral and
ABR masking studies (Spetner and Olsho, 1990; Abdala
Research on auditory development intensified in the
and Folsom, 1995). Some studies indicate improvement
mid-1970s, expanding our understanding of auditory
processes in infants and children. Recent comprehensive in frequency resolution up to age 4 years, but Hall and
Grose (1991) showed that when certain methodological
reviews of this literature include those by Aslin, Jusczyk,
issues are taken into account, children have adultlike
and Pisoni (1998), Werner and Marean (1996), and
frequency resolution by 4 years. Despite the early devel-
Werner and Rubel (1992). Two recurrent issues are
opment of frequency resolution, selective attention based
whether the limitations of testing methods cause true
on frequency is not well developed even by 9 months
abilities to be underestimated, and whether auditory de-
(Bargones and Werner, 1994).
velopment is substantially complete during infancy or
continues into childhood.
Temporal Processing
Absolute Sensitivity
There may be a more protracted course for temporal
Auditory sensitivity is within about 10–15 dB of adult processing than for some other aspects of auditory de-
values by 6 months after birth, but behavioral and elec- velopment. Adults detect acoustic gaps of 2–5 ms,
trophysiological methods show di¤erent trends during whereas infants require gaps up to ten times longer
early infancy. Behavioral thresholds at 1 month are ap- (Werner et al., 1992). However, infant gap detection
proximately 40 dB above adult values, with substantial may be adult-like when tested in conditions that mini-
improvement through 6 months and gradual improve- mize adaptation e¤ects (Trehub, Schneider, and Hender-
ment through the early school years (Schneider et al., son, 1995; Trainor et al., 2001). Gap detection improves
1986; Olsho et al., 1988). By 6 months the audibility through 5–10 years (Wightman et al., 1989). Hall and
curve (threshold plotted against frequency) is adultlike in Grose (1994) found that older children and adults had
Audition in Children, Development of 425
similar time constants for detection of amplitude modu- Speech Perception

lation, but that preschool-aged children needed larger
As early as the 1960s, investigators proposed that speech
modulations. Discrimination between small di¤erences
was processed by a separate perceptual module of a
in duration improves during infancy and into middle
larger language system unique to humans. Although
childhood (Jensen and Ne¤, 1993). The coarseness of
speech is still considered a special signal, the existence
temporal acuity in infants and children is surprising,
of an innate and specialized speech-processing mode
since typically developing children do well at linguistic
processing. But individual di¤erences in infants’ tempo- unique to humans remains in question (Aslin, Jusczyk,
and Pisoni, 1998). Since the initial studies of infant
ral resolution are associated with language development
speech perception (Eimas et al., 1971), many inves-
at 2 years (Trehub and Henderson, 1996). Besides tem-
poral acuity, another consideration is how sound is tigators have shown that mechanisms used to perceive
speech are in place long before infants utter their first
integrated across time, reflected by improvement of de-
words at around 12 months of age. For example, infants
tection thresholds as a sound remains on longer. Adult
can discriminate between most English speech sound
detection improves as the signal extends up to ¼ s,
pairs much as adults do (Kuhl, 1987), based on param-
whereas infants have steeper improvement curves ex-
eters such as voice onset time, manner, and place of ar-
tending to longer durations (Berg and Boswell, 1995). A
ticulation. Despite infants’ ability to categorize speech
related finding is that forward masking operates over
sounds, the boundaries between categories continue to
longer masker-signal intervals in 3-month-olds than in 6-
sharpen into early childhood (Nittrouer and Studdert-
month-olds or adults, implying that a temporal aspect of
Kennedy, 1987). Also, children age 3–4 years are not as
masking is mature by about 6 months (Werner, 1999).
adept as older children and adults at perceiving speech
sounds from brief-onset information (Ohde and Haley,
Intensity Resolution and Loudness 1997). Infants also can generalize phoneme recognition
across di¤erent talkers, fundamental frequencies, and
Early studies of intensity discrimination showed that
positions within a syllable (Eilers, Wilson, and Moore,
neonates and young infants detect large intensity
1977; Miller, Younger, and Morse, 1982). The ability to
changes. Sinnott and Aslin (1985) reported intensity dif-
categorize vowels, also known as equivalence classes,
ference limens for 7- to 9-month-olds of 3–12 dB, com-
has been demonstrated in infants as young as 2 months
pared to 1.8 dB for adults. Other studies indicate that
of age (Marean, Werner, and Kuhl, 1992). Changes in
between birth and 12 months, the intensity di¤erence
speech perception as a result of postnatal experience oc-
limen ranges from 2 to 5 dB (Bull, Eilers, and Oller,
1984; Tarquinio, Zelazo, and Weiss, 1990). Although cur within the first year of life, as shown by selective
perception of speech sounds from the native language
infants are good at discerning changes in intensity, this
(see Werner and Marean, 1996, chap. 6). As early per-
ability may not be adultlike until 6 years of age (Jensen
and Ne¤, 1993). Research on loudness perception by ception of linguistic input is so remarkable, any obstacle
to the receipt of audible and clear speech sounds by an
infants has not been reported, but there has been some
infant, such as hearing loss, should be suspected of ad-
work with children. Studies using magnitude estimation
versely a¤ecting language development.
and cross-modality matching have demonstrated that
4- to 7-year-olds described the growth of loudness of a
tone similarly to adults (Collins and Gescheider, 1989).
Spatial Hearing
Serpanos and Gravel (2000), also using cross-modality Sound localization is rather crude in early infancy,
matching paradigms, reported that loudness growth improving nearly to adult levels by 1 year and slowly
functions of children 4–12 years of age were similar to thereafter. Newborns turn their heads toward sound
those of adults. sources, but this response wanes, reappearing in brisk
form around 4 months (Clifton, 1992). This trend, along
with the onset of response to the precedence e¤ect (re-
Pitch and Music Perception lated to reflected sounds) at 4 months, suggests an
Research on pitch and music perception suggests that by increasing role of the auditory cortex in sound local-
late in the first year of life, infants extract relational ization. The low responsivity to sounds from birth to
information across frequencies. At 7 months, infants 4 months makes it di‰cult to assess hearing ability
categorize octave tonal complexes by fundamental fre- behaviorally in this age range, as noted above in the
quency, even if the fundamental is missing and cochlear discussion of auditory sensitivity. Precision of sound lo-
distortions are ruled out (Clarkson and Clifton, 1995; calization, as measured by the minimum audible angle,
Montgomery and Clarkson, 1997). A substantial litera- improves from 20 –25 at 4 months to 8 –10 at 12
ture on music perception in infants shows sensitivity to months, with further gradual changes to adultlike values
melodic contours, even when transposed across octaves of 1 –3 by 5 years (Ashmead, Clifton, and Perris, 1987;
(Trehub, 2001). Although findings on pitch and music Morrongiello, Fenwick, and Chance, 1990). This im-
suggest integrative processing, other work in nonmusical provement may entail integration across several sound
contexts indicates that infants are better able to discrim- localization cues, since sensitivity to single cues is better
inate absolute than relative spectral di¤erences (Sa¤ran than predicted from localization (Ashmead et al., 1991).
and Griepentrog, 2001). A phenomenon related to spatial hearing is masking
level di¤erences, which are smaller in children less than 7 Grose, J. H., Hall, J. W. III, and Dev, M. B. (1997). MLD
years old than in older children or adults (Grose, Hall, in children: E¤ects of signal and masker bandwidths. Jour-
and Dev, 1997). Regarding distance, 6-month-olds nal of Speech, Language, and Hearing Research, 40, 955–
distinguish between sounds within reach versus those 959.
Hall, J. W. III, and Grose, J. H. (1991). Notched-noise mea-
beyond reach, even if sound pressure level is removed as
sures of frequency selectivity in adults and children using
a distance cue (Litovsky and Clifton, 1992). fixed-masker-level and fixed-signal-level presentation. Jour-
See also physiological bases of hearing; pediatric nal of Speech and Hearing Research, 34, 651–660.
audiology: the test battery approach; hearing loss Hall, J. W. III, and Grose, J. H. (1994). Development of tem-
screening: the school-age child. poral resolution in children as measured by the temporal
modulation transfer function. Journal of the Acoustical So-
—Daniel H. Ashmead and Anne Marie Tharpe ciety of America, 96, 150–154.
Jensen, J. K., and Ne¤, D. L. (1993). Development of basic
References auditory discrimination in preschool children. Psychological
Abdala, C. (1998). A developmental study of distortion prod- Science, 4, 104–107.
uct otoacoustic emission (2f1 –f2 ) suppression in humans. Kuhl, P. K. (1987). Perception of speech and sound in early
Hearing Research, 121, 125–138. infancy. In P. Salapatek and L. Cohen (Eds.), Handbook of
Abdala, C., and Folsom, R. C. (1995). The development of infant perception (pp. 275–381). New York: Academic
frequency resolution in humans as revealed by the auditory Press.
brain-stem response recorded with notched-noise masking. Litovsky, R. Y., and Clifton, R. K. (1992). Use of sound-
Journal of the Acoustical Society of America, 98, 921–930. pressure level in auditory distance discrimination by 6-
Ashmead, D. H., Clifton, R. K., and Perris, E. E. (1987). Pre- month-old infants and adults. Journal of the Acoustical So-
cision of auditory localization in human infants. Devel- ciety of America, 92, 794–802.
opmental Psychology, 23, 641–657. Marean, G. C., Werner, L. A., and Kuhl, P. K. (1992). Vowel
Ashmead, D. H., Davis, D. L., Whalen, T. A., and Odom, categorization by very young infants. Developmental Psy-
R. D. (1991). Sound localization and interaural time dis- chology, 28, 396–405.
crimination in human infants. Child Development, 62, 1211– Miller, C. L., Younger, B. A., and Morse, P. A. (1982). The
1226. categorization of male and female voices in infancy. Infant
Aslin, R. N., Jusczyk, P. W., and Pisoni, D. B. (1998). Speech Behavior and Development, 5, 143–159.
and auditory processing during infancy: Constraints on and Montgomery, C. R., and Clarkson, M. G. (1997). Infants’
precursors to language. In W. Damon, D. Kuhn, and R. S. pitch perception: Masking by low- and high-frequency
Siegler (Eds.), Handbook of child psychology (5th ed.), Vol. noises. Journal of the Acoustical Society of America, 102,
2: Cognition, perception, and language (pp. 147–198). New 3665–3672.
York: Wiley. Morrongiello, B. A., Fenwick, K. D., and Chance, G. (1990).
Bargones, J. Y., and Burns, E. M. (1988). Suppression tuning Sound localization acuity in very young infants: An
curves for spontaneous otoacoustic emissions in infants and observer-based testing procedure. Developmental Psychol-
adults. Journal of the Acoustical Society of America, 83, ogy, 26, 75–84.
1809–1816. Nittrouer, S., and Studdert-Kennedy, M. (1987). The role of
Bargones, J. Y., and Werner, L. A. (1994). Adults listen selec- coarticulatory e¤ects in the perception of fricatives by chil-
tively; infants do not. Psychological Science, 5, 170–174. dren and adults. Journal of Speech and Hearing Research,
Berg, K. M., and Boswell, A. E. (1995). Temporal summation 30, 319–329.
of 500-Hz tones and octave-band noise bursts in infants and Nozza, R. J., and Henson, A. M. (1999). Unmasked thresholds
adults. Perception and Psychophysics, 57, 183–190. and minimum masking in infants and adults: Separating
Bourland, C., Tharpe, A. M., and Ashmead, D. H. (2000). sensory from nonsensory contributions to infant-adult
Behavioral auditory assessment of young infants: Method- di¤erences in behavioral thresholds. Ear and Hearing, 20,
ological limitations or natural lack of auditory responsive- 483–496.
ness? American Journal of Audiology, 9, 124–130. Ohde, R. N., and Haley, K. L. (1997). Stop-consonant and
Bull, D., Eilers, R. E., and Oller, D. K. (1984). Infants’ dis- vowel perception in 3- and 4-year-old children. Journal of
crimination of intensity variation in multisyllable stimuli. the Acoustical Society of America, 102, 3711–3722.
Journal of the Acoustical Society of America, 76, 13–17. Olsho, L. W., Koch, E. G., Carter, E. A., Halpin, C. F., and
Clarkson, M. G., and Clifton, R. K. (1995). Infants’ pitch Spetner, N. B. (1988). Pure-tone sensitivity of human
perception: Inharmonic tonal complexes. Journal of the infants. Journal of the Acoustical Society of America, 84,
Acoustical Society of America, 98, 1372–1379. 1316–1324.
Clifton, R. K. (1992). The development of spatial hearing in Sa¤ran, J. R., and Griepentrog, G. J. (2001). Absolute pitch in
human infants. In L. A. Werner and E. W. Rubel (Eds.), infant auditory learning: Evidence for developmental reor-
Developmental psychoacoustics (pp. 135–157). Washington, ganization. Developmental Psychology, 37, 74–85.
DC: American Psychological Association. Schneider, B. A., Trehub, S. E., Morrongiello, B. A., and
Collins, A. A., and Gescheider, G. A. (1989). The measure- Thorpe, L. A. (1986). Auditory sensitivity in preschool
ment of loudness in individual children and adults by abso- children. Journal of the Acoustical Society of America, 79,
lute magnitude estimation and cross-modality matching. 447–452.
Journal of the Acoustical Society of America, 85, 2012–2021. Serpanos, Y. C., and Gravel, J. S. (2000). Assessing growth of
Eilers, R. E., Wilson, W. R., and Moore, J. M. (1977). Devel- loudness in children by cross-modality matching. Journal of
opmental changes in speech discrimination in infants. Jour- the American Academy of Audiology, 11, 190–202.
nal of Speech and Hearing Research, 20, 766–780. Sinnott, J. M., and Aslin, R. N. (1985). Frequency and inten-
Eimas, P., Siqueland, E., Jusczyk, P., and Vigorito, J. (1971). sity discrimination in human infants and adults. Journal of
Speech perception in infants. Science, 171, 303–306. the Acoustical Society of America, 78, 1986–1992.
Auditory Brainstem Implant 427
Spetner, N. B., and Olsho, L. W. (1990). Auditory frequency

resolution in human infancy. Child Development, 61, 632–
652.
Tarquinio, N., Zelazo, P. R., and Weiss, M. J. (1990). Recov-
ery of neonatal head turning to decreased sound pressure
level. Developmental Psychology, 26, 752–758.
Trainor, L. J., Samuel, S. S., Desjardins, R. N., and Sonnadara,
R. R. (2001). Measuring temporal resolution in infants
using mismatch negativity. Neuroreport, 12, 2443–2448.
Trehub, S. E. (2001). Musical predispositions in infancy.
Annals of the New York Academy of Sciences, 930, 1–16.
Trehub, S. E., and Henderson, J. L. (1996). Temporal res-
olution in infancy and subsequent language development.
Trehub, S. E., Schneider, B. A., and Henderson, J. L. (1995).
Gap detection in infants, children, and adults. Journal of the
Werner, L. A. (1999). Forward masking among infant and Figure 1. Overview of the ABI device and placement.
adult listeners. Journal of the Acoustical Society of America,
105, 2445–2453.
Werner, L. A., Folsom, R. C., and Mancl, L. R. (1994). The The success of that early attempt led to the develop-
relationship between auditory brainstem response latencies ment of a more sophisticated multichannel ABI device
and behavioral thresholds in normal hearing infants and (Brackmann et al., 1993; Shannon et al., 1993; Otto
adults. Hearing Research, 77, 88–98. et al., 1998, 2002). The first commercial multichannel
Werner, L. A., and Marean, G. C. (1996). Human auditory ABI was developed in a collaborative e¤ort between the
development. Boulder, CO: Westview Press. House Ear Institute, Cochlear Corporation, and the
Werner, L. A., Marean, G. C., Halpin, C. F., Spetner, N. B., Huntington Medical Research Institutes. The multi-
and Gillenwater, J. M. (1992). Infant auditory temporal
acuity: Gap detection. Child Development, 63, 260–272.
channel ABI was approved by the U.S. Food and Drug
Werner, L. A., and Rubel, E. W. (Eds.). (1992). Developmental Administration in October 2000.
psychoacoustics. Washington, DC: American Psychological Several commercial ABI devices are available, all ba-
Association. sically similar to the original device. ABIs are virtually
Wightman, F., Allen, P., Dolan, T., Kistler, D., and Jamieson, identical in design to cochlear implants except for the
D. (1989). Temporal resolution in children. Child Develop- electrode assembly (Fig. 1). The electrode assembly is a
ment, 60, 611–624. flat, paddle-like structure with platinum electrical con-
tacts along one side. The overall size of the assembly is
generally 2–3 mm 8 mm and is designed to fit within
Auditory Brainstem Implant the lateral recess of the IV ventricle. The electrical con-
tacts are 0.5–1.0 mm in diameter, which is su‰cient for
keeping the electrical charge density at the stimulated
People who are deafened by bilateral acoustic tumors are neurons within safe limits (Shannon, 1992). All ABI
not candidates for a cochlear implant because tumor re- devices have an external speech processor unit that con-
moval often severs the auditory nerve. The auditory tains a microphone to pick up the acoustic sound, a sig-
brainstem implant (ABI) is designed for those patients. nal processor to convert the acoustic sound to electrical
It is intended to bypass the auditory nerve and electri- signals, and a transmitter/receiver to send the signals
cally stimulate the human cochlear nucleus. across the skin to the implanted portion of the device.
Neurofibromatosis type 2 (NF2) is a genetic dis- The implanted unit decodes the received signal and pro-
order that causes multiple tumors of the cranial nerves duces controlled electrical stimulation of the electrodes.
and spinal cord, among other symptoms. The gene The most widely used ABI electrode array (Fig. 2;
causing NF2 has been located on chromosome 22 manufactured by Cochlear Corp.) consists of 21 plati-
(Rouleau et al., 1993; Trofatter et al., 1993). The defin- num disk contacts, each 700 mm in diameter. The con-
ing symptom of the disease is bilateral tumors originat- tacts are placed in three rows along a Silastic rubber
ing on the Schwann cells of the vestibular branch of carrier that is 8 mm 2.5 mm (Fig. 3).
nerve VIII (vestibular schwannomas). These tumors are Present ABI electrodes are designed to be placed
life-threatening, and their removal usually produces bi- within the lateral recess of the IV ventricle. Anatomical
lateral deafness. Patients with NF2 cannot benefit from studies (Moore and Osen, 1979) of the human cochlear
a cochlear implant because they have no auditory nerve nucleus complex and imaging studies of early ABI pa-
that can be stimulated from an intracochlear electrode. tients demonstrated that this location produced the most
In 1979 William House and William Hitselberger e¤ective auditory results and the fewest nonauditory side
attempted to provide auditory sensations for an NF2 e¤ects (Shannon et al., 1997; Otto et al., 1998, 2002).
patient by placing a single pair of electrodes in the Electrical stimulation in the human brainstem can po-
cochlear nucleus following tumor removal (Edgerton, tentially produce activation of many nonauditory struc-
House, and Hitzelberger, 1984; Eisenberg, et al., 1987). tures (cranial nerves VII, IX, X, and cerebellum, for
Figure 2. Implantable portion of the ABI, showing the receiver/

stimulator and electrode array.
Figure 3. Close-up view of the 21-electrode array, which con-
sists of 21 platinum disks mounted on a Silastic substrate. The
example). Fortunately, the human cochlear nucleus fabric mesh backing is intended to encourage fibrous ingrowth
complex almost completely surrounds the opening of the to fix the electrode array in position.
lateral recess of the IV ventricle, and the levels of current
delivered to the ABI are not large enough to activate
electrical current range. We estimate that ABI patients
other brainstem nuclei more than about 2 mm away may be able to discriminate only 10 amplitude steps
(Shannon, 1989, 1992; Shannon et al., 1997).
within their dynamic range, in contrast to 20–40 steps
Vestibular schwannomas can be visualized and re-
for cochlear implants and 200–250 steps for acoustic
moved via several surgical approaches, of which the ret- hearing. Some ABI patients have relatively small di¤er-
rosigmoid and translabyrinthine approaches are the
ences in pitch across their electrode array, while others
most common. The translabyrinthine approach allows
show a large change in pitch. In general, patients who
better visualization of the mouth of the lateral recess
do better at speech recognition tend to have a larger
following tumor removal and thus better access for
pitch range across their electrode array, but not all
placement of the ABI electrode array (Brackmann et al.,
patients who have a large pitch range have good speech
1993).
recognition.
Of the first 80 patients implanted with the multi-
Most ABI patients have some electrodes that cause
channel ABI at the House Ear Institute, 86% received
nonauditory side e¤ects. Almost all of these nonauditory
su‰cient auditory sensations that they could use the ABI
e¤ects are benign and produce tingling sensations along
in daily life. For most ABI users the primary benefit is as the body on the side ipsilateral to the ABI. Nonauditory
an aid to lipreading, since only a few ABI patients can
sensations are produced from stimulation of the cer-
understand words with the ABI without lipreading. For
ebellar flocculus (which causes a sensation of eye jitter)
most patients the present ABI device functions at a level and from antidromic activation of the cerebellar pe-
similar to that of single-channel cochlear implants, even
duncle. In patients with an intact facial nerve on the
when many electrodes are used in the ABI speech proc-
implanted side there is a chance of activation of the
essor. ABI patients are able to detect sound and are able
facial nerve, causing facial tingling and even motor acti-
to discriminate sounds based on coarse temporal prop-
vation. In most cases, electrodes that produce non-
erties (Shannon and Otto, 1990; Otto et al., 2002). On
auditory side e¤ects are simply turned o¤ and not
average, ABI patients receive a 30% improvement in
stimulated.
speech understanding when the sound from the ABI is One of the possible reasons for the limited success
added to lipreading alone (Fig. 4). A few ABI patients
of the ABI is that the present electrode is placed on
(10%) achieve significant word and sentence recognition
the surface of the cochlear nucleus. Unfortunately, the
with the device, and a few (4/80) can actually converse in tonotopic axis of the human cochlear nucleus is orthog-
a limited fashion on the telephone.
onal to the surface of the nucleus (Moore and Osen,
Most ABI patients perceive variations in amplitude
1979), and thus orthogonal to the axis of the electrode
and temporal cues but receive little, if any, spectral cues.
array as well. To obtain better access to the tonotopic
ABI patients are able to perceive changes in pitch with
dimension of the human cochlear nucleus requires pene-
changes in pulse rate only up to about 150 Hz, which is
trating electrodes (McCreery et al., 1998). A penetrating
about an octave lower than observed for cochlear im-
electrode ABI system is presently under development. Its
plant listeners and for temporal pitch discrimination for
initial trial is anticipated for 2002.
normal-hearing listeners. Typically, the dynamic range
of amplitude for the ABI is only 6 dB or less in terms of —Robert V. Shannon
Auditory Brainstem Response in Adults 429
Figure 4. Speech recognition results from

the first 55 multichannel ABI patients.
Lower part of each bar shows the percent
correct recognition of simple sentence
materials using lipreading alone. The up-
per part of each bar shows the improve-
ment in recognition obtained when the
ABI was added to lipreading.
References Shannon, R. V. (1992). A model of safe levels for electrical

stimulation. IEEE Transactions in Biomedical Engineering,
Brackmann, D. E., Hitselberger, W. E., Nelson, R. A., Moore, 39, 424–426.
J. K., Waring, M., Portillo, F., et al. (1993). Auditory Shannon, R. V., Fayad, J., Moore, J. K., Lo, W., O’Leary, M.,
brainstem implant: I. Issues in surgical implantation. Otto, S., et al. (1993). Auditory brainstem implant: II. Post-
Otolaryngology–Head and Neck Surgery, 108, 624–634. surgical issues and performance. Otolaryngology–Head and
Edgerton, B. J., House, W. F., Hitselberger, W. (1984). Hear- Neck Surgery, 108, 635–643.
ing by cochlear nucleus stimulation in humans. Annals of Shannon, R. V., Moore, J., McCreery, D., and Portillo, F.
Otology, Rhinology, and Laryngology, 91, 117–124. (1997). Threshold-distance measures from electrical stimu-
Eisenberg, L. S., Maltan, A. A., Portillo, F., Mobley, J. P., and lation of human brainstem. IEEE Transactions on Rehabili-
House, W. F. (1987a). The central electroauditory prosthe- tation Engineering, 5, 1–5.
sis: Clinical Results. In J. D. Andrade, J. J. Brophy, D. E. Shannon, R. V., and Otto, S. R. (1990). Psychophysical mea-
Detmer, S. W. Kim, R. A. Normann, D. B. Olsen, and sures from electrical stimulation of the human cochlear nu-
R. L. Stephen (Eds.), Artificial Organs (pp. 491–505). New cleus. Hearing Research, 47, 159–168.
York: VCH Publishing. Terr, L. I., Fayad, J., Hitselberger, W. E., and Zakhary, R.
Eisenberg, L. S., Maltan, A. A., Portillo, F., Mobley, J. P., and (1990). Cochlear nucleus anatomy related to central elec-
House, W. F. (1987b). Electrical stimulation of the auditory troauditory prosthesis implantation. Otolaryngology–Head
brain stem structure in deafened adults. Journal of Rehabil- and Neck Surgery, 102, 717–721.
itation Research and Development, 24, 9–22. Trofatter, J. A., MacCollin, M. M., Rutter, J. L., Murrell,
McCreery, D. G., Shannon, R. V., Moore, J. K., Chatterjee, J. R., Duyao, M. P., Parry, D. M., et al. (1993). A novel
M., and Agnew, W. F. (1998). Accessing the tonotopic moesin-, ezrin-, radixin-like gene is a candidate for the neu-
organization of the ventral cochlear nucleus by intra- rofibromatosis 2 tumor suppressor. Cell, 72, 791–800.
nuclear microstimulation. IEEE Transactions on Rehabili-
tation Engineering, 6, 391–399.
Moore, J. K., and Osen, K. K. (1979). The cochlear nuclei in
man. American Journal of Anatomy, 154, 393–417.
Otto, S. A., Brackmann, D. E., Hitselberger, W. E., Shannon, Auditory Brainstem Response in Adults
R. V., and Kuchta, J. (2002). The multichannel auditory
brainstem implant update: Performance in 60 patients.
Journal of Neurosurgery, 96, 1063–1071. The auditory brainstem response (ABR) is a series of five
Otto, S. A., Shannon, R. V., Brackmann, D. E., Hitselberger, to seven neurogenic potentials, or waves, that occur
W. E., Staller, S., and Menapace, C. (1998). The multi- within the first 10 ms following acoustic stimulation
channel auditory brainstem implant: Performance in 20 (Sohmer and Feinmesser, 1967; Jewett, Romano, and
patients. Otolaryngology–Head and Neck Surgery, 118, Williston, 1970). The potentials are the scalp-recorded
291–303. synchronous electrical activity from groups of neurons in
Rouleau, G. A., Merel, P., Luchtman, M., Sanson, M., response to a rapid-onset (<1 ms) stimulus. An example
Zucman, C., Marineau, C., et al. (1993). Alteration in
a new gene encoding a putative membrane-organizing pro-
of these potentials, with their most common labeling
tein causes neurofibromatosis type 2. Nature, 363, 515– scheme using Roman numerals, is shown in Figure 1.
521. Waves I and II are generated in the auditory nerve, wave
Shannon, R. V. (1989). Threshold functions for electrical stim- III is predominantly from the cochlear nucleus, and
ulation of the human cochlear nucleus. Hearing Research, waves IV and V are predominantly from the superior
40, 173–178. olivary complex and lateral lemniscus (Moller, 1993).
uated in absolute and relative terms. Absolute latency is

measured from the arrival of the stimulus at the ear to
the positive peak of waves I, III, and V. Relative latency
is measured between relevant peaks within the same ear
or between ears. The absolute latency reflects the state of
the auditory system to the generation site of each wave
but may be a¤ected by conductive or cochlear pathol-
ogy, making it di‰cult to isolate any delay from neural
pathology. Interpeak intervals allow an estimation of
neural conduction time and are less dependent on pe-
ripheral pathology than absolute latencies are. Interaural
Figure 1. The normal ABR elicited by a high-level click stimu- values limit variability by using the nonsuspect ear as a
lus, with waves I–VI labeled. control in a patient with unilateral pathology, but inter-
aural values also may be a¤ected by asymmetrical con-
ductive or cochlear pathology.
The amplitudes of the waves are more variable than
The ABR is valuable in audiology and neurology be- the latencies and therefore are less useful for determining
cause of its reliability and highly predictable changes in normality of the waveform. Amplitudes, measured from
many pathological conditions a¤ecting the auditory sys- the positive peak to the averaged baseline or from posi-
tem. The ABR may be used in a number of ways in tive peak to subsequent negative trough, are a¤ected by
adults. Perhaps the most common use is in the diagnostic the quality of the electrode contact, physiological noise
assessment of a hearing loss, either to determine the site levels of the patient, and amplitudes of adjacent waves.
of the lesion or to determine the function of the neural Consequently, absolute amplitude values are little used
system. A second use is to estimate auditory sensitivity in except in the case of absent waves. Relative amplitudes,
patients who are unable or unwilling to provide accurate particularly the amplitude ratio of waves I and V, may
behavioral thresholds. A third use is for monitoring the be useful measures to control for measurement variables.
auditory nerve and brainstem pathways during surgery A decrement in wave V amplitude relative to wave I
for auditory nerve tumors or vestibular nerve section. amplitude may suggest auditory nerve or low brainstem
For diagnostic and neural monitoring purposes, the pathology.
latencies and amplitudes of the most reliable waves, For the estimation of auditory threshold, wave V may
waves I, III, and V, are analyzed. For estimation of au- be traced to its detection threshold, which is typically
ditory sensitivity, the lowest detection level for wave V is defined as the lowest stimulus level at which wave V can
used to approximate auditory threshold. Responses are be seen. The ABR does not measure hearing per se, but
replicated to ensure reliability of the waveforms. correlates with auditory sensitivity in most cases. Clicks,
Because the potentials are small (<1 mV) and em- which are broadband stimuli, provide estimates of aver-
bedded in high levels of background electrical noise, age sensitivity in the range of 2000–4000 Hz because of
several techniques are used to enhance the visibility of cochlear physiology that biases the response to the high-
the potentials (ASHA, 1988). Surface electrodes are frequency region of the cochlea (Fowler and Durrant,
attached to the scalp of the patient, with the placement 1993). For better definition of thresholds across the fre-
in line with the orientation of the dipoles of the neural quency range, frequency-specific stimuli, such as tone
generators. Pairs of electrodes are used such that one pips or clicks with ipsilateral masking, may be used
electrode picks up positive activity and one electrode (Stapells, Picton, and Durieux-Smith, 1993). Because of
picks up negative activity from the neural generators. limitations in the signals and the cochlea, thresholds for
Di¤erential amplification/common mode rejection en- frequencies below 1000 Hz are di‰cult to obtain, and
hances the electrical activity that is di¤erent between two other methods, such as middle or late auditory-evoked
electrodes (the potentials) and reduces the activity that is potentials or steady-state evoked potentials, may yield
the same between electrodes (the random background better results or additional information (see Hall, 1992;
electrical noise). The physiological response is filtered to Jacobson, 1993).
eliminate the extraneous electroencephalographic activ- Clicks at high presentation levels are the most com-
ity and external line noise. Signal averaging increases the monly used stimuli for diagnostic ABRs because their
magnitude of the time-locked response and minimizes abrupt rise times elicit the necessary degree of synchrony
the random background activity. Artifact rejection to obtain the full complement of waves. Although the
eliminates unusually high levels of electrical activity that normal ABR to a click is dominated by neurons asso-
are impossible to eliminate through averaging. ciated with 2000–4000 Hz, a peripheral hearing loss at
The diagnostic interpretation of the ABR is based on those frequencies may e¤ectively filter the stimulus,
the latencies and amplitudes of the component waves. causing di¤erent frequency regions to dominate the re-
The latencies, or times of occurrence, of the waves are sponse in di¤erent hearing losses, which will a¤ect the
the more reliable measures because latencies for a given ensuing ABR latencies (see Fowler and Durrant, 1993).
person remain stable across recording sessions unless ABR latencies from suspect ears are compared with the
intervening pathology has occurred. The latency is eval- norms at equivalent sound pressure levels at the cochlea.
Auditory Brainstem Response in Adults 431
Figure 3. The top panel illustrates the ABR for a person with a
mild conductive hearing loss, with wave V labeled to the visual
detection threshold of 70 dB pSPL. The lower panel shows the
latencies for waves I, III, and V plotted above the normative
values, indicated by the bounded areas.
latency (Bauch and Olsen, 1990). Figure 2 includes the

ABRs for high-level signals to threshold for a person
Figure 2. The top panel illustrates the normal ABR elicited by
with normal hearing, along with the latencies of those
clicks from 110 dB pSPL to 30 dB pSPL, with wave V labeled
down to the visual detection threshold of 40 dB pSPL. The waves plotted against the normal range. All latencies are
lower panel shows the latencies of waves I, III, and V plotted within the normal limits, and the threshold is 40 dB peak
against the normative values for those waves, indicated by the sound pressure level (pSPL), which equals 10 dB nHL
bounded areas. (normalized hearing level).
Conductive Hearing Loss. The absolute latencies are

The typical e¤ects of auditory pathology on the latencies prolonged and the amplitudes are reduced relative to the
and thresholds of the ABR are discussed below. degree of the conductive component of a hearing loss,
but the interwave intervals are normal because the
Normal Hearing. Normative latencies may vary some- neural system is intact. Consequently, a person with a
what among clinics, but are typically derived from the 30-dB conductive hearing loss will produce an ABR to a
mean and G2 standard deviations of the latencies of 90-dB nHL click that is approximately equivalent to the
waves from a jury of listeners with normal hearing. ABR to a 60-dB nHL click for a person with normal
Examples of normative values for waves I, III, and V hearing. An example of a waveform and the resulting
are shown as the bounded areas in the lower panel in latencies from a 30-dB flat, conductive hearing loss are
Figure 2. Typical normal interpeak intervals are shown in Figure 3. All absolute latencies are prolonged,
<2.51 ms for interpeak interval I–III, <2.31 ms for but the I–V latency di¤erence is within normal limits.
III–V, <4.54 ms for I–V, and <0.4 ms for interaural V The threshold is 70 dB pSPL.
moderate, flat cochlear hearing loss, with wave V labeled to its
vestibular schwannoma, with the reliable waves labeled. The
visual detection threshold of 100 dB pSPL. The lower panel
lower panel shows the latencies for waves I and V plotted
shows the latencies for waves I, III, and V plotted within the
against the normative values for those waves. Wave I is within
normative values, indicated by the bounded areas.
normal limits, whereas wave V is significantly prolonged,
yielding a prolonged interwave I–V interval. The threshold
for wave V does not correspond to the behavioral auditory
Cochlear Hearing Loss. The degree and configuration threshold.
of the hearing loss a¤ect the latencies of the waves in a
cochlear hearing loss, although typically the I–V interval
is normal because the neural system is intact. Most mild morphology of the ABR depending on the type, loca-
to moderate cochlear losses do not a¤ect the latencies tion, and size of the pathology. E¤ects may include
of the ABR for high-level click stimuli, although the absence of waves, prolonged absolute latencies or inter-
amplitudes of the waves may be reduced. Severe high- wave intervals, or prolonged interaural wave V latencies.
frequency hearing losses may reduce the amplitudes and In Figure 5, the waveform and resulting latencies are
prolong the absolute latencies of the waves with little shown from a patient with an acoustic neuroma and a
e¤ect on the I–V latency interval, although wave I may mild, high-frequency hearing loss. Wave I is within nor-
be absent. An example of the waveform and resulting mal limits at the two highest levels, but the absolute
latencies from a moderate, flat hearing loss are shown in latency for wave V, and consequently the I–V interval,
Figure 4. Absolute and interwave latencies are within is prolonged beyond the normal limits. The wave V
normal limits, and the threshold is 100 dB pSPL. threshold may be variable in people with retrocochlear
pathology, and may not provide a useful estimation of
Retrocochlear Hearing Loss. Retrocochlear pathology behavioral threshold.
refers to any neural pathology of the auditory system Finally, for intraoperative monitoring, the ABR is
that is beyond the cochlea and may include such dis- used to warn the surgeon of possible damage to the au-
orders as acoustic neuromas, multiple sclerosis, brain- ditory nerve during surgery for auditory nerve tumors or
stem strokes, and head trauma. Retrocochlear pathology resection of the vestibular nerve, particularly when the
may produce a variety of e¤ects on the latencies and goal includes preservation of hearing. The amplitudes
Auditory Neuropathy in Children 433
and latencies of waves I and V are monitored during the audiogram shape and lesion location. Archives of Otolar-
surgery when compromise of the nerve is a possibility. yngology, 103, 605–622.
Wave I provides an index of cochlear function and Cone-Wesson, B. (1995). How accurate are bone-conduction
neural function peripheral to the tumor and wave V ABR tests? American Journal of Audiology, 4, 14–19.
Davis, H., and Hirsch, S. K. (1979). A slow brain stem re-
provides an index of neural function central to the tu-
sponse for low-frequency audiometry. Audiology, 18, 445–
mor. Significant prolongations of the latency of wave V 461.
or a decline in the amplitude of wave V may suggest Durrant, J. D., and Fowler, C. G. (1996). ABR protocols for
damage to the nerve, which then may be at risk for per- dealing with asymmetric hearing loss. American Journal of
manent injury. Occasionally, the surgical technique may Audiology, 5, 5–6.
be modified in an attempt to avoid permanent injury to Goldstein, R., and Aldrich, W. M. (1999). Evoked potential
the nerve. audiometry: Fundamentals and applications. Boston: Allyn
and Bacon.
—Cynthia G. Fowler Hood, L. J. (1998). Clinical applications of the auditory brain-
stem response. San Diego, CA: Singular Publishing Group.
References Jewett, D. L. (1970). Volume conducted potentials in response
to auditory stimuli as detected by averaging in the cat.
ASHA Audiologic Evaluation Working Group on Auditory Electroencephalography and Clinical Neurophysiology, 28,
Evoked Potential Measurements. (1988). The short latency 609–618.
auditory evoked potentials. Rockville, MD: American Musiek, F. E., and Baran, J. A. (1986). Neuroanatomy, neu-
Speech-Language-Hearing Association. rophysiology, and central auditory assessment: Part I. Brain
Bauch, C. D., and Olsen, W. O. (1990). Comparison of ABR stem. Ear and Hearing, 7, 207–219.
amplitudes with TIPtrode and mastoid electrodes. Ear and Stapells, D. R. (1994). Low-frequency hearing and the audi-
Hearing, 11, 463–467. tory brainstem response. American Journal of Audiology, 3,
Fowler, C. G., and Durrant, J. D. (1993). The e¤ects of pe- 11–13.
ripheral hearing loss on the ABR. In J. Jacobson (Ed.), Wiedemayer, H., Fauser, B., Sandalcioglu, I. E., Schafer, H.,
Principles and applications in auditory evoked potentials (pp. and Stolke, D. (2002). The impact of neurophysiological
237–250). Needham, MA: Allyn and Bacon. intraoperative monitoring on surgical decisions: A critical
Hall, J. W., III (1992). Handbook of auditory evoked responses. analysis of 423 cases. Journal of Neurosurgery, 96, 55–62.
Boston: Allyn and Bacon. Wrege, K., and Starr, A. (1981). Binaural interaction in human
Jacobson, J. (Ed.). (1993). Principles and applications in audi- auditory brainstem evoked potentials. Archives of Neurol-
tory evoked potentials. Needham, MA: Allyn and Bacon. ogy, 38, 572–580.
Jewett, D. L., Romano, M. N., and Williston, J. S. (1970).
Human auditory evoked potentials: Possible brain stem
components detected on the scalp. Science, 167, 1517–1518.
Moller, A. R. (1993). Neural generators of auditory evoked
potentials. In J. Jacobson (Ed.), Principles and applications Auditory Neuropathy in Children
in auditory evoked potentials (pp. 23–46). Needham, MA:
Allyn and Bacon.
Sohmer, H., and Feinmesser, M. (1967). Cochlear action The disorder now known as auditory neuropathy (AN)
potentials recorded from the external ear in man. Annals of has been defined only within the past 10 years (Starr et
Otology, Rhinology, and Laryngology, 76, 427–438. al., 1991), although references to patients with this dis-
Stapells, D., Picton, T. W., and Durieux-Smith, A. (1993). order appeared as early as the 1970s and 1980s (Fried-
Electrophysiologic measures of frequency-specific auditory man, Schulman, and Weiss, 1975; Ishii and Toriyama,
function. In J. Jacobson (Ed.), Principles and applications in 1977; Worthington and Peters, 1980; Kraus et al., 1984;
auditory evoked potentials (pp. 251–283). Needham, MA:
Jacobson, Means, and Dhib-Jalbut, 1986). This disorder
Allyn and Bacon.
is particularly deleterious when it occurs in childhood
because it causes significant disturbance of encoding of
Further Readings temporal features of sound, which severely limits speech
Borg, E., and Lovqvist, L. (1982). A lower limit for auditory perception and, consequently, the development of oral
brainstem response. Scandinavian Audiology, 11, 277–278. language skills.
Brown, C. J., Hughes, M. L., Luk, B., Abbas, P. J., Wolaver, Patients with AN have three key characteristics. First,
A., and Gervais, J. (2000). The relationship between EAP they have a hearing disorder in the form of elevated
and EABR thresholds and levels used to program the Nu- pure-tone thresholds (which can vary from slight to
cleus 24 speech processor: Data from adults. Ear and Hear- profound) or significant dysfunction of hearing in noise.
ing, 21, 151–163. Second, they have evidence of good outer hair cell func-
Burkhard, R. (1984). Sound pressure level measurement tion, in the form of either present otoacoustic emissions
and spectral analysis of brief acoustic transients. Electro- or an easily recognized cochlear microphonic compo-
encephalography and Clinical Neurophysiology, 58, 83–91.
nent. Third, they have evidence of neural dysfunction at
Chiappa, K. H., Gladstone, K. J., and Young, R. R. (1979).
Brainstem auditory evoked responses: Studies of waveform the level of the primary auditory nerve. This condition
variations in 50 normal human subjects. Archives of Neu- manifests with an abnormal or absent auditory brain-
rology, 36, 81–87. stem response (ABR), beginning with wave I. The pres-
Coats, A. C., and Martin, J. L. (1977). Human auditory nerve ence of hearing dysfunction in quiet, and of poor or
action potentials and brainstem evoked responses: E¤ects of absent ABR, distinguish this disorder from central
auditory dysfunction, in which hearing and ABR are been isolated (Butinar et al., 1999; Kalaydjieva et al.,
both normal. 1996, 1998). Other syndromes that have been associated
The presence of normal outer hair cell function and with AN include Charcot-Marie-Tooth syndrome, Frie-
the absence of wave I of the ABR narrow the po- dreich’s ataxia, Ehlers-Danlos syndrome, and Stevens-
tential sites of lesion in AN to (1) the inner hair cell, (2) Johnson syndrome (Sininger and Oba, 2001).
the synaptic junction between the inner hair cell and Many patients with AN show no risk factors. How-
the auditory nerve, and (3) the peripheral portion of the ever, other health issues often associated with AN in
auditory nerve. There is evidence to support the first infants include hyperbillirubinemia and prematurity
and third possibilities, but no direct evidence of synaptic (Stein et al., 1996; Sininger and Oba, 2001). As we begin
disorder. to detect hearing loss in the newborn period with
Starr (2001) has found that approximately one-third screening, it will be possible to obtain more data on
of all patients with AN have symptoms of peripheral those factors that may be directly associated with AN.
nerve disease. Approximately 80% of adults with AN The pure-tone hearing loss in patients with AN ranges
demonstrate concomitant peripheral neuropathy, while from slight to profound, with a greater percentage of
no patients less than 5 years old show clinical evidence of severe and profound loss than in patients with sensory
peripheral nerve disorder. Peripheral neuropathy that hearing loss. Any configuration of loss can occur, but
was not evident in some of the younger patients emerged low-frequency emphasis or flat configurations are most
in children who were followed over time. In patients with often seen. Only rarely is AN unilateral, but such cases
other peripheral nerve involvement, disease of the pri- have been described.
mary portion of the auditory nerve would be the most The symptoms and hearing loss of patients with AN
parsimonious explanation for the auditory disorder. can fluctuate dramatically on a day-to-day basis or more
More direct evidence of primary auditory nerve dis- frequently. The most dramatic instance of rapid changes
ease in humans was reported by Spoendlin (1974) from in symptoms has been described as ‘‘temperature-
postmortem temporal bone histologic studies of two sensitive auditory neuropathy.’’ Starr et al. (1998) de-
sibling adult patients with moderate hearing loss. These scribed three such patients in whom severe symptoms,
individuals had a full complement of inner and outer including nearly complete loss of hearing and ABR, ac-
hair cells but significant loss of spiral ganglion cells. companied a slight fever. The symptoms in these patients
Similar findings were reported by Nadol (2001) in a pa- would remit, leaving nearly normal auditory function,
tient with Charcot-Marie-Tooth syndrome and hearing as soon as the core temperature returned to normal.
loss. Symptoms of AN can progress slowly over time, and the
Harrison (1999, 2001) has shown that both carbopla- symptoms seen in newborns sometimes improve in the
tin treatment and anoxia can induce isolated inner hair first few months of life.
lesions in chinchillas. The same animals had otoacoustic Patients with AN have poor performance on all au-
emissions and abnormal ABR results. Amatuzzi et al. ditory tasks involving temporal processing. For example,
(2001) recently reported an autopsy analysis of the tem- AN patients have dramatically abnormal gap detection
poral bones of three premature infants; findings included thresholds and temporal modulation transfer functions
isolated inner hair cell loss with a full complement of when compared with patients with sensory hearing loss
outer hair cells and auditory neurons. These infants had or normal hearing. In contrast, loudness functions, such
failed ABR screening while in the neonatal intensive care as intensity discrimination and temporal integration, are
unit. This study presented the first evidence in humans normal for subjects with AN (Zeng et al., 2001).
that an isolated inner hair cell disorder is a possible ex- Speech perception ability is notably reduced in these
planation for AN. patients, especially in regard to the degree of hearing loss
No currently available clinical tools can provide data (Sininger and Oba, 2001). Poor speech discrimination
to distinguish between the inner hair cell and the audi- can be directly linked to reduced temporal processing
tory nerve as site of lesion in AN. Because young chil- (Zeng et al., 2001).
dren with AN often do not show evidence of other The ABR is abnormal or absent in cases of audi-
peripheral nerve involvement, it is particularly di‰cult tory neuropathy. In all cases except profound loss, the
to know what the underlying pathology of their AN threshold of the ABR does not correspond to the audi-
might be. However, it is also not clear how any distinc- tory thresholds, and for this reason the ABR cannot be
tion in pathology could be used to remediate the hearing used to predict hearing levels in children with AN. In
di‰culties in these patients. some cases of AN, a wave V can be distinguished, but it
is usually small in amplitude, extended in latency, and
Description of Patients the response threshold is unrelated to pure-tone hearing
levels.
Most patients with AN have disease onset in child- The ABR waveform of the patient with AN, when
hood (Sininger and Oba, 2001). The sex distribution recorded with electrodes placed at the mastoid or other-
is approximately equal. Close to half of patients with wise near the ear, will usually show evidence of a large
AN have a family history or an associated genetic cochlear microphonic (CM) component. This pattern
syndrome, indicating a genetic basis for the disorder. can be easily distinguished from an early neural response
In some cases, specific chromosomal anomalies have because the CM waveform will invert with stimulus
Auditory Neuropathy in Children 435
Figure 1. Audiologic findings in a 7-year-old boy with auditory sient evoked otoacoustic emission. C, Auditory brainstem
neuropathy. A, Audiogram. Speech awareness threshold: right response. Study was performed using insert earphones, with a
ear ¼ 20 dB HL, left ear ¼ 25 dB HL. Speech discrimination: click stimulus rate of 25/s at 80 dB nHL. Rarefaction and
right ear ¼ 28%, left ear ¼ 8%. Typanometry was within nor- condensation graphs are overlaid; right ear and left ear.
mal limits; the acoustic reflex threshold was absent. B, Tran-
polarity and demonstrate a mirror image when the question is whether children with moderate hearing loss
waveforms corresponding to the two stimuli are super- and AN who do not receive benefit from hearing aids
imposed. Also, unlike neural response, the CM compo- should be considered for cochlear implants. Results in
nent does not change latency with a decrease in stimulus patients with moderate loss due to AN and cochlear
level and is relatively undisturbed by noise masking. The implants may be available in the near future to help
CM can be evidence of either inner or outer hair cell answer this important question.
function. The CM can be distinguished from stimulus
artifact by a simple procedure. If an insert earphone is Case Report
used, clamping the tubing will cause the CM to disap- In a 7-year-old boy with AN, the medical history was
pear, because no e¤ective stimulus will reach the ear. At unremarkable. Speech development was normal before
the same time, stimulus artifact from the transducer will the age of 5 years, but after that time, response to sound
remain after the tubing is clamped. was inconsistent, and di‰culty with speech production
In most cases of AN, an otoacoustic emission (OAE) was noted. At age 7 years the audiogram in Figure 1 was
is seen, regardless of the degree of hearing loss. How- obtained, revealing a moderate hearing loss bilaterally.
ever, the OAE has diminished over time in some of these Tympanometry was normal, acoustic reflexes were ab-
patients, for unknown reasons (Deltenre et al., 1997). sent, and otoacoustic emissions were present. ABR test-
The CM component can be substituted for the OAE as ing revealed no response to 80 dB nHL stimuli, but
evidence of hair cell function. evidence of a cochlear microphonic component was seen
Brainstem reflexes involving the auditory system, in- in the recording. Magnetic resonance imaging of the
cluding middle ear muscle reflexes and olivocochlear brain and cranial nerves VII and VIII was normal, as
reflexes (suppression of OAE with contralateral noise), was a neurological examination. Amplification and FM
are almost universally absent in patients with AN. systems were used sparingly, with little success. This
Again, unlike in sensory loss, this finding is true regard- child relies heavily on speechreading, supplemented with
less of the degree of hearing loss present. manual communication as necessary.
This child represents a possible late-onset case of AN
Rehabilitation Strategies of unknown etiology. His audiogram shows the often
seen nonuniform configuration with peaks and valleys.
Inherently poor timing in the auditory system of patients Significant fluctuations in his hearing have been noted
with AN requires specialized approaches to the devel- over time. He is also typical in that he relies very heavily
opment of speech and language. The use of visual in- on visual cues, including speechreading, supplemented
formation to supplement speech perception is most by signs, for receptive communication.
important. Manual communication in a total communi-
cation system (oral speech and sign language simulta- —Yvonne S. Sininger
neously) is often recommended. For very young children
with no language system, this approach helps to ensure References
that a language system will develop regardless of the Amatuzzi, M. G., Northrop, C., Liberman, M. C., Thornton,
capacity of the auditory system to process oral speech. A., Halpin, C., Herrmann, B., et al. (2001). Selective inner
In mild cases of AN (those with mild or moderate hear- hair cell loss in premature infants and cochlea pathological
ing thresholds), a cued-speech approach can be useful patterns from neonatal intensive care unit autopsies.
(Cone-Wesson, Rance, and Sininger, 2001). Archives of Otolaryngology–Head and Neck Surgery, 127,
Standard amplification does not provide the same 629–636.
degree of benefit for patients with AN as in patients with Butinar, D., Zidar, J., Leonardis, L., Popovic, M., Kalayd-
conductive or sensory hearing loss. However, for many jieva, L., Angelicheva, D., et al. (1999). Hereditary audi-
tory, vestibular, motor, and sensory neuropathy in a
children, some advantages can be gained from amplifi-
Slovenian Roma (Gypsy) kindred. Annals of Neurology, 46,
cation, including lower thresholds for the detection of 36–44.
environmental sounds and even small increases in speech Cone-Wesson, B., Rance, G., and Sininger, Y. S. (2001). Am-
perception ability. Parents should be cautioned that the plification and rehabilitation strategies for patients with au-
benefits of amplification will be limited, and a trial pe- ditory neuropathy. In Y. S. Sininger and A. Starr (Eds.),
riod should be used to determine if any help is a¤orded Auditory neuropathy: A new perspective on hearing disorders
by the hearing aids. (pp. 233–249). Albany, NY: Singular/Thompson Learning.
Cochlear implants have been used in many children Deltenre, P., Mansbach, A. L., Bozet, C., Clercx, A., and
with AN (Trautwein, Sininger, and Nelson, 2000; Shal- Hecox, K. E. (1997). Auditory neuropathy: A report on
lop et al., 2001; Trautwein et al., 2001). Electrical three cases with early onsets and major neonatal illnesses.
Electroencephalography and Clinical Neurophysiology (Ire-
stimulation of the auditory nerve may reintroduce the land), 104, 17–22.
temporal encoding through neural synchrony, necessary Friedman, S. A., Schulman, R. H., and Weiss, S. (1975).
for speech perception. Most of the children with AN Hearing and diabetic neuropathy. Archives of Internal
who have received cochlear implants perform similarly Medicine, 135, 573–576.
to other deaf children, but good performance is not al- Harrison, R. V. (1999). An animal model of auditory neurop-
ways achieved (Trautwein et al., 2001). An important athy. Ear and Hearing, 19, 355–361.
Auditory Scene Analysis 437
Harrison, R. V. (2001). Models of auditory neuropathy based Worthington, D. W., and Peters, J. F. (1980). Quantifiable
on inner hair cell damage. In Y. S. Sininger and A. Starr hearing and no ABR: Paradox or error? Ear and Hearing, 1,
(Eds.), Auditory neuropathy: A new perspective on hearing 281–285.
disorders (pp. 51–66). Albany, NY: Singular/Thompson Zeng, F.-G., Oba, S., Garde, S., Sininger, Y. S., and Starr, A.
Learning. (2001). Psychoacoustics and speech perception in auditory
Ishii, T., and Toriyama, M. (1977). Sudden deafness with se- neuropathy. In Y. S. Sininger and A. Starr (Eds.), Auditory
vere loss of cochlear neurons. Annals of Otology, Rhinology, neuropathy: A new perspective on hearing disorders (pp.
and Laryngology, 86, 541–547. 141–164). Albany, NY: Singular/Thompson Learning.
Jacobson, G. P., Means, E. D., and Dhib-Jalbut, S. (1986).
Delay in the absolute latency of auditory brainstem re-
sponse (ABR) component P1 in acute inflammatory de-
myelinating disease. Scandinavian Audiology, 15, 121–124. Auditory Scene Analysis
Kalaydjieva, L., Hallmayer, J., Chandler, D., Savov, A.,
Nikolova, A., Angelicheva, D., et al. (1996). Gene mapping
in Gypsies identifies a novel demyelinating neuropathy on
Sensory systems such as hearing probably evolved in
chromosome 8q24. Nature Genetics, 14, 214–217.
Kalaydjieva, L., Nikolova, A., Turnev, I., Petrova, J., Hris- order for organisms to determine objects in their envi-
tova, A., Ishpekova, B., et al. (1998). Hereditary motor and ronment, allowing them to navigate, feed, mate, and
sensory neuropathy: Lom, a novel demyelinating neurop- communicate. Objects vibrate and as a result produce
athy associated with deafness in gypsies. Brain, 121, 399– sound. An auditory system provides the neural architec-
408. ture for the organism to process sound, and thereby to
Kraus, N., Ozdamar, O., Stein, L., and Reed, N. (1984). Ab- learn something about these objects or sound sources. In
sent auditory brain stem response: Peripheral hearing loss most situations, many sound sources are present at the
or brain stem dysfunction? Laryngoscope, 94, 400–406. same time, but the sound from these various sources
Nadol, J. B., Jr. (2001). Primary cochlear neuronal degenera- arrives at the organism as one complex sound field, not
tion. In Y. S. Sininger and A. Starr (Eds.), Auditory neurop-
as separate, individual sounds. The challenge for the
athy: A new perspective on hearing disorders (pp. 99–140).
Albany, NY: Singular/Thompson Learning. auditory system is to process this complex sound field so
Shallop, J. K., Peterson, A., Facer, G. W., Fabry, L. B., and that the individual sound sources can be determined.
Driscoll, C. L. W. (2001). Cochlear implants in five cases of That is, the auditory system is presented with a complex
auditory neuropathy: Postoperative findings and progress. auditory scene, and the auditory images in this scene are
Laryngoscope, 111, 555–562. the sound sources (Bregman, 1990). The auditory system
Sininger, Y. S., and Oba, S. (2001). Patients with auditory must be capable of performing auditory scene analysis if
neuropathy: Who are they and what can they hear? In Y. S. it is to determine the sources of sounds.
Sininger and A. Starr (Eds.), Auditory neuropathy: A new Auditory scene analysis is not undertaken in the au-
perspective on hearing disorders (pp. 15–35). Albany, NY: ditory periphery (the cochlea and auditory nerve). The
Singular/Thompson Learning.
Spoendlin, H. (1974). Optic and cochleovestibular degener-
auditory periphery provides a spectral-temporal neural
ations in hereditary ataxias: II. Temporal bone pathology in code for the acoustic information contained within the
two cases of Friedreich’s ataxia with vestibulo-cochlear dis- auditory scene. That is, the auditory nerve relays to the
orders. Brain, 97, 41–48. auditory brainstem the coding performed by the cochlea.
Starr, A. (2001). The neurology of auditory neuropathy. In This neural code provides the central nervous system
Y. S. Sininger and A. Starr (Eds.), Auditory neuropathy: A with information about the spectral components that
new perspective on hearing disorders (pp. 37–49). Albany, make up the auditory scene in terms of their frequencies,
NY: Singular/Thompson Learning. levels, and timing. The central auditory nervous system
Starr, A., McPherson, D., Patterson, J., Don, M., Luxford, must then analyze this peripheral neural code so that the
W. M., Shannon, R., et al. (1991). Absence of both auditory individual sound sources that generated the scene can be
evoked potentials and auditory percepts dependent on tim-
ing cues. Brain, 114, 1157–1180.
determined.
Starr, A., Sininger, Y. S., Winter, M., Derebery, J., Oba, S., What information might be preserved in the periph-
and Michalewski, H. (1998). Transient deafness due to eral code that is usable by the central auditory system
temperature-sensitive auditory neuropathy. Ear and Hear- for auditory scene analysis? Several cues have been sug-
ing, 19, 169–179. gested. They include frequency separation, temporal
Stein, L., Tremblay, K., Pasternak, J., Banerjee, S., Linde- separation, spatial separation, level di¤erences in spec-
mann, K., and Kraus, N. (1996). Brainstem abnormalities tral profiles, asynchronous onsets and o¤sets, harmonic
in neonates with normal otoacoustic emissions. Seminars in structure, and temporal modulation (Yost, 1992). These
Hearing, 17, 197–213. are properties of the sound generated by sound sources
Trautwein, P., Shallop, J., Fabry, L., and Friedman, R. (2001). that may be preserved in the peripheral code. As an ex-
Cochlear implantation of patients with auditory neurop-
athy. In Y. S. Sininger and A. Starr (Eds.), Auditory
ample, if two sound sources each vibrate with a di¤erent
neuropathy: A new perspective on hearing disorders (pp. frequency, the two frequencies will be mixed into a single
203–231). Albany, NY: Singular/Thompson Learning. auditory scene arriving at the listener. The auditory sys-
Trautwein, P. G., Sininger, Y. S., and Nelson, R. (2000). tem could ascertain that two frequencies are pres-
Cochlear implantation of auditory neuropathy. Journal of ent, indicating two sound sources. We know that this is
the American Academy of Audiology, 11, 309–315. possible since within certain boundary conditions, the
auditory periphery codes for the frequency content of The example of the missing fundamental pitch can be
any complex sound. generalized to describe acoustic situations in which the
The example of two sound sources each producing a auditory system segregates sounds into more than one
di¤erent frequency is the basis of a set of experiments source. A naturally occurring sound source is unlikely to
designed to investigate auditory scene analysis. Imagine have all but one of its frequency components harmoni-
that the sound coming from each of the two sources is cally related. Thus, in the example cited above, it is un-
pulsed on and o¤ so that the sound from one source (one likely that a single sound source would have a spectrum
frequency) is on when the sound from the other source with frequency components at 300, 400, 550, 600, and
(a di¤erent frequency) is o¤. The perception of the stim- 700 Hz (the 550-Hz component is the inharmonic com-
ulus condition could be described as a single sound with ponent that replaced the 500-Hz component). In fact,
an alternating pitch. However, since each sound could be when one of the harmonics of a series of harmonically
from a di¤erent source, the perception of the stimulus related frequency components is ‘‘mistuned’’ from the
condition could also be that of two sound sources, each harmonic relationship (550 Hz in the example), listeners
producing a pulsing tone of a particular frequency. Each are likely to perceive two pitches (as if there were two
of these perceptions is possible given the exact fre- sound sources), one associated with the 100-Hz har-
quencies and timing used in the experiment. When the monic relationship and the other with the frequency of
perception is one of two di¤erent sound sources, the the mistuned harmonic (Hartmann, McAdams, and
percept is often described as if two perceptual streams Smith, 1990). That is, the 550-Hz mistuned harmonic
were running side by side, each stream representing a is perceptually segregated as a separate pitch from the
sound source. The stimulus conditions that lead to this 100-Hz complex pitch associated with the rest of the
form of stream segregation are likely to be those that harmonically related components. Such dual pitch per-
promote the segregation of sound from one source from ception suggests there were two potential sound sources.
that of another source (Bregman, 1990). Many of the In this case, the auditory system appears to be using a
parameters listed above have been studied using this wide frequency range (300–700 Hz) to process these two
auditory streaming paradigm. In general, stimulus pa- pitches, and hence perceives two potential sound sources.
rameters associated with frequency are more likely to The complex pitch example can also be used to ad-
support stream segregation (Kubovy, 1987), but most of dress the role of stimulus onset as another potential cue
the parameters listed can support auditory stream segre- used for auditory scene analysis. Two sound sources may
gation under certain conditions. each produce a harmonically related spectrum, such that
Experiments to study auditory scene analysis, such as one sound source may have frequency components at
auditory streaming, require listeners to process sound 150, 300, 450, 600, and 750 Hz (harmonics of 150 Hz)
over a large range of frequencies and over time. Since a and another at 233, 466, 699, and 832 Hz (harmonics of
great deal of work in auditory perception and psycho- 233 Hz). When presented in isolation these two sounds
acoustics has concentrated on short-time processing in will produce pitches of 150 and 233 Hz. However, if the
narrow frequency regions, less is known about auditory two stimuli are added together so that they come on and
processing across wide regions of the spectra and longer go o¤ together, it is unlikely that two pitches will be
periods of time. Therefore, obtaining a better under- perceived. The perception of two pitches can be recov-
standing of cross-spectral processing and long-time ered if one of the complex sounds comes on slightly be-
processing is very important for revealing processes and fore the other one, even though both sounds remain on
mechanisms that may assist auditory scene analysis together thereafter. Thus, if the sound from one source
(Yost, 1992). comes on (or in some cases goes o¤ ) before another
One of the traditional examples of cross-spectral sound, then the asynchronous onsets (or o¤sets) promote
processing that relates to auditory scene analysis is the sound source segregation, aiding in auditory scene anal-
processing of the pitch of complex sounds, such as the ysis (Darwin, 1981).
pitch of the missing fundamental (see pitch perception). If two sounds have di¤erent temporal patterns of
For these spectrally complex stimuli, usually ones that modulation they may be perceptually segregated on the
have a harmonic structure, a major perceptual aspect of basis of temporal modulation (especially if the modula-
the stimulus is the perception of a single pitch (Moore, tion is amplitude modulated; Moore and Alcantara,
1997). Conditions such as the pitch of the missing fun- 1996). Detecting a tonal signal in a wideband-noise
damental suggest that the auditory system uses a wide background is improved if the noise is amplitude modu-
range of frequencies to determine the complex pitch and lated, suggesting that the modulation helps segregate the
that this complex pitch may be the defining acoustic tone from the noise background (Hall, Haggard, and
characteristic of a harmonic sound source, such as the Fernandes, 1984).
musical note of a piano key. A sound consisting of the Sounds from spatially separated sources may help in
frequencies 300, 400, 500, 600, and 700 Hz would most determining the auditory scene. The ability of the audi-
likely have a perceived pitch of 100 Hz (the missing tory system to use sound to locate objects in the real
fundamental in the sound’s spectrum). The 100-Hz pitch world also appears to help segregate one sound source
may help in determining the existence of a sound source from another (Yost, 1997). However, the ability to de-
with this 100-Hz harmonic structure. termine the instruments (sound sources) in an orchestral
Auditory Training 439
piece played over a single loudspeaker suggests that Auditory Training

having actual sources in our immediate environment at
di¤erent locations is not required for auditory scene
analysis. Auditory training includes a collection of activities, the
Thus, in order to use sound to determine something goal of which is to change auditory function, auditory
about objects in our world, the central auditory system behaviors, or the ways in which individuals approach
must process the neural code for sound in order to parse auditory tasks. Auditory training most commonly is
that code into subsets of neural information where each associated with the rehabilitation of individuals with
subset may be the neural counterpart of a sound source. hearing loss, but it has been used with other populations
Several di¤erent parameters of sound sources are pre- that have presumed di‰culties with auditory processing,
served by the neural code and may form the basis of such as children with specific language impairment,
auditory scene analysis. Determining the sources of phonologic disorder, dyslexia, and autism (Wharry,
sound requires processing across a wide range of fre- Kirkpatrick, and Stokes, 1987; Bettison, 1996; Merze-
quencies and over time, and is required for organisms to nich et al., 1996; Habib et al., 1999). Auditory training
successfully cope with their environments. has been applied to children diagnosed with central au-
ditory processing and to adults learning a second lan-
—William A. Yost guage (Solma and Adepoju, 1995; Musiek, 1999). It also
has been used experimentally to assess the plasticity of
References speech perceptual categories and to determine the neu-
rological substrates of speech perception learning and
Bregman, A. S. (1990). Auditory scene analysis: The perceptual organization (Werker and Tees, 1984; Bradlow et al.,
organization of sound. Cambridge, MA: MIT Press.
Darwin, C. J. (1981). Perceptual grouping of speech compo- 1997; Tremblay et al., 1997, 1998, 2001; Wang et al.,
nents di¤ering in fundamental frequency and onset time. 1999).
Quarterly Journal of Experimental Psychology, 33A, 185– Most auditory training programs are organized
207. around three parameters: auditory processing approach,
Hall, J. W. III, Haggard, M., and Fernandes, M. A. (1984). auditory skill, and stimulus di‰culty level (Erber and
Detection of noise by spectro-temporal pattern analy- Hirsh, 1978; Erber, 1982; Tye-Murray, 1998). When
sis. Journal of the Acoustical Society of America, 76, 50– implementing an auditory training program, the first
58. decision is whether to use a top-down (synthetic) or a
Hartmann, M., McAdams, S., and Smith, B. K. (1990). Hear- bottom-up (analytic) processing approach or a combi-
ing a mistuned harmonic in an otherwise periodic complex
nation of both. Most normal-hearing adults tend to rely
tone. Journal of the Acoustical Society of America, 88,
1712–1724. primarily on top-down processing when listening to on-
Kubovy, M. (1987). Concurrent pitch segregation. In W. A. going speech, but if the goal for a patient is to learn or
Yost and C. S. Watson (Eds.), Audiotry processing of com- relearn an auditory skill, a more bottom-up processing
plex sounds. Hillsdale, NJ: Erlbaum. approach may be warranted. The skill to be learned or
Moore, B. C. J. (1997). An introduction to the psychology of relearned (e.g., detection, discrimination, identification,
hearing (3rd ed.). London: Academic Press. and comprehension) and the complexity of the stimuli
Moore, B. C. J., and Alcantara, J. I. (1996). Vowel identifica- are largely dictated by the status of the patient’s auditory
tion based on amplitude modulation. Journal of the Acous- skills and the goals of auditory training. The stimulus
tical Society of America, 99, 2332–2343. di‰culty can be manipulated by changing such factors
Yost, W. A. (1992). Auditory image perception. Hearing Re-
as set size, acoustic similarity, speed, linguistic complex-
search, 56, 8–19.
Yost, W. A. (1997). The cocktail party e¤ect: 40 years later. ity, lexical familiarity, visual cues, contextual support,
In R. Gilkey and T. Anderson (Eds.), Localization and and environmental acoustics. It also can be adjusted by
spatial hearing in real and virtual environments. Hillsdale, digitally manipulating specific acoustic parameters such
NJ: Erlbaum. as formant transition duration, f1 intensity, or noise
burst spectra. Typically, the training starts with skill and
stimulus levels at which the patient just exhibits di‰-
Further Readings culty. Then the skill and stimulus di‰culty are system-
Hartmann, W. M. (1988). Pitch perception and the organiza- atically increased as performance improves until the
tion and integration of auditory entities. In G. W. Edelman, training goal is attained. For example, the final goal
W. E. Gall, and W. M. Cowan (Eds.), Auditory function: might be that the patient will reduce fricative place con-
Neurobiological bases of hearing. New York: Wiley. fusions to 25% in connected discourse. In a bottom-up
Moore, B. C. J. (1997). An introduction to the psychology of approach, the patient might work on fricative place dis-
hearing (3rd ed.). London: Academic Press. crimination in CV syllables with the same vowel, then
Yost, W. A. (1992). Auditory perception and sound source de-
termination. Current Directions in Psychological Sciences, 1, with di¤erent vowels, words, phrasal and sentence struc-
12–15. tures, and finally at the discourse level. Speaking rate,
Yost, W. A., and Stanley, S. (1993). Auditory perception. In vocal intensity, environmental acoustics, and contextual
W. A. Yost, R. R. Fay, and A. Popper (Eds.), Psycho- cues can be adjusted at each level to increase the listen-
acoustics. New York: Springer-Verlag. ing demands. At the discourse level, the conversational
demands also can be increased systematically. As with more strongly advocated for infants and children with
most skills, learning likely is facilitated by cycling across hearing loss than for adults, but even fewer interpretable
a number of di‰culty levels and stimuli, having frequent studies have been reported to support its application in
training sessions, and varying the duration and location children and infants.
of the training. Although supporting literature is limited with respect
In a more top-down approach, the patient might start to auditory training of hearing-impaired populations,
at the discourse level and work on evaluating and using perceptual training studies with normal-hearing indi-
context to predict topic and word choices. The focus is viduals suggest that the impact of auditory training
less on hearing the place cues and more on increasing the on perception may be underestimated. For example,
awareness of various contextual cues that can be used to normal-hearing adults and children have been trained to
predict the topic flow within discourse. Initially, familiar perceive non-native speech contrasts (Werker and Tees,
topics and speakers may be used in quiet conditions with 1984; Bradlow et al., 1997; Wang et al., 1999). Although
visual cues provided, and then the discourse material can not all speech contrasts can be learned equally well, and
be increased in complexity, unfamiliar and multiple adults usually fail to reach native speaker performance
speakers can be introduced, as well as noise and visual levels, the e¤ects of training are retained over months
distractions. As a result, the patient becomes better able and show generalization within and across sound cate-
to fill information gaps when individual sound segments gories (McClaskey, Pisoni, and Carrell, 1983; Lively et
or even entire words and phrases are misperceived. With al., 1994; Tremblay et al., 1997). Digitally manipulating
this type of training approach, patients usually receive specific acoustic parameters of speech does not always
counseling on how to manipulate context so that they improve speech perception in expected ways, but shap-
reduce listening di‰culty, and how to recover if a pre- ing speech perception by gradually adjusting more di‰-
dictive or perceptual error results in a communication cult acoustic properties is under investigation in various
breakdown. disordered populations and may prove fruitful in the fu-
Auditory training is not routinely used with all adults ture for persons with hearing loss (Bradlow et al., 1999;
with hearing loss but tends to be reserved for individuals Habib et al., 1999; Merzenich et al., 1996; Thibodeau,
who have sustained a recent change in auditory status Friel-Patti, and Britt, 2001). Furthermore, Kraus and
or a substantive increase in auditory demands. For colleagues (Kraus et al., 1995; Tremblay et al., 1997)
example, adults with sudden deafness, recent cochlear have argued that auditory training impacts the physiol-
implant recipients, people switching to dramatically ogy of the central auditory system and might result in
di¤erent hearing aids with di¤erent signal-processing cortical and subcortical reorganization. If neural reor-
schemes, students entering college, or individuals who ganization occurs after the fitting of a hearing aid or
are beginning a new job that is auditorily demanding cochlear implant, then it is likely that these types of
might benefit from auditory training (see cochlear patients would be sensitive to intensive auditory training
implants; evaluation of cochlear implant candidacy during the reorganization period (Kraus, 2001; Purdy,
in adults; auditory brainstem implant). Patients who Kelly, and Thorne, 2001).
do not make expected improvements in audition and See also auditory brainstem implant; speech dis-
speech after the fitting of a hearing aid or sensory im- orders secondary to hearing impairment acquired in
plant also are reasonable candidates for auditory train- adulthood; speech tracking; speechreading training
ing (see speech perception indices). The fact that most and visual tracking.
adults do not elect to receive auditory training and typi-
cally are not referred for auditory training may be a —Sheila Pratt
consequence of the limited data documenting the e¤ec- References
tiveness and e‰cacy of auditory training programs. Only
a small number of studies have been published that have Bettison, S. (1996). The long-term e¤ects of auditory training
on children with autism. Journal of Autism and Devel-
assessed auditory training outcomes in adults with hear- opmental Disorders, 26, 361–374.
ing loss. Rubenstein and Boothroyd (1987) found only Bradlow, A. R., Kraus, N., Nicol, T. G., McGee, T. J., Cun-
modest benefit with sentence- and syllable-level auditory ningham, J., Zecker, S. G., and Carrell, T. D. (1999).
training with adults who had been successful hearing aid E¤ects of lengthened formant transition duration on dis-
wearers, but they did observe maintenance of gains that crimination and neural representation of synthetic CV syl-
were obtained. Walden et al. (1981) found that adults lables by normal and learning-disabled children. Journal of
newly fitted with hearing aids benefited significantly the Acoustical Society of America, 106, 2086–2096.
from systematic consonant discrimination training, while Bradlow, A. R., Pisoni, D., Akahane-Yamada, R., and Toh-
Kricos and Holmes (1996) found that older adults with kura, Y. (1997). Training Japanese listeners to identify
previous hearing aid experience did not benefit from English /r/ and /l/: IV. Some e¤ects of perceptual learning
on speech production. Journal of the Acoustical Society of
consonant and vowel discrimination training but did America, 101, 2299–2310.
benefit from active listening training. Auditory training Erber, N. (1982). Auditory training. Washington, D.C.: A.G.
usually focuses on speech and language stimuli, but mu- Bell Association.
sic perceptual training programs have been developed Erber, N., and Hirsh, I. (1978). Auditory Training. In H. Davis
for cochlear implant recipients and appear to be e¤ective and S. R. Silverman (Eds.), Hearing and Deafness (pp. 358–
(Gfeller et al., 1999). In addition, auditory training is 374). New York: Rinehart and Winston.
Auditory Training 441
Gfeller, K., Witt, S. A., Kim, K., Adamek, M., and Co¤man, Walden, B. E., Erdman, S. A., Montgomery, A. A., Schwartz,
D. (1999). Preliminary report of a computerized music D. M., and Prosek, R. A. (1981). Some e¤ects of training on
training program for adult cochlear implant recipients. speech recognition by hearing-impaired adults. Journal of
Journal of the Academy of Rehabilitative Audiology, 32, Speech and Hearing Research, 24, 207–216.
11–27. Wang, Y., Spence, M. M., Jongman, A., and Sereno, J. A.
Habib, M., Espesser, R., Rey, V., Giraud, K., Bruas, P., and (1999). Training American listeners to perceive Mandarin
Gres, C. (1999). Training dyslexics with acoustically modi- tones. Journal of the Acoustical Society of America, 106,
fied speech: Evidence of improved phonological perfor- 3649–3658.
mance. Brain and Cognition, 40, 143–146. Werker, J., and Tees, R. (1984). Cross-language speech per-
Kraus, N. (2001). Auditory pathway encoding and neural ception: Evidence for perceptual reorganization during the
plasticity in children with hearing problems. Audiology and first year of life. Infant Behavior and Development, 7, 49–63.
Neuro-Otology, 6, 221–227. Wharry, R. E., Kirkpatrick, S. W., and Stokes, K. D. (1987).
Kraus, N., McGee, T., Carrell, T. D., King, C., Tremblay, K., Auditory training: E¤ects on auditory retention with the
and Nicol, T. (1995). Central auditory system plasticity with learning disabled. Perceptual and Motor Skills, 65, 1000.
speech discrimination training. Journal of Cognitive Neuro-
science, 7, 25–32. Further Readings
Kraus, N., McGee, T., Carrell, T. D., and Sharma, T. (1995).
Neurophysiologic bases of speech discrimination. Ear and Alcantara, J., Cowan, R. S., Blamey, P., and Clark, G. (1990).
Hearing, 16, 19–37. A comparison of two training strategies for speech recog-
Kricos, P., and Holmes, A. (1996). E‰cacy of audiologic re- nition with an electrotactile speech processor. Journal of
habilitation for older adults. Journal of the American Acad- Speech and Hearing Research, 33, 195–204.
emy of Audiology, 7, 219–229. Blamey, P. J., and Alcantra, J. I. (1994). Research in auditory
Lively, S. E., Pisoni, D. B., Yamada, R. A., Tohkura, Y., and training. In J. P. Gagne and N. Tye-Murray (Eds.), Re-
Yamada, T. (1994). Training Japanese listeners to identify search in audiological rehabilitation: Current trends and
English /r/ and /l/. III. Long-term retention of new phonetic future directions. Journal of the Academy of Rehabilitation
categories. Journal of the Acoustical Society of America, 96, Audiology, 27, 161–191.
2067–2087. Carhart, R. (1947). Auditory training. In H. Davis (Ed.),
McClaskey, C. L., Pisoni, D. B., and Carrell, T. D. (1983). Hearing and deafness (pp. 276–299). New York: Rinehart.
Transfer of training of a new linguistic contrast in voicing. Durity, R. P. (1982). Auditory training for severely hearing-
Perception and Psychophysics, 34, 323–330. impaired adults. In D. G. Sims, G. G. Walter, and R. L.
Merzenich, M., Jenkings, W., Johnston, P., Schreiner, C., Whitehead (Eds.), Deafness and communication: Assessment
Miller, S. L., and Tallal, P. (1996). Temporal processing and training (pp. 296–311). Baltimore: Williams and Wilkins.
deficits of language-learning impaired children ameliorated Erber, N. P., and Lind, C. (1994). Communication therapy:
by training. Science, 217, 77–81. Theory and practice. In J. P. Gagne and N. Tye-Murray
Musiek, F. (1999). Habilitation and management of audi- (Eds.), Research in audiological rehabilitation: Current
tory processing disorders: Overview of selected procedures. trends and future directions. Journal of the Academy of Re-
Journal of the American Academy of Audiology, 10, 329–342. habilitation Audiology, 27, 267–287.
Purdy, S. C., Kelly, A. S., and Thorne, P. R. (2001). Auditory Gagne, J. P. (1994). Visual and audiovisual speech perception
evoked potentials as measures of plasticity in humans. training: Basic and applied research. In J. P. Gagne and N.
Audiology and Neuro-Otology, 6, 211–215. Tye-Murray (Eds.), Research in audiological rehabilitation:
Rubenstein, A., and Boothroyd, A. (1987). E¤ect of two Current trends and future directions. Journal of the Academy
approaches to auditory training on speech recognition by of Rehabilitation Audiology, 27, 317–336.
hearing-impaired adults. Journal of Speech and Hearing Garstecki, D. C. (1981). Audio-visual training paradigm for
Research, 30, 153–160. hearing impaired adults. Journal of the Academy of Reha-
Solma, R. T., and Adepoju, A. A. (1995). The e¤ect of aural bilitative Audiology, 14, 223–228.
feedback in second language vocabulary learning. Journal Gold, J., Bennett, P. J., and Sekuler, A. B. (1999). Signal but not
of Behavioral Education, 5, 433–445. noise changes with perceptual learning. Nature, 402, 176–178.
Thibodeau, L. M., Friel-Patti, S., and Britt, L. (2001). Psy- Houston, K. T., and Montgomery, A. A. (1997). Auditory-
choacoustic performance in children completing Fast visual integration: A practical approach. Seminars in Hear-
ForWord training. American Journal Speech-Language- ing, 18, 141–151.
Pathology, 10, 248–257. Hutchinson, K. (1990). An analytic distinctive feature
Tremblay, K., Kraus, N., Carrell, T. D., and McGee, T. approach to auditory training. Volta Review, 92, 5–13.
(1997). Central auditory system plasticity: Generalization to Pisoni, D. B., Aslin, R. N., Perey, A. J., and Hennessy, B. L.
novel stimuli following listening training. Journal of the (1982). Some e¤ects of laboratory training on identification
Acoustical Society of America, 102, 3762–3773. and discrimination of voicing contrasts in stop consonants.
Tremblay, K., Kraus, N., and McGee, T. (1998). The time Journal of Experimental Psychology, 8, 297–314.
course of auditory perceptual learning: Neurophysiological Spitzer, J. B., Leder, S. B., and Giolas, T. G. (1993). Rehabili-
changes during speech-sound training. Neuroreport: An In- tation of late-deafened adults: Modular program manual. St.
ternational Journal for the Rapid Communication of Re- Louis: Mosby.
search in Neuroscience, 9, 3557–3560. Tallal, P., Miller, S. L., Bedi, G., Byma, G., Wang, X.,
Tremblay, K., Kraus, N., McGee, T., Ponton, C., and Otis, B. Nagarajan, S. S., et al. (1986). Language comprehension in
(2001). Central auditory plasticity: Changes in the N1-P2 language-learning impaired children improved with acous-
complex after speech-sound training. Ear and Hearing, 22, tically modified speech. Science, 271, 81–84.
79–90. Tye-Murray, N. (1997). Communication training for hard-of-
Tye-Murray, N. (1998). Foundations of aural rehabilitation. San hearing adults and older teenagers: Speechreading, listening,
Diego, CA: Singular Publishing Group. and using repair strategies. Austin, TX: Pro-Ed.
Classroom Acoustics normal hearers (Crandell and Smaldino, 2000, 2001).

While it is recognized that listeners with SNHL experi-
ence greater speech recognition deficits in noise than
The acoustic environment of a classroom is a critical normal hearers, a number of populations of children
factor in the psychoeducational and psychosocial devel- with ‘‘normal hearing’’ sensitivity also experience sig-
opment of children with normal hearing and children nificant di‰culties recognizing speech in noise. These
with hearing impairment. Inappropriate levels of class- populations of normal-hearing children include young
room reverberation, noise, or both can deleteriously af- children (less than 15 years old); those with conductive
fect speech perception, reading and spelling ability, hearing loss; children with a history of recurrent otitis
classroom behavior, attention, concentration, and aca- media; those with a language disorder, articulation
demic achievement. Poor classroom acoustics have also disorder, dyslexia, or learning disability; non-native
been shown to negatively a¤ect teacher performance speakers of English, and others with various degrees
(Crandell, Smaldino, and Flexer, 1995). This article of hearing impairment or developmental delays. Due
discusses several acoustic factors that can compromise to high background noise levels, the range of S/Ns in
communication between teacher and child. These classrooms has been reported to be between approxi-
acoustic factors include (1) the level of the background mately 7 and þ5 dB.
noise in the classroom, (2) the relative intensity of the
information-carrying components of the speech signal to Reverberation
a non-information-carrying signal or noise (signal-to- Reverberation refers to the prolongation or persistence
noise ratio, S/N), (3) the degree to which the temporal of sound within an enclosure as sound waves reflect o¤
aspects of the information-carrying components of the hard surfaces (bare walls, ceilings, windows, floor) in the
speech signal are degraded (reverberation), and (4) the room. Operationally, reverberation time (RT) refers to
distance from the speaker to the listener. the amount of time it takes for a sound, at a specific
frequency, to decay 60 dB (or one-millionth of its origi-
Background Noise nal intensity) following termination of the signal. Exces-
sive reverberation degrades speech recognition through
Background noise refers to any auditory disturbance
the masking of direct and early-reflected energy by
that interferes with what a listener wants or needs to reverberant energy. Generally speaking, speech recogni-
hear. Background noise levels in classrooms are often
tion tends to decrease as the RT of the environment
too high for children to perceive speech accurately. In
increases. Speech recognition in individuals with nor-
general, background classroom noise a¤ects a child’s
mal hearing is often not compromised until the RT
speech recognition by reducing, or masking, the highly
exceeds approximately 1.0 s (Nabelek and Pickett,
redundant acoustic and linguistic cues available in the
1974a, 1974b). Listeners with SNHL, however, need
teacher’s voice. Because the energy of consonant pho-
considerably shorter RTs (0.4–0.5 s) for maximum
nemes is considerably less than that of vowel phonemes,
speech recognition (Crandell, 1991; Crandell, Smaldino,
background noise in a classroom often masks the con-
and Flexer, 1995). Studies have also indicated that the
sonants more than the vowels. Loss of consonant infor-
populations of ‘‘normal-hearing’’ children discussed
mation has a great e¤ect on speech recognition because previously also have greater speech recognition di‰-
the vast majority of the cues important for accurate
culties in reverberation than do young adults with nor-
speech recognition are carried by the consonants.
mal hearing. Unfortunately, RTs for classrooms are
In most listening environments, the fundamental de- often reported to range from 0.4 to 1.2 s.
terminant for speech recognition is not the overall level
of the room noise, but rather the relationship between E¤ects of Noise and Reverberation on Speech
the intensity of the signal and the intensity of the back-
Recognition
ground noise at the listener’s ear. This relationship is
referred to as the signal-to-noise ratio (S/N). Speech In all educational settings, noise and reverberation com-
recognition ability tends to be highest at favorable S/Ns bine in a synergistic manner to adversely a¤ect speech
and decreases as the S/N of the listening environment is recognition. That is, noise and reverberation a¤ect
reduced. Speech recognition in adults with normal hear- speech recognition more than would be expected from
ing is not severely degraded until the S/N approximates an examination of their individual e¤ects on speech per-
0 dB (speech and noise are at equal intensities). The ception. It appears that this occurs because reverberation
speech recognition performance of children with sen- fills in the temporal gaps in the noise, making the noise
sorineural hearing loss (SNHL), however, is reduced more steady state in nature and a more e¤ective masker.
in noise when compared with the performance of chil- As with noise and reverberation in isolation, research
dren with normal hearing (Finitzo-Hieber and Tillman, indicates that listeners with hearing impairment and
1978). Specifically, children with SNHL require the S/N ‘‘normal-hearing’’ children experience greater speech
to be improved by 4–12 dB, and by an additional recognition di‰culties in noise and reverberation than
3–6 dB in rooms with moderate levels of reverberation, adult normal listeners (see Crandell, Smaldino, and
for them to obtain recognition scores equal to those of Flexer, 1995).
Classroom Acoustics 443
Speaker–Listener Distance Acoustic Modifications of the Classroom

In classrooms, the acoustics of a teacher’s speech signal The fundamental strategy for improving speech per-
changes as it travels to the child. The direct sound is ception within a classroom is acoustic modification of
that sound which travels from the teacher to a child that environment. The most e¤ective procedure for
without striking other surfaces within the classroom. The achieving this goal is through appropriate planning
power of the direct sound decreases with distance be- with contractors, school o‰cials, architects, acoustical
cause the acoustic energy spreads over a larger area as it consultants, audiologists, and teachers for the hearing
travels from the source. Specifically, the direct sound impaired before the design and construction of the
decreases 6 dB in SPL with every doubling of distance building. Acoustic guidelines for hearing-impaired and
from the sound source. This phenomenon, called the in- ‘‘normal-hearing’’ populations indicate the following: (1)
verse square law, occurs because of the geometric diver- S/Ns should exceed þ15 dB, (2) unoccupied noise levels
gence of sound from the source. Because the direct should not exceed 30–35 dBA, and (3) RTs should not
sound energy decreases so quickly, only those listeners surpass 0.4–0.6 s (ASHA, 1995). Unfortunately, such
who are seated close to the speaker will be hearing direct guidelines are rarely achieved in most listening envi-
sound energy. At slightly farther distances from the ronments. Crandell and Smaldino (1995) reported that
speaker, early sound reflections will reach the listener. none of the 32 classrooms in which they measured sound
Early sound reflections are those sound waves that arrive levels met recommended criteria for noise levels. Only
at a listener within very short time periods (approxi- 27% of the classroom met criteria for reverberation. A
mately 50 ms) after the arrival of the direct sound. Early new American National Standards Institute (ANSI
sound reflections are often combined with the direct S12.60) document entitled ‘‘Acoustical Performance
sound and may actually increase the perceived loudness Criteria, Design Requirements and Guidelines for
of the sound. This increase in loudness can improve Schools’’ (ANSI, 2002) promises to set a national stan-
speech recognition in listeners with normal hearing. As a dard for acoustics in classrooms where learning occurs.
listener moves farther away from a speaker, reverbera- If acoustic modifications to learning spaces cannot re-
tion dominates the listening environment. As sound duce noise and reverberation to appropriate levels, then
waves strike multiple classroom surfaces, they generally hearing assistive technologies, such as frequency modu-
decrease in loudness, owing to the increased path length lation (FM) systems, should be implemented for chil-
they travel as well as the partial absorption that occurs dren with ‘‘normal hearing’’ or hearing impairment.
at each reflection o¤ the classroom surfaces. Some re- —Carl C. Crandell and Joseph J. Smaldino
verberation is necessary to reinforce the direct sound and
to enrich the quality of the sound. However, reverbera-
tion can lead to acoustic distortions of the speech signal, References
including temporal smearing and masking of important
American National Standards Institute. (2002). S12.60, Acous-
perceptual cues. tical Performance Criteria, Design Requirements and Guide-
Speech recognition in classrooms depends on the dis- lines for Schools. New York: ANSI.
tance of the child from the teacher. If the child is within Crandell, C. (1991). Classroom acoustics for normal-hearing
the critical distance of the classroom (the point at which children: Implications for rehabilitation. Education Audiol-
the intensity of the direct sound and reflected sound are ogy Monographs, 2, 18–38.
equal in intensity), reflected sound waves have minimal Crandell, C., and Smaldino, J. (1995). An update of classroom
e¤ects on speech recognition. The critical distance in acoustics for children with hearing impairment. Volta Re-
many classrooms is often 5–6 feet from the teacher. Be- view, 1, 4–12.
yond the critical distance, however, the reflections can Crandell, C., and Smaldino, J. (2000). Room acoustics and
amplification. In M. Valente, R. Roeser, and H. Hosford-
compromise speech recognition if there is enough of a Dunn (Eds.), Audiology: Treatment strategies. New York:
spectrum or intensity change in the reflected sound to Thieme.
interfere with the recognition of the direct sound. Over- Crandell, C., and Smaldino, J. (2001). Rehabilitative tech-
all, speech recognition scores tend to decrease until the nologies for individuals with hearing loss and abnormal
critical distance of the classroom is reached. Beyond the hearing. In J. Katz (Ed.), Handbook of audiology. New
critical distance, recognition ability tends to remain es- York: Williams and Wilkins.
sentially constant unless the classroom is very large (such Crandell, C., Smaldino, J., and Flexer, C. (1995). Sound field
as an auditorium). In such environments, speech recog- FM amplification: Theory and practical applications. San
nition may continue to decrease as a function of in- Diego, CA: Singular Press Publishing Group.
creased distance. These findings suggest that speech Finitzo-Hieber, T., and Tillman, T. (1978). Room acoustics
e¤ects on monosyllabic word discrimination ability for
recognition ability can be maximized by decreasing the normal and hearing-impaired children. Journal of Speech
distance between a speaker and listener only within the and Hearing Research, 21, 440–458.
critical distance of the classroom. Thus, preferential Nabelek, A., and Pickett, J. (1974a). Monaural and binaural
seating, while not a bad idea to improve visual per- speech perception through hearing aids under noise and re-
ception, may only minimally improve auditory speech verberation with normal and hearing-impaired listeners.
perception. Journal of Speech and Hearing Research, 17, 724–739.
Nabelek, A., and Pickett, J. (1974b). Reception of consonants

in a classroom as a¤ected by monaural and binaural listen-
ing, noise, reverberation, and hearing aids. Journal of the
Further Readings
Bess, F., Sinclair, J., and Riggs, D. (1984). Group amplifica-
tion in schools for the hearing-impaired. Ear and Hearing,
5, 138–144.
Bradley, J. (1986). Speech intelligibility studies in classrooms. Figure 1. Decision matrix. Abbreviations: pl, number of pa-
Journal of the Acoustical Society of America, 80, 846–854. tients with hearing loss; pn, number of patients with normal
Crandell, C. (1992). Classroom acoustics for hearing-impaired hearing; ht, number of hits; ms, number of misses; fa, number
children. Journal of the Acoustical Society of America, 92(4), of false alarms; cr, number of correct rejections.
2470.
Crandell, C., and Bess, F. (1986). Speech recognition of chil-
dren in a ‘‘typical’’ classroom setting. ASHA, 29, 87.
Crandell, C., and Smaldino, J. (1996). Sound field amplifica-
the potential for error. That is, a particular score could,
tion in the classroom: Applied and theoretical issues. In with finite probability, be produced by either condition.
F. Bess, J. Gravel, and A. Tharpe (Eds.), Amplification for CDA is well-suited to deal with this type of problem.
children with auditory deficits (pp. 229–250). Nashville, TN: For the discussions and examples in this article, we
Bill Wilkerson Center Press. will assume that we are trying to identify hearing loss. Of
Gengel, R. (1971). Acceptable signal-to-noise ratios for aided course, CDA can be used with diagnostic tests that are
speech discrimination by the hearing impaired. Journal of designed to identify a variety of diseases and conditions.
Auditory Research, 11, 219–222.
Harris, C. (1991). Handbook of acoustical measurements and Decision Matrix
noise control. New York: McGraw-Hill.
Houtgast, T. (1981). The e¤ect of ambient noise on speech in- A patient is given a test, and the outcome of the test is
telligibility in classrooms. Applied Acoustics, 14, 15–25. either positive or negative for hearing loss. The test is
Kodaras, M. (1960). Reverberation times of typical elementary fallible and makes errors, reflecting the ‘‘noise’’ in the
school settings. Noise Control, 6, 17–19. testing process. There are four possible outcomes, deter-
Markides, A. (1986). Speech levels and speech-to-noise ratios. mined by the test result and the hearing of the patient.
British Journal of Audiology, 20, 115–120.
Niemoeller, A. (1968). Acoustical design of classrooms for the
These outcomes are represented by a 2 2 decision ma-
deaf. American Annals of the Deaf, 113, 1040–1045. trix (Fig. 1). If the patient has hearing loss, a positive
Olsen, W. (1988). Classroom acoustics for hearing-impaired test is called a hit and a negative result a miss. If the
children. In F. Bess (Ed.), Hearing impairment in children. patient has normal hearing, a positive result is a false
Parkton, MD: York Press. alarm and a negative result is a correct rejection. Di¤er-
Siebein, G., Crandell, C., and Gold, M. (1997). Principles of ent terminology is sometimes used. A hit is a true posi-
classroom acoustics: Reverberation. Education Audiology tive; a miss is a false negative; a false alarm is a false
Monographs, 5, 32–43. positive; a correct rejection is a true negative.
The elements of the matrix in Figure 1 represent the
number of hits, misses, false alarms, and correct rejec-
Clinical Decision Analysis tions when the test is given to a number of patients. The
hits and correct rejections are correct decisions, whereas
misses and false alarms are errors. A basic property is
Clinical decision analysis (CDA) is a quantitative strat- that the number of hits plus misses always equals the
egy for making clinical decisions. The techniques of number of patients with the condition to be identified,
CDA are largely derived from the theory of signal de- e.g., hearing loss. The number of false alarms plus cor-
tection, which is concerned with extracting signals from rect rejections always equals the number of patients
noise. The theory of signal detection has been used to without the condition, e.g., normal hearing.
study the detection of auditory signals by the human
observer. This article provides a brief overview of CDA. Hit Rate, etc.
More comprehensive discussions of CDA and its ap-
plication to audiological tests can be found in Turner, The number of hits, misses, false alarms, and correct
Robinette, and Bauch (1999), Robinette (1994), and rejections can be used to calculate a variety of measures
Hyde, Davidson, and Alberti (1990). of test performance. The most basic measures are hit rate
Assume that we are using a clinical test to distinguish (HT), miss rate (MS), false alarm rate (FA), and correct
between two conditions, such as disease versus no dis- rejection rate (CR). Hit rate (also called true positive
ease or hearing loss versus normal hearing. Most tests rate and sensitivity) is the percentage of hearing loss
would produce a range of scores for each condition and patients correctly identified as positive for hearing loss;
therefore could be thought to contain some ‘‘noise’’ in miss rate (also called false negative rate) is the percent-
their results. More important, there may be an overlap in age of hearing loss patients incorrectly identified as neg-
the scores produced by a test for each condition, creating ative. False alarm rate (also called false positive rate) is
Clinical Decision Analysis 445
the percentage of normal-hearing patients incorrectly

called positive; correct rejection rate (also called true
negative rate and specificity) is the percentage of normal-
hearing patients correctly identified as negative. These
measures are calculated by the following equations:
ht
HT ¼
pl
ms
MS ¼
pl
fa
FA ¼
pn
cr
CR ¼
pn
where pl ¼ number of patients with hearing loss, pn ¼
number of patients with normal hearing, ht ¼ number
of hits, ms ¼ number of misses, fa ¼ number of false
alarms, and cr ¼ number of correct rejections. The
above equations yield a decimal value a 1.0, which can
be converted to a percent. In all calculations, the decimal
form is used. While all four measures can be calculated,
only two, usually HT and FA, need be considered. This
is because HT þ MS ¼ 100% and FA þ CR ¼ 100%;
thus, MS and CR can always be determined from HT
and FA.
The measures of test performance, HT, FA, MS, and
CR, can also be expressed as probabilities. HT is the
probability of a positive test result if the patient has
hearing loss (Pr[þ/L]); FA is the probability of a positive
result if the patient has normal hearing (Pr[þ/N]); MS is
the probability of a negative result given hearing loss
(Pr[/L]), and CR is the probability of a negative result
given normal hearing (Pr[/N]).
Probability Distribution Curve

Consider a theoretical test that produces a score from 0
Figure 2. Probability distribution curves for a theoretical test. to 100. The test is administered to two groups of pa-
A, One distribution (black bars) corresponds to test perfor- tients, one with hearing loss and one with normal hear-
mance for patients with hearing loss, the other (hatched bars) ing. Each group will produce a range of scores on the
for patients with normal hearing. Possible test scores are di-
test. We could plot a histogram of scores for each pop-
vided into ten intervals: 0–10, 11–20, 21–30; . . . ; 91–100. Each
bar indicates the probability of a test score within the interval. ulation, i.e., the number of patients in a group with a
Thus, the black bar located between test score 20 and 30 indi- score between 0 and 10, 10 and 20, etc. Next, we divide
cates that the probability of obtaining a score of 21 to 30 is the number of patients in each score range by the total
17% for a patient with hearing loss. A criterion of 50 is indi- number of patients in the group to obtain the probability
cated by the heavy arrow. Any score equal to or less than the distribution curve (PDC). Essentially, the PDC gives the
criterion is considered positive for hearing loss. Any score probability of obtaining a particular score, or range of
greater than the criterion is negative for hearing loss. Hit rate scores, for each group of patients. PDCs for a theoretical
and false alarm rate are shown for each criterion from 20 to 70. test are shown in Figure 2. Note that there are two
B, Distribution of scores for patients with hearing loss. Bars to PDCs, one for hearing loss and one for normal hearing,
the left of the criterion (50) correspond to hits; bars to the right
and that the two distributions are di¤erent.
correspond to misses. The number in parentheses above each
bar is the height of the bar, that is, the probability of a score in Because the two PDCs in Figure 2 do not completely
the corresponding interval. Hit rate is the sum of the proba- overlap, we may use this test to identify hearing loss.
bilities indicated by the bars corresponding to hits. Miss rate
is likewise determined using bars corresponding to misses. C,
Distribution of scores for patients with normal hearing. The bars the sum of the probabilities indicated by the bars correspond-
to the left of the criterion correspond to false alarms; the bars to ing to false alarms. The correct rejection rate is likewise deter-
the right correspond to correct rejections. The false alarm rate is mined using bars corresponding to correct rejections.
First, however, we must establish a criterion to deter-

mine if a test score is positive or negative for hearing
loss. In Figure 2, we set the criterion at 50. Normal-
hearing patients have, on average, higher scores than
hearing loss patients; therefore, a score greater than 50 is
negative and a score less than or equal to 50 is positive
for hearing loss. Because there is some overlap in the two
PDCs, the criterion divides the two PDCs into four
regions corresponding to hits, misses, false alarms, and
correct rejections. Below the criterion (a50), hearing loss
patients constitute the hits and the normal-hearing
patients constitute the false alarms. Above the criterion,
the hearing loss patients are the misses and the normal-
hearing patients are the correct rejections. Hit rate is the
total probability that hearing loss patients will have a
test score a 50. This equals the sum of the probabilities
for all of the bars below the criterion that correspond to
hearing loss patients.
We can select any criterion for a test, but di¤erent Figure 3. Receiver operating characteristic (ROC) curve. The
criteria will produce di¤erent test performance. If the values calculated in Figure 2 are plotted to form the ROC
criterion is increased (e.g., from 50 to 60), there is an curve. The numbers in parentheses correspond to the criteria
increase in HT, which is good, but there is also an in- used in Figure 2 to determine hit rate and false alarm rate. The
crease in FA, which is bad. If the criterion is reduced dashed line indicates chance performance, that is, HT ¼ FA.
(e.g., from 50 to 40), the FA will be reduced, which is The curve that lies closest to the point, HT/FA ¼ 100/0%, is,
good, but so will the HT, which is bad. Thus, we see that in general, the best test. The test from Figure 2 is therefore
better than ‘‘Test X’’ because the ROC curve lies above the
there is usually a trade-o¤ between HT and FA when we ROC curve for Test X and is closer to HT/FA ¼ 100/0%.
adjust the criterion.
Another interesting result occurs with extreme crite-
ria. We could set the criterion at 100 and call all results
positive for hearing loss. This would produce an HT of
sult given hearing loss (Pr[þ/L]), but the probability that
100%; however, FA would also equal 100%. Likewise,
the patient has hearing loss given a positive test result
with a criterion of 0, FA ¼ 0%, but also HT ¼ 0%.
(Pr[L/þ]). This probability is called a posterior proba-
Thus, HT and FA can be manipulated by changing the
bility. Another posterior probability is Pr[N/]; this is
criterion, but the trade-o¤ between HT and FA limits
the probability of normal hearing given a negative test
the value of this strategy. Because any HT or FA can be
result. These two posterior probabilities are important
obtained by adjusting the criterion, both HT and FA are
because they are the probability of being correct given
needed to evaluate the performance of a test.
a particular test result. There are two other posterior
The ability of a test to distinguish patients is related
probabilities, Pr[L/] and Pr[N/þ], which are the
to the amount of overlap of the PDCs. If in Figure 2 the
probability of being incorrect given a test result. The
two PDCs completely overlapped, then the test would be
posterior probabilities have been given other names:
useless. For any criterion, we would have HT ¼ FA and
predictive value and information content. These mea-
MS ¼ CR. If there was absolutely no overlap in the
sures are identical to the posterior probabilities and thus
PDCs, then the test would be perfect. It would be pos-
provide the same information.
sible to set the criterion such that HT ¼ 100% and
Because Pr[L/þ] þ Pr[N/þ] ¼ 100% and Pr[L/] þ
FA ¼ 0%.
Pr[N/] ¼ 100%, we need calculate only two of the four
Because HT and FA vary significantly with the crite-
posterior probabilities. To calculate the posterior prob-
rion, we can visualize this relationship using a receiver
abilities we need HT and FA for the test, plus the prev-
operating characteristic (ROC) curve, which is a plot of
alence (PD) of the disease or condition in the test
HT versus FA for di¤erent criteria. The HT/FA data
population. Prevalence is the percentage of the test pop-
from Figure 2 are plotted in Figure 3. The shape of the
ulation that has the disease or condition (e.g., hearing
ROC curve is determined by the PDCs.
loss) at the time of testing. The posterior probabilities
are calculated by the following equations:
Posterior Probabilities
Consider this situation. We are testing a patient in the 1
Pr[L/þ] ¼
clinic, and the test result is positive. We know the hit rate (FA) (1 PD)
1þ
of the test, but hit rate is the probability of a positive (HT) (PD)
result given hearing loss. We do not know if the patient
has hearing loss, but we do know the test result. Hit rate 1
Pr[N/] ¼
tells us little about the accuracy of the test result. What (1 HT) (PD)
1þ
we want is not hit rate, the probability of a positive re- (1 FA) (1 PD)
Cochlear Implants 447
The posterior probabilities can also be calculated References

from the number of hits, misses, false alarms, and cor-
rect rejections. Sometimes this is an easier strategy than Hyde, M. L., Davidson, M. J., and Alberti, P. W. (1990). Au-
ditory test strategies. In J. T. Jacobson and J. L. Northern
using the equations above:
(Eds.), Diagnostic audiology (pp. 295–322). Needham
ht Heights, MA: Allyn and Bacon.
Pr[L/þ] ¼ Robinette, M. S. (1994). Integrating audiometric results. In
ht þ fa J. Katz (Ed.), Handbook of clinical audiology (pp. 181–
cr 196). Baltimore: Williams and Wilkins.
Pr[N/] ¼ Turner, R. G., Robinette, M. S., and Bauch, C. D. (1999).
cr þ ms Clinical decisions. In F. E. Musiek and W. F. Rintelmann
The probability of being correct with a positive test (Eds.), Contemporary perspectives in hearing assessment
(pp. 437–463). Needham Heights, MA: Allyn and Bacon.
result is the number of hits divided by the total number
of positive test results (hits plus false alarms). Likewise,
the probability of being correct with a negative test re-
sult is the number of correct rejections divided by the Cochlear Implants
total number of negative results (correct rejections plus
misses).
When the prevalence of a condition (e.g., hearing A cochlear implant is a surgically implantable device
loss) is small, the probability of a positive result being that provides hearing sensation to individuals with se-
correct (Pr(L/þ)) is small, even for a test with high HT vere to profound hearing loss who cannot benefit from
and small FA. For example, consider a test with HT/ conventional hearing aids. By electrically stimulating the
FA ¼ 99/5%. Even with this test, which is better than auditory nerve directly, a cochlear implant bypasses
any audiological test, the probability of a positive result damaged or undeveloped sensory structures in the coch-
being correct is only 29% for PD ¼ 2%. There would be lea, thereby providing usable information about sound
2.5 false alarms for each hit. When prevalence is small, to the central auditory nervous system.
we should expect more false alarms than hits. Although it has been known since the late 1700s that
Now consider Pr[N/], the probability of being cor- electrical stimulation can produce hearing sensations
rect with a negative test result. When prevalence is low, (see Simmons, 1966), it was not until the 1950s that the
Pr[N/] is very large, 99þ% for the example above. potential for true speech understanding was demon-
Only when prevalence is high is there a significant vari- strated. Clinical applications of cochlear implants were
ation in Pr[N/] with test performance. pioneered by research centers in the United States, Eu-
rope, and Australia. By the 1980s, cochlear implants had
E‰ciency become a clinical reality, providing safe and e¤ective
speech perception benefit to adults with profound hear-
HT, FA, and PD can also be used to calculate e‰ciency ing impairment. Since that time, the devices have be-
(EF). EF is the percentage of total test results that are come more sophisticated, and the population that can
correct and is calculated by benefit from implants has expanded to include children
EF ¼ HT PD þ (1 FA) (1 PD) as well as adults with some residual hearing sensitivity
(Wilson, 1993; Shannon, 1996; Loizou, 1998; Osberger
Like the posterior probabilities, e‰ciency is a function of and Koch, 2000).
disease prevalence. When prevalence is small, the false The function of a cochlear implant is to provide
alarm rate drives e‰ciency more than the hit rate. Thus, hearing sensation to individuals with severe to profound
a test with a poor HT and a small FA could have a hearing impairment. Typically, people with that level of
higher EF than a test with a high HT and a modest FA. impairment have absent or malfunctioning sensory cells
Because of this, EF is not always a useful measure. in the cochlea. In a normal ear, sound energy is con-
verted to mechanical energy by the middle ear, and
Conclusion the mechanical energy is then converted to mechanical
fluid motion in the cochlea. Within the cochlea, the sen-
Clinical decision analysis provides us with a variety of
sory cells—the inner and outer hair cells—are sensitive
measures of test performance. These measures are useful
transducers that convert that mechanical fluid motion
for evaluating di¤erent tests and for understanding the
limitations of a particular test. In general, these mea- into electrical impulses in the auditory nerve. Cochlear
implants are designed to substitute for the function of
sures of performance are not su‰cient to identify the
the middle ear, cochlear mechanical motion, and sen-
‘‘best’’ test. To determine the best test for a particular
sory cells, transforming sound energy into the elec-
application, we may need to also consider other issues,
trical energy that will initiate impulses in the auditory
such as the cost or morbidity of the test. The decision as
nerve.
to the best test is based on a cost-benefit analysis, not
All cochlear implant systems comprise both internal
simply measures of test performance. Nevertheless, these
and external components (Fig. 1). Sound enters the
measures of performance are essential to execute a cost-
microphone, and the signal is then sent to the speech
benefit analysis.
processor, which manipulates and converts the acous-
—Robert G. Turner tic signal into a special code (i.e., speech-processing
advantage of the place-to-frequency coding mechanism

used by the normal cochlea. Information about low-
frequency sound is sent to electrodes at the apical end of
the array, whereas information about high-frequency
sounds is sent to electrodes nearer the base of the
cochlea. The ability to take advantage of the place-
frequency code is limited by the number and pattern of
surviving auditory neurons in an impaired ear. Un-
fortunately, attempts to quantify neuronal survival with
electrophysiologic or radiographic procedures before
implantation have been unsuccessful (Abbas, 1993).
The first multi-electrode arrays were straight and thin
(22 electrode bands on the 25-mm-long array) to mini-
mize the occupied space within the scala tympani (Clark
et al., 1983). Advances in electrode technology have led
to the development of precurved, spiral-shaped arrays to
follow the shape of the scala tympani, allowing the con-
tacts to sit close to the target neurons (Fayad, Luxford,
and Linthicum, 2000; Tykocinski et al., 2002). The
advantages of the precurved array are an increase in
Figure 1. External and internal components of a cochlear im-
spatial selectivity, a reduction in channel interaction,
plant system. Sound is picked up by the microphone located in and a reduction in the current required to reach thresh-
the headpiece and converted into an electrical signal, which is old and comfortable listening levels. In addition, the
then sent to the speech processor via a cable. The signal is electrode contacts are oriented toward the spiral gan-
encoded into a speech-processing strategy and is sent from the glion cells, and a ‘‘positioner,’’ a thin piece of Silastic,
speech processor (body-worn or behind-the-ear) to the trans- can be inserted behind the array to achieve even greater
mitter in the headpiece. The signal is transmitted to the internal spatial selectivity and improve speech recognition per-
receiver through transcutaneous inductive coupling of radio- formance (Zwolan et al., 2001).
frequency (RF) signals. The receiver/stimulator sends the sig- For all systems, electrical current is passed between
nal to the electrodes, which stimulate the cochlea with electrical an active electrode and an indi¤erent electrode. If the
current.
active and indi¤erent electrodes are remote, the stimula-
tion is termed monopolar. When the active and indi¤er-
ent electrodes are close to each other, the stimulation
strategy). The transmitter, which is located inside the is referred to as bipolar. Bipolar stimulation focuses
headpiece, sends the coded electrical signal to the inter- the current within a restricted area and presumably
nal components. The internal device contains the re- stimulates a small localized population of auditory nerve
ceiver, which decodes the signal from the speech fibers (Merzenich and White, 1977; van den Honert and
processor, and an electrode array, which stimulates the Stypulkowski, 1987). Monopolar stimulation, on the
cochlea with electrical current. The implanted electronics other hand, spreads current over a wider area and a
are encased in one of two biocompatible materials, tita- larger population of neurons. Less current is required to
nium or ceramic. The entire system is powered by bat- achieve adequate loudness levels with monopolar stimu-
teries located in the speech processor, which is worn on lation; more current is required for bipolar stimulation.
the body or behind the ear. The use of monopolar or bipolar stimulation is deter-
The transmission link enables information to be sent mined by the speech-processing strategy and each indi-
from the external parts of the implant system to the vidual’s response to electrical stimulation.
implanted components. For all current systems, the Two types of stimulation are currently used in coch-
connection is made through transcutaneous inductive lear implants, analog and pulsatile. Analog stimulation
coupling of radio-frequency (RF) signals. In this consists of electrical current that varies continuously in
scheme, an RF carrier signal—in which the important time. Pulsatile stimulation consists of trains of square-
code is embedded—is sent across the skin to the receiver. wave biphasic pulses. The pattern of stimulation can be
The receiver extracts the embedded code and determines either simultaneous or nonsimultaneous (sequential).
the stimulation pattern for the electrodes. Most cochlear With simultaneous stimulation, more than one electrode
implant systems also employ back telemetry, which is stimulated at the same time. With nonsimultaneous
allows the internal components to send information back stimulation, electrodes are stimulated in a specified se-
to the external speech processor to assess the function of quence, one at a time. Typically, analog stimulation is
the implanted electronics and electrode array. simultaneous and pulsatile stimulation is sequential.
The first cochlear implants consisted of a single elec- To represent speech faithfully, the coding strategy
trode, but since the mid-1980s, nearly all devices have must reflect three parameters in its electrical stimulation
multiple electrodes contained in an array. Typically, code: frequency, amplitude, and time. Frequency (pitch)
cochlear implant electrodes are inserted longitudinally information is conveyed by the site of stimulation, am-
into the scala tympani of the cochlea to take potential plitude (loudness) is encoded by the amplitude of the
Cochlear Implants 449
Figure 2. Mean pre- and postimplant scores on

speech perception tests for 51 adults with post-
lingual deafness. Performance was assessed on
CNC monosyllabic words, Central Institute for
the Deaf (CID) sentences, Hearing in Noise Test
(HINT) sentences, and HINT sentences in back-
ground noise (þ10 signal-to-noise ratio). Stimuli
were recorded and presented in the sound field at
70 dB SPL.
stimulus current, and temporal cues are conveyed by the system. Electrical threshold and most comfortable lis-
rate and pattern of stimulation. The first multichannel tening levels are determined for each electrode, and
devices extracted limited information from the acoustic other psychophysical parameters of the speech-process-
input signal (Millar, Tong, and Clark, 1984). Advances ing scheme are programmed into the speech processor.
in signal-processing technology have led to the develop- Multiple visits to the implant center are necessary during
ment of more sophisticated processing schemes. One the first months of implant use as the individual grows
type of strategy is referred to as ‘‘n of m,’’ in which a accustomed to sound and as tolerance for loudness
specified number of electrodes out of a maximum num- increases.
ber available are stimulated (Seligman and McDermott, Most adults who acquire a severe to profound hearing
1995). With this type of processing, the incoming sound loss after language is acquired (postlingual hearing loss)
is analyzed to identify the filters (frequency regions) with demonstrate dramatic improvements in speech under-
the greatest amount of energy, and then a subset of fil- standing after relatively limited implant experience
ters is selected and the corresponding electrodes are (Fig. 2). Improvements in technology have led to incre-
stimulated. mental improvements in benefit, which in turn have led
In another approach, referred to as continuous inter- to expanded inclusion criteria (Skinner et al., 1994;
leaved sampling (CIS), trains of biphasic pulses are Osberger and Fisher, 1999). There are large individual
delivered to the electrodes in an interleaved or non- di¤erences in outcome, and although there is no reliable
overlapping fashion to minimize electrical field inter- method to predict postimplant performance, age at onset
actions between stimulated electrodes (Wilson et al., of significant hearing loss, duration of the loss, and de-
1991). The amplitudes of the pulses delivered to each gree of preoperative residual hearing significantly a¤ect
electrode are derived by modulating them with the speech recognition abilities (Tyler and Summerfield,
envelopes of the corresponding bandpassed waveforms. 1996; Rubinstein et al., 1999). Many adults are able to
With analog stimulation, the incoming sound is sepa- converse on the telephone, and cochlear implants can
rated into di¤erent frequency bands, compressed, and improve the quality of life (Knutson et al., 1998). Adults
delivered to all electrodes simultaneously (Eddington, with congenital or early-acquired deafness and children
1980). In the most recent implementation of this type (see cochlear implants in children) also derive sub-
of processing, the digitized signal is transmitted to the stantial benefit from cochlear implants.
receiver; then, following digital-to-analog conversion, Technological advances will continue, with higher
the analog waveforms are sent simultaneously to all processing speeds o¤ering the potential to stimulate the
electrodes (Kessler, 1999). Bipolar electrode coupling is auditory nerve fibers in a manner that more closely
typically used to limit the area over which electrical cur- approximates that of normal hearing. Studies are under
rent spreads to reduce channel interaction, which is fur- way to evaluate the benefit of bilateral implants (Gantz
ther reduced with the use of spiral-shaped electrodes. et al., 2002). New miniaturization processes will result
Cochlear implant candidacy is determined only after in smaller behind-the-ear processors and, eventually, a
comprehensive evaluations by a team of highly skilled fully implantable system with rechargeable battery
professionals (see cochlear implants in adults: technology. In the early days of cochlear implants, few
candidacy). The surgery is performed under general people realized that this technology would become the
anesthesia and requires about 1–2 hours, either as an most successful of all prostheses of the central nervous
inpatient or outpatient procedure. Approximately 4 system.
weeks following surgery, the individual returns to the
clinic to be fitted with the external components of the —Mary Joe Osberger
References Tyler, R. S., and Summerfield, A. Q. (1996). Cochlear implan-

tation: Relationships with research on auditory deprivation
Abbas, P. (1993). Electrophysiology. In R. S. Tyler (Ed.), and acclimatization. Ear and Hearing, 17(Suppl. 3), 38–50.
Cochlear implants: Audiologic foundations (pp. 317–355). van den Honert, C., and Stypulkowski, P. H. (1987). Single
San Diego, CA: Singular Publishing Group. fiber mapping of spatial excitation patters in the electrically
Clark, G., Shepherd, R., Patrick, J., Black, R., and Tong, Y. stimulated auditory nerve. Hearing Research, 29, 195–206.
(1983). Design and fabrication of the banded electrode Wilson, B. S. (1993). Signal processing. In R. S. Tyler (Ed.),
array. Annals of the New York Academy of Sciences, 405, Cochlear implants: Audiological foundations (pp. 35–85).
191–201. San Diego, CA: Singular Publishing Group.
Eddington, D. (1980). Speech discrimination in deaf subjects Wilson, B. S., Finley, C. C., Lawson, D. T., Wolford, R. D.,
with cochlear implants. Journal of the Acoustical Society of Eddington, D. K., and Rabinowitz, W. M. (1991). Better
America, 68, 885–891. speech recognition with cochlear implants. Nature, 352,
Fayad, J. N., Luxford, W., and Linthicum, F. H. (2000). The 236–238.
Clarion electrode positioner: Temporal bone studies. Amer- Zwolan, T., Kileny, P. R., Smith, S., Mills, D., Koch, D. B.,
ican Journal of Otology, 21, 226–229. and Osberger, M. J. (2001). Adult cochlear implant patient
Gantz, B. J., Tyler, R. S., Rubinstein, J. T., Wolaver, A., performance with evolving electrode technology. Otology
Lowder, M., Abbas, P., et al. (2002). Binaural cochlear and Neurotology, 22, 844–849.
implants placed during the same operation. Otology and
Neurotology, 23, 169–180. Further Readings
Kessler, D. K. (1999). The Clarion Multi-Strategy cochlear
implant. Annals of Otology, Rhinology, and Laryngology, Clark, G., Tony, Y., and Patrick, J. F. (1990). Cochlear pros-
108(Suppl. 177), 8–16. theses. New York: Churchill Livingstone.
Knutson, J. F., Murray, K. T., Husarek, S., Westerhouse, K., Loizou, P. C. (1999). Introduction to cochlear implants. IEEE
Woodworth, G., Gantz, B. J., et al. (1998). Psychological Engineering in Medicine and Biology, January/February,
change over 54 months of cochlear implant use. Ear and 32–42.
Hearing, 19, 191–201. Loizou, P. C. (1999). Signal-processing techniques for cochlear
Loizou, P. C. (1998). Mimicking the human ear. IEEE Signal implants. IEEE Engineering in Medicine and Biology, May/
Processing Magazine, September: 101–130. June, 34–46.
Merzenich, M. M., and White, M. W. (1977). Cochlear im- Moller, A. R. (2001). Neurophysiologic basis for cochlear and
plant: The interface problem. In F. T. Hambrecht and J. B. auditory brainstem implants. American Journal of Audiol-
Reswick (Eds.), Functional electrical stimulation: Applica- ogy, 10, 68–77.
tions in neural prostheses (pp. 321–340). New York: Marcel Schindler, R. A., and Merzenich, M. M. (1985). Cochlear
Dekker. implants. New York: Raven Press.
Millar, J., Tong, Y., and Clark, G. (1984). Speech processing Spelman, F. A. (1999). The past, present, and future of coch-
for cochlear implant prostheses. Journal of Speech and lear prostheses. IEEE Engineering in Medicine and Biology,
Hearing Research, 27, 280–296. May/June, 27–33.
Osberger, M. J., and Fisher, L. (1999). SAS-CIS preference Tyler, R. S. (1993). Cochlear implants: Audiological founda-
study in postlingually deafened adults implanted with the tions. San Diego, CA: Singular Publishing Group.
Clarion cochlear implant. Annals of Otology, Rhinology, Waltzman, S. B., and Cohen, N. L. (2000). Cochlear implants.
and Laryngology, 108(Suppl. 177), 74–79. New York: Thieme.
Osberger, M. J., and Koch, D. B. (2000). Cochlear implants. In
R. E. Sandlin (Ed.), Textbook of Hearing Aid Amplification
(2nd ed., pp. 673–698). San Diego, CA: Singular Publishing Cochlear Implants in Adults:
Group.
Rubinstein, J. T., Parkinson, W. S., Tyler, R. S., and Gantz, Candidacy
B. J. (1999). Residual speech recognition and cochlear im-
plant performance: E¤ects of implantation criteria. Ameri-
can Journal of Otology, 20, 445–452. There are many potential advantages to measuring the
Seligman, P., and McDermott, H. (1995). Architecture of the benefit obtained from cochlear implants. These include:
Spectra 22 speech processor. Annals of Otology, Rhinology,
and Laryngology, 104(Suppl. 166), 139–141.
Determining selection criteria
Shannon, R. V. (1996). Cochlear implants: What have we
Setting the cochlear implant: selecting and modifying
learned and where are we going? Seminars in Hearing, 17, programming options
403–415. Monitoring performance
Simmons, F. B. (1966). Electrical stimulation of the auditory
nerve in man. Archives of Otolaryngology, 84, 2–54. The most important reason for the evaluation of coch-
Skinner, M. W., Clark, G. M., Whitford, L. A., Seligman, lear implants is in the selection criteria process. Specifi-
P. M., Staller, S. J., Shipp, D. B., et al. (1994). Evaluation cally, speech perception tests are critical to determine
of a new spectral peak coding strategy for the Nucleus 22 whether a hearing aid user might do better with a coch-
channel cochlear implant system. American Journal of lear implant. Here we focus on this subject and review
Otology, 15(Suppl. 2), 15–27. some of the tests used in evaluation.
Tykocinski, M., Cohen, L. T., Pyman, B. C., Roland, T.,
Treaba, C., Palamara, J., et al. (2000). Comparison of elec- Selection Criteria
trode position in the human cochlea using various peri-
modiolar electrode arrays. American Journal of Otology, 21, Guidelines traditionally have depended on how much
205–211. benefit is obtained from hearing aids, and how much
Cochlear Implants in Adults: Candidacy 451
Table 1. Principles Involved When Considering Monaural Implantation

Binaural Test Results Monaural Implant Decision
If binaural scores are high, and both ears are contributing Do not implant.
If binaural scores are high, and one ear is not contributing Consider implanting poorer ear to improve spatial hearing.
If binaural scores are medium, and both ears are contributing Implant ear with shorter duration and/or better thresholds.
equally
If binaural scores are medium, one ear is contributing more Implant poorer ear, to improve spatial hearing.
than the other, and best monaural ¼ binaural (no binaural Implant better ear, to improve speech in quiet.
benefit)
If binaural scores are medium, one ear is contributing more Do not implant, to preserve current performance levels.
than the other, and best monaural < binaural (is binaural Implant poorer ear, to improve binaural benefit and
benefit) preserve better ear.
Implant better ear, to improve speech in quiet.
If binaural scores are medium, one ear is not contributing, Implant poorer ear, to improve spatial hearing and speech
and best monaural ¼ binaural (no binaural benefit) in quiet.
If binaural scores are poor, with one ear contributing more Implant better ear, to improve speech in quiet.
than the other
If binaural scores are poor, with ears contributing equally Implant ear with shorter duration and/or better thresholds.
benefit might be expected from a cochlear implant. This has the advantage that linguistic and cognitive factors
is di‰cult, because there are limited databases with are minimized, and clinicians are familiar with the tests.
such information. Specific guidelines for selection crite- When isolated word lists are used, the vocabulary should
ria change regularly, are influenced by company and be common words.
clinic protocols and depend on whether the device is
investigational. Generally, the ‘‘best aided’’ performance Ongoing Speech. Sentence perception includes the
(with appropriately fit binaural hearing aids) is used to processing of information at a more rapid, natural rate,
determine if an implant is desirable. When considering a compared to the presentation of isolated words (Silver-
monaural implant, some centers implant the poorer ear man and Hirsh, 1955; Boothroyd, Hanin, and Hnath,
to ‘‘save’’ the better ear or to allow the patient to use a 1985; Nilsson, Soli, and Sullivan, 1994). Performance
cochlear implant in one ear and a hearing aid (better ear) can be a¤ected by memory, learning, and familiarity
in the other. At other centers, however, the implant is with items as a result of repeated use of the test lists. In
placed in the better ear to maximize implant perfor- addition, a patient who is more willing to guess or who is
mance in an ear with more nerve fibers. better at using contextual clues may score higher than
First, we shall discuss general guidelines. We refer to someone who has similar speech perception abilities but
poor, medium, and high speech perception scores, real- is less willing to guess. Therefore, it is important that
izing that this is arbitrary. The actual values will depend sentence length be short, lots of sentences be available,
on the test, and will change as overall implant perfor- and the test-retest reliability of the materials be known.
mance improves. The principles we are promoting in-
volve binaural testing and determining the contribution
Speech Perception Testing in Noise. Many realistic lis-
from each ear individually.
tening situations involve background noise, resulting in
Table 1 lists the principles involved when considering
di¤erences in speech perception that are not apparent
monaural implantation. Speech perception tests are
when testing is performed in quiet. Background noise
conducted typically with the patient using hearing aids
can result in a ‘‘floor e¤ect’’ (near 0% correct). There-
testing the right, left, and binaural conditions. The
fore, in some situations a favorable signal-to-noise ratio
results from each individual ear can then be compared
(S/N) is selected individually, or testing that adaptively
and the best monaural condition can be determined. In
varies the S/N is used (Plomp and Mimpen, 1979; Levitt,
addition, the best monaural condition can be compared
1992).
to the binaural condition to determine if there is a bin-
aural advantage. Table 2 lists the principles involved in
Subjective Ratings. Quality ratings measure a more
considering binaural implantation.
global attribute of speech. For example, ratings might
include the ease of listening or clarity of sound. We have
Protocols for Evaluation
developed a quality rating test that includes realistic lis-
This section reviews some of the tests used in the evalu- tening situations. Patients are asked to listen to each
ation process. sound and to rate it on a scale from 0 (unclear) to 100
(clear). Figure 1 shows the results from an adult cochlear
Isolated Words. The presentation of isolated words implant patient comparing a high-rate, roving, n-of-m
(Peterson and Lehiste, 1962; Tillman and Carhart, 1966) strategy (Wilson, 2000) to a SPEAK strategy (Skinner
Table 2. Principles Involved When Considering Binaural Implantation

Binaural Test Results Binaural Implant Decision
If binaural scores are high, and both ears are contributing Do not implant.
If binaural scores are high, and one ear is not Implant poorer ear to improve spatial hearing.
contributing
If binaural scores are medium, and both ears are Implant binaurally.
contributing equally Do not implant (conservative approach).
If binaural scores are medium, one ear is contributing Implant poorer ear, to improve spatial hearing.
more than the other, and best monaural ¼ binaural (no Implant better ear, to improve speech in quiet.
binaural benefit) Implant binaurally, to improve spatial hearing and speech in quiet.
If binaural scores are medium, one ear is contributing Do not implant, to preserve current performance levels.
more than the other, and best monaural < binaural (is Implant poorer ear, to improve binaural benefit and to preserve
binaural benefit) better ear.
Implant better ear, to improve speech in quiet.
Implant binaurally, to improve speech in quiet and spatial hearing.
If binaural scores are medium, one ear is not contributing, Implant poorer ear, to improve spatial hearing and speech in quiet.
and best monaural ¼ binaural (no binaural benefit) Implant binaurally, to improve speech in quiet and spatial hearing.
If binaural scores are poor, with one ear contributing Implant binaurally.
more than the other
If binaural scores are poor, with ears contributing equally Implant binaurally.
Figure 1. Average subjective ratings for

a high rate, roving, n-of-m strategy,
worn bilaterally, compared to bilateral
SPEAK.
Figure 2. Response form from the Audiovisual Feature Test. The

patient hears one of the ten items and must point to the item that
was presented.
Cochlear Implants in Adults: Candidacy 453
Figure 3. Eight-speaker localization test set-up. Speakers are at

15.5. (From Van Hoesel and Clark, 1999.)
et al., 1997). The test was su‰ciently sensitive to show a

strategy preference for all sounds. Information such as
this can be extremely helpful when selecting and mod-
ifying programming options.
Speech Features. It is advantageous to determine what

kinds of speech sounds are perceived to understand
which features are being transmitted by the cochlear
implant, and how to focus rehabilitation. We use a
variety of consonant and vowel tests. The Iowa Medial
Consonant Test (Tyler, Preece, and Lowder, 1983) pre-
sents items in an ‘‘ee/C/ee’’ (13-choice version) or ‘‘aa/
C/aa’’ (16- and 24-choice versions) context, where /C/
represents a variety of consonants (e.g., ‘‘aa/P/aa,’’ ‘‘aa/
M/aa,’’ ‘‘aa/D/aa,’’ etc.).
For adult patients who are poor performers and for
children, the Audiovisual Speech Feature Test is easier
and can be used (Tyler, Fryauf-Bertschy, and Kelsay,
1991). Figure 2 shows the items included in the test.
Spatial Hearing. Another attempt to make testing

more realistic includes measurement of spatially separate
speech and noise and the localization of sounds (Shaw,
1974; Middlebrooks and Green, 1991; Wightman and
Kistler, 1997). More individuals are being fitted with
either binaural implants or have a hearing aid and a
cochlear implant. We typically measure speech from the
front and noise from the right or left. This allows for the
measurement of the ‘‘head shadow’’ e¤ect and a ‘‘bin-
aural squelch e¤ect.’’
Localization is based on interaural time, amplitude,
and spectral di¤erences between ears (Mills, 1972;
Wightman and Kistler, 1992). For binaural cochlear
implant recipients the fine details of this information
may not be available. To test localization, everyday
sounds are presented through 8 loudspeakers. The loud-
speakers are arranged in an arc, the patient is asked to Figure 4. Localization results from one adult bilateral patient
indicate which speaker the sound came from (Fig. 3). (a) Left cochlear implant only; (b) right cochlear implant only;
Figure 4a shows results from one patient who was tested (c) both implants on at the same time).
wearing only the right cochlear implant, Figure 4b only
the left cochlear implant, and Figure 4c with both im-
plants at the same time.
Conclusions Wightman, F. L., and Kistler, D. J. (1992). The dominant role

of low-frequency interaural time di¤erences in sound local-
We have provided a brief overview of some of the more ization. Journal of the Acoustical Society of America, 91,
common issues involved in evaluating adult cochlear 1648–1661.
implant users. The most important task is to determine Wightman, F. L., and Kistler, D. J. (1997). Monaural sound
candidacy. Speech perception tests measuring binaural localization revisited. Journal of the Acoustical Society of
hearing aid benefit are needed to determine either mon- America, 101, 1050–1063.
aural or binaural cochlear implant candidacy. Several Wilson, B. S. (2000). Strategies for representing speech infor-
mation with cochlear implants. In J. K. Niparko, K. I.
di¤erent tests measure a wide range of potential hearing
Kirk, N. K. Mellon, A. M. Robbins, D. L. Tucci, and D. S.
abilities. Wilson (Eds.), Cochlear implants: Principles and practices
—Richard S. Tyler and Shelley Witt (pp. 129–172). Philadelphia: Lippincott.
References Further Readings

Boothroyd, A., Hanin, L., and Hnath, T. (1985). A sentence Dillon, H. (2001). Binaural and bilateral considerations in
test of speech perception: Reliability, set equivalence, and hearing aid fitting. In Hearing aids (pp. 370–403). New
short-term learning (Internal Report RCI 10). New York: York: Thieme.
City University of New York, Speech and Sciences Re- Dorman, M. F. (1993). Speech perception by adults. In R. S.
search Center. Tyler (Ed.), Cochlear implants: Audiological foundations
Levitt, H. (1992). Adaptive procedures for hearing aid pre- (pp. 145–190). San Diego, CA: Singular Publishing Group.
scription and other audiologic applications. Journal of the Kirk, I. K. (2000). Challenges in the clinical investigation of
American Academy of Audiology, 3, 119–131. cochlear implant outcomes. In J. K. Niparko, K. I. Kirk, N.
Middlebrooks, J. C., and Green, D. M. (1991). Sound local- K. Mellon, A. M. Robbins, D. L. Tucci, and D. S. Wilson
ization by human listeners. Annual Review in Psychology, (Eds.), Cochlear implants: Principles and practices (pp. 225–
42, 135–159. 259). Philadelphia: Lippincott.
Mills, A. W. (1972). Auditory localization. In J. W. Tobias
(Ed.), Foundations of Modern Auditory Theory, Vol 2 (pp.
301–348). New York: Academic Press.
Nilsson, M., Soli, S. D., and Sullivan, J. (1994). Development Cochlear Implants in Children
of the Hearing In Noise Test for the measurement of speech
reception thresholds in quiet and in noise. Journal of the
Acoustical Society of America, 95, 1085–1099. The substantial benefit derived from cochlear implants
Peterson, F. E., and Lehiste, I. (1962). Revised CNC lists for by adults (see cochlear implants and cochlear im-
auditory tests. Journal of Speech and Hearing Disorders, plants in adults: candidacy) led to the application
27(1), 62–70. of these devices in children. Unlike adults, however,
Plomp, R., and Mimpen, A. M. (1979). Speech-reception most pediatric candidates acquire their deafness before
threshold for sentences as a function of age and noise.
speech and language are learned (prelingually deafened).
Journal of the Acoustical Society of America, 66, 1333–
1342. Thus, children must depend on an auditory prosthesis
Shaw, E. A. G. (1974). Transformation of sound pressure level to learn the auditory code underlying spoken language—
from the free field to the eardrum in the horizontal plane. a formidable task, given the exquisite temporal and
Journal of the Acoustical Society of America, 56, 1848–1861. frequency-resolving powers of the normal ear. On the
Silverman, S. R., and Hirsh, I. (1955). Problems related to the other hand, young children may be the most successful
use of speech in clinical audiometry. Annals of Otology users of implantable auditory prostheses because of the
Rhinology and Laryngology, 64, 1234–1244. plasticity of the central nervous system.
Skinner, M. W., Holden, L. K., Holden, T. A., Demorest, The challenges faced in determining candidacy in
M. E., and Fourakis, M. S. (1997). Speech recognition at children require balancing the risks of surgery versus the
simulated soft, conversational, and raised-to-loud vocal
e¤orts by adults with cochlear implants. Journal of the
potential benefits of early implantation for the acquisi-
Acoustical Society of America, 101(6), 3766–3782. tion of spoken language. Initially, the use of cochlear
Tillman, T., and Carhart, R. (1966). An expanded test for implants in children was highly controversial. Thus,
speech discrimination utilizing CNC monosyllabic words. candidacy requirements were stringent, and the first
(Northwestern University Auditory Test No. 6 Technical children to receive cochlear implants were older (school-
report No. SAM-TR-66-55). San Antonio, Texas: USAF age or adolescents) and demonstrated no benefit from
School of Aerospace Medicine, Brooks Air Force Base. conventional hearing aids—not even sound awareness—
Tyler, R. S., Fryauf-Bertschy, H., and Kelsay, D. (1991). Au- even after many years of use and rehabilitation. These
diovisual Feature Test for Young Children. Iowa City, IA: children were considered ‘‘ideal’’ cochlear implant can-
The University of Iowa. didates because their hearing could be reliably evaluated
Tyler, R. S., Preece, J. P., and Lowder, M. W. (1983). The
Iowa Cochlear Implant Test Battery. Iowa City, IA: The
and it was obvious that no improvement in their audi-
University of Iowa. tory skills would occur with conventional hearing aids.
Van Hosesel, R. M., and Clark, G. M. (1999). Speech results The first devices used with children were single-
with a bilateral multi-channel cochlear implant subject for channel implants (see cochlear implants). Even though
spatially separated signal and noise. Australian Journal of performance was limited, the children who received
Audiology, 21, 23–28. these devices derived more benefit from their implants
Cochlear Implants in Children 455
than from conventional hearing aids (Thielemeir et al.,

1985; Robbins, Renshaw, and Berry, 1991). A small
percentage of these children achieved remarkable levels
of word recognition through listening alone, although
many of them had early acquired deafness with some
normal auditory experience prior to the onset of their
hearing loss (Berliner et al., 1989). Benefits also were
documented in speech production and language acquisi-
tion (Osberger, Robbins, Berry, et al., 1991). Clearly, the
early pioneering work with single-channel devices dem-
onstrated the safety and e¤ectiveness of implantable au-
ditory prostheses in children and paved the way for the
acceptance of cochlear implants as a medical treatment
for profound deafness.
Eventually children received multichannel cochlear
implants, especially as results indicated superior out-
Figure 2. Mean pre- and postimplant performance on Infant-
comes with these devices compared with single-channel
Toddler Meaningful Auditory Integration scale by age at im-
implants (Osberger, Robbins, Miyamoto, et al., 1991). plant (statistically significant di¤erence between groups after 3
Since that time, numerous research studies have docu- months of implant use).
mented the substantial benefits that children with pro-
found hearing loss obtain from multichannel cochlear
implants (see Kirk, 2000; Waltzman, 2000). Numerous
speech perception tests have been developed to assess Identification of hearing loss at an early age has also
implant candidacy and benefit, even in very young chil- contributed to implantation in children at increasingly
dren (Kirk, 2000; Zimmerman-Phillips, Robbins, and younger ages. Evidence suggests that children receiv-
Osberger, 2000). A finding common to all studies is the ing implants at a younger age achieve higher levels of
long time course over which children acquire auditory, performance with their devices than children receiving
speech, and language skills, even with multichannel implants at an older age (Fryauf-Bertschy et al., 1997;
devices (Tyler et al., 2000) (Fig. 1). This is not unex- Waltzman and Cohen, 1998). Significant di¤erences in
pected, given the number of years required for similar postimplant outcome have been documented in children
skill acquisition by hard-of-hearing children who use who receive implants before age 3 years. Children who
conventional hearing aids. received cochlear implants between ages 12 and 23
With continued clinical experience, improvements in months demonstrated better auditory skills after im-
technology, and documented benefits, cochlear implants plantation than children who received implants between
gained greater acceptance, and candidacy criteria were the ages of 24 and 36 months (Fig. 2) (Osberger et al.,
expanded. Children received implants at increasingly 2002). Thus, a di¤erence of as little as 1 year in age at
younger ages, and it is now common practice to place the time of implantation had a significant impact on
implants in children as young as 2 years, with a growing the rate of auditory skill development in these young
trend for children as young as 12 months of age to re- children.
ceive cochlear implants (Waltzman and Cohen, 1998). Even though the current trend is to provide implants
to children at younger ages, older children continue to
receive cochlear implants (Osberger et al., 1998). Some
of these children have residual hearing and demonstrate
benefit from conventional hearing aids. Implantation is
often delayed because it takes longer to determine
whether a plateau in auditory development has been
reached. In addition, the audiological candidacy criteria
were more stringent when these children were younger,
and thus they were not considered appropriate candi-
dates because they had too much hearing. Over time,
however, audiological criteria in children have been
expanded for implants (Zwolan et al., 1997). Following
implantation, children with preoperative speech percep-
tion abilities demonstrate remarkable auditory recogni-
tion skills and achieve higher levels of performance with
their implants than they did with hearing aids (Fig. 3).
Figure 1. Mean pre- and postimplant scores on phoneme rec- Other factors besides age influence cochlear implant
ognition (Phonetically Balanced-Kindergarten test) achieved benefit in children. Communication method also impacts
by children during Clarion cochlear implant clinical trials the postimplant performance in children. Most studies
(mean age at implant ¼ 5 years). have found that children who use oral communication
cation strategies and conversational skills have also been

reported (Tait, 1993; Nicholas, 1994), and more children
with implants demonstrate higher levels of reading
achievement than reported for their peers with hearing
aids (Spencer, Tomblin, and Gantz, 1999). Nonetheless,
even with marked improvements in performance, chil-
dren with cochlear implants remain delayed in linguistic
development compared to children of the same chrono-
logical age with normal hearing. However, children with
cochlear implants do not continue to fall farther behind
in their language performance, as has been reported for
their profoundly hearing-impaired peers with hearing
aids. As deaf children receive implants at younger ages,
Figure 3. Mean pre- and postimplant performance on two open- the gap between their skills and the skills of their age-
set speech perception tests (Lexical Neighborhood and Multi- matched peers with normal hearing will lessen.
syllabic Neighborhood tests) (recorded administration) and one Speech production skills also improve after implanta-
closed-set test (Early Speech Perception Monosyllable Word tion. Studies have shown improved production of seg-
test) (live-voice administration) (mean age at implant ¼ 9 mental and suprasegmental features of speech and
years).
overall speech intelligibility (Tobey, Geers, and Brenner,
1994; Robbins et al., 1995). Dramatic improvements in
speech production are often apparent after only several
(audition, speaking, lipreading) achieve higher levels of months of implant use, even in very young children
performance with their implants than do children who who had little or no auditory experience prior to im-
are educated using total communication (English-based plantation. In very young children, improvements in
sign language with audition, speaking, lipreading) vocalizations are usually the most noticeable changes
(Meyer et al., 1998). The trend for better implant per- following implantation (Zimmerman-Phillips, Robbins,
formance in children who use oral communication has and Osberger, 2000).
been shown in older children (Fig. 4) as well as in very Cochlear implants are now accepted as an e¤ective
young children (Osberger et al., 2002). This finding treatment for profound deafness. Many profoundly deaf
indicates that oral education programs more e¤ectively children gain access to the auditory and linguistic code
emphasize the use of auditory information provided by of spoken language with these devices, an accomplish-
an implant than do total communication programs. In ment realized by only a limited number of deaf children
fact, since multichannel cochlear implants became avail- with hearing aids. Profoundly deaf children with coch-
able, there has been a dramatic increase in the number of lear implants often function as well as children with less
educational programs that employ oral communication, severe hearing impairments with hearing aids (Booth-
because a greater number of children have the potential royd and Eran, 1994; Meyer et al., 1998). Consequently,
to acquire spoken language through audition. deaf children with implants acquire spoken language
In addition to auditory perceptual benefits, children vicariously through incidental learning, requiring fewer
with cochlear implants show significant improvement in special support services in school. Evidence suggests that
their receptive and expressive language development (see more deaf children who use implants are being main-
Robbins, 2000). Improvements in the use of communi- streamed in regular classrooms than their peers with
hearing aids (Francis et al., 1999). Thus, the long-term
educational costs for children with cochlear implants will
be less than for deaf children with hearing aids, resulting
in a net savings to society. In addition, cochlear implants
have a positive e¤ect on the quality of life of deaf chil-
dren, and have also been found to be a cost-e¤ective
treatment for deafness (Cheng et al., 2000). Whereas the
impact of cochlear implants on educational and voca-
tional achievement will take many years to establish, it is
clear that these devices have changed the lives of many
deaf children and their families.
—Mary Joe Osberger
References
Figure 4. Mean pre- and postimplant performance by commu-
nication mode for older children (mean age at implant ¼ 9 Berliner, K. I., Tonokawa, L. L., Dye, L. M., and House,
years) on the Early Speech Perception Monosyllable Word test) W. F. (1989). Open-set speech recognition in children with a
(live-voice administration) (statistically significant postimplant single-channel cochlear implant. Ear and Hearing, 10, 237–
di¤erences between groups). 242.
Cochlear Implants in Children 457
Boothroyd, A., and Eran, O. (1994). Auditory speech percep- Tobey, E. A., Geers, A., and Brenner, C. (1994). Speech pro-
tion capacity of child implant users expressed as equivalent duction results and speech feature acquisition. Volta Re-
hearing loss. Volta Review, 96, 151–168. view, 96, 109–130.
Cheng, A. K., Rubin, H. R., Powe, N. R., Mellon, N. K., Tyler, R. S., Teagle, H. F. B., Kelsay, D. M. R., Gantz, B. J.,
Francis, H. W., and Niparko, J. K. (2000). Cost-utility Woodworth, G. G., and Parkinson, A. J. (2000). Speech
analysis of the cochlear implant in children. Journal of the perception by prelingually deaf children after six years of
American Medical Association, 284, 850–856. cochlear implant use: E¤ects of age at implantation. Annals
Francis, H. W., Koch, M. E., Wyatt, R., and Niparko, J. K. of Otology, Rhinology, and Laryngology, 109(Suppl. 185),
(1999). Trends in educational placement and cost-benefit 82–84.
considerations in children with cochlear implants. Archives Waltzman, S. B. (2000). Variables a¤ecting speech perception
of Otolaryngology–Head and Neck Surgery, 125, 499–505. in children. In S. B. Waltzman and N. L. Cohen (Eds.),
Fryauf-Bertschy, H., Tyler, R. S., Kelsay, D. M., Gantz, B. J., Cochlear implants (pp. 199–206). New York: Thieme.
and Woodworth, G. G. (1997). Cochlear implant use by Waltzman, S. B., and Cohen, N. L. (1998). Cochlear implan-
prelingually deafened children: The influences of age at im- tation in children younger than 2 years old. American Jour-
plant and length of device use. Journal of Speech, Language, nal of Otology, 19, 158–162.
and Hearing Research, 40, 183–199. Zimmerman-Phillips, S., Robbins, A. M., and Osberger, M. J.
Kirk, K. I. (2000). Challenges in the clinical investigation of (2000). Assessing cochlear implant benefit in very young
cochlear implant outcomes. In J. K. Niparko (Ed.), Coch- children. Annals of Otology, Rhinology, and Laryngology,
lear implants: Principles and practices (pp. 225–258). Balti- 109(Suppl. 185), 42–43.
more: Lippincott Williams and Wilkins. Zwolan, T. A., Zimmerman-Phillips, S., Ashbaugh, C. J.,
Meyer, T. A., Svirsky, M. A., Kirk, K. I., and Miyamoto, Hieber, S. J., Kileny, P. R., and Telian, S. A. (1997).
R. T. (1998). Improvements in speech perception by chil- Cochlear implantation of children with minimal open-set
dren with profound prelingual hearing loss: E¤ects of de- speech recognition. Ear and Hearing, 18, 240–251.
vice, communication mode, and chronological age. Journal
of Speech, Language, and Hearing Research, 41, 846–858. Further Readings
Nicholas, J. (1994). Sensory aid use and the development of
communicative function. Volta Review, 96, 181–198. Brown, C. J., and McDowall, D. W. (1999). Speech production
Osberger, M. J., Fisher, L., Zimmerman-Phillips, S., Geier, L., results in children implanted with the Clarion implant.
and Barker, M. J. (1998). Speech recognition performance Annals of Otology, Rhinology, and Laryngology, 108(Suppl.
of older children with cochlear implants. American Journal 177), 110–112.
of Otology, 19, 152–157. Buss, E., Labadie, R. F., Brown, C. J., Gross, A. J., Grose,
Osberger, M. J., Robbins, A. M., Berry, S., Todd, S., Hesketh, J. H., and Pillsbury, H. C. (2002). Outcome of cochlear
L., and Sedey, A. (1991). Analysis of the spontaneous implantation in pediatric auditory neuropathy. American
speech samples of children with cochlear implants or tactile Journal of Otology and Neurotology, 23, 328–332.
aids. American Journal of Otology, 12(Suppl.), 173–181. Christiansen, J. B., and Leigh, I. W. (2002). Cochlear implants
Osberger, M. J., Robbins, A. M., Miyamoto, R. T., Berry, in children: Ethics and choices. Washington, DC: Gallaudet
S. W., Myres, W. A., Kessler, K. S., and Pope, M. L. University Press.
(1991). Speech perception abilities of children with cochlear Dawson, P. W., Blamey, P. J., Dettman, S. J., Barker, E. J.,
implants, tactile aids, or hearing aids. American Journal of and Clark, G. M. (1995). A clinical report on receptive vo-
Otology, 12(Suppl.), 105–115. cabulary skills in cochlear implant users. Ear and Hearing,
Osberger, M. J., Zimmerman-Phillips, S., and Koch, D. B. 16, 287–294.
(2002). Cochlear implant candidacy and performance trends Geers, A. E., and Moog, J. S. (1989). Evaluating speech per-
in children. Annals of Otology, Rhinology, and Laryngology, ception skills: Tools for measuring benefits of cochlear
111(Suppl. 189), 62–65. implants, tactile aids, and hearing aids. In E. Owens and
Robbins, A. M. (2000). Rehabilitation after cochlear implan- D. K. Kessler (Eds.), Cochlear implants in young deaf chil-
tation. In J. K. Niparko (Ed.), Cochlear Implants: Princi- dren. Boston: Little, Brown.
ples and Practices (pp. 323–364). Baltimore: Lippincott Kirk, K. I., Pisoni, D. B., and Osberger, M. J. (1995). Lexical
Williams and Wilkins. e¤ects on spoken word recognition by pediatric cochlear
Robbins, A. M., Kirk, K. I., Osberger, M. J., and Ertmer, D. implant users. Ear and Hearing, 16, 470–481.
(1995). Speech intelligibility of implanted children. Annals Nevins, M. E., and Chute, P. M. (1996). Children with cochlear
of Otology, Rhinology, and Laryngology, 104(Suppl. 166), implants in educational settings. San Diego, CA: Singular
399–401. Publishing Group.
Robbins, A. M., Renshaw, J. J., and Berry, S. W. (1991). Nikparko, J. K. (Ed.). (2000). Cochlear implants: Princi-
Evaluating meaningful auditory integration in profoundly ples and practices. Baltimore: Lippincott Williams and
hearing-impaired children. American Journal of Otology, Wilkins.
12(Suppl.), 151–164. Osberger, M. J., and Fisher, L. (2000). Preoperative predictors
Spencer, L., Tomblin, J., and Gantz, B. J. (1999). Reading of postoperative implant performance in children. Annals of
skills in children with multichannel cochlear-implant expe- Otology, Rhinology, and Laryngology, 109(Suppl. 185), 44–
rience. Volta Review, 99, 193–202. 46.
Tait, D. M. (1993). Video analysis: A method of assessing Robbins, A. M., Green, J., and Bollard, P. M. (2000). Lan-
changes in preverbal and early linguistic communica- guage development in children following one year of
tion after cochlear implantation. Ear and Hearing, 14, 378– Clarion implant use. Annals of Otology, Rhinology, and
389. Laryngology, 109(Suppl. 105), 94–95.
Thielemeir, M., Tonokawa, L. L., Peterson, B., and Eisenberg, Robbins, A. M., Svirsky, M., and Kirk, K. I. (1997). Children
L. S. (1985). Audiological results in children with a cochlear with implants can speak, but can they communicate? Oto-
implant. Ear and Hearing, 6(Suppl.), 27S–35S. laryngology–Head and Neck Surgery, 117, 155–160.
Ryugo, D. K., Limb, C. J., and Redd, E. E. (2000). Brain Distinction Between a True Ear Advantage and
plasticity: The impact of the environment as it relates to an Observed Ear Advantage
hearing and deafness. In J. K. Niparko (Ed.), Cochlear im-
plants: Principles and practices (pp. 33–56). Baltimore: A true ear advantage is thought to reflect di¤erences in
Lippincott Williams and Wilkins. the transmission capacities of the auditory channels from
Tobey, E. A., and Hasenstab, S. (1991). E¤ects of Nucleus the RE and LE to a common, centrally located process-
multichannel cochlear implant upon speech production in ing area that, for speech, is located in the left hemi-
children. Ear and Hearing, 12(Suppl.), 48S–54S. sphere. An observed ear advantage may arise from at
Waltzman, S. B., and Cohen, N. L. (2000). Cochlear implants.
New York: Thieme. least four sources: a true ear advantage, decision varia-
Waltzman, S. B., Scalchunes, V., and Cohen, N. L. (2000). bles, stimulus variables, and measurement error (Speaks,
Performance of multiply handicapped children using coch- Niccum, and Carney, 1982). Proper counterbalancing of
lear implants. American Journal of Otology, 21, 329–335. experimental conditions can control stimulus variables,
measurement error can be minimized by presenting a
su‰cient number of listening trials (Repp, 1977; Speaks,
Niccum, and Carney, 1982), and the role of decision
Dichotic Listening variables will be addressed subsequently.
Dichotic listening refers to listening to two di¤erent sig-

Optional Psychophysical Procedures
nals presented simultaneously through earphones, one
signal to the left ear (LE) and a di¤erent signal to the Kimura (1961) used a recall task. Listeners received
right ear (RE). Although the results have been expressed three pairs of spoken digits and were asked to recall as
in di¤erent ways, the most common approach is to cal- many of the six digits as possible. A second procedure—
culate %RE , %LE , and the di¤erence score (%RE %LE ). the most commonly used—involves a two-ear recogni-
The di¤erence score describes the percentage ear advan- tion task. One pair of signals is presented, the listener is
tage and may be a right ear advantage (REA), left ear to attend ‘‘equally’’ to both the RE and LE, and the lis-
advantage (LEA), or no ear advantage (NoEA). tener provides two responses from a closed message set
Dichotic listening is a psychophysical process, the (Studdert-Kennedy and Shankweiler, 1970; Berlin et al.,
testing of which is used to assess certain aspects of cen- 1972; Speaks, Niccum, and Carney, 1982). In a variation
tral auditory function. The outcomes of experiments of the two-ear recognition task, the listener attends to
have led to the development of ear–brain hypotheses; an both ears but provides only one response (Repp, 1977).
REA is accepted as evidence of left hemispheric domi- A third procedure employs an ear-monitoring task in
nance for processing and an LEA as evidence of right which two signals are presented but the listener is asked
hemispheric dominance. A NoEA is sometimes inter- to attend selectively to one ear and supply only one re-
preted to mean that brain dominance has not been well sponse (Hayden, Kirsten, and Singh, 1979). Although
established (Gerber and Goldman, 1971). those three tasks can be used with speech signals, their
When the signals are speech (usually consonant-vowel use for nonspeech signals is more problematic because
[CV] nonsense syllables), the commonly reported out- the listener is required to both recall and name the sig-
come is an REA, and interpretation of the REA in rela- nals heard.
tion to left hemispheric dominance has been based on A fourth technique applies the theory of signal detec-
four assumptions: (1) ipsilateral auditory pathways are tion (TSD) and involves a yes/no target-monitoring task
suppressed during dichotic stimulation (Milner, Taylor, (Katsuki et al., 1984). The TSD approach can be used
and Sperry, 1968); (2) information from each ear arrives with either speech or nonspeech signals, but the details
at the contralateral hemisphere in equivalent states will be described for speech signals. The message set
(Studdert-Kennedy and Shankweiler, 1970); (3) the left consists of six CV nonsense syllables. On a given test
hemisphere, which is language dominant for at least 95% block of 40 dichotic trials, one syllable is designated the
of the right-handed population and about 70% of the target. Contralateral interference is provided by the
left-handed population (Penfield and Roberts, 1959; other five syllables, each appearing equally often. The
Annett, 1975; Rasmussen and Milner, 1977), is princi- target is present on only half (20) of the trials within a
pally responsible for extracting phonetic information listening block; hence the a priori probability of target-
from the di¤erent signals presented to the RE and LE present trials is 0.50. The listener attends to a designated
(Studdert-Kennedy and Shankweiler, 1970; Studdert- ear and responds ‘‘yes’’ to vote that the target was pres-
Kennedy, Shankweiler, and Pisoni, 1972); and (4) the ent or ‘‘no’’ to vote that the target was not present. Over
lower LE score implies ‘‘loss of information’’ during the course of six listening blocks, each of the six syllables
interhemispheric transmission from the right hemisphere serves as the target, and contralateral interference is
to the left hemisphere via the corpus callosum (Studdert- provided by the other five syllables. The scores for
Kennedy and Shankweiler, 1970; Studdert-Kennedy, each ear are expressed by P(C)max, a d 0 -based statistic,
Shankweiler, and Pisoni, 1972; Berlin et al., 1973; the ear advantage is given by P(C)maxRE P(C)maxLE ,
Brady-Wood and Shankweiler, 1973; Cullen et al., 1974; and listener criterion is expressed by b. The e¤ects of
Repp, 1976). Each assumption is critical to the validity decision variables on the outcome of the yes/no target-
of an ear–brain hypothesis. monitoring task are minimized because both hit and
Dichotic Listening 459
false alarm responses are incorporated in the calculation Reliability of Ear Advantage Scores, d
of P(C)max.
The reliability of ear-advantage scores has been reported
principally for the simple ear-di¤erence score, d. Ryan
Choice of Metric and McNeil (1974) tested listeners with two blocks of 30
When the task is yes/no target monitoring, the ear CV nonsense syllables per block and reported a correla-
advantage should be expressed by P(C)maxRE tion between blocks of þ0.80. That outcome compares
P(C)maxLE . For the commonly used two-ear recognition favorably with the test-retest correlation of þ0.74 re-
task, several metrics have been proposed, which will be ported by Blumstein, Goodglass, and Tartter (1975).
presented here in the context of their dependence on These authors emphasized, however, that the correlation
performance level, Po , where Po ¼ (%RE þ %LE )=2. (1) d coe‰cient might not be sensitive to reversals in direc-
is the simple di¤erence score, %RE %LE , and d is tion of the advantage between test and retest: an REA
unconstrained by performance level only when Po ¼ on one block and an LEA on a second block. In fact,
50%. (2) Because performance level imposes a ceiling reversals occurred for 29% of their listeners. Pizzamiglio,
on d, Halwes (1969) proposed an ‘‘index of DePascalis, and Vignati (1974) reported a similar out-
laterality,’’ which will be symbolized as Li . Li ¼ come: 30% of their listeners who were tested with digits
[(R L)=(R þ L)]100, and the analysis is confined to evidenced a reversal in direction of the ear advantage.
those trials in which only one response is correct. (3) Other investigators have reported test-retest correlations
POC reflects the percentage of correct responses of þ0.64 (Catlin, Van Derveer, and Teicher, 1976) and
(Harshman and Krashen, 1972). (4) POE reflects the þ0.66 (Speaks and Niccum, 1977).
percentage of erroneous responses (Harshman and Repp (1977) contended that the poor reliability likely
Krashen, 1972). (5–6) POC 0 and POE 0 are linear trans- was due in part to an insu‰cient number of trials. He
formations of POC and POE (Repp, 1977). POC 0 is applied the Spearman-Brown predictive formula to the
unconstrained when Po < 50%, and POE 0 is uncon- Blumstein, Goodglass, and Tartter (1975) data and esti-
strained when Po > 50%. (7) f is the geometric mean of mated that a reliability coe‰cient of þ0.90 should be
POC 0 and POE 0 (Kuhn, 1973) and is unconstrained realized with a 240-trial test (eight blocks of 30 pairs of
only when Po ¼ 50%. (8) e is a disjunctive use of POC 0 syllables per block). Speaks, Niccum, and Carney (1982)
and POE 0 , meaning that e ¼ POC 0 when Po < 50% provided empirical support for Repp’s prediction. They
and e ¼ POE 0 when Po > 50%. Thus, e is independent tested 24 listeners with 20 blocks of 30 pairs of CV syl-
of Po . lables per block. The split-half reliability coe‰cient was
The maximum value of the various indices can be only þ0.62 when scores for block 1 were compared with
constrained by Po , but the dependence apparently is not scores for block 2. The coe‰cient improved to þ0.91,
particularly strong. Speaks, Niccum, and Carney (1982) however, when scores for blocks 1 through 6 were com-
reported that the correlations of the various metrics with pared with scores for blocks 7 through 12. Moreover, the
Po ranged from 0.18 to þ0.17. Moreover, all inter- standard error of measurement diminished from 13.09
correlations among the metrics were b0.95. for the block 1–2 comparison to 5.66 for the block 1–6
What, then, is the metric of choice? We (Speaks, and 7–12 comparison. They concluded that use of fewer
Niccum, and Carney, 1982) have argued that the ideal than 180 listening trials is likely to generate unreliable
metric should reflect two properties. One is the propor- data.
tion of all trials on which an ear advantage occurs,
which is the ratio of single-correct (SC) trials to total Properties of the Ear Advantage
trials: p(SC) ¼ SC=(SC þ DC þ DE), where DC refers
An examination of scores of reports on the dichotic ear
to double-correct trials and DE refers to double-error
advantage with CV nonsense syllables shows one fairly
trials. The second property is the magnitude of ear ad-
consistent result. The mean ear advantage for a group of
vantage, p(EAmag ), on those trials in which an ear ad-
listeners is almost always an REA with a magnitude in
vantage occurs, which is the ratio of the di¤erence
the range of 5%–12%. In most instances, however, there
between single-correct trials for the right ear and single-
have been an insu‰cient number of listening trials to
correct trials for the left ear to total single-correct trials:
permit a more detailed analysis of relevant statistical
p(EAmag ) ¼ (SCRE SCLE )=SC, and p(EAmag ) ¼ Li . If
properties. We will refer principally then to an experi-
we assume that the two properties are of equal impor-
ment reported by Speaks, Niccum, and Carney (1982)
tance, we can derive a single metric for the ear advan- on 24 listeners with the two-ear recognition task. Each
tage by computing the product of p(SC) and p(EAmag ),
listener received 20 blocks of 30 pairs of CV nonsense
and that weighted value ¼ d, the simple ear-di¤erence
syllables per block (600 dichotic trials per listener). Thus,
score. Finally, if we assume that scores for each ear and the properties of interest were derived from 1200 re-
the ear advantage are normally distributed for the indi-
sponses for each listener and from 28,800 responses (24
vidual listener, the utility of d can be improved by a z-
listeners 1200 per listener) for the group. We will note
score transformation, a d 0 -like statistic that incorporates
only a few of the more salient properties.
variability of the ear advantage for the individual lis-
tener as an error term: dZ ¼ (%RE %LE )=s, where 1. Mean percent scores for the group were 71.7% for the
2
s ¼ [(sRE 2
þ sLE )=2] 1=2 . RE and 65.1% for the LE, and the ear advantage, d,
was therefore þ6.6% (an REA). The RE and LE nificant (McNemar’s w 2 for correlated proportions) at
scores for the 20 listening blocks presented to the 24 the 0.05 level or less for 12 listeners (50%). Three (12.5%)
listeners were plotted as cumulative percentage dis- of the 24 listeners had a significant LEA. Katsuki et al.
tributions and fitted with normal integrals. That (1984) tested 20 listeners with a yes/no target-monitoring
analysis suggested that the distributions of recogni- task and the ear advantage for individual listeners was
tion scores for the two ears could be conceptualized tested on transformations (F ¼ [2 arcsin P(C)max] 1=2 ) on
as two overlapping normal curves with di¤erent P(C)max for each ear. Thirteen (65%) of 20 listeners had
means (RE ¼ 71.7%, LE ¼ 65.1%) but equal var- a significant REA, and two (10%) had a significant LEA.
iances (sRE ¼ 8.9%, sLE ¼ 8.8%). Thus, a true ear A similar outcome was reported by Wexler, Halwes,
advantage equals the di¤erence between the means of and Heninger (1981). They tested 31 listeners and found
the RE and LE probability density functions. that 14 (45%) had a significant REA and one (3%) had a
2. The block-to-block measures of the ear advantage significant LEA. They, however, placed a di¤erent in-
also were distributed as a normal curve with a mean terpretation on their data by disregarding the fact that
of 6.6% and a mean intralistener standard deviation 16 listeners did not have a significant ear advantage and
of 10.8% G 2.5%, where the mean sigma was calcu- emphasizing that 14 (93%) of the 15 listeners who had a
lated as the square root of the mean of the variances. significant ear advantage had a significant REA. To jus-
3. The mean/sigma ratio for the group of listeners was tify dismissing the 16 listeners with no ear advantage
very small: 6.6%/10.7% ¼ 0.62. Thus, the typical lis- from their analysis assumes that processing for speech
tener with a small ear advantage, on the order of must always be lateralized to either the left or the right
6.6%, should be expected to evidence block-to-block hemisphere and that failure to obtain an observed ear
reversals in the direction of ear advantage such as advantage must be due principally to measurement
those that have been reported in the literature. The error. They suggested that increasing the length of the
relation between the size of the mean/sigma ratio and dichotic test might reduce measurement error and lead
reversals in ear advantage can be illustrated by results to a larger number of listeners having a significant REA.
obtained for three of the 24 listeners. For L-1, d ¼ But, as we have seen, even with 20 listening blocks
23.3% (REA), s ¼ 9.7%, and X=s ¼ 2.40. An REA (Speaks, Niccum, and Carney, 1982), the mean ear ad-
occurred on all 20 listening blocks. For L-2, d ¼ vantage for a group of 24 listeners was only 6.6%
26.5% (LEA), s ¼ 10.1%, and X=s ¼ 2.62. An (REA), and only 12 of 24 listeners had a significant
LEA occurred on 19 of the 20 listening blocks, and REA. We acknowledge that most, perhaps 80% or so,
a NoEA was observed on one block. For L-3, listeners who have a dichotic ear advantage have an
d ¼ 0.5% (a nonsignificant REA), s ¼ 10.7%, and REA rather than an LEA. From a di¤erent perspective,
X=s ¼ 0.05. Of the 20 listening blocks, 4 were LEA, however, no more than two-thirds of listeners tested ap-
10 were NoEA, and 6 were REA. pear to have an REA. Because a large proportion of the
right-handed population is known to be left hemispheric
How Should the Dichotic Ear Advantage Be dominant for speech and language processing, but a
much smaller proportion evidence an REA, we believe
Interpreted? that speculations about the neurological bases for
An REA is widely accepted as evidence of left hemi- listener responses to dichotic stimulation are at best
spheric dominance for processing, and an LEA is viewed fragile.
as evidence of right hemispheric dominance for process-
—Charles Speaks
ing. The REA commonly reported for speech is thought
to arise from ‘‘loss of information’’ during interhemi-
spheric transmission from the right hemisphere to the References
left hemisphere via the corpus callosum.
Annett, M. (1975). Hand preference and the laterality of cere-
There is general agreement that about 95% of the
bral speech. Cortex, 11, 305–328.
right-handed population (and perhaps 70% of the left- Berlin, C., Lowe-Bell, S., Cullen, J., Thompson, C., and Loo-
handed population) is left hemispheric dominant for vis, C. (1973). Dichotic speech perception: An interpretation
the processing of speech and language (Penfield and of right-ear advantage and temporal o¤set e¤ects. Journal
Roberts, 1959; Geschwind and Levitsky, 1968; Annett, of the Acoustical Society of America, 53, 699–709.
1975). If the direction of ear advantage reflects the di- Berlin, C., Lowe-Bell, S., Cullen, J., Thompson, C., and Staf-
rection of hemispheric laterality, it seems reasonable to ford, M. (1972). Is speech ‘‘special?’’ Perhaps the temporal
assume that something approaching 95% of the right- lobectomy patient can tell us. Journal of the Acoustical So-
handed population should have an REA for speech sig- ciety of America, 52, 702–705.
nals presented dichotically. No outcome approaching Blumstein, S., Goodglass, H., and Tartter, V. (1975). The re-
liability of ear advantage in dichotic listening. Brain and
95% has been reported, and unfortunately, the signifi-
Language, 2, 226–236.
cance of the observed ear advantage for individual lis- Brady-Wood, S., and Shankweiler, D. (1973). E¤ects of am-
teners has rarely been tested statistically. In the Speaks, plitude variation on an auditory rivalry task: Implications
Niccum, and Carney (1982) experiment in which the concerning the mechanisms of perceptual asymmetries.
two-ear recognition task was used with 24 listeners, 18 Haskins Laboratory Status Reports on Speech Research, 34,
had an observed REA, but the advantage was only sig- 119–126.
Electrocochleography 461
Catlin, J., Van Derveer, N. J., and Teicher, R. (1976). Mon- dence from a dichotic study. Cognitive Psychology, 3, 455–
aural right-ear advantage in a target-identification task. 466.
Brain and Language, 3, 470–481. Wexler, E., Halwes, T., and Heninger, G. (1981). Use of a
Cullen, J., Thompson, C., Hughes, L., Berlin, C., and Samson, statistical significance criterion in drawing inferences about
D. (1974). The e¤ects of varied acoustic parameters on per- hemispheric dominance for language function from dichotic
formance in dichotic speech tasks. Brain and Language, 1, listening data. Brain and Language, 13, 13–18.
307–322.
Gerber, S. J., and Goldman, P. (1971). Ear preference for
dichotically presented verbal materials as a function of re-
port strategies. Journal of the Acoustical Society of America, Electrocochleography
49, 1163–1168.
Geschwind, N., and Levitsky, W. (1968). Human brain: left-
right asymmetries in temporal speech region. Science, 161, Electrocochleography (ECochG) refers to the general
186–187. method of recording the stimulus-related potentials
Halwes, T. (1969). E¤ects of dichotic fusion in the perception of of the cochlea and auditory nerve. The product of
speech. Doctoral dissertation, University of Minnesota, ECochG—the electrocochleogram, or ECochGm—is
Minneapolis. shown in Figure 1. As depicted in this figure, the com-
Harshman, R., and Krashen, S. (1972). An ‘‘unbiased’’ proce- ponents of interest may include the cochlear micro-
dure for comparing degree of lateralization of dichotically phonic (CM), cochlear summating potential (SP), and
presented stimuli. UCLA Working Papers on Phonetics, 23, auditory nerve action potential (AP). Detailed descrip-
3–12.
tions of these electrical events are abundant in the hear-
Hayden, M., Kirsten, E., and Singh, S. (1979). Role of dis-
tinctive features in dichotic perception of 21 English con- ing science literature and are beyond the scope of this
sonants. Journal of the Acoustical Society of America, 65, review. For a more thorough discussion of the history of
1039–1046. these potentials as recorded in humans, see Ferraro
Katsuki, J., Speaks, C., Penner, S., and Bilger, R. (1984). Ap- (2000).
plication of the theory of signal detection to dichotic listen- The popularity of ECochG as a clinical tool emerged
ing. Journal of Speech and Hearing Research, 27, 444–448. in the early 1970s, following the discovery and applica-
Kimura, D. (1961). Some e¤ects of temporal-lobe damage on tion of the auditory brainstem response (ABR). The
auditory perception. Canadian Journal of Psychology, 15, development and refinement of noninvasive recording
156–165.
Kuhn, G. M. (1973). The phi coe‰cient as an index of ear
di¤erences in dichotic listening. Cortex, 9, 447–457.
Milner, B., Taylor, S., and Sperry, R. W. (1968). Lateralized
suppression of dichotically presented digits after commissu-
ral section in man. Science, 161, 184–185.
Penfield, W., and Roberts, L. (1959). Speech and brain mecha-
nisms. Princeton, NJ: Princeton University Press.
Pizzamiglio, L., DePascalis, C., and Vignati, A. (1974). Stabil-
ity of dichotic listening test. Cortex, 10, 203–205.
Rasmussen, T., and Milner, B. (1977). The role of early left-
brain injury in determining lateralization of cerebral speech
functions. In S. J. Diamond and D. A. Blizard (Eds.), Evo-
lution and lateralization of the brain. Annals of the New York
Academy of Science, 299, 355–369. [Special issue]
Repp, B. (1976). Dichotic ‘‘masking’’ of voice onset
time. Journal of the Acoustical Society of America, 59, 183–
194.
Repp, B. (1977). Measuring laterality e¤ects in dichotic listen-
ing. Journal of the Acoustical Society of America, 42, 720–
737.
Ryan, W., and McNeil, M. (1974). Listener reliability for a
dichotic task. Journal of the Acoustical Society of America,
56, 1922–1923.
Speaks, C., and Niccum, N. (1977). Variability of the ear
advantage in dichotic listening. Journal of the American
Audiological Society, 3, 52–57.
Speaks, C., Niccum, N., and Carney, E. (1982). Statistical Figure 1. Components of the click-evoked human electro-
properties of responses to dichotic listening with CV non- cochleogram. Top tracings display responses to rarefaction (R)
sense syllables. Journal of the Acoustical Society of America, and condensation (C) polarity clicks. Adding separate R and
72, 1185–1194. C responses (middle tracing) enhances the cochlear Summat-
Studdert-Kennedy, M., and Shankweiler, D. (1970). Hemi- ing Potential (SP) and auditory nerve Action Potential (AP).
spheric specialization for speech perception. Journal of the Subtracting R and C responses (bottom tracing), enhances
Acoustical Society of America, 48, 579–594. the Cochlear Microphonic (CM). (From American Speech-
Studdert-Kennedy, M., Shankweiler, D., and Pisoni, D. (1972). Language-Hearing Association, 1988, p. 9, based on data from
Auditory and phonetic processes in speech perception: Evi- Coats, 1981.)
techniques also has facilitated the current clinical use of

ECochG.
The technical capability to record cochlear and audi-
tory nerve potentials in humans has led to a variety of
clinical applications for ECochG. Among the more
popular applications are
1. To diagnose, assess, and monitor Ménière’s disease/
endolymphatic hydrops (MD/ELH) and to assess
and monitor treatment strategies for these disorders
2. To enhance wave I of the ABR Figure 2. Construction of a tympanic membrane electrode
3. To monitor cochlear and auditory nerve function (foam rubber tip can be replaced with soft cotton). (From
during operations that involve the auditory periphery Ferraro, 2000, p. 434.)
(Ruth, Lambert, and Ferraro, 1988; Ferraro and
Krishnan, 1997).
Table 1. ECochG Recording Parameters
ECochG Recording Techniques Electrode Array
Primary (þ) Tympanic membrane
Transtympanic Versus Extratympanic ECochG. The Secondary () Contralateral earlobe or mastoid
terms transtympanic (TT) and extratympanic (ET) refer process
to the two general techniques for recording ECochG. Common Nasion or ipsilateral earlobe
The TT approach involves passing a needle electrode
Signal Averaging Settings
through the TM to rest on the cochlear promontory,
whereas ET recordings are generally made from the sur- Timebase 10 milliseconds
Amplification Factor 50,000–100,000 (Extratympanic—
face of the ear canal or TM. The primary advantage of
ET)
TT ECochG is that this ‘‘near-field’’ approach produces Filter Bandpass 5 Hz–3000 Hz
large components with relatively little signal averaging. Repetitions 750–1000
The major limitation of TT ECochG is that it is invasive.
ET recordings require more signal averaging, and the Stimuli
components tend to be smaller in magnitude than their Type Broadband Clicks (BBC), Tonebursts
TT counterparts. However, this approach is generally (TB)
painless and can be performed by nonphysicians in non- Duration (BBC) 100 microsecond electrical pulse
Envelope (TB) 2 millisecond linear rise/fall, 5–10 msec
medical settings.
plateau
TM o¤ers a good compromise between ear canal and Polarity Rarefaction and Condensation (BBC),
TT recording sites with respect to component magni- Alternating (TB)
tudes and signal averaging time without being invasive Repetition Rate 11.3/sec
or painful (Lambert and Ruth, 1988; Ferraro, Black- Level 95–85 dB HL (125–115 dB pe SPL)
well, et al., 1994; Ferraro, Thedinger, et al., 1994). In
addition, the waveform patterns that lead to the inter-
pretation of the TT ECochGm are preserved in TM
recordings (Ferraro, Thedinger, et al., 1994).
Figure 2 is a drawing of the TM electrode (or Stimulus Considerations. The broadband click tends to
‘‘tymptrode’’) used in our clinic and laboratory, which is be the most popular stimulus for short-latency AEPs
a modification of the device described several years ago because it excites synchronous discharges from a large
by Stypulkowski and Staller (1987). Details regarding population of neurons to produce well-defined neural
the fabrication and placement of the tymptrode can be components. However, the brevity of the click makes it a
found in Ferraro (1997, 2000). less than ideal stimulus for studying cochlear potentials
such as the CM and SP, whose durations are stimulus
dependent. The use of tonal stimuli can overcome some
Recording Parameters. ECochG components occur of these limitations, and also provide for a higher degree
within the first few milliseconds of electrophysiological of response frequency specificity than clicks (Durrant
activity following stimulus onset. Table 1 illustrates the and Ferraro, 1991; Ferraro, Blackwell, et al., 1994; Fer-
parameters used in our laboratory and clinic for record- raro, Thedinger, et al., 1994; Margolis et al., 1995).
ing the SP and AP together, which is usually the pattern Stimulus polarity is an important factor for ECochG.
of interest when ECochG is used in the diagnosis of Presenting clicks or tonebursts in alternating polarity
MD/ELH. It is important to note that the bandpass inhibits the presence of stimulus artifact and CM, which
setting of the analog filter of the preamplifier must be are both dependent on stimulus phase. Thus, alternating-
wide enough to accommodate the recording of both di- polarity stimuli may be preferable when the amplitudes
rect and alternating current potentials (i.e., the SP and of the SP and AP are of interest (as in the determination
AP, respectively). of the SP/AP amplitude ratio for the diagnosis of MD/
Figure 3. Normal electrocochleogram recorded from the tym-

panic membrane to clicks presented in alternating polarity at
80 dB HL. The amplitudes of the Summating Potential (SP)
and Action Potential (AP) can be measured from peak-to-peak Figure 4. Normal electrocochleogram recorded from the tym-
(left panel), or with reference to a baseline value (right panel). panic membrane to a 2000-Hz toneburst presented in al-
Amplitude/time scale is 1.25 mV/1 msec per gradation. Insert ternating polarity at 80 dB HL. Action Potential (AP) and
phone delay is 0.90 msec. (From Ferraro, 2000, p. 435.) its first negative peak (N1 ) are seen at the onset of the re-
sponse. Summating Potential (SP) persists as long as the stim-
ulus. SP amplitude is measured at midpoint of response
(point B), with reference to a baseline value (point A). Ampli-
ELH). Recording separate responses to condensation tude (microvolts)/time (milliseconds) scale at lower right.
and rarefaction clicks also is useful, as certain subjects (From Ferraro, Blackwell, et al., 1994, p. 19.)
with MD/ELH display abnormal AP latency di¤erences
to clicks of opposing polarity (Margolis et al., 1992;
Margolis et al., 1995; Orchik, Ge, and Shea, 1997; Sass, quencies recorded from both the TM and promontory
Densert, and Arlinger, 1997). (TT) of the same patient. An important aspect illus-
When ECochG is performed to help diagnose MD/ trated in this figure is that the amplitudes of toneburst-
ELH, stimulus presentation should begin at a level near SPs in normal-hearing subjects are generally negative
the maximum output of the stimulus generator to evoke in regard to baseline amplitude, and are very small.
a well-defined SP-AP complex. Masking of the con- Another noteworthy aspect of Figure 5 is that although
tralateral ear is not a concern for conventional ECochG the amplitudes of the TM responses are approximately
since the magnitude of any electrophysiological response ¼ that of the promontory responses (note amplitude
from the nontest ear is very small and ECochG compo- scales), the corresponding patterns of the TM and TT
nents are generated prior to crossover of the auditory recordings at each frequency are virtually identical.
pathway.
Clinical Applications
Interpretation of the ECochGm
Figure 3 depicts a normal ECochGm to click stimuli MD/ELH. As mentioned earlier, ECochG has emerged
recorded from the TM. Component amplitudes can be as one of the more powerful tools in the diagnosis,
measured from peak to peak (left panel) or using a assessment, and monitoring of MD/ELH, primarily
baseline reference (right panel). AP-N1 latency is mea- through the measurement of the SP and AP. Examples
sured from stimulus onset to the peak of N1 and should
be identical to the latency of ABR wave I at the same
stimulus level. When using a tubal insert transducer
(highly recommended), these values will be delayed by a
factor proportional to the length of the tubing. Although
labeled in Figure 3, N2 has received little interest for
ECochG applications.
Also as shown in Figure 3, SP and AP amplitudes are
made from the leading edge of both components. The
resultant values are used to derive the SP/AP amplitude
ratio, which ranges from approximately 0.1 to 0.5 in
normal subjects.
Figure 4 depicts a normal ECochGm evoked by a
2000-Hz toneburst. As opposed to click-evoked re-
sponses, where the SP normally appears as a small
shoulder preceding the AP, the SP to tonebursts persists
Figure 5. Tympanic Membrane (TM) recorded electrocochleo-
as long as the stimulus. The AP and its N1 in turn are grams evoked by tonebursts of di¤erent frequencies presented
seen at the onset of the response. SP amplitude is mea- at 80 dB HL. Stimulus frequency in kilohertz indicated at the
sured with reference to baseline amplitude, and at the right of each waveform. Amplitude (microvolts)/time (milli-
midpoint of the waveform to minimize the influence of seconds) scale at lower right. (From Ferraro, Blackwell, et al.,
the AP. Figure 5 illustrates toneburst SPs at several fre- 1994, p. 20.)
Figure 7. Comparison of electrocochleograms recorded from

the tympanic membrane between the a¤ected (right) and un-
a¤ected (left) sides of a patient with endolymphatic hydrops.
The shaded areas include the AP and SP components. SP am-
plitude is measured at point B with respect to point and is ab-
normally enlarged on the a¤ected side. Arrows indicate the
AP-N1. Stimulus was a 1000-Hz toneburst (2 msec rise/fall,
Figure 6. Abnormal responses to clicks recorded from the 10 msec plateau) presented at 90 db HL. (From Ferraro, 1993,
promontory (TT) (upper panel) and tympanic membrane p. 37.)
(TM) (lower panel) of the a¤ected ear of the same patient.
Both TT and TM responses display an enlarged Summating
Potential (SP)/Action Potential (AP) amplitude ratio. ‘‘Base’’
indicates reference for SP and AP amplitude measurements.
Amplitude (microvolts)/time (milliseconds) scale at lower right. rarefaction clicks (Margolis et al., 1992, 1995), and
Stimulus onset delayed by approximately 2 msec. (From Fer- measuring the respective ‘‘areas’’ of the SP and AP to
raro, Thedinger, et al., 1994, p. 27.) derive the SP/AP area ratio (Ferraro and Tibbils, 1999).
Enhancement of Wave I. In a high percentage of hard-

of-hearing subjects, including those with acoustic
of this application are shown in Figures 6 (click-evoked tumors, wave I of the ABR may be unrecordable in the
ECochGms), and 7 (toneburst-evoked ECochGms). The presence of wave V (Hyde and Blair, 1981; Cashman
upper tracings in Figure 6 were measured from the and Rossman, 1983). This situation precludes the mea-
promontory (TT), whereas the lower waveforms repre- surement of the I–V and I–III interwave intervals, which
sent TM recordings. The SP/AP amplitude ratios are are key features of the ABR for neurodiagnostic ap-
enlarged under each condition. The rationale for this plications. Under these and other less than optimal
finding remains unclear, but may relate to the nature of recording conditions, using an ECochG approach for
the SP as a distortion product of transduction processes recording the ABR has considerable utility (Ferraro and
in the cochlea. In particular, ELH may augment this Ferguson, 1989). Figure 8 exemplifies this application in
distortion and thus increase the amplitude of the SP. a patient with hearing loss. The top tracing represents
Enlarged SP/AP amplitude ratios also have been re- the ABR recorded with surface electrodes at the vertex
ported for perilymphatic fistulas (Ackley, Ferraro, and and ear lobes, and wave I is absent in the presence of
Arenberg, 1994), which suggests that the fluid pressure wave V. When the TM is used as a recording site, how-
of the scala media may be the underlying feature to ever (bottom tracing), wave I is clearly present, permit-
which ECochG is specific. ting the measurement of the I–V interwave interval.
Figure 7 illustrates the di¤erence between right and
left SPs evoked by 2000-Hz tonebursts in a patient with Intraoperative Monitoring. Intraoperative monitoring
MD/ELH on the right side. A pronounced, negative SP of inner ear and auditory nerve status during operations
is seen on the a¤ected side of this particular patient, that involve the peripheral auditory system has emerged
whereas the una¤ected side shows a normal pattern. as an important application for ECochG. Such mon-
The reported incidence of an enlarged SP and SP/AP itoring usually is done to help the surgeon avoid poten-
amplitude ratio in the general Ménière’s population is tial trauma to the ear and nerve in an e¤ort to preserve
approximately 60%–65% (Gibson, Mo¤at, and Rams- hearing, to identify anatomic landmarks (such as the
den, 1977; Coats, 1981; Kumagami, Nishida, and endolymphatic sac), or to help predict postoperative
Masaaki, 1982). Testing patients when they are experi- outcome (Lambert and Ruth, 1988; Gibson and Aren-
encing symptoms of MD/ELH has been shown to berg, 1991; Wazen, 1994). Figure 9 illustrates a series
improve this percentage (Ferraro, Arenberg, and Has- of ECochG responses recorded from a patient undergo-
sanein, 1985). Other approaches to increasing the sensi- ing endolymphatic shunt decompression surgery for
tivity of ECochG include measuring the AP-N1 latency treatment of MD/ELH. A reduction in the SP/AP am-
di¤erence between responses to condensation versus plitude ratio was observed during the course of surgery,
Figure 8. ABR recorded with a vertex

(þ)-to-ipsilateral earlobe () electrode
array, and ECochG-ABR recorded with
a vertex (þ)-to-ipsilateral tympanic
membrane () electrode array from a
patient with hearing loss (audiogram at
right). Wave I is absent in the conven-
tional ABR tracings but recordable with
the ECochG-ABR approach. (From
Ferraro and Ferguson, 1989, p. 165.)
and this patient reported improvement in symptoms

postoperatively.
Summary
ECochG continues to be a useful clinical tool in the
identification and evaluation of inner ear and auditory
nerve disorders. Although this article has addressed the
currently more popular clinical applications of ECochG,
others will emerge as our knowledge of auditory physi-
ology continues to improve, and the technical aspects of
recording the electrical events associated with hearing
become more sophisticated. Current areas in need of
additional research include the standardization of re-
cording and stimulus parameters, and studies designed
to improve the sensitivity of ECochG to MD/ELH and
other cochlear disorders.
—John A. Ferraro
References
Figure 9. Series of electrocochleograms recorded from a patient Ackley, R. S., Ferraro, J. A., and Arenberg, I. K. (1994). Di-
undergoing endolymphatic shunt surgery for treatment of agnosis of patients with perilymphatic fistula. Seminars in
Ménière’s disease. Baseline tracing (1), drilling on mastoid Hearing, 15, 37–41.
bone (2), probing for endolymphatic duct (3), inserting pros- American Speech-Language-Hearing Association. (1988). The
thesis (4), closing (5). Tracing 5 shows a reduction in the SP/ short latency auditory evoked potentials: A tutorial paper
AP amplitude ratio compared to tracing 1, and this patient by the Working Group on Auditory Evoked Potential
reported improvement in symptoms postoperatively. (From Measurements of the Committee on Audiologic Evaluation.
Ferraro, 2000, p. 446.) Cashman, M., and Rossman, R. (1983). Diagnostic features
of the auditory brainstem response in identifying cerebel-
lopontine angle tumors. Scandinavian Audiology, 12, 35–
41.
Coats, A. C. (1981). The summating potential and Ménière’s
disease. Archives of Otolaryngology, 104, 199–208.
Durrant, J. D., and Ferraro, J. A. (1991). Analog model of
human click-elicited SP and e¤ects of high-pass filtering.
Ear and Hearing, 12, 144–148.
Ferraro, J. A. (1997). Laboratory exercises in auditory evoked
potentials. San Diego, CA: Singular Publishing Group.
Ferraro, J. A. (2000). Electrocochleography. In R. J. Roeser,
M. Valente, and H. Hosfort-Dunn (Eds.), Audiology diag-
nosis (pp. 425–450). New York: Thieme.
Ferraro, J. A., Arenberg, I. K., and Hassanein, R. S. (1985). Further Readings

Electrocochleography and symptoms of inner ear dysfunc-
tion. Archives of Otolaryngology, 111, 71–74. Aran, J. M., and Lebert, G. (1968). Les responses nerveuse
Ferraro, J. A., Blackwell, W., Mediavilla, S. J., and Thedinger, cochleaires chex l’homme, image du fonctionnement de
B. (1994). Normal summating potential to tonebursts l’oreille et nouveau test d’audiometrie objectif. Revue de
recorded from the tympanic membrane in humans. Journal Laryngologie, Otologie, Rhinologie (Bordeaux), 89, 361–
of the American Academy of Audiology, 5, 17–23. 365.
Ferraro, J. A., and Ferguson, R. (1989). Tympanic ECochG Coats, A. C. (1986). Electrocochleography: Recording tech-
and conventional ABR: A combined approach for the nique and clinical applications. Seminars in Hearing, 7,
identification of wave I and the I–V interwave interval. Ear 247–266.
and Hearing, 3, 161–166. Coats, A. C., and Dickey, J. R. (1970). Non-surgical recording
Ferraro, J. A., and Krishnan, G. (1997). Cochlear potentials of human auditory nerve action potentials and cochlear
in clinical audiology. Audiology and Neuro-Otology, 2, microphonics. Annals of Otology, Rhinology, and Laryngol-
241–256. ogy, 29, 844–851.
Ferraro, J. A., Thedinger, B., Mediavilla, S. J., and Blackwell, Conlon, B. J., and Gibson, W. P. (2000). Electrocochle-
W. (1994). Human summating potential to tonebursts: Ob- ography in the diagnosis of Ménière’s disease. Acta Oto-
servations on TM versus promontory recordings in the same Laryngologica, 120, 480–483.
patient. Journal of the American Academy of Audiology, 6, Dallos, P., Schoeny, Z. G., and Cheatham, M. A. (1972).
217–224. Cochlear summating potentials: Descriptive aspects. Acta
Ferraro, J. A., and Tibbils, R. (1999). SP/AP area ratio in the Otolaryngologica, 301(Suppl.), 1–46.
diagnosis of Ménière’s disease. American Journal of Audiol- Durrant, J. D. (1986). Combined EcochG-ABR approach ver-
ogy, 8, 21–27. sus conventional ABR recordings. Seminars in Hearing, 7,
Gibson, W. P. R., and Arenberg, I. K. (1991). The scope of 289–305.
intraoperative electrocochleography. In I. K. Arenberg Ferraro, J. A., Best, L. G., and Arenberg, I. K. (1983). The use
(Ed.), Proceedings of the Third International Symposium and of electrocochleography in the diagnosis, assessment and
Workshops on the Surgery of the Inner Ear (pp. 295–303). monitoring of endolymphatic hydrops. Otolaryngologic
Amsterdam: Kugler. Clinics of North America, 16, 69–82.
Gibson, W. P. R., Mo¤at, D. A., and Ramsden, R. T. Gibson, W. P. R. (1978). Essentials of electric response audio-
(1977). Clinical electrocochleography in the diagnosis metry. New York: Churchill and Livingstone.
and management of Ménière’s disorder. Audiology, 16, Gibson, W. P. R., Arenberg, I. K., and Best, L. G. (1988).
389–401. Intraoperative electrocochleographic parameters follow-
Hyde, M. L., and Blair, R. L. (1981). The auditory brainstem ing nondestructive inner ear surgery utilizing a valved
response in neuro-otology: Perspectives and problems. shunt for hydrops and Ménière’s disease. In J. G. Nadol
Journal of Otololaryngolgy, 10, 117–125. (Ed.), Proceedings of the Second International Sympo-
Kumagami, H., Nishida, H., and Masaaki, B. (1982). Electro- sium on Ménière’s Disease. Amsterdam: Kugler and
cochleographic study of Ménière’s disease. Archives of Ghedini.
Otology, 108, 284–288. Goin, D. W., Staller, S. J., Asher, D. L., and Mischke, R. E.
Lambert, P., and Ruth, R. A. (1988). Simultaneous recording (1982). Summating potential in Ménière’s disease. Laryn-
of noninvasive ECoG and ABR for use in intraoperative goscope, 92, 1381–1389.
monitoring. Otolaryngology–Head and Neck Surgery, 98, Kiang, N. S. (1965). Discharge patterns of single nerve fibers in
575–580. the cat’s auditory nerve (Research Monograph No. 35).
Margolis, R. H., Levine, S. M., Fournier, M. A., Hunter, L. L., Cambridge, MA: MIT Press.
Smith, L. L., and Lilly, D. J. (1992). Tympanic electro- Kitahara, M., Takeda, T., and Yazama, T. (1981). Electro-
cochleography: Normal and abnormal patterns of response. cochleography in the diagnosis of Ménière’s disease. In
Audiology, 31, 18–24. K. H. Volsteen (Ed.), Ménière’s disease: Pathogenesis, di-
Margolis, R. H., Rieks, D., Fournier, M., and Levine, S. M. agnosis and treatment (pp. 163–169). New York: Thieme-
(1995). Tympanic electrocochleography for diagnosis of Stratton.
Ménière’s disease. Archives of Otolaryngology–Head and Kobayashi, H., Arenberg, I. K., Ferraro, J. A., and Van der
Neck Surgery, 121, 44–55. Ark, G. (1993). Delayed endolymphatic hydrops following
Orchik, J. G., Ge, X., and Shea, J. J. (1997). Action potential acoustic tumor removal with intraoperative and postopera-
latency shift by rarefaction and condensation clicks in tive auditory brainstem response improvements. Acta Oto-
Ménière’s disease. American Journal of Otolaryngology, 14, laryngolica (Stockholm), 504(Suppl.), 74–78.
290–294. Koyuncu, M., Mason, S. M., and Shinkwin, C. (1994).
Ruth, R. A., Lambert, P. R., and Ferraro, J. A. (1988). Elec- E¤ect of hearing loss in electrocochleographic investiga-
trocochleography: Methods and clinical applications. tion of endolymphatic hydrops using tone-pip and click
American Journal of Otolaryngology, 9, 1–11. stimuli. Journal of Laryngology and Otology, 108, 125–
Sass, K., Densert, B., and Arlinger, S. (1997). Recording 130.
techniques for transtympanic electrocochleography in clini- Laureano, A. N., Murray, D., McGrady, M. D., and Camp-
cal practice. Acta Otolaryngologica (Stockholm), 118, 17– bell, K. C. M. (1995). Comparison of tympanic membrane-
25. recorded electrocochleography and the auditory brainstem
Stypulkowski, P., and Staller, S. (1987). Clinical evaluation response in threshold determination. American Journal of
of a new ECoG recording electrode. Ear and Hearing, 8, Otolaryngology, 16, 209–215.
304–310. Levine, S. M., Margolis, R. H., Fournier, E. M., and Winzen-
Wazen, J. J. (1994). Intraoperative monitoring of auditory burg, S. M. (1992). Tympanic electrocochleography for
function: Experimental observation, and new applications. evaluation of endolymphatic hydrops. Laryngoscope, 102,
Laryngoscope, 104, 446–455. 614–622.
Electronystagmography 467
Moriuchi, H., and Kumagami, H. (1979). Changes of AP, SP 45 angle. The diagnosis of BPPV is based on charac-
and CM in experimental endolymphatic hydrops. Audiol- teristic clinical findings on the Dix-Hallpike test. These
ogy, 22, 258–260. findings include (1) torsional nystagmus and vertigo that
Morrison, A. W., Mo¤at, D. A., and O’Connor, A. F. (1980). occur when the patient is placed in the provoking posi-
Clinical usefulness of electrocochleography in Ménière’s
tion, (2) a delay in the onset of vertigo and nystagmus,
disease: An analysis of dehydrating agents. Otolaryngologic
Clinics of North America, 11, 703–721. and (3) a duration of vertigo and nystagmus of less than
Ng, M., Srireddy, S., Horlbeck, D. M., and Niparko, J. K. 1 minute.
(2001). Safety and patient experience with transtympanic
electrocochleography. Laryngoscope, 111, 792–795.
Orckik, D. J., Shea, J. J., Jr., and Ge, X. (1993). Trans- Ocular Motor Tests
tympanic electrocochleography in Ménière’s disease using
clicks and tone-bursts. American Journal of Otology, 14, The purpose of ocular motor tests is to test non-
290–294. vestibular eye movements. These movements include the
Ruben, R., Sekula, J., and Bordely, J. E. (1960). Human saccadic system, the smooth pursuit system, the opto-
cochlear responses to sound stimuli. Annals of Otorhinolar- kinetic (OPK) system, and the gaze-holding mechanism.
yngology, 69, 459–476.
Ruth, R. A. (1990). Trends in electrocochleography. Journal of Abnormalities in the ocular motor systems generally lo-
the American Academy of Audiology, 1, 134–137. calize CNS lesions; however, acute peripheral vestibular
Ruth, R. A. (1994). Electrocochleography. In J. Katz (Ed.), lesions may also cause abnormal findings.
Handbook of clinical audiology (4th ed., pp. 339–350). Bal- The saccadic system rapidly changes the direction of
timore: Williams and Wilkins. the eye to acquire the image of an object of interest.
Schmidt, P., Eggermont, J., and Odenthal, D. (1974). Study of Disorders of the saccadic system can include slowing of
Ménière’s disease by electrocochleography. Acta Otolar- saccadic eye movements, impaired saccadic accuracy,
yngologica, 316(Suppl.), 75–84. and impaired reaction time. The preferred stimulus for
Schoonhoven, R., Fabius, M. A. W., and Grote, J. J. (1995). testing the saccadic system is a random sequence para-
Input/output curves to tonebursts and clicks in extra-
digm presented with an array of light-emitting diodes
tympanic and transtympanic electrocochleography. Ear and
Hearing, 16, 619–630. (LEDs) controlled by a computer (Baloh et al., 1975).
Measurement parameters on the saccade test include la-
tency, velocity, and accuracy (Fig. 1).
The smooth pursuit system allows continuous, clear
vision of objects moving within the visual environment.
Patients with impaired smooth pursuit require frequent
corrective saccades to keep up with the target, producing
Electronystagmography cogwheeling or saccadic pursuit responses. Most com-
puterized ENG systems use a sinusoidal paradigm that
o¤ers precise control of frequency and amplitude of
Electronystagmography (ENG) refers to a battery of smooth pursuit testing. The most important measure-
tests used to evaluate the vestibular system. The tests ment parameter of smooth pursuit is gain. Gain is cal-
include (1) the Dix-Hallpike test, (2) ocular motor tests, culated as the ratio of eye velocity to target velocity.
(3) a search for positional and/or spontaneous nys- Figure 2 shows gain and EOG recording during smooth
tagmus, (4) the caloric test, and (5) the failure of fixation pursuit for a normal subject.
suppression test. During ENG testing, eye movement The optokinetic system serves to hold images of the
is measured to determine the presence of peripheral environment on the retina during sustained head rota-
(vestibular nerve and/or end-organ) vestibular or central tion. There are two optokinetic stimuli: (1) partial field
nervous system (CNS) dysfunction. Two methods for devices, which include the light bar and a small motor-
recording eye movement are electro-oculography (EOG) ized drum, and (2) full-field devices (preferred), such
and video-nystagmography (VNG). EOG is a recording as an optokinetic projector or large optokinetic drum,
of the corneoretinal potential with surface electrodes that fill at least 90% of the visual field. The measure-
placed on the face, whereas VNG measures eye movement parameter of the OPK test is gain (ratio of eye ve-
ment through the use of infrared video cameras mounted locity to field velocity). Gain should be at least 0.5 and
in goggles worn by the patient during testing. symmetrical.
The gaze tests determine whether a patient has gaze-
evoked nystagmus. Gaze-evoked nystagmus occurs when
Dix-Hallpike Test a leaky neural integrator causes the eyes to drift back
toward the primary position, necessitating corrective
The Dix-Hallpike maneuver is a provocative positioning saccades. Thus, the eyes cannot hold their position when
test for benign paroxysmal positioning vertigo (BPPV) looking at an eccentric target. Gaze-evoked nystagmus
(Dix and Hallpike, 1952). The patient is seated upright can be caused by drugs, cerebellar disease, brainstem
with the head turned 45 toward the test ear and then lesions, and acute peripheral vestibular lesions. The
quickly lowered into a supine position with the head stimulus for the gaze tests is a light bar target positioned
hanging o¤ the bed or table and still positioned at the 20 or 30 right, left, up, or down from the center.
Figure 1. Sample of ENG recordings of saccades produced with a random-sequence paradigm in a normal subject. Data points rep-
resent peak velocity, accuracy, and latency measurements for each rightward and leftward saccade.
Figure 2. Sample of ENG recording of smooth pursuit produced with a sinusoidal paradigm in a normal subject. Data points repre-
sent gain values for each rightward and leftward eye movement across target frequency.
Electronystagmography 469
Figure 3. Plots of slow-component velocity from nystagmus elicited by bithermal water caloric irrigation as a function of time in a
normal subject. The four panels represent irrigation of the left and right ears at warm (44 C) and cool (30 C) water temperatures.
Spontaneous Nystagmus and to perform mental alerting tasks to avoid central

suppression.
To determine the presence of spontaneous nystagmus, Positional (and spontaneous) nystagmus is classified
the subject is seated upright with eyes closed, the head according to the direction of the fast phase. The direc-
positioned straight ahead (0 ), and the subject engaged tion of nystagmus can be fixed or changing. Direction-
in mental alerting tasks to prevent suppression of nys- changing nystagmus can be geotropic (beating toward
tagmus. Spontaneous nystagmus is typically horizontal the earth) or ageotropic (beating away from the earth).
jerk nystagmus and usually occurs due to a lesion to Direction-changing nystagmus is an abnormal, non-
the peripheral vestibular system causing an imbalance in localizing finding that is most often associated with pe-
the tonic signals arriving at the oculomotor neurons. The ripheral vestibular disease (Lin et al., 1986). Positional
imbalance produces a constant drift of the eyes in one nystagmus can also be direction-changing in a single
direction, interrupted by fast components in the opposite head position. This clinical finding is rare and indicates a
direction. If the imbalance results from a peripheral ves- central pathology. There is evidence that both structural
tibular lesion, then the pursuit system can cancel it. and metabolic factors can alter the specific gravity of the
Thus, when the patient opens his or her eyes, the nys- cupula in the semicircular canals and cause positional
tagmus disappears. The clinical finding of spontaneous nystagmus (Honrubia, 2000). Positional nystagmus can
nystagmus suggests an uncompensated peripheral lesion, also be caused by brainstem lesions, and most clinicians
typically on the side opposite the direction of nystagmus. place the burden of proof on the ocular motor function
The proof of the side of lesion, however, lies in the ca- tests. That is, if the ocular motor tests are within normal
loric test. limits, then it is doubtful that a brainstem lesion exists.
Positional Nystagmus
Positional nystagmus occurs when a subject is placed in
The Caloric Test
the following static positions: sitting; supine; supine, The caloric test (Fitzgerald and Hallpike, 1942) is the
head turned left; supine, head turned right; right lateral; most important test in the ENG test battery and the
left lateral; pre-irrigation position; and head-hanging only clinical vestibular test that can lateralize a
straight, right, and left. The patient is asked to close his vestibular deficit. The caloric test stimulates the hori-
or her eyes to eliminate the e¤ects of visual suppression zontal semicircular canal and involves measurement of
Figure 4. Plots of slow-component velocity from nystagmus elicited by bithermal caloric irrigation in a patient with a right peripheral
vestibular lesion. No response was elicited from warm and cool water irrigation to the right ear, resulting in a right unilateral
weakness.
slow-component eye velocity. The key principle of the by which the average SCVs provoked by right ear irri-
caloric test is the convection current created in the hori- gation di¤er in intensity from those provoked by left ear
zontal semicircular canal by changing the temperature of irrigations. The formula for unilateral weakness is:
the endolymph. This convection current causes utriculo-
petal endolymph flow and increased neural firing of the (RW þ RC) (LW þ LC)
UW(%) ¼ 100
primary vestibular a¤erent nerve during warm water or RW þ RC þ LW þ LC
air irrigation, so that nystagmus beats toward the ear
that is stimulated. Irrigation with cool water or air pro- where RW, RC, LW, and LC are the peak SCVs of
duces utriculofugal flow and a decrease in neural firing the responses to right warm, right cool, left warm, and
of the primary vestibular a¤erent nerve, so that nys- left cool irrigations, respectively. Most laboratories use
tagmus beats away from the stimulated ear. an interear di¤erence of greater than 20%–25% to de-
The fundamental assumption of the caloric test is that termine if a unilateral weakness exists (Baloh and Hon-
all four caloric irrigations of a given patient are equally rubia, 1990; Jacobson, Newman, and Kartush, 1997).
strong. Variables that a¤ect the strength of the caloric Unilateral weakness identifies a peripheral vestibular
stimulus include stimulus temperature, stimulus dura- deficit on the weak side and is the most definitive finding
tion, flow rate or volume, mental alerting procedures, on the ENG test for identifying and lateralizing a pe-
size and shape of the external auditory canal, type of ripheral vestibular deficit.
stimulus, patient age, and patient medication. The slow- A spontaneous nystagmus, such as in the case of an
component velocity (SCV) of the nystagmus has proved acute peripheral vestibular lesion, can shift the baseline
to be the most sensitive parameter of vestibular function of the caloric response. The caloric responses will be
and is currently the standard metric used for evaluating skewed toward the direction of the spontaneous nys-
the caloric test (Henricksson, 1956). Because average tagmus. The intensity di¤erence between right-beating
SCVs vary widely in normal subjects, a ratio of right and left-beating caloric nystagmus is called directional
ear to left ear responses is used to analyze the caloric preponderance (DP). The formula for directional pre-
test results (Barber and Wright, 1973). Normal and ponderance is:
symmetrical bithermal caloric nystagmus is shown in (RW þ LC) (RC þ LW)
Figure 3. Unilateral weakness is a caloric asymmetry DP(%) ¼ 100
RW þ LC þ RC þ LW
that results when one labyrinth is less sensitive to caloric
irrigation than the other labyrinth (Fig. 4). The unilat- where RW, RC, LW, and LC are the peak SCVs of the
eral weakness is calculated by determining the amount responses to right warm, right cool, left warm, and left
Frequency Compression 471
cool irrigations, respectively. A directional preponder- Ford, C. R., and Stockwell, C. W. (1978). Reliabilities of air
ance of 30% or greater is abnormal (Baloh and Honru- and water caloric responses. Archives of Otolaryngology,
bia, 1990). Directional preponderance is considered a 104, 380–382.
nonlocalizing finding and usually reflects the presence of Furman, J. M., and Cass, S. P. (1996). Balance disorders: A
case-study approach. Philadelphia: F. A. Davis Co.
spontaneous or positional nystagmus.
Furman, J. M., and Jacob, R. G. (1993). Jongkees’ formula re-
Other caloric abnormalities include failure of fixa- evaluated: Order e¤ects in the response to alternate binaural
tion suppression and bilateral weakness. Failure of fix- bithermal caloric stimulation using closed-loop irrigation.
ation suppression indicates that a patient is unable to Acta Otolaryngologica, 113, 3–10.
suppress visually the nystagmus by more than 50% of Goebel, J. (2001). Practical management of the dizzy patient.
peak SCV, suggesting central vestibular involvement. Philadelphia: Lippincott, Williams and Wilkins.
Bilateral weakness occurs with weak or absent caloric Herdman, S. (2000). Vestibular rehabilitation. Philadelphia:
responses in both ears. Bilateral weakness usually indi- F. A. Davis Co.
cates a bilateral peripheral vestibular deficit, but it also Jacobson, G. P., Calder, J. A., Shepherd, V. A., Rupp, K. A.,
may occur with CNS pathology. and Newman, C. W. (1995). Reappraisal of the mono-
thermal warm caloric screening test. Annals of Otology,
—Faith W. Akin Rhinology, and Laryngology, 104, 942–945.
Karlsen, E. A., Mikhail, H. H., Norris, C. W., and Hassanein,
R. S. (1992). Comparison of responses to air, water, and
References closed-loop caloric irrigators. Journal of Speech and Hear-
Baloh, R. W., and Honrubia, V. (1990). Clinical neurophys- Leigh, J., and Zee, D. S. (1993). The neurology of eye move-
iology of the vestibular system. Philadelphia: F. A. Davis ments. Philadelphia: F. A. Davis Co.
Co. Shepard, N. T., and Telian, S. A. (1996). Practical management
Baloh, R. W., Konrad, H. R., Sills, A. W., and Honrubia, V. of the balance disorder patient. San Diego, CA: Singular
(1975). The saccade velocity test. Neurology, 25, 1071–1076. Publishing Group.
Barber, H. O., and Wright, G. (1973). Positional nystagmus in
normals. Advances in Oto-Rhino-Laryngology, 19, 276–285.
Dix, R., and Hallpike, C. S. (1952). The pathology, sympto-
matology and diagnosis of certain common disorders of the
vestibular system. Annals of Otology, Rhinology, and Laryn- Frequency Compression
gology, 6, 987–1016.
Fitzgerald, G., and Hallpike, C. S. (1942). Studies in human
vestibular function: I. Observations on the directional pre-
Frequency compression (or frequency lowering) is a
ponderance (‘‘nystagmusbereitschaft’’) of caloric nystagmus
resulting from cerebral lesions. Brain, 62, 115–137. general term applied to attempts to lower the spectrum
Henricksson, N. G. (1956). Speed of slow component and of the acoustic speech signal to better match the re-
duration in caloric nystagmus. Acta Otolaryngologica, 46, sidual hearing of listeners with severe to profound high-
1–29. frequency sensorineural impairment accompanied by
Honrubia, V. (2000). Quantitative vestibular function tests and better hearing at the low frequencies. This pattern of
the clinical examination. In S. Herdman (Ed.), Vestibular hearing loss is common to a number of di¤erent etiol-
rehabilitation (pp. 105–171). Philadelphia: F. A. Davis. ogies of hearing loss (including presbycusis, noise expo-
Jacobson, G. P., Newman, C. W., and Kartush, J. M. (1997). sure, ototoxicity, and various genetic syndromes) and
Handbook of balance function testing. San Diego, CA: Sin- arises from greater damage to the basal region relative to
gular Publishing Group.
the apical region of the cochlea (see noise-induced
Lin, J., Elidan, J., Baloh, R. W., and Honrubia, V. (1986).
Direction-changing positional nystagmus: Incidence and hearing loss; ototoxic medications; presbyacusis).
meaning. American Journal of Otolaryngology, 4, 306– The major e¤ect of high-frequency hearing loss on
310. speech reception is a degraded ability to perceive sounds
whose spectral energy is dominated by high frequencies,
in some cases extending to 10 kHz or beyond. Perceptual
Further Readings studies have documented the di‰culties of listeners with
Bamiou, D. E., Davies, R. A., McKee, M., and Luxon, L. M. high-frequency loss in the reception of high-frequency
(1999). The e¤ect of severity of unilateral vestibular dys- sounds (including plosive, fricative, and a¤ricate con-
function on symptoms, disabilities and handicap in vertigi- sonants) and have demonstrated that this pattern of
nous patients. Clinical Otolaryngology, 24, 31–38. confusion is similar to that observed by normal-hearing
Barber, H. O., and Stockwell, C. (1980). Manual of electro- listeners deprived of high-frequency cues through the use
nystagmography. Shambaugh, IL: ICS Medical Corp. of low-pass filtering (Wang, Reed, and Bilger, 1978).
Beynon, G. J. (1997). A review of management of benign Traditional hearing aids attempt to treat this pattern of
paroxysmal positional vertigo by exercise therapy and by hearing loss by delivering frequency-dependent amplifi-
repositioning manoeuvres. British Journal of Audiology, 31,
cation to overcome the loss at high frequencies. Such
11–26.
Bhansali, S. A., and Honrubia, V. (1999). Current status of amplification, however, may not lead to improved per-
electronystagmography testing. Otolaryngology–Head and formance and has even been shown to degrade the
Neck Surgery, 120, 419–426. speech reception ability of some listeners with severe
Coats, A. (1976). The air caloric: A parametric study. Archives to profound high-frequency loss (Hogan and Turner,
of Otolaryngology, 102, 343–354. 1998).
The goal of frequency lowering is to recode the high- time (Fairbanks, Everitt, and Jaeger, 1954) prior to the
frequency components of speech into a lower frequency application of slow playback. Time compression can
range that is matched to the residual capacity of a lis- be accomplished in di¤erent ways, including the elimi-
tener’s hearing. Frequency lowering has been accom- nation of a fixed duration of speech at a given rate of
plished through a variety of di¤erent techniques. These interruption or eliminating pitch periods from voiced
methods have arisen primarily from attempts at band- speech. When the time-compression and slow-playback
width reduction in the telecommunications industry, factors are chosen to be equal, the long-term duration
rather than being driven by the perceptual needs of of the speech signal can be preserved while at the same
hearing-impaired listeners. This article summarizes and time frequencies are lowered proportionally. Funda-
updates the review of the literature on frequency low- mental frequency can be a¤ected di¤erently, depending
ering published by Braida et al. (1979). For each of seven on the particular characteristics of the time-compression
di¤erent categories of signal processing, the major char- scheme, including being lowered, remaining unchanged,
acteristics of each method are described and a brief or being severely distorted (see Braida et al., 1979). The
summary of results obtained with it is provided. intelligibility of speech processed by this technique for
normal-hearing listeners is similar to that described
Slow Playback above for slow playback; that is, bandwidth reduction
by factors greater than 20% lead to severe decrease
The earliest approach to frequency lowering was the in performance (Danilo¤, Shriner, and Zemlin, 1968;
playback of recorded speech at a slower speed than that Nagafuchi, 1976). Results of studies in hearing-impaired
used in recording. Each spectral component is scaled listeners (Mazor et al., 1977; Turner and Hurtig, 1999)
lower in frequency by a multiplicative factor equal to the indicate that improvements for time-compressed slow-
slowdown factor. Although this method is not suitable playback speech compared to conventional linear or
for real-time applications (because of the inherent time high-pass amplification may be observed under certain
dilation of the resulting waveform) and leads to severe conditions. Small benefits, on the order of 10–20 per-
alterations in temporal relations between speech sounds, centage points, are most likely to be observed for small
it is nonetheless important to understand its e¤ects amounts of frequency lowering (bandwidth reduction
because it is a component of many other frequency- factors in the range of 10%–30%), for female rather
lowering schemes. An important characteristic of this than male voices, and for individuals who receive little
method is its preservation of the proportional relation- aid from conventional high-pass amplification. A wear-
ship between spectral components, including the rela- able aid that operates on the basic principles of time-
tion between the short-term spectral envelope and the compressed slow playback (the AVR Transonic device)
fundamental frequency (F 0) of voiced speech. A nega- has been evaluated in children with profound deafness
tive consequence of proportional lowering, however, is (Davis-Penn and Ross, 1993; MacArdle et al., 2001)
the shifting of F 0 into an undesirably low frequency and in adults with high-frequency impairment (Parent,
range (particularly for male voices). Results obtained in Chmiel, and Jerger, 1997; McDermott et al., 1999). A
studies of the e¤ects of slow playback on speech recep- high degree of variability is observed across studies and
tion conducted with normal-hearing listeners (Ti¤anny across subjects within a given study, with substantial
and Bennett, 1961; Danilo¤, Shriner, and Zemlin, 1968) improvements noted for certain subjects and negligible
indicate that reductions in bandwidth up to roughly 25% e¤ects or degradations for others.
produce only small losses in intelligibility, bandwidth
reductions of 50% cause moderate losses in intelligibility,
and bandwidth reductions of 66% or greater lead to se-
Frequency Shifting
vere loss in intelligibility. These studies have shown that Another technique for frequency lowering employs
the voices of females and children are more resistant to heterodyne processing, which uses amplitude modula-
lowering than male voices (presumably because the fun- tion to shift all frequency components in a given band
damental and formant frequencies are higher for women downward by a fixed displacement. This process leads to
than for men), that the e¤ects of lowering are similar for the overlaying, or aliasing, of high-frequency and low-
the reception of consonants and vowels, and that per- frequency components. Aliasing is generally avoided by
formance with lowered speech materials improves with the removal of low-frequency components through fil-
practice. In a study of slow playback in listeners with tering before modulation. Systems that employ shifting
high-frequency sensorineural hearing loss, Bennett and of the entire spectrum have a number of disadvantages:
Byers (1967) found beneficial e¤ects for modest degrees although temporal and rhythmic patterns of speech re-
of frequency lowering (up to a 20% reduction in band- main normal, the harmonic relationships of voiced
width) but that greater degrees of lowering led to a sub- sounds are greatly altered, fundamental frequency is
stantial reduction in performance. severely modified, and low-frequency components im-
portant to speech recognition are removed to prevent
aliasing. Even mild degrees of frequency shifting (e.g., a
Time-Compressed Slow Playback
400-Hz shift for male voices) have been found to inter-
A solution to the time dilation inherent to slow playback fere substantially with speech reception ability (Ray-
was introduced by techniques that compress speech in mond and Proud, 1962).
Frequency Compression 473
Frequency Transposition envelope). The e¤ect of degree of lowering in vocoder-

based systems appears to be comparable to that de-
When frequency shifting is restricted to the high- scribed above for slow playback and time-compressed
frequency components of speech (rather than to the slow playback (Fu and Shannon, 1999). A number of
entire speech spectrum), the process is referred to as fre- studies conducted with vocoder-based lowering systems
quency transposition. This approach has been incorpo- have demonstrated improved speech reception with
rated into several di¤erent wearable or desktop aids training, both for normal-hearing (Takefuta and Swi-
(Johansson, 1966; Velmans, 1973) whose basic operation gart, 1968; Posen, Reed, and Braida, 1993) and for
involves shifting speech frequencies in the region above hearing-impaired listeners (Ling and Druz, 1967;
3 or 4 kHz into a lower frequency region, and adding McDermott and Dean, 2000). When performance with
these processed components to the original unprocessed vocoding systems is compared to baseline systems
speech signal. Generally, the most careful and con- employing low-pass filtering to an equivalent bandwidth
trolled studies of frequency transposition indicate that for normal listeners or conventional amplification for
benefits are quite modest. Transposition can render high- impaired listeners, however, the benefits of lowered
frequency speech cues audible to listeners with severe speech appear to be quite modest. One possible reason
to profound high-frequency hearing loss (Rees and Vel- for the lack of success of some of these systems (despite
mans, 1993); however, these cues may interfere with the apparent promise of this approach) may have been
information normally associated with the reception of the failure to distinguish between voiced and unvoiced
low-frequency speech components (Ling, 1968). There is sounds. Systems in which processing is suppressed
evidence to suggest that transposition aids may be more when the input signal is dominated by low-frequency
useful in training in speech production than in improving energy (Posen, Reed, and Braida, 1993) lead to better
speech reception in deaf children (Ling, 1968). performance (compared to systems with no inhibi-
tions in processing for voiced sounds) based on their
Zero-Crossing-Rate Division ability to enhance the reception of high-frequency sounds
Another approach to frequency lowering lies in an at- while not degrading the reception of low-frequency
tempt to reduce the zero-crossing rate of the speech sounds.
signal. In these schemes, bands of speech are extracted
by filtering and the filter outputs are converted to lower Frequency Warping
frequency sounds having reduced zero-crossing rates.
Evaluations of a system in which processing was applied A more recent approach to frequency lowering incor-
only to high-frequency components and inhibited during porates digital signal-processing techniques developed
voiced speech (Guttman and Nelson, 1968) indicated no for correcting ‘‘helium speech.’’ The speech signal is
benefits for processed materials on a large-set word rec- segmented pitch synchronously, processed to achieve
ognition task for normal-hearing listeners with simulated nonuniform spectral warping, dilated in time to achieve
hearing loss. Use of this system as a speech-production frequency lowering, and resynthesized with the original
aid for hearing-impaired children indicates that, follow- periodicity. Both the overall bandwidth reduction and
ing extensive training, the ability to produce selected the relative compression of high- and low-frequency
high-frequency sounds was improved, while at the same components can be specified. These methods roughly
time the ability to discriminate these same words audi- extrapolate the variance associated with increased length
torily showed no such improvements (Guttman, Levitt, of the vocal tract and include the following character-
and Bellefleur, 1970). istics: they preserve the temporal and rhythmic prop-
erties of speech, they leave F 0 of voiced sounds
unaltered, they allow for independent manipulation of
Vocoding F 0 and spectral envelope, and they compress the short-
An important class of frequency-lowering systems for term spectrum in a continuous and monotonic fashion.
the hearing-impaired is based on the channel vocoder Studies of speech reception with frequency-warped
(Dudley, 1939), which was originally developed to speech indicate that spectral transformations that lead
achieve bandwidth reduction in telecommunications to greater lowering of the high frequencies relative to
systems. Vocoding systems analyze speech into contigu- the low frequencies are superior to those with linear
ous bandpass filters whose output envelopes are detected lowering or with greater lowering of low relative to
and low-pass-filtered for transmission. These signals are high frequencies (Allen, Strong, and Palmer, 1981; Reed
then used to control the amplitudes of corresponding et al., 1983). Improvements in the ability to identify
channels. For frequency lowering, the set of synthesis frequency-warped speech with training have been noted
filters correspond to lower frequencies than the asso- for normal and hearing-impaired listeners (Reed et al.,
ciated analysis filters. Vocoding systems appear to have 1985). Improved ability to discriminate and identify
a number of advantages, including operation in real time high-frequency consonants has been demonstrated with
and flexibility in terms of the choice of analysis and such warping transformations compared to low-pass fil-
synthesis filters (which can allow for di¤erent degrees of tering for substantial reductions in bandwidth (up to a
lowering in di¤erent regions of the spectrum as well as factor of 4 or 5). Overall performance, however, is simi-
for independent manipulation of F 0 and the spectral lar for lowering and low-pass filtering, owing to reduced
performance for the lowering schemes on sounds with Davis-Penn, W., and Ross, M. (1993). Pediatric experiences
substantial low-frequency energy. with frequency transposing. Hearing Instruments, 44, 26–
32.
Summary and Conclusions Dudley, H. (1939). Remaking speech. Journal of the Acoustical
Society of America, 11, 169–177.
Attempts at frequency lowering through a variety of Fairbanks, G., Everitt, W. L., and Jaeger, R. P. (1954).
di¤erent methods have met with only limited success. Method for time or frequency compression-expansion of
Frequency lowering leads to a reduction in bandwidth of speech. IRE Transactions on Audio, AU-2, 7–12.
the original speech signal and to the creation of new Fu, Q. J., and Shannon, R. V. (1999). Recognition of spec-
speech codes which may sound unnatural to the un- trally degraded and frequency-shifted vowels in acoustic
trained ear. Evidence from a number of di¤erent studies and electric hearing. Journal of the Acoustical Society of
America, 105, 1889–1900.
indicates that performance on frequency-lowered speech
Guttman, N., Levitt, H., and Bellefleur, P. (1970). Articulation
can improve with familiarization and training in the training of the deaf using low-frequency surrogate fricatives.
use of frequency-lowered speech. Many of these same Journal of Speech and Hearing Research, 13, 19–29.
studies, however, also indicate that even after extended Guttman, N., and Nelson, J. R. (1968). An instrument that
practice, performance with the coded speech signals creates some artificial speech spectra for the severely hard of
does not exceed that achieved with appropriate base- hearing. American Annals of the Deaf, 113, 295–302.
line conditions (e.g., speech filtered to an equivalent Hogan, C. A., and Turner, C. W. (1998). High-frequency au-
bandwidth in normal-hearing listeners or conventional dibility: Benefits for hearing-impaired listeners. Journal of
amplification with appropriate frequency gain char- the Acoustical Society of America, 104, 432–441.
acteristics in hearing-impaired listeners). Although Johansson, B. (1966). The use of the transposer for the
management of the deaf child. International Audiology, 5,
frequency-lowering techniques can lead to large im-
362–373.
provements in the reception of high-frequency sounds, Ling, D. (1968). Three experiments on frequency transposition.
they may at the same time lead to detrimental e¤ects on American Annals of the Deaf, 113, 283–294.
the reception of vowels and consonants whose spectral Ling, D., and Druz, W. S. (1967). Transposition of high-
energy is concentrated at low frequencies. Because of the frequency sounds by partial vocoding of the speech spec-
need to use the region of low-frequency residual hearing trum: Its use by deaf children. Journal of Auditory Research,
for recoding high-frequency sounds, the low-frequency 7, 133–144.
components of speech may be altered as well through MacArdle, B. M., West, C., Bradley, J., Worth, S., Mackenzie,
the overlaying of coded signals onto the original un- J., and Bellman, S. C. (2001). A study of the application of
processed speech or through wholesale lowering of the a frequency transposition hearing system in children. British
Journal of Audiology, 35, 17–29.
entire speech signal. In listeners with high-frequency im-
Mazor, M., Simon, H., Scheinberg, J., and Levitt, H. (1977).
pairment accompanied by good residual hearing in the Moderate frequency compression for the moderately hear-
low frequencies, benefits for frequency lowering have ing impaired. Journal of the Acoustical Society of America,
been observed for listeners with severe to profound high- 62, 1273–1278.
frequency loss using mild degrees of lowering (no greater McDermott, H. J., and Dean, M. R. (2000). Speech perception
than 30% reduction in bandwidth). For children with with steeply sloping hearing loss: E¤ects of frequency
profound deafness (whose residual low-frequency hear- transposition. British Journal of Audiology, 34, 353–361.
ing may be quite limited), frequency lowering appears McDermott, H. J., Dorkos, V. P., Dean, M. R., and Ching,
to be more e¤ective as a speech production training aid T. Y. C. (1999). Improvements in speech perception with
for specific groups of phonemes rather than as a speech use of the AVR TranSonic frequency-transposing hearing
aid. Journal of Speech, Language, and Hearing Research, 42,
perception aid.
1323–1335.
—Charlotte M. Reed and Louis D. Braida Nagafuchi, M. (1976). Intelligibility of distorted speech sounds
shifted in frequency and time in normal children. Audiology,
References 15, 326–337.
Parent, T. C., Chmiel, R., and Jerger, J. (1997). Comparison of
Allen, D. R., Strong, W. J., and Palmer, E. P. (1981). Experi- performance with frequency transposition hearing aids and
ments on the intelligibility of low-frequency speech codes. conventional hearing aids. Journal of the American Acad-
Journal of the Acoustical Society of America, 70, 1248– emy of Audiology, 8, 355–365.
1255. Posen, M. P., Reed, C. M., and Braida, L. D. (1993). Intelli-
Bennett, D. N., and Byers, V. W. (1967). Increased intelligibil- gibility of frequency-lowered speech produced by a channel
ity in the hypacusic by slow-play frequency transposition. vocoder. Journal of Rehabilitation Research and Develop-
Journal of Auditory Research, 7, 107–118. ment, 30, 26–38.
Braida, L. D., Durlach, N. I., Lippmann, R. P., Hicks, B. L., Raymond, T. H., and Proud, G. O. (1962). Audiofrequency
Rabinowitz, W. M., and Reed, C. M. (1979). Hearing aids: conversion. Archives of Otolaryngology, 76, 60–70.
A review of past research on linear amplification, amplitude Reed, C. M., Hicks, B. L., Braida, L. D., and Durlach, N. I.
compression, and frequency lowering (ASHA Monograph (1983). Discrimination of speech processed by low-pass fil-
No. 19). Rockville, MD: American Speech and Hearing tering and pitch-invariant frequency lowering. Journal of the
Association. Acoustical Society of America, 74, 409–419.
Danilo¤, R. G., Shriner, T. H., and Zemlin, W. R. (1968). In- Reed, C. M., Schultz, K. I., Braida, L. D., and Durlach, N. I.
telligibility of vowels altered in duration and frequency. (1985). Discrimination and identification of frequency-
Journal of the Acoustical Society of America, 44, 700–707. lowered speech in listeners with high-frequency hearing im-
Functional Hearing Loss in Children 475
pairment. Journal of the Acoustical Society of America, 78, Risberg, A. (1965). The transposer and a model for speech
2139–2141. perception. STL-QPSR, 4, 26–30. Stockholm: KTH.
Rees, R., and Velmans, M. (1993). The e¤ect of frequency Rosenhouse, J. (1990). A new transposition device for the deaf.
transposition on the untrained auditory discrimination of Hearing Journal, 43, 20–25.
congenitally deaf children. British Journal of Audiology, 27, Sher, A. E., and Owens, E. (1974). Consonant confusions
53–60. associated with hearing loss above 2000 Hz. Journal of
Takefuta, Y., and Swigart, E. (1968). Intelligibility of speech Speech and Hearing Research, 17, 669–681.
signals spectrally compressed by a sampling-synthesizer Zue, V. W. (1970). Translation of diver’s speech using digital
technique. IEEE Transactions on Auditory Electroacoustics, frequency warping. MIT Research Laboratory of Electronics
AU-16, 271–274. Quarterly Progress Report, 101, 175–182.
Ti¤anny, W. R., and Bennett, D. N. (1961). Intelligibility of
slow played speech. Journal of Speech and Hearing Re-
search, 4, 248–258. Functional Hearing Loss in Children
Turner, C. W., and Hurtig, R. R. (1999). Proportional fre-
quency compression of speech for listeners with sensori-
neural hearing loss. Journal of the Acoustical Society of Functional hearing loss (FHL) is frequently forgotten or
America, 106, 877–886.
Velmans, M. L. (1973). Speech imitation in simulated deafness, misdiagnosed in the pediatric population, despite the
using visual cues and ‘‘recoded’’ auditory information. fact that it is well documented (Bowdler and Rogers,
Language and Speech, 16, 224–236. 1989). The diagnosis is often missed in children because
Wang, M. D., Reed, C. M., and Bilger, R. C. (1978). A com- of lack of awareness of its manifestations (Pracy et al.,
parison of the e¤ects of filtering and sensorineural hearing 1996), its incidence (Barr, 1963), and its multiple causes
loss on patterns of consonant confusions. Journal of Speech (Broad, 1980).
and Hearing Research, 21, 5–36. Functional hearing loss is one of several terms used to
describe a hearing loss that cannot be ascribed to an or-
Further Readings ganic cause (Aplin and Rowson, 1986). In FHL, actual
Beasley, D. S., Mosher, N. L., and Orchik, D. J. (1976). Use of
audiometric thresholds are inconsistent with the volun-
frequency shifted/time compressed speech with hearing- tary thresholds of a patient. The term pseudohypacusis
impaired children. Audiology, 15, 395–406. was coined by Carhart (1961) to describe a condition in
Block, von R., and Boerger, G. (1980). Hörverbessernde Ver- which a person presents with a hearing loss not consis-
fahren mit Bandbreitenkompression. Acustica, 45, 294–303. tent with clinical or audiologic evaluation. Nonorganic
Boothroyd, A., and Medwetsky, L. (1992). Spectral distribu- hearing loss is used interchangeably with FHL and
tion of /s/ and the frequency-response of hearing aids. Ear pseudohypacusis in that these terms do not comment on
and Hearing, 13, 150–157. the intent, conscious or subconscious, of the patient be-
Ching, T. Y. C., Dillon, H., and Byrne, D. (1998). Speech rec- ing tested (Radkowski, Cleveland, and Friedman, 1998).
ognition of hearing-impaired listeners: Predictions from au- Terms such as psychogenic hearing loss and conversion
dibility and the limited role of high-frequency amplification.
Journal of the Acoustical Society of America, 103, 1128–
deafness imply that the cause of the auditory disturbance
1140. is psychological, whereas malingering suggests the con-
David, E. E., and McDonald, H. S. (1956). Note on pitch- scious and deliberate adoption or fabrication of a
synchronous processing of speech. Journal of the Acoustical hearing loss for personal gain (Stark, 1966). FHL often
Society of America, 28, 1261–1266. appears as an overlay on an organic impairment. As
Denes, P. B. (1967). On the motor theory of speech perception. such, the term functional overlay is used to describe an
In W. W. Dunn (Ed.), Models for the perception of speech exaggeration of an existing hearing loss.
and visual form (pp. 309–314). Cambridge, MA: MIT Press. The incidence of FHL in children is not well docu-
Foust, K. O., and Gengel, R. W. (1973). Speech discrimination mented. Valid estimates are lacking, owing to lack of
by sensorineural hearing impaired persons using a trans- consensus on the definition of FHL and the absence of a
poser hearing aid. Scandinavian Audiology, 2, 161–170.
Hicks, B. L., Braida, L. D., and Durlach, N. I. (1981). Pitch-
concerted e¤ort to collect such data. Estimates range
invariant frequency lowering with nonuniform spectral from 1% to 12%, but the validity of these figures is
compression. Proceedings of the IEEE International Con- undermined by the lack of standard criteria for diagnos-
ference on Acoustics, Speech, and Signal Processing (pp. ing FHL in children (Broad, 1980). Pracy et al. (1996)
121–124). New York: IEEE. suggest that the incidence is higher than expected and
Ling, D. (1972). Auditory discrimination of speech transposed cannot be compared with the reported incidence in adults.
by a sample-and-hold process. In G. Fant (Ed.), Pro- A number of studies that do report incidence suggest that
ceedings of the International Symposium on Speech Com- FHL occurs more than twice as often in girls as in boys
munication and Profound Deafness, Stockholm (1970) (pp. (Dixon and Newby, 1959; Brockman and Hoversten,
323–333). Washington, DC: A.G. Bell Association. 1960; Campanelli, 1963). In pediatric studies of FHL, the
Oppenheim, A. V., and Johnson, D. H. (1972). Discrete
representation of signals. Proceedings of the IEEE, 60,
condition is typically diagnosed in adolescence (Berger,
681–691. 1965; Radkowski, Cleveland, and Friedman, 1998).
Reed, C. M., Power, M. H., Durlach, N. I., Braida, L. D., The ability to describe a characteristic audiometric
Foss, K. K., Reid, J. A., et al. (1991). Development and and behavioral profile of FHL in children is made
testing of low-frequency speech codes. Journal of Rehabili- problematic by its multiple manifestations and defi-
tation Research and Development, 28, 67–82. nitions. Aplin and Rowson (1990) described four
subgroups of FHL: (1) cases involving emotional prob- to conduct, is appropriate only for unilateral hearing
lems that typically preceded the audiologic examination; losses. The Doerfler-Stewart test (1946), Bekesy audio-
(2) cases in which mild anxiety or conscious mechanisms metry (Jerger and Herer, 1961), and the Lombard test
produce a hearing loss during an audiologic examina- (Black, 1951) require special equipment and are rarely
tion; (3) cases of malingering (the hearing loss is delib- used in adults in the twenty-first century.
erately and consciously assumed for the purposes of Cross-check procedures such as otoacoustic emissions
financial gain); and (4) cases in which the audiologic and acoustic reflex testing can be used to determine the
discrepancies are artifactual as a result of lack of under- reliability of pure-tone thresholds (Radkowski, Cleve-
standing or inattention during audiometric testing. land, and Friedman, 1998). However, for actual thresh-
The hallmark of FHL in children is inconsistency in old determination, the auditory brainstem response
audiometric test results. The diagnosis of FHL in chil- (ABR) measurement has been shown to be e¤ective.
dren is generally easier than in adults, because children Yoshida, Noguchi, and Uemura (1989) performed ABR
are less able to consistently produce erroneous results on audiometry with 39 school-age children presenting with
repeated testing (Pracy et al., 1996). A common presen- suspected FHL and found normal hearing in 65 ears of
tation of possible FHL is the child who demonstrates 35 patients. Although ABR audiometry is not a hearing
no di‰culty in conversational speech but has a volun- test, Sanders and Lazenby (1983) suggest it can be a
tary pure-tone audiogram that suggests di‰culty in powerful tool in the identification and quantification of
speech recognition (Stark, 1966). Speech audiometric FHL.
results that are better than the pure-tone results are an- After detecting FHL and determining true hearing
other common indicator of FHL in children (Aplin and levels, Hosoi, Murata, and Levitt (1999) suggest it is es-
Rowson, 1990). Behavioral responses to speech audio- sential that information about the cause of the func-
metry typical of FHL in children include responding to tional hearing loss be obtained. Reports on FHL in
only one syllable of a test word or to only one phoneme children cite lack of attention, deflection of attention,
presented at a given intensity (Pracy et al., 1996). Hosoi, school di‰culties, a history of abuse, and emotional
Tsuta, Murata, and Levitt (1999) reported several indi- problems as possible causes, among others.
cators of FHL in children seen during audiometric Barr (1963) found that the extra attention paid to
testing. These indicators included (1) poor test-retest re- children because of their purported hearing di‰culty had
liability, (2) a saucer-shaped audiometric configuration, encouraged them, consciously or unconsciously, to feign
(3) the absence of a shadow curve with a severe to pro- hearing loss. Children with previous knowledge of ear
found unilateral hearing loss, and (4) a discrepancy be- problems may use lack of hearing and withdrawal from
tween the speech reception threshold and the pure-tone communication as a response to problems experienced at
average. school (Aplin and Rowson, 1986). A history of trauma
Misunderstanding, confusion, or unfamiliarity with immediately preceding the complaint of hearing loss was
the test directions or procedures must be determined be- reported in 18 patients studied by Radkowski, Cleve-
fore further testing for FHL is initiated. After reinstruc- land, and Friedman (1998). Drake et al. (1995) found
tion, modification of speech audiometry is undertaken that FHL may be an indicator of child abuse.
when FHL in a child is suspected. Pracy et al. (1996) A high suspicion in approaching children will not de-
described successful use of the Fournier technique lay early intervention in cases of an organic hearing loss,
(1956), in which the level of speech presentation is but failure to recognize FHL can be costly and poten-
varied, thereby tricking the patient into responding at a tially hazardous to the pediatric patient (Radkowski,
level at which he or she had not previously responded. Cleveland, and Friedman, 1998). Unnecessary radio-
Other techniques using speech audiometry include use of graphic studies or exploratory surgical intervention,
an ascending threshold determination technique (Harris, inappropriate amplification, and financial and psycho-
1958) and presentation of informal conversation at levels social costs are among the possible outcomes of in-
below the voluntary pure-tone thresholds. appropriate diagnosis of FHL in children. Spraggs,
Hosoi et al. (1999) proposed suggestion audiometry Burton, and Graham (1994) reported on five adult
as a useful technique to detect FHL in children as well as patients who underwent assessment for cochlear im-
to determine true hearing levels. In suggestion audio- plantation and were found to have nonorganic hearing
metry, standard audiometric procedures are preceded by loss. Similar findings have not been reported in the pe-
an information session on the benefits of using a hearing diatric population, but with the proliferation of cochlear
aid. The child is shown a hearing aid, given information implants in children, accurate diagnosis of hearing loss is
about it, and ultimately wears the hearing aid in the o¤ essential.
position during testing. The hearing aid is without tub- Once the diagnosis of FHL in a child has been estab-
ing or an earmold; consequently, the earphone used lished, a nonconfrontational approach is recommended.
during testing is placed over the auricle with the hearing Giving the child the opportunity to ‘‘save face’’ often
aid. Hosoi et al. found that 16 of 20 children diagnosed can be achieved with reinstructing, reassuring, retesting,
with FHL showed a significant change in hearing level and convincing the child that the hearing loss will im-
following the suggestion technique. prove. Retesting with accurate results is often accom-
The use of adult-oriented tests of pseudohypacusis plished during one visit but may require follow-up visits.
has had limited success in children demonstrating FHL. Labeling the child a malingerer is detrimental to the
The Stenger test (Chaiklin and Ventry, 1965), while easy child and decreases the probability of obtaining accurate
Genetics and Craniofacial Anomalies 477
behavioral thresholds. If the child is labeled as exhibiting Pracy, J., and Bowdler, D. (1996). Pseudohypacusis in
FHL, the chance to back out gracefully is lost, and the children. Clinical Otolaryngology and Allied Sciences, 21,
functional component of the hearing loss may be so- 383–384.
lidified (Pracy et al., 1996). Pracy and Bowdler (1996) Pracy, J., Walsh, R., Mepham, G., and Bowdler, D. (1996).
Childhood pseudohypacusis. International Journal of Pedi-
advocate an approach that treats the child as if the
atric Otorhinolaryngology, 37, 143–149.
hearing loss were real, followed by the use of speech Radkowski, D., Cleveland, S., and Friedman, E. (1998).
audiometry techniques to determine actual hearing Childhood pseudohypacusis inpatients with high risk for
thresholds. In recalcitrant cases, ABR may be required actual hearing loss. Larygoscope, 108, 1534–1538.
for threshold determination. Psychiatric referral is rarely Sanders, J., and Lazenby, B. (1983). Auditory brain stem re-
necessary or desirable and should be reserved only for sponse measurement in the assessment of pseudohypoa-
the intractable child (Bowdler and Rogers, 1989). cusis. American Journal of Otology, 4, 292–299.
Spraggs, P., Burton, M., and Graham, J. (1994). Nonorganic
—Patricia McCarthy hearing loss in cochlear implant candidates. American
References Journal of Otology, 15, 652–657.
Stark, E. (1966). Functional hearing loss in children. Illinois
Aplin, D., and Rowson, V. (1986). Personality and functional Medical Journal, 130, 628–631.
hearing loss in children. British Journal of Clinical Psychol- Yoshida, M., Noguchi, A., and Uemura, T. (1989). Functional
ogy, 25, 313–314. hearing loss in children. International Journal of Pediatric
Aplin, D., and Rowson, V. (1990). Psychological character- Otorhinolaryngology, 17, 287–295.
istics of children with functional hearing loss. British Jour-
nal of Audiology, 24, 77–87. Further Readings
Barr, B. (1963). Psychogenic deafness in school children.
Audiology, 2, 125–128. Hayes, R. (1992). Non-organic hearing loss in young persons.
Berger, K. (1965). Nonorganic hearing loss in children. La- British Journal of Audiology, 26, 347–350.
ryngoscope, 75, 447–457. Noble, W. (1987). The conceptual problem of functional hear-
Black, J. (1951). The e¤ect of delayed side-tone upon vocal rate ing loss. British Journal of Audiology, 21, 1–3.
and intensity. Journal of Speech and Hearing Disorders, 16, Ventry, I., and Chaiklin, J. (1965). Multidiscipline study of
56–60. functional hearing loss. Journal of Auditory Research, 5,
Bowdler, D., and Rogers, J. (1989). The management of 179–272.
pseudohypacusis in school-age children. Clinical Otolaryn-
gology and Allied Sciences, 14, 211–215.
Broad, R. (1980). Developmental and psychodynamic issues
related to causes of childhood functional hearing loss. Child Genetics and Craniofacial Anomalies
Psychiatry and Human Development, 11, 49–58.
Brockman, S., and Hoversten, G. (1960). Pseudo-neural hypa-
cusis in children. Laryngoscope, 70, 825–839. Genetic factors contribute to more than half of all
Campanelli, P. (1963). Simulated hearing losses in school chil- congenital hearing losses and are also responsible for
dren following identification audiometry. Journal of Audi-
later-onset hearing losses. Understanding the factors
tory Research, 3, 91–108.
Carhart, R. (1961). Tests for malingering. Transactions of the underlying hereditary hearing loss requires locating the
American Academy of Opthalmology and Otolaryngology, genes responsible for hearing loss and defining the spe-
65, 437. cific mechanisms and functions of those genes. From a
Chaiklin, J., and Ventry, I. (1965). The e‰ciency of audio- clinical standpoint, this information may contribute to
metric measures used to identify functional hearing loss. improved management strategies for individuals with
Journal of Auditory Research, 5, 219–230. hereditary hearing loss and their families. Accurate de-
Dixon, R., and Newby, H. (1959). Children with non-organic termination of the auditory characteristics associated
hearing problems. Archives of Otolaryngology, 70, 619–623. with various genetic abnormalities requires the use of
Doerfler, L., and Stewart, K. (1946). Malingering and psycho- measures sensitive to subtle aspects of auditory function.
genic deafness. Journal of Speech Disorders, 11, 181–186.
Drake, A., Makielski, K., McDonald-Bell, C., and Atcheson,
B. (1995). Two new otolaryngologic findings in child abuse. Hereditary Hearing Loss. Congenital (hereditary) hear-
Archives of Otolaryngology–Head and Neck Surgery, 121, ing loss occurs in approximately 1–2 of 1000 births,
1417–1420. and at least 50% of all cases of hearing loss have a ge-
Fournier, J. (1956). La depistage de la stimulation auditive. In netic origin (Morton, 1991). Although hereditary hear-
Exposes annuels d’oto-rhino’laryngologie (pp. 107–126). ing losses may occur in conjunction with other disorders
Paris: Masson. as part of a syndrome, the majority of cases are non-
Harris, D. (1958). A rapid and simple technique for the detec- syndromic. Later-onset hereditary hearing loss occurs at
tion of non-organic hearing loss. Archives Otolaryngology, various ages from the first decade to later in life.
68, 758–760.
Hosoi, H., Tsuta, Y., Murata, T., and Levitt, H. (1999). Sug-
gestion audiometry for non-organic hearing loss (pseudo-
Chromosomes. Human cells contain 23 pairs of chro-
hypoacusis) in children. International Journal of Pediatric mosomes (22 pairs of autosomes and two sex chromo-
Otorhinolaryngology, 47, 11–21. somes). Hereditary material in the form of DNA is
Jerger, J., and Herer, G. (1961). An unexpected dividend in carried as genes on chromosomes. Cells reproduce by
Bekesy audiometry. Journal of Speech and Hearing Dis- mitosis (meiosis for the sex chromosomes), where chro-
orders, 26, 390–391. mosomes divide, resulting in two genetically similar
cells. Errors can occur during mitotic or meiotic division, fecting all frequencies (Liu and Xu, 1994). Autosomal
resulting in cells with chromosomal abnormalities and dominant nonsyndromic hereditary hearing loss tends to
an individual with a chromosomal defect. be less severe, more often delayed in onset, progressive,
and a¤ecting high frequencies. Patients with X-linked
Genotype and Phenotype. Genotype describes an indi- hearing loss generally have prelingual onset but are
vidual’s genetic constitution. Phenotype relates to the clinically diverse.
physical characteristics of an individual and can include Mutations in the GJB2 (connexin 26) gene may ex-
information obtained from physiological, morphologi- plain greater than 50% of autosomal recessive deafness
cal, and biochemical studies. Auditory tests contribute to in some populations (Zelante et al., 1997). The GJB2
the phenotypic description. gene encodes the protein connexin 26 (Cx26), thought to
be essential for maintenance of high potassium in the
Inheritance Patterns scala media of the inner ear. Several mutations in the
Hereditary hearing loss follows several patterns of in- GJB2 gene have been associated with hearing loss, and
heritance. Autosomal recessive inheritance occurs in mutation sites vary among world populations. A 35delG
70%–80% of individuals with nonsyndromic hearing mutation is common in some Mediterranean-based
loss. To display a recessive trait, a person must acquire populations (Denoyelle et al., 1997), while a 167delT
one abnormal gene for the trait from each parent. mutation is most common in the Ashkenazi Jewish pop-
Parents are heterozygous for the trait since they each ulation (Morell et al., 1998). Cx26 mutations are gener-
carry one abnormal gene and one normal gene. Thus, ally responsible for recessive deafness, although they
recessively inherited defects appear among the o¤spring have been observed in dominant deafness.
of phenotypically normal parents who are both carriers Hearing losses associated with Cx26 mutations are
of a single recessive gene for the trait. When both cochlear in nature but vary widely in degree, ranging
parents are carriers, the chance of a child receiving two from mild to profound, and stability (e.g., Cohn et al.,
copies of the abnormal gene and displaying the pheno- 1999; Denoyelle et al., 1999; Mueller et al., 1999; Wilcox
type is 25%. The parents’ chance of having a carrier et al., 2000). Hearing losses resulting from the same
child is 50%, and there is a 25% chance of having a child genetic mutations show wide variability in degree and
with no gene for the defect. In cases of nonsyndromic progression. Furthermore, audiometric characteristics
recessive hearing loss, a genetic source may be suspected are not directly linked to a particular type of mutation
in families with two or more occurrences of the disorder. (e.g., Cohn et al., 1999; Sobe et al., 2000).
Recessive inheritance occurs more commonly in non-
syndromic than in syndromic hearing loss. Chromosomal Defects. Down syndrome is the most
In autosomal dominant inheritance, a single copy of common autosomal defect. The a¤ected individual has
an abnormal gene can result in hearing loss; thus, an an additional chromosome 21 (trisomy 21), for a total
a¤ected parent has a 50% chance of passing that gene of 47 chromosomes, or a translocation trisomy. Down
to their child. Autosomal dominant hereditary hearing syndrome is characterized by mental retardation and a
loss occurs in approximately 15%–20% of nonsyndromic number of craniofacial and other characteristics. Hear-
hearing loss and is more commonly associated with syn- ing loss may be congenital, sensory, and there is a high
dromic hearing loss. Other inheritance patterns that can incidence of middle ear disorders. Trisomy 13 and 18
result in hearing loss are X-linked, at a rate of 2%–3%, syndromes, less common and with more dramatic ab-
and mitochondrial, which occurs in less than 1% of normalities, are characterized by inner ear dysplasias
cases. involving the organ of Corti and stria vacularis, external
and middle ear malformations, cleft lip and palate, and
Variability in Hereditary Hearing Loss. Phenotypic other defects.
and genetic heterogeneity is pronounced, with reports of
more than 400 forms of syndromic and nonsyndromic
hereditary hearing loss (Gorlin, Toriello, and Cohen,
Syndromes Associated with Hearing Loss
1955). Considerable variation exists among hereditary There are far too many syndromes associated with
hearing losses, between dominant and recessive hear- hearing loss to include in this brief entry. A useful
ing losses, among various forms of either recessive or method of classifying hearing loss was provided by
dominant hearing loss, and even among persons with Konigsmark and Gorlin (1976), where genetic and met-
the same genetic mutations. Furthermore, the same abolic hearing losses were divided into major categories
genes have been found responsible for both syndromic depending on the organ system or metabolic defect
and nonsyndromic hearing loss, and have been asso- involved.
ciated with both autosomal dominant and recessive Usher syndrome is the most common syndrome asso-
transmission. ciated with hearing loss and eye abnormalities, specifi-
Hereditary hearing losses range from mild to pro- cally retinitis pigmentosa. There are several types and
found (Nance and Sweeney, 1975). In subjects with subtypes and various genetic loci. Other syndromes in-
autosomal recessive nonsyndromic hereditary hearing volving vision are Cockayne syndrome and Alstrom
loss, onset of the hearing loss tends to be congenital, se- disease, associated with retinal disorders. Treacher Col-
vere to profound in degree, stable over time, and af- lins syndrome, Goldenhar syndrome (hemifacial micro-
Genetics and Craniofacial Anomalies 479
somia), Crouzon syndrome (craniofacial dysostosis), quency range, type, and progression of a hearing loss,
Apert syndrome, otopalatal-digital syndrome, and and whether one or both ears are a¤ected. Other, more
osteogenesis imperfecta are all associated with muscu- sensitive measures (such as otoacoustic emissions, e¤er-
loskeletal disease. ent reflexes, and auditory-evoked potentials) are neces-
Waardenburg syndrome, characterized by displaced sary to understand the nature of a hearing loss in more
medial canthi, white forelock, heterochromia, and broad detail.
nasal root, is the most prominent hearing syndrome The majority of hereditary hearing losses are non-
involving the integumentary system. Alport syndrome syndromic, with no associated disorders that might raise
is a combination of progressive hearing loss, progres- the index of suspicion or aid in diagnosis. Furthermore,
sive renal disease, and ocular lens abnormalities. Pen- since the majority of nonsyndromic hearing losses are
dred syndrome, mucopolysaccharidosis, and Jervell and recessively inherited, parents are not a¤ected by hearing
Lange-Nielsen syndrome are associated with metabolic loss. There may be no history of hearing loss in the
and other abnormalities. The diverse syndromes asso- family, so this risk factor would not be an indicator to
ciated with neurological disorders include Friedreich raise suspicion of hearing loss. Thus, identification of
ataxia and acoustic neuromas and neural deafness. nonsyndromic, and particularly recessively inherited,
Craniofacial anomalies associated with hearing hearing loss is particularly challenging clinically.
loss may be sporadic, inherited, due to disturbances See also speech disorders: genetic transmission.
during embryonic development, of toxic origin, or re-
—Linda J. Hood
lated to chromosomal abnormalities. These maldevelop-
ments may be of unknown origin or related to known
syndromes. References
Inner ear dysplasias include Michel deafness, which is
rare and involves complete inner ear dysplasia, Mondini Cohn, E. S., Kelley, P. M., Fowler, T. W., et al. (1999). Clini-
deafness, and Scheibe deafness (Schuknecht, 1974). In cal studies of families with hearing loss attributable to
mutations in the connexin 26 gene (GJB2/DFNB1). Pedi-
Mondini deafness, the bony cochlear capsule is flat-
atrics, 103, 546–550.
tened, with underdevelopment of the apical turn of the Denoyelle, F., Martin, S., Weil, D., et al. (1999). Clinical fea-
cochlea and possible saccular and endolymphatic in- tures of the prevalent form of childhood deafness, DFNB1,
volvement. Hearing loss is typically moderate to pro- due to a connexin-26 gene defect: Implications for genetic
found but varies widely. Scheibe deafness involves the counseling. Lancet, 353, 1298–1303.
membranous portion of the cochlea and saccule, greater Denoyelle, F., Weil, D., Maw, M. A., et al. (1997). Prelingual
in basal portions, and is the most common of the inner deafness: High prevalence of a 30delG mutation in the
ear dysplasias. connexin 26 gene. Human Molecular Genetics, 6, 2173–
External and middle ear anomalies are associated 2177.
with improper development of the first and second Gorlin, R. J., Toriello, H. V., and Cohen, M. M. (1955). He-
reditary hearing loss and its syndromes. Oxford: Oxford
branchial clefts and arches, which are also responsible University Press.
for lower jaw and other structures. Middle ear abnor- Konigsmark, B. W., and Gorlin, R. J. (1976). Genetic and
malities include absence or fusion of the ossicles or metabolic deafness. Philadelphia: Saunders.
abnormalities of the eustachian tube or middle ear cav- Liu, X., and Xu, L. (1994). Nonsyndromic hearing loss: An
ity. Middle ear anomalies may be suspected whenever analysis of audiograms. Annals of Otology, Rhinology, and
other branchial arch anomalies such as external ear Laryngology, 103, 428–433.
atresia, cleft palate, micrognathia, Treacher Collins syn- Morell, R., Kim, H. J., Hood, L. J., et al. (1998). Mutations in
drome, Pierre Robin syndrome, and low-set auricles are the connexin 26 gene (GJB2) among Ashkenazi Jews with
present. Skeletal defects, such as those associated with nonsyndromic recessive deafness. New England Journal of
Apert syndrome, Klippel-Feil syndrome, and Paget dis- Medicine, 339, 1500–1505.
Morton, N. E. (1991). Genetic epidemiology of hearing im-
ease, and connective tissue disorders, such as those re- pairment. In R. J. Ruben, T. R. van de Water, and K. P.
lated to Hunter-Hurler or Möbius syndromes, may also Steel (Eds.), Genetics of hearing impairment. Annals of the
indicate the presence of middle ear anomalies. Malde- New York Academy of Sciences, 630, 16–31.
velopment of the external ear includes preauricular tags, Mueller, R. F., Nehammer, A., Middleton, A., et al. (1999).
microtic or deformed pinna, or partial or complete atre- Congenital non-syndromal sensorineural hearing impair-
sia of the external canal. The presence of external or ment due to connexin 26 gene mutations: Molecular and
middle ear anomalies may indicate additional malfor- audiological findings. International Journal of Pediatric
mations or reduced hearing, depending on the structures Otorhinolaryngology, 50, 3–13.
and degree of involvement. Nance, W. E., and Sweeney, A. (1975). Genetic factors in
deafness of early life. Otolaryngolic Clinics of North Amer-
ica, 8, 19–48.
Evaluation and Classification of Hearing Loss Schuknecht, H. F. (1974). Pathology of the ear. Cambridge,
MA: Harvard University Press.
The characteristics of a hearing loss are important in Sobe, T., Vreugde, S., Shahin, H., et al. (2000). The prevalence
understanding relationships, or lack of relationships, be- and expression of inherited connexin 26 mutations asso-
tween genotype and phenotype. The audiogram provides ciated with nonsyndromic hearing loss in the Israeli popu-
a general description of the degree, configuration, fre- lation. Human Genetics, 106, 50–57.
Wilcox, S. A., Saunders, K., Osborn, A. H., et al. (2000). High Audibility measures usually combine information
frequency hearing loss correlated with mutations in the about the availability of amplified acoustical speech cues
GJB2 gene. Human Genetics, 106, 399–405. with weighting factors proportional to the importance of
Zelante, L., Gasparini, P., Estivill, X., et al. (1997). Connexin those cues for speech recognition. The availability of
26 mutations associated with the most common form of
cues may be limited by sensitivity thresholds or compet-
non-syndromic neurosensory autosomal recessive deafness
(DFNB1) in Mediterraneans. Human Molecular Genetics, ing noises, as well as by the speech level. The importance
6, 1605–1609. of cues for speech intelligibility varies with frequency
and type of speech. Studies of normal-hearing listeners
and listeners with mild to moderate hearing impairments
Hearing Aid Fitting: Evaluation of have shown that measures of weighted audibility can
provide rather accurate predictions of speech intelligi-
Outcomes bility scores for these individuals in a laboratory setting.
There are several well-researched approaches to obtain-
ing audibility measures (e.g., Studebaker, 1992; ANSI,
The outcomes of hearing aid fitting can be assessed in
terms of the technical merit of the device in situ or in 1997). Some methods incorporate the e¤ects of age,
speech level, or hearing loss to improve the accuracy
terms of the extent to which the device alleviates the
of speech intelligibility predictions for elderly hearing-
daily problems of the hearing-impaired person and his or
her family. Early e¤orts to measure outcomes focused impaired listeners who are exposed to high-level ampli-
fied speech.
mainly on technical merit. It was assumed that if the
Measuring the recognition of amplified speech is per-
instrument was technically superior, real-life outcomes
haps the most venerable approach to evaluating hear-
would be proportionately superior. However, this is
ing aid fitting outcomes. Many standardized tests are
not always the case. Real-life problems associated with
available, in both open-set and closed-set varieties, with
hearing impairment are complicated by issues such as
stimuli ranging from nonsense syllables to sentences. The
personality, lifestyle, environment, and family dynamics.
Thus, it is now recognized that real-life outcomes of a popularity of speech intelligibility testing as a measure of
outcome is rooted in its high level of face validity. The
fitting must be assessed separately from the technical
most frequently cited reason for obtaining amplification
adequacy of the hearing aid. These two types of out-
comes are evaluated at di¤erent times after fitting. is a need to improve communication ability. A measure
of improved speech understanding consequent on hear-
Technical merit data are often obtained as part of the
ing aid fitting addresses that need in an attractively
verification process conducted immediately after fitting.
direct manner. For many years, this measure was the
Sometimes these data may prompt modifications of the
bedrock of hearing aid fitting outcome evaluation.
fitting. Alleviation of real-life problems is evaluated after
Unfortunately, in order to achieve a useful level of sta-
the hearing-impaired person has had time to use the
tistical power, speech intelligibility tests must include a
hearing aid on a daily basis. This evaluation is usually
large number of test items. This requirement has limited
made after at least 2 weeks of use of the device.
the recent use of speech intelligibility testing mostly to
Evaluation of Technical Merit research applications (e.g., Gatehouse, 1998).
The importance of producing amplified sounds that
The technical merit of a fitted hearing aid may be re- are acceptably loud has long been recognized. Although
flected in both acoustical and psychoacoustical data. using loudness data to facilitate fitting protocols has
Acoustical outcomes include real ear probe microphone been advocated for many years, interest in measuring
measures (such as insertion gain, aided response, and the loudness of amplified sounds following the fitting
saturation response) and audibility measures (such as has burgeoned since the widespread acceptance of wide
articulation index and speech intelligibility index). dynamic range compression devices. With these instru-
Psychoacoustical outcomes include speech recognition ments, it is appealing to assess the extent to which environ-
scores, aided loudness assessment, and ratings of quality, mental sounds, including speech, have been ‘‘normalized’’
clarity, pleasantness, or other dimensions. by amplification. Interest in this type of outcome data is
Real ear probe microphone measures provide infor- increasing, and some of the measurement issues have
mation about the sound delivered to the eardrum of the been addressed (e.g., Cox and Gray, 2001).
particular patient (e.g., Mueller, Hawkins, and North- Formal ratings of aspects of amplified sounds, such as
ern, 1992). This takes into consideration the physical quality, clarity, distortion, and the like, have been used
di¤erences among patients and the di¤erences between frequently in assessing technical merit in research appli-
real ears and standard couplers such as the 2 cm 3 cou- cations but have not often been advocated for clinical
plers used in the ANSI measurement standard (Ameri- use. These types of measures are commonly performed
can National Standards Institute [ANSI], 1996). One with listeners supplying a rating on a semantic di¤eren-
advantage of these measures is their ability to confirm tial scale, such as the 11-point version developed by
the extent to which the fitting is congruent with pre- Gabrielsson and Sjogren (1979). This approach has the
scriptive fitting target values. Real ear probe micro- advantage of permitting quantification of dimensions of
phone data are also valuable for troubleshooting fitting amplified sounds that are not psychoacoustically acces-
problems. sible via other metrics.
Hearing Aid Fitting: Evaluation of Outcomes 481
Evaluation of Real-World Impact not to be sensitive to the changes that result from hear-
ing aid use (Bess, 2000).
The real-life e¤ectiveness of a hearing aid is measured It is not unusual to observe that an individual who
using subjective data provided by the hearing-impaired reports substantial hearing aid benefit is nevertheless not
person or significant others. Numerous questionnaires satisfied with the device or does not use amplification
have been developed and standardized specifically for very often. These observations suggest that daily use and
the purpose of assessing hearing aid fitting outcomes, hearing aid satisfaction are additional, distinct dimen-
and many others have been conscripted to serve this ap- sions of real-world outcome that require separate as-
plication (Noble, 1998; Bentler and Kramer, 2000). In sessment (e.g., Cox and Alexander, 1999).
addition to standardized questionnaires, there is strong
support for use of personalized instruments in which Relationship Between Technical Merit and
the patient identifies the items and thus creates a cus- Real-World Impact
tomized questionnaire (e.g., Dillon, James, and Ginis,
1997). Regardless of which type of inventory is used, Numerous studies have shown that measures of technical
there are at least seven di¤erent domains of subjective merit are not strongly predictive of real-world outcomes
outcomes of hearing aid fitting that can be assessed. of hearing aid fitting (e.g., Souza et al., 2000; Walden
They include residual activity limitations, residual par- et al., 2000). Principal components analyses in studies
ticipation restrictions, impact on others, use, benefit, using multiple outcome measures often show that the
satisfaction, and quality-of-life changes. Most invento- two types of measures tend to occupy separate factors
ries do not assess all of these domains. (e.g., Humes, 1999). Many researchers feel that both
Residual activity limitations are the di‰culties the types of data are essential for a full description of hear-
hearing aid wearer continues to have in everyday tasks ing aid fitting outcome.
such as understanding speech, localizing sounds, and the See also hearing aids: prescriptive fitting.
like. The activity limitations experienced by a specific —Robyn M. Cox
individual will depend on the demands of that person’s
lifestyle. Residual participation restrictions are the un-
resolved problems or barriers the hearing aid wearer References
encounters to involvement in situations of daily life. This American National Standards Institute. (1996). American Na-
also di¤ers with individuals but can include such cir- tional Standard Specification of Hearing Aid Characteristics
cumstances as participation in church services, bridge (ANSI S3.22–1996). New York: Acoustical Society of
clubs, and so on. ICF (2001) provides a full discussion of America.
activity limitations and participation restrictions. American National Standards Institute. (1997). American Na-
Hearing impairment often places a heavy burden on tional Standards Method for the Calculation of the Speech
the family and friends as well as on the involved indi- Intelligibility Index (ANSI S3.5–1997). New York: Acous-
tical Society of America.
vidual. In fact, encouragement (or compulsion) by sig- Bentler, R. A., and Kramer, S. E. (2000). Guidelines for
nificant others is sometimes the major factor prompting choosing a self-report outcome measure. Ear and Hearing,
an individual to seek a hearing aid. The relief provided 21(4, Suppl.), 37S–49S.
by amplification for the problems in the family constel- Bess, F. H. (2000). The role of generic health-related quality of
lation (i.e., the impact on others) is an important out- life measures in establishing audiological rehabilitation
come dimension but one that has received relatively little outcomes. Ear and Hearing, 21(4, Suppl.), 74S–79S.
attention to date. Cox, R. M., and Alexander, G. C. (1999). Measuring satisfac-
A measure of benefit quantifies change in a hearing- tion with amplification in daily life: The SADL Scale. Ear
related dimension of functioning as a result of using and Hearing, 20, 306–320.
amplification. Benefit may be measured directly, in terms Cox, R. M., and Gray, G. A. (2001). Verifying loudness
perception after hearing aid fitting. American Journal of
of degree of change (small versus large), or it may be Audiology, 10, 91–98.
computed by comparing aided and unaided performance Dillon, H., James, A., and Ginis, J. (1997). The Client Ori-
on a particular dimension. Typical dimensions on which ented Scale of Improvement (COSI) and its relationship to
subjective benefit is measured are activity limitations and several other measures of benefit and satisfaction provided
participation restrictions. Hearing-specific question- by hearing aids. Journal of the American Academy of
naires are typically used to quantify hearing aid benefit. Audiology, 8, 27–43.
Sometimes general, non-hearing-specific question- Gabrielsson, A., and Sjogren, H. (1979). Perceived sound
naires are used to determine changes that result from quality of hearing aids. Scandinavian Audiology, 8, 159–169.
hearing aid provision. These types of data tend to be Gatehouse, S. (1998). Speech tests as measures of outcome.
interpreted as reflecting changes in general quality of life. Scandinavian Audiology, 27(Suppl. 49), 54–60.
Humes, L. E. (1999). Dimensions of hearing aid outcome.
A recent large-scale study found that hearing aid use was Journal of the American Academy of Audiology, 10, 26–39.
significantly associated with improvements in many as- ICF. (2001). International Classification of Functioning, Dis-
pects of life, including social life and mental health ability, and Health. Geneva: World Health Organization.
(Kochkin and Rogin, 2000). Despite the importance of Kochkin, S., and Rogin, C. (2000). Quantifying the obvious:
these e¤ects for individuals, functional health status The impact of hearing instruments on quality of life. Hear-
measures that are often used to gauge quality of life tend ing Review, 7, 6–34.
Mueller, H. G., Hawkins, D. B., and Northern, J. L. (1992). Hearing Aids: Prescriptive Fitting
Probe microphone measurements: Hearing aid selection
and assessment. San Diego, CA: Singular Publishing
Group. Prescriptive procedures are used in hearing aid fittings to
Noble, W. (1998). Self-assessment of hearing and related func- select an appropriate amplification characteristic based
tions. London, U.K.: Whurr.
on measurements of the auditory system. The advan-
Souza, P. E., Yueh, B., Sarubbi, M., and Loovis, C. F. (2000).
Fitting hearing aids with the Articulation Index: Impact on tages of using a prescriptive procedure as opposed to an
hearing aid e¤ectiveness. Journal of Rehabilitation Research evaluative or other approach are (1) they can be used
and Development, 37, 473–481. with all populations in a time-e‰cient way, (2) they help
Studebaker, G. A. (1992). The e¤ect of equating loudness on the clinician select a suitable parameter combination
audibility-based hearing aid selection procedures. Journal of from among an almost unlimited number possible in
the American Academy of Audiology, 3, 113–118. modern hearing aids and settings, and (3) they can be
Walden, B. E., Surr, R. K., Cord, M. T., Edwards, B., and verified. On the negative side, there is little interaction
Olsen, L. (2000). Comparison of benefits provided by dif- with the client when fitting hearing aids according to
ferent hearing aid technologies. Journal of the American a prescriptive procedure, and any two people with the
Academy of Audiology, 11, 540–560.
same type of loss may have di¤erent preferences for the
loudness and the tone of sounds.
More than fifty prescriptive procedures for fitting
Further Readings hearing aids have been presented. The procedures vary
with respect to the type of amplification characteristic
Cox, R. M., and Alexander, G. C. (1995). The Abbreviated that is prescribed, the type of data the procedure is based
Profile of Hearing Aid Benefit. Ear and Hearing, 16, 176– on, and the aim of the procedure.
186. The parameter most commonly prescribed in hearing
Cox, R. M., Alexander, G. C., and Gray, G. A. (1999).
Personality and the subjective assessment of hearing
aids is gain as a function of frequency. In linear devices,
aids. Journal of the American Academy of Audiology, 10, 1– only one gain/frequency response is prescribed, which
13. applies to all input levels that do not cause the hearing
Dillon, H. (2001). Assessing the outcomes of hearing rehab- aid to saturate. If the hearing aid is nonlinear (contains
ilitation. In H. Dillon, Hearing aids (pp. 349–369). New compressor amplifiers), gain varies as a function of both
York: Thieme. frequency and input level. In that case, gain/frequency
Dillon, H., and So, M. (2000). Incentives and obstacles to the curves are prescribed for di¤erent input levels, or the
routine use of outcomes measures by clinicians. Ear and static compression parameters are prescribed for selected
Hearing, 21(4, Suppl.), 2S–6S. frequencies. To avoid excessive loudness when listening
Gagne, J.-P., McDu¤, S., and Getty, L. (1999). Some limi- to high-intensity input levels, the maximum output of
tations of evaluative investigations based solely on normed
outcome measures. Journal of the American Academy of
the hearing device must also be prescribed.
Audiology, 10, 46–62. Some procedures prescribe the amplification char-
Gatehouse, S. (1999). Glasgow Hearing Aid Benefit Profile: acteristic based on threshold levels only. Others use
Derivation and validation of a client-centered outcome suprathreshold loudness judgments, such as the most
measure for hearing aid services. Journal of the American comfortable level (MCL), the loudness discomfort level
Academy of Audiology, 10, 80–103. (LDL), or the entire loudness scale. Supporters of
Humes, L. E., Halling, D., and Coughlin, M. (1996). Reliabil- threshold-based procedures argue that loudness data are
ity and stability of various hearing-aid outcome measures in di‰cult to measure and unreliable, especially in children
a group of elderly hearing-aid wearers. Journal of Speech and special populations, and that preferred gain and
and Hearing Research, 39, 923–935. maximum output can be adequately predicted from
Kricos, P. B. (2000). The influence of non-audiological varia-
bles on audiological rehabilitation outcomes. Ear and
threshold levels. The argument for using individually
Hearing, 21(4, Suppl.), 7S–14S. measured loudness data is that the fitting will be more
Larson, V. D., Williams, D. W., Henderson, W. G., Luethke, accurate because hearing aid users with the same audio-
L. E., Beck, L. B., No¤singer, D., et al. (2000). E‰cacy of gram can perceive loudness di¤erently. Table 1 lists
3 commonly used hearing aid circuits: A crossover trial. some of the most widely used prescription procedures
Journal of the American Medical Association, 284, 1806– developed to date. They are categorized according to
1813. which parameters are prescribed and the data used.
Studebaker, G. A. (1991). Measures of intelligibility and qual- Most procedures for fitting linear devices share the
ity. In G. A. Studebaker, F. H. Bess, and L. B. Beck (Eds.), general aim of amplifying speech presented at an aver-
The Vanderbilt Hearing Aid Report II (pp. 185–194). Park- age level to a comfortable level situated approximately
ton, MD: York Press.
Ventry, I. M., and Weinstein, B. E. (1982). The Hearing
halfway between threshold and LDL. The rationale is
Handicap Inventory for the Elderly: A new tool. Ear and that such a response provides optimum speech under-
Hearing, 3, 128–134. standing and comfortable listening in general situations.
Weinstein, B. E. (1997). Outcome measures in the hearing aid Despite this common rationale, the assumptions and
fitting/selecting process (Trends in Amplification, No. 2(4)). underlying operational principles behind each procedure
New York: Woodland. vary, producing very di¤erent formulas. The assump-
Hearing Aids: Prescriptive Fitting 483
Table 1. An Overview of Widely Used Prescriptive Hearing Aid Fitting Procedures

Linear Amplification Nonlinear Amplification Output
Threshold based Berger FIG6 (Figure 6) POGO
POGO, POGO II (Prescription of NAL-NL1 (National Acoustic Laboratories’ NAL-SSPL
Gain/Output) nonlinear formula, version 1)
NAL-R, NAL-RP (National
Acoustic Laboratories)
Use supra-threshold CID (Central Institute for the Deaf) LGOB (Loudness Growth of Octave Bands) CID
loudness data DSL* (Desired Sensation Level) IHAFF (Independent Hearing Aid Fitting Forum) MSU*
MSU* (Memphis State University) DSL(i/o)* (Desired Sensation Level, input/output IHAFF
function)
ScalAdapt (Adaptive fitting by category loudness
scaling)
*Supra-threshold loudness data may be measured or predicted from threshold levels.
tions presented include the following: (1) The audibility quency region is audible (Ching, Dillon, and Byrne,
of all speech components is important for speech under- 1998). Consequently, the goal of achieving equal loud-
standing (e.g., DSL; Seewald, Ross, and Spiro, 1985). (2) ness is progressively relaxed within the NAL-NL1 rule
Speech discrimination is highest when speech is pre- as hearing loss increases.
sented at levels above MCL (e.g., MSU; Cox, 1988). (3) The rationale behind most nonlinear prescription
Speech is best understood when speech bands at di¤erent procedures, however, is loudness normalization. Exam-
frequencies have equal loudness (e.g., NAL-R; Byrne ples are LGOB (Humes et al., 1996), FIG6 (Killion and
and Dillon, 1986; and CID; Skinner et al., 1982). (4) For Fikret-Pasa, 1993), IHAFF (Cox, 1995), DSL[i/o] (Cor-
hearing aid users with mild to moderate losses, speech nelisse, Seewald, and Jamieson, 1995), and ScalAdapt
presented at average input levels is restored to the MCL (Kiessling, Schubert, and Archut, 1996). The assump-
when providing gain equal to about half the amount of tion behind this rationale is that ‘‘normal hearing’’ is
threshold loss (e.g., Berger, Hagberg, and Rane, 1977; best for speech understanding and for listening to envi-
and POGO; McCandless and Lyregaard, 1983). ronmental sounds. Loudness normalization is achieved
Some of these procedures take the shape of the by applying the gain needed to make narrow-band stim-
speech spectrum into consideration when prescribing uli of any input level just as loud for the impaired ear
gain at each frequency (DSL, MSU, CID, and NAL-R), as they are for normal ears. This rationale maintains
whereas others introduce a reduction in the low- the interfrequency variation of loudness that normally
frequency gain to avoid upward spread of masking from occurs for speech (Fig. 1). Consequently, loudness nor-
low-frequency ambient noise (Berger, POGO). Either malization is not consistent with the principles of equal-
way, the net result is that, even for a flat hearing loss, izing loudness across frequency and deemphasizing
less gain is prescribed in the low than in the high fre- loudness at those frequencies where loss is greatest,
quencies. The NAL-R procedure di¤ers from the other principles that have emerged from research into linear
linear procedures in two respects. First, the gain pre- amplification.
scribed at any frequency is a¤ected by the degree of loss
at other frequencies. Second, it is the only procedure that
is well supported by direct empirical data (e.g., Byrne
and Cotton, 1988).
One procedure for fitting nonlinear devices, NAL-
NL1 (Dillon, 1999), follows the common rationale of
procedures for fitting linear devices by aiming at max-
imizing speech intelligibility for any input level. To
avoid amplifying all input levels to a most comfortable
level, which probably would make the loudness of envi-
ronmental sounds unacceptable, the rationale uses the
constraint that for any input level, the overall loudness
of speech must not exceed normal loudness. This proce-
dure prescribes gain/frequency responses that make the
speech bands approximately equal in loudness (Fig. 1), Figure 1. Graph illustrating loudness perception of speech
which is in agreement with several procedures for fitting when the interfrequency variation of intensity for speech is
linear devices. As hearing loss at any frequency be- maintained (loudness normalization) and when the intensity
comes severe or profound, the ear becomes less able to of speech bands has been equalized to maximize speech
extract information, even when the signal in that fre- intelligibility.
Figure 2. Audiogram and prescribed insertion gain curves for Figure 3. Audiogram and prescribed insertion gain curves for
a person with a moderate flat hearing loss. For IHAFF, the a person with a moderate to severe low-frequency hearing
targets are calculated based on average threshold-dependent loss. For IHAFF, the targets are calculated based on average
loudness data (Cox, personal communication). Note that the threshold-dependent loudness data (Cox, personal communi-
NAL-NL1 rule does not prescribe insertion gain at frequencies cation). Note that the NAL-NL1 rule does not prescribe inser-
where it is doubtful that amplification will contribute to speech tion gain at frequencies where it is doubtful that amplification
intelligibility. will contribute to speech intelligibility.
Because of the di¤erent assumptions and principles Such variations can be programmed into di¤erent mem-
used by the various procedures, di¤erent procedures ories in a multimemory hearing aid.
prescribe di¤erent amplification characteristics for the Some procedures also prescribe the maximum output
same type of hearing loss (Figs. 2–5). The di¤erences are of the hearing aid known as the saturation sound pres-
more pronounced for flat and reverse sloping loss than sure level (SSPL). It is important to have the output level
for the more common high-frequency sloping loss. of the hearing aid correctly adjusted. If the SSPL is too
A recent evaluation of loudness normalization versus high, the hearing aid can cause discomfort or damage to
speech intelligibility maximization suggests that when the hearing aid user. On the other hand, if the SSPL is
the di¤erence in prescription between the two rationales too low, the hearing aid user may experience insu‰cient
is substantial, hearing aid users prefer and perform loudness and excessive saturation. Most procedures that
better with the speech intelligibility maximization ratio- prescribe SSPL aim at avoiding discomfort. In those
nale (Keidser and Grant, 2001). cases the SSPL is set equal to or just below the hearing
The gain/frequency curves may be prescribed accord- aid user’s discomfort level; examples are CID, MSU,
ing to the acoustic input the client is likely to experience. POGO, and IHAFF. Only one procedure, NAL-SSPL,
Simple variations applied to the amplification charac- considers both the maximum output level and the mini-
teristic prescribed to compensate for the hearing loss mum output level and prescribes a level halfway between
have proved useful for compensating for defined changes these two extremes (Dillon and Storey, 1998). This is
in the acoustic input (Keidser, Dillon, and Byrne, 1996). also the only procedure for prescribing the output level
Hearing Aids: Prescriptive Fitting 485
Figure 4. Audiogram and prescribed insertion gain curves for a

Figure 5. Audiogram and prescribed insertion gain curves for a
person with a gently sloping high-frequency hearing loss. For
person with a steeply sloping high-frequency hearing loss. For
IHAFF, the targets are calculated based on average threshold-
IHAFF, the targets are calculated based on average threshold-
dependent loudness data (Cox, personal communication). Note
dependent loudness data (Cox, personal communication). Note
that the NAL-NL1 rule does not prescribe insertion gain at
that the NAL-NL1 rule does not prescribe insertion gain at
frequencies where it is doubtful that amplification will contrib-
frequencies where it is doubtful that amplification will contrib-
ute to speech intelligibility.
ute to speech intelligibility.
that has been experimentally evaluated. It was found to client has tried the hearing aid in everyday listening
provide an SSPL that did not require fine-tuning for 80% environments.
of clients (Storey et al., 1998). The most commonly used prescription procedures
Many prescription procedures target a sensorineural are readily available in an electronic format, either as
loss of mild to moderate degree. Appropriate adjust- specifically designed computer programs, in programs
ments to the prescriptions may be needed if prescribing provided by hearing aid manufacturers for fitting their
amplification for clients with a conductive component programmable devices, or in equipment for measuring
(Lybarger, 1963), and a severe to profound loss (POGO real-ear gain.
II: Schwartz, Lyregaard, and Lundt, 1988; NAL-RP: See also hearing aid fitting: evalution of out-
Byrne, Parkinson, and Newall, 1990). Some adjustments comes; hearing aids: sound quality.
are also needed for clients who are fitted with one versus
—Gitte Keidser and Harvey Dillon
two hearing aids (Dillon, 2001).
When the hearing aid has been adjusted according References
to the prescriptive procedures, the setting can be veri-
fied against the prescribed target, either in a hearing aid Berger, R. A., Hagberg, E. N., and Rane, R. L. (1977). Pre-
scription of hearing aids. Kent, OH: Herald.
test box or in the real ear. Verifying the prescriptive pa-
Byrne, D., and Cotton, S. (1988). Evaluation of the National
rameters in the real ear allows individual configura- Acoustic Laboratories’ new hearing aid selection procedure.
tions of the ear canal and the acoustic coupling between Journal of Speech and Hearing Research, 31, 178–186.
ear and hearing aid to be taken into consideration. Byrne, D., and Dillon, H. (1986). The National Acoustic Lab-
For some clients fine-tuning may be needed after the oratories’ (NAL) new procedure for selecting the gain and
frequency response of a hearing aid. Ear and Hearing, 7, Further Readings

257–265.
Byrne, D., Parkinson, A., and Newall, P. (1990). Hearing aid Barker, C., Dillon, H., and Newall, P. (2001). Fitting low ratio
gain and frequency response requirements for the severely/ compression to people with severe and profound hearing
profoundly hearing impaired. Ear and Hearing, 11, 295– losses. Ear and Hearing, 22, 130–141.
300. Bustamante, D., Braida, L. (1987). Multiband compression
Ching, T. Y. C., Dillon, H., and Byrne, D. (1998). Speech rec- limiting for hearing-impaired listeners. Journal of Rehabili-
ognition of hearing-impaired listeners: Predictions from au- tative Research and Development, 24, 149–160.
dibility and the limited role of high frequency amplification. Byrne, D. (1983). Theoretical prescriptive approaches to
Journal of the Acoustical Society of America, 103, 1128– selecting the gain and frequency response of a hearing aid.
1140. Monographs in Contemporary Audiology, 4(1).
Cornelisse, L., Seewald, R., and Jamieson, D. (1995). The Byrne, D. (1986). E¤ects of frequency response characteristics
input/output formula: A theoretical approach to the fitting on speech discrimination and perceived intelligibility and
of personal amplification devices. Journal of the Acoustical pleasantness of speech for hearing-impaired listeners. Jour-
Society of America, 97, 1854–1864. nal of the Acoustical Society of America, 80, 494-504.
Cox, R. (1988). The MSU hearing instrument prescription Byrne, D. (1996). Hearing aid selection for the 1990s: Where
procedure. Hearing Instruments, 39, 6–10. to? Journal of the American Academy of Audiology, 7, 377–
Cox, R. (1995). Using loudness data for hearing aid selection: 395.
The IHAFF approach. Hearing Journal, 48(2), 39–44. Byrne, D., Dillon, H., Katsch, R., Ching, T., and Keidser, G.
Dillon, H. (1999). NAL-NL1: A new prescriptive fitting pro- (2001). The NAL-NL1 procedure for fitting non-linear
cedure for non-linear hearing aids. Hearing Journal, 52(4), hearing aids: Characteristics and comparisons with other
10–16. procedures. Journal of the American Academy of Audiology,
Dillon, H. (2001). Hearing aids. Sydney, Australia: Boomerang 12, 37–51.
Press. Danaher, E. S., and Pickett, J. M. (1975). Some masking
Dillon, H., and Storey, L. (1998). The National Acoustic Lab- e¤ects produced by low-frequency vowel formants in per-
oratories’ procedure for selecting the saturation sound sons with sensorineural loss. Journal of Speech and Hearing
pressure level of hearing aids: Theoretical derivation. Ear Research, 18, 79–89.
and Hearing, 19, 255–266. Dillon, H., and Murray, N. (1987). Accuracy of twelve
Humes, L. E., Pavlovic, C., Bray, V., and Barr, M. (1996). methods for estimating the real ear gain of hearing aids. Ear
Real-ear measurement of hearing threshold and loudness. and Hearing, 8, 2–11.
Trends in Amplification, 1(4), 121–135. Gatehouse, S. (1993). Role of perceptual acclimatization in the
Keidser, G., Dillon, H., and Byrne, D. (1996). Guidelines for selection of frequency responses for hearing aids. Journal of
fitting multiple memory hearing aids. Journal of the Ameri- the American Academy of Audiology, 4, 296–306.
can Academy of Audiology, 7, 406–418. Harford, E., and Barry, J. (1965). A rehabilitative approach to
Keidser, G., and Grant, F. (2001). Comparing loudness nor- the problem of unilateral hearing impairment: Contralateral
malization (IHAFF) with speech intelligibility maximiza- routing of signals (CROS). Journal of Speech and Hearing
tion (NAL-NL1) when implemented in a two-channel Disorders, 30, 121–138.
device. Ear and Hearing, 22, 501–515. Hogan, C. A., and Turner, C. W. (1998). High-frequency au-
Kiessling, J., Schubert, M., and Archut, A. (1996). Adaptive dibility: Benefits for hearing-impaired listeners. Journal of
fitting of hearing instruments by category loudness scaling the Acoustical Society of America, 104, 432–441.
(ScalAdapt). Scandinavian Audiology, 25, 153–160. Kamm, C., Dirks, D. D., and Mickey, M. R. (1978). E¤ect of
Killion, M. C., and Fikret-Pasa, S. (1993). The 3 types of sen- sensorineural hearing loss on loudness discomfort level and
sorineural hearing loss: Loudness and intelligibility consid- most comfortable level judgements. Journal of Speech and
erations. Hearing Journal, 46(11), 31–36. Hearing Disorders, 21, 668–681.
Lybarger, S. (1963). Simplified fitting system for hearing aids. Keidser, G. (1995). The relationship between listening con-
Cantonsburg, PA: Radioear Co. ditions and alternative amplification schemes for multiple
McCandless, G. A., and Lyregaard, P. E. (1983). Prescription memory hearing aids. Ear and Hearing, 16, 575–586.
of gain/output (POGO) for hearing aids. Hearing Instru- Killion, M. C., and Monser, E. L. (1980). Corfig coupler re-
ments, 34, 16–21. sponse for flat insertion gain. In G. A. Studebaker and
Schwartz, D., Lyregaard, P., and Lundh, P. (1988). Hearing I. Hochberg (Eds.), Acoustical factors a¤ecting hearing aid
aid selection for severe-to-profound hearing loss. Hearing performance (pp. 147–168). Baltimore: University Park
Journal, 41(2), 13–17. Press.
Seewald, R., Ross, M., and Spiro, M. (1985). Selecting ampli- Macrae, J. (1994). A review of research into safety limits for
fication characteristics for young hearing-impaired children. amplification by hearing aids. Australian Journal of Audiol-
Ear and Hearing, 6, 48–53. ogy, 16, 67–77.
Skinner, M., Pascoe, D., Miller, J., and Popelka, G. (1982). Moore, B. C., Alcantara, J. I., and Glasberg, B. R. (1998).
Measurements to determine the optimal placement of Development and evaluation of a procedure for fitting
speech energy within the listener’s auditory area: A basis for multi-channel compression hearing aids. British Journal of
selecting amplification characteristics. In G. Studebaker, Audiology, 32, 177–195.
and F. Bess (Eds.), The Vanderbilt hearing-aid report (pp. Mueller, H. G., Hawkins, D. B., and Northern, J. L. (1992).
161–169). Upper Darby, PA: Monographs in Comtempo- Probe microphone measurements: Hearing aid selection and
rary Audiology. assessment. San Diego, CA: Singular Press Publishing
Storey, L., Dillon, H., Yeend, I., and Wigney, D. (1998). The Group.
National Acoustic Laboratories’ procedure for selecting the Plomp, R. (1978). Auditory handicap of hearing impairment
saturation sound pressure level of hearing aids: Experimen- and the limited benefit of hearing aids. Journal of the
tal validation. Ear and Hearing, 19, 267–279. Acoustical Society of America, 63, 533–549.
Hearing Aids: Sound Quality 487
Rankovic, C. (1991). An application of the articulation index There is also good agreement among studies that too
to hearing aid fitting. Journal of Speech and Hearing Re- much high-frequency emphasis degrades sound quality.
search, 34, 391–402. This type of amplification is characterized by descrip-
Skinner, M. W. (1980). Speech intelligibility in noise-induced tions of sound as shrill, harsh, and tinny (e.g., Gabriels-
hearing loss: E¤ects of high-frequency compensation. Jour-
son and Sjogren, 1979; Gabrielsson, Schenkman, and
nal of the Acoustical Society of America, 67, 306–317.
Swan, I., and Gatehouse, S. (1987). Optimum side for fitting a Hagerman, 1988). Thompson and Lassman (1970),
monaural hearing aid. 3. Preference and benefit. British Neuman and Schwander (1987), and Leijon et al. (1991)
Journal of Audiology, 21, 205–208. found that the sound quality of a flat frequency response
Turner, C. W., Humes, L. E., Bentler, R. A., and Cox, R. M. was preferred to a response with extreme high-frequency
(1996). A review of past research on changes in hearing aid emphasis. The results of these studies point to the need
benefit over time. Ear and Hearing, 17(3, Suppl.), 14S–28S. for balance between the low- and high-frequency energy
Walker, G. (1997). Conductive hearing impairment and pre- for good sound quality. Of course, the optimum balance
ferred hearing aid gain. Australian Journal of Audiology, 19, for any person depends on the way that person’s hearing
81–89. loss varies with frequency. It has also been realized that
the frequency response requirements for good sound
quality for music di¤er from those for speech. An ex-
Hearing Aids: Sound Quality tended low-frequency response is more important for
music than for speech (e.g., Franks, 1982).
A smoother frequency response has better sound
The electroacoustic characteristics of a hearing aid di¤er quality than a frequency response with peaks. Davis and
considerably from those of a high-fidelity communica- Davidson (1996) found that hearing-impaired listeners
tion system. The hearing aid has a relatively narrow preferred to listen to speech processed through a hearing
bandwidth, and the frequency response of the hearing aid with a moderate amount of damping that smoothed
aid is almost never flat. Because hearing loss usually the large resonant peak in the frequency response of
increases with increasing frequency, the frequency re- the hearing aid. This preference was true for both male
sponse of the hearing aid typically provides increased and female voices in quiet and in noise. Smoothing of
gain as a function of frequency in order to obtain audi- the frequency response resulted in judgments of greater
bility for the hearing aid user. While hearing-impaired brightness, clarity, distinctness, fullness, nearness, and
listeners may require considerable gain in order to ob- openness. Similarly, van Buuren, Festen, and Houtgast
tain audibility, the output of the hearing aid must be (1996) investigated the e¤ect of adding single and mul-
limited in order to prevent the amplified sound from be- tiple peaks to a smooth frequency response. Hearing-
coming uncomfortably loud when input levels are high. impaired subjects rated pleasantness on a five-point
A number of studies have been carried out to derating scale. The smooth frequency response was rated
termine the e¤ect of manipulating the electroacoustic as having better sound quality than any of the frequency
characteristics of a hearing aid on sound quality. Sound responses with peaks. Based on the results of the study,
quality is a multidimensional construct. Gabrielsson and the researchers recommended that peak-to-valley ratios
Sjogren (1979) identified eight important ‘‘dimensions’’ in the frequency response of the hearing aid should not
related to the manipulation of the frequency response exceed 5 dB.
of loudspeakers, headphones, and hearing aids. These In spite of the general agreement among studies about
dimensions are clarity, fullness, brightness, shrillness, the e¤ect of frequency response on sound quality, Pre-
loudness, spaciousness, nearness, and disturbing sounds minger and Van Tasell (1995) have shown intersubject
(distortion). The ‘‘overall impression of quality’’ consists di¤erences with regard to their ratings of the various
of some weighting of these various dimensions. dimensions of sound quality as a function of the manip-
The e¤ect of manipulating the frequency response ulation of frequency response. They suggested that
of the hearing aid has been assessed in many studies. measures of speech quality be used to select among al-
The bandwidth, amount of low-frequency versus high- ternative frequency responses yielding similar speech in-
frequency gain, and the smoothness of the frequency re- telligibility (close to 100%).
sponse will all a¤ect the sound quality of speech. Several Much of the research described above was carried out
studies have shown that the presence of low-frequency with linear hearing aids, with testing carried out at a
content is a strong determinant of good sound quality single input level. However, many current hearing aids
for hearing-impaired listeners listening at comfortable are nonlinear, which means that the frequency response
listening levels (Punch, 1978; Punch et al., 1980; Punch characteristics change as a function of the input level.
and Parker, 1981; Tecca and Goldstein, 1984; Punch and Full evaluation of sound quality would require testing
Beck, 1986). However, the bandwidth yielding better with various signals at multiple input levels and deter-
sound quality changes as a function of the input level to mining optimum sound quality at each level.
the hearing aid and the amount of amplification pro- The method of output limiting is another hearing aid
vided by the hearing aid. When hearing-impaired sub- parameter that has an important e¤ect on sound quality.
jects listen at higher levels, a frequency response with less Output limiting is used in a hearing aid to prevent
low-frequency amplification yields better sound quality amplified sounds from becoming uncomfortably loud
(Tecca and Goldstein, 1984). and to protect the ear from excessively loud sounds that
might cause further damage to the hearing. Major quality revealed that clarity, pleasantness, background
methods of output limiting include peak clipping, com- noise, loudness, and overall impression all showed neg-
pression limiting, or wide dynamic range compression. ative e¤ects of increasing compression ratio (Neuman
Peak clipping causes the generation of signals in the et al., 1998).
output signal that are not in the input signal. Harmonic Release time also a¤ects sound quality. If short re-
distortion (integer multiples of the input signal) and lease times are used, low-level noise is amplified in the
intermodulation distortion (combinations of the har- pauses between words. This amplification of low-level
monics caused by sums and di¤erences of the harmonic noise has been found to have a negative e¤ect on the
distortion) are caused by peak clipping. The coherence perceived sound quality. Neuman and colleagues (1998)
between the input signal and the output signal is pre- found that hearing-impaired listeners’ ratings of the
dictive of sound quality. Moderate amounts of peak clarity, pleasantness, and overall quality of speech in
clipping degrade the sound quality of speech in quiet and quiet and speech in noise all decreased as release time
in noise, and the sound quality of music (Fortune and was decreased from 1000 ms to 60 ms (single-band-
Preves, 1992; Palmer et al., 1995; Kozma-Spytek, Kates, compression hearing aid). Ratings of the amount of
and Revoile, 1996). There is also an interaction between background noise increased as release time decreased.
the frequency response shaping of the hearing aid and It is clear that the electroacoustic characteristics of a
the clipping level. This interaction is subject-dependent hearing aid have a significant e¤ect on sound quality.
(Kozma-Spytek, Kates, and Revoile, 1996). Fortune Sound quality has been recognized as an important fac-
and Preves (1992) found specifically that hearing tor in the acceptability of a hearing aid to the user, and
aids having less distortion (higher coherence) were per- because of individual di¤erences among listeners, it has
ceived as having better clarity and brightness, as pro- been suggested that sound quality should be considered
ducing less discomfort, and as yielding better overall a factor in hearing aid fitting (e.g., Gabrielsson and
sound quality. Sjogren, 1979; Kuk and Tyler, 1990; Preminger and Van
Compression is a nonlinear form of amplification in Tasell, 1995; Lunner et al., 1997). Past research has
which the gain of the amplifier is decreased as the input shown that characteristics of the listeners, characteristics
to the amplifier increases. Compression amplification of the signal being amplified, and characteristics of the
may be used to limit output (compression limiting) or amplification system all a¤ect sound quality. Applica-
to fit a wide range of signals into the listener’s dynamic tion of sound quality judgments to the fitting of non-
range (wide dynamic range compression). In general, linear and multimemory hearing aids should be helpful
compression limiting preserves sound quality better than in determining the appropriate settings for these devices
peak clipping. Compression limiting does not generate (e.g., Keidser, Dillon, and Byrne, 1995).
harmonic and intermodulation distortion and yields See also hearing aid fitting: evaluation of out-
higher coherence values. Hawkins and Naidoo (1993) comes; hearing aids: prescriptive fitting.
compared the e¤ect of asymmetrical peak clipping and —Arlene C. Neuman
compression limiting on sound quality and clarity of
speech in quiet, speech in noise, and music. For both References
sound quality and clarity, and for all three types of Boike, K. T., and Souza, P. E. (2000). E¤ect of compression
stimuli, compression limiting was preferred to peak clip- ratio on speech recognition and speech-quality ratings with
ping under conditions in which the hearing aid input was wide dynamic range compression amplification. Journal of
high enough to cause limiting. Speech Language and Hearing Research, 43, 456–468.
For hearing aids utilizing wide dynamic range com- Davis, L. A., and Davidson, S. A. (1996). Preference for and
pression, compression ratio, attack, and release time all performance with damped and undamped hearing aids by
a¤ect the sound quality of the processed signal. The ef- listeners with sensorineural hearing loss. Journal of Speech
fect of compression variables also depends on whether and Hearing Research, 39, 483–493.
Fortune, T. W., and Preves, D. A. (1992). Hearing aid satura-
the signal of interest occurs in quiet or in noise. Several tion and aided loudness discomfort. Journal of Speech and
studies have shown that high compression ratios have a Hearing Research, 35, 175–185.
negative e¤ect on sound quality (Neuman et al., 1994, Franks, J. R. (1982). Judgments of hearing aid processed mu-
1998; Boike and Souza, 2000). Neuman and colleagues sic. Ear and Hearing, 3, 18–23.
(1994, 1998) found that compression ratios higher than Gabrielsson, A., Schenkman, B. N., and Hagerman, B. (1988).
3 : 1 significantly degraded the sound quality of a single- The e¤ects of di¤erent frequency responses on sound qual-
band-compression hearing aid. Compression ratios that ity judgments and speech intelligibility. Journal of Speech
did not significantly degrade sound quality in quiet, and Hearing Research, 31, 166–177.
degraded sound quality in noise. The sound quality of Gabrielsson, A., and Sjogren, H. (1979). Perceived sound
linear amplification (no compression) was preferred quality of hearing aids. Scandinavian Audiology, 8, 159–169.
Hawkins, D. B., and Naidoo, S. V. (1993). Comparison of
when background noise levels were high. Boike and sound quality and clarity with asymmetrical peak clipping
Souza (2000) also found that speech quality ratings and output limiting compression. Journal of the American
decreased with increasing compression ratio for speech Academy of Audiology, 4, 221–228.
in noise. Compression ratio did not significantly degrade Keidser, G., Dillon, H., and Byrne, D. (1995). Candidates for
quality for speech in quiet. Research to determine the multiple frequency response characteristics. Ear and Hear-
e¤ect of compression on specific dimensions of sound ing, 16, 575–586.
Hearing Loss and the Masking-Level Di¤erence 489
Kozma-Spytek, L., Kates, J. M., and Revoile, S. G. (1996). Barker, C., and Dillon, H. (1999). Client preference for com-
Quality ratings for frequency-shaped peak-clipped speech: pression threshold in single channel wide dynamic range
Results for listeners with hearing loss. Journal of Speech and compression hearing aids. Ear and Hearing, 20, 127–139.
Hearing Research, 39, 1115–1123. Crain, T. R., and Van Tasell, D. J. (1994). E¤ect of peak clip-
Kuk, F. K., and Tyler, R. S. (1990). Relationship between ping on speech recognition threshold. Ear and Hearing, 15,
consonant recognition and subjective ratings of hearing 443–453.
aids. British Journal of Audiology, 24, 171–177. Kuk, F. K. (1991). Perceptual consequence of vents in hearing
Leijon, A., Lindkvist, A., Ringdahl, A., and Israelsson, B. (1991). aids. British Journal of Audiology, 24, 163–169.
Sound quality and speech reception for prescribed hearing Kuk, F. K., Harper, T., and Doubek, K. (1994). Preferred real-
aid frequency responses. Ear and Hearing, 12, 251–260. ear insertion gain on a commercial hearing aid at di¤erent
Lunner, T., Hellgren, J., Arlinger, S., and Elberling, C. (1997). speech and noise levels. Journal of the American Academy of
A digital filterbank hearing aid: Three digital signal pro- Audiology, 5, 99–109.
cessing algorithms—User preference and performance. Ear Kuk, F. K., Tyler, R. S., and Mims, L. (1990). Subjective rat-
and Hearing, 18, 373–387. ings of noise-reduction hearing aids. Scandinavian Audiol-
Neuman, A. C., Bakke, M. H., Hellman, S., and Levitt, H. ogy, 19, 237–244.
(1994). E¤ect of compression ratio in a slow-acting com- Naidoo, S. V., and Hawkins, D. B. (1997). Monaural/binaural
pression hearing aid: Paired-comparison judgments of preferences: E¤ect of hearing aid circuit on speech intelligi-
quality. Journal of the Acoustical Society of America, 96, bility and sound quality. Journal of the American Academy
1471–1478. of Audiology, 8, 188–202.
Neuman, A. C., Bakke, M. H., Mackersie, C., Hellman, S., and Preminger, J. E., and Van Tasell, D. J. (1995). Quantifying the
Levitt, H. (1998). The e¤ect of compression ratio and release relation between speech quality and speech intelligibility.
time on the categorical rating of sound quality. Journal of the Journal of Speech and Hearing Research, 39, 714–725.
Neuman, A. C., and Schwander, T. J. (1987). The e¤ect of fil-
tering on the intelligibility and quality of speech in noise. Hearing Loss and the Masking-Level
Journal of Rehabilitation Research and Development, 24,
127–134. Di¤erence
Palmer, C. V., Killion, M. C., Wilber, L. A., and Ballad, W. J.
(1995). Comparison of two hearing aid receiver-amplifier
combinations using sound quality judgments. Ear and The masking-level di¤erence (MLD) (Hirsh, 1948) refers
Hearing, 16, 597–598. to a binaural paradigm in which masked signal detection
Preminger, J. E., and Van Tasell, D. J. (1995). Measurement of is contrasted between conditions di¤ering with respect
speech quality as a tool to optimize the fitting of a hearing to the availability of binaural di¤erences cues. The most
aid. Journal of Speech and Hearing Research, 38, 726–736. common MLD paradigm has two conditions. In the
Punch, J. L. (1978). Quality judgments of hearing-aid pro- first, NoSo, both the masker and signal are presented in
cessed speech and music by normal and otopathologic phase to the two ears. In this condition, the composite
listeners. Journal of the American Auditory Society, 3, 179– stimulus of signal plus noise contains no binaural di¤er-
188.
ence cues. In the second condition, NoSp, the masker is
Punch, J. L., and Beck, L. B. (1986). Relative e¤ects of low-
frequency amplification on syllable recognition and speech presented in phase to the two ears, but the signal is pre-
quality. Ear and Hearing, 7, 57–62. sented 180 out of phase at the two ears. In this condi-
Punch, J. L., Montgomery, A. A., Schwartz, D. M., Walden, tion, the composite stimulus of signal plus noise contains
B. E., Prosek, R. A., and Howard, M. T. (1980). Multi- binaural di¤erence cues of time and amplitude. There
dimensional scaling of quality judgments of speech signals are many other MLD conditions, but an underlying
processed by hearing aids. Journal of the Acoustical Society similarity is that all involve at least one condition in
of America, 68, 458–466. which the addition of the signal results in a change in
Punch, J. L., and Parker, C. A. (1981). Pairwise listener pref- the distribution of interaural time di¤erences, inter-
erences in hearing aid evaluation. Journal of Speech and aural amplitude di¤erences, or both interaural time and
amplitude di¤erences. For a broadband masker and a
Tecca, J. E., and Goldstein, D. P. (1984). E¤ect of low-
frequency hearing aid response on four measures of speech 500-Hz signal frequency, the threshold for the NoSp
perception. Ear and Hearing, 5, 22–29. condition is approximately 15 dB better than that for the
Thompson, G., and Lassman, F. (1970). Listener preference NoSo condition, reflecting the sensitivity of the auditory
for selective vs. flat amplification for a high-frequency system to the small interaural di¤erences that are intro-
hearing-loss population. Journal of Speech and Hearing Re- duced when the Sp signal is presented in the No noise.
search, 12, 594–606. The magnitude of the MLD is most robust at relatively
van Buuren, R. A., Festen, J. M., and Houtgast, T. (1996). low signal frequencies, but under specific circumstances,
Peaks in the frequency response of hearing aids: Evaluation the MLD can be quite large at high frequencies (Mc-
of the e¤ects on speech intelligibility and sound quality. Fadden and Pasanen, 1978). Whereas the anatomical
stage of processing most critical for the MLD has its
locus in the auditory brainstem, the MLD also hinges
Further Readings upon the fidelity of more peripheral auditory processing.
Balfour, P. B., and Hawkins, D. B. (1992). A comparison of
sound quality judgments for monaural and binaural hearing Neurological Disorders. Some of the most prominent
aid processed stimuli. Ear and Hearing, 13, 331–339. applications of the MLD to clinical populations have
concerned patients with lesions a¤ecting the auditory

nerve and auditory brainstem. The rationale for using
the MLD in such cases was based on the assumption
that the critical stages of auditory processing underlying
the MLD occur in the low or mid-brainstem. It was
reasoned that lesions a¤ecting the transmission of fine
timing information within this region would be asso-
ciated with reduced MLDs. The results from several
audiological investigations have supported this assump-
tion. For example, reduced MLDs have been reported
in listeners with tumors of the auditory nerve and
low brainstem, and in listeners with multiple sclerosis
(Quaranta and Cervellera, 1974; Olsen and No¤singer,
1976; Olsen, No¤singer, and Carhart, 1976; Lynn et al.,
1981). Poor binaural performance in such cases has been Figure 1. MLDs for a 500-Hz pure tone presented in a
attributed to gross changes in the temporal discharge 58 dB/Hz, 100-Hz wide band of noise centered on 500 Hz. The
patterns in the peripheral auditory nervous system, due open circles represent data from young adults and the filled
to either physical pressure on the nerve or, in the case of triangles represent data from elderly adults. All listeners had
multiple sclerosis, demyelination of low brainstem neu- normal hearing thresholds. Data are adapted from Grose et al.
ral tissue. In additional support of the idea that the (1994).
MLD is determined by relatively peripheral auditory
processes, several studies have indicated that the MLD is
usually not reduced in listeners having specifically corti- logical testing because of its sensitivity to neural audi-
cal auditory lesions (e.g., Bocca and Antonelli, 1976). tory dysfunction. Unfortunately, the MLD is a¤ected by
Binaural tests other than the MLD have also been a wide range of hearing pathologies, making the clini-
used to probe for the existence of peripheral auditory cal specificity of this test relatively poor. For example,
neural disorder. For example, a test of interaural time the MLD is often reduced in listeners with cochlear
discrimination termed phase response audiometry has hearing loss (e.g., Olsen and No¤singer, 1976; Hall,
been applied to patients having neural lesions in the au- Tyler, and Fernandes, 1984; Jerger, Brown, and Smith,
ditory periphery (Nilsson and Liden, 1976; Almqvist, 1984). MLDs are particularly likely to be reduced in
Almqvist, and Johnson, 1989). In general, such patients cases of asymmetrical cochlear hearing loss, but reduced
have been found to have a reduced ability to discrimi- MLDs are also quite common in cases of symmetrical
nate changes in interaural time di¤erences. hearing loss. Although reduced MLDs in cochlear hear-
ing loss may sometimes be accounted for in terms of a
Hearing Dysfunction Related to Aging. Presbyacusis relatively low sensation level of stimulation or in terms
refers not only to the cochlea-based losses of threshold of stimulation asymmetry, in some studies MLDs in lis-
sensitivity that typically accompany the normal aging teners with cochlear hearing loss (particularly Ménière’s
process but also to possible auditory neural dysfunction disease) are reduced more than would be expected from
that may coexist with (or exist independently of ) coch- stimulus level and asymmetry factors (Schoeny and
lear loss. In general, results from studies of the MLD in Carhart, 1971; Sta¤el et al., 1990). Whereas such find-
the elderly indicate reduced MLDs with advancing lis- ings undermine the MLD as a test that can di¤erentiate
tener age. MLDs are often reduced in presbycusic lis- between cochlear and retrocochlear sites of lesion, they
teners, particularly when hearing loss is present at the point to the potential of this test for understanding the
frequencies of the test stimulus. Of greater interest is the e¤ects of cochlear hearing loss on the processing of bin-
fact that MLDs are sometimes reduced in elderly lis- aural information.
teners (Fig. 1) even when the audiograms of the listeners One general finding associated with cochlear hearing
do not indicate an age-related hearing loss (e.g., Grose, loss is variability of results among listeners. This clearly
Poth, and Peters, 1994). Such findings are usually inter- holds true for the MLD. It is not presently obvious what
preted in terms of abnormal auditory neural processing accounts for the variability in the size of the MLD across
in the aging auditory system. The nature of the underly- listeners with cochlear hearing loss. Some possible fac-
ing neural abnormality accounting for the reduced tors include the cause of the hearing loss, whether a
MLDs in elderly listeners is unknown. It is possible that particular case of hearing loss is associated with a sub-
such a dysfunction could make a significant contribution stantial reduction in the number of nerve fibers con-
to the overall hearing disability associated with aging, as tributing information for binaural analysis, and whether
the MLD measures the kinds of auditory function that the cochlear disease state may a¤ect the symmetry of the
underlie, at least in part, our abilities to localize sound frequency or place encodings at the two cochleae. Al-
sources and to hear desired signals in noise backgrounds. though there is controversy on this point, it has been
speculated that some forms of cochlear hearing impair-
Cochlear Hearing Loss. As reviewed above, the MLD ment may be associated with a reduced ability of the
has potential relevance to site of lesion clinical audio- auditory nerve to phase lock (Woolf, Ryan, and Bone,
Hearing Loss and the Masking-Level Di¤erence 491
1981). This could contribute to reduced MLDs, and fore possible that MLDs could be substantially reduced
could also account for the poor interaural time dis- because of this factor in cases of conductive hearing loss.
crimination found in many cochlear-impaired listeners. Studies of binaural hearing in listeners with conduc-
It is likely that several causes contribute to reduced tive hearing impairment have found that binaural hear-
MLDs among cochlear-impaired listeners. The challenge ing is relatively poor in many subjects (Jonkees and van
of determining the specific mechanisms underlying the der Veer, 1957; Nordlund, 1964; Quaranta and Cervel-
results in particular patients remains. lera, 1974; Hall and Derlacki, 1986). The factors of
hearing asymmetry and the bone conduction route of
Conductive Hearing Loss. In some listening conditions, stimulation (discussed above) probably contribute to this
conductive hearing loss can be considered in terms of a poor binaural hearing. However, it is likely that addi-
simple attenuation of sound. In this sense, performance tional factors are involved. One relevant finding is that
in an ear with conductive hearing loss would be expected binaural hearing does not always return to normal im-
to be similar to that in a normal ear stimulated at lower mediately following middle ear surgery. In studies of
level. The situation for some aspects of binaural hearing adults having otosclerosis, binaural hearing has been
is more complicated. If the conductive loss is di¤erent in found to remain abnormal up to 1–2 years following the
the two ears, the associated attenuation will be asym- restoration of a normal audiogram (Hall and Derlacki,
metrical. This asymmetry could reduce the e‰ciency of 1986; Hall, Grose, and Pillsbury, 1990; Magliulo et al.,
binaural hearing. Colburn and Hausler (1980) also 1990). As indicated in Figure 2, reduced MLDs have
pointed out that another possible source of poor binau- also been found in children with normal audiograms at
ral hearing in conductive impairment is related to bone the time of testing but with a history of hearing loss due
conduction. For sound presented via headphones, both to otitis media with e¤usion (Moore, Hutchings, and
the air conduction route of stimulation and the bone Meyer, 1991; Pillsbury, Grose, and Hall, 1991; Hall,
conduction route of stimulation are theoretically rele- Grose, and Pillsbury, 1995). It has been speculated that
vant. In normal-hearing listeners, the influence of bone- some of the di‰culties in the binaural hearing of con-
conducted sound on the stimuli reaching the cochleae is ductively impaired listeners may be related to a reduc-
probably of no material consequence. In cases of con- tion in the e‰ciency of the neural processing of binaural
ductive hearing loss, however, the bone-conducted sound di¤erence cues (Hall, Grose, and Pillsbury, 1995).
could have a significant e¤ect on the composite wave- The results of MLD studies indicate that binaural
forms reaching the cochleae and could materially a¤ect analysis is often negatively a¤ected in most general types
the distribution of interaural di¤erence cues. It is thereof auditory dysfunction. Whereas poor performance is
Figure 2. MLDs for children without (open circles) and with The data for the normal control group are repeated in the six
(filled triangles) a history of otitis media with e¤usion. The panels. The region between the lines represented the 95% pre-
data were obtained sequentially, before and after the children diction interval for the normal group. Data are adapted from
with a history of otitis media received tympanostomy tubes. Hall et al. (1995).
due to abnormal neural function in cases of retrococh- Nordlund, B. (1964). Directional audiometry. Acta Oto-
lear hearing disorders, the reasons for poor performance Laryngologica, 57, 1–18.
in cases of cochlear and conductive loss are often less Olsen, W., and No¤singer, D. (1976). Masking level di¤erences
clear, particularly when degree and symmetry of hearing for cochlear and brainstem lesions. Annals of Otology, Rhi-
nology, and Laryngology, 85, 1–6.
loss are not su‰ciently explanatory. Identification of the
Olsen, W., No¤singer, D., and Carhart, R. (1976). Masking-
particular factors resulting in abnormal MLD results in level di¤erences encountered in clinical populations.
particular individuals remains a challenge. Audiology, 15, 287–301.
—Joseph W. Hall Pillsbury, H. C., Grose, J. H., and Hall, J. W. (1991). Otitis
media with e¤usion in children: Binaural hearing before and
References after corrective surgery. Archives of Otolaryngology–Head
Almqvist, U., Almqvist, B., and Jonsson, K. E. (1989). Phase Quaranta, A., and Cervellera, G. (1974). Masking level di¤er-
audiometry: A rapid method for detecting cerebello-pontine ences in normal and pathological ears. Audiology, 13, 428–
angle tumours? Scandinavian Audiology, 18, 155–159. 431.
Bocca, E., and Antonelli, A. (1976). Masking level di¤erence: Schoeny, Z., and Carhart, R. (1971). E¤ects of unilateral
Another tool for the evaluation of peripheral and cortical Ménière’s disease on masking level di¤erences. Journal of
defects. Audiology, 15, 480–487. the Acoustical Society of America, 50, 1143–1150.
Colburn, H. S., and Hausler, R. (1980). Note on the modeling Sta¤el, J. G., Hall, J. W. III., Grose, J. H., and Pillsbury, H. C.
of binaural interaction in impaired auditory systems. In (1990). NoSo and NoS pi detection as a function of masker
G. v. d. Brink and F. A. Bilsen (Eds.), Physical, physiolog- bandwidth in normal-hearing and cochlear-impaired lis-
ical and behavioral studies in hearing. Netherlands: Delft teners. Journal of the Acoustical Society of America, 87,
University. 1720–1727.
Grose, J. H., Poth, E. A., and Peters, R. W. (1994). Masking Woolf, N. K., Ryan, A. F., and Bone, R. C. (1981). Neural
level di¤erences for tones and speech in elderly listeners with phase-locking properties in the absence of cochlear outer
relatively normal audiograms. Journal of Speech and Hear- hair cells. Hearing Research, 4, 335–346.
Hall, J. W., and Derlacki, E. L. (1986). E¤ect of conductive Further Readings
hearing loss and middle ear surgery on binaural hearing.
Annals of Otology, Rhinology, and Laryngology, 95, 525– Besing, J. M., and Koehnke, J. (1995). A test of virtual audi-
530. tory localization. Ear and Hearing, 16, 220–229.
Hall, J. W., Grose, J. H., and Pillsbury, H. C. (1990). Predict- Colburn, H. S. (1973). Theory of binaural interaction based on
ing binaural hearing after stapedectomy from pre-surgery auditory nerve data: I. General strategy and preliminary
results. Archives of Otolaryngology–Head and Neck Sur- results on interaural discrimination. Journal of the Acousti-
gery, 116, 946–950. cal Society of America, 54, 1458–1470.
Hall, J. W., Grose, J. H., and Pillsbury, H. C. (1995). Long- Colburn, H. S. (1977). Theory of binaural interaction based on
term e¤ects of chronic otitis media on binaural hearing in auditory-nerve data: II. Detection of tones in noise. Journal
children. Archives of Otolaryngology–Head and Neck Sur- of the Acoustical Society of America, 61, 525–533.
gery, 121, 847–852. Dolan, T. R., and Robinson, D. E. (1967). An explanation of
Hall, J. W., Tyler, R. S., and Fernandes, M. A. (1984). Factors masking-level di¤erences that result from interaural inten-
influencing the masking level di¤erence in cochlear hearing- sive disparities of noise. Journal of the Acoustical Society of
impaired and normal-hearing listeners. Journal of Speech America, 42, 977–981.
and Hearing Research, 27, 145–154. Gabriel, K. J., Koehnke, J., and Colburn, H. S. (1992). Fre-
Hirsh, I. J. (1948). Binaural summation and interaural inhibi- quency dependence of binaural performance in listeners
tion as a function of the level of the masking noise. Journal with impaired binaural hearing. Journal of the Acoustical
of the Acoustical Society of America, 20, 205–213. Society of America, 91, 336–347.
Jerger, J., Brown, D., and Smith, S. (1984). E¤ect of peripheral Green, D. M. (1978). An introduction to hearing. Hillsdale, NJ:
hearing loss on the MLD. Archives of Otolaryngology, 110, Erlbaum.
290–296. Hausler, R., Colburn, H. S., and Marr, E. (1983). Sound lo-
Jonkees, L., and van der Veer, R. (1957). Directional hearing calization in subjects with impaired hearing. Acta Otola-
capacity in hearing disorders. Acta Otolaryngologica, 48, ryngology, Supplement, 400.
465–474. Hawkins, D. B., and Wightman, F. L. (1980). Interaural time
Lynn, G. E., Gilroy, J., Taylor, P. C., and Leiser, R. P. (1981). discrimination ability of listeners with sensori-neural hear-
Binaural masking level di¤erences in neurological disorders. ing loss. Audiology, 19, 495–507.
Archives of Otolaryngologica, 107, 357. Je¤ress, L. A., Blodgett, H. C., and Deatherage, B. H.
Magliulo, G., Gagliardi, M., Muscatello, M., and Natale, A. (1952). The masking of tones by white noise as a func-
(1990). Masking level di¤erence before and after surgery. tion of the interaural phases of both components: I. 500
British Journal of Audiology, 24, 117–121. cycles. Journal of the Acoustical Society of America, 24,
McFadden, D. M., and Pasanen, E. G. (1978). Binaural de- 523–527.
tection at high frequencies with time-delayed waveforms. Kaigham, J. G., Koehnke, J., and Colburn, H. S. (1992). Fre-
Journal of the Acoustical Society of America, 63, 1120–1131. quency dependence of binaural performance in listeners
Moore, D. R., Hutchings, M. E., and Meyer, S. E. (1991). with impaired binaural hearing. Journal of the Acoustical
Binaural masking level di¤erences in children with a history Society of America, 91, 336–347.
of otitis media. Audiology, 30, 91–101. Langford, T., and Je¤ress, L. (1964). E¤ect of noise cross-
Nilsson, R., and Liden, G. (1976). Sound localization with correlation on binaural signal detection. Journal of the
phase audiometry. Acta Oto-Laryngologica, 81, 291–299. Acoustical Society of America, 36, 1455–1458.
Hearing Loss and Teratogenic Drugs or Chemicals 493
Licklider, J. C. R. (1948). The influence of interaural phase to note that most studies of intrauterine ototoxicity
relations upon the masking of speech by white noise. Jour- have been retrospective and have not controlled for
nal of the Acoustical Society of America, 20, 150–159. other factors, such as diuretic use, that could have acted
McFadden, D. (1968). Masking level di¤erences determined synergistically.
with and without interaural disparities in masker inten-
Isotretinoin is a prescriptive retinoid (a kind of vita-
sity. Journal of the Acoustical Society of America, 44, 212–
223. min A derivative) used to treat persistent and severe
Smoski, W. J., and Trahiotis, C. (1986). Discrimination of in- cystic acne. Its teratogenic e¤ects include abnormalities
teraural temporal disparities by normal-hearing listeners of the ophthalmologic, cardiovascular, vestibular, audi-
and listeners with high-frequency sensori-neural hearing tory, and central nervous systems. Specific auditory sys-
loss. Journal of the Acoustical Society of America, 79, 1541– tem abnormalities include enlargement of the saccule
1547. and utricle, shortening of the cochlea, and malformation
Webster, F. A. (1951). The influence of interaural phase on of the external ear (Schuknecht, 1993; Westerman, Gil-
masked thresholds: The role of interaural time deviation. bert, and Schondel, 1994). Both conductive and sensori-
Journal of the Acoustical Society of America, 23, 452–462. neural hearing losses have been reported in newborns as
Zurek, P. M. (1986). Consequences of conductive auditory im-
a result of maternal use of isotretinoin during gestation.
pairment for binaural hearing. Journal of the Acoustical
Society of America, 80, 466–472. Proper distribution of isotretinoin currently requires
informed patient consent, a negative pregnancy test in
the 2 weeks before treatment is initiated, and the use of
contraceptives from 1 month before to 1 month after use
Hearing Loss and Teratogenic Drugs or of the drug (Dyer, Strasnick, and Jacobson, 1998).
Probably the best-known nonprescription drug with
Chemicals devastating e¤ects on the unborn is thalidomide. This
over-the-counter sedative was available for a short pe-
Most causes of hearing loss in newborn infants are he- riod of time in the late 1950s. Prescribed for morning
reditary and cannot be prevented. However, about 30% sickness, when it was consumed during a susceptible
of cases of hearing loss in newborns have been linked period of fetal development, the implications for the
to teratogenic factors, and many of these cases are pre- baby were catastrophic. When thalidomide was ingested
ventable. Teratogens are factors capable of causing during the first trimester, when fetal limbs di¤erenti-
physical defects in the developing fetus or embryo and ate, babies were born with limb buds rather than
are typically grouped into four categories: infectious, fully formed limbs. A variety of ear anomalies were
chemical, physical, and maternal agents. During intra- also noted, including atresia of the external auditory
uterine life, the fetus is protected from many teratogens meatus, cochlear malformations, and absent acoustic
by the placenta, which serves as a filter to prevent and vestibular nerves (Dyer, Strasnick, and Jacobson,
the toxic substances from entering the fetus’ system. The 1998). Thalidomide was withdrawn from the market but
placenta, however, is not a perfect filter and cannot pre- has subsequently been reintroduced in lesser dosage for
vent entry of all teratogens. Prenatal susceptibility use in a number of immunological and inflammatory
to teratogens and the severity of the insult are quite disorders.
variable. Four factors believed to contribute to this Although numerous other prescription and over-the-
variability are dosage of the agent, the timing of the ex- counter drugs are known to have ototoxic e¤ects in
posure, the susceptibility of the host, and interactions adults, their ototoxic potential in the fetus has not yet
with exposure to other agents. This entry discusses ter- been demonstrated. Those agents currently under suspi-
atogenic chemicals that contribute to hearing loss in the cion but not proven to be ototoxic in utero include,
newborn. It should be kept in mind that much is still among others, a variety of antibiotics, including some
unknown about chemical teratogens and their ultimate aminoglycosides and tetracyclines; anti-inflammatory
impact on the developing auditory system. agents; chloroquine, an antimalarial drug; chemo-
therapeutic drugs; and diuretics. For additional reviews
Drugs of prescription and over-the-counter ototoxic medica-
tions, see Dyer, Strasnick, and Jacobson (1998) and
Of the prescription or over-the-counter medications, one Strasnick and Jacobson (1995).
of the best-studied groups known to have an adverse The e¤ects of recreational drug use by pregnant
e¤ect on the auditory system is the aminoglycocides. women have been di‰cult to document, for a number of
There is considerable information on the ototoxic e¤ects reasons. First, the e¤ects are multifactorial, meaning
of aminoglycocides in adults. Some aminoglycocides are that one drug can interact with another, and many con-
thought to be toxic to both the auditory and vestibular sumers are polydrug users. In addition to using more
systems. Aminoglycosides can cross the placenta, caus- than one drug at a time, drug-using mothers may not
ing sensorineural hearing loss and labyrinthine damage be receiving proper prenatal health care, another factor
in the fetus. This fetal ototoxicity can occur even in the contributing to premature delivery that can compromise
absence of ototoxicity in the mother. Intrauterine oto- the health of a newborn. Furthermore, mothers who
toxicity has been reported for streptomycin, dihydro- consume drugs may not accurately or honestly report the
streptomycin, kanamycin, and gentamicin. It is important range and degree of their drug use.
Ethyl alcohol is clearly the most widely abused drug Other Chemical Teratogens
in the United States. Fetal alcohol syndrome, first de-
scribed in the early 1970s, is a pattern of anomalies Limited data are available on other possible chemical
resulting from maternal consumption of alcohol during teratogens. At least one study has supported the thesis
pregnancy. The exact amount of maternal alcohol con- that maternal use of trimethadione, an antiseizure medi-
sumption required to cause fetal damage is uncertain, cation, can occasionally result in hearing deficits in the
but the e¤ects are believed to be related more to the fetus (Jones, 1997). Although extensive examination has
amount consumed than to the timing of the consump- not been conducted to date, maternal opium and nico-
tion (Dyer, Strasnick, and Jacobson, 1998). Approxi- tine use have not been found to result in peripheral
mately 2 in 1000 newborns su¤er from fetal alcohol hearing loss in newborns.
syndrome (Strasnick and Jacobson, 1995). The syn- See also ototoxic medications.
drome consists of multiple congenital anomalies, in- —Anne Marie Tharpe
cluding prenatal and postnatal growth retardation,
craniofacial dysmorphology, developmental delay, and
behavioral aberrations. The characteristic cranial fea-
References
tures include microcephaly, narrow forehead, small nose Church, M. W., and Abel, E. L. (1998). Fetal alcohol syn-
and midface, a long, thin upper lip, and micrognathia. drome: Hearing, speech, language, and vestibular disorders.
The primary auditory concern with children diagnosed Obstetrics and Gynecological Clinics of North America, 25,
with fetal alcohol syndrome appears to be conduc- 85–97.
tive hearing loss secondary to recurrent otitis media Church, M. W., and Gerkin, K. P. (1988). Hearing disorders in
children with fetal alcohol syndrome: Findings from case
(Church and Gerkin, 1988). This may be the result of
reports. Pediatrics, 82, 147–154.
first and second pharyngeal arch malformations leading Cone-Wesson, B., and Spingarn, A. (1990). E¤ects of maternal
to eustachian tube dysfunction. Sensorineural hearing cocaine abuse on neonatal ABR: Premature and small-for-
loss has also been reported at higher rates in children dates infants. Journal of the American Academy of Audiol-
with fetal alcohol syndrome than in the general popula- ogy, 1, 52(A).
tion (Gerber, Epstein, and Mencher, 1995; Church and Cone-Wesson, B., and Wu, P. (1992). Audiologic findings in
Abel, 1998). infants born to cocaine-abusing mothers. Infant-Toddler
Although animal research has demonstrated ototoxic Invervention: The Transdisciplinary Journal, 2, 25–35.
e¤ects when fetuses are exposed to cocaine, similar find- Dyer, J. J., Strasnick, B., and Jacobson, J. T. (1998). Terato-
ings have not been demonstrated in humans. The inci- genic hearing loss: A clinical perspective. American Journal
of Otology, 19, 671–678.
dence of prenatal cocaine exposure has been reported to
Gerber, S. E., Epstein, L., and Mencher, L. S. (1995). Recent
range from 11% to 14% of live births. Pregnant women changes in the etiology of hearing disorders: Perinatal drug
metabolize cocaine slower than other people, making exposure. Journal of the American Academy of Audiology,
them more sensitive to small amounts of the drug. 6(5), 371–377.
Metabolites of cocaine have been found in the urine of Jones, K. L. (1997). Fetal trimethadione syndrome. In K. L.
exposed infants for up to 10 days after birth. If the Jones (Ed.), Smith’s recognizable patterns of human malfor-
cocaine-exposed infant is born full term with normal or mation (5th ed., p. 564). Philadelphia: Saunders.
near normal birth weight, there appears to be no pe- Salamy, A., Eldredge, L., Anderson, J., and Bull, D. (1990).
ripheral hearing loss, according to auditory brainstem Brainstem transmission time in infants exposed to cocaine
response studies (Cone-Wesson and Wu, 1992). It does in utero. Journal of Pediatrics, 117, 627–629.
Schuknecht, H. F. (1993). Pathology of the ear (2nd ed., p. 177).
appear, however, that neurotoxic e¤ects are present in
Philadelphia: Lea and Febiger.
cocaine-exposed infants whether they are full-term or Strasnick, B., and Jacobson, J. T. (1995). Teratogenic hear-
low birth weight (Cone-Wesson and Spingarn, 1990; ing loss. Journal of the American Academy of Audiology, 6,
Salamy et al., 1990). The ingestion of cocaine by preg- 28–38.
nant women results in vasoconstriction of blood vessels Westerman, S. T., Gilbert, L. M., and Schondel, L. (1994).
delivering nutrients and oxygen to the developing fetus, Vestibular dysfunction in a child with embryonic exposure
which can cause hypoxia. As such, infants with intra- to accutane. American Journal of Otology, 15, 400–403.
uterine exposure to cocaine are at risk for a variety of
disordered neurobehaviors. It is still unclear, however, Further Readings
whether infants of cocaine-abusing mothers who are
born prematurely and of low birth weight are at greater Joint Committee on Infant Hearing. (2000). Year 2000 posi-
risk for neurobehavioral problems than premature, low- tion statement: Principles and guidelines for early hearing
birth-weight infants who are not exposed to cocaine. detection and intervention programs. American Journal of
Audiology, 9, 9–29.
Babies whose mothers are addicted to heroine or Shepard, T. H. (1989). Catalog of teratogenic agents. Balti-
methadone are also addicted at birth and will show signs more: Johns Hopkins University Press.
of narcotic withdrawal. The child may be hyperirritable Shih, L., Cone-Wesson, B., and Reddix, B. (1988). E¤ects of
for several months and may continue to be hyperactive. maternal cocaine abuse on the neonatal auditory system.
No direct link between maternal heroine use and con- International Journal of Pediatric Otorhinolaryngology, 15,
genital hearing loss has been made. 245–251.
Hearing Loss Screening: The School-Age Child 495
Hearing Loss Screening: The School- Table 1. School-Age Children in Need of Regular Hearing
Screening and Monitoring
Age Child Parent/care provider, health care provider, teacher, or other
school personnel have concerns regarding hearing, speech,
Among school-age children in the United States, it is language, or learning abilities
estimated that nearly 15% have abnormal hearing in one
Family history of late or delayed-onset hereditary hearing
loss
or both ears (Niskar et al., 1998). With newborn hearing Recurrent or persistent otitis media with e¤usion for more
screening now available in nearly every state, many sen- than 3 months
sorineural hearing losses are identified prior to school Craniofacial anomalies, including those with morphological
entry. Even so, comprehensive hearing screening of abnormalities of the pinna and ear canal
school-age children is important, for several reasons. Stigmata or other findings associated with a syndrome
First, it will be years before universal infant hearing known to include sensorineural or conductive hearing loss
screening is fully implemented. Second, late-onset sen- Head trauma with loss of consciousness
sorineural loss may occur in the weeks or months fol- Reported exposure to potentially damaging noise levels or
lowing newborn screening, especially in young children ototoxic drugs
with complicated birth histories (Centers for Disease
Control and Prevention, 1997). Third, mild sensori-
neural loss can escape detection even when newborn resources for the program’s implementation and main-
hearing screening is provided (Joint Committee on In- tenance. Hearing screening in the school-age population
fant Hearing, 2000). In school-age children, acquired satisfies each of these criteria; however, ongoing evalua-
sensorineural hearing loss may occur as a result of dis- tion of new technologies and protocols is needed to de-
ease or noise exposure. The e¤ects of sensorineural termine optimal methodology and pass-fail criteria.
hearing loss in children are well documented. Even mild, The Panel on Audiologic Assessment of the American
high-frequency, unilateral sensorineural hearing loss Speech-Language-Hearing Association (ASHA, 1997a)
can have important developmental consequences (Bess, recommends that school-age children be screened on
Dodd-Murphy, and Parker, 1998). More severe losses initial school entry, annually from kindergarten through
are likely to a¤ect the development of speech, language, third grade, and in grades 7 and 11. The panel also rec-
academic performance, and social-emotional develop- ommends screening children at entry to special educa-
ment (Gallaudet University, 1998). In addition to sen- tion, those who repeat a grade, and any who are newly
sorineural loss, hearing screening is needed to identify admitted to the school system. More aggressive hearing
children with conductive hearing loss. In nearly all cases, screening is recommended for children with one or more
conductive hearing loss in school-age children is due to of the high-risk indicators listed in Table 1.
otitis media. The incidence of otitis media with e¤usion Guidelines and position statements of ASHA and the
(OME) is highest during the infant-toddler period and American Academy of Audiology (AAA) recommend
declines substantially by school age. Even so, otitis me- visual inspection of the external ear to detect conspicu-
dia is the most frequent primary diagnosis in children ous signs of disease or malformation (AAA, 1997;
less than 15 years old (American Academy of Pediatrics, ASHA, 1997b). The outer ear examination is followed
1994). The hearing loss associated with OME, although by otoscopic inspection. Screening personnel must have
mild and rarely permanent, can occur throughout child- the knowledge and skill required to conduct visual in-
hood and may result in medical complications as well spection and otoscopy, not for diagnostic purposes but
as potentially adverse e¤ects on communication and to identify obvious signs of ear disease, impacted ceru-
development. men, or foreign objects that may compromise the valid-
ity of the screening or indicate the need for medical
Principles and Methods referral.
Current guidelines and position statements recom-
The purpose of hearing screening is to identify children mend that hearing screening of school-age children be
most likely to have a hearing or middle ear disorder conducted on an individual basis, using manually
needing medical, audiologic, or other interventions. administered pure tones delivered via earphones or insert
Thus, the goal of a hearing screening program is to receivers at 20 dB hearing level (HL) for the frequencies
identify asymptomatic individuals with an increased 1000, 2000, and 4000 Hz (AAA, 1997; ASHA, 1997a).
likelihood of hearing impairment, so that diagnostic In order to pass, the child must respond to all three fre-
follow-up is applied only to that subset of individuals. quencies in each ear. Although failure to respond may
To justify the resources needed to provide a comprehen- simply be due to lack of cooperation or motivation,
sive screening program, several important assumptions hearing loss should be suspected until appropriately
must be met. The problem must be considered signifi- ruled out. When referral is indicated, evaluation by an
cant, both to individuals a¤ected and to society; there audiologist is needed to determine the nature and degree
must be appropriate screening tools with acceptable of hearing loss.
performance criteria; there must be e¤ective treatment Routine pure-tone screening at 20 dB HL, however, is
for those identified; and there must be su‰cient financial inadequate for the identification of OME (Melnick,
Eagles, and Levine, 1964; Roush and Tait, 1985). Con- Professionals and support personnel from many dis-
sequently, many institutional screening programs in- ciplines are now involved in hearing and middle ear
clude acoustic immittance (tympanometry and related screening. The screening procedures are not di‰cult with
measures) as part of the hearing screening program. Be- most school-age children; however, personnel must be
cause OME has low incidence in the school-age popula- competent and appropriately supervised. Furthermore,
tion, ASHA guidelines recommend routine middle ear those responsible for the program must ensure that
screening only to age 6 (ASHA, 1997b). screening personnel are employed in a manner consistent
Evoked otoacoustic emissions (EOAEs) have become with state licensure, professional scope of practice, and
an essential tool in evaluating peripheral auditory func- other regulatory requirements. For this reason it is rec-
tion. EOAEs, which are present in nearly all ears with ommended that institutional screening programs be
normal cochlear function, have at least two important conducted under the general supervision of an audiolo-
advantages over behavioral pure-tone screening. They gist. School-age children who are di‰cult to test because
are objective and, for most school-age children, easy to of developmental disabilities or other factors should be
measure. Because EOAEs are usually absent in ears with screened by an audiologist.
more than mild hearing loss, they are potentially well- Audiometers and tympanometric screening instru-
suited for school-age screening (Driscoll, Kei, and Mc- ments must undergo full calibration by a qualified tech-
Pherson, 1997; McPherson et al., 1998). But EOAES are nician at least once each year. The American National
influenced by middle ear disease as well as by cochlear Standards Institute (ANSI) has established specific
dysfunction. While this is often cited as a disadvantage requirements for the calibration of these instruments
of EOAEs, Nozza and colleagues have suggested that (ANSI S3.39-1987; S3.6-1996). In addition to formal
EOAEs, in conjunction with tympanometry, may be calibration measurements, ANSI standards advise rou-
useful as part of a multistage screening program to de- tine visual inspection and daily listening checks.
tect hearing loss and otitis media (Nozza, Sabo, and Because school-age children are at risk for permanent
Mandel, 1997; Nozza, 2001). Before such an approach hearing loss from exposure to high-intensity noise, a
can be endorsed for routine screening of school-age comprehensive program of hearing screening for school-
children, further clinical trials are needed to determine age children should include information on prevention of
the sensitivity, specificity, and predictive value of acoustic trauma through the use of appropriate hearing
EOAEs and tympanometry, in comparison to conven- protection (Anderson, 1991). The New York League for
tional procedures. Based on currently available research, the Hard of Hearing (2001) has developed useful mate-
EOAEs are likely to play an increasing role in school- rials for educating children about the dangers of noise-
age screening (Nozza, 2001). induced hearing loss and how to prevent it.
Parental permission must be obtained prior to con-
Other Considerations ducting hearing and middle ear screening procedures
unless consent has already been obtained as part of an
An ideal acoustical environment is rarely available in
institutional enrollment process or admissions proce-
schools and other settings where screening is often
dure. Failure to obtain informed consent not only is
conducted. Although few programs make on-site noise
unprofessional but could lead to negative public rela-
measurement as part of evaluating a test environment
tions and possible legal action. Parents must be informed
prior to screening, the time and expense involved in
of screening procedures and their purpose. In addition to
unnecessary follow-up and audiologic referral can far
informed consent, strict confidentiality must be ensured.
exceed the cost of providing a noise survey. Thus, a pre-
Discussion of screening outcomes or distribution of re-
screening noise survey that includes measurement of
sults should occur only with parents’ knowledge and
ambient noise levels at several third-octave bands (Table
consent, and mechanisms used to transmit screening
2) should be conducted when there is uncertainty re- results must be in secure data formats. Most screening
garding adequacy of the screening environment (ANSI
programs will already have institutional guidelines. If
S3.1-1999).
guidelines do not exist, they must be implemented ac-
cording to institutional protocols as well as state and
federal laws. Screening personnel must be familiar with
these policies and maintain compliance at all times
Table 2. Maximum Permissible Ambient Noise Levels (in dB (Roush, 2000).
SPL) for One-Third Octave Bands, for Screening at 20 dB HL Undetected hearing loss in a school-age child is a
Using the Pure-Tone Frequencies 1000, 2000, and 4000 Hz serious matter. Not only is appropriate intervention
(ANSI S3.1-1999) denied, hearing loss can be mistaken for a developmental
Pure-Tone Frequency (Hz) disability or attention deficit. Even so, mass screening of
school-age children is an expensive and time-consuming
Stimulus/Transducer 1000 2000 4000
endeavor that requires systematic review and ongoing
Screening at 20 dB HL with 41 49 52 evaluation. This includes careful examination of screen-
supra-aural earphones ing outcomes, tracking of referrals, and communication
Screening at 20 dB HL with 62 64 65 with agencies and health care providers to whom refer-
insert earphones
rals are made. In recent years, despite an enormous in-
Hearing Protection Devices 497
crease in studies related to newborn hearing screening, New York League for the Hard of Hearing. (2001). Noise
there has been remarkably little research aimed at the and its e¤ects on children. Available: www.lhh.org/noise/
school-age population. In particular, there is a need to children/index.htm.
further examine the role of EOAEs, alone and in com- Niskar, A. S., Kieszak, S. M., Holmes, A., Esteban, E., Rubin,
C., and Brody, D. B. (1998). Prevalence of hearing loss
bination with other tests, to determine the optimal
among children 6 to 19 years of age. Journal of the Ameri-
screening battery for identification of hearing loss and can Medical Association, 279(14), 1071–1075.
middle ear disease in the school-age population. Nozza, R. (2001). Screening with otoacoustic emissions beyond
—Jackson Roush the newborn period. Seminars in Hearing, 22, 415–426.
Nozza, R., Sabo, D., and Mandel, E. (1997). A role for oto-
References acoustic emissions in screening for hearing impairment and
middle ear disorders in school-age children. Ear and Hear-
American Academy of Audiology. (1997). Report and Position ing, 18, 227–239.
Statement: Identification of hearing loss and middle ear Roush, J. (2000). Screening for hearing loss and otitis media in
dysfunction in preschool and school-age children. Audiology children. San Diego, CA: Singular/Thomson Learning.
Today, 9(3), 18–22. Roush, J., and Tait, C. (1985). Pure tone and acoustic immit-
American Academy of Pediatrics. (1994). Practice guideline: tance screening of preschool-aged children: An examination
Managing otitis media with e¤usion in young children. Pe- of referral criteria. Ear and Hearing, 6, 245–250.
diatrics, 94(5).
American National Standards Institute. (1987). Specifications Further Readings
for instruments to measure aural acoustic impedance and ad-
mittance (ANSI S3.39-1987, R1996). New York: Acousti- American Speech-Language-Hearing Association. (1997).
cal Society of America. Guidelines for audiologic screening. Rockville, MD: Author.
American National Standards Institute. (1991). Maximum Bess, F. H., and Hall, J. W. III (Eds.). (1992). Screening chil-
permissible ambient noise levels for audiometric test rooms dren for auditory function. Nashville, TN: Bill Wilkerson
(ANSI S3.1-1991). New York: Acoustical Society of Center Press.
America. Cadman, D., Chambers, L., Feldman, W., and Sackett, D.
American National Standards Institute. (1996). Specifications (1984). Assessing the e¤ectiveness of community screening
for audiometers (ANSI S3.6-1996). New York: Acoustical programs. Journal of the American Medical Association,
Society of America. 252, 1580–1585.
American Speech-Language-Hearing Association. (1997a). DeConde Johnson, C. (2002). Hearing and immittance screen-
Guidelines for audiologic screening for hearing impairment: ing. In J. Katz (Ed.), Handbook of clinical audiology. Phil-
School-age children, 5 through 18 years. Rockville, MD: adelphia: Lippincott Williams and Wilkins.
Author. Nozza, R. (1996). Pediatric hearing screening. In F. N. Martin
American Speech-Language-Hearing Association. (1997b). and J. G. Clark (Eds.), Hearing care for children. Needham
Guidelines for screening infants and children for outer and Heights, MA: Allyn and Bacon.
middle ear disorders: Birth through 18 years. Rockville, MD:
Author.
Anderson, K. (1991). Hearing conservation in the public
schools revisited. Seminars in Hearing, 12, 340–358. Hearing Protection Devices
Bess, F. H., Dodd-Murphy, J., and Parker, R. A. (1998).
Children with minimal sensorineural hearing loss: Preva-
lence, educational performance, and functional status. Ear Hearing protection devices (HPDs) are, as a practical
and Hearing, 19, 339–354. matter, the first line of defense against hearing loss
Centers for Disease Control and Prevention. (1997). Serious caused by excessive noise. Other ways to reduce expo-
hearing impairment among children aged 3–10 years— sure to loud sound (engineering control of noise sources,
Atlanta, Georgia, 1991–1993. MMWR, 46, 1073–1076. reduction of noise in the transmission path between
Driscoll, C., Kei, J., and McPherson, B. (2001). Transient a source and an individual) can indeed be more e¤ec-
evoked otoacoustic emissions in 6-year-old school children:
A comparison with pure tone screening and tympanometry.
tive, but are often more costly or more di‰cult to
International Journal of Pediatric Otolaryngology, 57, 67–76. manage.
Gallaudet University Center for Assessment and Demographic HPDs can be classified by type (active versus passive),
Study. (1998). Thirty years of the annual survey of deaf and by form (e.g., earplugs and earmu¤s of various types),
hard-of-hearing children and youth: A glance over the dec- and by e¤ect. In all cases the goal is the same: to atten-
ades. American Annals of the Deaf, 142(2), 72–76. uate the magnitude of sound reaching the cochlea, thus
Joint Committee on Infant Hearing. (2000). Year 2000 posi- limiting acoustical insult to the end-organ of hearing (see
tion statement. Available: http://www.pediatrics.org/cgi/ noise-induced hearing loss). Passive devices accom-
content/full/106/4/798. plish this through blockage of the airborne transmission
McPherson, B., Kei, J., Smyth, V., Latham, S., and Loscher, J. path to the inner ear. Active devices seek to mechani-
(1998). Feasibility of community-based hearing screening
using transient evoked otoacoustic emissions. Public Health,
cally or electronically respond to noise to reduce signal
112, 147–152. amplitudes presented to the auditory system.
Melnick, W., Eagles, E., and Levine, H. (1964). Evaluation of Earplugs fall into five categories (Berger, 2000): (1)
a recommended program of identification audiometry in closed-cell foam devices designed to be manually com-
school-age children. Journal of Speech and Hearing Dis- pressed, then inserted into the ear canal, where they ex-
orders, 29, 3–13. pand to approximate their initial size (e.g., the Aearo
Company E-A-R plug), (2) preformed devices available irritation. If hearing protectors (perhaps combined with
in di¤erent diameters to accommodate di¤erent ear hearing loss) render speech communication di‰cult, or if
canals (e.g., the PlastiMed V-51R plug), (3) malleable they limit audibility of other signals deemed important,
devices intended to fit a range of ear canals, (4) semi- users may reject them. For obvious reasons, earmu¤s
insert devices held in the ear canal by means of a plastic should not be used in conjunction with hearing aids.
or metal band, and (5) devices made from ear mold These and other issues are discussed in detail by Berger
impressions taken from individual ear canals. Most ear- (2000).
plugs are created from plastics (polyvinyl chloride, poly- Real environments in which hearing protectors might
urethane, silicone, or acrylic); malleable earplugs often provide benefit di¤er tremendously in noise amplitude,
consist of wax-impregnated cotton or fiberglass enclosed spectrum, and duration. Noise exposure is normally in-
in a thin plastic container. Most earplugs are passive, dexed by time-weighted average (TWA) sound pressure
that is, they are not intended to respond di¤erently to levels sampled using integrating meters or personal noise
di¤ering noise exposures. Some active earplugs employ dosimeters. In the United States, such measurements
metal slugs or other material intended to move within are specified by Federal regulation (Occupational Safety
the plug when stimulated by sudden acoustic overpres- and Health Administration [OSHA], 1983, CFR Part
sures, thus increasing attenuation in response to im- 1910.95) and the subject of technical standards (Ameri-
pulsive noise. For various reasons, it is di‰cult or can National Standards Institute [ANSI], S12.19 1996).
impossible to objectively measure the attenuation of Among other details, exposure is to be indexed using a
such devices or to estimate their real-world benefit. A slow meter ballistic characteristic and an A-weighting
recent addition to the array of earplugs are those o¤ered network (a high-pass filter useful in predicting the e¤ects
by Etymotic Research for users with specific needs (e.g., of broadband noise on hearing). TWA levels are single-
musicians) who seek flat attenuation across specified number values used to describe noise exposure and de-
frequency ranges. termine actions to protect workers from noise-induced
Earmu¤s are designed as integral components of hearing loss in the workplace (OSHA, 1983).
safety helmets or as separate devices that surround In 1979, the Environmental Protection Agency (EPA)
the outer ear and are held in place by headbands that issued a regulation intended to promote laboratory
extend over or behind the head or beneath the chin. measurement of hearing protector attenuation for the
Some of these are combined with communication sys- purpose of combining such information with exposure
tems intended to increase the signal-to-noise ratio of data to estimate protective e¤ect. The EPA regulation
messages electronically routed to earphones placed (CFR40 Part 211) built on previous technical standards
within headsets. At present, most earmu¤s are passive (ANSI, 1974), and invoked a single-number index, called
devices, but several have been developed as active sys- the Noise Reduction Rating (NRR), to be included
tems. Indeed, most active hearing protectors are based in hearing protector product labels. Computation of
on earmu¤s, owing to the space required for sound NRRs from averaged behavioral real-ear attenuation-
sensing and processing components. Active protectors at-threshold (REAT) data assume temporally continu-
employ one of two (or both) methods to attenuate ous band-limited noise stimuli with equal energy per
sound. One method senses sound with a microphone, octave (pink noise), and address intersubject variability
then processes sound delivered through an earphone by by doubling the standard deviation of threshold shifts,
means of automatic gain control circuitry: when incident then subtracting that value from mean threshold shift
sound exceeds a certain level, further increases are elec- for each noise band. Adjusted attenuation values are
tronically clipped or otherwise squelched. The other summed logarithmically across stimulus noise bands to
method samples incident sound, reverses the phase of the yield an NRR in decibels. Because the NRR method
signal, and electronically adds the reversed signal within also assumes measurement of unprotected levels indexed
the mu¤ enclosure to partially cancel the incident sound. with a C-weighting network (which has a flatter fre-
Because of incident signal changes and processing speed quency response than the A-weighting network used to
requirements, devices employing additive cancellation measure exposure), an additional 7 dB must be sub-
techniques are more e¤ective at relatively low frequen- tracted from the NRR to estimate A-weighted noise
cies (e.g., below 500 Hz; see Nixon, McKinley, and levels when a hearing protector is in place (see OSHA,
Steuver, 1992). Some active noise reduction methods 1983, Appendix B). Finally, because it was recognized
appear similar to (or may benefit from) methods that how hearing protectors are placed in a subject’s ears
employed in hearing aids. (plugs) or on a subject’s head (mu¤s) could a¤ect out-
Beyond type and form, HPDs di¤er in weight, com- comes, the EPA method specified experimenter fitting of
fort, uniformity of fit to individuals, compatibility with HPDs during laboratory testing.
other protective or prosthetic devices, and compatibility Because real-ear attenuation methods performed fol-
with individual user health status. Using eyeglasses with lowing the procedures stipulated by EPA designate ex-
earmu¤s, for example, can create acoustic leaks that re- perimenter fitting of HPDs, it is to be expected that
duce attenuation performance. Similarly, a subject with resulting NRRs will be larger than what would be found
excessive cerumen or a middle ear e¤usion should not with subject fitting of HPDs. Because all extant methods
use earplugs. Use of hearing protectors in hot, humid for measuring REATs use temporally continuous noise,
environments can be uncomfortable and can cause skin results of such measurements cannot be generalized
Hearing Protection Devices 499
to impulse noise (e.g., gunfire) or impact noise (e.g., If only experimenter-fit data are available, NIOSH
forging). (1998) currently recommends derating of NRRs based
Shortly after the inception of the current OSHA on type of hearing protector: 25% for earmu¤s, 50% for
Hearing Conservation Rule (OSHA, 1983), the National formable earplugs, and 70% for all other earplugs. In
Institute of Occupational Safety and Health (NIOSH) the case of double protection (plugs and mu¤s), the
recommended that labeled NRRs be derated to estimate OSHA Technical Manual (OSHA, 1999) recommends
e¤ectiveness in the field. Six schemes are noted in using the EPA NRR for the better protector, minus
Appendix B of the Hearing Conservation Rule. These 7 dB, dividing the result by 2 (a 50% derating), then
di¤er based on available measurement devices and data, adding 5 dB to the field-adjusted NRR to account for
but generally reduce the estimated benefit of hearing the second protector.
protectors. For example, if only A-weighted noise expo- Rather clearly, much work remains to be done to
sure data are available, 7 dB is subtracted from the improve the prediction of real-world benefit of hearing
NRR. For both A-weighted and C-weighted exposure protectors (Berger and Lindgren, 1992; Berger, 1999).
data, the resulting corrected NRR is further reduced by One promising approach involves methods similar to in
50%. vivo real-ear gain measurements of hearing aids (now a
Subsequent research over two decades suggests that common practice), together with modification of com-
NRRs derated in this manner still overestimate the at- monly used personal noise dosimeters. This approach
tenuation of hearing protectors in real-world situations. requires the ability to simultaneously measure exposure
Various factors contribute to this inaccuracy, including level and the sound level generated within the ear canal
overestimation associated with (1) experimenter fit, (2) of the wearer of a hearing protector. If both are mea-
highly trained test subjects (whose small standard de- sured with the same filtering schemes (preferably, the
viations of REATs produce higher NRRs), and (3) C-weighting network; ideally with both A and C net-
di¤erences in patterns of use of hearing protectors in works), the signed di¤erence between the two would
laboratory and field settings (NIOSH, 1998). index attenuation due to the hearing protector. If such
Other pertinent generalizations include the following: measurements can be adapted to field use (e.g., with a
(1) overall, earmu¤s provide the most protection, foam two-channel noise dosimeter), it may be possible to add
and formable earplugs provide the next greatest protec- useful information to what otherwise can be determined
tion, and all other insert types provide less, and (2) ide- about the performance of at least some HPDs. ANSI
ally, individuals should be fitted individually for hearing S12.42 (1995) addresses some of these issues for earmu¤s
protectors (NIOSH, 1998). Generally, both earplugs and communication headsets, but only for laboratory
and earmu¤s provide greater attenuation at frequencies measurements. Because the use of probe microphones
above 500 Hz than at lower frequencies (Berger, 2000). with earplugs is likely to produce reactive measurement
Chapter 4 of the revised NIOSH criteria document e¤ects, this approach may not be suitable for insert
(NIOSH, 1998) o¤ers details about estimated real-world devices.
NRRs for 84% of wearers of hearing protectors, based It is generally recognized that e¤ective use of hearing
on several independent studies. Labeled NRRs for single protectors in the workplace or elsewhere is influenced by
protectors range from 11 to 29 dB, while weighted mean factors that go beyond the physical performance of these
NRR84 values range from 0.1 to 14.3 dB. devices. As summarized by NIOSH (1998), these factors
To address these problems, existing standards for include convenience and availability, comfort and ease
measuring HPD attenuation were revised (Royster et al., of fit, compatibility with other safety equipment, and
1996; Berger et al., 1998) to include subject-fit methods worker belief that the device can be worn e¤ectively, will
with audiometrically proficient listeners naive about indeed prevent hearing loss, and will still permit hearing
HPDs. The resulting standard is ANSI S12.6 (1997). of important sounds.
(A companion standard, ANSI S12.42 (1995), specifies a
test fixture method and a microphone-in-real-ear method —Michael R. Chial
for measuring insertion loss useful for quality control
and product development work with earmu¤s.) References
In 1995 the National Hearing Conservation Associa- American National Standards Institute. (1974). S3.19-1974
tion proposed alternative labeling requirements in which American National Standard measurement of real-ear pro-
only subject-fit real-ear attenuation data (ANSI S12.6- tection of hearing protectors and physical attenuation of ear
1997, Method B) are reported. The revised NRR(SF) mu¤s. New York: Author.
information generally suggests less protection than American National Standards Institute. (1995). S12.42-1995
NRRs based on experimenter fitting. Alternatively, the American National Standard microphone-in-real-ear and
NHCA (1995) suggests labeling to include high, me- acoustic test fixture methods for the measurement of insertion
loss of circumaural hearing protection devices. New York:
dium, and low NRRs based on statistical distributions of Author.
measured subject-fit REATs. This proposal has been American National Standards Institute. (1996). S12.19-1996
endorsed by several other organizations. As of this writ- American National Standard measurement of occupational
ing, however, the EPA NRR labeling requirement re- noise exposure. New York: Author.
mains based on the experimenter-fit method specified in American National Standards Institute. (1997). S12.6-1997
ANSI S3.19 (1974). American National Standard methods for measuring of the
real-ear attenuation of hearing protectors. New York: American National Standards Institute [Archive]. Available:
Author. http://www.ansi.org/. [Accessed April 12, 2002.]
Berger, E. H. (1999). Hearing protector testing: Let’s get real Berger, E. H. Earlog series (1–21) [Archive]. Available: http://
[using the new ANSI Method-B data and the NRR(SF)]. www.cabotsafety.com/html/industrial/earlog.htm. [Accessed
Earlog 21. Indianapolis, IN: Aearo Co. Available: http:// April 12, 2002.]
www.cabotsafety.com/html/industrial/earlog21.htm. [Ac- Council for Accreditation in Occupational Hearing Conserva-
cessed April 12, 2002.] tion [Archive]. Available: http://www.caohc.org/. [Accessed
Berger, E. H. (2000). Hearing protection devices. In E. H. April 12, 2002.]
Berger, L. H. Royster, J. D. Royster, D. P. Driscoll, and Franks, J. R., and Berger, E. H. (1998). Hearing protection-
M. Layne (Eds.), The noise manual (5th ed., chap. 10). personal protection: Overview. In Encyclopedia of oc-
Fairfax, VA: American Industrial Hygiene Association. cupational health and safety (31.11–31.15). Geneva:
Berger, E. H., Franks, J. R., Behar, A., Casali, J. G., Dixon- International Labour Organization.
Ernst, C., Kieper, R. W., et al. (1998). Development of a National Hearing Conservation Association. (2002). Contem-
new standard laboratory protocol for estimating the field porary references: Hearing protection research. Available:
attenuation of hearing protection devices: Part III. The va- http://www.hearingconservation.org/cr3.html. [Accessed
lidity of using subject-fit data. Journal of the Acoustical So- April 12, 2002.]
ciety of America, 103, 665–672. Noise Pollution Clearinghouse [Archive]. Available: http://
Berger, E. H., and Lindgrin, F. (1992). Current issues in hear- www.nonoise.org/. [Accessed April 12, 2002.]
ing protection. In A. L. Dancer, D. Henderson, R. J. Salvi,
and R. P. Hamernik (Eds.), Noise-induced hearing loss
(chap. 33). St. Louis: Mosby–Year Book.
Environmental Protection Agency. (1979). Noise labeling Masking
requirements for hearing protectors. Code of Federal
Regulations 40CFR Part 211. Available: http://www
.nonoise.org/lawlib/cfr/40/40cfr211.htm. [Accessed April A major goal of the basic audiologic evaluation is as-
12, 2002.] sessment of auditory function of each ear. There are sit-
National Hearing Conservation Association. (1995). Recom-
uations during both pure-tone and speech audiometry,
mendations of the NHCA Task Force on Hearing Protector
E¤ectiveness. Available: http://www.hearingconservation however, when the nontest ear can contribute to the
.org/pos6.htm. [Accessed April 12, 2002.] observed response from the test ear. Whenever it is sus-
National Institute of Occupational Safety and Health. (1998). pected that the nontest ear is responsive during evalua-
Criteria for a recommended standard: Occupational noise tion of the test ear, a masking stimulus must be applied
exposure, revised criteria 1988 (DHHS [NIOSH] Publica- to the nontest (i.e., contralateral) ear in order to elimi-
tion No. 98-126). Cincinnati, OH: National Institute of nate its participation.
Occupational Safety and Health. Available: http://www Cross-hearing occurs when a stimulus presented to the
.cdc.gov/niosh/98-126.html. [Accessed April 12, 2002.] test ear ‘‘crosses over’’ and is perceived in the nontest
Nixon, C. W., McKinley, R. L., and Steuver, J. W. (1992). ear. It is the result of limited interaural attenuation dur-
Performance of active noise reduction headsets. In A. L.
ing both air- and bone-conduction testing. Interaural at-
Dancer, D. Henderson, R. J. Salvi, and R. P. Hamernik
(Eds.), Noise-induced hearing loss (chap. 34). St. Louis: tenuation refers to the reduction of energy between ears.
Mosby–Year Book. Generally, it represents the amount of separation be-
Occupational Safety and Health Administration. (1983). Oc- tween ears during testing. Specifically, it is the decibel
cupational noise exposure-hearing conservation amend- di¤erence between the hearing level of the signal at the
ment. Code of Federal Regulations 29 CFR Part 1910.95. test ear and the hearing level reaching the nontest ear
Available: http://www.osha.gov/pls/oshaweb/owadisp cochlea.
.show_document?p_table=STANDARDS&p_id=9735. A major factor that a¤ects interaural attenuation
[Accessed April 12, 2002.] is the transducer type: air-conduction versus bone-
Occupational Safety and Health Administration. (1999). Noise conduction. Two types of earphones are commonly
measurement. In OSHA technical manual (sect. III, chap.
used during air-conduction audiometry. Supra-aural
5). Available: http://www.osha.gov/dts/osta/otm/otm_iii/
otm_iii_5.html. [Accessed April 12, 2002.] earphones use cushions that press against the pinna,
Royster, J. D., Berger, E. H., Merry, C. J., Nixon, C. W., while insert earphones are coupled to the ear by insertion
Franks, J. R., Behar, A., et al. (1996). Development of a into the ear canal.
new standard laboratory protocol for estimating the field Interaural attenuation for supra-aural earphones
attenuation of hearing protection devices: Part I. Research varies across frequency and subject, ranging from about
of Working Group 11, Accredited Standards Committee 40 dB to 80 dB (e.g., Coles and Priede, 1970; Snyder,
S12, Noise. Journal of the Acoustical Society of America, 99, 1973; Killion, Wilber, and Gudmundsen, 1985; Sklare
1506–1526. and Denenberg, 1987). The smallest reported value of
interaural attenuation for speech is 48 dB (e.g., Snyder,
1973; Martin and Blythe, 1977). When making a deci-
Further Readings sion about the need for contralateral masking during
Alberti, P. W. (Ed.). (1982). Personal hearing protection in in- clinical practice, a single value defining the lower limit of
dustry. New York: Raven Press. interaural attenuation is most useful (Studebaker, 1967).
American Industrial Hygiene Association [Archive]. Available: The majority of audiologists use an interaural attenua-
http://www.aiha.org/. [Accessed April 12, 2002.] tion value of 40 dB for all air-conduction measurements,
Masking 501
both pure-tone and speech, when making a decision The major factor to consider when making a deci-
about the need for contralateral masking (Martin, sion about the need for contralateral masking during
Champlin, and Chambers, 1998). bone-conduction audiometry is whether the unmasked
Commonly used insert earphones are the ER-3A bone-conduction threshold (unmasked BC) suggests the
(Etymotic Research, 1991) and the E-A-RTONE 3A presence of a significant conductive component in the
(E-A-R Auditory Systems, 1997). A major advantage of test ear. Specifically, the use of contralateral masking
the 3A insert earphone is increased interaural attenua- is indicated whenever the results of unmasked bone-
tion for air-conducted sound, particularly in the lower conduction audiometry suggest the presence of an air-
frequencies. Consequently, the need for contralateral bone gap in the test ear (AB GapT ) of 15 dB or greater:
masking is significantly reduced during air-conduction
audiometry. Based on currently available data, conser- AB GapT b 15 dB
vative estimates of interaural attenuation for 3A insert
where
earphones with deeply inserted foam eartips are 75 dB
at 1000 Hz and below and 50 dB at frequencies above AB GapT ¼ ACT Unmasked BC
1000 Hz (Killion, Wilber, and Gudmundsen, 1985;
Sklare and Denenberg, 1987). The smallest reported Contralateral masking is indicated during speech
value of interaural attenuation for speech is 20 dB audiometry whenever the presentation level of the
greater when using 3A insert earphones with deeply speech signal in dB HL at the test ear (PLT ) minus
inserted foam eartips (Sklare and Denenberg, 1987) than interaural attenuation equals or exceeds the best pure-
when using a supra-aural arrangement (Snyder, 1973; tone bone-conduction threshold in the nontest ear (Best
Martin and Blythe, 1977). Consequently, a value of BCNT ):
60 dB represents a conservative estimate of interaural
attenuation for speech when using 3A insert earphones. PLT IA b Best BCNT
Interaural attenuation is greatly reduced during bone-
Because speech is a broadband signal, it is necessary
conduction audiometry. Regardless of the placement
to consider bone-conduction hearing sensitivity at more
of a bone vibrator (i.e., mastoid versus forehead), it is
than a single pure-tone frequency. The most conser-
generally agreed that interaural attenuation for bone-
vative approach involves considering the best bone-
conducted sound is negligible and should be considered conduction threshold in the 250- to 4000-Hz frequency
0 dB (e.g., Hood, 1960; Sanders and Rintelmann, 1964;
range (Coles and Priede, 1975).
Studebaker, 1967; Dirks, 1994).
Clinical Masking Procedures
When to Mask
Although there are many di¤erent approaches to clinical
Contralateral masking is required during pure-tone masking, each addresses two basic questions. First, what
air-conduction audiometry when the unmasked air-
is the minimum level of noise that is required to just
conduction threshold obtained in the test ear (ACT )
mask the cross-hearing signal in the nontest ear? Stated
minus the apparent bone-conduction threshold (i.e.,
di¤erently, this is the minimum masking level that is
the unmasked bone-conduction threshold) in the nontest
needed to prevent undermasking (i.e., the test signal
ear (BCNT ) equals or exceeds interaural attenuation
continues to be perceived in the nontest ear). Second,
(IA):
what is the maximum level of noise that can be pre-
ACT BCNT b IA sented to the nontest ear that will not shift or change the
true threshold in the test ear? Stated di¤erently, this is
Many audiologists will obtain air-conduction thresh- the maximum masking level that can be used without
olds prior to measurement of bone-conduction thresh- overmasking.
olds. A preliminary decision about the need for Masking refers to ‘‘the process by which the threshold
contralateral masking can be made by comparing the of hearing for one sound is raised by the presence of an-
air-conduction thresholds of the two ears. When the other (masking) sound’’ (ANSI S3.6-1996, p. 5). The
air-conduction threshold in the test ear minus the air- purpose of contralateral masking is to reduce the sensi-
conduction threshold in the nontest ear (ACNT ) equals tivity of the nontest ear to the test stimulus. The masking
or exceeds interaural attenuation, masking should be noise typically used during pure-tone audiometry is
applied to the nontest ear: narrow-band noise centered geometrically around the
audiometric test frequency. Speech spectrum noise (i.e.,
ACT ACNT b IA weighted random noise for the masking of speech) is
typically used during speech audiometry. Masking noise
It is important to remember, however, that cross-hearing is calibrated in e¤ective masking level (dB EM) (ANSI
for air-conducted sound occurs primarily through the S3.6-1996). E¤ective masking level for pure tones refers
mechanism of bone conduction. Consequently, it will be to the dB HL to which detection threshold is shifted by a
necessary to reevaluate the need for contralateral mask- given level of noise. E¤ective masking level for speech
ing during air-conduction testing following the measure- refers to the dB HL to which the speech recognition
ment of unmasked bone-conduction thresholds. threshold (SRT) is shifted by a given level of noise.
The introduction of contralateral masking can pro- an occlusion e¤ect may be created in the nontest ear.
duce a small threshold shift in the test ear even when The nontest ear consequently can become more sensi-
masking level is insu‰cient to produce overmasking. tive to bone-conducted sound for test frequencies below
Wegel and Lane (1924) referred to this phenomenon 2000 Hz, particularly when using supra-aural earphones
as central masking. Central masking has been reported (e.g., Elpern and Naunton, 1963; Goldstein and Hayes,
to a¤ect thresholds during both pure-tone and speech 1965; Berger and Kerivan, 1983; Dean and Martin,
audiometry (e.g., Lidén, Nilsson, and Anderson, 1959; 2000). Consequently, the minimum masking level must
Studebaker, 1962; Dirks and Malmquist, 1964; Martin, be increased by the amount of the occlusion e¤ect. There
Bailey, and Pappas, 1965; Martin, 1966; Martin and is evidence suggesting that the occlusion e¤ect is de-
DiGiovanni, 1979). Although the threshold shift gener- creased significantly when using deeply inserted E-A-R
ally is considered to be about 5 dB, variable results have foam plugs, the eartips used with 3A insert earphones
been reported across subjects and studies. (Berger and Kerivan, 1983; Dean and Martin, 2000).
The most popular method for obtaining masked pure- The clinician can use either individually determined
tone threshold was first described by Hood in 1957 (Martin, Butler, and Burns, 1974; Dean and Martin,
(Hood, 1960; Hood’s original paper was reprinted in the 2000) or fixed occlusion e¤ect values (i.e., based on av-
United States in 1960). The Hood method is also re- erage data reported in the literature) when calculating
ferred to as the plateau, shadowing, or threshold shift minimum masking level. When using supra-aural ear-
procedure. The goal of the plateau procedure is to es- phones, the following fixed occlusion e¤ect values are
tablish the hearing level at which the pure-tone threshold recommended for clinical use: 30 dB at 250 Hz, 20 dB at
remains unchanged with increments in masking level. 500 Hz, and 10 dB at 1000 Hz. When using 3A insert
The masking ‘‘plateau’’ represents a range of masking earphones with deeply inserted foam eartips, the follow-
levels (e.g., 15–20 dB) over which the pure-tone thresh- ing values are recommended: 10 dB at 250 and 500 Hz,
old remains unchanged. The recommended clinical pro- and 0 dB at frequencies of 1000 Hz and higher. It should
cedure is summarized below: be noted that the occlusion e¤ect is decreased or absent
in ears with conductive hearing impairment (e.g., Mar-
1. Masking noise is introduced to the nontest ear at a
tin, Butler, and Burns, 1974; Studebaker, 1979). If the
minimum masking level. The pure-tone threshold is
nontest ear exhibits a potential air-bone gap of 20 dB or
then reestablished.
more, the occlusion e¤ect should not be added to the
2. The level of the tone or noise is increased subse-
minimum masking level.
quently by 5 dB. If there is a response to the tone in
The American Speech-Language-Hearing Association
the presence of the noise, the level of the noise is (ASHA, 1990) has published guidelines for audiometric
increased by 5 dB. If there is no response to the tone
symbols and procedures for graphic representation of
in the presence of the noise, the level of the tone is
frequency-specific audiometric findings. Di¤erent sym-
increased in 5-dB steps until a response is obtained. bols are used to represent pure-tone thresholds obtained
3. A plateau has been reached when the level of the
with and without contralateral masking. The reader is
noise can be increased over a range of 15–20 dB
referred to ASHA’s 1990 guidelines and the article pure-
without shifting the threshold of the tone. This corre-
tone threshold assessment.
sponds to a response to the tone at the same hearing
The optimal masking level during speech audiometry
level at three to four consecutive masking levels.
is one that falls above the minimum and below the
4. The masked pure-tone threshold corresponds to the
maximum masking levels (Lidén, Nilsson, and Ander-
hearing level of the tone at which a masking plateau
son, 1959; Studebaker, 1979; Konkle and Berry, 1983).
has been established.
The goal is to select a masking level that falls at the
Formulas have been proposed for the calculation of middle of the masking plateau. This concept was origi-
minimum masking level during pure-tone threshold nally discussed by Luscher and König in 1955 (cited in
audiometry (e.g., Lidén, Nilsson, and Anderson, 1959; Studebaker, 1979).
Studebaker, 1964). The simplified method described by Minimum masking level (MMin ), adapted from Lidén,
Martin (1967, 1974) is recommended for clinical use. Nilsson, and Anderson (1959), can be summarized using
Specifically, minimum masking level (MMin ) (i.e., ‘‘initial the following equation:
masking’’ level) during air-conduction testing is equal to
the air-conduction threshold of the nontest ear plus a MMin ¼ PLT IA þ Max AB GapNT
safety factor of at least 10 dB:
PLT represents the presentation level of the speech signal
MMin ¼ ACNT þ 10 dB in dB HL at the test ear, IA is the interaural attenuation
Minimum masking level during bone-conduction audio- value for speech, and Max AB GapNT is the maximum
metry is determined similarly. However, it is also neces- air-bone gap in the nontest ear in the 250–4000-Hz fre-
sary to account for the occlusion e¤ect (OE): quency range. PLT IA, an estimate of the hearing level
of the speech signal that has reached the nontest ear,
MMin ¼ ACNT þ OE þ 10 dB
represents the minimum masking level required. How-
Whenever an earphone covers or occludes the non- ever, the presence of air-bone gaps in the nontest ear will
test ear during masked bone-conduction audiometry, reduce the e¤ectiveness of the masker. Consequently, the
Masking 503
minimum masking level must be increased by the size References

of the air-bone gap. Following the recommendation of
Coles and Priede (1975), the maximum air-bone gap in American National Standards Institute. (1996). American Na-
the nontest ear should be considered when determining tional Standard specification for audiometers (ANSI S3.6-
1996). New York: Author.
minimum masking level. American Speech-Language-Hearing Association. (1990).
Maximum masking level (MMax ), adapted from Guidelines for audiometric symbols. ASHA, Supplement 2,
Lidén, Nilsson, and Anderson (1959), can be summar- 25–30.
ized using the following equation: Berger, E. H., and Kerivan, J. E. (1983). Influence of physio-
logical noise and the occlusion e¤ect on the measurement of
MMax ¼ Best BCT þ IA 5 dB
real-ear attenuation at threshold. Journal of the Acoustical
Best BCT represents the best bone-conduction threshold Society of America, 74, 81–94.
in the test ear in the frequency range from 250 through Coles, R. R. A., and Priede, V. M. (1970). On the misdiagnosis
4000 Hz, and IA is the interaural attenuation value for resulting from incorrect use of masking. Journal of Laryn-
gology and Otology, 84, 41–63.
speech. There is the assumption that the best bone-
Coles, R. R. A., and Priede, V. M. (1975). Masking of the non-
conduction threshold is most susceptible to the e¤ects of test ear in speech audiometry. Journal of Laryngology and
overmasking. If Best BCT þ IA is just su‰cient to pro- Otology, 89, 217–226.
duce overmasking, then a slightly lower masking level Dean, M. S., and Martin, F. N. (2000). Insert earphone depth
than the calculated value must be used clinically. Be- and the occlusion e¤ect. American Journal of Audiology, 9,
cause masking level is typically adjusted using a 5-dB 131–134.
step size, a value of 5 dB is subsequently subtracted from Dirks, D. D. (1994). Bone-conduction threshold testing. In
the calculated value. J. Katz (Ed.), Handbook of clinical audiology (4th ed., pp.
Although Studebaker (1962) originally described an 132–146). Baltimore: Williams and Wilkins.
equation for calculating midmasking level during pure- Dirks, D. D., and Malmquist, C. (1964). Changes in bone-
conduction thresholds produced by masking in the non-
tone bone-conduction testing, the basic principles un-
test ear. Journal of Speech and Hearing Research, 7, 271–
derlying the midplateau method also can be applied 278.
e¤ectively during speech audiometry. A direct approach E-A-R Auditory Systems. (1997). Instructions for the use of
to calculating the midmasking level (MMid ) involves E-A-RTONE 3A insert earphones (rev. ed.). Indianapolis,
determining the arithmetic mean of the minimum and IN: Author.
maximum masking levels: Elpern, B. S., and Naunton, R. F. (1963). The stability of the
occlusion e¤ect. Archives of Otolaryngology, 77, 376–382.
MMid ¼ (MMin þ MMax )=2 Etymotic Research. (1991). ER-3A Tubephone insert earphone.
Yacullo (1999) has described a simplified approach Elk Grove Village, IL: Author.
Goldstein, D. P., and Hayes, C. S. (1965). The occlusion e¤ect
to selecting an appropriate level of contralateral mask-
in bone-conduction hearing. Journal of Speech and Hearing
ing during speech audiometry. Although this approach Research, 8, 137–148.
proves most e¤ective during assessment of suprathresh- Hood, J. D. (1960). The principles and practice of bone-
old speech recognition, it can also be applied during conduction audiometry. Laryngoscope, 70, 1211–1228.
threshold measurement. Stated simply, e¤ective masking Killion, M. C., Wilber, L. A., and Gudmundsen, G. I. (1985).
level is equal to the presentation level of the speech sig- Insert earphones for more interaural attenuation. Hearing
nal in dB HL at the test ear minus 20 dB: Instruments, 36, 34–36.
Konkle, D. F., and Berry, G. A. (1983). Masking in speech
dB EM ¼ PLT 20 dB audiometry. In D. F. Konkle and W. F. Rintelmann (Eds.),
Principles of speech audiometry (pp. 285–319). Baltimore:
The procedure proves very e¤ective given the following University Park Press.
two prerequisite conditions: (1) there are no signifi- Lidén, G., Nilsson, G., and Anderson, H. (1959). Masking in
clinical audiometry. Acta Oto-Laryngologica, 50, 125–136.
cant air-bone gaps (i.e., b15 dB) in either ear, and (2) Martin, F. N. (1966). Speech audiometry and clinical masking.
speech is presented at a moderate sensation level (i.e., Journal of Auditory Research, 6, 199–203.
30–40 dB) relative to the measured or estimated SRT. Martin, F. N. (1967). A simplified method for clinical masking.
Given these two prerequisites, the selected masking level Journal of Auditory Research, 7, 59–62.
will occur approximately at midplateau. Martin, F. N. (1974). Minimum e¤ective masking levels in
The plateau masking procedure, described earlier as a threshold audiometry. Journal of Speech and Hearing Dis-
popular method for obtaining masked pure-tone thresh- orders, 39, 280–285.
old, also can be applied e¤ectively during measurement Martin, F. N., Bailey, H. A. T., and Pappas, J. J. (1965). The
of both speech detection and speech recognition thresh- e¤ect of central masking on threshold for speech. Journal of
olds (i.e., SDT and SRT). A major advantage of the Auditory Research, 5, 293–296.
Martin, F. N., and Blythe, M. E. (1977). On the cross hearing
plateau procedure is that information about bone-con- of spondaic words. Journal of Auditory Research, 17, 221–
duction sensitivity in each ear is not required when 224.
selecting appropriate masking levels. The reader is re- Martin, F. N., Butler, E. C., and Burns, P. (1974). Audiometric
ferred to Yacullo (1996) for further discussion. Bing test for determination of minimum masking levels for
bone conduction tests. Journal of Speech and Hearing Dis-
—William S. Yacullo orders, 39, 148–152.
Martin, F. N., Champlin, C. A., and Chambers, J. A. (1998). Smith, B. L., and Markides, A. (1981). Interaural attenuation
Seventh survey of audiometric practices in the United for pure tones and speech. British Journal of Audiology, 15,
States. Journal of the American Academy of Audiology, 9, 49–54.
95–104. Zwislocki, J. (1953). Acoustic attenuation between the ears.
Martin, F. N., and DiGiovanni, D. (1979). Central masking Journal of the Acoustical Society of America, 25, 752–759.
e¤ects on spondee threshold as a function of masker sensa-
tion level and masker sound pressure level. Journal of the
American Auditory Society, 4, 141–146. Middle Ear Assessment in the Child
Sanders, F. W., and Rintelmann, W. F. (1964). Masking in
audiometry. Archives of Otolaryngology, 80, 541–556.
Sklare, D. A., and Denenberg, L. J. (1987). Interaural attenu- Current clinical methods of assessment of the middle ear
ation for Tubephone insert earphones. Ear and Hearing, 8,
298–300. in children include otoscopic examination, acoustic im-
Snyder, J. M. (1973). Interaural attenuation characteristics in mittance measures, and reflectometry. When assessing
audiometry. Laryngoscope, 73, 1847–1855. middle ear function, a good otoscopic evaluation is the
Studebaker, G. A. (1962). On masking in bone-conduction first step. Examination of the ear canal for any obstruc-
testing. Journal of Speech and Hearing Research, 5, 215– tions that would preclude placement of a probe such as
227. is used for acoustic immittance measures is essential.
Studebaker, G. A. (1964). Clinical masking of air- and bone- Often, cerumen in the ear canal becomes impacted, even
conducted stimuli. Journal of Speech and Hearing Disorders, in children, and thereby confounds tympanometric mea-
29, 23–35. sures. Even when not impacted, cerumen can clog the
Studebaker, G. A. (1967). Clinical masking of the non-test
immittance probe and cause invalid measurements.
ear. Journal of Speech and Hearing Disorders, 32, 360–
371. Otoscopy can also be useful in identifying middle ear
Studebaker, G. A. (1979). Clinical masking. In W. F. Rintel- disorders such as middle ear e¤usion. Examination of
mann (Ed.), Hearing assessment (pp. 51–100). Baltimore: the tympanic membrane can provide evidence of fluid by
University Park Press. its opacity or color. An opaque or yellow membrane, for
Wegel, R. L., and Lane, G. I. (1924). The auditory masking of example, might indicate middle ear e¤usion. Clearer ev-
one pure tone by another and its probable relation to the idence comes when a fluid meniscus or bubbles can be
dynamics of the inner ear. Physics Review, 23, 266–285. seen through a transparent tympanic membrane. Some-
Yacullo, W. S. (1996). Clinical masking procedures. Needham one skilled with pneumatic otoscopy can examine the
Heights, MA: Allyn and Bacon. membrane’s mobility by applying slight changes in air
Yacullo, W. S. (1999). Clinical masking in speech audiometry:
pressure in the ear canal. However, the ability to use a
A simplified approach. American Journal of Audiology, 8,
106–116. pneumatic otoscope for the diagnosis of middle ear
e¤usion is highly variable. Of course, in the clinical
examination associated with the diagnosis of middle
Further Readings ear e¤usion, visual examination of the ear canal and
Chaiklin, J. B. (1967). Interaural attenuation and cross-hearing tympanic membrane is important so that other con-
in air-conduction audiometry. Journal of Auditory Research, ditions (e.g., cholesteatoma, retraction pocket) may be
7, 413–424. identified.
Dirks, D. D., and Swindeman, J. G. (1967). The variability of
occluded and unoccluded bone-conduction thresholds. Acoustic Immittance: Tympanometry
Fletcher, H. (1940). Auditory patterns. Review of Modern Tympanometry, unlike pneumatic otoscopy, is a derived
Physics, 12, 47–65. physiological measure that requires instrumentation that
Gelfand, S. A. (2001). Clinical masking. In Essentials of meets a standard of the American National Standards
audiology (2nd ed., pp. 291–317). New York: Thieme. Institute (ANSI S3.39, 1987) and is a test that is easily
Hawkins, J. E., and Stevens, S. S. (1950). Masking of pure
administered. The principles of acoustic immittance
tones and of speech by white noise. Journal of the Acoustical
Society of America, 22, 6–13. measurement are covered elsewhere.
Hodgson, W., and Tillman, T. (1966). Reliability of bone con- Acoustic immittance is a term used to describe the
duction occlusion e¤ects in normals. Journal of Auditory ability of energy to flow through the middle ear. The
Research, 6, 141–151. word immittance is a combination of the words imped-
Hosford-Dunn, H., Kuklinski, A. L., Raggio, M., and Hagg- ance (opposition to the flow of energy) and admittance
erty, H. S. (1986). Solving audiometric masking dilemmas (ease with which energy flows). Today, most immit-
with an insert masker. Archives of Otolaryngology–Head tance instruments measure acoustic admittance, so the
and Neck Surgery, 112, 92–95. description that follows will consider only acoustic ad-
Lidén, G., Nilsson, G., and Anderson, H. (1959). Narrow- mittance measures. Tympanometry is a dynamic mea-
band masking with white noise. Acta Oto-Laryngologica,
surement of acoustic admittance as air pressure in the
50, 116–124.
Martin, F. N. (1980). The masking plateau revisited. Ear and ear canal is varied. The tympanogram is the graph of
Hearing, 1, 112–116. acoustic admittance in mmhos (or in equivalent volume
Naunton, R. F. (1960). A masking dilemma in bilateral units such as milliliters) versus air pressure in deca-
conduction deafness. Archives of Otolaryngology, 72, 753– pascals (daPa). To estimate admittance, a tone is deliv-
757. ered to the ear via a probe hermetically sealed in the ear
Middle Ear Assessment in the Child 505
canal. The probe tone is the force that activates the Pattern Classification of Tympanograms
middle ear system, while a microphone that is connected
to a separate opening in the probe records sound pres- Historically, the most common way to interpret tympa-
sure in the ear canal. A third opening in the probe con- nometric data has been to classify tympanograms ac-
nects to a pneumatic system for varying air pressure in cording to their pattern. The most notable and widely
the ear canal. used pattern classification scheme was proposed by
The first measurement made in tympanometry is an Jerger (1970). This pattern classification scheme is easy
estimate of the volume of the space between the probe to use, with patterns that are easy to identify and few in
tip and the tympanic membrane and is called ear canal number. The pattern classifications were based originally
volume or equivalent volume. This is done with a low- on the height of the tympanogram (in arbitrary ‘‘com-
frequency (226-Hz) probe tone when the pressure in the pliance’’ units) and the tympanometric peak pressure.
ear canal is set to a very high value (either positive or The patterns are identified by the letters A, B, and C,
negative). The extreme pressure sti¤ens the tympanic with subclassifications used to better define them. A
membrane such that admittance of the middle ear is normal tympanogram is A, with As and Ad representing
diminished to (theoretically) zero. Thus, the acoustic abnormally low peak compliance and abnormally high
admittance detected by the instrument is a measure of peak compliance, respectively, in the presence of normal
the admittance of the volume of air enclosed in the ear tympanometric peak pressure. Type B is a tympanogram
canal and, with the 226-Hz probe tone only, can be with no peak, and type C is a tympanogram with a high
expressed as an equivalent volume of air in the ear canal. negative tympanometric peak pressure.
This ear canal volume measure is useful for determining In children, the most common reason for assessment
the validity of the probe fit in the ear canal and, when of middle ear function is to identify middle ear e¤usion.
larger than normal, suggests an opening in the tympanic As a result, much of the published research on tympan-
membrane. This is useful when trying to determine if ometry in children relates to identification of middle ear
there is a perforation in the tympanic membrane or if a e¤usion. In most studies of the ability of tympanometry
pressure equalization tube is functioning. to identify ears with middle ear e¤usion, the type B
The normal tympanogram has the shape of an in- tympanogram (no peak) has high predictive ability; that
verted V, with the peak amplitude referred to as peak is, a tympanogram that has no peak will very likely be
admittance or static admittance. When the admittance associated with an ear with e¤usion. However, the op-
attributed to the ear canal volume is automatically sub- posite is not necessarily true: ears with middle ear e¤u-
tracted from the dynamic admittance measure, peak sion most often produce tympanograms that cannot be
admittance is referred to as peak compensated acoustic classified as type B. Rarely is an ear with middle ear ef-
admittance. The magnitude of the peak in mmhos of fusion associated with a type A (normal peak, normal
admittance carries diagnostic information. Because there tympanometric peak pressure), so many ears with mid-
is developmental change in acoustic admittance varia- dle ear e¤usion fall in the C category (normal peak
bles, peak admittance in a given case must be compared admittance, abnormally negative tympanometric peak
with age-appropriate normative values. Low peak ad- pressure). However, many ears with no middle ear e¤u-
mittance when tympanometric peak pressure (the air sion also fall into the C category. Subdivisions of the C
pressure in the ear canal at which the peak occurs) is category have emerged to try to improve the diagnostic
within normal range suggests that the middle ear space is ability of tympanometry, but even with more specific
well aerated but that there is a condition that is reducing information regarding tympanometric peak pressure, the
the ability of sound to flow through. A condition that ability of the C category to help with identification of
increases sti¤ness in the middle ear system, such as middle ear e¤usion varies widely across studies (Orchik,
stapes fixation, might result in such a pattern. Dunn, and McNutt, 1978).
If peak admittance is greater than the normal range, Because of the ambiguity in the C category, even
with tympanometric peak pressure within the normal when subdivided, attempts to further improve identifi-
range, there is a condition of the middle ear that is in- cation of middle ear e¤usion have included addition of
creasing admittance, such as an ossicular disarticulation. the acoustic reflex test, in which case an equivocal tym-
Because middle ear function is being inferred from ad- panometric pattern accompanied by an absent acoustic
mittance changes based on sound reflected from the reflex is considered evidence of middle ear e¤usion. Also,
tympanic membrane, abnormalities of that membrane schemes with very detailed categories have been devel-
can influence the measurement. For example, scarring oped (e.g., Cantekin et al., 1980).
on the membrane can also cause an abnormally high There are several drawbacks to the A, B, C pattern
admittance tympanogram in the presence of an other- classification scheme. One is that the patterns were de-
wise normal middle ear system. termined based on clinical observations and not on sta-
In an ear with eustachian tube dysfunction, in which tistical analysis of performance in a controlled clinical
middle ear pressure varies and is not equalized easily, trial with a reference standard such as myringotomy for
tympanometry can give valuable information. The tym- the diagnosis of middle ear e¤usion. Many studies that
panometric peak pressure is a reasonably good estimate have been done to determine the performance charac-
of middle ear pressure, so a negative value is often diag- teristics of the pattern classification of tympanograms
nostic of eustachian tube dysfunction. used predetermined categories rather than collecting
tympanometric data and then using statistical techniques Tympanometric width is the width of the tympano-
to determine which characteristics are best able to pre- gram in daPa at half the peak admittance. The more
dict middle ear status. A second drawback is that tym- rounded or flat the tympanogram becomes, the greater
panograms produced using instruments meeting the 1987 will be the tympanometric width. Tympanometric width
ANSI standard are based on absolute physical quantities has good diagnostic value for identification of middle ear
and are not directly comparable to the tympanograms e¤usion in children (Margolis and Heller, 1987; Nozza
using arbitrary compliance units on which the A, B, C et al., 1994). Using quantitative analysis, Nozza et al.
scheme was developed. (1994) reported that using tympanometric width greater
A third drawback to the A, B, C classification scheme than 275 daPa as a criterion for identification of middle
is that it does not incorporate very much information ear e¤usion had a sensitivity of 81% and a specificity of
about the shape, or gradient, of the tympanogram. Data 82% in a group of children undergoing myringotomy
from studies using absolute physical units of immittance and tube surgery. This was the best performance of any
have shown that tympanogram shape carries useful in- single tympanometric variable. For peak admittance, a
formation regarding middle ear status that is not readily cuto¤ of 0.3 mmho separated the ears, with a sensitivity
available in the A, B, C classification scheme (Brooks, and specificity nearly as good as the best cuto¤ for tym-
1968; Koebsell and Margolis, 1986; Nozza et al., 1992, panometric width. Interestingly, tympanometric peak
1994). pressure alone was the worst at separating ears with and
without middle ear e¤usion and contributed nothing
Quantitative Analysis of Tympanograms when used in combination with other variables. This
suggests that the weight that tympanometric peak pres-
Instruments developed since 1987, when the ANSI stan- sure carries in the pattern classification scheme is prob-
dard became e¤ective, provide immittance information ably not warranted when it comes to identification of
using absolute physical quantities rather than arbitrary ears with middle ear e¤usion. The acoustic reflex alone
compliance values such as were used when the pattern was examined as well, and overall performance was only
classification scheme was developed. There are several fair because specificity was poor. Too often, no acoustic
advantages to using a quantitative analysis of tympano- reflex can be measured in children, even in the absence
metric data. First, such measures are standardized of middle ear e¤usion. Also, because the reflex relies on
for all instruments and, therefore, across clinics and an auditory system and elements of the central nervous
laboratories. Second, tympanometric shape can be system that are su‰ciently intact, an immittance crite-
quantified. Third, unlike the pattern categories, the rion for identification of middle ear e¤usion that in-
data from current instruments that measure actual cludes the acoustic reflex will not be applicable to
physical quantities of admittance are on continua that children with high degrees of hearing impairment or
permit statistical analyses. Measures that may be used in certain neurological problems.
a quantitative analyses include peak compensated Because the data from studies such as that of Nozza
acoustic admittance, tympanometric peak pressure, ab- et al. (1994) come from children from a special popula-
solute gradient, relative gradient, and tympanometric tion, those undergoing myringotomy and tube surgery,
width. they may not be representative of the general pediatric
The absolute and relative gradients and tympano- population, in whom middle ear assessment is so impor-
metric width are measures used to quantify the shape tant. Recommendations for criteria for identification of
of the tympanogram. To compute absolute gradient, a middle ear e¤usion in the general population can be
horizontal line is drawn between the sides of the tym- found in guidelines for screening that have been pub-
panogram at the point where the tympanogram is lished by the American Academy of Audiology (1997)
100 daPa wide. This serves as a temporary baseline from and the American Speech-Language-Hearing Associa-
which the distance to the peak of the tympanogram is tion (ASHA, 1997). The American Academy of Audiol-
measured in mmho. This value is the absolute gradient ogy suggests using tympanometric variables of peak
and increases in value as tympanogram shape becomes admittance or tympanometric width. In both guidelines,
more sharply peaked. Relative gradient is determined by the notion is that a positive result on tympanometric
dividing the absolute gradient by the peak admittance. screening should not result in immediate referral but
This results in a ratio that can range from 0 to 1. Again, should indicate the need for rescreening in 6 to 8 weeks.
the greater the relative gradient, the more sharply Because middle ear e¤usion is a transient, often self-
peaked is the tympanogram. Few instruments report limiting disorder, referrals based on a single test result in
absolute gradient, but some will report relative gradient a high over-referral rate. ASHA recommends that an ear
to characterize the rate of change of the tympanogram in pass the tympanometric screen when peak admittance is
the region of the peak (i.e., tympanogram shape). Abso- b0.3 mmho or tympanometric width is <200 daPa
lute and relative gradients have been examined in the when screening children from the general population for
past for their contribution to the diagnosis of middle middle ear e¤usion. The American Academy of Audiol-
ear e¤usion (Brooks, 1968). The lower the gradient, the ogy uses a similar protocol, with b0.2 mmho a passing
more rounded or flat the tympanogram and the greater indication for peak admittance and <250 daPa a passing
the likelihood of middle ear e¤usion. indication for tympanometric width.
Middle Ear Assessment in the Child 507
High-Frequency Tympanometry in Infants aural acoustic impedance and admittance (aural acoustic
immittance) (ANSI S3.39). New York: Acoustical Society
With universal newborn hearing screening increasing the of America.
number of young infants referred for rescreening and American Speech-Language-Hearing Association. (1997).
audiological assessments, it is important to consider the Guidelines for audiologic screening. Rockville, MD: Author.
special circumstances related to middle ear assessment in Brooks, D. N. (1968). An objective method of detecting fluid in
that population. It has long been known that tympan- the middle ear. International Audiology, 7, 280–286.
ometry using low-frequency probe tones is unreliable in Cantekin, E. I., Bluestone, C. D., Fria, T. J., Stool, S. E.,
Beery, Q. C., and Sabo, D. L. (1980). Identification of otitis
young infants. Infants with middle ear e¤usion can pro-
media with e¤usion in children. Annals of Otology, Rhinol-
duce tympanograms that appear normal, presumably ogy and Laryngology, 89(Suppl. 68), 190–195.
because of distensibility of the walls of the ear canal. Jerger, J. (1970). Clinical experience with impedance audio-
Recent work using multifrequency tympanometry has metry. Archives of Otolaryngology, 92, 311–324.
demonstrated that young infants (<4 months) with mid- Koebsell, K. A., and Margolis, R. H. (1986). Tympanometric
dle ear e¤usion might produce a normal tympanogram gradient measured from normal preschool children. Audiol-
with a 226-Hz probe tone but an abnormal tympano- ogy, 25, 149–157.
gram when a high-frequency probe, 1000 Hz in particu- Margolis, R. H., and Heller, J. W. (1987). Screening tympan-
lar, is used (McKinley, Grose, and Roush, 1997; Rhodes ometry: Criteria for medical referral. Audiology, 26, 197–
et al., 1999; Purdy and Williams, 2000). Use of a 1000- 208.
McKinley, A. M., Grose, J. H., and Roush, J. (1997). Multi-
Hz probe tone is recommended now for identification of
frequency tympanometry and evoked otoacoustic emissions
middle ear e¤usion in young infants. in neonates during the first 24 hours of life. Journal of the
American Academy of Audiology, 8, 218–223.
Reflectometry Nozza, R. J., Bluestone, C. D., Kardatzke, D., and Bachman,
R. N. (1992). Towards the validation of aural acoustic
An alternative method for identification of middle ear
immittance measures for diagnosis of middle ear e¤usion in
e¤usion uses a measurement called acoustic reflec- children. Ear and Hearing, 13, 442–453.
tometry. The acoustic reflectometer generates a broad Nozza, R. J., Bluestone, C. D., Kardatzke, D., and Bachman,
band sound in the ear canal and measures the sound en- R. N. (1994). Identification of middle ear e¤usion by aural
ergy reflected back from the tympanic membrane. The acoustic admittance and otoscopy. Ear and Hearing, 15,
instrument is hand-held and has a speculum-like tip that 310–323.
is put into the entrance of the ear canal. No hermetic Orchik, D. J., Dunn, J. W., and McNutt, L. (1978). Tympa-
seal is required, thus making the test desirable for use nometry as a predictor of middle ear e¤usion. Archives of
with children. The relationship between the known out- Otorhinolaryngology, 104, 4–6.
put of the device and the resultant sound in the ear canal Purdy, S. C., and Williams, M. J. (2000). High frequency tym-
panometry: A valid reliable immittance test protocol for
provides diagnostic information. An early version of the
young infants? New Zealand Audiological Society Bulletin,
instrument used only the sound pressure level in the ear 109, 9–24.
canal in the diagnostic decision. It was later determined Rhodes, M. C., Margolis, R. H., Hirsch, J. E., and Napp, A. P.
that, by plotting out the reflectivity data on frequency by (1999). Hearing screening in the newborn intensive care
amplitude axes, better diagnostic information regarding nursery: Comparison of methods. Otolaryngology–Head
middle ear e¤usion could be derived. The current version and Neck Surgery, 120, 799–808.
of the instrument analyzes automatically the frequency-
amplitude relationship in the reflected sound and dis-
plays a number between 1 and 5 to indicate the like- Further Readings
lihood of an e¤usion. This instrument is getting some Brooks, D. N. (1969). The use of the electro-acoustic imped-
favorable use in primary care settings for the identifica- ance bridge in the assessment of middle ear function. Inter-
tion of both acute otitis media and asymptomatic otitis national Audiology, 8, 563–569.
media with e¤usion. Cantekin, E. I. (1983). Algorithm for diagnosis of otitis media
See also hearing loss screening: the school-age with e¤usion. Annals of Otology, Rhinology, and Laryngol-
child; otitis media: effects on children’s language; ogy, 92(Suppl. 107), 6.
pediatric audiology: the test battery approach; Fiellau-Nikolajsen, M. (1980). Tympanometry and middle
physiological bases of hearing; tympanometry. ear e¤usion: A cohort-study in three-year-old children. In-
ternational Journal of Pediatric Otorhinolaryngology, 2, 39–
—Robert J. Nozza 49.
Finitzo, T., Friel-Patti, S., Chinn, K., and Brown, O. (1992).
References Tympanometry and otoscopy prior to myringotomy: Issues
in diagnosis of otitis media. International Journal of Pediat-
American Academy of Audiology. (1997). Position statement: ric Otorhinolaryngology, 24, 101–110.
Identification of hearing loss and middle ear dysfunction in Grimaldi, P. M. G. B. (1976). The value of impedance testing
preschool and school-age children. Audiology Today, 9(3), in diagnosis of middle ear e¤usion. Journal of Laryngology
21–23. and Otology, 90, 141–152.
American National Standards Institute. (1987). American na- Haughton, P. M. (1977). Validity of tympanometry for middle
tional standard specifications for instruments to measure ear e¤usions. Archives of Otolaryngology, 103, 505–513.
Liden, G. (1969). The scope and application of current audio- other factors such as chemicals, solvents, and toxic sub-
metric tests. Journal of Laryngology and Otology, 83, 507– stances such as carbon monoxide can be significant and
520. are an active area of investigation (Morata and Dunn,
Marchant, C. D., McMillan, P. M., Shurin, P. A., et al. (1986). 1995).
Objective diagnosis of otitis media in early infancy by tym-
The database available on noise-induced hearing loss
panometry and ipsilateral acoustic reflex thresholds. Journal
of Pediatrics, 109, 590–595. caused by exposure to steady-state noise is substantial.
Northern, J. L. (1992). Special issues concerned with screening Some of the data are from laboratory studies of tempo-
for middle ear disease in children. In F. H. Bess and J. W. rary e¤ects in human subjects (Davis et al., 1950; Mills
Hall III (Eds.), Screening children for auditory function. et al., 1970; Melnick and Maves, 1974) and both tem-
Nashville, TN: Bill Wilkerson Center Press. porary and permanent e¤ects in laboratory animals
Nozza, R. J. (1995). Critical issues in acoustic-immittance (Miller, Watson, and Covell, 1963; Mills, 1973). Other
screening for middle-ear e¤usion. Seminars in Hearing, data are from studies of permanent hearing loss in
16(1), 86–98. humans in occupational settings (Taylor et al., 1965;
Nozza, R. J. (1998). Identification of otitis media. In F. Bess Burns and Robinson, 1970; Johnson, 1991). These and
(Ed.), Children with hearing impairment: Contemporary
other field studies of noise-induced permanent hearing
trends (pp. 207–214). Nashville, TN: Bill Wilkerson Center
Press. loss (see Johnson, 1991) are the scientific bases of inter-
Shanks, J. E. (1984). Tympanometry. Ear and Hearing, 5, 268– national (International Organization for Standardiza-
280. tion [ISO], 1990) and American standards (American
Teele, D. W., and Teele, J. (1984). Detection of middle ear ef- National Standards Institute [ANSI], 1996), which pres-
fusion by acoustic reflectometry. Journal of Pediatrics, 104, ent methods to estimate noise-induced permanent
832–838. threshold shifts as a function of A-weighted sound level
Toner, J. G., and Mains, B. (1990). Pneumatic otoscopy and and years of exposure time. The development and ac-
tympanometry in the detection of middle ear e¤usion. Clin- ceptance of these standards represents many years of
ics of Otolaryngology, 15, 121–123. work and intense debate. Regulations for industry are
Van Camp, K. J., Margolis, R. H., Wilson, R. H., Creten,
given by the U.S. Department of Labor (Occupational
W. L., and Shanks, J. E. (1986). Principles of tympanom-
etry. ASHA Monograph, 1–88. Safety and Health Administration [OSHA], 1983).
Vaughan-Jones, R., and Mills, R. P. (1992). The Welch Allyn Data from humans and animals exposed to a wide
Audioscope and Microtymp: Their accuracy and that of variety of steady-state noises suggest that the range of
pneumatic otoscopy, tympanometry and pure tone audio- human audibility can be categorized with respect to risk
metry as predictors of otitis media with e¤usion. Journal of of acoustic injury of the ear and noise-induced hearing
Laryngology and Otology, 106, 600–602. loss. This categorization is shown in Figure 1, where the
Wilber, L. A., and Feldman, A. S. (1976). Acoustic impedance range of human audibility is bounded by the threshold of
and admittance: The measure of middle ear function. Balti-
more: Williams and Wilkins.
Noise-Induced Hearing Loss
Hearing loss a¤ects about 28 million Americans. The

two most common causes of sensorineural hearing loss
are aging and exposure to noise. Noise exposure can in-
jure the ear and produce loss of hearing. The injury and
hearing loss can be temporary—that is, fully recoverable
after the noise exposure is terminated—or permanent.
When hearing loss is measured at a postexposure time of
2 weeks, it is considered to be permanent, inasmuch as
very little additional recovery occurs at postexposure
times in excess of 2 weeks (Miller, Watson, and Covell,
1963; Mills, 1973). The presence and severity of a tem-
porary or permanent hearing loss depend on several
factors and the susceptibility of the individual. Acousti-
cally, the level (intensity), spectrum (frequency), and
temporal properties (duration, intermittency, number)
Figure 1. Categorization of the range of human audibility with
of the exposures are the most pertinent properties. respect to acoustic injury of the ear and noise-induced hearing
Nonacoustic factors such as interactions with various loss. (From Mills, J. H., et al., 1993, Hazardous Exposure to
medicines or drugs, eye color, smoking, sex, and other Steady-State and Intermittent Noise. Working Group Report,
personal characteristics of an individual (except aging Committee on Hearing, Bioacoustics and Biomechanics, Na-
and hearing loss) are second-order e¤ects, or are not tional Research Council. Washington, DC: National Academy
consistently observed (Ward, 1995). Interactions with Press. Reproduced with permission.)
Noise-Induced Hearing Loss 509
audibility at one extreme and the threshold of pain (Yost ear play a prominent role. The external auditory meatus
and Nielsen, 1985) at the other. Of course, sounds below (essentially a tube open at one end with a length of about
the threshold of audibility are inaudible and present no 25 mm) has a resonant frequency of about 3 kHz and a
risk of noise-induced hearing loss. Sounds in excess of gain of about 20 dB. Thus, the typical industrial or en-
the threshold of pain present a risk of acoustic injury of vironmental noise, which may have a flat or slightly
the ear and noise-induced hearing loss even with one, downward-sloping spectrum when measured in the field,
short exposure (see Mills et al., 1993). Between the will have a peak at 3 kHz because of the external ear
extremes of pain and audibility are acoustic injury canal. Thus, the ‘‘4-kHz notch’’ in the audiogram that is
thresholds (open triangles in Fig. 1). These thresholds characteristic of noise-induced hearing loss (and head
define the highest levels of noise that will not produce a injuries) may reflect the acoustic properties of the exter-
noise-induced threshold shift regardless of the duration nal ear. In the middle ear, the acoustic reflex, the con-
of exposure, the number of exposures, or the temporal sensual contraction of the stapedial and tensor tympani
properties of the exposure. The data points in Figure 1 muscles, may reduce the level of intense sounds. In ad-
are from temporary threshold shift experiments for oc- dition, the e¤erent innervation of outer hair cells may
tave bands of noise from 63 Hz to 4 kHz in octave steps. have a protective role (Maison and Liberman, 2000).
Data at lower and higher frequencies are extrapolations. Although there has been a substantial e¤ort, the ana-
Thus, between the extremes of the threshold of pain and tomical, chemical, and biological bases of noise-induced
audibility are two categories: no risk and risk. The no- temporary threshold shift are unknown. The pathologi-
risk category can also be described as ‘‘e¤ective quiet’’ cal anatomy associated with noise-induced permanent
(Ward, Cushing, and Burns, 1976). The region bounded hearing loss involves the organ of Corti, especially the
by safe levels on the low side and threshold of pain on hair cells. Loss of outer hair cells is the most prominent
the high side is the area where the risk of hearing loss anatomical feature of permanent noise-induced loss, and
and acoustic injury of the ear depends on the parameters is almost always greater than the loss of inner hair cells.
of the noise exposure as well as on the susceptibility of This greater loss of outer than inner hair cells may occur
the individual. In qualitative terms, risk increases with for several reasons, including the direct shearing forces
noise level, duration, number of exposures, and suscep- on the outer hair cell stereocilia, which are embedded in
tibility of the individual. Although individual di¤erences the tectorial membrane. The correlations between loss of
can be substantial, no method has been developed that hair cells, both inner and outer, and permanent thresh-
allows the a priori identification of those individuals old shift are very high, ranging from 0.6 to 0.8, depend-
who are most susceptible to noise-induced hearing loss. ing on frequency (Hamernik et al., 1989). Even with
Quantitative relations between noise-induced hearing such high correlations there remains considerable vari-
loss and exposure parameters are given in ANSI S3.44- ance between hair cell loss and permanent threshold
1996. shift. This variance can be reduced by consideration of
Whereas the database is massive for noise-induced the status of the stereocilia (Liberman and Dodds,
hearing losses produced by exposure to continuous noise 1984). With degeneration of inner hair cells following
(see Johnson, 1991), it is unimpressive for intermittent severe exposures, there can be retrograde degeneration
(quiet periods of a few seconds to a few hours) and time- of auditory nerve fibers, as indicated by losses of spiral
varying (level fluctuations greater than 10 dB) noises. ganglion cells. Neural degeneration is not restricted to
On a qualitative basis, there is agreement that intermit- the auditory nerve but progresses throughout the as-
tent and fluctuating noises are less hazardous than con- cending auditory system (Morest, 1982). Regeneration
tinuous noises, presumably because the ‘‘quiet’’ periods of hair cells has been observed after intense exposures to
allow time for the ear to recover. Because of regulatory noise. This dramatic e¤ect has been reported only for the
e¤orts in noise control and a perceived need for sim- cochlea of various species of birds. Regenerating sensory
plicity, several single-number correction factors have cells have not been observed in the cochlea of mammals.
evolved. That is, as an exposure is increased from 4 Reactive oxygen species and oxidative stress have
hours to 8 hours, what change in noise level is needed to been implicated in the production of noise-induced
maintain an equal risk of hearing loss? The equal-energy hearing loss and in age-related hearing loss as well
rule specifies a 3-dB reduction in noise level for a dou- (Ohlemiller et al., 2000). It is believed that acute im-
bling of exposure duration. This rule is incorporated into pairment of antioxidant defenses promotes cochlear in-
the ISO 1999 standard. Other standards and regulations jury, and conversely, strengthening antioxidant defenses
use 4-dB, 5-dB, and 6-dB rules, as well as other more should provide protection, including possibly rescuing
complicated schemes (see Ward, 1991). It is likely that cells that are in the early stages of injury. E¤orts at pre-
each of these single-number rules may apply only to a vention and rescue are in the early stage of development,
restricted set of exposure conditions. With the continued with some promising initial results (Hu et al., 1997).
absence of needed data, the e¤ects of intermittence on Additional protective functions can be obtained by con-
noise-induced hearing loss may always be a contentious ditioning exposures or exposures that protect the ear
issue. from subsequent noise (Canlon, Borg, and Flock, 1988).
Although the biological bases of noise-induced hear- A related phenomenon is improvements (reductions) in
ing loss have been studied extensively, with the greatest threshold shifts observed during the course of an ex-
emphasis on the cochlea, both the external and middle tended sequence of intermittent exposure to noise
(Miller, Watson, and Covell, 1963; Clark, Bohne, and Hamernik, R. P., Ahroon, W. A., Hsueh, K. D., Lei, S. F., and
Boettcher, 1987). Clearly, noise-induced hearing loss is Davis, R. I. (1993). Audiometric and histological di¤erences
not related simply to the sound level of the exposure. between the e¤ects of continuous and impulsive noise
Acoustic trauma refers to injury of the ear and per- exposures. Journal of the Acoustical Society of America, 93,
2088–2095.
manent hearing loss caused by exposure to an intense,
Hamernik, R. P., and Hsueh, K. D. (1993). Impulse noise:
short-duration sound. In contrast to the gradual loss Some definitions, physical acoustics and other considera-
of outer hair cells and stereocilia typically seen from tions. Journal of the Acoustical Society of America, 90, 189–
steady-state or intermittent exposures with sound levels 196.
less than 100–110 dB SPL, the injury to the organ of Hamernik, R. P., Patterson, J. H., Turrentine, G. A., and
Corti is more extensive, involving the tearing of mem- Ahroon, W. A. (1989). The quantitative relation between
branes, rupturing of cells, and mixing of cochlear fluids. sensory cell loss and hearing thresholds. Hearing Research,
At extremely high sound levels, the tympanic mem- 38, 199–211.
brane and middle ear can be injured, with a resultant Henderson, D., and Hamernik, R. P. (1986). Impulse noise:
conductive/mixed hearing loss. The most common form Critical review. Journal of the Acoustical Society of Amer-
ica, 80, 569–564.
of acoustic trauma is hearing loss associated with the
Hu, B. H., Zheng, X. Y., McFadden, S., and Henderson, D.
impulsive noises produced by small-arms gunfire (Clark, (1997). The protective e¤ects of R-PIA on noise-induced
1991). Hearing loss from impulses is related to the peak hearing loss. Hearing Research, 113, 198–206.
SPL of the impulse, duration, number of impulses, International Organization for Standardization. (1990). Acous-
and other variables (Henderson and Hamernik, 1986; tic determination of occupational noise exposure and esti-
Hamernik and Hsueh, 1993; Hamernik et al., 1993). mation of noise-induced hearing impairment. Geneva:
A longstanding issue is the interaction between noise- International Standard 1999.
induced hearing loss incurred throughout a person’s Johnson, D. L. (1991). Field studies: Industrial expo-
working lifetime and the hearing loss associated with sures. Journal of the Acoustical Society of America, 90, 170–
aging. This issue is particularly important for medical 181.
Liberman, M. C., and Dodds, L. W. (1984). Single-neuron
reasons, for litigation involving worker’s compensation
labeling and chronic cochlear pathology: III. Stereocilia
for occupational hearing loss, and in establishing noise damage and alterations of threshold tuning curves. Hearing
standards and damage-risk criteria. Both the current ISO Research, 16, 55–74.
(1990) and ANSI (1996) standards assume that noise- Maison, S. F., and Liberman, M. C. (2000). Predicting vul-
induced permanent threshold shifts add (in decibels) to nerability to acoustic injury with a noninvasive assay of
age-related threshold shifts, i.e., a 20-dB loss from noise olivocochlear reflex strength. Journal of Neuroscience, 20,
and a 20-dB loss from aging results in a loss of 40 dB. 4701–4707.
Some data from noise-exposed and aged animals do not Melnick, W., and Maves, M. (1974). Asymptotic threshold
support additivity in decibels but additivity in intensity: shift (ATS) in man from 24-hr exposures to continuous
i.e., 20 dB and 20 dB produces a loss of 23 dB (Mills noise. Annals of Otology, Rhinology, and Laryngology, 83,
820–829.
et al., 1997). The issues of additivity and medical-legal
Miller, J. D., Watson, C. S., and Covell, W. (1963). Deafening
aspects of noise-induced hearing loss are discussed by e¤ects of noise on the cat. Acta Otolaryngologica, Supple-
Dobie (2001). ment, 176, 1–91.
—John H. Mills Mills, J. H. (1973). Threshold shifts produced by exposure to
noise in chinchillas with noise-induced hearing losses. Jour-
References nal of Speech and Hearing Research, 16, 700–708.
Mills, J. H., Boettcher, F. A., and Dubno, J. R. (1997). Inter-
American National Standards Institute. (1996). Determina- action of noise-induced permanent threshold shift and age-
tion of occupational noise exposure and estimation of noise- related threshold shift. Journal of the Acoustical Society of
induced hearing impairment (ANSI S3.44-1996). New America, 101, 1681–1686.
York: Acoustical Society of America. Mills, J. H., Clark, W. W., Dobie, R. A., Humes, L. E., John-
Burns, W., and Robinson, D. W. (1970). Hearing and noise in son, D. J., Liberman, M. C., et al. (1993). Hazardous expo-
industry. London: Her Majesty’s Stationery O‰ce. sure to steady-state and intermittent noise. Working Group
Canlon, B. E., Borg, E., and Flock, A. (1988). Protection Report, Committee on Hearing, Bioacoustics and Bio-
against noise trauma by pre-exposure to a low-level acoustic mechanics, National Research Council. Washington, DC:
stimulus. Hearing Research, 34, 197–200. National Academy Press.
Clark, W. W. (1991). Noise exposure from leisure activities. Mills, J. H., Gengel, R. W., Watson, C. S., and Miller, J. D.
Journal of the Acoustical Society of America, 90, 155–163. (1970). Temporary changes of the auditory system due to
Clark, W. W., Bohne, B. A., and Boettcher, F. A. (1987). exposure to noise for one or two days. Journal of the
E¤ects of periodic rest on hearing loss and cochlear damage Acoustical Society of America, 48, 524–530.
following exposure to noise. Journal of the Acoustical Soci- Morata, T. C., and Dunn, D. D. (1995). Occupational hearing
ety of America, 82, 1253–1264. loss. Philadelphia: Hanley and Belfus.
Davis, H., Morgan, C. T., Hawkins, J. E., Galambos, R., and Morest, D. K. (1982). Degeneration in the brain following
Smith, F. W. (1950). Temporary deafness following expo- exposure to noise. In R. P. Hamernik, D. Henderson, and
sure to loud tones and noise. Acta Otolaryngologica, S88, R. J. Salvi (Eds.), New perspectives in noise-induced hearing
1–57. loss (pp. 87–93). New York: Raven Press.
Dobie, R. A. (2001). Medical-legal evaluation of hearing loss Occupational Safety and Health Administration, U.S. Depart-
(2nd ed.). San Diego, CA: Singular Publishing Group. ment of Labor. (1983). Occupational noise exposure: Hear-
Otoacoustic Emissions 511
ing conservation amendment. Final rule. Federal Register, Prascher, D., Henderson, D., and Salvi, R. J. (2002). Noise-
48, 9738. induced hearing loss. Cambridge, U.K.: Cambridge Univer-
Ohlemiller, K. K., McFadden, S. L., Ding, D. L., and Lear, sity Press.
P. M. (2000). Targeted mutation of the gene for cellular Salvi, R. J., Henderson, D., Hamernik, R. P., and Colletti, V.
glutathione peroxidase (Gpx1) increases noise-induced (1985). Basic and applied aspects of noise-induced hearing
hearing loss in mice. Journal of the Association for Research loss. New York: Plenum Press.
in Otolaryngology, 1, 243–254. Satalo¤, R. T., and Satalo¤, J. (1993). Occupational hearing
Taylor, W., Pearson, J., Mair, A., and Burns, W. (1965). Study loss (2nd ed.). New York: Marcel Dekker.
of noise and hearing in jute weaving. Journal of the Acous- Saunders, J. C., Cohen, Y. E., and Szymko, Y. M. (1991). The
tical Society of America, 38, 113–120. structural and functional consequences of acoustic injury
Ward, W. D. (1991). The role of intermittence in PTS. Journal in the cochlea and peripheral auditory system: A five year
of the Acoustical Society of America, 90, 164–169. update. Journal of the Acoustical Society of America, 90,
Ward, W. D. (1995). Endogenous factors related to suscepti- 136–146.
bility to damage from noise. Occupational Medicine, 10,
561–575.
Ward, W. D., Cushing, E. M., and Burns, E. M. (1976). Ef-
fective quiet and moderate TTS: Implications for noise Otoacoustic Emissions
exposure standards. Journal of the Acoustical Society of
America, 59, 160–165.
Yost, W. A., and Nielsen, D. W. (1985). Fundamentals of The 25th anniversary of the discovery of otoacoustic
hearing. New York: Holt, Rinehart, and Winston. emissions (OAEs), the acoustic energy produced by the
cochlea, was celebrated in 2003. Following Kemp’s
Further Readings (1978, 1979a, 1979b) breakthrough descriptions of the
four types of OAEs in humans—the click- or transient-
Axelsson, A., Borchgrevink, H., Hamernik, R. P., Hellstrom, evoked OAE (TEOAE), the distortion product OAE
P. A., Henderson, D., and Salvi, R. J. (1996). Scientific (DPOAE), the stimulus frequency OAE (SFOAE), and
basis of noise-induced hearing loss. New York: Thieme.
Bohne, B. A., and Rabbit, R. (1983). Holes in the reticular
the spontaneous OAE (SOAE)—interest turned to basic
lamina after noise exposure: Implications for continuing research issues. Existing models of cochlear function
damage in the organ of Corti. Hearing Research, 11, 41– were modified to reflect the existence of active processing
53. as implied by the mere reality of OAEs. Also, e¤orts
Cotanche, D. A. (1987). Regeneration of hair cell stereocilia were made to relate OAEs to parallel neural and psy-
bundles in the chick cochlea following severe acoustic choacoustical phenomena, and to describe emitted re-
trauma. Hearing Research, 30, 181–194. sponses in species used as research models, including
Dancer, A. L., Henderson, D., Salvi, R. J., and Hamernik, monkeys, gerbils, guinea pigs, and chinchillas (Zurek,
R. P. (1992). Noise-induced hearing loss. St. Louis: Mosby– 1985).
Year Book. During the early years of OAE study, another great
Hamernik, R. P., Henderson, D., and Salvi, R. P. (1982). New
perspectives on noise-induced hearing loss. New York: Raven
advance in the hearing sciences occurred when Brownell
Press. et al. (1985) discovered electromotility in isolated outer
Henderson, D., Prasher, D., Kopke, R., Salvi, R., and hair cells. The current consensus is that outer hair cell
Hamernik, R. (2001). Noise-induced hearing loss: Basic motility is due to the receptor-potential initiated move-
mechanisms, prevention and control. London: Noise Re- ments of atomic-sized ‘‘motor’’ molecules called prestin
search Network Publications. (Zheng et al., 2002) that are embedded in the lateral
Henderson, D., Salvi, R. J., Quaranta, A., McFadden, S. L., membrane of the outer hair cell. Our present under-
and Burkhard, R. F. (1999). Ototoxicity: Basic science and standing is that OAEs are generated as a by-product of
clinical applications. Annals of the New York Academy of these electromotile vibrations of outer hair cells (Brow-
Sciences, 884. nell, 1990).
Kryter, K. D. (1994). The handbook of hearing and the e¤ects
of noise. San Diego, CA: Academic Press.
As the initial basic studies on OAEs were ongoing,
Kujawa, S. G., and Liberman, M. C. (1999). Long-term sound the significant benefits of OAEs as a clinical test were
conditioning enhances cochlear sensitivity. Journal of Neu- being recognized. Thus, early on, four major applica-
rophysiology, 82, 863–873. tions of OAE testing in clinical settings became appar-
Kujawa, S. G., and Liberman, M. C. (2001). E¤ects of olivo- ent: the di¤erential diagnosis of hearing loss, hearing
cochlear feedback on distortion product otoacoustic emis- screening in di‰cult to test patients, serial monitoring of
sions in guinea pig. Journal of the Association for Research progressive hearing impairment conditions, and deter-
in Otolaryngology, 2, 268–278. mining the legitimacy of medicolegal claims involving
Liberman, M. C., and Gao, W. Y. (1995). Chronic cochlear de- compensatory payments for hearing loss. The rationale
e¤erentiation and susceptibility to permanent acoustic infor using OAEs in each of these major applications is
jury. Hearing Research, 90, 158–168.
Ohlemiller, K. K., McFadden, S. L., Ding, D. L., Reaume,
based on a significant beneficial feature of the measure,
A. G., Ho¤man, E. K., Scott, R. W., Wright, J. S., et al. including its specificity for testing the functional status of
(1999). Targeted deletion of the cystolic Cu/Zn-superoxide outer hair cells, the most fragile sensory receptors for
dismutase gene (SOD1) increases susceptibility to noise- hearing. This attribute in particular makes OAEs an
induced hearing loss. Audiology Neuro-Otology, 4, 237– ideal measure for determining the sensory component of
246. a sensorineural hearing loss. In addition, mainly because
the OAE is an objective response that is noninvasively

measured from the outer ear canal and thus can be
rapidly obtained, it is an ideal screening test for identi-
fying hearing impairment in newborns. Finally, because
OAEs are stable and reliably measured over long time
intervals, they are excellent for monitoring pathological
changes in cochlear function, particularly in individuals
regularly exposed to ototoxic drugs or excessive sounds.
One relatively new application of OAEs over the past
decade has been the use of emissions to measure the
intactness of the entire ascending and descending audi-
tory pathway (Collet et al., 1990). This capability is
based on the knowledge that the suppressive e¤ects of
cochlear e¤erents mainly a¤ect outer hair cell activity,
since these sensory cells are the primary targets of the
descending auditory system. Indeed, recent research in-
dicates that the susceptibility of the ear to the harmful
e¤ects of, for example, intense noise is likely determined
by the amount of indigenous e¤erent activity (Luebke,
Foster, and Stagner, 2002). That is, the more robust the
e¤erent activity, the more resistant the ear is to the
damaging e¤ects of loud sounds, and vice versa.
Of the two general classes of OAEs, SOAEs have not
been as clinically useful as the evoked OAEs, for several
reasons. Their prevalence in only about 50% of normal-
hearing individuals and the individually based unique-
ness of their frequencies and levels make it di‰cult to
develop SOAEs into a standardized test. However,
SOAEs have been linked to tinnitus in a subset of tin-
nitus patients with near-normal hearing (Penner, 1992).
In patients with SOAE-induced tinnitus, suppressing
the associated SOAEs eliminates the annoying tinnitus.
Interestingly, and contrary to expectations, since exces-
sive aspirin produces tinnitus in normal-hearing individ-
uals, high-dose aspirin su‰ces as a palliative in persons
with SOAE-induced tinnitus (Penner and Coles, 1992).
Concerning the three subclasses of evoked OAEs,
only TEOAEs and DPOAEs have proved to be clinically
useful. SFOAEs can be reliably measured only using ex-
pensive phase-tracking devices, since the emission must
be extracted from the ear canal sound at a time that the
eliciting stimulus is present at the identical frequency.
Figure 1. Audiometric and evoked OAE findings in a 37-year- Fortunately, the SFOAE is essentially the long-lasting
old man who had received intravenous infusions of cisplatin for version of the TEOAE, which is more straightforward to
testicular carcinoma. A, Pure-tone clinical audiogram for left measure and interpret.
(solid circles) and right (open circles) ear showing normal to Within a decade after the discovery of TEOAEs,
near-normal hearing thresholds from 250 Hz to 4 kHz, after
which hearing levels fell to 35–65 dB HL at 6 and 8 kHz. B, A
commercial equipment based on procedures used for
TEOAE spectrum for the left ear showing relatively normal evoking auditory brainstem responses was available
click-evoked emissions up to about 3.5 kHz. Note the progres- (Bray and Kemp, 1987). Figure 1 shows the results of
sive decrement in TEOAE levels above 1.5 kHz, which is a pure-tone audiometry (A) and tests of click-evoked
pattern typically observed for normal-hearing adults. The ‘‘re-
pro by frequency’’ values indicate excellent test-retest results
over the short recording session for frequencies up to 3 kHz. C,
DP-gram showing DPOAE levels for left (solid circles) and counterpart distribution of noise-floor values for the same
right (open circles) ears in response to moderate, equilevel pri- subjects. Note that the patient was exceptionally quiet, as his
mary tones, i.e., L1 ¼ L2 ¼ 65 dB SPL, from 800 Hz to 8 kHz, noise-floor curves (bold line ¼ left ear; stippled line ¼ right
at 10 points per octave. The bold dashed curves at the top of ear) tracked the lower distribution trajectory of the control
the plot represent the G1 SD of DPOAE values in response to population. In this example, the ototoxic drug caused outer
identical primaries for 100 ears from normal-hearing subjects; hair cell dysfunction for frequencies above about 2.5 kHz for
the bold dotted curves at the bottom of the plot indicate the the left ear and 4 kHz for the right ear.
Otoacoustic Emissions 513
TEOAEs (B) and DPOAEs (C ) in a 37-year-old patient which are often o¤set in level, the primary generation
receiving the ototoxic antitumor drug cisplatin. In this site is closer to f2 .
case, following a single infusion (Fig. 1A), a moderate As illustrated in Figure 1C, the patient’s emissions
high-frequency hearing loss was evident bilaterally for were relatively normal, as compared to the G1 SD dis-
frequencies over 4 kHz. The associated TEOAE spec- tribution of DPOAE levels for normal-hearing adults,
trum (Fig. 1B) illustrates the commonly measured prop- until about 3 kHz for the left ear (solid circles) and
erties for this emitted response, including its level, 4 kHz for the right ear (open circles). In this case, be-
frequency content and extent, and reliability, according cause DPOAEs are typically tested out to 8 kHz, they
to automatically computed reproducibility factors for detected the developing high-frequency hearing loss
five representative frequencies at 1, 2, 3, 4, and 5 kHz. associated with the ototoxic antitumor therapy.
Because the TEOAE is measured after the transient It is clear that applications of OAEs in the hearing
stimulus occurs, each ear produces a response that sciences and clinical audiology are varied. Without a
exhibits a unique spectral pattern. This idiosyncratic doubt, OAEs are useful experimentally for evaluating
property makes it di‰cult to develop a set of metrics and monitoring the status of cochlear function in animal
that describe the average TEOAE for normal-hearing models, and clinically in distinguishing cochlear from
individuals. Owing to this di‰culty in determining retrocochlear disorders. Moreover, their practical fea-
‘‘normal’’ TEOAEs in terms of frequencies and level tures make them helpful in the hearing screening of
values, they are most often described as being either newborns. Additionally, they have proved useful in
present or absent. Thus, one of the most popular uses of monitoring the e¤ects of agents such as ototoxins and
TEOAEs clinically is as a test for screening auditory loud sounds on cochlear function. In fact, there is accu-
function in newborns (Norton et al., 2000). mulating evidence that it is possible to detect such ad-
In the example of Figure 1B, representing the left ear verse e¤ects of drugs or noise on outer hair cell function
of the patient, even in the presence of a drug-induced using OAEs before a related hearing loss can be detected
high-frequency hearing loss, the TEOAE pattern ap- by pure-tone audiometry. In addition, OAEs provide a
pears fairly normal in that the click-elicited emission noninvasive means for assessing the integrity of the
typically falls o¤ for frequencies greater than 2 kHz, and cochlear e¤erent pathway. In general, OAEs supply
is seldom present at frequencies above 4 kHz in adult unique information about cochlear function in the pres-
ears. For newborns and older infants, the TEOAE is ence of hearing problems, and this capability makes
much more robust by about 10 dB and typically can be them ideal response measures in both the clinical and
measured out to about 6 kHz, indicating that smaller ear basic hearing sciences.
canals influence the acoustic characteristics of standard
click stimuli much di¤erently than do adult ears. Acknowledgments
Distortion product OAEs are elicited by presenting
two long-lasting pure tonebursts at f1 (lower frequency) This work was supported by grants from the Public
and f2 (higher frequency) simultaneously to the ear. The Health Service (DC 00613, DC03114).
frequencies and levels of the tonebursts or primary tones —Brenda L. Lonsbury-Martin
are important in that the largest DPOAEs are elicited by
f1 and f2 primaries that are within one-half octave of References
each other (i.e., f 2 =f 1 ¼ 1:22) with levels, L1 and L2 , that
are o¤set. For example, typical clinical protocols mea- Bray, P., and Kemp, D. T. (1987). An advanced cochlear echo
sure the 2f1 –f2 DPOAE, which is the largest DPOAE technique suitable for infant screening. British Journal of
in human ears, in response to primary-tone levels of Audiology, 21, 191–204.
L1 ¼ 65 and L2 ¼ 55 dB SPL (Gorga, Neely, and Dorn, Brownell, W. E. (1990). Outer hair cell electromotility and
otoacoustic emissions. Ear and Hearing, 11, 82–92.
1999).
Brownell, W. E., Bader, C. R., Bertrand, D., and Ribaupierre,
Figure 1C shows a DP-gram, i.e., DPOAE level as a Y. (1985). Evoked mechanical responses of isolated outer
function of test frequency, from about 800 Hz to 8 kHz, hair cells. Science, 227, 194–196.
in response to equilevel primary tones (L1 ¼ L2 ¼ 65 dB Collet, L., Kemp, D. T., Veuillet, E., Duclaux, R., Moulin, A.,
SPL). In this example, test frequency is represented by and Morgon, A. (1990). E¤ect of contralateral auditory
the geometric or logarithmic mean of f1 and f2 , although stimuli on active cochlear micro-mechanical properties in
it could also be represented by the f2 frequency. That is, human subjects. Hearing Research, 43, 251–261.
based on a combination of theoretical considerations, Gorga, M. P., Neely, S. T., and Dorn, P. A. (1999). Distortion
experimental studies, and observations of the generation product otoacoustic emission test performance for a priori
of DPOAEs in pathological ears, it is clear that these criteria and for multifrequency audiometric standards. Ear
and Hearing, 20, 345–362.
emissions are produced in the region of the primary Kemp, D. T. (1978). Stimulated acoustic emissions from within
tones. Based on further experimental work, it is likely the human auditory system. Journal of the Acoustical Soci-
that the DPOAE source is level-dependent, with the pri- ety of America, 64, 1386–1391.
mary generation site in response to higher level primaries Kemp, D. T. (1979a). Evidence of mechanical nonlinearity
of equal level (L1 ¼ L2 ) occurring around the geometric and frequency selective wave amplification in the cochlea.
mean frequency. In contrast, for lower level primaries, Archives of Otorhinolaryngology, 224, 37–45.
Kemp, D. T. (1979b). The evoked cochlear mechanical re- otoacoustic emissions. Journal of the Acoustical Society of
sponse and the auditory microstructure: Evidence for a new America, 111, 285–296.
element in cochlear mechanics. Scandinavian Audiology, 9, Hussain, D. M., Gorga, M. P., Neely, S. T., Keefe, D. H., and
35–47. Peters, J. (1998). Transient evoked otoacoustic emissions in
Luebke, A. E., Foster, P. K., and Stagner, B. B. (2002). A patients with normal hearing and in patients with hearing
multifrequency method for determining cochlear e¤erent loss. Ear and Hearing, 19, 434–449.
activity. Journal of the Association for Research in Otolar- Kim, D. O., Dorn, P. A., Neely, S. T., and Gorga, M. P.
yngology, 3, 16–25. (2001). Adaptation of distortion product otoacoustic emis-
Norton, S. J., Gorga, M. P., Widen, J. E., Folsom, R. C., sion in humans. Journal of the Association for Research in
Sininger, Y., Cone-Wesson, B., et al. (2000). Identification Otolaryngology, 2, 31–40.
of neonatal hearing impairment: A multicenter investiga- Knight, R. D., and Kemp, D. T. (2001). Wave and place fixed
tion. Ear and Hearing, 21, 348–356. DPOAE maps of the human ear. Journal of the Acoustical
Penner, M. J. (1992). Linking spontaneous otoacoustic emis- Society of America, 109, 1513–1525.
sions and tinnitus. British Journal of Audiology, 26, 115– Konrad-Martin, D., Neely, S. T., Keefe, D. H., Dorn, P. A.,
123. and Gorga, M. P. (2001). Sources of DPOAEs revealed by
Penner, M. J., and Coles, R. R. (1992). Indications for aspirin suppression experiments, inverse fast Fourier transforms,
as a palliative for tinnitus caused by SOAEs: A case study. and SFOAEs in normal ears. Journal of the Acoustical So-
British Journal of Audiology, 26, 91–96. ciety of America, 111, 1800–1809.
Zheng, J., Madison, L. D., Oliver, D., Fakler, B., and Dallos, Lalaki, P., Markou, K., Tsalighopoulos, M. G., and Daniilidis,
P. (2002). Prestin, the motor protein of outer hair cells. I. (2001). Transiently evoked otoacoustic emissions as a
Audiology and Neuro-Otology, 7, 9–12. prognostic indicator in idiopathic sudden hearing loss.
Zurek, P. M. (1985). Acoustic emissions from the ear: A sum- Scandinavian Audiology Supplement, 52, 141–145.
mary of results from humans and animals. Journal of the Lucertini, M., Moleti, A., and Sisto, R. (2002). On the detec-
Acoustical Society of America, 78, 340–344. tion of early cochlear damage by otoacoustic emission
analysis. Journal of the Acoustical Society of America, 111,
Further Readings 972–978.
Lukashkin, A. N., Lukashkina, V. A., and Russell, I. J. (2002).
Abdala, C. (2001). Maturation of the human cochlear ampli- One source for distortion product otoacoustic emissions
fier: Distortion product otoacoustic emission suppression generated by low- and high-level primaries. Journal of the
tuning curves recorded at low and high primary tone levels. Acoustical Society of America, 111, 2740–2748.
Journal of the Acoustical Society of America, 110, 1465– Martin, G. K., Jassir, D., Stagner, B. B., Whitehead, M. L.,
1476. and Lonsbury-Martin, B. L. (1998). Locus of generation for
Arnold, D. J., Lonsbury-Martin, B. L., and Martin, G. K. the 2f1 –f2 vs 2f2 –f1 distortion-product otoacoustic emissions
(1999). High-frequency hearing influences lower-frequency in normal-hearing humans revealed by suppression tuning,
acoustic distortion products. Archives of Otolaryngology, onset latencies, and amplitude correlations. Journal of the
125, 215–222. Acoustical Society of America, 103, 1957–1971.
Berlin, C. I., Hood, L. J., Hurley, A., and Wen, H. (1994). Owens, J. J., McCoy, M. J., Lonsbury-Martin, B. L., and
Contralateral suppression of otoacoustic emissions: An in- Martin, G. K. (1993). Otoacoustic emissions in children
dex of the function of the medial olivocochlear system. with normal ears, middle-ear dysfunction, and ventilating
Otolaryngology–Head and Neck Surgery, 110, 3–21. tubes. American Journal of Otology, 14, 34–40.
Di Girolamo, S., d’Ecclesia, A., Quaranta, N., Garozzo, A., Probst, R., Lonsbury-Martin, B. L., and Martin, G. K. (1991).
Evoli, A., and Paludetti, G. (2001). E¤ects of contralateral A review of otoacoustic emissions. Journal of the Acoustical
white noise stimulation on distortion product otoacoustic Society of America, 89, 2027–2067.
emissions in myasthenic patients. Hearing Research, 162, Shera, C. A., and Guinan, J. J., Jr. (1999). Evoked otoacoustic
80–84. emissions arise by two fundamentally di¤erent mechanisms:
Dorn, P. A., Piskorski, P., Gorga, M. P., Neely, S. T., and A taxonomy for mammalian OAEs. Journal of the Acousti-
Keefe, D. H. (1999). Predicting audiometric status from cal Society of America, 105, 782–798.
distortion product otoacoustic emissions using multivariate Stavroulaki, P., Apostolopoulos, N., Segas, J., Tsakanikos,
analysis. Ear and Hearing, 20, 149–163. M., and Adamopoulos, G. (2001). Evoked otoacoustic
Driesbach, L. E., and Siegel, J. H. (2001). Distortion-product emissions: An approach for monitoring cisplatin induced
otoacoustic emissions measured at high frequencies in ototoxicity in children. International Journal of Pediatric
humans. Journal of the Acoustical Society of America, 110, Otorhinolaryngology, 59, 47–57.
2456–2469. Sutton, L. A., Lonsbury-Martin, B. L., Martin, G. K., and
Engel-Yates, B., Zaaroura, S., Zlotogora, J., et al. (2002). Whitehead, M. L. (1994). Sensitivity of distortion-product
The e¤ects of a connexin 26 mutation—35delG—on oto- otoacoustic emissions in humans to tonal over-exposure:
acoustic emissions and brainstem auditory evoked poten- Time course of recovery and e¤ects of lowering L2 . Hearing
tials: Homozygotes and carriers. Hearing Research, 163, Research, 75, 161–174.
93–100. Telischi, F. F., Roth, J., Lonsbury-Martin, B. L., Balkany,
Fetterman, B. L. (2001). Distortion-product otoacoustic emis- T. J. (1995). Patterns of evoked otoacoustic emissions asso-
sions and cochlear microphonics: Relationships in patients ciated with acoustic neuromas. Laryngoscope, 105, 675–
with and without endolymphatic hydrops. Laryngoscope, 682.
111, 946–954. Whitehead, M. L., Stagner, B. B., Lonsbury-Martin, B. L., and
Howard, M. A., Stagner, B. B., Lonsbury-Martin, B. L., and Martin, G. K. (1994). Measurement of otoacoustic emis-
Martin, G. K. (2002). E¤ects of reversible noise exposure sions for hearing assessment. IEEE Medicine and Biology,
on the suppression tuning of rabbit distortion-product 13, 210–226.
Otoacoustic Emissions in Children 515
Otoacoustic Emissions in Children used to distinguish ears with hearing loss from those
with normal hearing. This dichotomous decision is made
whenever OAEs are used in screening programs, re-
Unidentified hearing loss in infants and young children gardless of the target population. The following discus-
can lead to delays in speech and language acquisition sion describes work in this area.
(Yoshinaga-Itano et al., 1998; Moeller, 2000). Identifi- The clinical value of OAE measurements was recog-
cation of hearing loss presents an additional challenge nized starting with their discovery (Kemp, 1978). Several
because these patients may be unable to provide volun- studies describe the accuracy with which OAEs identify
tary responses to sound. Otoacoustic emissions (OAEs) auditory status (e.g., Martin et al., 1990; Prieve et al.,
are an e¤ective means to identify hearing loss in young 1993; Gorga et al., 1993a, 1993b, 1996, 1997, 1999,
children because they are related to the integrity of the 2000; Glattke et al., 1995; Kim et al., 1996; Hussain et
peripheral auditory system and do not require voluntary al., 1998; Dorn et al., 1999; Harrison and Norton, 1999;
responses from the patient. Norton, Widen, et al., 2000). In general, both TEOAEs
OAEs are by-products of normal, nonlinear cochlear and DPOAEs identify auditory status with greater ac-
function, the source of which is the outer hair cell (OHC) curacy for middle and high frequencies than for lower
system. They may be evoked by single tones (stimulus frequencies. This occurs because noise levels decrease as
frequency, SFOAEs), pairs of tones (distortion prod- frequency increases during OAE measurements. The
uct, DPOAEs), or transient stimuli (transient evoked, noise interfering with OAE measurements (1) is acousti-
TEOAEs), and take from several seconds to several cal, (2) results mainly from patient breathing and/or
minutes to measure. Although all OAEs do not require a movement, and (3) contains mostly lower frequency en-
behavioral response, only TEOAEs and DPOAEs have ergy. Noise adds variability and reduces measurement
been used widely to identify hearing loss. Since OHC reliability. Thus, OAE test performance depends heavily
damage results in hearing loss, OAEs, which are gen- on the frequencies at which predictions about auditory
erated by OHCs, should be present when cochlear func- status are being made, in large part because noise level
tion is normal and reduced or absent when it is not. depends on frequency.
These facts have led to the application of OAE mea- Test performance also depends on stimulus level
surements in e¤orts to describe auditory function in (Whitehead et al., 1995; Stover et al., 1996; Harrison
humans, especially infants and young children. Un- and Norton, 1999). Moderate-level stimuli result in the
fortunately, OAE response properties from normal and fewest false positive and false negative errors. Lower or
impaired ears are not completely distinguishable; thus, higher stimulus levels decrease one of these error rates at
diagnostic errors are inevitable. Below, we provide the expense of increasing the rate of the other. This
brief descriptions of TEOAEs and DPOAEs in normal- occurs for simple reasons. If low stimulus levels are
hearing infants and young children, followed by a de- chosen, virtually every ear with hearing loss will fail
scription of these two OAEs in patients with hearing the test, resulting in a false negative rate of zero. How-
loss. Robinette and Glattke (2002) and Hall (2000) pro- ever, the number of ears with normal hearing not pro-
vide more background information and extensive refer- ducing responses will also increase as stimulus level
ence lists on OAEs. Norton, Gorga, et al. (2000) and decreases, increasing the false positive rate. If high-
Gorga, Norton, et al. (2000) provide comprehensive level stimuli are used, the vast majority of ears with
descriptions of OAEs in the perinatal period. normal hearing will produce responses, resulting in a
Infants and young children produce larger OAEs than low false positive rate. Unfortunately, some ears with
older children and adults (Prieve, Fitzgerald, and hearing loss, especially ears with mild or moderate
Schulte, 1997; Prieve, Fitzgerald, Schulte, and Kemp, losses, will produce a response to high-level stimulation,
1997; see Widen and O’Grady, 2002, for a review). increasing the false negative rate. Moderate-level stimuli
There are several explanations for this di¤erence. Very result in optimal combinations of false positive and
young children have not been exposed to environmental false negative rates. Thus, primary levels of 50–65 dB
factors that might result in OHC damage. Also, their SPL for DPOAE measurements or 80–85 dB pSPL
middle ears transmit energy to and from the cochlea for clicks during TEOAE measurements are recom-
di¤erently than adults (Keefe et al., 1993), which might mended.
alter OAE levels in the ear canal. In addition, infant ear Figure 1 shows representative examples of DPOAE
canal resonances show greater level at high frequencies, and TEOAE signal and noise levels for three hearing
compared to spectra measured in adult ear canals. If loss categories. In general, robust responses above the
stimuli di¤er in adult and infant ear canals, then re- noise floor are observed when hearing is normal (top
sponses may di¤er as well. Finally, the space between the row). When borderline normal hearing or mild hearing
measuring microphone and the eardrum is smaller in loss exists (middle row), the response is either reduced in
infants than in adults. If equivalent OAEs were generated level or absent. In cases of moderate or greater hearing
in the cochlea, that signal would be larger in the infant loss (bottom row), the response typically does not exceed
ear canal because it was recorded in a smaller space. the noise floor, even when the noise level is low. These
While di¤erences in infant and adult OAEs exist, the examples are consistent with general response patterns
larger question revolves around whether OAEs can be in these hearing loss categories, but it is important to
OAE test performance also depends on the audio-

metric criterion defining the border between normal
hearing and hearing loss. Both TEOAEs and DPOAEs
perform best when thresholds a20–30 dB HL are used
to define normal hearing. There are data suggesting that
TEOAEs are more sensitive than DPOAEs to mild
hearing loss (for a review, see Harris and Probst, 1997).
However, direct comparisons failed to reveal large dif-
ferences in test performance between TEOAEs and
DPOAEs when audiometric criterion was varied (Gorga
et al., 1993b; Norton, Widen, et al., 2000). Still, some
di¤erences across frequency have been observed.
TEOAEs tend to perform better at detecting hearing loss
for lower frequencies while DPOAEs tend to perform
better at detecting high-frequency hearing loss (Gorga et
al., 1993b; Kemp, 1997), because of how each measure-
ment is made.
During TEOAE measurements, a fast Fourier trans-
form (FFT) is performed on the ear canal waveform.
The status of the cochlear region associated with specific
frequencies is determined by examining the energy (or
signal-to-noise ratio) at those frequencies. For example,
one would conclude that the 1000-Hz region of the
cochlea is functioning if energy is observed in the FFT
at 1000 Hz. For DPOAE measurements, two tones are
presented simultaneously (f1 and f2 ), interact at the
cochlea place close to where f2 is represented, and pro-
Figure 1. OAE (circles) and noise (triangles) levels as a function duce distortion products (DP), the most prominent of
of frequency, with DPOAE and TEOAE data shown in the left which is the 2f1 –f2 DP. The level of this component is
and right columns, respectively. Note that the y-axis is not then measured to determine if the cochlea is functioning
the same for DPOAEs and TEOAEs. Representative examples at the point of its initial generation (f2 ). However, 2f1 –f2
are shown for normal hearing (top row), borderline normal
hearing/mild hearing loss (middle row), and moderate to severe
occurs at a frequency that is about one-half octave lower
hearing loss (bottom row). See Gorga et al. (1997) for an ex- than f2 . Thus, the measured response may occur in a re-
planation of the shaded areas in the DPOAE column. Al- gion in which noise floors are less favorable, thus reduc-
though Hussain et al. (1998) developed a similar grid for ing measurement reliability. As a consequence, DPOAEs
TEOAE data, those data were collected with a di¤erent para- are less accurate than TEOAEs for lower frequencies.
digm, and thus cannot be applied to the present set of data. During TEOAE measurements, the first 2.5 ms of the
ear canal waveform following stimulation usually is
zeroed to ensure that stimulus artifact does not contam-
inate the measured response. However, TEOAE energy
remember that both measurements will produce some generated in the high-frequency (basal) end of the
diagnostic errors. cochlea will return with the shortest latency. Zeroing the
One of these errors (false positives or false negatives) first 2.5 ms of the ear canal signal may remove some of
may be more important for certain clinical applications. the high-frequency cochlear response. DPOAEs are not
For example, infants or young children who are brought susceptible to this problem. Thus, they predict cochlear
to a speech and hearing clinic or an otolaryngology status better than TEOAEs do at higher frequencies.
clinic because of concern about hearing loss are at higher Implicit in the above discussion is that errors are in-
risk than the general population. In this case, one might evitable regardless of OAE measurement, stimulus level,
choose stimuli or criteria that provide higher sensitiv- OAE criterion value, or the definition of normal hearing.
ities, despite the higher false positive rates, because Both TEOAEs and DPOAEs will miss some ears with
missing a hearing loss is the greater concern. In contrast, hearing loss and/or will incorrectly label some ears with
one might choose stimuli or criteria that provide higher normal hearing as hearing impaired. In addition, OAEs
specificity, despite the lower sensitivity, when the target are not useful measurements of sensory function when
population includes only well babies without risk for middle ear dysfunction exists, which is frequently the
hearing loss. In this group, the probability of hearing case in children. Furthermore, OAEs will not identify
loss is so low that it may be more important to minimize patients with pathologies central to the OHCs, because
false positive errors. It is impossible to recommend a OAEs test only the OHC system. Since the majority of
single set of stimuli and/or criteria because individual hearing losses arise from OHC damage, however, OAEs
clinics must decide which error is more important for are well-suited to the task of determining auditory status
their needs. in infants and children.
Otoacoustic Emissions in Children 517
Acknowledgments Keefe, D. H., Bulen, J. C., Arehart, K. H., and Burns, E. M.

(1993). Ear-canal impedance and reflection coe‰cient in
Much of the authors’ work summarized in this chapter human infants and adults. Journal of the Acoustical Society
was supported by grants from the NIH (NIDCD R01 of America, 94, 2617–2638.
DC2251 and DC R10 1958). Kemp, D. T. (1978). Stimulated acoustic emissions from within
See also otoacoustic emissions. the human auditory system. Journal of the Acoustical Soci-
ety of America, 64, 1386–1391.
—Michael P. Gorga, Stephen T. Neely, and Kemp, D. T. (1997). Otoacoustic emissions in perspective. In
Judith E. Widen M. S. Robinette and T. J. Glattke (Eds.), Otoacoustic
emissions: Clinical applications. New York: Thieme.
References Kim, D. O., Paparello, J., Jung, M. D., Smurzynski, J., and
Sun, X. (1996). Distortion product otoacoustic emission test
Dorn, P. A., Piskorski, P., Gorga, M. P., Neely, S. T., and of sensorineural hearing loss: Performance regarding sensi-
Keefe, D. H. (1999). Predicting audiometric status from tivity, specificity, and receiver operating characteristics.
distortion product otoacoustic emissions using multivariate Acta Oto-Laryngologica (Stockholm), 116, 3–11.
analyses. Ear and Hearing, 20, 149–163. Martin, G. K., Ohlms, L. A., Franklin, D. J., Harris, F. P.,
Glattke, T. J., Pafitis, I. A., Cummiskey, C., and Herer, G. R. and Lonsbury-Martin, B. L. (1990). Distortion product
(1995). Identification of hearing loss in children and young emissions in humans III: Influence of sensorineural hearing
adults using measures of transient otoacoustic emission loss. Annals of Otology, Rhinology, and Laryngology, Sup-
reproducibility. American Journal of Audiology, 4, 71–87. plement, 147, 30–42.
Gorga, M. P., Neely, S. T., Bergman, B., Beauchaine, K. L., Moeller, M. P. (2000). Early intervention and language devel-
Kaminski, J. R., Peters, J., et al. (1993a). Otoacoustic opment in children who are deaf and hard of hearing. Pe-
emissions from normal-hearing and hearing-impaired sub- diatrics, 106, E43.
jects: Distortion product responses. Journal of the Acousti- Norton, S. J., Gorga, M. P., Widen, J. E., Vohr, B. R., Fol-
cal Society of America, 93, 2050–2060. som, R. C., Sininger, Y. S., et al. (2000). Identification of
Gorga, M. P., Neely, S. T., Bergman, N. M., Beauchaine, neonatal hearing impairment: Transient otoacoustic emis-
K. L., Kaminski, J. R., Peters, J., Schulte, L., and Jesteadt, sions during the perinatal period. Ear and Hearing, 21, 425–
W. (1993b). A comparison of transient-evoked and distor- 442.
tion product otoacoustic emissions in normal-hearing and Norton, S. J., Widen, J. E., Gorga, M. P., Folsom, R. C.,
hearing-impaired subjects. Journal of the Acoustical Society Sininger, Y. S., Cone-Wesson, B., et al. (2000). Iden-
of America, 94, 2639–2648. tification of neonatal hearing impairment: Evaluation of
Gorga, M. P., Neely, S. T., and Dorn, P. A. (1999). DPOAE TEOAE, DPOAE, and ABR test performance. Ear and
test performance for a priori criteria and for multifrequency Hearing, 21, 508–528.
audiometric standards. Ear and Hearing, 20, 345–362. Prieve, B. A., Fitzgerald, T. S., and Schulte, L. E. (1997). Basic
Gorga, M. P., Neely, S. T., Ohlrich, B., Hoover, B., Redner, J., characteristics of click-evoked otoacoustic emissions in
and Peters, J. (1997). From laboratory to clinic: A large infants and children. Journal of the Acoustical Society of
scale study of distortion product otoacoustic emissions in America, 102, 2860–2870.
ears with normal hearing and ears with hearing loss. Ear Prieve, B. A., Fitzgerald, T. S., Schulte, L. E., and Kemp, D. T.
and Hearing, 18, 440–455. (1997). Basic characteristics of distortion product otoa-
Gorga, M. P., Nelson, K., Davis, T., Dorn, P. A., and Neely, coustic emissions in infants and children. Journal of the
S. T. (2000). DPOAE test performance when both 2f1 –f2 Acoustical Society of America, 102, 2871–2879.
and 2f2 –f1 are used to predict auditory status. Journal of the Prieve, B. A., Gorga, M. P., Schmidt, A. R., Neely, S. T.,
Acoustical Society of America, 107, 2128–2135. Peters, J., Schulte, L., et al. (1993). Analysis of transient-
Gorga, M. P., Norton, S. J., Sininger, Y. S., Cone-Wesson, B., evoked otoacoustic emissions in normal-hearing and
Folsom, R. C., Vohr, B. R., et al. (2000). Identification of hearing-impaired ears. Journal of the Acoustical Society of
neonatal hearing impairment: Distortion product otoacous- America, 93, 3308–3319.
tic emissions during the perinatal period. Ear and Hearing, Robinette, M. S., and Glattke, T. J. (Eds.). (2002). Otoacoustic
21, 400–424. emissions: Clinical applications (2nd ed.). New York:
Gorga, M. P., Stover, L. T., and Neely, S. T. (1996). The use of Thieme.
cumulative distributions to determine critical values and Stover, L. J., Gorga, M. P., Neely, S. T., and Montoya, D.
levels of confidence for clinical distortion product otoa- (1996). Towards optimizing the clinical utility of distortion
coustic emission measurements. Journal of the Acoustical product otoacoustic emissions. Journal of the Acoustical
Society of America, 100, 968–977. Society of America, 100, 956–967.
Hall, J. W. (2000). Handbook of otoacoustic emissions. San Whitehead, M. L., McCoy, M. J., Lonsbury-Martin,
Diego, CA: Singular Publishing Group. B. L., and Martin, G. K. (1995). Dependence of distortion
Harris, F. P., and Probst, R. (1997). Otoacoustic emissions and product otoacoustic emissions on primary levels in normal
audiometric outcomes. In M. S. Robinette and T. J. Glattke and impaired ears: I. E¤ects of decreasing L2 below L1 .
(Eds.), Otoacoustic emissions: Clinical applications. New Journal of the Acoustical Society of America, 97, 2346–
York: Thieme. 2358.
Harrison, W. A., and Norton, S. J. (1999). Characteristics of Widen, J. E., and O’Grady, G. M. (2002). Evoked otoacoustic
transient evoked otoacoustic emissions in normal-hearing emissions in evaluating children. In M. S. Robinette and
and hearing-impaired children. Ear and Hearing, 20, 75–86. T. J. Glattke (Eds.), Otoacoustic emissions: Clinical appli-
Hussain, D., Gorga, M. P., Neely, S. T., Keefe, D. H., and cations (2nd ed., pp. 375–515). New York: Thieme.
Peters, J. (1998). Transient otoacaoustic emissions in pa- Yoshinaga-Itano, C., Sedey, A. L., Coulter, D. K., and Mehl,
tients with normal hearing and in patients with hearing loss. A. L. (1998). Language of early- and late-identified children
Ear and Hearing, 19, 434–449. with hearing loss. Pediatrics, 102, 1161–1171.
Ototoxic Medications penicillin family, the cephalosporin family, and the

macrolide family (except in situations where dosages are
very high). Here we will discuss a few of the more com-
Many medications are ototoxic, meaning that they ad- mon ototoxic antibiotics.
versely a¤ect inner ear function. The toxicity can be The aminoglycosides are a large family of antibiotics
divided into two broad categories, cochleotoxicity and that are uniformly ototoxic. Streptomycin, the first clin-
vestibulotoxicity. Cochleotoxic medications a¤ect hear- ically used aminoglycoside, is now used only in treating
ing function and typically manifest with tinnitus (an ab- tuberculosis, because many gram-negative bacteria are
normal noise in the ear), decreased hearing, or both. resistant and because of substantial ototoxicity. Dihy-
Most cochleotoxins a¤ect hearing at the highest fre- drostreptomycin is no longer used in the United States,
quencies first, reflecting damage to hair cells of the coch- but streptomycin sulfate can still be obtained. Strepto-
lea. Vestibulotoxic medications a¤ect the part of the mycin is primarily a vestibulotoxin.
inner ear that senses motion—the vestibular system. Neomycin, isolated in 1949, is now mainly used
Vestibulotoxicity usually manifests with dizziness, un- topically because of renal toxicity and cochleotoxicity.
steadiness, and when severe, oscillopsia. Oscillopsia Hearing ototoxicity from oral absorption of neomycin
denotes inability of a person to see when the head is has been reported (Rappaport et al., 1986) and there
moving, although visual acuity may be normal when the may also be toxicity from eardrops in patients with per-
head is still. The most common vestibulotoxins, the forated eardrums.
aminoglycoside antibiotics, primarily damage the ves- Kanamycin, developed in 1957, has been replaced by
tibular hair cells. Ototoxicity is often irreversible, as newer aminoglycosides such as gentamicin, tobramycin,
humans lack the ability to regenerate hair cells. netilmicin, and amikacin. It is not thought to be as oto-
Ototoxic medications can be broken down into sev- toxic as neomycin.
eral broad groups. Chemotherapeutic agents are often Gentamicin is presently the biggest problem antibiotic
cochleotoxic. Many antibiotics are ototoxic, and those with respect to ototoxicity, as most of the other ototoxic
in the aminoglycoside family are all ototoxic to some antibiotics have been replaced. Gentamicin was released
degree, some being primarily cochleotoxic and others for clinical use in the early 1960s (Matz, 1993). Netilmi-
primarily vestibulotoxic. Diuretics, mainly the loop cin has equivalent ototoxicity to gentamicin (Tange
diuretics, are often cochleotoxic. Similarly, quinine et al., 1995). Hearing toxicity generally involves the high
derivatives such as antimalarials are also commonly frequencies first, but it is rarely severe. Vestibulotoxicity,
cochleotoxic. Many common medications in the non- rather than hearing toxicity, is the major problem re-
steroidal anti-inflammatory group, such as aspirin, are sulting from gentamicin use. Certain persons with mito-
cochletoxic. chondrial deletions in the 12S subunit are much more
susceptible to gentamicin than the general population
Chemotherapy Agents (Fischel-Ghodsian et al., 1997). The prevalence of this
mutation is not clear, but 1% of the population is a rea-
Chemotherapeutic agents are drugs generally used to
sonable estimate, based on available data. Gentamicin
treat cancer. Actinomycin, bleomycin, cisplatin, carbo-
accumulates in the inner ear with repeated dosing, and
platin, nitrogen mustard, and vincristine have all been
for this reason, most cases of toxicity are associated with
reported to be ototoxic. Their ototoxicity is generally a
durations of administration of 2 weeks or more.
direct result of their toxicity to other cells. While coch-
Vancomycin, although not an aminoglycoside, does
leotoxic, these medications are rarely encountered as a
have minor ototoxicity. However, it is often combined
source of vestibular dysfunction.
with aminoglycosides, and in this situation it potentiates
Cisplatin is currently the most widely used anticancer
the ototoxicity of gentamicin (Brummett et al., 1990) as
drug, and unfortunately, it is cochleotoxic. The toxicity
well as (probably) other aminoglycosides such as tobra-
of cisplatin is synergistic with that of gentamicin (Riggs
mycin. Vancomycin by itself, in appropriate doses, is not
et al., 1996), and high doses of cisplatin have been
ototoxic (Gendeh et al., 1998). Occasional persons do
reported to cause total deafness. In animals, cisplatin
appear idiosyncratically to have substantial vestibular
ototoxicity is related to lipid peroxidation, and the use of
toxicity from vancomycin. The reason why occasional
antioxidant agents is protective (Rybak et al., 2000).
persons are more sensitive is not clear but might resem-
Some chemotherapy medications also have central
ble the situation with gentamicin, where there is a sus-
nervous system toxicity, which can be confused with
ceptibility mutation (Fischel-Ghodsian et al., 1997).
vestibulotoxicity.
Eardrops may contain antibiotics, some of which
Antibiotics can be ototoxic when administered to persons with per-
forated eardrums. Cortisporin otic solution appears to
A large number of antibiotics have been reported to be the most ototoxic to the cochlea of guinea pigs.
be ototoxic in certain circumstances, including eryth- Ofloxacin eardrops have negligible toxicity (Barlow
romycin, gentamicin, streptomycin, dihydrostrepto- et al., 1995). Neomycin-containing eardrops have been
micin, tobramycin, netilmicin, amikacin, neomycin, reported to contribute to hearing loss (Podoshin, Fradis,
kanamycin, etiomycin, vancomycin, and capreomycin. and Ben David, 1989) in a relatively small way, but a
Antibiotics generally considered safe are members of the definitive assessment of risk has not yet been made. The
Ototoxic Medications 519
vestibulotoxicity of eardrops has so far not been studied, Ménière’s syndrome will develop a hearing disturbance
although case reports suggest that gentamicin-containing from a small amount of an NSAID.
drops are toxic (Marais and Rutka, 1998; Bath et al.,
1999).
Compounding Factors
There are several known interactions between anti-
biotics as well as with other agents. Vancomycin com- Noise exposure is the most common source of hear-
bined with gentamicin causes more vestibulotoxicity ing loss. Industrial exposure characteristically causes a
than either one alone. Loop diuretics potentiate amino- ‘‘noise notch,’’ with the hearing loss at mid- to high fre-
glycoside toxicity. Noise also may potentiate amino- quencies bilaterally. Guns and other unilateral sources of
glycoside ototoxicity. noise can cause more circumscribed lesions. Noise can
Delayed ototoxicity, meaning essentially toxicity that be a cofactor in medication-induced ototoxicity. Those
continues for several months after the drug has been who have hearing loss from an ototoxic antibiotic, for
stopped, occurs because the aminoglycosides are re- example, may be at much greater risk from noise (Aran,
tained within the inner ear much longer than in the 1995).
blood. Gentamicin has been reported to persist for more
than 6 months in animals (Dulon et al., 1993). Neomy-
cin, streptomycin, and kanamycin are also known to be
Protection from Ototoxins
eliminated from the inner ear slowly (Thomas, Marion, Little is known about protection from ototoxicity. Anti-
and Hinojosa, 1992). oxidants protect partially from noise or toxins in several
animal models (Rybak, Whitworth, and Somani, 1999).
Ototoxic Diuretics In theory, prevention of reactive oxygen species, neu-
tralization of toxic products, and blockage of the apop-
Loop diuretics are well-known cochleotoxins. Examples tosis pathway might provide protection from oxidative
include furosemide and ethacrinic acid (Rybak, 1993). stress, which is a common final pathway for ototoxicity.
Diuretics generally considered safe include chlorthiazide. Toxic waste products can be neutralized with gluta-
Loop diuretics are rarely a source of vestibulotoxicity. thione and derivatives (Rybak et al., 2000). Apoptosis
They are possibly a source of hearing disturbance. They can be blocked using capsase inhibitors. At this writing,
may be synergistic with aminoglycoside ototoxins such all of these approaches are investigational and are not
as gentamicin, neomycin, streptomycin, and kanamycin. being used clinically. Most also require delivery systems
It seems prudent to attempt to avoid exposure to these that go directly into the inner ear, and are therefore
agents if hearing is impaired. impractical for clinical use. For cochleotoxicity, noise
avoidance is likely helpful, but even here the story is
Quinine Derivatives complicated. Paradoxically, moderate amounts of noise
Numerous quinine derivatives, including quinidex, at- may protect from extreme amounts of noise. Neverthe-
abrine, plaquenil, quinine sulfate, mefloquine (Lariam), less, it seems prudent to avoid excessive noise exposure,
and chloroquine, have reported ototoxicity (Jung et al., particularly in situations where there has been a recent
1993). The toxicity is primarily cochleotoxic and is exposure to an ototoxin. Because aminoglycosides may
generally confined to tinnitus, but it can cause a syn- persist in the inner ear for more than 6 months, practi-
drome that includes tinnitus, sensorineural hearing loss, cally this advice implies long-term noise avoidance.
and vertigo. Some quinine derivatives taken for malaria
prevention can occasionally cause significant and long- Treatment of Ototoxicity
lasting tinnitus. Recent studies suggest that quinine
impairs outer hair cell motility. Because most cochleotoxicity is caused by damage to
hair cells, and because once dead, hair cells do not re-
generate in humans, treatment of completed ototox-
Aspirin, NSAIDs, and Other Analgesics icity, whether it be cochlear or vestibular, is limited to
Aspirin and other NSAIDs are commonly used, and substitution of other inputs, procedures that recalibrate
apparently only toxic to hearing (Jung et al., 1993). remaining function, and behavioral adaptations. For
These include ibuprofen, naproxen, piroxicam, diflu- cochleotoxicity, the approach is largely amplification.
nisal, indomethacin, etodolac, nabumetone, ketorolac Hearing aids and related devices (assistive devices such
tromethamine, diclofenac sodium, and the salicylates, as telephone amplifiers) can be helpful in those whose
aspirin and salsalate. hearing loss is subtotal. When hearing loss is complete,
Rarely, hearing loss is reported from other types of cochlear implants may be o¤ered.
analgesics, for example, hydrocodone/acetaminophen For vestibulotoxicity, a rehabilitation approach is
combination (Friedman et al., 2000; Oh, Ishiyama, and often very helpful. The goal of vestibular rehabilitation
Baloh, 2000). is to reduce symptoms of dizziness, oscillopsia, and un-
Permanent hearing disturbances are possible but rare. steadiness. Patients are instructed in and perform a daily
They are most commonly seen in individuals who exercise routine designed to recalibrate remaining ves-
take aspirin in large doses for long periods, such as for tibular input and to substitute other senses such as vision
the treatment of arthritis. Occasionally persons with and neck proprioception.
With reduced vestibular function, the eye movement Oh, A. K., Ishiyama, A., and Baloh, R. W. (2000). Deafness
generated for a given head movement is too small, associated with abuse of hydrocodone/acetaminophen.
resulting in oscillopsia or blurred vision. To train the Neurology, 54, 23–45.
brain to generate an eye movement of equal amplitude Podoshin, L., Fradis, M., and Ben David, J. (1989). Ototox-
icity of ear drops in patients su¤ering from chronic otitis
to the head movement, gaze stabilization exercises are
media. Journal of Laryngology and Otology, 103, 46–50.
performed. Exercises consist of focusing on an object Rappaport, B. Z., Fausti, S. A., Schechter, M. A., and Frey,
with continuous movements of the head for 1–2 minutes. R. H. (1986). A prospective study of high-frequency audi-
Exercises can be made more di‰cult by increasing the tory function in patients receiving oral neomycin. Scandi-
complexity of the visual background behind the object of navian Audiology, 15, 67–71.
regard. The complexity of the exercises is progressed Riggs, L. C., Brummett, R. E., Guitjens, S. K., and Matz, G. J.
gradually as symptoms resolve. (1996). Ototoxicity resulting from combined administra-
For the balance deficits that are common in vestibular tion of cisplatin and gentamicin. Laryngoscope, 106, 401–
ototoxicity, patients are given static and dynamic exer- 406.
cises that require the control of balance with reduced Rybak, L. P. (1993). Ototoxicity of loop diuretics. Otolar-
yngolic Clinics of North America, 26, 829–844.
sensory input, conflicting sensory input, reduced base of
Rybak, L. P., Husain, K., Morris, C., Whitworth, C., and
support, and during head movements. To help adapt to Somani, S. (2000). E¤ect of protective agents against cis-
their vestibular loss, patients are educated in environ- platin ototoxicity. American Journal of Otology, 21, 513–
mental and behavioral modifications to reduce the risk 520.
of falls. Rybak, L. P., Whitworth, C., and Somani, S. (1999). Applica-
tion of antioxidants and other agents to prevent cisplatin
—Timothy C. Hain and Janet Helminski ototoxicity. Laryngoscope, 109, 1740–1744.
Tange, R. A., Dreschler, W. A., Prins, J. M., Buller, H. R.,
References Kuijper, E. J., and Speelman, P. (1995). Ototoxicity and
nephrotoxicity of gentamicin vs netilmicin in patients with
Aran, J. M. (1995). Current perspectives on inner ear serious infections: A randomized clinical trial. Clinical Oto-
toxicity. Otolaryngology–Head and Neck Surgery, 112, laryngology, 20, 118–123.
133–144. Thomas, J., Marion, M. S., and Hinojosa, R. (1992). Neomy-
Barlow, D. W., Duckert, L. G., Kreig, C. S., and Gates, G. A. cin ototoxicity. American Journal of Otolaryngology, 13,
(1995). Ototoxicity of topical otomicrobial agents. Acta 54–55.
Oto-Laryngolica, 115, 231–235.
Bath, A. P., Walsh, R. M., Bance, M. L., and Rutka, J. A.
(1999). Ototoxicity of topical gentamicin preparations.
Laryngoscope, 109, 1088–1093. Pediatric Audiology: The Test Battery
Brummett, R. E., Fox, K. E., Jacobs, F., Kempton, J. B.,
Stokes, Z., and Richmond, A. B. (1990). Augmented gen- Approach
tamicin ototoxicity induced by vancomycin in guinea pigs.
Archives of Otolaryngology–Head and Neck Surgery, 116,
61–64. The auditory mechanism is a complex sensory system
Dulon, D., Hiel, H., Aurousseau, C., Erre, J. P., and Aran, and as such requires a wide selection of specific proce-
J. M. (1993). Pharmacokinetics of gentamicin in the sensory dures for assessing its functional integrity. In general,
hair cells of the organ of Corti: Rapid uptake and long term these procedures may be grouped according to the dif-
persistence. Comptes Rendus de l’Academie des Sciences, ferential information they supply about peripheral audi-
série III, Sciences de la Vie, 316, 682–687. tory disorders (i.e., external and middle ear, cochlea, and
Fischel-Ghodsian, N., Prezant, T. R., Chaltraw, W. E., Wendt, cranial nerve VIII), central auditory dysfunction (i.e.,
K. A., Nelson, R. A., Arnos, K. S., and Falk, R. E. (1997). neural pathways of the brainstem and auditory cortex),
Mitochondrial gene mutation is a significant predisposing and pseudohypacusis (i.e., hearing loss of nonorganic
factor in aminoglycoside ototoxicity. American Journal of
origin). The results of a battery of procedures contribute
Otolaryngology, 18, 173–178.
Friedman, R. A., House, J. W., Luxford, W. M., Gherini, S., information about the auditory processes that are nor-
and Mills, D. (2000). Profound hearing loss associated with mal as well as those that are abnormal.
hydrocodone/acetaminophen abuse. American Journal of In di¤erential audiologic assessment, the audiologist
Otology, 21, 188–191. seeks procedures that provide optimum information
Gendeh, B. S., Gibb, A. G., Aziz, N. S., Kong, N., and Zahir, about which levels of the auditory system are disordered.
Z. M. (1998). Vancomycin administration in continuous Patients can—and often do—have coexisting disorders
ambulatory peritoneal dialysis: The risk of ototoxicity. at several levels, with the most dominant problem
Otolaryngology–Head and Neck Surgery, 118, 551–558. masking clues to the presence of others. Because no sin-
Jung, T. T., Rhee, C. K., Lee, C. S., Park, Y. S., and Choi, gle test can represent the integrity of the entire auditory
D. C. (1993). Ototoxicity of salicylate, nonsteroidal anti-
inflammatory drugs, and quinine. Otolaryngolic Clinics of
system, the best overall measure is obtained by com-
North America, 26, 791–810. bining test results, whereby each test within the test bat-
Marais, J., and Rutka, J. A. (1998). Ototoxicity and topical tery evaluates some aspect of the auditory mechanism.
eardrops. Clinical Otolaryngology, 23, 360–367. Jaeschke, Guyatt, and Sackett (1994) propose that useful
Matz, G. J. (1993). Aminoglycoside cochlear ototoxicity. Oto- diagnostic tests distinguish among disorders or states
laryngolic Clinics of North America, 26, 705–712. that might otherwise be confused, add information be-
Pediatric Audiology: The Test Battery Approach 521
yond that otherwise available, and lead to a change in sponsibility. Therefore, the general rule to apply in pe-
management that is beneficial to the patient. diatric assessment when selecting appropriate audiologic
Some tests are designed specifically to assist in identi- tests is not to administer a test unless its results pro-
fying the site of the lesion, while others are designed to vide new information for patient management. In
determine the presence and nature of an auditory deficit. fact, the real advantage of tests in a test battery comes
The diagnostic outcomes sought from the pediatric pop- when negative correlation is determined between tests,
ulation vary little from adult counterparts. That is, indicating that each test tends to identify di¤erent dis-
audiologic tests are selected to di¤erentiate peripheral orders.
versus central hearing loss, conductive versus sensori- Not only does the test battery delineate hearing loss,
neural hearing loss, cochlear versus neural site of lesion, it also provides opportunities for making appropriate
and varying middle ear conditions. The audiologist nei- cross-checks. The cross-check principle in audiology,
ther expects nor gets complete agreement on all the originally outlined by Jerger and Hayes (1976), under-
di¤erent tests performed. Age, physical and cognitive/ girds the concept of a test battery approach so that a
intellectual conditions, individual variability, and pecu- single test is not interpreted in isolation, but various tests
liarities of di¤erent audiologic conditions can result in act as a cross-check on the final outcome. The principle
paradoxical outcomes and may a¤ect the consistency of is that the results of a single test are never accepted as
audiologic findings. Thus, an extensive battery of audio- conclusive proof of the nature or site of auditory dis-
logic tests is intended to provide a profile of data that order without support from at least one additional in-
may be compared with findings obtained in individuals dependent test. That is, the error inherent in any test
with previously documented auditory conditions. Evi- and in patient response behavior is recognized, and the
dence for a specific interpretation exists when the profile probability of an incorrect diagnosis is minimized when
of results is consistent with expected findings. the results of several tests lead to the same conclusion.
The test battery approach in pediatric audiology is Moreover, the test battery approach and cross-check
focused on confirming suspected hearing loss in infants principle provide a statistical advantage when compared
referred from universal newborn hearing screening pro- with the utilization of but a single test. The multiplicity
grams and the ongoing assessment of infants at risk for of judgments in the test battery renders the entire di¤er-
delayed-onset or progressive hearing loss (Joint Com- ential assessment more reliable and valid. Statistically,
mittee on Infant Hearing, 2000). Refinements in audio- multiple judgments from nonduplicative, negatively cor-
logic tests (e.g., conditioned behavioral tests including related tests lend safety to the interpretation of raw data
visual reinforcement audiometry and conditioned play when compared to the outcome and potential for error
audiometry; acoustic immittance; auditory-evoked po- from a single judgment. To implement the cross-check
tentials), as well as the addition of new audiologic tests strategy successfully, clinicians need to recognize the
(e.g., otoacoustic emissions), provide the audiologist importance of selecting tests based on the child’s physi-
with a sophisticated test battery from which to initiate cal status, developmental level, and test correlation.
clinical decisions (Folsom and Diefendorf, 1999). A test battery is paramount when the clinician is
When individual tests are combined into a test bat- evaluating children with multiple disabilities. These chil-
tery, results can be viewed from a holistic framework. In dren exhibit diverse medical problems that can diminish
this approach, findings across tests are integrated to es- the accuracy of behavioral and physiological hearing
tablish a working diagnosis that often goes beyond the tests. Complicating factors may include but are not lim-
sum of the individual parts. The use of a test battery ited to severe neurological, motor, and sensory prob-
o¤ers several advantages, including (1) avoidance of lems. These factors can adversely influence test results,
overgeneralizing the results from a single test, (2) in- in turn compromising the validity of a single test ap-
creasing the data set from which to draw conclusions, proach. In addition, the limitations of the tests them-
and (3) enhancing the confidence in a clinical decision as selves can impose barriers when evaluating children with
the number of test results consistent with a specific in- complex problems. Test constraints (e.g., limitations of
terpretation increases. Conversely, combining tests into behavioral observation audiometry [BOA] in eliciting an
a battery may not be advantageous, cost-e¤ective, or observable response; impact of developmental age on
time-e‰cient when the tests are highly correlated (that is, visual reinforcement audiometry [VRA]; middle ear pa-
di¤erent tests testing for the same disorder). The more thology compromising acoustic reflex measures; the im-
positive the test correlation, the less performance varies pact of central nervous system damage on the auditory
when tests are combined. When tests have high to maxi- brainstem response [ABR]) frequently dictate what pro-
mum positive correlation, test battery performance can- cedures are feasible for a child with special needs. No
not be better than the best single test in the battery; thus, test or clinician is infallible, and mistakes made with
there is no value in combining tests just to satisfy the infants and young children can have crucial implications
faulty assumption that more tests are always better. for medical and educational management.
Each test must be selected on the basis of the patient’s Gans and Gans (1993) tested children with special
complaints, and associated with the highest hit rate and needs by a test battery made up of BOA, VRA, ABR,
lowest false alarm rate for the suspected disorder. and the acoustic reflex. The primary goal of the study
When selecting tests as part of a test battery, it is was to rule out bilateral hearing loss greater than a
essential to balance quality patient care with fiscal re- mild degree. Stringent criteria were established for ruling
out hearing loss with each of the tests to minimize the Jerger, J., and Hayes, D. (1976). The cross-check principle
chances of missing a child (false negative error) with a in pediatric audiometry. Archives of Otolaryngology, 102,
moderate hearing loss or greater. The tests were per- 614–620.
formed in a serial manner until one test result ruled in Joint Committee on Infant Hearing. (2000). Year 2000 posi-
tion statement: Principles and guidelines for early hearing
essentially normal hearing. Once achieved, further test-
detection and intervention programs. American Journal of
ing was discontinued. Audiology, 9, 9–29.
The results demonstrated that BOA passed approxi-
mately 35% of the children, VRA passed approximately Further Readings
10%, acoustic reflex measurement passed approximately
22%, and ABR passed approximately 57%. Yet when Bess, F. H. (Ed.). (1998). Children with Hearing Loss: Con-
conducted in a serial strategy (individual tests adminis- temporary Trends. Nashville, TN: Bill Wilkerson Center
tered until one ‘‘normal’’ result was obtained), 80% of Press.
the children under study were determined to have hear- Diefendorf, A. O., and Gravel, J. S. (1996). Behavioral obser-
ing better than the cuto¤ criterion. Although ABR alone vation and visual reinforcement audiometry. In S. Gerber
(Ed.), Handbook of Pediatric Audiology. Washington, DC:
was better at predicting hearing sensitivity than the other Gallaudet University Press.
tests, the total percentage score accomplished by a serial Stach, B. A., Wolf, S. J., and Bland, L. (1993). Otoacoustic
test battery was more than 20% greater than for ABR emissions as a cross-check in pediatric hearing assessment:
alone. Factor analysis failed to find a strong relationship Case report. Journal of the American Academy of Audiol-
among the tests and suggests that di¤erent factors ogy, 4, 392–398.
caused changes in threshold estimation across di¤erent Turner, R. G., Frazer, G. J., and Shepard, N. T. (1984). For-
children. That is, successful outcomes were based on mulating and evaluating audiological test protocols. Ear
interactions between the individual’s disabilities and the and Hearing, 5, 321–330.
individual tests. These factors included but were not Turner, R. G., and Nielson, D. W. (1984). Application of
clinical decision analysis to audiological tests. Ear and
limited to low chronological or developmental age (neu-
Hearing, 5, 125–133.
rological, motor, skeletal, and respiratory abnormal- Turner, R. G., Shepard, N. T., and Frazer, G. J. (1984). Clin-
ities), medications, and conductive hearing loss. Certain ical performance of audiological and related diagnostic
factors will adversely influence the results of one test tests. Ear and Hearing, 5, 187–194.
more than another. Therefore, reliance on a single test
for a child with disabling conditions would give an er-
roneous clinical impression that a large proportion of
these children sustain hearing loss. Physiological Bases of Hearing
An audiologst prone to using a single-test approach
might be tempted to rely on ABR as the only test
method. This approach certainly relies on the assump-
Outer and Middle Ears
tion that the test of choice is a valid hearing test for all The cartilaginous pinna on the outside of the skull has
individuals and is not susceptible to variables that could a set of characteristic folds and curves, di¤erent for
lead to errors in outcome. The important findings from each individual, which set up a series of shadowings and
the work of Gans and Gans (1993) provide evidence that reflections of the sound wave. The result is that the
challenges this assumption for children with special spectrum of the sound, as transmitted to the concha
needs, and provides a strong rationale for using a battery (the opening of the ear canal), is modified according to
of appropriately selected tests. the direction and elevation of the sound’s source. Al-
The selection of individual tests for use in a test bat- though the main cue for sound localization comes from
tery must be supported by clinical and experimental evi- comparing the relative intensities and times of arrival of
dence. If individual tests and their use in test batteries the stimuli at the two ears, that comparison does not
are not evidence based, are not cost-e¤ective in out- give us information on the elevation of the sound source,
comes, and do not positively impact patients, we dimin- or whether the source is behind or in front of the head;
ish the quality of services provided. that information is provided by the pinna. Moreover, the
outer ear means that sound localization of a sort can be
—Allan O. Diefendorf and Michael K. Wynne
undertaken with only one ear.
The middle and inner ears are protected from the
References outside world by an ear canal, and the inner ear is fur-
ther protected by a middle ear cavity. The closed cavity
Folsom, R. C., and Diefendorf, A. O. (1999). Physiologic and of the middle ear, however, is a common site for infec-
behavioral approaches to pediatric hearing assessment. Pe- tion, in which pus and secretions in early stages, and the
diatric Clinics of North America, 46, 107–120.
formation of fibrous tissue in later stages, reduce the ef-
Gans, D., and Gans, K. D. (1993). Development of a hearing
test protocol for profoundly involved multi-handicapped ficiency of transmission of vibrations to the cochlea.
children. Ear and Hearing, 14, 128–140. The relatively dense, incompressible cochlear fluids,
Jaeschke, R., Guyatt, G., and Sackett, D. L. (1994). Users’ enclosed in a bony canal, with their movement limited
guides to the medical literature. Journal of the American by the membranes of the inner ear, need a higher
Medical Association, 271, 389–391. pressure of vibration for a certain amplitude of move-
Physiological Bases of Hearing 523
Figure 1. A cross-section of the cochlear

duct showing the division into three
scalae and the position of the sensory
apparatus, the organ of Corti. (From
Fawcett, D. W. [1986]. A textbook of
histology. Philadelphia: Saunders, Fig.
35.11. Reproduced with permission.)
ment than do sound waves in air. One job of the outer The mechanical vibration, transmitted to the cochlear
and middle ears is to transform the ratio (pressure/ fluids, causes a ripple-like motion (i.e., a traveling wave)
amplitude) of vibration from a low value suitable for the in the membranes dividing the scalae and hence in the
external air to a much higher value able to drive the organ of Corti, causing deflection of the stereocilia or
cochlear fluids e‰ciently. This is undertaken by (1) hairs on the hair cells (see Robles and Ruggero, 2001,
acoustic resonances in the outer ear canal, (2) the foot- for a review). Cyclical deflection of the stereocilia opens
plate of the stapes in the oval window being much and closes the mechanotransducer channels, causing
smaller than the tympanic membrane (so that forces positive and negative potential changes within the hair
from the sound vibration are concentrated in a small cells. There are two types of hair cell (Fig. 2). In the
area), and (3) a lever action in the vibration both of the outer hair cells, operation of the mechanotransducer
middle ear bones and of the tympanic membrane. channels with the resulting changes in electrical potential
Vibration through the middle ear is a¤ected by the induce, in ways that are controversial, a mechanical
middle ear muscles, the tensor tympani and the stapedius response that enhances the initial mechanical vibra-
muscle, with contraction of the muscles reducing sound tion. The result is that the amplitude of the mechanical
transmission. These muscles contract in response to self- traveling wave grows exponentially as it travels along
produced activity such as vocalizations, but also in re- the cochlear duct away from its point of introduction.
sponse to loud sounds, to give some partial protection of Because the dimensions and sti¤ness of the cochlear duct
the inner ear against acoustic trauma. and membranes change along the duct, at some point
along the duct, the ratio of mass of fluid that has to be
moved by the introduced vibration to the sti¤ness of the
The Cochlea membranes that assist in the moving becomes too great
The cochlea performs a spectral analysis, sorting the in- to permit the vibration to continue for that particular
coming mechanical vibration into its di¤erent frequency frequency of stimulation, and the wave dies out rapidly.
components, and transduces the sound, turning the me- For a single tone, the traveling wave therefore has a
chanical vibration into an electrical change that activates peak which is sharp and narrow, so that maximal stim-
the fibers of the auditory nerve. ulation of hair cells occurs along only a short region of
The cochlea, deep inside the temporal bone, has a the cochlear duct. The peak occurs near the base (i.e.,
spiral central cavity, which curves the long (35-mm) oval window end) for high-frequency tones and near the
cochlear duct into a small space 10 mm across. The ca- apex for low-frequency tones; for a spectrally complex
nal is divided by membranous partitions into three stimulus, the spatial pattern of vibration reflects the
spaces, or scalae, which run the length of the canal spectrum of the incoming sound. This is known as place
(Fig. 1). coding of frequency. In addition, the time pattern of
Figure 2. Detail of the organ of Corti in

the cochlear duct showing the position
of the inner and outer hair cells and
cochlear innervation. The modiolus
(center of the cochlear spiral) is to the
left in the figure. IPC, inner phalangeal
cell. (Modified with permission from
Ryan, A. F., and Dallos, P. [1984].
Physiology of the cochlea. In J. L.
Northern [Ed.], Hearing disorders. Bos-
ton: Little, Brown, Fig. 22–4.)
vibration at any point reflects the time pattern of the The enhancement of spectral contrast depends on the
(spectrally filtered) acoustic stimulus. spatial pattern of activity in the auditory nerve, i.e., on
The outer hair cells are particularly vulnerable com- the place coding of sound frequency, which is then
ponents of the cochlea, being readily damaged by insults emphasized by neural lateral inhibition between adjacent
such as loud sound, anoxia, and many ototoxic drugs. cell groups in the central auditory system, starting at the
They are also particularly vulnerable to degenerative cochlear nucleus. Auditory neural responses therefore
changes. The result of such processes is that the traveling become dominated by representations of the most in-
wave is reduced in amplitude, and the sharpness of its tense spectral peaks (such as vowel formants in the case
peak is reduced. The inner hair cells are still able to de- of speech), while responses to a lower level or a more
tect the vibrations, but the cochlea loses sensitivity (i.e., spectrally uniform background stimuli are reduced. In
there is a hearing loss), and there is a degradation in its some way that is not understood, the two mechanisms of
ability to perform the spectral analysis. frequency representation, namely, place coding and time
coding, are neurally integrated into a single percept.
Processing in the Auditory Nervous System Temporal information in the sound waveform is also
emphasized; the enhancement of temporal transitions
The inner hair cells make synaptic connections with the in the neural responses means that the response to a
a¤erent fibers of the auditory nerve, so that when the fluctuating stimulus consists substantially of bursts of
inner hair cells are stimulated in parallel with the outer neural activity when the stimulus intensity increases, no
hair cells, the auditory nerve fibers are activated, and activity when the intensity decreases, and very low levels
both the spectral and temporal patterns of vibration are of activity during steady portions of the stimulus.
thereby signaled to the central nervous system. At low The spatial location of the sound source is primarily
frequencies, the time pattern of neural firing can follow analyzed by comparing the sounds arriving at the two
the time pattern of the vibration of the organ of Corti at ears. A stimulus on, say, the left will strike the left ear
the point of innervation. This is known as the temporal first, and will also be more intense in the left ear. Both of
coding of sound frequency. However, at higher frequen- these cues are extracted by the nervous system to give an
cies the fluctuations cannot follow each cycle of the indication of direction, with a neural representation in
waveform, and therefore the stimulus can be signaled the central nervous system such that neurons on one side
only by a change in the mean firing rate. At low fre- of the body are driven most strongly by sounds origi-
quencies, therefore (below about 300 Hz), both place nating on the opposite side. The time cues are extracted
coding and time coding can contribute; at high frequen- by comparing the time of arrival of the nerve impulses
cies (above a few kHz), only place coding is operative. from the two ears at the medial superior olivary nucleus
However, temporal fluctuations in the envelope of the in the brainstem, while intensity di¤erences are detected
sound waveform, if below a few hundred Hz, are still primarily in the lateral superior olivary nucleus. The
represented in the overall pattern of the firing. information as to the location of the sound source is
Processing in the auditory brainstem extracts and integrated in the inferior colliculus with spectral and
enhances three features of the auditory stimulus. Spec- temporal pattern information that had been enhanced at
tral contrast in the sound stimulus is enhanced, temporal the cochlear nucleus, before reaching the auditory cortex
transitions are emphasized, and information on the via the specific thalamic nucleus, the medial geniculate
sound locus is extracted. body.
Pitch Perception 525
Cortical Analysis neural mechanisms underlying the functions, which re-

main elusive.
It is likely that there is a division of function between
di¤erent parts of the auditory cortex, situated on the su- —James O. Pickles
perior surface of the temporal lobe, buried in the lateral
(or sylvian) fissure. The more dorsal areas are likely to References
represent location in auditory space, while the more
Binder, J. R., Frost, J. A., Hammeke, T. A., Cox, R. W., Rao,
ventral areas are involved in the analysis of complex S. M., and Prieto, T. (1997). Human brain language areas
stimuli, such as of speech (Rauschecker and Tian, 2000). identified by functional magnetic resonance imaging. Jour-
Recording from individual cortical neurons suggests that nal of Neuroscience, 17, 353–362.
speech sounds are likely to be analyzed and represented Rauschecker, J. P., and Tian, B. (2000). Mechanisms and
only over a whole population, that is, they are repre- streams for processing of ‘what’ and ‘where’ in auditory
sented as a pattern of activity that is spread over a large cortex. Proceedings of the National Academy of Sciences of
number of neurons, where consideration of activity of the United States of America, 97, 11800–11806.
the whole population is necessary for the accurate speci- Robles, L., and Ruggero, M. A. (2001). Mechanics of the
fication of the speech sound. Within the population, in- mammalian cochlea. Physiological Reviews, 81, 1305–
1352.
dividual neurons or neural assemblies may be specialized
Wang, X. (2000). On cortical coding of vocal communication
for the detection of critical features, such as spectral sounds in primates. Proceedings of the National Academy of
peaks or rapid temporal transitions, such as are neces- Sciences of the United States of America, 97, 11843–11849.
sary for the specification of the sound (Wang, 2000).
One area traditionally associated with speech is Wer- Further Readings
nicke’s area, which lies just posterior to the primary
cortical area in the dominant (generally left) hemisphere. Dallos, P., Popper, A. N., and Fay, R. R. (Eds.). (1996). The
Lesions of Wernicke’s area result in defects in compre- cochlea. New York: Springer-Verlag.
hension and word selection. Further forward, on the Pickles, J. O. (1988). An introduction to the physiology of hear-
ing. London: Academic Press.
ventral frontal lobe, lesions of Broca’s area result in Popper, A. N., and Fay, R. R. (Eds.). (1992). The mammalian
deficits in the production of speech. While the lesion auditory pathway: Neurophysiology. New York: Springer-
data show that these areas are critical, functional mag- Verlag.
netic resonance imaging shows that listening to speech
activates a much wider range of areas (Fig. 3), with
much more extensive surrounding temporal, angular,
and frontal areas likely to be involved in both linguistic Pitch Perception
and semantic analysis (Binder et al., 1997). Although
these imaging studies give information on the cortical The study of pitch dates back to at least the time of
areas activated and suggest the potential for conceptu- Pythagoras, who formulated the relationship between
ally splitting a task into its di¤erent functional com- the length of a string and the pitch it would produce if it
ponents, they do not provide any information on the were strummed. The perception of pitch is the basis of
musical melody and the voicing of speech; moreover,
pitch is an attribute of the sound created by many
objects in our world.
While the common definition of pitch has to do with a
subjective attribute of sound and is scaled from low to
high, pitch is closely related to frequency, a physical at-
tribute of sound. The other physical attributes of sound
are level, temporal structure, and complexity (Rossing,
1990). The study of pitch perception is often linked to
the ability of the auditory system to process the fre-
quency content of sound. The physical attributes of
sound are derived from the fact that sound occurs when
objects vibrate. The rate at which an object vibrates in
an oscillatory manner is the frequency of the sound. If
an object vibrates back and forth in a regular and
repeatable manner 440 times in 1 s, it is said to have a
frequency of 440 cycles per second (cps), which is indi-
Figure 3. Functional magnetic resonance image of the human
cated as 440 Hz (Hertz). If this vibrating object gen-
cortex during a speech analysis task. Areas that are heavily
activated are shown in white. The section is sagittal through
erated sound, the sound would have a 440-Hz frequency.
the left hemisphere. (Modified with permission from Binder, If the vibrating object were a guitar string, a musician
J. R., et al. [1997]. Human brain language areas identified by would perceive the vibrating string to have a pitch of
functional magnetic resonance imaging. Journal of Neuro- 440 Hz. A higher rate of vibration would produce a
science, 17, 353–362.) higher pitch and a lower rate a lower pitch. Thus, for the
vibrating guitar string, pitch is the subjective attribute of musical scale expressed either by musical notes or by
frequency. The relationship between the physical attrib- cents.
utes of sound and pitch is not as simple as the guitar- The other measure of pitch is the mel scale (Stevens
string example suggests, and a few of the complexities and Volkman, 1940). The mel scale is an attempt to
will be described later in this article. That is, frequency measure pitch on a scale from low to high, as implied
and pitch are not synonymous. by the definition of pitch given above. A sound with a
Frequency can be measured using physical or objec- tonal frequency of 1000 Hz has a pitch of 100 mels. A
tive means to determine the rate of vibration (Rossing, sound that is judged to have a pitch twice as high has a
1990). The measurement of pitch requires perceptual 200-mel pitch, while a pitch judged to be twice as low
measurement techniques found in the research toolbox has a 50-mel pitch. Thus, the 1000-Hz tone serves as a
of the psychoacoustician or the musician. In many con- referent against which the pitch of other sounds can be
texts, the pitch of a test sound is determined by com- compared. A mel scale relates the perceived pitch of a
paring its perceived pitch with that of a standard sound. sound in mels to another variable, most often frequency.
The standard sound is usually a sound with a very regu- So a sound with a pitch of 300 mels is one that is per-
lar vibratory pattern, such as a tone or a periodic train of ceived as having a pitch that is three times higher than
brief impulses (a click train). If the comparison sound that of a 1000-Hz tone.
and the test sound are judged to have the same perceived
pitch, then the pitch of the test sound is the vibratory Complex Pitch
frequency of the comparison sound. So, in the example
involving the guitar string, the vibrating string would be Sounds can have complex patterns of vibration and as
perceived as having the same pitch as a tone or a click such are made up of vibrations of many di¤erent fre-
train with a 440-Hz frequency. quencies. These complex sounds have a spectrum of
many di¤erent frequencies. If certain frequencies are
dominant in the sound’s spectrum, then the pitch of the
Musical Pitch and Pitch Scales
sound may be perceived as representing the frequency
In music, pitch can have two di¤erent meanings. When of the dominant component in the spectrum. So the
the frequency of the vibrating guitar string doubles (e.g., pitch of simple sounds with regular patterns of vibration
from 440 to 880 Hz), the frequency has increased by an or complex sounds with dominant frequencies in the
octave. Musical sounds that are an octave apart in fre- sound spectrum is determined by these main frequencies.
quency are perceived as similar, as opposed to a less However, we can consider a complex sound that is the
similar perception for sounds that are not separated by sum of the following frequencies: 300, 400, 500, 600, and
an octave. This similarity means that a melody played at 700 Hz. This complex sound will have a perceived pitch
one octave is very recognizable when it is played in other of 100 Hz, even though the sound contains no frequency
octaves. Thus, musicians will say that sounds that di¤er at 100 Hz. Note that this complex sound consists of fre-
by an octave have the same pitch, even though they have quency components that are spaced 100 Hz apart and
di¤erent frequencies and may be judged to match a are all integer multiplies (harmonics) of 100 Hz. The
standard tone of a di¤erent frequency. Thus, pitch missing 100-Hz component is the fundamental frequency
can have two meanings in music, one referring to of the complex, because it is the highest frequency
octave relationships and one to the actual frequency of for which all of the other components are integer multi-
vibration. Pitch chroma is sometimes used to refer to ples (de Boer, 1976). The 100-Hz perceived pitch is that
octave relationships involving pitch, while pitch height of this missing fundamental, and so this type of per-
is used to refer to pitch as determined by vibratory ceived pitch is often referred to as the ‘‘pitch of the
frequency. For example, a 440-Hz tone and an 880-Hz missing fundamental’’ or ‘‘complex pitch.’’ Although
tone have the same pitch chroma but di¤erent pitch this sound’s spectrum has no frequency at its pitch, the
heights. pattern of vibration oscillates at 100 Hz. Thus, it is pos-
The notes of the musical scale represent di¤erent ratio sible that the pitch is not related as much to the fre-
intervals within an octave. Thus, musical pitch can be quency content of the sound as it is to the rate of
measured in terms of musical notes (the 12 notes of A, B, oscillation. Or perhaps the 100-Hz spacing of the fre-
C, D, E, F, G and the half-tones, or the sharps and flats). quency components is the crucial dimension of the sound
The octave can be divided into 1200 equal logarithmic that determines the pitch of the missing fundamental. If
units (ratio units) call cents. The 12 musical notes divide the spectrum of this sound is changed to be 325, 425,
this 1200-cent octave into 12 equal units, so that each 525, 625, and 725 Hz, the perceived pitch is now 104 Hz,
note is 100 cents (100 cents is sometimes referred to a which is neither the missing fundamental nor equal to
semitone). Each note represents a di¤erent ratio from the the 100-Hz spacing of the frequency components. In ad-
beginning of the octave; e.g., the ratio between the notes dition, the sound does not have a dominant pattern of
C and G is one-fifth. The pitch of the 440-Hz vibrat- vibratory oscillations at 104 Hz (Patterson, 1973). Thus,
ing guitar string is the note A in the middle range of neither simple frequency nor temporal properties of a
the musical octaves, and the note C would have a pitch sound can be used to predict complex pitch.
of 528 Hz in this same octave, or 264 Hz in the next Complex pitches, such as the pitch of the missing
lower octave. Therefore, another measure of pitch is the fundamental, suggest that pitch is not simply related to
Presbyacusis 527
frequency. It is also the case that changes in other phys- Presbyacusis

ical attributes of sound, such as sound level, can lead to
a change in pitch, reinforcing the observation that pitch
and frequency are not synonymous. Various theories of Presbyacusis is a general term to describe hearing loss in
pitch processing have been proposed to account for the older persons or the hearing loss associated with aging.
pitches of simple and complex sounds. These theories Hearing loss observed in older adults is a result of the
fall into three general categories (Moore, 1997). One set combined e¤ects of aging, long-term exposure to occu-
of theories, spectral theories, propose di¤erent means of pational and nonoccupational noise, the use of ototoxic
using the sound’s frequency spectrum to determine pitch. drugs, diet, disease, and other factors. In this case, the
Another group of theories, temporal theories, suggest term presbyacusis describes any hearing loss observed in
that aspects of the temporal oscillation of the sound’s an older person, regardless of cause. Alternatively, pres-
vibratory pattern are used by the auditory system to de- byacusis may refer specifically to the hearing loss that
termine pitch. Then there are theories that use a combi- increases with chronological age and is related only to
nation of spectral and temporal aspects of a sound to age-related deterioration in the auditory periphery and
predict its pitch (de Boer, 1976). Such theories have been central nervous system (CNS).
proposed for nearly a hundred years, and no one theory Currently, about 75% of the 28 million hearing-
has emerged that best accounts for all of the data related impaired individuals in the United States are 55 years
to pitch perception (Plomp, 1976). of age or older; the number of hearing-impaired indi-
Thus, whereas pitch is a major attribute of sound and viduals will increase as the population ages. Indeed,
is crucial to our ability to use sound to identify the ob- presbyacusis is the most prevalent of the chronic con-
jects in our world and to communicate, auditory science ditions of aging among men age 65 years and older, and
does not have a good explanation of how pitch is pro- the fifth most prevalent condition among older women,
cessed by the auditory system. It is clear that the way in following arthritis, cardiovascular diseases, and visual
which the auditory system processes both spectral and impairments (National Center for Health Statistics,
temporal information contributes to pitch processing, 1986). Age-related hearing loss in the United States has
but the details of how these processes operate is still a been well characterized by epidemiologic surveys such as
mystery. the Framingham Heart Study (Moscicki et al., 1985), the
Baltimore Longitudinal Study of Aging (Pearson et al.,
—William A. Yost
1995), and the Epidemiology of Hearing Loss Study of
the adult residents of Beaver Dam, Wisconsin (Cruick-
References shanks et al., 1998). Figure 1 shows the systematic
increase in thresholds with chronological age in Fra-
de Boer, E. (1976). Residue and auditory pitch perception. In
W. Ne¤ and W. Keidel (Eds.), Handbook of sensory physi- mingham subjects; thresholds at high frequencies are
ology (vol. V). Berlin: Springer-Verlag. higher for male than for female subjects. In the Beaver
Moore, B. C. J. (1997). An introduction to the psychology of Dam study, the prevalence of hearing loss was 45.9%,
hearing (3rd ed.). London: Academic Press. with hearing loss defined as average thresholds (0.5–
Patterson, R. (1973). Physical variables determining residue 4 kHz) greater than 25 dB in the worse ear. This is con-
pitch. Journal of the Acoustical Society of America, 53, sistent with the prevalence reported in the Framingham
1566–1572. study of 42%–47%. The prevalence of hearing loss in the
Plomp, R. (1976). Aspects of tone sensation. London: Academic Beaver Dam study varied greatly with sex and age,
Press. ranging from 10.2% for women 48–52 years of age to
Rossing, T. (1990). The science of sound (2nd ed.). Reading,
MA: Addison-Wesley.
96.6% for men 80–92 years of age. Another set of data,
Stevens, S. S., and Volkman, J. (1940). The relation of pitch to including hearing levels as a function of age, sex, and
frequency: A revised scale. American Journal of Psychology, history of occupational noise exposure, is part of an in-
53, 329–353. ternational standard (ISO 1999, 1990). Database A from
this standard may more closely represent aging e¤ects
on hearing, given that subjects were screened for oc-
Further Readings cupational and other noise exposure history. Epidemio-
de Cheveigne, A. (2001). The auditory system as a ‘separation logic surveys also provide estimates of the genetic
machine.’ In D. J. Breebaart, A. J. M. Houstma, A. Kohl- component of presbyacusis. Heritability coe‰cients sug-
rausch, A. F. Prijs, and R. Schoonhoven (Eds.), Physiolog- gest that as much as 55% of the variance in thresholds in
ical and psychophysical bases of auditory function. The older persons is genetically determined, are stronger in
Netherlands: Shaker Publishing. women than in men, and are comparable to those for
Hartmann, W. M. (1998). Signal, sounds, and sensation. New hypertension and hyperlipidemia (Gates, Couropmitree,
York: Academic Press.
Moore, B. C. J. (1986). Frequency selectivity in hearing. Lon-
and Myers, 1999).
don: Academic Press. A remarkable and consistent age-related change in
Roederer, J. G. (1973). Introduction to the physics and psycho- hearing occurs at frequencies above 8 kHz and begins as
physics of music. New York: Springer-Verlag. early as age 20–30 years (Stelmachowicz et al., 1989).
Yost, W. A. (2000). Fundamentals of hearing: An introduction Figure 2 shows thresholds in the conventional and
(4th ed.). New York: Academic Press. extended high frequencies for younger and older adults,
Figure 2. Mean pure-tone thresholds at frequencies from

0.25 kHz to 18 kHz for three groups of subjects aged 60–79,
grouped by pure-tone average at 1, 2, and 4 kHz (normal, mild
to moderate, and severe), and one group of younger subjects
with normal hearing. (Adapted with permission from Mat-
thews, L. J., et al., 1997, ‘‘Extended high-frequency thresholds
in older adults.’’ Journal of Speech, Language, and Hearing
Research, 40, 208–214. 6 American Speech-Language-Hearing
Association.)
and is not accurately predicted from the pure-tone

audiogram.
Studies of auditory behavior in older adults must
separate age-related e¤ects from those attributable
Figure 1. Mean pure-tone thresholds (in dB HL) for the better simply to reduced audibility resulting from elevated
ear of male and female participants in the Framingham Heart thresholds. One experimental method to minimize the
Study. Age ranges of participants are given at the right. confound of reduced audibility is to include only older
(Adapted with permission from Moscicki, E. K., et al., 1985, subjects whose pure-tone thresholds are equal to those
‘‘Hearing loss in the elderly: An epidemiologic study of the of younger subjects. When changes in auditory behavior
Framingham Heart Study cohort.’’ Ear and Hearing, 6, 184– in older adults are observed that are not attributable
190.) to reduced audibility, they may be due to age-related
changes in the auditory periphery, which provides an
impoverished input to a normal auditory CNS, or to the
combined e¤ects of an aging periphery and an aging
grouped according to average thresholds from 1 to CNS. For many behavioral measures, it may not be
4 kHz (Matthews et al., 1997). Thresholds are sub- possible to di¤erentiate between these outcomes. This is
stantially elevated at frequencies above 8 kHz, even particularly the case for tasks that require comparisons
for individuals with nearly normal thresholds at lower of temporal information across intervals of time or that
frequencies. assess binaural processing.
To supplement pure-tone thresholds, the Hearing Other than e¤ects related to their hearing loss, older
Handicap for the Elderly instrument (Ventry and Wein- adults probably do not have increased problems in
stein, 1982), a self-report questionnaire, has been used speech understanding relative to younger individuals, as
to compare older individuals’ assessment of their com- measured conventionally (i.e., monaurally, under ear-
munication abilities with objective measures of hearing phones, with highly redundant signals). Indeed, some
and with threshold-based estimates of hearing handicap, studies suggest that 70%–95% of the variance in mon-
such as that recommended by the American Academy of aural speech recognition scores may be accounted for
Otolaryngology–Head and Neck Surgery (AAO, 1979; by the variance in speech audibility (Humes, Christo-
Matthews et al., 1990). Discrepancies between objective pherson, and Cokely, 1992). Figure 3 shows scores on
and subjective measures are common, and the variance several speech recognition tests for three age groups with
in pure-tone thresholds within hearing handicap catego- nearly identical (within @3 dB) mean thresholds from
ries is large. Thus, whereas hearing loss is among the 0.25 to 8 kHz (Dubno et al., 1997). With hearing loss
most prevalent chronic conditions of aging, the impact held constant across age group, speech recognition
of hearing loss on communication abilities and daily scores also remained constant. Nevertheless, age-related
activities of older adults varies greatly among individuals di¤erences in speech recognition in noise may become
Presbyacusis 529
humans and animals. Older gerbils raised in quiet have

elevated thresholds of the compound action potential
(CAP) of the auditory nerve and shallow slopes of CAP
input-output functions (Hellstrom and Schmiedt, 1990).
These characteristics are also reflected in higher auditory
brainstem response (ABR) thresholds and shallower
slopes of ABR amplitude-intensity functions relative to
young gerbils (Boettcher, Mills, and Norton, 1993).
Similar findings have been observed in older humans.
Although these potentials produced by short-duration
signals are reduced in amplitude with age, amplitudes of
potentials arising from higher CNS centers in response
to long-duration signals, such as steady-state potentials
and N100 –P200 , may be una¤ected or even increase with
age. Abnormal recovery from adaptation or forward
masking and abnormal gap detection at the level of the
brainstem have also been observed in older animals and
humans (Walton, Orlando, and Burkard, 1999). In aging
Figure 3. Mean scores in percent correct (G1 standard error) for
six measures of speech recognition. The six measures are word
gerbils, the 80–90 mV dc resting potential in the scala
recognition for NU-6 monosyllabic words (PB), maximum media of the cochlea, known as the endocochlear poten-
word recognition (PBmax ), maximum synthetic sentence identi- tial (EP), is reduced substantially. In contrast to these
fication (SSImax ), keyword recognition for high-context and changes, nonlinear phenomena remain relatively intact.
low-context sentences from the SPIN test (SPIN-PH and SPIN- For example, transient otoacoustic emissions, reflecting
PL), and ‘‘percent hearing for speech’’ (SPIN-HFS). (Adapted the functioning of outer hair cells, are present in about
with permission from Dubno, J. R., et al., 1997, ‘‘Age-related 90% of older humans with normal hearing, but ampli-
and gender-related changes in monaural speech recognition.’’ tudes are reduced in a manner that is not predictable by
Journal of Speech, Language, and Hearing Research, 40, 444– either age or pure-tone thresholds. In aging gerbils, dis-
452. 6 American Speech-Language-Hearing Association.) tortion product otoacoustic emissions are present and
robust, but somewhat reduced in amplitude (Boettcher,
Gratton, and Schmiedt, 1995). Two-tone rate sup-
apparent in more realistic listening environments, such pression is observed in older gerbils, with age-related
as in the sound field with spatially separated speech and threshold shifts (Schmiedt, Mills, and Adams, 1990),
maskers, with competing sounds that have temporal or although older humans may have reduced suppression
spectral dips, or on tasks that require divided or selective measured psychophysically (Dubno and Ahlstrom,
attention. It remains unclear if these age-related changes 2001).
may be attributed to an aging auditory periphery or to The pathologic anatomy underlying these physiologic
the combined e¤ects of an aging periphery and an aging changes was most extensively described through studies
CNS. of human temporal bones by Schuknecht (1974). Ini-
Although older adults are the largest group of hearing tially, four categories of presbyacusis were identified,
aid wearers, their satisfaction with hearing aids is low. including sensory (degeneration of sensory cells), neural
More than 75% of individuals who are likely to benefit (largely loss of spiral ganglion cells), metabolic (degen-
from a hearing aid do not own one, a gap of approxi- eration of the lateral wall and stria vascularis, with re-
mately 20 million people (LaPlante, Hendershot, and duction in the protein Na,K-ATPase), and mechanical
Moss, 1992). Similar results were observed for hearing- (aging of sound-conducting structures of the inner ear).
impaired participants of the Framingham and Beaver Categories of presbyacusis were revised by Schuknecht
Dam studies, in which less than 20% were hearing aid and Gacek (1993) wherein atrophy of the stria vascularis
users. In examining the functional health status and was designated as the ‘‘predominant lesion’’ of the aging
psychosocial well-being of older individuals, Bess et al. ear, neuronal loss was ‘‘constant and predictable,’’ me-
(1989) concluded that hearing loss is a primary determi- chanical loss remained theoretical, and sensory presbya-
nant of function and that its impact is comparable to cusis was the ‘‘least important type of loss.’’ These
that of other chronic conditions a¤ecting this popula- histopathological findings are consistent with the physi-
tion. Thus, untreated hearing loss can have a negative ological evidence described above, such as reduced CAP
e¤ect on quality of life beyond that due to poorer com- amplitudes and reduced EP but robust otoacoustic
munication abilities (Mulrow et al., 1990). The potential emissions. Thus, most age-related changes in hearing
benefit to communication and quality of life, together can be accounted for by changes observed in the audi-
with new fitting options and improved technology, sug- tory periphery. Nevertheless, there are many age-related
gests that older adults should be encouraged to use anatomical, neurochemical, and neurophysiological
amplification. changes in the CNS. One prominent neurochemical
Evidence of age-related changes in the auditory sys- change is a loss of gamma-aminobutyric acid, which
tem is revealed in the physiological properties of aging may a¤ect the balance of inhibitory and excitatory
neurotransmission. The e¤ects of these and other CNS Journal of Speech, Language, and Hearing Research, 40,
changes on age-related hearing loss may be substantial 208–214.
but remain largely unknown. Matthews, L. J., Lee, F. S., Mills, J. H., and Schum, D. J.
(1990). Audiometric and subjective assessment of hearing
handicap. Archives of Otolaryngology–Head and Neck Sur-
Acknowledgments gery, 116, 1325–1330.
Moscicki, E. K., Elkins, E. F., Baum, H. M., and McNamara,
This work was supported (in part) by grants P50 P. M. (1985). Hearing loss in the elderly: An epidemiologic
DC00422 and R01 DC00184 from the National Institute study of the Framingham Heart Study cohort. Ear and
on Deafness and Other Communication Disorders (Na- Hearing, 6, 184–190.
tional Institutes of Health). Mulrow, C. D., Aguilar, C., Endicott, J. E., Tuley, M. R.,
Velez, R., Charlip, W. S., et al. (1990). Quality of life
—Judy R. Dubno and John H. Mills changes and hearing impairment: Results of a randomized
trial. Annals of Internal Medicine, 113, 188–194.
References National Center for Health Statistics. (1986). Prevalence
of Selected Chronic Conditions, United States, 1979–1981.
American Academy of Otolaryngology. (1979). Guide for the Vital and health statistics, Series 10, No. 155 (OHHS Publi-
evaluation of hearing handicap. Otolaryngology–Head and cation No. [PHS] 86-1583). Public Health Service. Wash-
Neck Surgery, 87, 539–551. ington, DC: U.S. Government Printing O‰ce.
Bess, F. H., Lichtenstein, M. J., Logan, S. A., and Burger, Pearson, J. D., Morrell, C. H., Gordon-Salant, S., Brant, L. J.,
M. C. (1989). Comparing criteria of hearing impairment in Metter, E. J., Klein, L. L., and Fozard, J. L. (1995). Gender
the elderly: A functional approach. Journal of Speech and di¤erences in a longitudinal study of age-associated hearing
Hearing Research, 32, 795–802. loss. Journal of the Acoustical Society of America, 97, 1196–
Boettcher, F. A., Gratton, M. A., and Schmiedt, R. A. (1995). 1205.
E¤ects of age and noise on hearing. Occupational Medicine, Schmiedt, R. A., Mills, J. H., and Adams, J. (1990). Tuning
10, 577–591. and suppression in auditory nerve fibers of aged gerbils
Boettcher, F. A., Mills, J. H., and Norton, B. L. (1993). Age- raised in quiet or noise. Hearing Research, 45, 221–236.
related changes in auditory evoked potentials of gerbils: I. Schuknecht, H. F. (1974). Pathology of the ear. Cambridge,
Response amplitudes. Hearing Research, 71, 137–145. MA: Harvard University Press.
Cruickshanks, K. J., Wiley, T. L., Tweed, T. S., Klein, B. E., Schuknecht, H. F., and Gacek, M. R. (1993). Cochlear pa-
Klein, R., Mares-Perlman, J. A., et al. (1998). Prevalence of thology in presbycusis. Annals of Otology, Rhinology, and
hearing loss in older adults in Beaver Dam, Wisconsin: The Laryngology, 102, 1–16.
Epidemiology of Hearing Loss Study. American Journal of Stelmachowicz, P. G., Beauchaine, K. A., Kalberer, A., and
Epidemiology, 148, 879–886. Jesteadt, W. (1989). Normative thresholds in the 8- to 20-
Dubno, J. R., and Ahlstrom, J. B. (2001). Psychophysical sup- kHz range as a function of age. Journal of the Acoustical
pression measured with bandlimited noise extended below Society of America, 86, 1384–1391.
and/or above the signal: E¤ects of age and hearing loss. Ventry, I. M., and Weinstein, B. E. (1982). The Hearing
Journal of the Acoustical Society of America, 110, 1058– Handicap Inventory for the Elderly: A new tool. Ear and
1066. Hearing, 3, 128–134.
Dubno, J. R., Lee, F. S., Matthews, L. J., and Mills, J. H. Walton, J. P., Orlando, M., and Burkard, R. (1999). Auditory
(1997). Age-related and gender-related changes in monaural brainstem response forward-masking recovery functions in
speech recognition. Journal of Speech, Language, and older humans with normal hearing. Hearing Research, 127,
Hearing Research, 40, 444–452. 86–94.
Gates, G. A., Couropmitree, N. N., and Myers, R. H. (1999).
Genetic associations in age-related hearing thresholds. Further Readings
Archives of Otolaryngology–Head and Neck Surgery, 125,
654–659. Anderer, P., Semlitsch, H. V., and Saletu, B. (1996). Multi-
Hellstrom, L. I., and Schmiedt, R. A. (1990). Compound channel auditory event-related brain potentials: E¤ects of
action potential input/output functions in young and quiet- normal aging on the scalp distribution of N1, P2, N2 and
aged gerbils. Hearing Research, 50, 163–174. P300 latencies and amplitudes. Electroencephalography and
Humes, L. E., Christopherson, L., and Cokely, C. (1992). Clinical Neurophysiology, 99, 458–472.
Central auditory processing disorders in the elderly: Fact or Bredberg, G. (1968). Cellular pattern and nerve supply of the
fiction? In J. Katz, N. Stecker, and D. Henderson (Eds.), human organ of Corti. Acta Otolaryngology, Supplement,
Central auditory processing (pp. 141–149). St. Louis: 236, 1–135.
Mosby–Year Book. Caspary, D. M., Milbrandt, J. C., and Helfert, R. H. (1995).
International Organization for Standards: Acoustics. (1990). Central auditory aging: GABA changes in the inferior col-
Determination of occupational noise exposure and estimation liculus. Experimental Gerontology, 30, 349–360.
of noise-induced hearing impairment [ISO 1999]. Geneva: Committee on Hearing, Bioacoustics, and Biomechanics
International Organization for Standards. (CHABA). (1988). Speech understanding and aging. Jour-
LaPlante, M. P., Hendershot, G. E., and Moss, A. J. (1992). nal of the Acoustical Society of America, 83, 859–893.
Assistive technology devices and home accessability fea- Dubno, J. R., Dirks, D. D., and Morgan, D. E. (1984). E¤ects
tures: Prevalence, payment, need, and trends. Advance data of age and mild hearing loss on speech recognition in
from vital and health statistics, no. 217. Hyattsville, MD: noise. Journal of the Acoustical Society of America, 76, 87–
National Center for Health Statistics. 96.
Matthews, L. J., Lee, F. S., Mills, J. H., and Dubno, J. R. Fitzgibbons, P. J., and Gordon-Salant, S. (1995). Age e¤ects
(1997). Extended high-frequency thresholds in older adults. on duration discrimination with simple and complex
Pseudohypacusis 531
stimuli. Journal of the Acoustical Society of America, 98, variate analyses. Journal of the Acoustical Society of Amer-
3140–3145. ica, 88, 2611–2624.
Frisina, R. D. (2001). Anatomical and neurochemical bases of Weinstein, B. E. (2000). Geriatric audiology. New York:
presbycusis. In P. R. Ho¤ and C. V. Mobbs (Eds.), Func- Thieme.
tional neurobiology of aging (pp. 531–548). San Diego, CA: Willott, J. F. (1991). Aging and the auditory system: Anatomy,
Academic Press. physiology, and psychophysics. San Diego, CA: Singular
Gordon-Salant, S., and Fitzgibbons, P. J. (1993). Temporal Publishing Group.
factors and speech recognition performance in young and
elderly listeners. Journal of Speech and Hearing Research,
36, 1276–1285.
Gratton, M. A., and Schulte, B. A. (1995). Alterations in Pseudohypacusis
microvasculature are associated with atrophy of the stria
vascularis in quiet-aged gerbils. Hearing Research, 82, 44–
52. Pseudohypacusis means, literally, a false elevation of
Grose, J. H., Poth, E. A., and Peters, R. W. (1994). Masking thresholds. In pseudohypacusis, intratest and intertest
level di¤erences for tones and speech in elderly listeners with audiometric inconsistencies cannot be explained by
relatively normal audiograms. Journal of Speech and Hear-
ing Research, 37, 422–428. medical examinations or a known organic condition
He, N.-J., Dubno, J. R., and Mills, J. H. (1998). Frequency (Ventry and Chaiklin, 1965). Authors of literature in this
and intensity discrimination measured in a maximum- area have called this condition exaggerated hearing loss,
likelihood procedure from young and aged normal-hearing nonorganic hearing loss, or functional hearing loss.
subjects. Journal of the Acoustical Society of America, 103, Exaggerated hearing loss implies intent, but some forms
553–565. of pseudohypacusis may have subconscious origins
He, N.-J., Horwitz, A. R., Dubno, J. R., and Mills, J. H. (Wolf et al., 1993). The intent of the listener cannot be
(1999). Psychometric functions for gap detection in noise determined with audiometric measures. Nonorganic
measured from young and aged subjects. Journal of the hearing loss implies that there is no physical basis for the
hearing loss; however, many adults have a false elevation
Humes, L. E., and Christopherson, L. A. (1991). Speech iden-
tification di‰culties of hearing-impaired elderly persons: of thresholds added to an existing loss. Some even pre-
The contributions of auditory processing deficits. Journal of sent with pseudohypacusis and an ear-related medical
Speech and Hearing Research, 34, 686–693. problem requiring immediate attention (Qui et al., 1998).
Humes, L. E., Watson, B. U., Christensen, L. A., Cokely, Functional hearing loss is the only synonym among
C. G., Halling, D. C., and Lee, L. (1994). Factors asso- these terms.
ciated with individual di¤erences in clinical measures of Monetary or psychological gain motivates most
speech recognition among the elderly. Journal of Speech and pseudohypacusics. Audiologists should be alert for this
Hearing Research, 37, 465–474. condition if the referral source is a lawyer, as in a medi-
Jerger, J., and Chmiel, R. (1997). Factor analytic structure of colegal case, or an organization documenting hearing for
auditory impairment in elderly persons. Journal of the
compensation purposes; however, there are cases where
Mills, J. H., Schmiedt, R. A., and Kulish, L. F. (1990). Age- the referral source does not provide any warning. There
related changes in auditory potentials of Mongolian gerbil. are even cases of persons with normal auditory systems
Hearing Research, 46, 301–310. presenting with longstanding false losses that have been
Moore, B. C. J., and Peters, R. W. (1992). Pitch discrimination misdiagnosed and the individuals inappropriately fitted
and phase sensitivity in young and elderly subjects and its with hearing aids.
relationship to frequency selectivity. Journal of the Acousti- Estimates of the prevalence rate for pseudohypacusis
cal Society of America, 91, 2881–2893. are between 2% and 5%, with higher rates observed
Rosen, S., Bergman, M., Plester, D., El-Mofty, A., and Hamad in some special populations, such as the military and
Satti, M. (1962). Presbyacusis study of a relatively noise-free industrial workers (Rintelmann and Schwann, 1999).
population in the Sudan. Annals of Otology, Rhinology, and
Pseudohypacusics show falsely elevated thresholds in
Laryngology, 71, 727–743.
Schneider, B. A. (1997). Psychoacoustics and aging: Implica- one or both ears, the degree of loss ranges from mild to
tions for everyday listening. Journal of Speech-Language profound, and the type of loss can be sensorineural or
Pathology and Audiology, 21, 111–124. mixed (Qui et al., 1998).
Schulte, B. A., and Schmiedt, R. A. (1992). Lateral wall NaK- Pseudohypacusics usually adopt an internal loud-
ATPase and endocochlear potential decline with age in ness yardstick that corresponds to the amount of their
quiet-reared gerbils. Hearing Research, 61, 35–46. ‘‘hearing loss’’ (Vaubel, 1976; Gelfand and Silman,
Strouse, A., Ashmead, D. H., Ohde, R. N., and Grantham, 1985). External sounds are compared with this internal
D. W. (1998). Temporal processing in the aging auditory yardstick, and pseudohypacusics only respond behav-
system. Journal of the Acoustical Society of America, 104, iorally to sounds that exceed this internal value. This is
2385–2399.
important to know, because modifications to hearing
Takahashi, G. A., and Bacon, S. P. (1992). Modulation detec-
tion, modulation masking, and speech understanding in tests that a¤ect loudness perception often have little
noise in the elderly. Journal of Speech and Hearing Re- or no e¤ect on thresholds of audibility in cooperative
search, 35, 1410–1421. adults. Many behavioral tests for pseudohypacusis that
van Rooij, J. C., and Plomp, R. (1990). Auditive and cognitive are used by audiologists are designed to disrupt loudness
factors in speech perception by elderly listeners: II. Multi- judgments.
The responsibility of the audiologist in the assessment too. However, this finding in most instances is a result of
of persons presenting with psuedohypacusis is to docu- the loudness of speech and tones growing at di¤erent
ment intertest and intratest inconsistencies and to quan- rates. Consistent with loudness-related issues, this test is
tify true thresholds as a function of frequency. Many most e¤ective when spondee thresholds are measured
methods exist for documenting inconsistent results, but using an ascending approach (beginning at a low level)
few measures exist for quantifying accurately true be- and pure-tone thresholds are measured using a descend-
havioral thresholds. ing approach (beginning at a high level) (Schlauch et al.,
When evaluated for intertest and intratest incon- 1996). This procedure identified 100% of pseudohy-
sistencies, the basic battery of audiologic tests, adminis- pacusics, with no incorrect identifications of cooperative
tered to nearly everyone entering the clinic, are the ones test subjects with hearing loss. A more conventional
that will likely identify persons presenting with this con- procedure that measured pure-tone thresholds with an
dition, given that many pseudohypacusics have nothing ascending approach identified only about 60% of per-
in their history that might raise suspicion. The basic sons with pseudohypacusis.
battery of tests that is used in the assessment of pseudo- Immittance measures are also routine tests that can
hypacusis di¤ers somewhat across clinics, but this group aid in the documentation of persons presenting with
of tests often includes pure-tone thresholds, spondee pseudohypacusis. Tympanometry and acoustic reflex
thresholds, and immittance (see tympanometry). thresholds are sensitive measures of middle ear status
Pure-tone threshold assessment provides several and provide some indication of the integrity of the au-
methods for identifying pseudohypacusis. Most clini- ditory system up to the superior olivary complex. For
cians routinely retest the threshold for a 1000 Hz tone as severe sensorineural losses, acoustic reflex thresholds are
a reliability check. Thresholds on retest are usually generally 10 dB or more above a person’s behavioral
within 10 dB of the first test in cooperative persons, threshold (Gelfand, 1994). Thresholds obtained at lower
whereas pseudohypacusics often show larger threshold levels suggest pseudohypacusis. Gelfand (1994) has pub-
di¤erences (Ventry and Chaiklin, 1965). Although this lished normative values of reflex thresholds for di¤erent
method is not particularly sensitive or specific for the degrees of loss.
identification of pseudohypacusis, deviations greater Numerous special tests were developed for assessing
than 10 dB can provide a warning to the clinician. In pseudohypacusis during the period immediately follow-
addition to poor reliability, pseudohypacusics do not ing World War II. Many of the tests developed during
demonstrate false positive responses (a response when a this time are confrontational, time-consuming, and inef-
tone is not presented) (Ventry and Chaiklin, 1965). By fective when a clinical decision theory analysis is done.
contrast, several false positive responses in a single test- An exception to this criticism is a computer implemen-
ing session are quite common in persons with tinnitus tation of a simple modification to Bekesy audiometry or
(Mineau and Schlauch, 1997). Unfortunately, audio- automated audiometry (Chaiklin, 1990), which still has
metric configuration is not a reliable diagnostic tool for application when testing the hearing of large groups of
pseudohypacusis (Ventry and Chaiklin, 1965). However, persons with a limited number of testers.
the presence of a flat loss, or equal hearing loss at each The Stenger test is a special test that is quick to ad-
audiometric frequency, has been reported as common in minister and, unlike most tests, has the capability of
several studies (Coles and Mason, 1984; Alpin and quantifying the actual thresholds of persons presenting
Kane, 1985). The absence of shadow responses in asym- with unilateral pseudohypacusis (Kinstler, Phelan, and
metrical losses is a reliable sign of pseudohypacusis Lavender, 1972). This test makes use of the finding that
(Rintelmann and Schwann, 1999). Shadow responses are when the same sound is presented simultaneously to
thresholds based on the response of the nontest ear when both ears, the listener only hears the sound in the ear
sound is presented to the poorer, test ear. They reflect a with the loudest percept. This test can be performed
limitation in the ability to isolate the two ears during a with tones or speech, but to be e¤ective, the asymmetry
hearing test when masking noise is not presented to the between ears should be 40 dB or more. Manipulation
nontest ear. of the sound levels in each ear is e¤ective in identify-
Spondee thresholds, a speech threshold for two- ing psuedohypacusis and quantifying actual behavioral
syllable words with equal stress on both syllables, are a thresholds (Rintelmann and Schwann, 1999).
quick measure that, when combined with pure-tone Otoacoustic emissions (OAEs) (Musiek, Bornstein,
thresholds, provide one of the most e¤ective tests for and Rintelmann, 1995) and auditory-evoked potentials
identification of pseudohypacusis. Spondee thresholds in (Saunders and Lazenby, 1983; Bars et al., 1994; Musiek,
cooperative adults usually fall within 10 dB of the aver- Bornstein, and Rintelmann, 1995) are useful physiologi-
age threshold for 500 Hz and 1000 Hz pure tones (PTA) cal measures for evaluating persons with pseudohypa-
(Carhart and Porter, 1971). Pseudohypacusics usually cusis. These special tests assess structures in the auditory
show larger di¤erences, with the spondee threshold being pathways. OAEs assess the outer hair cells in the cochlea
lower (better) than the PTA (Carhart, 1952). In some and the conductive pathway leading to the cochlea; these
instances, this di¤erence may reflect the naı̈veté of the emissions are often absent even with mild hearing losses.
listener (Frank, 1976), as is often the case in children. In However, auditory neuropathy cases, although rare,
other words, the listener may feign a loss for tones and show that persons can have essentially normal OAEs
not understand that speech thresholds are quantifiable, and a severe hearing loss (Sininger et al., 1995). This
Pseudohypacusis 533
possibility and the fact that many adults exaggerate Gelfand, S. A., and Silman, S. (1985). Functional hearing loss
existing losses make interpretation of OAEs in isolation and its relationship to resolved hearing levels. Ear and
somewhat ambiguous when evaluating pseudohypacusis. Hearing, 6, 151–158.
Estimation of the auditory brainstem response (ABR) Glattke, T. J. (1993). Short-latency auditory evoked potentials.
Austin, TX: Pro-Ed.
threshold, a type of evoked potential, has been advo-
Gorga, M. P., Neely, S. T., Bergman, B. M., Beauchaine,
cated as a useful measure for compensation cases (Bars K. L., Kaminski, J. R., Peters, J., et al. (1993). A compari-
et al., 1994), but this threshold assessment tool, while son of transient-evoked and distortion product otoacoustic
accurate in many situations, can yield misleading re- emissions in normal-hearing and hearing-impaired subjects.
sults in certain hearing-loss configurations (e.g., a rising Journal of the Acoustical Society of America, 94, 2639–
audiogram) (Glattke, 1993; Bars et al., 1994). ABR 2648.
threshold, like OAEs, does not assess the entire auditory Hall, J. W., III. (1992). Handbook of auditory evoked responses.
pathway as do behavioral measures. OAEs (Gorga et al., Needham Heights, MA: Allyn and Bacon.
1993) and ABR (Hall, 1992) are also ine¤ective tools for Hyde, M., Alberti, P., Matsumoto, N., and Li, Y. L. (1986).
assessing low frequencies, a critical region for consider- Auditory evoked potentials in audiometric assessment of
compensation and medicolegal patients. Annals of Otology,
ation in compensation cases. Other evoked potentials,
Rhinology, and Laryngology, 95, 514–519.
such as middle latency responses and the slow cortical Kinstler, D. B., Phelan, J. G., and Lavender, R. W. (1972).
potential, may hold promise, but like the ABR test, they The Stenger and speech Stenger tests in functional hearing
are expensive to administer (Hyde et al., 1986; Musiek, loss. Audiology, 11, 187–193.
Bornstein, and Rintelmann, 1995). Mineau, S. M., and Schlauch, R. S. (1997). Threshold mea-
The ABR threshold and OAEs are important for the surement for patients with tinnitus: Pulsed or continuous
complete documentation of intractable cases or persons tones. American Journal of Audiology, 6, 52–56.
presenting with severe to profound bilateral losses, but Musiek, F. E., Bornstein, S. P., and Rintelmann, W. F. (1995).
most cases of pseudohypacusis are resolved by a combi- Transient evoked otoacoustic emissions and pseudohypa-
nation of readministering patient instructions, informing cusis. Journal of the American Academy of Audiology, 6,
293–301.
the patient that there are inconsistent responses, and
Qui, W. W., Stucker, F. J., Shengguang, Y. S., and Welsh,
making multiple measurements of the audiogram using L. W. (1998). Current evaluation of pseudohypacusis:
an ascending approach. The validity of remeasured pure- Strategies and classification. Annals of Otology, Rhinology,
tone thresholds is evaluated using the PTA-spondee and Laryngology, 107, 638–647.
threshold-screening test described earlier. Patients whose Rintelmann, W. F., and Schwann, S. A. (1999). Pseudohypa-
thresholds are not resolved using this approach are cusis. In F. E. Musiek and W. F. Rintelmann (Eds.), Con-
scheduled for additional testing. Persons with obvious temporary perspectives in hearing assessment (pp. 415–436).
psychological problems are referred for counseling. Needham Heights, MA: Allyn and Bacon.
See also clinical decision analysis; otoacoustic Saunders, J. W., and Lazenby, B. B. (1983). Auditory brain
emissions; pure-tone threshold assessment; supra- stem response measurement in the assessment of pseudohy-
pacusis. American Journal of Otology, 4, 292–299.
threshold speech recognition; tinnitus.
Schlauch, R. S., Arnce, K. D., Morison Olson, L., Sanchez, S.,
—Robert S. Schlauch and Doyle, T. N. (1996). Identification of pseudohypacusis
using speech recognition thresholds. Ear and Hearing, 17,
References 229–236.
Sininger, Y. S., Hood, L. J., Starr, A., Berlin, C. I., and Picton,
Alpin, D. Y., and Kane, J. M. (1985). Variables a¤ecting pure T. W. (1995). Hearing loss due to auditory neuropathy.
tone and speech audiometry in experimentally simulated Audiology Today, 7, 11–13.
hearing loss. British Journal of Audiology, 19, 219–228. Vaubel, A. S. W. (1976). A study of the strategies of malingerers
Bars, D. M., Altho¤, L. K., Krueger, W. W., and Olsson, J. E. during pure-tone and speech audiometry. Unpublished mas-
(1994). Work related, noise induced hearing loss: Evaluation ter’s thesis, University of Minnesota, Minneapolis.
including evoked potential audiometry. Otolaryngology– Ventry, I. M., and Chaiklin, J. B. (Eds.). (1965). Multi-
Head and Neck Surgery, 110, 177–184. discipline study of functional hearing loss. Journal of Audi-
Carhart, R. (1952). Speech audiometry in clinical evaluation. tory Research, 5, 179–272.
Acta Otolaryngologica, 41, 18–42. Wolf, M., Birger, M., Shosan, J. B., and Kronenberg, J.
Carhart, R., and Porter, L. S. (1971). Audiometric configura- (1993). Conversion deafness. Annals of Otology, Rhinology,
tion and prediction of threshold for spondees. Journal of and Laryngology, 102, 349–352.
Chaiklin, J. B. (1990). A descending LOT-Bekesy screening test Further Readings
for functional hearing loss. Journal of Speech and Hearing
Disorders, 55, 67–74. Altschuler, M. W. (1982). Qualitative indicators of non-
Coles, R. R., and Mason, S. M. (1984). The results of cortical organicity: Informal observations and evaluation. In M. B.
electric response audiometry in medico-legal investigations. Kramer and J. M. Armbruster (Eds.), Forensic audiology
British Journal of Audiology, 18, 71–78. (pp. 59–68). Baltimore: University Park Press.
Frank, T. (1976). Yes-no test for nonorganic hearing loss. Beagley, H. A. (1973). The role of electrophysiological tests
Archives of Otolaryngology, 102, 162–165. in the diagnosis of non-organic hearing loss. Audiology, 12,
Gelfand, S. A. (1994). Acoustic reflex threshold tenth per- 470–480.
centiles and functional hearing impairment. Journal of the Bekesy, G. V. (1947). A new audiometer. Acta Otolaryn-
American Academy of Audiology, 5, 10–16. gologica, 35, 411–422.
Bordley, J., and Hardy, W. (1949). A study of objective crimination. These tests provide information about a
audiometry with the use of a psychogalvanometric response. listener’s ability to recognize, discriminate, or under-
Annals of Otology, Rhinology, and Laryngology, 58, 751– stand acoustic signals, such as speech, and usually are
760. conducted at moderate or higher signal levels. Tests of
Conn, M., Ventry, I. M., and Woods, R. W. (1972). Pure-
word recognition are clinical examples of such tests (see
tone average and spondee threshold relationships in simu-
lated hearing loss. Journal of Auditory Research, 12, 234– suprathreshold speech recognition).
239. Hearing tests are performed for two primary pur-
Durrant, J. D., Kesterson, R. K., and Kamerer, D. B. (1997). poses. One purpose is to identify hearing problems that
Evaluation of the nonorganic hearing loss suspect. Ameri- may be caused by ear disease or damage to auditory
can Journal of Otology, 18, 361–367. structures. In some cases a hearing loss may indicate a
Gold, S. R., Hunsaker, D. H., and Haseman, E. M. (1991). medical problem, such as an ear infection. Because
Pseudohypacusis in a military population. Ear, Nose, and medical treatment of ear diseases is successful more
Throat Journal, 70, 710–712. often in early stages of the disease process, it is critical
Harris, D. A. (1958). A rapid and simple technique for the that such problems be detected and treated as soon as
detection of non-organic hearing loss. Archives of Oto-
possible. A second purpose for hearing tests is to obtain
laryngology, 68, 758–760.
Kvaerner, K. J., Engdahl, B., Aursnes, J., Arnesen, A. R., and information important for rehabilitation planning. In
Mair, I. W. S. (1996). Transient evoked otoacoustic emis- those cases of hearing loss for which medical treatment is
sions: Helpful tool in the detection of pseudohypacusis. not an appropriate alternative, it is important that non-
Scandinavian Audiology, 25, 173–177. medical rehabilitative measures be considered based on
Martin, F. N. (1994). Pseudohypacusis. In J. Katz (Ed.), the communication needs of the a¤ected individual. In-
Handbook of clinical audiology (pp. 553–567). Baltimore: formation obtained from the hearing evaluation, for ex-
Williams and Wilkins. ample, is required to make decisions about the need for
Martin, F. N., Armstrong, T. W., and Champlin, C. A. (1994). personal amplification (such as a hearing aid) or for
A survey of audiological practices in the United States. other auditory rehabilitation services.
American Journal of Audiology, 3, 20–26.
A basic requirement for administration of hearing
Martin, F. N., and Shipp, D. B. (1982). The e¤ects of sophis-
tication on three threshold tests for subjects with simulated tests is an acoustic system that enables control of the
hearing loss. Ear and Hearing, 3, 34–36. signals presented to the listener. An audiometer is an
McCandless, G. A., and Lentz, W. E. (1968). Evoked response electronic instrument used to present controlled acoustic
(EEG) audiometry in non-organic hearing loss. Archives of signals to a listener in order to test auditory function.
Otolaryngology, 87, 123–128. In conventional pure-tone audiometry, the audiometer
Peck, J. E., and Ross, M. (1970). A comparison of the ascend- provides for the presentation of tones ranging in fre-
ing and the descending modes for the administration of the quency from 125 through 8000 hertz (Hz). A hearing
pure-tone Stenger test. Journal of Auditory Research, 10, level (HL) dial allows the tester to control the level (in
218–222. decibels, or dB) of a tone being presented to the listener.
Sohmer, H., Feinmesser, M., Bauberger-Tell, L., and Edel-
The HL control is graduated in steps of 10 and 5 dB
stein, E. (1977). Cochlear, brain stem, and cortical evoked
responses in non-organic hearing loss. Annals of Otology, (and sometimes smaller) and typically is adjustable over
Rhinology, and Laryngology, 86, 227–234. a range of 120 dB. When the HL dial is set to 0 dB, the
Spraggs, P. D. R., Burton, M. J., and Graham, J. M. (1994). output level of the audiometer corresponds to an average
Nonorganic hearing loss in cochlear implant candidates. normal HL at that specific frequency. This is referred
American Journal of Audiology, 15, 652–657. to as audiometric zero (American National Standards
Stenger, P. (1907). Simulation and dissimulation of ear diseases Institute [ANSI], 1996). This instrumental convenience
and their identification. Deutsche Medizinische Wochen- accounts for the di¤erences in absolute hearing sensitiv-
schrifit, 33, 970–973. ity (in dB sound pressure level, or SPL) across frequency
Sulkowski, W., Sliwinska-Kowalska, M., Kowalska, S., and in persons with normal hearing. Other controls on the
Bazydlo-Golinska, G. (1994). Electric response audiometry
audiometer enable the tester to route the test signal to
and compensational noise-induced hearing loss. Otolaryn-
gologia Polska, 48, 370–374. various receivers or transducers used to present the tones
to a listener. The two most common receivers used in
pure-tone audiometry are a set of earphones and a bone
conduction vibrator. Earphones provide for airborne
Pure-Tone Threshold Assessment acoustic signals; a bone conduction vibrator is used to
transmit vibratory energy through the skull to the inner
ear (cochlea). Diagnostic audiometers also provide
Audiometry is the measurement of hearing. Clinical masking signals (typically a noise) for presentation to the
hearing tests are designed to evaluate two basic aspects nontest ear during specific audiometric tests. A masking
of audition: sensitivity and recognition (or discrimina- noise is necessary for bone conduction audiometry and
tion). Hearing sensitivity measures are estimates of the for cases of substantial unilateral hearing loss in which
lowest level at which a person can just detect the pres- the test signal may be intense enough to be heard in the
ence of a test signal (Ward, 1964). Measures that require nontest ear. To rule out this possibility, the masking
an identification response or judgments of sound di¤er- noise is introduced in the nontest ear. The noise elevates
ences are tests of auditory acuity, recognition, or dis- the threshold in that ear (masks it) and eliminates it from
Pure-Tone Threshold Assessment 535
Figure 1. Audiogram form used for

recording pure-tone thresholds.
Symbols shown to the right of
the audiogram are those recom-
mended by the American Speech-
Language-Hearing Association
(ASHA, 1990).
the test situation, to ensure that subject responses are olds). Symbols used to record thresholds on the audio-
only for the test ear. gram also may be color coded, with red indicating
Pure-tone audiometry consists of threshold measures measures for the right ear and blue indicating results for
for air and bone conduction signals in each ear (sepa- the left ear.
rately). A pure-tone threshold is the lowest level at which Specific procedures have been developed for pure-
a person can just detect the presence of a tone. The tone audiometry so that thresholds are obtained in a
threshold measure is statistical in nature; it is a level at manner that minimizes test variability and are repeatable
which a listener responds to a criterion percentage of the over time and from clinic to clinic. These procedures are
signals presented. Clinically, threshold is usually defined based on research findings for persons with normal
as the lowest signal level at which the listener just detects hearing and persons with hearing impairment, and in-
50% of the tones presented. Audiometric thresholds are clude specifications on test frequencies, duration of test
influenced by a number of variables, including the intones, step size changes in signal level, and other proce-
structions to the listener, the positioning of the earphone dural variables (ASHA, 1978, 1997). The basic test pro-
(or other transducer) on the head, and the psychophys- tocol involves initially familiarizing the listener with the
ical threshold measurement technique used (Dancer and tones that will be heard and then determining thresholds
Ventry, 1976; Yantis, 1994). In addition, individuals for the tones. A bracketing technique is used whereby
may demonstrate threshold variability related to factors the tone level at a specific frequency is varied up and
such as motivation, the nature of the ear disorder, the down and the listener indicates whether the tone is au-
patient’s ability to comply with the test situation, and dible at each level. The tester determines the average
ongoing physiological changes inherent to the auditory hearing level at which the tone was heard approximately
system (Wilber, 1999). half the time over a series of presentations. This level
Threshold measures obtained from pure-tone audio- represents the pure-tone threshold at a specific frequency
metry are conventionally plotted on a graph called an and is recorded on the audiogram. Thresholds are
audiogram (Fig. 1). The audiogram enables the tester to obtained separately for air conduction using earphones
quickly see the extent to which thresholds for a listener and for bone conduction using a bone conduction vi-
deviate from normal. In Figure 1, HL (dB) is plotted on brator. Air conduction tones produced by the earphone
the linear vertical axis and signal frequency (Hz) is indi- are directed down the ear canal, through the middle ear,
cated on the logarithmic horizontal axis. The format of and then to the cochlea. In bone conduction testing,
the audiogram is such that one octave along the fre- however, a vibrator is used to transmit the signal
quency axis corresponds in dimensional scale to 20 dB through the bones of the skull to the cochlea. The vi-
on the HL axis (ANSI, 1996). Recommended symbols brator is placed on the forehead or the mastoid portion
for audiograms are also included in the legend of Figure of the temporal bone (behind the pinna), and thresholds
1 (American Speech-Language-Hearing Association are measured for the desired audiometric frequencies.
[ASHA], 1990). Note that separate symbols are used to All testing is performed in a sound-treated room that
indicate bone conduction thresholds and measures made meets standards for the exclusion of ambient noise
with a masking noise in the nontest ear (masked thresh- (ANSI, 1999).
Results of pure-tone audiometry, recorded on the Table 1. Degree Classifications of Hearing Loss (Handicap)
audiogram, provide a description of both the degree and Pure-Tone Average (dB HL) for 500, Handicap
type of hearing loss. Because we are mainly interested in 1000, and 2000 Hz in the Better Ear Classification
a person’s ability to hear everyday speech, the customary
a25 Not significant
procedure for classifying degree of hearing loss involves
26–40 Slight
a computation of the pure-tone average (PTA). A per- 41–55 Mild
son’s PTA is her or his average pure-tone thresholds for 56–70 Marked
the speech frequencies of 500, 1000, and 2000 Hz. Table 71–90 Severe
1 provides a degree classification of hearing loss based >90 dB Extreme
on average pure-tone thresholds in the better ear for
Adapted with permission from Davis, H. (1978). Hearing
these frequencies. In considering the purpose and results handicap standards for hearing, and medicolegal rules. In H.
of audiometric tests, it is important to distinguish the Davis and S. R. Silverman (Eds), Hearing and deafness (4th
terms hearing loss (or hearing impairment) and hearing ed., pp. 266–290). New York: Holt, Rinehart, and Winston.
Figure 2. Example audiograms for conductive (2a), sensorineural (2b), and mixed (2c) types of hearing loss. Further description is
provided in the text.
Pure-Tone Threshold Assessment 537
handicap (or disability) (ASHA, 1997; ASHA/CED, significant sensorineural hearing loss, for example, can
1998). The communicative handicap associated with a still experience an ear infection or other pathology that
given amount of hearing loss in dB may di¤er consider- may result in a conductive component in addition to the
ably across individuals. sensorineural loss. Finally, it should be understood that
The classification of hearing loss according to type is many patients with significant disorders or pathologies
based on comparisons of thresholds for air and bone of the auditory system may have no significant hearing
conduction test tones. Conductive hearing losses are al- loss (in dB). Patients with disorders of the central audi-
most always due to abnormalities of the outer or middle tory nervous system, for example, may demonstrate
ear. In such disorders, an interruption or blockage of normal pure-tone thresholds. Normal hearing sensitivity
sound conduction to the cochlea accounts for the hear- does not always indicate a normal auditory system
ing loss. Wax in the ear canal or breaks in the ossicular (Wiley, 1988).
chain (middle ear bones) are examples of conditions that
restrict the flow of sound from the outer ear and middle —Terry L. Wiley
ear to the cochlea and result in a conductive hearing loss.
The primary audiometric sign of a conductive hearing References
loss is an air–bone gap. An air–bone gap is present
when hearing sensitivity by bone conduction is signifi- American Speech-Language-Hearing Association, Joint Com-
cantly better than by air conduction. This can occur in mittee of the American Speech-Language-Hearing Associa-
tion (ASHA) and the Council on Education of the Deaf
cases of outer and middle ear disorders because the path
(CED). (1998). Hearing loss: Terminology and classifi-
of sound transmission for bone conduction is primarily cation. Position statement and technical report. ASHA,
through the bones of the skull directly to the inner ear, 40(Suppl. 18): 22–23.
essentially bypassing the a¤ected outer and middle ear American National Standards Institute. (1996). American Na-
structures. An audiogram for a conductive hearing loss tional Standard Specification for Audiometers (S3.6). New
is shown in Figure 2A. Note that although the air con- York: Author.
duction thresholds are elevated by 50–60 dB, bone con- American National Standards Institute. (1999). American Na-
duction thresholds are within normal limits (0 dB HL). tional Standard Maximum Permissible Ambient Noise Levels
The primary e¤ect of a conductive disorder is to reduce for Audiometric Test Rooms (S3.1). New York: Author.
the level of sound reaching the inner ear. The impair- American Speech-Language-Hearing Association. (1978).
Guidelines for manual pure tone threshold audiometry.
ment is primarily a loss in hearing sensitivity, not in ASHA, 20, 297–301.
speech understanding. If the sound level can be in- American Speech-Language-Hearing Association. (1990).
creased, the person will be able to hear and understand Guidelines for audiometric symbols. ASHA, 32(Suppl. 2),
speech. 25–30.
Hearing loss resulting from damage or disease to any American Speech-Language-Hearing Association. (1997).
portion of the inner ear or neural auditory pathways is Guidelines for audiologic screening. Rockville, MD: Author.
classified as sensorineural. Because the problem lies in Dancer, J., and Ventry, I. M. (1976). E¤ects of stimulus pre-
the inner ear or neural pathways (or both), there will be sentation and instructions in pure-tone thresholds and false-
an equal hearing loss for both air- and bone-conducted alarm responses. Journal of Speech and Hearing Disorders,
signals. An audiogram for a patient with a sensorineural 41, 315–324.
Davis, H. (1978). Hearing handicap, standards for hearing,
hearing loss is shown as Figure 2B. Notice that the bone and medicolegal rules. In H. Davis and S. R. Silverman
conduction thresholds are the same as the air conduc- (Eds.), Hearing and deafness (pp. 266–290). New York:
tion thresholds—there is no air–bone gap. In contrast to Holt, Rinehart, and Winston.
conductive hearing loss, sensorineural hearing loss typi- Ward, W. D. (1964). Sensitivity versus acuity. Journal of
cally involves both a loss in hearing sensitivity and a Speech and Hearing Research, 7, 294–295.
reduced ability to understand speech. Even when speech Wilber, L. A. (1999). Pure-tone audiometry: Air and bone
is made louder, the person will still have some di‰culty conduction. In F. E. Musiek and W. F. Rintelmann (Eds.),
in understanding. Contemporary perspectives in hearing assessment (pp. 1–20).
A mixed hearing loss is a combination of both con- Needham Heights, MA: Allyn and Bacon.
ductive and sensorineural losses. Figure 2C is an exam- Wiley, T. L. (1988). Audiologic evaluation. In D. Yoder and
R. Kent (Eds.), Decision making in speech-language pathol-
ple audiogram for a mixed hearing loss in both ears. ogy (pp. 4–5). Toronto: B. C. Decker.
Note that air conduction thresholds are poorer than Yantis, P. A. (1994). Puretone air-conduction testing. In
normal, averaging about 60 dB HL, and bone conduc- J. Katz (Ed.), Handbook of clinical audiology (4th ed., pp.
tion thresholds are also poorer than normal, averaging 97–108). Baltimore: Williams and Wilkins.
35 dB HL. There is an air–bone gap of approximately
25 dB, suggestive of some conductive hearing loss. In
addition, there is a loss by bone conduction, indicating Further Readings
some abnormality of the inner ear and/or auditory Bess, F. H., and Humes, L. E. (1995). Audiology: The funda-
nerve. mentals (2nd ed.). Baltimore: Williams and Wilkins.
The case of a mixed hearing loss underscores the fact Carhart, R., and Jerger, J. F. (1959). Preferred method for
that both sensorineural and conductive disorders can clinical determination of pure-tone thresholds. Journal of
exist simultaneously in the same patient. A person with a Speech and Hearing Disorders, 24, 330–345.
Carhart, R., and Porter, L. S. (1971). Audiometric configura- proposed earlier. The new term better reflected the rela-
tion and prediction of threshold for spondees. Journal of tionship between the index and what were then called
Speech and Hearing Research, 14, 486–495. articulation tests. Articulation tests were speech tests in
Eagles, E. L. (1972). Selected findings from the Pittsburgh which listeners were asked to identify speech sounds
study. Transactions of the American Academy of Ophthal-
spoken by a caller under conditions of interest to the
mology and Otolaryngology, 76, 343–348.
Harrell, R. W. (2002). Puretone evaluation. In J. Katz (Ed.), experimenter. The exact makeup of the articulation tests
Handbook of clinical audiology (5th ed., pp. 71–87). Balti- varied, but they usually consisted of a carrier sentence
more: Lippincott Williams and Wilkins. with a nonsense syllable test item at the end. Several
Henderson, D., Salvi, R. J., Boettcher, F. A., and Clock, A. E. callers would in turn utter the sentences via the test cir-
(1994). Neurophysiologic correlates of sensory-neural hear- cuit to crews of six to eight listeners. The listeners would
ing loss. In J. Katz (Ed.), Handbook of clinical audiology record what they heard phonetically. The circuit ar-
(4th ed., pp. 37–55). Baltimore: Williams and Wilkins. ticulation was equal to the average proportion of the
Joint Audiology Committee on Clinical Practice Algorithms sounds (or syllables) heard correctly by the listeners. The
and Statements. (2000). Audiology clinical practice algo- articulation index was devised as an alternative to this
rithms and statements. Audiology Today, pp. 32–49. [Spe-
procedure.
cial issue]
Konkle, D. F. (1995, April). Classifying terms, degrees of The articulation index is an index that describes the
hearing sensitivity. Hearing Instruments, 11. proportion of the total importance-weighted speech
Martin, F. N., and Clark, J. G. (2000). Introduction to audiol- signal that is audible to the listener under specified
ogy (7th ed., pp. 73–115). Needham Heights, MA: Allyn conditions. Normally, the index’s value (ranging from 0
and Bacon. to 1) is derived from physical measurements, including
Roeser, R. J., Buckley, K. A., and Stickney, G. S. (2000). Pure principally the speech signal’s intensity level, the level of
tone tests. In R. J. Roeser, M. Valente, and H. Hosford- any noise that may be present, and the characteristics of
Dunn (Eds.), Audiology diagnosis (pp. 227–251). New the transmission system that delivers the speech and
York: Thieme. noise to the listener’s ear. Reverberation e¤ects are
Salvi, R. J., Henderson, D., Hamernik, R., and Ahroon, W. A.
sometimes included. Index values are often modified
(1983). Neural correlates of sensorineural hearing loss. Ear
and Hearing, 4, 115–129. based on certain well-known performance characteristics
Van Cleve, J. V. (Ed.). (1987). Gallaudet encyclopedia of deaf of the human auditory system. Most commonly these
people and deafness (3 vols.). Blacklick, OH: McGraw-Hill. would include pure-tone thresholds and cross-frequency
Ward, W. D. (1983). The American Medical Association/ band spread of masking. The negative e¤ects of listening
American Academy of Otolaryngology formula for deter- at high signal levels might also be included, as well as
mination of hearing handicap. Audiology, 22, 313–324. positive factors, such as the e¤ect of the listener being
Wiley, T. L. (1980). Hearing disorders and audiometry. In T. J. able to see the talker’s face. The articulation index value
Hixon, L. D. Shriberg, and J. S. Saxman (Eds.), Introduc- can be used directly as an indicator of the relative quality
tion to communication disorders (pp. 490–529). Englewood of a communications system, or it can be used to predict
Cli¤s, NJ: Prentice-Hall.
average speech recognition success for the particular
Yost, W. A. (2000). Fundamentals of hearing (4th ed.). New
York: Academic Press. types of speech or speech elements under specified lis-
tening conditions through the use of an appropriate
transfer function.
The early history of development of the articulation
Speech Perception Indices index is di‰cult to follow with certainty because much
of this work was done at the Bell Telephone Labo-
ratories and was described only in internal company
The articulation index, or, as it is now known, the speech memos and reports. Most was not published in the sci-
intelligibility index, was originally developed by early entific literature for 10–25 years after it was carried out.
telephone engineer-scientists to describe and predict the Some was never published. Fortunately, copies of many
quality of telephone circuits (Fletcher, 1921; Collard, of the original documents from this early period recently
1930; French and Steinberg, 1947). Initially their moti- became available on a compact disk through the e¤orts
vation was to provide a method that would reduce the of C. M. Rankovic and J. B. Allen (Rankovic and Allen,
need for lengthy (and expensive) human articulation 2000).
tests to evaluate the merit of telephone circuit modifica- The first credible e¤ort to evaluate the e¤ectiveness of
tions. However, by 1947, with the appearance of the a communication system based on physical measure-
watershed papers of French and Steinberg (1947) and ments was that of H. Fletcher (1921). In an unpublished
Beranek (1947), it was evident that articulation theory Western Electric Laboratory report, Fletcher pointed
had the potential for much greater significance and out that a suitable measure (index) of circuit quality
broader application than was implied by these early must have the property of additivity. By this he meant
goals. that if a particular frequency range of speech heard
The term articulation index (AI) was coined at the alone has a quality value of Q1 and if a second frequency
Bell Telephone Laboratories and first appeared in com- range has a value Q2 , then the value of both ranges
pany memos as early as 1926 (French, 1926). It replaced heard at the same time should equal Q1 þ Q2 . It was
the term quality index that Harvey Fletcher (1921) had clear that articulation test scores did not possess this
Speech Perception Indices 539
property (i.e., A1 þ A2 0 A12 ). Therefore, Fletcher pro- method (Rankovic, 1997, 1998). An available computer
posed an intermediate variable, related to articulation, program implementing the procedure by H. Müsch
that would at least approximate this additivity property. (2001) now makes its practical use feasible.
He also proposed methods to derive the index’s value for The next major steps in the history of the articulation
a circuit from measures of received speech intensity, fre- index were taken by Karl Kryter. In two landmark
quency distortion, room and line noise, ‘‘asymmetric papers (Kryter, 1962a, 1962b), he described and vali-
distortion,’’ and other factors (Fletcher, 1921). dated a comprehensive method for calculating the artic-
The term articulation index first appeared in the pub- ulation index under a broad range of conditions. This
lished literature in the landmark paper by Bell Tele- work was based directly on the publications of Beranek
phone Laboratory scientists N. R. French and J. C. (1947) and French and Steinberg (1947). Kryter’s meth-
Steinberg in 1947. However, the term appears in internal ods became even more influential in 1969 when they
Bell Labs documents as a replacement for quality index were adopted, virtually intact, by the American National
as early as 1926. The expression quality theory, however, Standards Institute (ANSI) as the American National
continued in use for several more years before finally Standard Methods for the Calculation of the Articulation
being replaced by articulation theory. Index (ANSI S3.5-1969). For some 30 years this method
In a parallel development in England, John Collard, quite literally defined the articulation index.
working for International Telephone and Telegraph, An important development taking place in Europe
published a detailed description of an index with prop- during this period was that of the speech transmission
erties similar to those of the articulation index in a series index by T. Houtgast and H. J. M. Steeneken (Houtgast
of papers beginning in 1930. He called his index band and Steeneken, 1973; Steeneken and Houtgast, 1980).
articulation (or frequently just ‘‘the new unit’’). Speech Similar in many ways to the articulation index of French
test scores were called sound articulation. By 1939 Col- and Steinberg (1947), the speech transmission index
lard had created a mechanical band articulation calcu- added a unique method for incorporating and combining
lator (Pocock, 1939) that included many of the features the e¤ects of noise and reverberation into the calcula-
of more recent articulation index methods. It is unclear tion through measurements of the modulation transfer
whether the Bell Telephone Laboratory scientists were function (MTF). At first called the weighted MTF, the
aware of Collard’s work, but it is not cited in either their speech transmission index has found relatively wide-
published or unpublished reports. spread use in the area of architectural acoustics in an
During World War II, Fletcher, then director of abbreviated form known as the rapid speech transmis-
physical research at Bell Telephone and chairman of the sion index, or RASTI.
National Defense Research Committee (NDRC), pro- In 1997 ANSI published the American National
vided a description of at least some of the articulation Standard Method for Calculation of the Speech Intelligi-
index methods that had been developed at Bell Labs to bility Index (ANSI S3.5-1997). This renamed standard is
Leo Beranek, then at Harvard University (Allen, 1996). the direct successor of articulation index standard ANSI
Beranek was working on methods for improving com- S3.5-1969 and it is similar to the earlier standard in basic
munications for aircraft pilots as a part of the war e¤ort concept. However, there are many di¤erences in detail.
under a contract from the NDRC. Allen (1996) reports One of the more obvious is the change in the name of the
that following the war, Beranek persuaded the Bell Labs index to speech intelligibility index. The new name fi-
group to finally publish a description of their work on nally severs the connection with the now obsolete term,
the articulation index. The classic 1947 paper by N. R. articulation test, and also avoids confusion between its
French and J. C. Steinberg was the result. This paper abbreviation, AI, and the newer but more general use of
was soon followed by Beranek’s frequently cited paper this abbreviation to mean artificial intelligence.
on the articulation index (Beranek, 1947). These two Of the procedural di¤erences, one of the more funda-
papers together were to play highly influential roles in mental ones concerns the frequency importance function.
the future of the articulation index. The frequency importance function describes the relative
In 1950 Fletcher and his long-time Bell Labs associate importance of di¤erent frequency regions along the fre-
Rogers Galt published a detailed description of their quency scale for speech intelligibility. ANSI S3.5-1997
conception of the articulation index. Their goal in this provides a function for average speech, but it also pro-
paper was to cover a broader range of conditions than vides di¤erent functions for particular types of speech.
had ever been attempted before. ‘‘Telephony’’ was de- This change from the earlier standard implies that fre-
fined by the authors as referring to ‘‘any talker-listener quency importance is significantly dependent on the
combination’’ (p. 90). The results were seen as applica- characteristics of the speech material and not only
ble to sound recording and reproduction systems, public dependent on the characteristics of the auditory system.
address systems, and even hearing aids. In 1952, Fletcher In addition, ANSI S3.5-1997 provides for calculations
further extended the applications to persons with hearing based on the auditory critical band, uses newer methods
loss. Unfortunately, Fletcher and Galt’s attempt to for calculating spread of masking, includes a speech level
account for all aspects of the problem resulted in a distortion factor, uses di¤erent speech spectra for raised,
conceptualization and method that was too complex for loud, and shouted speech, and provides for use of the
most people to understand, and it was seldom used. modulation transfer function methods of Houtgast
Recently, there has been a renewed interest in the and Steeneken (1980). To ensure accuracy, a computer
program implementing the S3.5-1997 method was made Rankovic, C. M., and Allen, J. B. (2000). Study of speech and
available. This program lacks a user-friendly interface hearing at Bell Telephone Laboratories: The Fletcher years.
but provides an invaluable test of other implementations Compact disk. New York: Acoustical Society of America.
of the method. Steeneken, H. J. M., and Houtgast, T. (1980). A physical
method for measuring speech-transmission quality. Journal
—Gerald A. Studebaker of the Acoustical Society of America, 67, 318–326.
References Further Readings

Allen, J. B. (1996). Harvey Fletcher’s role in the creation of Ching, T. Y. C., Dillon, H., and Byrne, D. (1998). Speech rec-
communication acoustics. Journal of the Acoustical Society ognition of hearing-impaired listeners: Predictions from au-
of America, 99, 1825–1839. dibility and the limited role of high-frequency amplification.
American National Standards Institute. (1969). American Journal of the Acoustical Society of America, 103, 1128–
National Standard methods for the calculation of the Articu- 1140.
lation Index (ANSI S3.5-1969). New York: Acoustical So- Ching, T. Y., Dillon, H., Katsch, R., and Byrne, D. (2001).
ciety of America. Maximizing e¤ective audibility in hearing aid fitting. Ear
American National Standards Institute. (1997). American Na- and Hearing, 22, 212–224.
tional Standard methods for the calculation of the Speech Dubno, J. R., Dirks, D. D., and Schaefer, A. B. (1989). Stop-
Intelligibility Index (ANSI S3.5-1997). New York: Acous- consonant recognition for normal-hearing listeners and
tical Society of America. listeners with high-frequency hearing loss: II. Articulation
Beranek, L. L. (1947). The design of speech communication index predictions. Journal of the Acoustical Society of
systems. Proceedings of the Institute of Radio Engineers, 35, America, 85, 355–364.
880–890. Duggirala, V., Studebaker, G. A., Pavlovic, C. V., and Sher-
Collard, J. (1930). Calculation of the articulation of a tele- becoe, R. L. (1988). Frequency importance functions for a
phone circuit from circuit constants. Electrical Communica- feature recognition test. Journal of the Acoustical Society of
tion, 8, 141–163. America, 83, 2372–2382.
Collard, J. (1933). A new unit of circuit performance. Electrical Humes, L. E., Dirks, D. D., Bell, T. S., Ahlstrom, C., and
Communication, 11, 226–233. Kincaid, G. (1986). Application of the articulation index
Collard, J. (1934). The practical application of the new unit and the speech transmission index to the recognition of
of circuit performance. Electrical Communication, 12, 270– speech by normal-hearing and hearing-impaired listeners.
275. Journal of Speech and Hearing Research, 29, 447–462.
Fletcher, H. (1921). Study of speech and telephone quality. Kamm, C. A., Dirks, D. D., and Bell, T. S. (1985). Speech
Unpublished laboratory report No. 21839, Western Electric recognition and the articulation index for normal and
Company, April 8, 1921. Reproduced on compact disk hearing-impaired listeners. Journal of the Acoustical Society
compiled by C. M. Rankovic and J. B. Allen, 2000, Study of of America, 77, 281–288.
speech and hearing at Bell Telephone Laboratories: The Müsch, H., and Buus, S. (2001). Using statistical decision
Fletcher years. New York: Acoustical Society of America. theory to predict speech intelligibility: I. Model structure.
Fletcher, H. (1952). The perception of speech sounds by deaf- Journal of the Acoustical Society of America, 109, 2896–
ened persons. Journal of the Acoustical Society of America, 2909.
24, 490–497. Pavlovic, C. V. (1987). Derivation of primary parameters and
Fletcher, H., and Galt, R. H. (1950). The perception of speech procedures for use in speech intelligibility predictions.
and its relation to telephony. Journal of the Acoustical So- Journal of the Acoustical Society of America, 82, 413–422.
ciety of America, 22, 89–151. Pavlovic, C. V. (1993). Problems in the prediction of speech
French, N. R. (1926). Telephone quality. Unpublished memo- recognition performance of normal-hearing and hearing-
randum, October 19, 1926. Reproduced on compact disk impaired individuals. In G. A. Studebaker and I. Hochberg
compiled by C. M. Rankovic and J. B. Allen, 2000, Study of (Eds.), Acoustical factors a¤ecting hearing aid performance
speech and hearing at Bell Telephone Laboratories: The (pp. 221–234). Needham Heights, MA: Allyn and Bacon.
Fletcher years. New York: Acoustical Society of America. Pavlovic, C. V., and Studebaker, G. A. (1984). An evaluation
French, N. R., and Steinberg, J. C. (1947). Factors governing of some assumptions underlying the articulation index.
the intelligibility of speech sounds. Journal of the Acoustical Journal of the Acoustical Society of America, 75, 1606–
Society of America, 19, 90–119. 1612.
Houtgast, T., and Steeneken, H. J. M. (1973). The modulation Pavlovic, C. V., Studebaker, G. A., and Sherbecoe, R. L.
transfer function in room acoustics as a predictor of speech (1986). An articulation index based procedure for predicting
intelligibility. Acustica, 28, 66–73. the speech recognition performance of hearing-impaired
Kryter, K. (1962a). Methods for the calculation and use of individuals. Journal of the Acoustical Society of America,
the articulation index. Journal of the Acoustical Society of 80, 50–57.
America, 34, 1689–1697. Rankovic, C. M. (1997). Prediction of speech reception for
Kryter, K. D. (1962b). Validation of the articulation index. listeners with sensorineural hearing loss. In W. Jesteadt
Journal of the Acoustical Society of America, 34, 1698– (Ed.), Modeling sensorineural hearing loss. Mahwah, NJ:
1702. Erlbaum.
Müsch, H. (2001). Review and computer implementation of Rankovic, C. M. (1998). Factors governing speech reception
Fletcher and Galt’s method of calculating the articulation benefits of adaptive liner filtering for listeners with sensori-
index. Acoustics Research Letters Online, 2, 25–30. neural hearing loss. Journal of the Acoustical Society of
Pocock, L. C. (1939). The calculation of articulation for e¤ec- America, 103, 1043–1057.
tive rating of telephone circuits. Electrical Communication, Schroeder, M. (1981). Modulation transfer function: Definition
18, 120–132. and measurement. Acustica, 49, 179–182.
Speech Tracking 541
Studebaker, G. A., Gray, G. A., and Branch, W. E. (1999). mented by the Tactaid 7 vibrotactile aid. After about 30
Prediction and statistical evaluation of speech recognition hours of testing and training with Speech Tracking, the
test scores. Journal of the American Academy of Audiology, subject’s mean Tracking Rates in the two presentation
10, 355–370. conditions were 33.7 wpm for lip reading alone and
Studebaker, G. A., and Sherbecoe, R. L. (1993). Frequency
46.1 wpm for lip reading supplemented by the Tactaid 7.
importance functions for speech recognition. In G. A. Stu-
debaker and I. Hochberg (Eds.), Acoustical factors a¤ecting Despite the widespread acceptance of Speech Track-
hearing aid performance. Needham Heights, MA: Allyn and ing in research projects, its use as an evaluative tool has
Bacon. been severely criticized by Tye-Murray and Tyler (1988).
Studebaker, G. A., Sherbecoe, R. L., and Gilmore, C. (1993). These researchers cited a number of extraneous variables
Frequency-importance and transfer functions for the Audi- that they believed were extremely di‰cult to control.
tec of St. Louis recordings of the NU-6 word test. Journal of These included the characteristics of the speaker (seg-
Speech and Hearing Research, 36, 799–807. ment selection, ability to use cues to overcome block-
Zwicker, E. (1961). Subdivision of the audible frequency range ages, speaking style, articulatory patterns, etc.), the
into critical bands (Frequenzgruppen). Journal of the Acous- receiver (degree of assertiveness, language proficiency,
tical Society of America, 33, 248.
motivation, etc.), and the text (degree of syntactic com-
plexity, vocabulary, etc.). Hochberg, Rosen, and Ball
(1989), for example, found that text complexity could
Speech Tracking greatly influence Tracking Rate. In a study using the
same talker/receiver pairs, they found that Tracking
Rates varied from 62.9 wpm for ‘‘easy’’ materials (con-
Speech Tracking (sometimes called Connected or Con- trolled vocabulary readers designed for English-as-a-
tinuous Discourse Tracking) is a procedure developed by second-language learners) to 29.5 wpm for ‘‘di‰cult’’
De Filippo and Scott ‘‘for training and evaluating the materials (popular adult fiction).
reception of ongoing speech’’ (De Filippo and Scott, Tye-Murray and Tyler (1998) believed that these fac-
1978, p. 1186). In the Speech Tracking procedure, a tors made Speech Tracking unsuitable for across-subject
talker (usually a therapist or experimenter) reads from a test designs. They did, however, feel that with some
prepared text, phrase by phrase, for a predetermined modifications the procedure could be used for within-
time period, usually five or ten minutes. The task of the subject test designs. These recommendations included an
receiver (the person with hearing loss) is to repeat ex- insistence on a verbatim response, the use of only one
actly what the talker said. If the receiver does not give a speaker, training of the speaker/receiver pairs, the use of
verbatim response, the talker applies a strategy to over- appropriate texts, and limiting the repair strategies used
come the breakdown. This can take the form of simple to repetition and writing down a blocked word after
repetition, the use of clue words, or a paraphrase of three repeats.
the original segment. The therapist goes on to the next A number of groups (for example, Boothroyd, 1987;
phrase only when the receiver is able to repeat correctly Pichora-Fuller and Benguerel, 1991; Dempsey et al.,
every word of the original segment. 1992) have attempted to make the technique more
At the end of the time period, the number of words suitable for evaluative purposes through the use of
repeated correctly is counted and divided by the time computer-controlled recorded materials. These ap-
elapsed to derive the receiver’s Tracking Rate, expressed proaches ensure that the problems created by sender
in words per minute (wpm). For example, if a receiver di¤erences are controlled and minimized. Although
was able to repeat 150 words in five minutes, his Track- promising, none of these systems have become widely
ing Rate would be 30 wpm. The Tracking Rate repre- accepted and used.
sents the time taken for the text to be presented and The KTH Speech Tracking Procedure (Gnosspelius
repeated. Estimates of Tracking Rate for people with and Spens, 1992; Spens, 1992) represented another
normal hearing vary, but it is generally recognized that a attempt to more closely control the approach. This
rate of around 100 wpm (De Filippo, 1988) is obtained computer-controlled modification used live-voice pre-
if the same presentation and response rules are followed. sentations, but the segment length was predetermined
Since its introduction, Speech Tracking has also been and only one repair strategy—repetition—was allowed.
used extensively in investigations evaluating the e¤ec- The written form of any word repeated three times was
tiveness of sensory aids for people with hearing loss. automatically presented to the receiver via an LED dis-
These have included studies of cochlear implants (Rob- play. At the end of a Speech Tracking session the pro-
bins et al., 1985; Levitt et al., 1986), tactile aids (Brooks gram automatically calculated a number of measures
et al., 1986; Cowan et al., 1991; Plant, 1998), and direct including Tracking Rate, Ceiling Rate (time taken when
contact tactile approaches such as Tadoma (Reed et al., all words in a segment were correctly repeated after only
1992) and Tactiling (Plant and Spens, 1986). These one presentation), and the Proportion of Blocked Words
studies usually compare a receiver’s Tracking Rate in (the total number of words that had to be repeated di-
two or more presentation conditions such as aided and vided by the total number of words in the session). This
unaided lip reading. For example, Plant (1998) looked at approach has been used in a number of studies (for ex-
a subject’s Speech Tracking performance with materials ample, Plant, 1998; Ronnberg et al., 1998) but has not
presented via lip reading alone and lip reading supple- gained widespread acceptance.
Speech Tracking has also been widely used as a ceiver is then scored for the number of words correctly
training technique for use with adults (Plant, 1996) and identified and shown the written form of the segment. At
children (Tye-Murray, 1998) with profound hearing loss. the end of each part, the percentage of correct responses
When used for training, modifications can be made to is calculated, based on the number of words correctly
provide receivers with practice in the use of repair strat- identified. This approach provides the receiver with im-
egies. Owens and Raggio (1987), for example, provided mediate feedback on the correctness of her or his re-
receivers with a list of directives they could use when sponse and ensures that she or he is able to benefit from
they did not correctly repeat a segment. The receiver ongoing contextual information.
could ask the sender to: Speech Tracking is an innovative technique that can
be used for both testing and training the speech percep-
1. Say that again. tion skills of people with hearing loss. When used for
2. Say it another way. testing, however, a precise protocol must be followed to
3. Spell that word. minimize the e¤ects of sender, receiver, and text varia-
4. Write that word. bles. In training a less rigid approach can be used, and
5. Spell an important word. the approach may be modified to include practice in the
6. Write an important word. use of repair strategies.
Lunato and Weisenberger (1994) looked at the com- —Geo¤ Plant
parative e¤ectiveness of four repair strategies in Speech
Tracking. The repair strategies used were:
References
1. Verbatim repetition of a word or phrase.
2. The use of antonyms or synonyms as clues to the Boothroyd, A. (1987). CASPER. Computer assisted speech
perception evaluation and training. In Proceedings of the
identity of blocked words.
10th Annual Conference of the Rehabilitation Society of
3. Providing the receiver with phoneme-by-phoneme America (pp. 734–736). Washington, DC: Association for
correction of blocked words. the Advancement of Rehabilitation Technology.
4. Providing context by moving forward or backward in Brooks, P. L., Frost, B. J., Mason, J. L., and Gibson, D. M.
the text. (1986). Continuing evaluation of the Queen’s University
Tactile Vocoder. II: Identification of open set sentences and
These researchers reported that strategy 1 yielded the tracking narrative. Journal of Rehabilitation Research and
highest Tracking Rates and strategy 2 the lowest. Development, 23, 129–138.
When Speech Tracking is being used for training, Cowan, R. S. C., Blamey, P. J., Sarant, J. Z., Galvin, K. L.,
compromises can also be made in the receiver’s response Alcantara, J. I., Whitford, L. A., and Clark, G. M. (1991).
patterns. Owens and Raggio (1987), for example, argued Role of a multichannel electrotactile speech processor in a
for the use of nonverbatim responses in training sessions cochlear implant program for profoundly hearing impaired
adults. Journal of the Acoustical Society of America, 12, 39–
using Speech Tracking. While acknowledging the im-
46.
portance of a verbatim response for test purposes, they De Filippo, C. L. (1988). Tracking for speechreading training.
felt that in training it may be better to provide the re- Volta Review, 90, 215–239.
ceiver with practice in picking up the gist of the message De Filippo, C. L., and Scott, B. L. (1978). A method for
rather than expecting absolute identification at all times. training the reception of ongoing speech. Journal of the
Although Speech Tracking has become a widely used Acoustical Society of America, 63, 1186–1192.
training procedure, there are some people with hearing Dempsey, J. J., Levitt, H., Josephson, J., and Porazzo, J.
loss for whom it is unsuitable. These include people with (1992). Computer-assisted tracking simulation (CATS).
very poor speech reception skills, resulting in Tracking Journal of the Acoustical Society of America, 92, 701–710.
Rates of less than 20 wpm. At these levels receivers find Gnosspelius, J., and Spens, K.-E. (1992). A computer-based
speech tracking procedure. STL-QPSR (No. 1), 131–137.
the task extremely di‰cult and stressful. Others for Hochberg, I., Rosen, S., and Ball, V. (1989). E¤ects of text
whom the technique may be unsuitable include people complexity on connected discourse tracking rate. Ear and
with poor speech production skills. In such cases the Hearing, 10, 192–199.
sender may be unable to determine reliably whether Levitt, H., Waltzman, S. B., Shapiro, W. H., and Cohen, N. L.
the receiver gave the correct response. This necessitates (1986). Evaluation of a cochlear prosthesis using connected
the use of a written or a signed response, which serves to discourse tracking. Journal of Rehabilitation Research and
greatly reduce the Tracking Rate. Development, 23, 147–154.
Plant (1996, 1989) developed a modified version of Lunatto, K. E., and Weisenberger, J. M. (1994). Comparative
Speech Tracking designed to be used with such cases. e¤ectiveness of correction strategies in connected discourse
Simple stories are divided into parts, each consisting of tracking. Ear and Hearing, 15, 362–370.
Owens, E., and Raggio, M. (1987). The UCSF tracking pro-
200 words. Each part is in turn divided into short seg- cedure for evaluation and training of speech reception by
ments ranging in length from 4 to 12 words. The seg- hearing-impaired adults. Journal of Speech and Hearing
ments are presented for identification, and the receiver is Disorders, 52, 120–128.
asked to repeat as many words as he or she can. The Pichora-Fuller, M. K., and Benguerel, A. P. (1991). The design
receiver can use repair strategies to obtain additional in- of CAST (computer-aided speechreading training). Journal
formation if he or she experiences di‰culties. The re- of Speech and Hearing Research, 34, 202–212.
Speechreading Training and Visual Tracking 543
Plant, G. (1989). Commtrac: Modified Connected Discourse co-articulated during speech production. In addition,
Tracking Exercises for Hearing Impaired Adults. Chats- expectations about linguistic context may influence un-
wood, NSW, Australia: National Acoustic Laboratories. derstanding. Nevertheless some individuals are expert
Plant, G. (1996). Commtrac: Modified Speech Tracking Exer- speechreaders, scoring more than 80% correct on words
cises for Hearing-Impaired Adults. Somerville, MA: Hearing
in unrelated sentences, and demonstrate enhanced visual
Rehabilitation Foundation.
Plant, G. (1998). Training in the use of a tactile supplement to phonetic perception (Bernstein, Demorest, and Tucker,
lipreading: A long-term case study. Ear and Hearing, 19, 2000). Attempts to relate speechreading proficiency to
394–406. other sensory, perceptual, and cognitive function, in-
Plant, G., and Spens, K.-E. (1986). An experienced user of cluding neurophysiological responsiveness, have met
tactile information as a supplement to lipreading. An eval- with limited success (for a review, see Summerfield,
uative study. STL-QPSR (No. 1), 87–110. 1992).
Reed, C. M., Durlach, N. I., and Delhorne, L. A. (1992). From a historical perspective, speechreading was ini-
Natural methods of tactile communication. In I. Summers tially developed in Europe as a method to teach speech
(Ed.), Tactile aids for the hearing impaired (pp. 219–230). production to young children with hearing loss. Until the
London: Whurr.
1890s it was limited to children and was characterized by
Robbins, A. M., Osberger, M. J., Miyamoto, R. T., Kienle,
M. J., and Myres, W. (1985). Speech tracking performance a vision-only (unisensory) approach. Speechreading
in single channel cochlear implant subjects. Journal of training was based on analytic methods, which encour-
Speech and Hearing Research, 28, 565–578. aged perceivers to analyze mouth position to recognize
Ronnberg, J., Andersson, U., Lyxell, B., and Spens, K.-E. sounds, words, and sentences, or synthetic methods,
(1998). Vibrotactile speech tracking support: Cognitive pre- which encouraged perceivers to grasp a speaker’s whole
requisites. Journal of Deaf Studies and Deaf Education, 3, meaning (Gestalt). O’Neill and Oyer (1961) reviewed
143–156. several early distinctive methods that were adopted in
Spens, K.-E. (1992). Numerical aspects of the speech tracking America: Bruhn’s method (characterized by syllable
procedure. STL-QPSR (No. 1), 115–130. drill and close observation of lip movements), Nitchie’s
Tye-Murray, N. (1998). Foundations of aural rehabilitation:
method (which shifted from an analytical to a synthetic
Children, adults, and their family members. San Diego:
Singular Publishing Group. method), Kinzie’s method (Bruhn’s classification of
Tye-Murray, N., and Tyler, R. S. (1988). A critique of contin- sounds plus Nitchie’s basic psychological ideas), the Jena
uous discourse tracking as a test procedure. Journal of method (kinesthetic and visual cues), and film techniques
Speech and Hearing Disorders, 53, 226–231. (Mason’s visual hearing, Markovin and Moore’s con-
textual systemic approach). Gagné (1994) reviewed
present-day approaches: multimodal speech perception
Speechreading Training and Visual (integration of available auditory cues with those from
Tracking vision and other modalities), computer-based activi-
ties (interactive learning using full-motion video), and
conversational contexts (question-answer approach,
Speechreading (lipreading, visual speech perception), a e¤ective communication strategies, and training for
form of information processing, is defined by Boothroyd talkers with normal hearing to improve communication
(1988) as a ‘‘process of perceiving spoken language using behavior).
vision as the sole source of sensory evidence’’ (p. 77). When indicated, speechreading training is included in
Speechreading, a natural process in everyday communi- comprehensive programs of aural rehabilitation. At best,
cation, is especially helpful when communicating in the post-treatment gains from speechreading training are
noisy and reverberant conditions because facial motion modest, in the range of approximately 15%. Unfortu-
in speech production may augment or replace degraded nately, data on improvements related to visual speech
auditory information (Erber, 1969). Also, visual cues perception training are limited, and little is known about
have been shown to influence speech perception in in- the e‰cacy of various approaches. However, some indi-
fants with normal hearing (Kuhl and Meltzo¤, 1982), viduals demonstrate significant gains. Walden et al.
and speech perception phenomena, such as the McGurk (1977) have reported an increase in the number of visu-
e¤ect (MacDonald and McGurk, 1978), demonstrate the ally distinctive groups of phonemes (visemes) with which
influence of vision on auditory speech perception. consonants are recognized following practice. The iden-
To understand language, the speechreader directs tification responses following practice suggest that the
attention to, extracts, and uses linguistically relevant distinctiveness of visual phonetic cues related to place-
information from a talker’s face movements, facial of-articulation information is increased. Although it is
expressions, and body gestures. This information, which not clear what factors account for such improvements
may vary within and across talkers (for a review, see in performance, these results provide evidence for
Kricos, 1996), is integrated with other available sensory instances of learning in which perception is modified.
cues, such as auditory cues, as well as knowledge about Other studies show that the results are variable, both in
speech production and language in general to make performance on speechreading tasks and in gains related
sense of the visual information. However, the visual in- to learning programs. Improvements observed by Mas-
formation may be ambiguous because many sounds saro, Cohen, and Gesi (1993) suggest that repeated
look alike on the lips, are hidden in the mouth, or are testing experience may be as beneficial as structured
training. Initial changes may be due to nonsensory

factors such as increased familiarity with the task or
improved viewing strategies. In contrast, Bernstein et al.
(1991) suggest that speechreaders learn the visual pho-
netic characteristics of specific talkers after long periods
of practice. Treatment e‰cacy studies may be enhanced
by enrolling a larger number of participants, specifying
training methods, using separate materials for training
versus testing, evaluating asymptotic performance and
long-term e¤ects, determining whether the e¤ects of the
intervention are generalized to nontherapy situations,
and designing studies to control for factors such as the
motivation or test-taking behaviors of participants, as
well as personal attention directed toward participants
(for a review, see Gagné, 1994; Walden and Grant,
1993).
Current research has also focused on visual speech
perception performance in psychophysical experiments.
One research theme has centered on determining what
regions of the face contain critical motion that is used in
visual speech perception. Subjective comments from ex-
pert lipreaders suggest that movement in the cheek areas
may aid lipreading. Data from Lansing and McConkie
(1994) illustrate that eye gaze may shift from the mouth
to the cheeks, chin, or jaw during lipreading. Results
from Greenberg and Bode (1968) support the usefulness
of the entire face for consonant recognition. In contrast,
results from Ijsseldijk (1992) and Marassa and Lansing
(1995) indicate that information from the lips and mouth
region alone is su‰cient for word recognition. Massaro
(1998) reports that some individuals can discriminate
among a small set of test syllables without directly gaz-
ing at the mouth of the talker, and Preminger et al.
(1998) demonstrate that 70% of visemes in /a/ and /aw/
Figure 1. The photograph at the top shows a profile of the head
vowel contexts can be recognized when the mouth is
mounted hardware of the prototype S&R (Stampe and Rein-
masked. These diverse research findings underscore the gold) Eyelink system. The lightweight headband holds two
presence of useful observable visual cues for spoken custom-built ultra-miniature high-speed cameras mounted on
language at the mouth and in other face regions. adjustable rods to provide binocular eye monitoring with high
Eye-monitoring technology may be useful in under- spatial resolution (0.01 degrees) and fast sampling rates
standing the role of visual processes in speechreading (250 Hz). Each camera uses two infrared light–emitting diodes
(Lansing and McConkie, 1994). It provides information (IR LEDs) to illuminate the eye to determine pupil position.
about on-line processing of visual information and the The power supply is worn at the back of the head and coupled
tendency for perceivers to direct their eyes to the regions to a specialized image-processing card housed in a computer
of interest on a talker’s face. By moving the eyes, a per- that runs the eye-tracking software. The photograph at the
bottom shows the third camera on the headband that tracks the
ceiver may take advantage of the densely packed, highly
relative location for banks of IR LEDs a‰xed to each corner
specialized cone receptor cells in the fovea to inspect vi- of the computer monitor which displays full-motion video or
sual detail (Hallet, 1986). Eye monitoring has been used text. The relative location of the LEDs changes in relation
to study a variety of cognitive tasks, such as reading, to head movement and distance from the display and is used
picture perception, and face recognition, and to study to compensate for head motion to determine x; y eye-fixation
human-computer interaction (for a review, see Rayner, locations. An Ethernet link connects the experimental display
1984). The basic data obtained from eye monitoring computer to the eye-tracking computer with support for real-
reveals sequences of periods associated with perception time data transfer and control.
in which the eye is relatively stable (fixations) and high-
velocity jumps in eye position (saccades) during which
perception is inhibited. Distributions of saccadic infor-
mation (‘‘where’’ decisions) are quantified in terms of
length and direction, and distributions of fixations
(‘‘when’’ decisions) are quantified in terms of duration
and location. Experiments are designed to evaluate the
variance associated with cognitive processes. For exam-
Figure 2. The graph at the top shows the se-

quence and location of x; y eye fixations for a
perceiver who is speechreading the sentence,
‘‘You can catch the bus across the street.’’
The size of the markers is scaled in relative
units to illustrate di¤erences in total fixation
times directed at each x; y location. The as-
terisk-shaped markers enclosed in a circle are
used to show x; y fixation locations during
observable face motion associated with
speech, and the square-shaped markers show
locations prior to and following speech mo-
tion. The rectangles are used to illustrate the
regions of the talker’s face, ordered from top
to bottom, left to right: eye, left cheek, nose,
mouth, right cheek, chin. Region boundaries
accommodate dynamic face movements for
the production of the entire sentence. The
graph at the bottom half shows the corre-
sponding data record and includes speech-
reading followed by the reading of text. The
y-axis of the graph is scaled in units corre-
sponding to the measurement identified by the
number that has been superimposed: 1 ¼ x
(pixel) location of horizontal eye movements;
2 ¼ y (pixel) location of vertical eye move-
ments; 3 ¼ pupil size/10; 4 ¼ eye movement
velocity. The darker vertical bars show peri-
ods in which no eye data are available due to
eye blinks, and the light gray vertical bars
illustrate saccades that are defined by high-
velocity eye movements. Eye fixations are
identified by the lines 1 and 2 and separated
from one another by a vertical bar.
ple, distributions of fixation duration and of saccade culty. For interpretation, the eye movement records are
length di¤er across cognitive tasks such as reading versus linked to the spatial and temporal characteristics of face
picture perception. Various types of instrumentation are motion for each video frame or speech event (e.g., lips
available for eye-monitoring research, some of which opening).
include direct physical contact with the eyes to camera- Results from eye-monitoring studies demonstrate that
based video systems that determine eye rotation charac- speechreaders make successive eye gazes (fixations) to
teristics based on changes in the location of landmarks inspect the talker’s face or to track facial motion. The
such as the center of the pupil or corneal reflections, free talker’s eyes attract attention prior to and following
of error induced by translation related to head move- speech production (Lansing and McConkie, 2003), and
ment (for a review, see Young and Sheena, 1975). Fac- in the presence of auditory cues (Vatikiotis-Bateson,
tors such as cost, accuracy, ease of calibration, response Eiigisti, and Munhall, 1998). If auditory cues are not
mode, and demands of the participants and experimental available, perceivers with at least average proficiency at-
task must be considered in selecting an appropriate eye- tend to the talker’s mouth region for accurate sentence
monitoring system. An example of a system used in perception (Lansing and McConkie, 2003). However,
speechreading research is shown in Figure 1. The system some gazes are observed toward the regions adjacent to
is used to record the eye movements of the perceiver and the lips as well as toward the eyes of the talker. Similarly
to obtain a detailed record of the sequence and dura- for word understanding, speechreaders direct eye gaze
tion of fixations. A scan plot and sample record of eye most often and for longer periods of time toward the
movements are shown in Figure 2. Simultaneously, talker’s mouth than toward any other region of the face.
measurements are made of the accuracy of perception, Motion in regions other than the mouth may increase
e‰ciency of processing, or judgment of stimulus di‰- signal redundancy from that at the mouth and a¤ord a
natural context for observing detailed mouth motion. Gagné, J.-P. (1994). Visual and audiovisual speech perception
Task characteristics also influence where people look for training: Basic and applied research needs. Journal of the
information on the face of the talker (Lansing and Academy of Rehabilitative Audiology, 27, 133–159.
McConkie, 1999). Eye gaze is directed toward secondary Greenberg, H. J., and Bode, D. L. (1968). Visual discrimina-
tion of consonants. Journal of Speech and Hearing Re-
facial cues, located in the upper part of the face, with
search, 11, 466–471.
greater frequency for the recognition of intonation in- Hallet, P. E. (1986). Eye movements. In K. B. Bo¤, L. Kauf-
formation than for phonemic or stress recognition. Pho- man, and J. P. Thomas (Eds.), Handbook of perception
nemic and word stress information can be recognized and human performance: Sensory processes and perception
from cues located in the middle and lower parts of the face. (pp. 10-1–10-102). New York: Wiley.
Finally, new findings from brain imaging studies may Ijsseldijk, F. J. (1992). Speechreading performance under dif-
provide valuable insights into the neural underpinnings ferent conditions of video image, repetition, and speech
of basic processes in the visual perception of spoken rate. Journal of Speech and Hearing Research, 35, 466–
language (Calvert et al., 1997) and individual di¤erences 477.
in speechreading proficiency (Ludman et al., 2000). Kricos, P. (1996). Di¤erence in visual intelligibility across
talkers. In D. G. Storke and M. E. Hennecke (Eds.),
Although preliminary results have not yet identified
Speechreading by humans and machines: Models, systems,
speechreading-specific regions, measures in perceivers and applications (NATO ASI Series, Vol. 150, Series F:
with normal hearing indicate bilateral activation of Computer and systems sciences, pp. 43–53). Berlin:
the auditory cortext for silent speechreading (Calvert Springer-Verlag.
et al., 1997). Results from measures in perceivers with Kuhl, P. K., and Meltzo¤, A. N. (1982). The bimodal percep-
congenital onset of profound bilateral deafness (who rely tion of speech in infancy. Science, 218, 1138–1141.
on speechreading for understanding spoken language) Lansing, C. R., and McConkie, G. W. (1994). A new method
do not indicate strong left temporal activation (Mac- for speechreading research: Eyetracking observers’ eye
Sweeny et al., 2002). Functional magnetic resonance movements. Journal of the Academy of Rehabilitative
imaging may prove to be a useful tool to test hypotheses Audiology, 27, 25–43.
Lansing, C. R., and McConkie, G. W. (1999). Attention to fa-
about task di¤erences and the activation of primary
cial regions in segmental and prosodic visual speech per-
sensory processing areas, the role of auditory experi- ception tasks. Journal of Speech, Language, and Hearing
ence and plasticity, and neural mechanisms and sites of Research, 42, 526–539.
cross-modal integration in the understanding of spoken Lansing, C. R., and McConkie, G. W. (2003). Word identifi-
language. cation and eye fixation locations in visual and visual-plus-
Continued study and research in the basic processes auditory presentations of spoken sentences. Perception and
of speechreading are needed to determine research-based Psychophysics, 65, 536–552.
approaches to intervention, the relative advantages of Ludman, C. N., Summerfield, A. Q., Hall, D., Elliott, M.,
di¤erent approaches, and how specific approaches relate Foster, J., Hykin, J. L., et al. (2000). Lip-reading ability and
to individual needs. Additional insight into the basic patterns of cortical activation studied using fMRI. British
Society of Audiology, 34, 225–230.
processes of visual speech perception is needed to de-
MacDonald, J. W., and McGurk, H. (1978). Visual influences
velop and test a model of spoken word recognition that on speech perception processes. Perception and Psycho-
incorporates visual information, to optimize sensory- physics, 24, 253–257.
prosthetic aids, and to enhance the design of human- MacSweeney, M., Calvert, G. A., Campbell, R., McGuire,
computer interfaces. P. K., David, A. S., Williams, S. C. R., et al. (2002).
Speechreading circuits in people born deaf. Neuro-
—Charissa R. Lansing psychologia, 40, 801–807.
Marassa, L. K., and Lansing, C. R. (1995). Visual word rec-
References ognition in two facial motion conditions: Full-face versus
lips-plus-mandible. Journal of Speech and Hearing Re-
Bernstein, L. E., Demorest, M. E., Coulter, D. C., and search, 38, 1387–1394.
O’Connell, M. P. (1991). Lipreading sentences with vibro- Masarro, D. W. (1998). Perceiving talking faces: From speech
tactile vocoders: Performance of normal-hearing and perception to a behavioral principle (pp. 415–443). Cam-
hearing-impaired subjects. Journal of the Acoustical Society bridge, MA: MIT Press.
of America, 90, 2971–2984. Massaro, D., Cohen, M., and Gesi, A. (1993). Long-term
Bernstein, L. E., Demorest, M. E., and Tucker, P. E. (2000). training, transfer and retention in learning to lipread. Per-
Speech perception without hearing. Perception and Psycho- ception and Psychophysics, 53, 549–562.
physics, 62, 233–252. O’Neill, J. J., and Oyer, H. J. (1961). Visual communication
Boothroyd, A. (1988). Linguistic factors in speechreading. In for the hard of hearing. Englewood Cli¤s, NJ: Prentice
C. L. De Filippo and D. G. Sims (Eds.), New reflections in Hall.
speechreading. Volta Review, 90(5), 77–87. Preminger, J. E., Lin, H.-B., Payen, M., and Levitt, H. (1998).
Calvert, G. A., Bullmore, E. T., Brammer, M. J., Campbell, Selective masking in speechreading. Journal of Speech,
R., Williams, S. C. R., McGuire, P. K., et al. (1997). Acti- Language, and Hearing Research, 41, 564–575.
vation of auditory cortex during silent lipreading. Science, Rayner, K. (1984). Visual selection in reading, picture per-
276, 593–596. ception, and visual search: A tutorial review. In H. Bouma
Erber, N. P. (1969). Interaction of audition and vision in the and D. G. Bouwhuis (Eds.), Attention and performance:
recognition of oral speech stimuli. Journal of Speech and X. Control of language processes (pp. 67–96). London:
Hearing Research, 12, 423–425. Erlbaum.
Summerfield, Q. (1992). Lipreading and audio-visual speech Grant, K. W., and Walden, B. E. (1996). Spectral distribution
perception. Philosophical Transactions of the Royal Society of prosodic information. Journal of Speech and Hearing
of London, 335, 71–78. Research, 39, 228–238.
Vatikiotis-Bateson, E., Eigsti, I.-M., Yano, S., and Munhall, Grant, K. W., Walden, B. E., Seitz, P. F. (1998). Auditory-
K. (1998). Eye movements of perceivers during audiovisual visual speech recognition by hearing-impaired subjects:
speech perception. Perception and Psychophysics, 60, 926– Consonant recognition, sentence recognition, and auditory-
940. visual integration. Journal of the Acoustical Society of
Walden, B., and Grant, D. (1993). Research needs in rehabili- America, 103, 2677–2690.
tative audiology. In J. G. Alpiner and P. A. McCarthy Kaplan, H. (1996). Speechreading. In M. J. Moseley and S. J.
(Eds.), Rehabilitative audiology: Children and adults (pp. Bally (Eds.), Communication therapy: An integrated
500–528). Baltimore: Williams and Wilkins. approach to aural rehabilitation. Washington, DC: Gallau-
Walden, B., Prosek, R., Montgomery, A., Scherr, C., and det University Press.
Jones, C. (1977). E¤ects of training on the visual recogni- Lansing, C. R., and Bievenue, L. A. (1994). Using intelligent
tion of consonants. Journal of Speech and Hearing Re- computer-based teaching systems to evaluate the e¤ective-
search, 20, 130–145. ness of consonant recognition training. Volta Review, 96,
Young, L. R., and Sheena, D. (1975). Survey of eye movement 41–49.
recording methods. Behavioral Research Methods and In- Lansing, C. R., and Helegeson, C. L. (1995). Priming the visual
strumentation, 7, 394–429. recognition of spoken words. Journal of Speech and Hearing
Research, 38, 1377–1386.
Further Readings Lesner, S., Sandridge, S., and Kricos, P. (1987). Training
influences on visual consonant and sentence recognition.
Bernstein, L. E., Auer, E. T., Jr., and Tucker, P. E. (2001). Ear and Hearing, 8, 283–287.
Enhanced speechreading in deaf adults: Can short-term Lidestam, B., Lyxell, B., and Andersson, G. (1999). Speech-
training/practice close the gap for hearing adults? Journal of reading: Cognitive predictors and displayed emotion. Scan-
Speech, Language, and Hearing Research, 44, 5–18. dinavian Audiology, 28, 211–217.
Bernstein, L. E., Demorest, M. E., and Tucker, P. E. (1998). MacSweeney, M., Campbell, R., Calvert, G. A., McGuire,
What makes a good speechreader? First you have to find P. K., David, A. S., Suckling, J., et al. (2000). Dispersed
one. In R. Campbell, B. Dodd, and D. Burnham (Eds.), activation in the left temporal cortex for speech-reading
Hearing by eye: II. Advances in the psychology of speech- in congenitally deaf people. Proceedings of the Royal
reading and audiovisual speech (pp. 221–227). London: Society of London. Series B, Biological Sciences, 268, 451–
Erlbaum. 457.
Bernstein, L. E., Eberhardt, S. P., and Demorest, M. E. (1989). Pichora-Fuller, M. K., and Benguerel, A. P. (1991). The design
Single-channel vibrotactile supplements to visual perception of CAST computer-aided speechreading training. Journal of
of intonation and stress. Journal of the Acoustical Society of Speech and Hearing Disorders, 34, 202–212.
America, 85, 397–405. Rönnberg, J. (1995). What makes a good speechreader? In
Binnie, C. A., Jackson, P. L., and Montgomery, A. A. (1976). G. Plant and K. Spens (Eds.), Profound deafness and speech
Visual intelligibility of consonants: A lipreading screening communication (pp. 393–417). London: Whurr.
test with implications for aural rehabilitation. Journal of Rosenblum, L. D., Johnson, J. A., and Saldaña, H. M. (1996).
Speech and Hearing Disorders, 41, 530–539. Point-light facial displays enhance comprehension of speech
Binnie, C. A., Montgomery, A., and Jackson, P. (1974). Audi- in noise. Journal of Speech and Hearing Research, 39, 1159–
tory and visual contributions to the perception of con- 1170.
sonants. Journal of Speech and Hearing Research, 17, 619– Samar, V. J., and Sims, D. G. (1983). Visual evoked-response
630. correlates of speechreading performance in normal-hearing
Calvert, G. A., Campbell, R., and Brammer, M. J. (2000). adults: A replication and factor analytic extension. Journal
Evidence from functional magnetic resonance imaging of of Speech and Hearing Research, 26, 2–9.
crossmodal binding in the human heteromodal cortex. Cur- Sims, D. G., Von Feldt, J., Dowaliby, F., Hutchinson, K., and
rent Biology, 10, 649–657. Meyers, T. (1979). A pilot experiment in computer assisted
Cherry, R., and Small, S. (1993). E¤ect of interactive video speechreading instruction utilizing the Data Analysis Video
versus non-interactive video training on speech recognition Interactive Device DAVID. American Annals of the Deaf,
by hearing-impaired adults. Volta Review, 95, 135–137. 124, 618–623.
De Filippo, C., and Sims, D. (Eds.). (1988). New reflections on Storke, D. G., and Hennecke, M. E. (Eds.). (1996). Speech-
speechreading. Volta Review, 90(5), 1–313. reading by humans and machines: Models, systems, and
Demorest, M. E., Bernstein, L. E., and Dehaven, G. P. (1996). applications (NATO ASI Series, Vol. 150, Series F: Com-
Generalizability of speeechreading performance on non- puter and systems sciences). Berlin: Springer-Verlag.
sense syllables, words and sentences: Subjects with normal Summerfield, Q. (1989). Visual perception of phonetic gestures.
hearing. Journal of Speech and Hearing Research, 39, 697– In J. G. Mattingly (Ed.), Modularity and the motor theory of
713. speech perception (pp. 117–137). Hillsdale, NJ: Erlbaum.
Erber, N. P. (1988). Communication therapy for hearing- Tye-Murray, N., Tyler, R. S., Bong, R., and Nares, T. (1988).
impaired adults. Abbotsford, Victoria, Australia: Clavis Computerized laser videodisc programs for training
Publishing. speechreading and assertive communication behaviors.
Gagné, J.-P., Dinon, D., and Parsons, J. (1991). An evaluation Journal of the Academy of Rehabilitative Audiology, 21,
of CAST: A computer-aided speechreading program. Jour- 143–152.
nal of Speech and Hearing Research, 34, 213–221. Walden, B., Erdman, S., Montgomery, A., Schwartz, D., and
Garstecki, D. C. (1981). Auditory-visual training paradigm for Prosek, R. (1981). Some e¤ects of training on speech rec-
hearing-impaired adults. Journal of the Academy of Reha- ognition by hearing impaired adults. Journal of Speech and
bilitative Audiology, 14, 223–238. Hearing Research, 24, 207–216.
Suprathreshold Speech Recognition tener would likely hear few, if any, conversational
speech sounds and would be expected to have very poor
speech recognition. The second listener has normal
Suprathreshold refers to speech presented above the au- hearing in the low frequencies falling to a mild hearing
ditory threshold of the listener. Speech recognition is loss in the high frequencies. This listener might hear
generally defined as the percentage of words or sentences some, but not all, of the speech sounds. The inability to
that can be accurately heard by the listener. For exam- hear high-frequency consonants would likely result in
ple, a patient who could correctly repeat 40 out of 50 less than 100% speech recognition for a conversational-
words presented would have 80% speech recognition. level signal.
Because speech is a complex and continually varying Most commonly, the material used to measure speech
signal requiring multiple auditory discrimination skills, it recognition is a list of monosyllabic words. Typically
is not possible to accurately predict an individual’s each word is preceded by a carrier phrase, such as ‘‘Say
speech recognition from the pure-tone audiogram (Mar- the word——.’’ Most available monosyllabic word lists
shall and Bacon, 1981). Measurement of suprathreshold are open set; that is, the listener is not restricted to a
speech recognition allows clinicians to assess a patient’s predetermined list of possible responses. A number of
speech communication ability in a controlled and sys- standard lists have been developed with vocabulary
tematic manner. The results can help clinicians distin- levels appropriate for adults (Hirsh et al., 1952; Tillman
guish between di¤erent causes of hearing loss and plan and Carhart, 1966) or children. The lists exclude words
and evaluate audiological rehabilitation programs. that would be unfamiliar to most people, and word se-
Speech is a complex acoustic signal that varies from lection is balanced to maintain similar levels of di‰culty
moment to moment: from shouting to whispering, from across lists. Additionally, each list is phonetically bal-
clear speech in quiet to di‰cult to understand speech in anced; that is, the sounds in the words occur in the same
high ambient noise. Figure 1 shows the expected fre- proportion as in everyday speech. Test sensitivity is
quency and intensity of speech sounds for speech spoken enhanced by using a larger number of items, such as 50
at a conversational level in quiet. In general, the vowel instead of 25 words (Thornton and Ra‰n, 1978).
sounds contain lower frequency information and are Sentence tests are also available. These tests are more
more intense, while consonant sounds contain higher like ‘‘real speech’’ and thus presumably able to provide a
frequency information and are produced at a lower closer estimate of real-life communication performance.
intensity level. Which sounds are audible depends on However, sentence tests incorporate additional factors
the listener’s audiometric thresholds as well as on the besides simple audibility. With sentences, the listener
speaker and the level at which the words are spoken. For may be relying on linguistic, prosodic, or contextual cues
example, the sounds of shouted speech would be shifted in addition to auditory information. To limit contextual
to a higher intensity level and have a slightly di¤erent cues, sentence lists are available that use neutral context
pattern across frequency (Olsen, 1998). Figure 1 also (Kalikow, Stevens, and Elliot, 1977) or linguistically
shows audiograms for two di¤erent listeners. The first meaningless word combinations (Speaks and Jerger,
listener has a moderately severe hearing loss. This lis- 1965). Sentences also place greater demands on higher-
level processes such as auditory memory, which may be
a particular problem in older listeners (Chmiel and
Jerger, 1996).
For standard clinical testing, the participant is seated
in a sound booth and listens to speech presented to one
ear at a time through earphones. Speech may also be
presented through a speaker, although this does not
provide information specific to each ear. Such sound
field testing can be used to quantify the e¤ects of ampli-
fication. Recorded speech materials are preferred for
consistency, although speech recognition tests are also
administered using monitored live voice, during which
the tester speaks to the participant over a microphone
while monitoring her vocal strength. The entire list of
words is presented at the same level. After each word,
the participant responds by repeating or writing the
Figure 1. Audiogram showing expected frequency and intensity word. The speech recognition score is then expressed as
of speech sounds. Illustrative hearing thresholds are also shown the percentage of correct words at the given presentation
for a listener with a moderately severe hearing loss (circles) and
level in each ear.
for a listener with normal hearing in the low frequencies falling
to a mild loss in the high frequencies (triangles). In each case, Although most often presented in quiet, these materi-
speech sounds falling below the hearing threshold (i.e., at als may also be administered in noise. Many recordings
higher intensity levels) are audible to the listener; speech include a background of multitalker babble that mimics
sounds falling above the hearing threshold (i.e., at lower in- a more realistic listening situation; this increases the de-
tensity levels) are inaudible. gree to which test results characterize the listener’s
Suprathreshold Speech Recognition 549
hearing listener. PI functions for listeners with retro-

cochlear loss may demonstrate disproportionately low
scores as well as a phenomenon called rollover, in which
performance first improves with increasing presentation
level, and then degrades as the presentation level con-
tinues to increase.
In the clinic, speech recognition testing is often done
at only one or two levels in each ear to minimize test
time. One common approach is to select one or more
levels relative to the speech reception threshold. Selec-
tion of the specific presentation level is generally based
on providing adequate speech audibility, particularly at
frequencies containing important consonant informa-
Figure 2. Representative performance-intensity functions, ex- tion. An alternative approach is to present speech at the
pressed as percent of words correct at each presentation level, level the listener deems most comfortable. Because the
for a listener with normal hearing and for three listeners with listener’s most comfortable level may not be the same
di¤erent types of hearing loss. level at which she obtains a maximum score, testing
exclusively at the most comfortable level can lead to er-
roneous conclusions about auditory function (Ullrich
everyday communication abilities. For example, listeners and Grimm, 1976; Beattie and Warren, 1982).
with sensorineural loss require a more favorable signal- In summary, measurement of suprathreshold speech
to-noise ratio than listeners with normal hearing or con- recognition is an important part of an audiometric ex-
ductive hearing loss (Dubno, Dirks, and Morgan, 1984). amination. Test results can be a¤ected by a number of
When administering and interpreting suprathreshold factors, including the participant’s pure-tone sensitivity,
speech recognition tests it is important to consider not the amount of distortion produced by the hearing loss,
only the test environment but also the physical, linguis- the presentation level of the speech, the type of speech
tic, cognitive, and intellectual abilities of the listener. If material, the presence or absence of background noise,
the listener is unable to respond verbally or in writing, and even the participant’s age. A detailed understanding
tests are available where the listener can choose among of these factors is important when interpreting test
a set of picture responses (Ross and Lerman, 1970). results and drawing conclusions about an individual’s
Although most often used with children, these tests are overall communication ability.
also appropriate for adults with spoken word deficits,
including dysarthria or apraxia. One limitation of such —Pamela E. Souza
closed-set tests is that the chance of guessing correctly is
higher when only a fixed number of choices is available. References
However, scoring accuracy may be higher than with Beattie, R. C., and Warren, V. G. (1982). Relationships among
open-set tests because there are fewer chances for mis- speech threshold, loudness discomfort, comfortable loud-
interpretation of the response. For listeners who are not ness, and PB max in the elderly hearing impaired. American
proficient in English, recorded materials are available in Journal of Otolaryngology, 3, 353–358.
a number of other languages (provided, of course, that Chmiel, R., and Jerger, J. (1996). Hearing aid use, central au-
the tester has su‰cient knowledge of the test language to ditory disorder, and hearing handicap in elderly persons.
interpret responses). Journal of the American Academy of Audiology, 7, 190–
An important consideration is the presentation level. 202.
If multiple levels are tested, the percentage correct in- Dubno, J. R., Dirks, D. D., and Morgan, D. E. (1984). E¤ects
of age and mild hearing loss on speech recognition in noise.
creases with increasing presentation level in a char-
Journal of the Acoustical Society of America, 76, 87–96.
acteristic pattern (Fig. 2). This is referred to as the Hirsh, I. J., Davis, H., Silverman, S. R., Reynolds, E. G.,
performance intensity (PI) function. The rate of im- Eldert, E., and Benson, R. W. (1952). Development of
provement depends on the test material as well as patient materials for speech audiometry. Journal of Speech and
characteristics. Easier material (e.g., sentences contain- Hearing Disorders, 17, 321–337.
ing contextual cues) results in a greater rate of improve- Kalikow, D. N., Stevens, K. N., and Elliot, L. L. (1977).
ment with increases in level than more di‰cult material Development of a test of speech intelligibility in noise using
(e.g., nonsense words). The presentation level at which sentence materials with controlled word predictability.
the listener achieves a highest score is referred to as the Journal of the Acoustical Society of America, 61, 1337–
PB max, or maximum score for phonetically balanced 1351.
Marshall, L., and Bacon, S. P. (1981). Prediction of speech
words. A normal-hearing listener typically achieves
discrimination scores from audiometric data. Journal of
100% speech recognition at levels 30–40 dB above the Speech and Hearing Research, 2, 148–155.
SRT. Sensorineural hearing loss may restrict the PB Olsen, W. O. (1998). Average speech levels and spectra in var-
max to below 100%. Listeners with conductive hearing ious speaking/listening conditions: A summary of the Pear-
loss generally achieve 100% recognition, although they son, Bennett, and Fidell (1977) report. American Journal of
require a higher presentation level than would a normal- Audiology, 7, 1–5.
Ross, M., and Lerman, J. (1970). Picture identification test for

hearing-impaired children. Journal of Speech and Hearing
Research, 13, 44–53.
Speaks, C., and Jerger, J. (1965). Performance-intensity char-
acteristics of synthetic sentences. Journal of Speech and
Thornton, A. R., and Ra‰n, M. J. M. (1978). Speech-
discrimination scores modeled as a binomial variable. Jour-
nal of Speech and Hearing Research, 21, 507–518.
Tillman, T. W., and Carhart, R. (1966). An expanded test for
speech discrimination utilizing CNC monosyllabic words:
Northwestern University Auditory Test No. 6 (USAF School
of Aerospace Medicine Technical Report). Brooks Air
Force Base, TX.
Ullrich, K., and Grimm, D. (1976). Most comfortable listening
level presentation versus maximum discrimination for word
discrimination material. Audiology, 15, 338–347.
Further Readings Figure 1. Temporal integration curves according to the func-

Dubno, J. R., and Dirks, D. D. (1993). Factors a¤ecting per- tions shown in the legend. In curves A1 and A2 , the time con-
formance on psychoacoustic and speech-recognition tasks stant t ¼ 300 ms; in A1 , exponent m ¼ 1; in A2 , m ¼ 0.8. In
in the presence of hearing loss. In G. A. Studebaker and B1 , t ¼ 300 ms; in B2 , t ¼ 100 ms. The value of t ¼ 300 ms is
I. Hochberg (Eds.), Acoustical factors a¤ecting hearing aid indicated by a mark on the abscissa.
performance (pp. 235–253). Needham Heights, MA: Allyn
and Bacon.
Hall, J. W., and Mueller, H. G. (1997). Speech audiometry. In
Audiologists’ desk reference. Vol. I. Diagnostic audiology:
TI has a time limit. For a stimulus longer than this limit,
Principles, procedures, and practices (pp. 113–174). San Di- the loudness, or the detection (threshold) level, remains
ego, CA: Singular Publishing Group. relatively constant.
Kirk, K. I., Pisoni, D. B., and Miyamoto, R. C. (1997). E¤ects Interest in studying TI is fueled by the need to under-
of stimulus variability on speech perception in listeners stand auditory processing of speech—a signal that, by its
with hearing impairment. Journal of Speech, Language, and nature, changes rapidly in time. Better understanding of
Hearing Research, 40, 1395–1405. the temporal characteristics of hearing should help us
Olsen, W. O., and Matkin, N. D. (1991). Speech audiometry. improve means for enhancement of speech communica-
In W. F. Rintelmann (Ed.), Hearing assessment (pp. 39– tion in unfavorable listening environments, and of lis-
140). Austin, TX: Pro-Ed. teners with impaired hearing.
Olsen, W. O., Van Tasell, D. J., and Speaks, C. E. (1997).
Phoneme and word recognition for words in isolation and
Graphs of the relationship between the stimulus du-
in sentences. Ear and Hearing, 18, 175–188. ration (plotted on the horizontal coordinate, usually in
Penrod, J. P. (1994). Speech threshold and recognition/ milliseconds with a logarithmic scale) and the intensity
discrimination testing. In J. Katz (Ed.), Handbook of level at the threshold of hearing (plotted on the vertical
clinical audiology (pp. 147–164). Baltimore: Williams and coordinate in decibels, dB) are called temporal inte-
Wilkins. gration curves (TICs). Examples of TICs are shown in
Stach, B. A., Davis-Thaxton, M., and Jerger, J. (1995). Figure 1. The detection intensity level first declines as
Improving the e‰ciency of speech audiometry: Computer- the stimulus duration increases and then, beyond a time
based approach. Journal of the American Academy of limit called the critical duration, remains constant. The
Audiology, 6, 330–333. magnitude of TI can be expressed by the di¤erence be-
Thibodeau, L. M. (2000). Speech audiometry. In R. J. Roeser,
M. Valente, and H. Hosford-Dunn (Eds.), Audiology diag-
tween the detection levels of long and short signals. The
nosis (pp. 281–310). New York: Thieme. rate of decline of TICs is represented by slopes of the
curves, which too are often used as indicators of TI
magnitude. These slopes (they are negative) are usually
expressed as the ratio of the change of level (in dB) per
Temporal Integration tenfold increase in signal duration [(L2–L1)=dec], or per
doubling of signal duration. The slopes of the TICs and
the values of the critical duration represent summary
The term temporal integration (TI) refers to summation characteristics of TI. (The critical duration depends on a
of stimulus intensity during the duration of the stim- time constant t, a parameter in formulas describing
ulus. As duration increases, a sensation like loudness in- TICs.)
creases, or the sound level at which the stimulus can be
detected decreases. The stimuli may be various types of Factors A¤ecting TI
signals, such as tones or bands of noise. Similarly, short
succeeding stimuli can combine their energies and pro- The slope of the curves and the time constant depend on
vide a lower detection level than individual stimuli. The various factors, such as signal frequency, status of hear-
Temporal Integration 551
ing, and type of signals. The e¤ect of signal frequency on trode cochlear implant were considerably less steep than
TI is pronounced (Gerken, Bhat, and Hutchison-Clutter, 8 dB/dec typically observed with acoustical stimula-
1990) and depends on signal duration. TI, as well as t, is tion. The slopes varied widely across subjects and across
greater at lower frequencies than at higher ones (e.g., stimulated electrodes. When Shannon and Otto (1990)
Fastl, 1976; Nábělek, 1978; Florentine, Fastl, and Buus, used a device called the auditory brainstem implant
1988). At low frequencies and signal durations below (ABI) and positioned its electrodes near the cochlear
10 ms, the TIC slopes were found to be up to 15 dB/ nucleus of listeners, they obtained only a shallow TIC
dec (e.g., Green, Birdsall, and Tanner, 1957). At fre- over the range of 2- to 1000-ms signal duration.
quencies between 1 and 8 kHz and at signal durations
between 20 and 100 ms, the slopes are between 10 and
8 dB/dec (e.g., Zwislocki, 1969; Gerken, Bhat, and
Models
Hutchison-Clutter, 1990). The steeper slopes at short A number of models for temporal integration have been
signal durations as compared to slopes at longer signal proposed. The theoretical foundations for the mathe-
durations are attributed to the loss of contribution of matical description of TI are either deterministic or
some energy due to spectral broadening, or ‘‘splatter.’’ probabilistic. Deterministic models include power func-
When the frequency during the signal is not constant but tion models or exponential function models. One of the
is increasing, the slope between 20 ms and 80 ms of sig- deterministic models is described mathematically by
nal duration is smaller, about 9 dB/dec, than when the the power function t(It Iy ) ¼ Iy t ¼ const (Hughes,
frequency is decreasing—about 13 dB/dec (Nábělek, 1946), or in its more general form by It m ¼ C (Green et
1978). For broadband masking conditions, the values of al., 1957). In these equations t is the stimulus duration, It
TI for constant tones are similar to those without mask- is the threshold intensity at t, Iy is the threshold inten-
ing, but some influence of the level of the masker was sity for very long stimuli, t is the time constant of the
observed. This influence depends on signal duration. For integration process, m is the power function exponent,
signal durations between 2 and 10 ms, the TICs at me- and C is a constant. The exponent m determines the
dium masker levels are steeper than at low or high slope of the curves (A1 and A2 in Fig. 1). The slope
masker levels, and for signal durations over 20 ms, the 3 dB/doubling or 10 dB/dec corresponds to m ¼ 1.
TI values are not a¤ected by the masker level (Oxenham, Another model is the exponential one It =Iy ¼ 1=(1
Moore, and Vickers, 1997). Formby et al. (1994) inves- et=t ) (Feldkeller and Oetinger, 1956; Plomp and
tigated the influence of bandwidth of a noise signal Bouman, 1959). The curves B1 and B2 in Figure 1 cor-
masked by an uncorrelated broadband noise on TI and respond to this equation.
t. They found that both TI and t were related inversely Zwislocki (1960) developed a temporal summation
to the bandwidth, if the bandwidth was greater than the theory for two pulses separated in time and proposed a
critical band of hearing (CB), and were relatively invar- theory of TI for loudness (Zwislocki, 1969). In his model
iant if the bandwidth was smaller than the critical band. it is assumed that (1) a linear temporal integrator (with a
For gated signal and masker, Formby et al. (1994) time constant on the order of 200 ms) exists in the cen-
identified at least three cues for signal detection: (1) a tral nervous system; (2) a nonlinear transformation that
relative timing cue, (2) a spectral shape cue, and (3) a produces compression precedes the temporal summa-
traditional energy cue. The timing and spectral shape tion; and (3) neural excitation decreases exponentially
cues count most for the shortest (10 ms) and narrowest with a short time constant at the input to the integrator
(bandwidth ¼ 63 Hz) signals, respectively. When the that summates the central neural activity. (This last as-
signal is a series of tone pulses, and not single bursts, the sumption indicates that the term temporal integration
change of time interval between the pulses produces should not be interpreted as the integration of acoustic
smaller change of TI than the change in duration of energy per se.)
single bursts (Carlyon, Buus, and Florentine, 1990). Attempts to resolve an apparent discrepancy between
Listeners with hearing impairment generally show high temporal resolution of hearing and long time con-
less temporal integration than listeners with normal stants of temporal integration have led to a number of
hearing (e.g., Watson and Gengel, 1969; Gerken, Bhat, models employing short integration times (e.g., Penner,
and Hutchison-Clutter, 1990). No e¤ect of level of a 1978; Oxenham, Moore, and Vickers, 1997). Viemeister
broadband masker was found for listeners with impaired and Wakefield (1991) have not considered this discrep-
hearing at any signal duration. ancy to be a real problem. Their model is based on a
Loudness increases when the duration of a short sig- statistical probability approach and assumes multiple
nal increases. When the signal level changes, the TI for sampling. Taking their own data into account, Vie-
loudness changes; however, the change is not monotonic meister and Wakefield conclude that power integration
(Buus, Florentine, and Poulsen, 1999). The change occurs only for pulses separated in time by less than
is greatest at moderate sensation levels and depends about 5 ms, and that therefore their data are inconsistent
on signal duration. The e¤ect of signal level on TI with the classical view of TI involving long-term inte-
of loudness is greater at short than at long signal dura- gration. However, they find the data to be compatible
tions. with the notion that the input is sampled at a fairly
Donaldson, Viemeister, and Nelson (1997) found that high rate and the obtained samples (or ‘‘looks’’)
TICs for electrical stimulation with the Nucleus-22 elec- are stored in memory; while in the memory, the ‘‘looks’’
can be selectively accessed, weighted, and otherwise Fastl, H. (1976). Influence of test tone duration on auditory
processed. masking patterns. Audiology, 15, 63–71.
Dau, Kollmeier, and Kohlrausch (1997) proposed a Feldkeller, R., and Oetinger, R. (1956). Die Horbarkeits-
multichannel model. They describe the e¤ects of spectral grenzen von Impulsen verschiedener Dauer. Acustica, 6,
481–493.
and temporal integration in amplitude-modulation de-
Florentine, M., Fastl, H., and Buus, S. (1988). Temporal inte-
tection for a stochastic noise carrier. The model is based gration in normal hearing, cochlear impairment, and im-
on the concept of a modulation filter-bank. To integrate pairment simulated by masking. Journal of the Acoustical
information across frequency, the detection process in Society of America, 84, 195–203.
the model combines cues from all filters with an optimal Formby, C., Heinz, M. G., Luna, C. E., and Shaheen, M. K.
decision statistic. To integrate information across time, a (1994). Masked detection thresholds and temporal integra-
‘‘multiple-look’’ strategy, similar to that proposed by tion for noise band signals. Journal of the Acoustical Society
Viemeister and Wakefield (1991), is realized within the of America, 96, 102–114.
detection stage of the model. The temporal integration Gerken, G. M., Bhat, V. K., and Hutchison-Clutter, M.
involves a template that provides the basis for the opti- (1990). Auditory temporal integration and the power func-
tion model. Journal of the Acoustical Society of America, 88,
mal detector of the model. The length and the time con-
767–778.
stant of the template are variable: they change according Green, D. M., Birdsall, T. G., and Tanner, W. P., Jr. (1957).
to the task which the listener has to perform. Signal detection as a function of signal intensity and dura-
Although an extensive knowledge of temporal inte- tion. Journal of the Acoustical Society of America, 29, 523–
gration has been attained, many aspects of TI await 531.
further investigation. For example, evidence of some TI Haig, A. R., Gordon, E., De-Pascalis, V., Meares, R. A.,
mechanism at a higher stage of the auditory pathway Bahramali, H., and Harris, A. (2000). Gamma activity in
was found by Uppenkamp, Fobel, and Patterson (2001) schizophrenia: Evidence of impaired network binding?
when they compared the perception of short-frequency Clinical Neurophysiology, 111, 1461–1468.
sweeps and the physiological response to them in the Hughes, J. W. (1946). The threshold of audition for short
periods of stimulation. Proceedings of the Royal Society of
brainstem. The improved understanding of TI should
London. Series B: Biological Sciences, B133, 486–490.
provide a sounder basis for the development of means Michie, P. T. (2001). What has MMN revealed about the au-
for securing better speech communication in general and ditory system in schizophrenia? International Journal of
for listeners with special problems, like those with coch- Psychophysiology, 42, 177–194.
lear implants, in particular. Presently, TI studies are not Nábělek, I. V. (1978). Temporal summation of constant and
limited to traditional topics but also cover higher levels gliding tones at masked auditory threshold. Journal of the
of the brain, like the role of TI in establishing neural Acoustical Society of America, 64, 751–763.
representations of phonemes (Tallal et al., 1998), and Oxenham, A. J., Moore, B. C., and Vickers, D. A. (1997).
investigation of an association between a deficient TI Short-term temporal integration: Evidence for the influence
and mental disturbances in schizophrenia (Haig et al., of peripheral compression. Journal of the Acoustical Society
of America, 101, 3676–3687.
2000; Michie, 2001).
Penner, M. J. (1978). A power law transformation resulting in
a class of short-term integrators that produce time-intensity
Acknowledgments trades for noise bursts. Journal of the Acoustical Society of
Many thanks to Assoc. Prof. Lana S. Dixon for her help America, 63, 195–201.
in securing pertinent references. Plomp, R., and Bouman, M. A. (1959). Relation between
See also clinical decision analysis; cochlear hearing threshold and duration for tone pulses. Journal of
the Acoustical Society of America, 31, 749–758.
implants; masking; temporal resolution.
Shannon, R. V., and Otto, S. R. (1990). Psychophysical mea-
—Igor V. Nábělek sures from electrical stimulation of the human cochlear nu-
cleus. Hearing Research, 47, 159–168.
References Tallal, P., Merzenich, M. M., Miller, S., and Jenkins, W.
(1998). Language learning impairments: Integrating basic
Buus, S., Florentine, M., and Poulsen, T. (1999). Temporal science, technology, and remediation. Experimental Brain
integration of loudness in listeners with hearing losses of Research, 123, 210–219.
primarily cochlear origin. Journal of the Acoustical Society Uppenkamp, S., Fobel, S., and Patterson, R. D. (2001). The
of America, 105, 3464–3480. e¤ects of temporal asymmetry on the detection and percep-
Carlyon, R. P., Buus, S., and Florentine, M. (1990). Temporal tion of short chirps. Hearing Research, 158, 71–83.
integration of trains of tone pulses by normal and by coch- Viemeister, N. F., and Wakefield, G. H. (1991). Temporal in-
learly impaired listeners. Journal of the Acoustical Society of tegration and multiple looks. Journal of the Acoustical So-
America, 87, 260–268. ciety of America, 90, 858–865.
Donaldson, G. S., Viemeister, N. F., and Nelson, D. A. (1997). Watson, C. S., and Gengel, R. W. (1969). Signal duration and
Psychometric functions and temporal integration in electric signal frequency in relation to auditory sensitivity. Journal
hearing. Journal of the Acoustical Society of America, 101, of the Acoustical Society of America, 46, 989–997.
3706–3721. Zwislocki, J. J. (1960). Theory of temporal auditory summation.
Dau, T., Kollmeier, B., and Kohlrausch, A. (1997). Modeling Journal of the Acoustical Society of America, 32, 1046–1060.
auditory processing of amplitude modulation: II. Spectral Zwislocki, J. J. (1969). Temporal summation of loudness: An
and temporal integration. Journal of the Acoustical Society analysis. Journal of the Acoustical Society of America, 46,
of America, 102, 2906–2919. 431–441.
Temporal Resolution 553
Further Readings tion have focused on intensity variations in an attempt to

separate purely temporal from spectro-temporal resolv-
Algom, D., Rubin, A., and Cohen-Raz, L. (1989). Binaural ing capabilities (see auditory scene analysis). Tempo-
and temporal integration of the loudness of tones and
ral resolution is limited by auditory inertia resulting
noises. Perception and Psychophysics, 46, 155–166.
Bacon, S. P., Hicks, M. L., and Johnson, K. L. (2000). Tem- from mechanical and/or electrophysiological transduc-
poral integration in the presence of o¤-frequency maskers. tion processes. Such a limitation e¤ectively smoothes or
Journal of the Acoustical Society of America, 107, 922–932. attenuates the intensive changes of a stimulus, which
Buus, S. (1999). Temporal integration and multiple looks, reduces the salience of those changes. Impaired tem-
revisited: Weights as a function of time. Journal of the poral resolution may be conceptualized as an increase
Acoustical Society of America, 105, 2466–2475. in this smoothing process, and thus a loss of temporal
Cacace, A. T., Margolis, R. H., and Relkin, E. M. (1991). information.
Threshold and suprathreshold temporal integration e¤ects The influence of hearing impairment on temporal
in the crossed and uncrossed human acoustic stapedius re- resolution depends on the site of lesion. For example,
flex. Journal of the Acoustical Society of America, 89, 1255–
1261.
conductive hearing loss is often modeled as a simple
Csepe, V., Pantev, C., Hoke, M., Ross, B., and Hampson, S. attenuation characteristic and thus should not alter
(1997). Mismatch field to tone pairs: Neuromagnetic evi- temporal resolution, given su‰cient stimulus levels.
dence for temporal integration at the sensory level. Electro- Damage at the level of the cochlea, however, involves
encephalography and Clinical Neurophysiology, 104, 1–9. more than attenuation. Reduced outer hair cell function
Fu, Q. J., and Shannon, R. V. (2000). E¤ect of stimulation rate is associated with a reduction in sensitivity, frequency
on phoneme recognition by Nucleus-22 cochlear implant selectivity, and compression at the level of the basilar
listeners. Journal of the Acoustical Society of America, 107, membrane. Each of these might influence the percep-
589–597. tion of intensity changes. For example, a loss of basilar
Garner, W. R., and Miller, G. A. (1947). The masked thresh- membrane compression might provide a more salient
old of pure tones as a function of duration. Journal of Ex-
perimental Psychology, 37, 293.
representation of intensity changes and thus lead to im-
Hall, J. W., and Fernandes, M. A. (1983). Temporal integra- proved performance on temporal resolution tasks in-
tion, frequency resolution, and o¤-frequency listening in volving such changes. Reduced frequency selectivity is
normal-hearing and cochlear-impaired listeners. Journal of analogous to broadening of a filter characteristic, which
the Acoustical Society of America, 74, 1172–1177. is associated with a shorter temporal response. This too
Kollmeier, B. (Ed.). (1995). Psychoacoustics, speech, and hear- might lead to improved temporal resolution. A loss of
ing aids. Proceedings of the summer school and interna- inner hair cell function, however, would reduce the
tional symposium, Bad Zwischenahn, Germany, 31 Aug.–5 quality and amount of information transmitted to the
Sept. 1995. World Scientific Singapore, 1996. central auditory pathway, and might therefore lead to
Moore, B. C. J. (1997). An introduction to the psychology of poor coding of temporal features. The altered neural
hearing (4th ed.). London: Academic Press.
Oxenham, A. J. (1998). Temporal integration at 6 kHz as a
function associated with a retrocochlear lesion may also
function of masker bandwidth. Journal of the Acoustical lead to a less faithful representation of the temporal fea-
Society of America, 103, 1033–1042. tures of a sound.
Sheft, S., and Yost, W. A. (1990). Temporal integration in Numerous techniques have been used to probe
amplitude modulation detection. Journal of the Acoustical temporal resolution abilities; however, the two most
Society of America, 88, 796–805. common techniques are temporal gap detection and
Yost, W. A. (1991). Fundamentals of hearing: An introduction. amplitude modulation detection (Fig. 1). Following the
San Diego, CA: Academic Press.
Temporal Resolution
Sensory systems function as change detectors in many

respects. They quickly adapt to steady-state stimuli and
are easily excited by the introduction of novel stimuli.
The pattern of changes in an acoustic stimulus conveys
information about the nature of the sound source and
the message being transmitted by the sender. Therefore
the identification, discrimination, and interpretation of
acoustic events depend on the ability of the auditory Figure 1. Schematic diagram of a two-interval, forced-choice
system to faithfully encode the temporal features of psychophysical paradigm used to estimate gap detection
thresholds (top row) and sinusoidal amplitude modulation
those events. This ability to respond to changes in an (SAM) detection thresholds (bottom row). Stimulus waveforms
acoustic stimulus has been termed temporal resolution. are shown for each of two observation intervals. A broadband
Although most natural acoustic signals are characterized noise standard is shown in interval 1 and a noise with a tem-
by changes in intensity as well as changes in the acoustic poral gap (64 ms) or amplitude modulation (6 dB) is shown in
spectrum over time, investigations of temporal resolu- interval 2. Correct and incorrect responses are listed.
notion of auditory inertia, Plomp (1964) investigated tensity resolution for sinusoids (Glasberg and Moore,
the rate of decay of auditory sensation by measuring the 1989) and gap detection for band-limited noise is corre-
minimum detectable silent interval between two broad- lated with intensity resolution for band-limited noise
band noise pulses as a function of the relative level of the (Eddins and Manegold, 2001). This highlights the po-
two pulses. When the pulses surrounding the gap were tential role of intensity resolution in a task such as gap
equal in level, the minimum detectable gap was about detection. Poor gap detection thresholds may result from
3 ms. Gap detection thresholds deteriorate as stimulus poor intensity resolution, poor temporal resolution, or a
level falls below about 30 dB sensation level (e.g., combination of the two. Listeners with cochlear implants
Plomp, 1964; Penner, 1977; Buus and Florentine, 1983; o¤er a unique perspective on temporal resolution in that
Florentine and Buus, 1984). Thus, reduced audibility the auditory periphery, save for the auditory nerve, is
associated with hearing loss may result in longer than bypassed. Gap detection in such listeners, using electrical
normal gap detection thresholds. For patients with con- stimulation via the implant, is as good as gap detection
ductive or sensorineural hearing loss, gap detection for listeners with normal hearing using acoustic stimula-
thresholds for broadband noise are longer than normal tion (e.g., Shannon, 1989; Moore and Glasberg, 1988).
at low stimulus levels. At higher stimulus levels, gap This is consistent with the notion that gap detection may
thresholds are within normal limits for conductive loss not be strongly dependent upon cochlear processes.
but remain longer than normal for listeners with sensori- While gap detection thresholds may be strongly in-
neural hearing loss (Irwin, Hinchcli¤, and Kemp, 1981). fluenced by a listener’s intensity resolution, the am-
To gauge temporal resolution in di¤erent frequency plitude modulation detection paradigm provides an
regions, one may measure gap detection thresholds using opportunity to separate the a¤ects of intensity resolution
band-limited noise. Results from listeners with normal from temporal resolution. A modulation detection
hearing reveal that gap thresholds improve with increas- threshold is obtained by determining the minimum depth
ing stimulus level up to about 30 dB sensation level (e.g., of modulation necessary to discriminate an unmodulated
Buus and Florentine, 1983) and improve with increasing from a sinusoidally amplitude-modulated stimulus. With
noise bandwidth (e.g., Shailer and Moore, 1983; Eddins, this technique, temporal resolution can be more com-
Hall, and Grose, 1992), but vary little with frequency pletely described as the change in modulation threshold
region when noise bandwidth (in Hz) is held constant over a range of fluctuation rates (modulation frequen-
(e.g., Eddins et al., 1992). With hearing loss of cochlear cies). With the assumption that intensity resolution does
origin, gap detection is often worse than normal using not vary with modulation frequency, a separate index of
band-limited noise (e.g., Fitzgibbons and Wightman, intensity resolution may be obtained from modulation
1982; Fitzgibbons and Gordon-Salant, 1987); however, detection thresholds at very low modulation frequencies.
this is not true for all listeners with cochlear hearing loss If loudness recruitment associated with cochlear hearing
(e.g., Florentine and Buus, 1984; Glasberg and Moore, loss has a negative influence on gap detection in narrow-
1989; Grose, Eddins, and Hall, 1989). Thus, cochlear band noise, as suggested above, then one might predict
hearing loss does not necessarily result in poorer than that recruitment would enhance the perception of fluc-
normal temporal resolution. tuations introduced by amplitude modulation. Using
Temporal gap detection thresholds measured for broadband noise carriers, this does not seem to be the
sinusoidal stimuli do not vary substantially with stimulus case. Modulation detection thresholds for listeners with
frequency from 400 to 2000 Hz, but increase substan- cochlear hearing loss may be normal or worse than nor-
tially at and below 200 Hz (e.g., Shailer and Moore, mal, but are not better than normal (Bacon and Vie-
1987; Moore, Peters, and Glasberg, 1992). Although lis- meister, 1985; Bacon and Gleitman, 1992). Similarly,
teners with hearing impairment may have worse than modulation detection using band-limited noise is not
normal gap detection thresholds for noise stimuli, gap worse than normal in listeners with cochlear hearing
detection thresholds for tonal stimuli are normal when loss (e.g., Moore, Shailer, and Schooneveldt, 1992; Hall
compared at equivalent sound pressure levels and are et al., 1998). Modulation detection with tonal carriers,
better than normal at equal sensation levels (Moore and however, tends to be better than normal in listeners with
Glasberg, 1988). One theory consistent with these results cochlear hearing loss, and the perceived depth of modu-
is that gap detection is limited to some extent by the in- lation appears to be related to the steepness of loudness
herent fluctuations in narrow-band noise, and this e¤ect growth (Moore, Wojtczak, and Vickers, 1996; Moore
may be accentuated by the loudness recruitment of some and Glasberg, 2001). This is quite di¤erent from ampli-
hearing-impaired listeners (e.g., Moore and Glasberg, tude-modulated noise stimuli, for which threshold does
1988). Sinusoids, having a smooth temporal envelope, not seem to be related to loudness growth (Hall et al.,
would not be subject to such a limitation. This leads to 1998). As in the gap detection paradigm, there are
the possibility that temporal resolution per se may not be marked di¤erences between the results obtained with
adversely a¤ected by cochlear hearing loss (e.g., Moore noise and tonal stimuli. Thus, it is possible that the rela-
and Glasberg, 1988). If gap detection is influenced by tion between loudness growth and intensive changes is
loudness recruitment, then one would expect a rela- di¤erent for sinusoidal and noise stimuli.
tionship between gap detection and intensity resolution. Some listeners with cochlear pathology have worse
Indeed, gap detection for sinusoids is correlated with in- than normal modulation detection using noise carriers,
Temporal Resolution 555
as do listeners with Ménière’s disease (Formby, 1987), Formby, C. (1987). Modulation threshold functions for chron-
eighth nerve tumors (Formby, 1986), and auditory ically impaired Ménière’s patients. Audiology, 26, 89–102.
neuropathy (Zeng et al., 1999). Interestingly, listeners Glasberg, B. R., and Moore, B. C. (1989). Psychoacoustic
with cochlear implants perform as well as normal- abilities of subjects with unilateral and bilateral cochlear
hearing impairments and their relationship to the ability to
hearing subjects on amplitude-modulation tasks (Shan-
understand speech. Scandinavian Audiology, Supplement,
non, 1992). 32, 1–25.
In summary, listeners with abnormal cochlear func- Grose, J. H., Eddins, D. A., and Hall, J. W. (1989). Gap de-
tion often exhibit reduced performance on gap and tection as a function of stimulus bandwidth with fixed high-
modulation detection tasks with noise but not sinusoidal frequency cuto¤ in normal-hearing and hearing-impaired
stimuli. Studies of temporal resolution using other ex- listeners. Journal of the Acoustical Society of America, 86,
perimental techniques have yielded results that are gen- 1747–1755.
erally consistent with those discussed here. These results Hall, J. W. III, and Grose, J. H. (1997). The relation between
are consistent with an interpretation that cochlear pa- gap detection, loudness, and loudness growth in noise-
thology may not lead to reduced temporal resolution per masked normal-hearing listeners. Journal of the Acoustical
Society of America, 101, 1044–1049.
se, but may lead to di‰culty perceiving stimuli with
Hall, J. W. III, Grose, J. H., Buss, E., and Hatch, D. R. (1998).
pronounced, random intensity fluctuations (Grose et al., Temporal analysis and stimulus fluctuation in listeners with
1989; Hall and Grose, 1997; Hall et al., 1998). Although normal and impaired hearing. Journal of Speech, Language,
many hearing-impaired listeners perform as well as and Hearing Research, 41, 340–354.
normal-hearing listeners on tasks involving temporal Irwin, R. J., Hinchcli¤, L. K., and Kemp, S. (1981). Temporal
resolution, especially when stimuli are presented at acuity in normal and hearing-impaired listeners. Audiology,
optimal levels and have relatively smooth temporal 20, 234–243.
envelopes, such listeners are likely to have di‰culty Moore, B. C., and Glasberg, C. J. (1988). Gap detection with
in natural listening environments with fluctuating sinusoids and noise in normal, impaired, and electrically
backgrounds. Thus, measures of gap and amplitude stimulated ears. Journal of the Acoustical Society of Amer-
ica, 83, 1093–1101.
modulation detection using noise stimuli might have
Moore, B. C., and Glasberg, C. J. (2001). Temporal modula-
promise as predictors of communication di‰culty in re- tion transfer functions obtained using sinusoidal carriers
alistic environments. with normally hearing and hearing-impaired listeners.
—David A. Eddins Journal of the Acoustical Society of America, 110, 1067–
1073.
References Moore, B. C., Peters, R. W., and Glasberg, B. R. (1992). De-
tection of temporal gaps in sinusoids by elderly subjects
Bacon, S. P., and Gleitman, R. M. (1992). Modulation detec- with and without hearing loss. Journal of the Acoustical
tion in subjects with relatively flat hearing losses. Journal of Society of America, 92, 1923–1932.
Speech and Hearing Research, 35, 642–653. Moore, B. C., Shailer, M. J., and Schooneveldt, G. P. (1992).
Bacon, S. P., and Viemeister, N. F. (1985). Temporal modula- Temporal modulation transfer functions for bandlimited
tion transfer functions in normal-hearing and hearing- noise in subjects with cochlear hearing loss. British Journal
impaired listeners. Audiology, 24, 117–134. of Audiology, 26, 229–237.
Buus, S., and Florentine, M. (1983). Gap detection in normal Moore, B. C., Wojtczak, J. M., and Vickers, D. A. (1996). Ef-
and impaired listeners: The e¤ect of level, and frequency. In fect of loudness recruitment on the perception of amplitude
A. Michelson (Ed.), Time resolution in auditory systems (pp. modulation. Journal of the Acoustical Society of America,
159–179). New York: Springer-Verlag. 100, 481–489.
Eddins, D. A., Hall, J. W., and Grose, J. H. (1992). The de- Penner, M. J. (1977). Detection of temporal gaps in noise as a
tection of temporal gaps as a function of frequency region measure of the decay of auditory sensation. Journal of the
and absolute noise bandwidth. Journal of the Acoustical Acoustical Society of America, 61, 552–557.
Society of America, 91, 1069–1077. Plomp, R. (1964). Rate of decay of auditory sensation. Journal
Eddins, D. A., and Manegold, R. A. (2001). Spectral integra- of the Acoustical Society of America, 36, 277–282.
tion in the detection of temporal gaps, amplitude modulation, Shailer, M. J., and Moore, B. C. (1983). Gap detection as a
increments, and decrements in bandlimited noise. Unpub- function of frequency, bandwidth, and level. Journal of the
lished manuscript. Acoustical Society of America, 74, 467–473.
Fitzgibbons, P. J., and Gordan-Salant, S. (1987). Temporal Shailer, M. J., and Moore, B. C. (1987). Gap detection and
gap resolution in listeners with high-frequency sensorineural the auditory filter: Phase e¤ects using sinusoidal stimuli.
hearing loss. Journal of the Acoustical Society of America, Journal of the Acoustical Society of America, 81, 1110–
81, 133–137. 1117.
Fitzgibbons, P. J., and Wightman, F. L. (1982). Gap detection Shannon, R. V. (1989). Detection of gaps in sinusoids and
in normal and hearing-impaired listeners. Journal of the pulse trains by patients with cochlear implants. Journal of
Acoustical Society of America, 72, 761–765. the Acoustical Society of America, 85, 2587–2592.
Florentine, M., and Buus, S. (1984). Temporal gap detection in Shannon, R. V. (1992). Temporal modulation transfer func-
sensorineural and simulated hearing impairments. Journal tions in patients with cochlear implants. Journal of the
of Speech and Hearing Research, 27, 449–455. Acoustical Society of America, 91, 2156–2164.
Formby, C. (1986). Modulation detection by patients with Zeng, F. G., Oba, S., Garde, S., Sininger, Y., and Starr, A.
eighth-nerve tumors. Journal of Speech and Hearing Re- (1999). Temporal and speech processing deficits in auditory
search, 29, 413–419. neuropathy. NeuroReport, 10, 3429–3435.
Further Readings but in its severe form it is far more common in adults
and the elderly (Davis and El Rafaie, 2000).
Derleth, R. P., Dau, T., and Kollmeier, B. (2001). Modeling What distinguishes mild from severe tinnitus is not
temporal resolution and compressive properties of the nor-
established, apart from variations in subjective ratings of
mal and impaired auditory system. Hearing Research, 159,
132–149. intrusiveness and loudness. In particular, in attempts to
Eddins, D. A., and Green, D. M. (1995). Temporal integration determine the handicap caused by tinnitus, it has not
and temporal resolution. In B. C. J. Moore (Ed.), Hearing been possible to make the determination using aspects of
(pp. 207–242). San Diego, CA: Academic Press. the tinnitus itself (e.g., loudness, character, etc.). More-
Glasberg, B. R., and Moore, B. C. J. (1992). E¤ects of enve- over, tinnitus has been notoriously di‰cult to measure
lope fluctuation on gap detection. Hearing Research, 64, objectively. However, psychological complaints are of
81–92. major importance in determining the severity of tinnitus
Moore, B. C. J. (1998). Cochlear hearing loss. London: Whurr. (Andersson, 2001). In line with the di‰culties associated
Plack, C. J., and Moore, B. C. J. (1991). Decrement detection with measuring tinnitus, no consensus has been reached
in normal and impaired ears. Journal of the Acoustical So-
ciety of America, 90, 3069–3076.
regarding its classification, and several schemes have
been proposed. Structured interviews and validated self-
report questionnaires are helpful when describing tinni-
tus.
Among the most influential theories on why tinnitus
Tinnitus causes annoyance is Hallam’s psychological model of
tinnitus (Hallam, Rachman, and Hinchcli¤e, 1984) and
Tinnitus is an auditory perceptual phenomenon that is Jastrebo¤ ’s (1990) neurophysiological model. The latter
defined as the conscious perception of internal noises model puts less emphasis on conscious mechanisms in-
without any outer auditory stimulation. Tinnitus may volved in tinnitus perception. Basically, Jastrebo¤ pres-
occur as a concomitant of practically all the dysfunctions ents a conditioning model in which the tinnitus signal is
that involve the human auditory system. Hence, dam- conditioned to aversive reactions such as anxiety and
age to the middle ear, the cochlea, cranial nerve VIII fear.
(audiovestibular), and pathways in the brain from coch-
lear nucleus to primary auditory cortex all are likely
Audiological Characteristics
candidates for explaining why tinnitus appears (Levine, Measurement of tinnitus involves subjective report and a
2001). A common distinction is between so-called objec- history of its features, such as loudness, character, fluc-
tive tinnitus (somatosounds) and subjective tinnitus. In tuations, and severity. Patients have described tinnitus
clinical settings, objective tinnitus represents a minority as tones, buzzing noises, and mixtures of buzzing and
of cases. Examples of conditions related to objective ringing. More complicated descriptions include metallic
tinnitus are spontaneous otoacoustic emissions, tensor sounds and multiple tones of varying frequencies.
tympani syndrome, and vascular lesions. Subjective tin- Since the 1930s, tinnitus has been measured by asking
nitus has been linked to sensorineural hearing loss the patient to compare the tinnitus with an external tone
caused by various deficits such as age-related hearing or combinations of tones. Tinnitus loudness can be pre-
loss (presbyacusis), noise exposure, acoustic neuroma, sented at hearing level (HL) or sensation level (SL), the
and Ménière’s disease (Levine, 2001), but also to other latter being the level of tinnitus above hearing threshold.
conditions such as temporomandibular joint dysfunc- Further, tinnitus loudness can be matched using the tin-
tion. Di¤erent neural mechanisms have been proposed, nitus frequency (for which hearing is often impaired) or
and tinnitus has been explained as the result of increased another frequency where hearing is normal (Henry and
neural activity in the form of increased burst firing, or as Meikle, 2000). Contralateral versus ipsilateral matching
a result of pathological synchronization of neural activ- is another choice. Determining the minimal masking
ity. Other suggested mechanisms are hypersensitivity and level is a way to quantify the intrusiveness of tinnitus by
cortical reorganization (Rauschecker, 1999). determining how loud a sound needs to be to mask the
tinnitus (Henry and Meikle, 2000). Pioneering work by
Prevalence and Categorization Feldmann (1971) revealed that tinnitus patients could
be categorized according to how tones of di¤erent fre-
Tinnitus is commonly a temporary sensation, and most
quencies masked the tinnitus (so-called masking curves).
people have experienced it. However, it may develop For example, one type of tinnitus was equally masked by
into a chronic condition that resists medical or surgical
tones of low and high frequency, whereas another type
treatment. Prevalence figures vary slightly, but at least
of tinnitus was more easily masked by low-frequency
10%–15% of the general population can be expected to
tones. Tinnitus can often be masked by white noise, at
have tinnitus. A large majority do not have severe tinni-
least temporarily (Henry and Meikle, 2000).
tus. Findings from epidemiological studies suggest that
about 1%–3% of the adult population has severe tinni-
Emotional and Cognitive Disturbances
tus, in the sense that it causes marked disruption of
everyday activities, mood changes, and often disrupted Tinnitus patients often report di‰culties with concen-
sleep patterns. Tinnitus has been reported in children, tration, such as during reading. Until recently, few
Tinnitus 557
attempts were made to measure tinnitus patients’ per- (e.g., nerve compression). However, when surgery is
formance on tests of cognitive functioning, but pre- called for, the e¤ects on tinnitus have been unclear. In
liminary results corroborate the self-report findings some patients tinnitus disappears, but in others it re-
(Andersson et al., 2000). mains unchanged or becomes worse (Hazell, 1990).
In its severe form, tinnitus is often associated with Alternative medicine approaches (such as Gingko
lowered mood and depression. There are only a few biloba and acupuncture) either have not been tested or,
studies that have endorsed structured psychiatric in- when trials have been conducted, have yielded disap-
terviews, and most of the available data are based pointing results (Davies, 2001).
on questionnaires. Suicide related to tinnitus is rare. The e¤ects of electrical stimulation have been inves-
Most cases reported had associated comorbid psychiat- tigated in two forms. The first is application of electric
ric disturbances (Lewis, Stephens, and McKenna, 1994). current via transcutaneous nerve stimulation, and the
Anxiety, and in particular anxious preoccupation with second is through cochlear implantation, in which elec-
somatic sensations, is an important aggravating factor trodes are inserted into the cochlea. The latter approach
related to distress caused by tinnitus (Newman, Whar- is most interesting, as cases have been reported in which
ton, and Jacobson, 1997). Stress is often mentioned as a tinnitus disappears while the implant is on and returns
negative factor for tinnitus, in particular the stress of when it is turned o¤ (Dauman, 2000).
major adverse life events. Finally, sleep problems are a There is a long history of attempts to treat tinnitus via
significant component of tinnitus patients’ complaints maskers and, more recently, white noise generators.
(McKenna, 2000). These are basically hearing aid-like devices that emit
noise of broadband or narrow-band character. Unfortu-
Tinnitus and the Brain nately, there are few controlled trials on the use of
masking devices or white noise generators. The studies
Researchers and clinicians have long suspected that tin- that do exist do not support the e‰cacy of masking, but
nitus involves certain areas of the brain, particularly clinical experience suggests that they help some people
those that subserve the perception and amplify the (Vernon and Meikle, 2000).
experience. Studies have been conducted on tinnitus More recently a treatment method called tinnitus
patients’ reaction times, brainstem audiometry, evoked retraining therapy (TRT) has been developed. TRT has
potentials, and magnetoencephalography. Tinnitus has two parts, one consisting of counseling in a directive
recently been studied using single photon emission com- format and the other part providing ‘‘sound enrichment’’
puted tomography, positron emission tomography (e.g., using white noise generators set at a level that does not
Mirz et al., 1999), and functional magnetic resonance cover tinnitus (Jastrebo¤ and Jastrebo¤, 2000).
imaging. Findings from brain imaging studies suggest Among the treatments aimed at reducing distress,
that tinnitus can be objectively measured, but are not cognitive-behavioral therapy (CBT) is the most re-
consistent. However, it is clear that tinnitus a¤ects searched alternative (Andersson, 2001). CBT is a rela-
brain areas related to hearing and processing of sounds, tively brief psychological treatment approach directed at
but also that some involvement of the brain’s atten- identifying and modifying maladaptive behaviors and
tional and emotional systems might be involved (e.g., the cognitions by means of behavior change and cognitive
amygdala). restructuring. The focus is on applying techniques such
as applied relaxation in real-life settings. There is evi-
Treatment dence that CBT can be e¤ective in alleviating the distress
caused by tinnitus, and also that it works in a self-help
There is a long history of attempts to cure tinnitus. Al- format presented via the Internet (Andersson et al.,
though there are ways to alleviate the su¤ering, surgical 2002).
and pharmacological interventions have been largely
unsuccessful (Dobie, 1999). A pharmacological agent —Gerhard Andersson
that reliably abolishes tinnitus for a short period is the
local anesthetic agent lidocaine (Davies, 2001). About References
60% of tinnitus su¤erers respond to lidocaine adminis- Andersson, G. (2001). The role of psychology in managing
tered intravenously, which in some cases totally abol- tinnitus: A cognitive behavioural approach. Seminars in
ishes tinnitus for a brief period. Because of side e¤ects Hearing, 22, 65–76.
and the lack of e¤ective oral analogues, lidocaine is not Andersson, G., Eriksson, J., Lundh, L.-G., and Lyttkens, L.
a viable treatment for tinnitus (Davies, 2001). (2000). Tinnitus and cognitive interference: A Stroop para-
One treatment alternative for selected patients with digm study. Journal of Speech, Hearing, and Language Re-
tinnitus is antidepressants. A few studies have found search, 43, 1168–1173.
Andersson, G., Strömgren, T., Ström, T., and Lyttkens, L.
positive results with respect to tinnitus annoyance,
(2002). Randomised controlled trial of Internet based cog-
whereas more modest results were found in another nitive behavior therapy for distress associated with tinnitus.
study. More studies are needed, in particular to investi- Psychosomatic Medicine, 64, 810–816.
gate the e¤ects of selective serotonin reuptake inhibitors. Dauman, R. (2000). Electrical stimulation for tinnitus sup-
Tinnitus is rarely the only indication for surgery un- pression. In R. S. Tyler (Ed.), Tinnitus handbook (pp. 377–
less clear objective findings can identify a causal agent 398). San Diego, CA: Singular/Thomson Learning.
Davies, W. E. (2001). Future prospects for the pharmacologi- Hazell, J. W. P., Wood, S. M., Cooper, H. R., Stephens,
cal treatment of tinnitus. Seminars in Hearing, 22, 89–99. S. D. G., Corcoran, A. L., Coles, R. R. A., et al. (1985). A
Davis, A., and El Rafie, E. A. (2000). Epidemiology of tin- clinical study of tinnitus maskers. British Journal of Audiol-
nitus. In R. S. Tyler (Ed.), Tinnitus handbook (pp. 1–23). ogy, 19, 65–146.
San Diego, CA: Singular/Thomson Learning. Henry, J. L., and Wilson, P. H. (2001). Psychological manage-
Dobie, R. A. (1999). A review of randomized clinical trials of ment of chronic tinnitus: A cognitive-behavioral approach.
tinnitus. Laryngoscope, 109, 1202–1211. Boston: Allyn and Bacon.
Feldmann, H. (1971). Homolateral and contralateral masking Lockwood, A. H., Salvi, R. J., Coad, M. L., Towsley, M. L.,
of tinnitus by noise-bands and pure tones. Audiology, 10, Wack, D. S., and Murphy, B. W. (1998). The functional
138–144. neuroanatomy of tinnitus: Evidence for limbic system links
Hallam, R. S., Rachman, S., and Hinchcli¤e, R. (1984). Psy- and neural plasticity. Neurology, 50, 114–120.
chological aspects of tinnitus. In S. Rachman (Ed.), Con- McCombe, A., Baguley, D., Coles, R., McKenna, L., Mc-
tributions to medical psychology (pp. 31–53). Oxford, U.K.: Kinney, C., and Windle-Taylor, P. (2001). Guidelines for
Pergamon Press. the grading of tinnitus severity: The results of a working
Hazell, J. W. P. (1990). Tinnitus: II. Surgical management of group commissioned by the British Association of Otolar-
conditions associated with tinnitus and somatosounds. yngologists, Head and Neck Surgeons. Clinical Otolaryn-
Journal of Otolaryngology, 19, 6–10. gology, 26, 388–393.
Henry, J. A., and Meikle, M. B. (2000). Psychoacoustic mea- McFadden, D. (1982). Tinnitus: Facts, theories, and treatments.
sures of tinnitus. Journal of the American Academy of Washington, DC: National Academy Press.
Audiology, 11, 138–155. Melcher, J. R., Sigalosky, I. S., Guinan, J. J., and Levine,
Jastrebo¤, P. J. (1990). Phantom auditory perception (tinnitus): R. A. (2000). Lateralized tinnitus studied with functional
Mechanisms of generation and perception. Neuroscience magnetic resonance imaging: Abnormal inferior colliculus
Research, 8, 221–254. activation. Journal of Neurophysiology, 83, 1058–1072.
Jastrebo¤, P. J., and Jastrebo¤, M. M. (2000). Tinnitus Noble, W. (2000). Self-reports about tinnitus and about coch-
retraining therapy (TRT) as a method for treatment of tin- lear implants. Ear and Hearing, 21, 50S–59S.
nitus and hyperacusis patients. Journal of the American Park, J., White, A. R., and Ernst, E. (2000). E‰cacy of acu-
Academy of Audiology, 11, 162–177. puncture as a treatment for tinnitus: A systematic review.
Levine, R. A. (2001). Diagnostic issues in tinnitus: A neuro- Archives of Otolaryngology–Head and Neck Surgery, 126,
otological perspective. Seminars in Hearing, 22, 23–36. 489–492.
Lewis, J. E., Stephens, S. D. G., and McKenna, L. (1994). Schulman, A. (1995). A final common pathway for tinnitus:
Tinnitus and suicide. Clinical Otolaryngology, 19, 50–54. The medial temporal lobe system. International Tinnitus
McKenna, L. (2000). Tinnitus and insomnia. In R. S. Tyler Journal, 1, 115–126.
(Ed.), Tinnitus handbook (pp. 59–84). San Diego, CA: Stephens, D. (2000). A history of tinnitus. In R. S. Tyler (Ed.),
Singular/Thomson Learning. Tinnitus handbook (pp. 437–448). San Diego, CA: Singular/
Mirz, F., Pedersen, C. B., Ishizu, K., Johannsen, P., Ovesen, Thomson Learning.
T., Sødkilde-Jørgensen, H., and Gjedde, A. (1999). Positron Tyler, R. S. (Ed.). (2000). Tinnitus handbook. San Diego, CA:
emission tomography of cortical centres of tinnitus. Hearing Singular/Thomson Learning.
Research, 134, 133–144.
Newman, C. W., Wharton, J. A., and Jacobson, G. P. (1997).
Self-focused and somatic attention in patients with tinnitus.
Journal of the American Academy of Audiology, 8, 143– Tympanometry
149.
Rauschecker, J. P. (1999). Auditory cortical plasticity: A com-
parison with other sensory systems. Trends in Neuroscience, Tympanometry is a measure of the acoustic admittance
22, 74–80. or ease with which acoustic energy flows into the middle
Vernon, J. A., and Meikle, M. B. (2000). Tinnitus masking. In ear transmission system as air pressure is varied in the
R. S. Tyler (Ed.), Tinnitus handbook (pp. 313–356). San ear canal. This measure is accomplished by sealing a
Diego, CA: Singular/Thomson Learning. small probe device into the ear canal. A speaker delivers
a probe signal, typically 226 Hz, into the ear canal, and
Further Readings a microphone measures the amplitude and phase of the
Andersson, G., and Lyttkens, L. (1999). A meta-analytic re- probe signal admitted into the middle ear system. The
view of psychological treatments for tinnitus. British Jour- acoustic admittance is determined by the combined
nal of Audiology, 33, 201–210. sti¤ness (or conversely, compliance), mass, and resis-
Andersson, G., Vretblad, P., Larsen, H.-C., and Lyttkens, L. tance of the eardrum and all middle ear structures. In the
(2001). Longitudinal follow-up of tinnitus complaints. presence of middle ear pathology, these admittance
Archives of Otolaryngology–Head and Neck Surgery, 127, characteristics are altered, and therefore the amplitude
175–179. and phase of the probe signal measured in the ear canal
Drew, S., and Davies, E. (2001). E¤ectiveness of Gingko biloba are also altered. In pathology such as middle ear e¤u-
in treating tinnitus: Double blind, placebo controlled trial.
sion, the eardrum is sti¤ened by fluid in the middle ear
British Medical Journal, 322, 1–6.
Giraud, A. L., Chéry-Croze, S., Fischer, G., Fischer, C., Vig- cavity, and only minimal acoustic energy from the probe
hetto, A., Grégoire, M.-C., et al. (1999). A selective imaging signal is admitted into the middle ear; acoustic admit-
of tinnitus. NeuroReport, 10, 1–5. tance in the plane of the eardrum is described as low.
Hallam, R. S. (1989). Living with tinnitus: Dealing with the In contrast, pathology such as ossicular discontinuity
ringing in your ears. Wellingborough, U.K.: Thorson’s. makes the ear less sti¤, so that most of the acoustic en-
Tympanometry 559
to establish objective criteria for medical referral. Four

commonly used calculations are depicted in Figure 2.
The first, acoustic equivalent volume (Vea ), is an esti-
mate of the ear canal volume between the probe device
and the eardrum. This estimate typically is made using
a 226-Hz probe signal and an ear canal pressure of
200 daPa. When the probe device is sealed in the ear
canal, the measured acoustic admittance reflects the
combined e¤ects of the ear canal and the middle ear.
Under extreme ear canal pressures, however, the ear-
drum theoretically becomes so sti¤ that acoustic admit-
tance into the middle ear decreases to 0 mmhos. The
admittance measured at extreme pressures then is at-
tributed solely to the ear canal volume. When a 226-Hz
probe signal is used, the acoustic admittance measured
at 200 daPa is equal to the volume of the ear canal. In
Figure 2, Vea equals 0.6 cm 3 . In children less than 7
Figure 1. Three patterns of tympanograms recorded using a years old, Vea ranges from 0.3 to 0.9 cm 3 (Margolis and
226 Hz probe signal. Type A is normal, type B is flat, and type Heller, 1987; Shanks et al., 1992). In adults, Vea averages
C has a negative tympanogram peak pressure. 1.3 cm 3 in women and 1.5 cm 3 in men (Wiley et al.,
1996). As a subsequent example will demonstrate, Vea is
useful in di¤erentiating between intact and perforated
ergy from the probe signal is admitted into the middle eardrums when a flat type B tympanogram is recorded.
ear system, and acoustic admittance is high. Peak compensated static acoustic admittance (Ytm ) is
In addition to the loudspeaker and microphone, the the amplitude of the tympanogram between the peak
probe system is connected to a pneumatic pump that and 200 daPa. This measure describes the acoustic ad-
adjusts ear canal pressure over a range from 600 to mittance of the middle ear transmission system compen-
þ400 daPa. The dekapascal (daPa) is the unit of pres- sated for or minus the e¤ects of the ear canal volume. In
sure that has replaced mm H2 O (ANSI S3.39-1987). The Figure 2, Ytm is 1.1 acoustic mmhos, calculated as peak
two units, however, are nearly interchangeable admittance (1.7 mmhos) minus ear canal admittance
(1 daPa ¼ 1.02 mm H2 O). (0.6 mmhos). Many instruments ‘‘baseline correct’’
Tympanometry became a routine clinical procedure at 200 daPa, so that Ytm can be read directly from the
following the landmark paper of Jerger (1970). Jerger y-axis. If the tympanogram in Figure 2 were baseline
identified three basic tympanogram shapes. A tympano- corrected, zero admittance would be shifted upward
gram is a graphic display of acoustic admittance mea- to correspond with the Vea at 200 daPa. If a middle
sured as a function of changing ear canal pressure. A ear problem produces abnormally high sti¤ness, the
normal type A tympanogram is shown in Figure 1. In-
troduction of extreme pressures into the sealed ear canal
sti¤ens the eardrum, and theoretically, all of the acoustic
energy from the probe signal is reflected at the surface
of the eardrum, and admittance reaches a minimum.
Acoustic admittance gradually increases to a maximum,
and the probe signal becomes most audible, when the
pressure in the ear canal equals the pressure in the mid-
dle ear cavity. When the eustachian tube is functioning
normally, atmospheric pressure of 0 daPa is maintained
in the middle ear cavity, and tympanogram peak pres-
sure (TPP) also is 0 daPa. The ear canal pressure pro-
ducing peak admittance, therefore, provides an estimate
of middle ear pressure. When the eardrum is retracted
and negative middle ear pressure exists, the peak of the
tympanograms shifts to a corresponding negative value.
This tympanogram pattern is designated type C in
Figure 1. The third tympanogram, designated type B,
has no discernible peak and is flat. This pattern is re-
corded from ears with middle ear e¤usion (MEE), per- Figure 2. Four calculations made on 226 Hz tympanograms:
forated eardrums, or patent pressure-equalization tubes acoustic equivalent volume (Vea , in cm 3 ), peak compensated
(PET). static acoustic admittance (Ytm , in acoustic mmhos), tympa-
Tympanogram shape also has been quantified in an nogram width (TW, in daPa), and tympanogram peak pressure
attempt to aid in the diagnosis of middle ear disease and (TPP, in daPa).
Table 1. Means and 90% Ranges for Vea , Ytm , TW, and TPP from Several Large-Scale Studies in Subjects 8 Weeks to 90 Years
Old with Normal Middle Ear Transmission Systems
Study Age (yr) N Statistic Vea (cm 3 ) Ytm (mmhos) TW (daPa) TPP (daPa)
Wiley et al. (1996) 48–90 2147 Mean 1.36 0.66 75 23
90% range 0.9–2.0 0.2–1.5 35–125 85 to 5
Margolis and 19–61 87 Mean 1.05 0.78 77 19
Heller (1987) 90% range 0.63–1.46 0.32–1.46 51–114 83 to 0
2.8–5.8 92 Mean 0.74 0.55 100 30
90% range 0.42–0.97 0.22–0.92 59–151 139 to 11
Nozza et al. 3–16 130 Mean 0.90 0.78 104 34
(1992, 1994) 90% range 0.60–1.35 0.40–1.39 60–168 207 to 15
Roush et al. 0.5–2.5 þ1636 Mean 0.45 148
(1995) 90% range 0.20–0.70 102–204
Shanks et al. 8 wk–7 yr 334 Mean 0.58
(1992) 90% range 0.3–0.9
Abbreviations: Vea , acoustic equivalent volume; Ytm , peak compensated static acoustic admittance; TW, tympanogram width;
TPP, tympanogram peak pressure.
amplitude of the tympanogram, or Ytm , will decrease. sure changes at rates of 200–600 daPa/s and correction
Conversely, if a middle ear problem decreases the sti¤- for ear canal volume at 200 daPa.
ness of the eardrum or middle ear, Ytm will increase. Ytm The remaining figures depict tympanometry findings
at 226 Hz normally increases slightly from infancy to for a variety of middle ear pathologies. The probe signal
adulthood, with a mean of 0.5 acoustic mmhos at 4 frequency most commonly used to measure the admit-
months to 0.7 acoustic mmhos in adulthood (Margolis tance properties of the middle ear is 226 Hz. Although
and Heller, 1987; Holte, Margolis, and Cavanaugh, this low-frequency probe signal was selected partly at
1991; Roush et al., 1995; Wiley et al., 1996). random during instrument development (Terkildsen and
Tympanogram width (TW), defined as the width in Scott Nielson, 1960), it remains the most commonly
daPa at one-half Ytm , is a measure of the broadness of a used probe signal. Acoustic admittance measurements at
tympanogram peak. In Figure 2, TW is 85 daPa. TW is low frequencies are dominated by the sti¤ness charac-
not highly correlated with Ytm , and therefore it provides teristics of the eardrum and middle ear transmission
supplemental information regarding middle ear func- system, whereas measurements made at high frequencies
tion (Koebsell and Margolis, 1986). TW has been most are dominated by mass characteristics. Although high-
useful in identifying children with middle ear e¤usion frequency probe signals of 660–1000 Hz are valuable
(MEE). In some cases of MEE, Ytm is normal but TW is in assessing the mass characteristics of the middle ear,
abnormally broad. Nozza et al. (1992, 1994) reported the tympanogram patterns that result at high frequencies
that a TW greater than 275 daPa was associated with a are more complex and have not enjoyed widespread
high sensitivity (81%) and specificity (82%) in identifying use. Only low-frequency tympanograms are presented in
MEE. subsequent examples, but cases where high-frequency
The fourth measure, TPP, provides an estimate of probe signals might be advantageous are pointed out.
middle ear pressure or indirect measure of eustachian Additional references on high-frequency tympanometry
tube function. Figure 2 shows a normal TPP of 10 daPa. are provided.
Not all individuals with negative middle ear pressure Figure 3 shows a series of tympanograms recorded
develop MEE. Results from school screening programs from a child recovering from a 3-month episode of otitis
showed that medical referral on the basis of TPP alone media with MEE. When first evaluated, the admittance
resulted in unacceptably high overreferral rates, and tympanogram was flat (type B), with a normal Vea of
therefore TPP is no longer used in referral criteria. A 0.45 mmhos. Sequential pure-tone audiograms showed
negative TPP in conjunction with a reduced Ytm is a air–bone gaps across all frequencies, ranging from 10 dB
much stronger indication of MEE and cause for medical to 55 dB, that were greatest at 250 and 4000 Hz and
referral (Feldman, 1976). smallest at 2000 Hz. Over time, the tympanogram
Table 1 shows means and 90% normal ranges for Vea , changed to a type C pattern. In early recovery, the
Ytm , TW, and TPP from several large-scale studies in tympanogram, shown by the heavy line, had a shal-
subjects ages 8 weeks to 90 years. These calculations low (Ytm ¼ 0.25 mmhos), broad peak (TW ¼ 200 daPa)
are significantly a¤ected by the procedures (e.g., rate with negative peak pressure (TPP ¼ 100 daPa). Air–
and direction of pressure changes and the pressure used bone gaps decreased to 10–25 dB and were largest at
to estimate Vea ) used to record the tympanogram 4000 Hz. This tympanogram pattern has been demon-
(Shanks and Wilson, 1986). The data presented in Table strated in human temporal bones injected with middle
1 were calculated from tympanograms recorded using ear fluid up to the level of the umbo, producing a mass
the most commonly used parameters, descending pres- loading e¤ect on the eardrum (Renvall, Liden, and
Tympanometry 561
recorded from an ear with a patent PET, and the top

tympanogram was recorded from an ear with a trau-
matic perforation from a Q-tip. Vea often is larger with a
traumatic eardrum perforation than with a perforation
associated with chronic middle ear disease and poorly
developed mastoid air-cell system (Andreasson, 1977).
In a child less than 7 years old, a volume greater than
1.0 cm 3 is indicative of a perforated eardrum, whereas in
adults the volume must exceed 2.5 cm 3 (Shanks et al.,
1992). Volumes exceeding these ranges clearly indicate a
perforated eardrum, but flat tympanograms with smaller
volumes do not necessarily rule out eardrum perforation.
A flat tympanogram with a normal Vea can also be re-
corded from an ear with eardrum perforation and cho-
lesteatoma filling the middle ear space and closing o¤ the
Figure 3. Type B and two type C acoustic admittance tympa- mastoid air-cell system. Case history and otoscopic ex-
nograms recorded using a 226 Hz probe signal from a child amination are very important in these cases. No consis-
during recovery from otitis media with e¤usion. tent pattern of hearing loss is associated with eardrum
perforation; air–bone gaps can be absent or as large as
50–70 dB if necrosis of the incus also occurs.
Bjorkman, 1975). The mass e¤ect is greatest at high fre- Figure 5 demonstrates that otosclerosis also is asso-
quencies, as reflected by large air–bone gaps at 4000 Hz, ciated with a variety of tympanogram shapes. Tympa-
and is accentuated when tympanometry is performed nograms vary from a normal type A pattern (shown in
using high-frequency probe signals such as 600–800 Hz. Fig. 1) to a low-admittance, sti¤ pattern (type As ) shown
Further resolution of the otitis media produced a type C by the lower tympanogram in Figure 5. A third pattern
tympanogram, increasing to normal Ytm (0.35 mmhos) frequently recorded in otosclerosis is a normal type A
with a TPP of 200 daPa. Small air–bone gaps of 15– pattern, but with a narrow tympanogram width (Shanks,
20 dB at this time were confined to the 250–1000 Hz 1984; Shahnaz and Polka, 1997). Pure-tone audiometry
range and were virtually closed at 4000 Hz, indicating in otosclerosis shows a sti¤ness tilt, with the largest
increased sti¤ness of the eardrum from the negative air–bone gaps at low frequencies and the smallest air–
middle ear pressure without the mass loading e¤ects of bone gap near 2000 Hz. Otosclerosis is virtually the
the middle ear fluid. The tympanogram gradually only middle ear pathology where significant air–bone
returned to a type A. This case study demonstrates a gaps are measured in conjunction with a normal type A
variety of tympanogram patterns associated with otitis tympanogram.
media. Rather than being a drawback, the various tym- Figure 6 shows two cases of type A tympanograms
panogram shapes help clinicians track the resolution of with abnormally high Ytm (2.5 mmhos). Normal tym-
MEE. panograms with deep peaks sometimes are designated
The American Speech-Language-Hearing Association type A d . The bottom tympanogram was recorded from
(1997) has developed guidelines for screening infants and
children for chronic middle ear disorders with the po-
tential for causing significant hearing loss or long-lasting
speech, language, and learning deficits. Medical referral
is advised for infants when Ytm is less than 0.2 mmhos or
TW is greater than 235 daPa, and for 1- to 5-year-olds
when Ytm is less than 0.3 mmhos or TW is greater than
200 daPa if these abnormal findings persist at a 6–8-
week rescreening. Immediate medical referral is recom-
mended for otalgia, otorrhea, or eardrum perforation
noted otoscopically or from a flat tympanogram with
Vea greater than 1.0 cm 3 . Screening guidelines are not
available for infants less than 7 months old. In this age
group, tympanogram shapes at 226 Hz are irregular and
di‰cult to interpret (Holte, Margolis, and Cavanaugh,
1991).
Figure 4 displays three type B tympanograms. The
bottom tympanogram was recorded from an ear with an Figure 4. Type B acoustic admittance tympanograms recorded
intact eardrum and MEE; Vea of 0.45 cm 3 is normal for using a 226 Hz probe signal from an ear with middle ear e¤u-
a child’s ear canal. The other two tympanograms also sion (bottom tympanogram), an ear with a patent pressure
are flat but Vea is 3.25 cm 3 in one case and greater than equalization tube (middle tympanogram), and an ear with a
5.0 cm 3 in the other. The middle tympanogram was traumatic eardrum perforation (top tympanogram).
whereas ossicular discontinuity results in a maximum

conductive hearing loss.
The preceding cases demonstrate that tympanometry
is most beneficial when used as one of a battery of tests
that also include case history, otoscopic examination,
and pure-tone audiometry. The cases also demonstrate
that each unique middle ear problem does not produce
one and only one tympanogram pattern. On the con-
trary, a single pathology can produce several di¤erent
tympanometry patterns, and conversely, a single tympa-
nogram pattern can result from several di¤erent middle
ear problems. When used with a battery of tests, how-
ever, the contribution from tympanometry can be unique
and informative.
See also middle ear assessment in the child.
Figure 5. Type A acoustic admittance tympanograms recorded
using a 226 Hz probe signal in two ears with surgically con- —Janet E. Shanks
firmed otosclerosis.
References
American National Standards Institute. (1987). Specifications
an ear with traumatic ossicular discontinuity; the audio- for instruments to measure aural acoustic impedance and ad-
gram showed a maximum conductive hearing loss with mittance (aural acoustic immittance) (ANSI S3.39-1987).
30–70 dB air–bone gaps. The top tympanogram was New York: Acoustical Society of America.
recorded from an ear with a monomeric eardrum result- American Speech-Language-Hearing Association Audiologic
ing from a healed perforation; the audiogram showed Assessment Panel. (1996, 1997). Guidelines for audiologic
screening. Rockville, MD: Author.
slight air–bone gaps at only 3000 and 4000 Hz. This
Andreasson, L. (1977). Correlation of tubal function and vol-
A d pattern also is typical of ears with tympanosclerotic ume of mastoid and middle ear space as related to otitis
plaques on the eardrum and status post stapedectomy. media. Acta Otolaryngologica, 83, 29–33.
These cases of high-admittance pathology are another Feldman, A. (1976). Tympanometry: Procedures, interpreta-
indication for high-frequency tympanometry. High- tions and variables. In A. S. Feldman and L. A. Wilbur
frequency tympanograms in ears with ossicular disconti- (Eds.), Acoustic impedance and admittance: The measure-
nuity typically exhibit broader, more undulating peaks ment of middle ear function (pp. 103–155). Baltimore: Wil-
than tympanograms recorded from ears with eardrum liams and Wilkins.
pathology. In cases of high Ytm , otoscopic examination Holte, L., Margolis, R., and Cavanaugh, R. (1991). Devel-
of the eardrum is crucial; high-admittance pathology of opmental changes in multifrequency tympanograms.
the eardrum can dominate or mask low-admittance pa-
Jerger, J. (1970). Clinical experience with impedance audi-
thology such as otosclerosis. Pure-tone audiometry also ometry. Archives of Otolaryngology, 92, 311–324.
is invaluable in these cases. Eardrum pathology alone Koebsell, K., and Margolis, R. (1986). Tympanometric gradi-
does not produce a significant conductive hearing loss, ent measured from normal preschool children. Audiology,
25, 149–157.
Margolis, R., and Heller, J. (1987). Screening tympanometry:
Criteria for medical referral. Audiology, 26, 197–208.
Nozza, R., Bluestone, C., Kardatze, D., and Bachman, R.
(1992). Towards the validation of aural acoustic immittance
measures for diagnosis of middle ear e¤usion in children.
Ear and Hearing, 13, 442–453.
Nozza, R., Bluestone, C., Kardatze, D., and Bachman, R.
(1994). Identification of middle ear e¤usion by aural acous-
tic immittance measures for diagnosis of middle ear e¤usion
in children. Ear and Hearing, 15, 310–323.
Renvall, U., Liden, G., and Bjorkman, G. (1975). Experimen-
tal tympanometry in human temporal bones. Scandinavian
Roush, J., Bryant, K., Mundy, M., Zeisel, S., and Roberts, J.
(1995). Developmental changes in static admittance and
tympanometric width in infants and toddlers. Journal of the
Shahnaz, N., and Polka, L. (1997). Standard and multi-
Figure 6. Type A d acoustic admittance tympanograms recorded frequency tympanometry in normal and otosclerotic ears.
using a 226 Hz probe signal in an ear with traumatic ossic- Ear and Hearing, 18, 326–341.
ular discontinuity (bottom tympanogram) and in an ear with a Shanks, J. (1984). Tympanometry. Ear and Hearing, 5, 268–
monomeric tympanic membrane (top tympanogram). 280.
Vestibular Rehabilitation 563
Shanks, J., Stelmachowicz, P., Beauchaine, K., and Schulte, L. pard and Telian, 1993). Knowledge of vestibular anat-
(1992). Equivalent ear canal volumes in children pre- and omy, physiology, pathologies involved, and an in-depth
post-tympanostomy tube insertion. Journal of Speech and understanding of how various interventions can a¤ect
Hearing Research, 35, 936–941. outcome is very important for e¤ective treatment of
Shanks, J., and Wilson, R. (1986). E¤ects of direction and rate
persons with vestibular disorders. Exercises to decrease
of ear-canal pressure changes on tympanometric measures.
Journal of Speech and Hearing Research, 29, 11–19. the risk of falling, improve balance and postural control,
Terkildsen, K., Scott Nielsen, S. (1960). An electroacoustic improve confidence, and decrease the subjective feelings
impedance measuring bridge for clinical use. Archives of of dizziness also seem to decrease a patient’s anxiety
Otolaryngology, 72, 339–346. (Jacob et al., 2000). Vestibular exercise programs have
Wiley, T., Cruickshanks, K., Nondahl, D., Tweed, T., Klein, been shown to enhance the speed and degree of recovery
R., and Klein, B. (1996). Tympanometric measures in older (Herdman et al., 1995; Horak et al., 1992; Krebs et al.,
adults. Journal of the American Academy of Audiology, 7, 1993; Strupp et al., 1998; Yardley et al., 1998).
260–268. Common conditions often referred for vestibular
physical and occupational therapy include benign par-
Further Readings oxysmal positional vertigo (BPPV) (Blakely, 1994;
Colletti, V. (1977). Multifrequency tympanometry. Audiology, Herdman et al., 1993; Lynn et al., 1995), bilateral vesti-
16, 278–287. bulopathy (Brown et al., 2001; Krebs et al., 1993; Telian
Fowler, C., and Shanks, J. (2002). Tympanometry. In J. Katz et al., 1991), endolymphatic hydrops, labyrinthine con-
(Ed.), Handbook of clinical audiology (pp. 175–204). Balti- cussion (Cowand et al., 1998; Fujino et al., 1996; Horak
more: Lippincott Williams and Wilkins. et al., 1992; Shepard et al., 1990; Shepard et al., 1993;
Liden, G., Peterson, J., and Bjorkman, G. (1970). Tympa- Smith-Wheelock et al., 1991), labyrinthitis (Cowand
nometry: A method for analysis of middle ear function. et al., 1998; Fujino et al., 1996; Shepard et al., 1990;
Acta Otolaryngologica, 263, 218–224. Shepard et al., 1993; Smith-Wheelock et al., 1991),
Lilly, D. (1984). Multiple frequency, multiple component tym-
Ménière’s disease (Cowand et al., 1998; Fujino et al.,
panometry: New approaches to an old diagnostic problem.
Ear and Hearing, 5, 300–308. 1996; Shepard et al., 1990; Shepard et al., 1993; Smith-
Margolis, R. (1981). Fundamentals of acoustic immittance. In Wheelock et al., 1991), perilymph fistula, and vestibular
G. Popelka (Ed.), Hearing assessment with the acoustic re- neuritis. Central diagnoses include cervicogenic dizzi-
flex (pp. 117–144). New York: Grune and Stratton. ness, brainstem hemorrhage (Cowand et al., 1998;
Margolis, R., Paul, S., Saly, G., Schachern, P., and Keefe, D. Horak et al., 1992; Shepard et al., 1990; Shepard et al.,
(2001). Wideband reflectance tympanometry in chinchillas 1995; Smith-Wheelock et al., 1991), posttraumatic anxi-
and humans. Journal of the Acoustical Society of America, ety symptoms, stroke/transient ischemic attacks (TIA),
110, 1453–1464. traumatic head injury (Cowand et al., 1998; Horak et al.,
Margolis, R., Van Camp, K., Wilson, R., and Creten, W. 1992; Shepard et al., 1990; Shepard et al., 1993), and
(1985). Multi-frequency tympanometry in normal ears.
migraine-related vestibulopathy (Cass et al., 1997;
Shanks, J., and Shelton, C. (1991). Basic principles and clinical Whitney et al., 2000). Psychiatric disorders that have
applications of tympanometry. Otolaryngology Clinics of been reported to manifest with a component of dizziness
North America, 24, 299–328. include panic disorders (Jacob et al., 2000), agoraphobia
Shanks, J., Wilson, R., and Cambron, N. (1993). Multiple fre- (Jacob et al., 2000), and hyperventilation syndrome. The
quency tympanometry: E¤ects of ear canal volume com- most common nonvestibular causes of dizziness are low
pensation on static acoustic admittance and estimates of blood pressure and medication-induced dizziness (Fur-
middle ear resonance. Journal of Speech and Hearing Re- man and Whitney, 2000).
search, 36, 178–185. Persons with vestibular disorder present with various
Vanhuyse, V., Creten, W., and Van Camp, K. (1975). On the complaints, and often report experiencing balance dys-
W-notching of tympanograms. Scandinavian Audiology, 4,
function, dizziness, vertigo, anxiety about their symp-
45–50.
Zwislocki, J. (1982). Normal function of the middle ear and its toms, space and motion complaints (symptoms elicited
measurement. Audiology, 21, 4–14. by a specific visual stimulus pattern (Furman and Cass,
1996), and fear of falling. They may describe visual dis-
turbances, dysequilibrium, and dizziness occurring while
Vestibular Rehabilitation they are at work, at home, or engaged in leisure activ-
ities. Common visual problems experienced include dif-
ficulty focusing while reading, ‘‘bouncing’’ of the visual
The concept of prescribing exercise for persons with world as they move (oscillopsia), impaired smooth pur-
dizziness was first described by Cooksey and Cawthorne suit, saccades, and vergence. Balance problems fre-
in the 1950s (Cawthorne, 1944; Cooksey, 1946). Today, quently noted include increased sway while standing, an
exercise for persons with vestibular disorders is consid- inability to stand still, walking with a wide-based gait,
ered to be the standard of care (Cowand et al., 1998; veering during walking, adduction or crossing their legs
Herdman, 1990; Herdman, 1992; Herdman et al., 1995; during gait, di‰culty walking in the dark or on uneven
Herdman and Whitney, 2000). Exercises are specifically surfaces, bumping into things, or falling.
prescribed that help the person with a vestibular disorder Tinnitus, di‰culty hearing, and aural fullness are re-
either compensate for or adapt to the impairment (She- lated cochlear signs reported by persons with vestibular
disorders. Descriptions of problems related to the head toms on a verbal analogue scale and indicate the duration
include dizziness, spinning, headache, pressure, neck of the symptoms (Norre and De Weerdt, 1980; Smith-
pain, a swimming sensation, and heaviness. Often, per- Wheelock et al., 1991). Monitoring the intensity and
sons with vestibular disorders report fatigue and di‰- duration of the symptoms over the length of treatment
culty concentrating. All of these problems contribute to can provide information on the recovery of the patient.
making vestibular disorders di‰cult to treat, as a¤ected Two aspects of postural control should be evaluated,
individuals may have multiple symptoms and frequently the ability to move the center of gravity within the base
have more than one diagnosis. of support and the ability to utilize available sensory in-
A physical therapy evaluation provides information formation for balance. The ability to move the center of
on impairments and functional deficits so that appropri- gravity within the base of support is determined by ask-
ate intervention can be determined. A thorough workup ing the person to perform tasks such as shifting his or her
by the physician and vestibular function tests help direct weight while standing and then reaching for objects. This
the physical therapy evaluation and intervention. The can be quantified by measuring how far the person can
patient history should include goals of treatment, pre- reach (i.e., the Functional Reach test [Duncan et al.,
morbid health, current and premorbid activity level, and 1990] or the multidirectional Reach test [Newton, 2001])
a description of the onset, frequency, duration, and se- or how long they can maintain a position (i.e., standing
verity of the dizziness and imbalance. Not all persons on one foot or in tandem). In addition, the clinician can
with vestibular disorders experience both dizziness and ask patients to twist their trunk, pick up objects from
imbalance. Identifying the positions or situations that di¤erent surfaces, or stand on a narrow base of support
exacerbate or relieve the symptoms can a¤ord valuable to determine how stable they are doing functional activ-
insight into the cause of the problem. Gaining an un- ities. Having the person with a vestibular disorder stand
derstanding of the magnitude of the functional deficits on high-density foam with the eyes open and then closed
is very important. The intensity and duration of symp- (Clinical Test of Sensory Interaction and Balance, or
toms, the degree to which symptoms impede activities of CTSIB) (Shumway-Cook and Horak, 1986) can help the
daily living, and how symptoms a¤ect social activities therapist determine the fall risk (Anacker and Di Fabio,
help to determine intervention. 1992), and also how well the patient uses the sensory
A thorough exploration of the individual’s history of information that he or she has available (Shumway-
falling can also provide insight into the physiology of the Cook and Horak, 1986). Scores on the CTSIB have been
condition and the necessary treatment (Herdman et al., shown to correlate with conditions 4 and 5 of computed
2000; Whitney, Hudak, and Marchetti, 2000). Not sim- dynamic posturography (Anacker and Di Fabio, 1992;
ply the number of falls but also the conditions of the fall, Weber and Cass, 1993).
the frequency of falling, and whether medical treatment Patients with vestibular disorders often describe di‰-
was necessary are all important in the assessment of the culty walking, especially under varying sensory condi-
person with a vestibular disorder. Fear of falling in tions such as walking with head turns, in dimmed or
individuals who have fallen may constrict their willing- absent lighting, or with movement in the environment.
ness to move (Tinetti and Powell, 1993). Assessment of the person’s gait during various func-
The patient’s medical and surgical history will a¤ect tional tasks and under di¤erent sensory conditions is
the prescription of an exercise program. Persons with crucial. Persons with a vestibular disorder are asked to
premorbid orthopedic and cardiac limitations need to be walk, walk at di¤erent speeds, walk over and around
carefully monitored to ensure that they are safe with the objects, and walk with various head movements in order
exercise program. Frail, older adults may need to be seen to determine how stable they are during ambulation
more frequently in order to ensure compliance and (Whitney et al., 2000a; Whitney and Herdman, 2000).
safety with their exercises. These tasks can be quantified using the time it takes to
Typically, the range of motion of the joints, muscle complete a task or by qualitatively describing the move-
strength, sensation, vision, motions that provoke symp- ments, as in the Dynamic Gait Index (Shumway-Cook
toms, balance, and gait are all determined before an ex- and Woollacott, 1995).
ercise program is started. Because of the influence of The goals of vestibular rehabilitation include de-
somatosensation on balance, it is important to assess creasing the risk of falling, improving gaze stability,
range of motion and sensation, particularly in the ankles improving the person’s dynamic and static postural
and cervical region. The visual assessment includes test- control, decreasing symptoms, and enhancing the indi-
ing the function of the ocular muscles, including sac- vidual’s ability to carry out activities of daily living and
cades and smooth pursuit, as well as the function of the to work. The achievement of these goals is through
vestibular ocular reflex. exercise and the practice of activities in a safe environ-
Quantification of the movements and positions that ment. Customized exercise programs are better than ex-
trigger symptoms of dizziness not only provides infor- ercise handouts provided without direction as to which
mation on the cause of the symptoms but may also help exercises are most important to perform (Shepard and
in selecting activities for treatment. Therapists commonly Telian, 1995).
ask patients to move into and out of supine and side-lying Vestibular rehabilitation intervention is prescribed
positions, and then have the patients rate their symp- individually for each patient. For patients with periph-
eral vestibular lesions, vestibular rehabilitation exercises with vestibular disorders should have an opportunity to
are thought to promote compensation or recalibration work with a knowledgeable physical or occupational
of the vestibular system, specifically the vestibulo-ocular therapist, because quality of life can be improved with
reflex (VOR). The system appears to recalibrate because vestibular rehabilitation.
an error signal is created from the slip of an image on
the retina (Robinson, 1976; Fetter and Zee, 1988). The —Susan L. Whitney and Diane M. Wrisley
use of eye and head movements as an exercise to change
the gain of the VOR results in a change in the inhibi-
References
tion of the activity in the vestibular nuclei, and conse- Anacker, S. L., and Di Fabio, R. P. (1992). Influence of sen-
quently in enhanced patient function. Recovery of the sory inputs on standing balance in community-dwelling
VOR is frequency specific. Stimulation of the VOR elders with a recent history of falling. Physical Therapy, 72,
through exercises must be performed at varying fre- 575–581.
quencies for maximal functional recovery (Godaux, Blakely, B. W. (1994). A randomized, controlled assessment of
the canalith repositioning maneuver. Otolaryngology–Head
Halleux, and Gobert, 1983; Lisberger, Miles, and Opti-
can, 1983). Bronstein, A. M., and Hood, J. D. (1986). The cervico-ocular
Activity after a lesion to the vestibular system is im- reflex in normal subjects and patients with absent vestibular
portant. Animals that moved freely after surgery had function. Brain Research, 373, 399–408.
faster functional recovery (Lacour, Roll, and Appaix, Brown, K. E., Whitney, S. L., Wrisley, D. M., and Furman,
1976). Patients with vestibular disorders had faster re- J. M. (2001). Physical therapy outcomes for persons with
covery and improved function when they increased their bilateral vestibular loss. Laryngoscope, 111, 1812–1817.
activity early after surgery (Herdman et al., 1995). Cass, S. P., Furman, J. M., Ankerstjerne, J. K., Balaban, C.,
Some persons with vestibular disorders have little or Yetiser, S., and Aydogan, B. (1997). Migraine-related ves-
no remaining vestibular function, owing to disease or tibulopathy. Annals of Otology Rinology and Laryngology,
106, 181–189.
ototoxicity. These persons must learn how to use re- Cawthorne, T. (1944). The physiological basis for head exer-
maining sensory function such as somotosensation and cises. Journal of the Chartered Society of Physiotherapy, 30,
vision. In addition, receptors in the neck can assist in 106–107.
stabilizing vision and posture, although in patients with Cooksey, F. S. (1946). Rehabilitation in Vestibular Injuries.
intact vestibular systems, the cervical ocular reflex con- Proceedings of the Royal Society of Medicine, 39, 273–278.
tributes little to gaze stability. The cervical ocular reflex Cowand, J. L., Wrisley, D. M., Walker, M., Strasnick, B., and
performs maximally at lower frequencies and is accen- Jacobson, J. T. (1998). E‰cacy of vestibular rehabilitation.
tuated in patients with bilateral vestibular loss (Kasai Otolaryngology–Head and Neck Surgery, 118, 49–54.
and Zee, 1978; Bronstein and Hood, 1986). Smooth Duncan, P. W., Weiner, D., Chandler, J., and Studenski, S.
pursuits and saccades can also assist in stabilizing vision (1990). Functional Reach: A new clinical measure of bal-
ance. Journal of Gerontology, 45, M192–197.
at slow speeds (Kasai and Zee, 1978; Segal and Katsar- Fetter, M., and Zee, D. S. (1988). Recovery from unilateral
kas, 1988; Leigh et al., 1994). Patients who have no labyrinthectomy in rhesus monkeys. Journal of Neurophys-
function in the vestibular system will never be able to iology, 59, 370–393.
walk in the dark, will have great di‰culty walking on Fujino, A., Tokumasu, K., Okamoto, M., Naganuma, H.,
uneven surfaces, and will never be able to read and walk Hoshino, I., Arai, M., and Yoneda, S. (1996). Vestibular
at the same time. Driving a car without any vestibular training for acute unilateral vestibular disturbances: Its e‰-
function is impossible because the visual field jumps cacy in comparison with antivertigo drug. Acta Otolaryn-
(oscillopsia), especially as the car goes over bumps. gology Suppl (Stockh), 524, 21–26.
Patients with balance disorders are taught to max- Furman, J. M., and Cass, S. P. (1996). Balance disorders: A
imize the sensory function that remains, to substitute for case-study approach. Philadelphia: F. A. Davis.
Furman, J. M., and Whitney, S. L. (2000). Central causes of
sensory loss, and to predetermine when they will have dizziness. Physical Therapy, 80, 179–187.
di‰culty with balance so that they can modify their be- Godaux, E., Halleux, J., and Gobert, C. (1983). Adaptive
havior. Exercises are prescribed to enhance the use of change of the vestibulo-ocular reflex in the cat: The e¤ects
vestibular, visual, and somatosensory inputs. Habitua- of a long term frequency-selective procedure. Experimental
tion exercises may be recommended for patients with Brain Research, 49, 28–34.
dizziness provoked by specific position changes. Herdman, S. J. (1990). Assessment and Treatment of balance
The outcomes of vestibular rehabilitation have in- disorders in the vestibular deficient patient. In S. J. Herd-
cluded a decrease in dizziness and vertigo, a decrease in man (Ed.), Vestibular rehabilitation (pp. 87–94). Philadel-
the number of falls, improved gait, decreased neck pain, phia: F. A. Davis.
improved VOR, greater balance confidence, decreased Herdman, S. J. (1992). Physical therapy management of ves-
tibular disorders in older patients. Physical Therapy Prac-
anxiety, improvements in activities of daily living, and tice, 1, 77–87.
improvements in the perceived disability. Generally, Herdman, S. J., Blatt, P., Schubert, M. C., and Tusa, R. J.
persons with peripheral vestibular disorders have a (2000). Falls in patients with vestibular deficits. American
better prognosis than those with central vestibular dis- Journal of Otology, 21, 847–851.
orders. Persons with both central and peripheral ves- Herdman, S. J., Clendaniel, R. A., Mattox, D. E., Holliday,
tibular disorders have a poorer prognosis. All persons M. J., and Niparko, J. K. (1995). Vestibular adaptation
exercises and recovery: Acute stage after acoustic neuroma Shumway-Cook, A., and Horak, F. B. (1986). Assessing the
resection. Otolaryngology–Head and Neck Surgery, 113, influence of sensory interaction on balance. Physical Ther-
77–87. apy, 66, 1548–1550.
Herdman, S. J., Tusa, R. J., Zee, D. S., Proctor, L. R., Shumway-Cook, A., and Woollacott, M. (1995). Motor con-
and Mattox, D. E. (1993). Single treatment approaches trol: Theory and practical applications (pp. 323–324). Balti-
to benign paroxysmal positional vertigo. Archives of more, MD: Williams and Wilkins.
Otolaryngology–Head and Neck Surgery, 119, 450–454. Smith-Wheelock, M., Shepard, N. T., and Telian, S. A. (1991).
Herdman, S. J., and Whitney, S. L. (2000). Treatment of ves- Long-term e¤ects for treatment of balance dysfunction:
tibular hypofunction. In S. J. Herdman (Ed.), Vestibular Utilizing a home exercise approach. Seminars in Hearing,
rehabilitation (pp. 387–423). Philadelphia: F. A. Davis. 12, 297–302.
Horak, F. B., Jones-Rycewicz, C., Black, F. O., and Shumway- Strupp, M., Arbusow, V., Maag, K. P., Gall, C., and Brandt,
Cook, A. (1992). E¤ects of vestibular rehabilitation on T. (1998). Vestibular exercises improve central vestibulo-
dizziness and imbalance. Otolaryngology–Head and Neck spinal compensation after vestibular neuritis. Neurology, 51,
Surgery, 106, 175–180. 838–844.
Jacob, R. G., Whitney, S. L., Detweiler-Shostak, G., and Fur- Telian, S. A., Shepard, N. T., Smith-Wheelock, M., and
man, J. M. (2000). E¤ect of vestibular rehabilitation in Hoberg, M. (1991). Bilateral vestibular paresis: Diagnosis
patients with panic disorder with agoraphobia and ves- and treatment. Otolaryngology–Head and Neck Surgery,
tibular dysfunction: A pilot study. Journal of Anxiety Dis- 104, 67–71.
orders, 15, 131–146. Tinetti, M. E., and Powell, L. E. (1993). Fear of falling and
Kasai, T., and Zee, D. S. (1978). Eye-head coordination in low self-e‰cacy: A cause of dependence in elderly persons.
labyrinthine-defective human beings. Brain Research, 144, Journal of Gerontology, 48, 35–38.
123–141. Weber, P. C., and Cass, S. P. (1993). Clinical assessment of
Krebs, D. E., Gill-Body, K. M., Riley, P. O., and Parker, S. W. postural stability. American Journal of Otology, 14, 566–569.
(1993). Double-blind placebo-controlled trial of rehabilita- Whitney, S. L., and Herdman, S. J. (2000). Physical therapy
tion for bilateral vestibular hypofunction: Preliminary re- assessment of vestibular hypofunction. In S. J. Herdman
port. Otolaryngology-Head and Neck Surgery, 109, 735– (Ed.), Vestibular Rehabilitation (pp. 333–372). Philadel-
741. phia: F. A. Davis.
Lacour, M., Roll, J. P., and Appaix, M. (1976). Modifications Whitney, S. L., Hudak, M. K., and Marchetti, G. F. (2000).
and development of spinal reflexes in the alert baboon The dynamic gait index relates to self-reported fall history
(Papio papio) following a unilateral vestibular neurectomy. in individuals with vestibular dysfunction. Journal of Ves-
Brain Research, 113, 255–269. tibular Research, 10, 99–105.
Leigh, R. J., Huebner, W. P., and Gordon, J. L. (1994). Sup- Whitney, S. L., Wrisley, D. M., Brown, K. E., and Furman,
plementation of the vestibulo-ocular reflex by visual fixation J. M. (2000). Physical therapy for migraine-related vesti-
and smooth pursuit. Journal of Vestibular Research, 4, 347– bulopathy and vestibular dysfunction with history of mi-
353. graine. Laryngoscope, 110, 1528–1534.
Lisberger, S. G., Miles, F. A., and Optican, L. M. (1983). Yardley, L., Beech, S., Zander, L., Evans, T., and Weinman, J.
Frequency-selective adaptation: Evidence for channels in (1998). A randomized controlled trial of exercise therapy for
the vestibulo-ocular reflex. Journal of Neuroscience, 3, dizziness and vertigo in primary care. British Journal of
1234–1244. General Practice, 48, 1136–1140.
Lynn, S., Pool, A., Rose, D., Brey, R., and Suman, V. (1995).
Randomized trial of the canalith repositioning procedure. Further Readings
Otolaryngology–Head and Neck Surgery, 113, 712–720.
Newton, R. A. (2001). Validity of the multi-directional reach Brown, K. E., Whitney, S. L., Wrisley, D. M., and Furman,
test: A practical measure for limits of stability in older J. M. (2001). Physical therapy outcomes for persons with
adults. Journals of Gerontology. Series A, Biological Sci- bilateral vestibular loss. Laryngoscope, 111, 1812–1817.
ences and Medical Sciences, 56A, M248–M252. Cass, S. P., Borello-France, D., and Furman, J. M. (1996).
Norre, M., and De Weerdt, W. (1980). Treatment of vertigo Functional outcome of vestibular rehabilitation in patients
based on habituation. 2. Technique and results of habitua- with abnormal sensory organization testing. American
tion training. Journal of Laryngology and Otology, 94, 689– Journal of Otolaryngology, 17, 581–594.
696. Cohen, H. (1994). Vestibular rehabilitation improves daily life
Robinson, D. A. (1976). Adaptive control of vestibulo-ocular function. American Journal of Occupational Therapy, 48,
reflex by the cerebellum. Journal of Neurophysiology, 39, 919–925.
954–969. Herdman, S. J., Clendaniel, R. A., Mattox, D. E., Holliday,
Segal, B. N., and Katsarkas, A. (1988). Long-term deficits M. J., and Niparko, J. K. (1995). Vestibular adaptation
of goal-directed vestibulo-ocular function following total exercises and recovery: Acute stage after acoustic neuroma
unilateral loss of peripheral vestibular function. Acta Oto- resection. Otolaryngology–Head and Neck Surgery, 113,
Laryngologica Supplement (Stockh), 106, 102–110. 77–87.
Shepard, N. T., and Telian, S. A. (1995). Programmatic ves- Horak, F. B., Jones-Rycewicz, C., Black, F. O., and Shumway-
tibular rehabilitation. Otolaryngology–Head and Neck Sur- Cook, A. (1992). E¤ects of vestibular rehabilitation on
gery, 112, 173–182. dizziness and imbalance. Otolaryngology–Head and Neck
Shepard, N. T., Telian, S. A., and Smith-Wheelock, M. (1990). Surgery, 106, 175–180.
Habituation and balance retraining therapy: A retrospective Karlberg, M., Magnusson, M., Malmstrom, E. M., Melander,
review. Neurology Clinics, 8, 459–475. A., and Moritz, U. (1996). Postural and symptomatic im-
Shepard, N. T., Telian, S. A., Smith-Wheelock, M., and Raj, provement after physiotherapy in patients with dizziness of
A. (1993). Vestibular and balance rehabilitation therapy. suspected cervical origin. Archives of Physical Medicine and
Annals of Otology, Rhinology, Laryngology, 102, 198–205. Rehabilitation, 77, 874–882.
Krebs, D. E., Gill-Body, K. M., Riley, P. O., and Parker, S. W. Smith-Wheelock, M., Shepard, N. T., and Telian, S. A. (1991).
(1993). Double-blind placebo-controlled trial of rehabilita- Long-term e¤ects for treatment of balance dysfunction:
tion for bilateral vestibular hypofunction: Preliminary re- Utilizing a home exercise approach. Seminars in Hearing,
port. Otolaryngology–Head and Neck Surgery, 109, 735–741. 12, 297–302.
Mruzek, M., Barin, K., Nichols, D. S., Burnett, C. N., and Strupp, M., Arbusow, V., Maag, K. P., Gall, C., and Brandt,
Welling, D. B. (1995). E¤ects of vestibular rehabilitation T. (1998). Vestibular exercises improve central vestibulo-
and social reinforcement on recovery following ablative spinal compensation after vestibular neuritis. Neurology, 51,
vestibular surgery. Laryngoscope, 105, 686–692. 838–844.
Norre, M., and De Weerdt, W. (1980). Treatment of vertigo Szturm, T., Ireland, D. J., and Lessing-Turner, M. (1994).
based on habituation: 2. Technique and results of habitua- Comparison of di¤erent exercise programs in the rehabili-
tion training. Journal of Laryngology and Otology, 94, 689– tation of patients with chronic peripheral vestibular dys-
696. function. Journal of Vestibular Research, 4, 461–479.
Shepard, N. T., and Telian, S. A. (1995). Programmatic ves- Whitney, S. L., Wrisley, D. M., Brown, K. E., and Furman,
tibular rehabilitation. Otolaryngology–Head and Neck Sur- J. M. (2000). Physical therapy for migraine-related vesti-
gery, 112, 173–182. bulopathy and vestibular-dysfunction with history of mi-
Shepard, N. T., Telian, S. A., and Smith-Wheelock, M. (1990). graine. Laryngoscope, 110, 1528–1534.
Habituation and balance retraining therapy: A retrospective Wrisley, D. M., Sparto, P. J., Whitney, S. L., and Furman,
review. Neurology Clinics, 8, 459–475. J. M. (2000). Cervicogenic dizziness: A review of diagnosis
Shepard, N. T., Telian, S. A., Smith-Wheelock, M., and Raj, and treatment. Journal of Orthopedic and Sports Physical
A. (1993). Vestibular and balance rehabilitation therapy. Therapy, 30, 755–766.
Annals of Otology, Rhinology, and Laryngology, 102, 198– Yardley, L., Beech, S., Zander, L., Evans, T., and Weinman, J.
205. (1998). A randomized controlled trial of exercise therapy
Shumway-Cook, A., and Horak, F. B. (1986). Assessing the for dizziness and vertigo in primary care. British Journal of
influence of sensory interaction on balance. Physical Ther- General Practice, 48, 1136–1140.
apy, 66, 1548–1550.
Contributors
General Editor Raymond D. Kent
University of Wisconsin-Madison, Madison, Wisconsin
Advisory Editors
Voice Disorders in Children Steven D. Gray (deceased)
University of Utah, Salt Lake City, Utah
Voice Disorders in Adults Robert E. Hillman
Massachusetts Eye and Ear Infirmary, Boston, Massachusetts
Speech Disorders in Children Lawrence D. Shriberg
University of Wisconsin-Madison, Madison, Wisconsin
Speech Disorders in Adults Joseph R. Du¤y
Mayo Clinic, Rochester, Minnesota
Language Disorders in Children Mabel L. Rice
University of Kansas, Lawrence, Kansas
Language Disorders in Adults David A. Swinney
University of California at San Diego, San Diego, California
and
Lewis P. Shapiro
San Diego State University, San Diego, California
Hearing Disorders in Children Fred H. Bess
Vanderbilt University, Nashville, Tennessee
Hearing Disorders in Adults Sandra Gordon-Salant
University of Maryland, College Park, Maryland
Contributing Authors
Faith W. Akin Pelagie M. Beeson
James H. Quillen VA Medical Center, Mountain Home, University of Arizona, Tuscon, Arizona
Tennessee agraphia
electronystagmography Gerald S. Berke
Jont B. Allen University of California at Los Angeles School of Medicine,
University of Illinois at Urbana-Champaign, Beckman Los Angeles, California
Institute, Urbana, Illinois laryngeal reinnervation procedures
amplitude compression in hearing aids Ken Bleile
Gerhard Andersson University of Northern Iowa, Cedar Falls, Iowa
Uppsala University, Uppsala, Sweden speech disorders in children: birth-related risk
tinnitus factors
Daniel H. Ashmead Joel H. Blumin
Vanderbilt University Medical Center, Nashville, Tennessee University of Pennsylvania School of Medicine, Philadelphia,
audition in children, development of Pennsylvania
Laura J. Ball laryngeal reinnervation procedures
University of Nebraska Medical Center, Omaha, Nebraska Sheila E. Blumstein
augmentative and alternative communication approaches Brown University, Providence, Rhode Island
in adults phonology and adult aphasia
augmentative and alternative communication approaches Carol A. Boliek
in children University of Alberta, Edmonton, Alberta
Martin J. Ball speech development in infants and young children with
University of Louisiana at Lafayette, Lafayette, Louisiana a tracheostomy
speech assessment, instrumental Louis D. Braida
Shari R. Baum Massachusetts Institute of Technology, Cambridge,
McGill University, Montreal, Quebec Massachusetts
prosodic deficits frequency compression
Kathryn A. Bayles Mitchell F. Brin
University of Arizona, Tuscon, Arizona Columbia University, New York, New York
dementia hypokinetic laryngeal movement disorders
570 Contributors
Bonnie Brinton segmentation of spoken language by normal adult

Brigham Young University, Provo, Utah listeners
pragmatics Barbara L. Davis
psychosocial problems associated with communicative University of Texas at Austin, Austin, Texas
disorders developmental apraxia of speech
Hiram Brownell Peter A. de Villiers
Boston College, Chestnut Hill, Massachusetts Smith College, Northampton, Massachusetts
right hemisphere language and communication functions language of the deaf: acquisition of english
in adults language of the deaf: sign language
Hugh W. Buckingham Allan O. Diefendorf
Louisiana State University, Baton Rouge, Louisiana Indiana University School of Medicine, Indianapolis,
phonological analysis of language disorders in aphasia Indiana
Eugene H. Buder pediatric audiology: the test battery approach
University of Memphis, Memphis, Tennessee Harvey Dillon
acoustic assessment of voice The National Acoustic Laboratories, Chatswood, Australia
Angela Burda hearing aids: prescriptive fitting
University of Northern Iowa, Cedar Falls, Iowa
Christine Dollaghan
speech disorders in children: birth-related risk factors
University of Pittsburgh, Pittsburgh, Pennsylvania
Thomas Campbell early recurrent otitis media and speech development
University of Pittsburgh and Children’s Hospital of
Patrick J. Doyle
Pittsburgh, Pittsburgh, Pennsylvania
VA Pittsburgh Healthcare System and University of
early recurrent otitis media and speech development
Pittsburgh, Pittsburgh, Pennsylvania
David Caplan apraxia of speech: nature and phenomenology
Massachusetts General Hospital, Boston, Massachusetts apraxia of speech: treatment
aphasic syndromes: connectionist models
Philip C. Doyle
Janina K. Casper University of Western Ontario, London, Ontario
SUNY-Upstate Medical Center, Syracuse, New York alaryngeal voice and speech rehabilitation
vocal hygiene laryngeal trauma and peripheral structural
Hugh W. Catts ablations
University of Kansas, Lawrence, Kansas voice rehabilitation after conservation laryngectomy
language impairment and reading disability Nina F. Dronkers
Michael R. Chial VA Northern California Health Care System, Martinez,
University of Wisconsin-Madison, Madison, Wisconsin California
hearing protection devices aphasia: the classical syndromes
Li-Rong Lilly Cheng Judy R. Dubno
San Diego State University, San Diego, California Medical University of South Carolina, Charleston, South
speech and language issues in children from asian-pacific Carolina
backgrounds presbyacusis
Melissa Cheslock Joseph R. Du¤y
Georgia State University, Atlanta, Georgia Mayo Clinic, Rochester, Minnesota
augmentative and alternative communication: general dysarthrias: characteristics and classification
issues Anh Duong
Chris Code University of Montreal, Montreal, Quebec
University of Exeter, Exeter, United Kingdom discourse impairments
aphasia treatment: psychosocial issues Tanya L. Eadie
Robyn M. Cox University of Western Ontario, London, Ontario
University of Memphis, Memphis, Tennessee alaryngeal voice and speech rehabilitation
hearing aid fitting: evaluation of outcomes David A. Eddins
Martha Crago SUNY-Bu¤alo, Bu¤alo, New York
McGill University, Montreal, Quebec temporal resolution
language impairment in children: cross-linguistic studies Mary Louise Edwards
Carl C. Crandell Syracuse University, Syracuse, New York
University of Florida, Gainesville, Florida phonetic transcriptions of children’s speech
classroom acoustics Wiltrud Fassbinder
Richard F. Curlee University of Pittsburgh, Pittsburgh, Pennsylvania
University of Arizona, Tuscon, Arizona right hemisphere language disorders
stuttering Heidi M. Feldman
Anne Cutler University of Pittsburgh School of Medicine, Pittsburgh,
Max Planck Institute for Psycholinguistics, Nijmegen, The Pennsylvania
Netherlands language development in children with focal lesions
Contributors 571
John A. Ferraro Helen M. Hanson

University of Kansas Medical Center, Kansas City, Kansas Massachusetts Institute of Technology, Cambridge,
electrocochleography Massachusetts
Marc E. Fey voice acoustics
University of Kansas Medical Center, Kansas City, Kansas M. N. Hegde
preschool language intervention California State University at Fresno, Fresno, California
W. Tecumseh Fitch speech disorders in children: behavioral approaches to
Harvard University, Cambridge, Massachusetts remediation
vocal production system: evolution Nancy Helm-Estabrooks
Cynthia G. Fowler Boston University School of Medicine, Boston, Massachusetts
University of Wisconsin-Madison, Madison, Wisconsin perseveration
auditory brainstem response in adults Janet Helminski
Cynthia Fox Midwestern University School of Physical Therapy, Downers
University of Colorado, Boulder, Colorado Grove, Illinois
hypokinetic laryngeal movement disorders ototoxic medications
Naama Friedmann Gregory Hickok
Tel Aviv University, Tel Aviv, Israel University of California at Irvine, Irvine, California
syntactic tree pruning auditory-motor interaction in speech and language
Martin Fujiki functional brain imaging
Brigham Young University, Provo, Utah Argye E. Hillis
pragmatics Johns Hopkins University School of Medicine, Baltimore,
psychosocial problems associated with communicative Maryland
disorders alexia
Fred Genesee Robert E. Hillman
McGill University, Montreal, Quebec Massachusetts Eye and Ear Infirmary and Massachusetts
bilingualism and language impairment General Hospital Institute of Health Professions, Boston,
LouAnn Gerken Massachusetts; Harvard Medical School, Cambridge,
University of Arizona, Tuscon, Arizona Massachusetts
linguistic aspects of child language impairment— aerodynamic assessment of vocal function
prosody Jacqueline J. Hinckley
Bruce Gerratt University of South Florida, Tampa, Florida
University of California at Los Angeles School of Medicine, communication disorders in adults: functional
Los Angeles, California approaches to aphasia
voice quality, perceptual evaluation Megan M. Hodge
Judith A. Gierut University of Alberta, Edmonton, Alberta
Indiana University, Bloomington, Indiana speech disorders in children: motor speech disorders of
speech assessment in children: descriptive linguistic known origin
methods Barbara Hodson
Lisa Go¤man Wichita State University, Wichita, Kansas
Purdue University, West Lafayette, Indiana phonological awareness intervention for children with
motor speech involvement in children expressive phonological impairments
Brian Goldstein Erica Ho¤
Temple University, Philadelphia, Pennsylvania Florida Atlantic University, Davie, Florida
speech issues in children from latino backgrounds poverty: effects on language
Michael P. Gorga Jeannette D. Hoit
Boys Town National Research Hospital, Omaha, Nebraska University of Arizona, Tuscon, Arizona
otoacoustic emissions in children ventilator-supported speech production
Yosef Grodzinsky Audrey L. Holland
McGill University, Montreal, Quebec; Tel Aviv University, University of Arizona, Tuscon, Arizona
Tel Aviv, Israel communication disorders in adults: functional
trace deletion hypothesis approaches to aphasia
Murray Grossman Linda J. Hood
University of Pennsylvania, Philadelphia, Pennsylvania Kresge Hearing Research Laboratory, New Orleans,
alzheimer’s disease Louisiana
Timothy C. Hain genetics and craniofacial anomalies
Northwestern University School of Medicine, Chicago, Illinois Aquiles Iglesias
ototoxic medications Temple University, Philadelphia, Pennsylvania
Joseph W. Hall language disorders in latino children
University of North Carolina at Chapel Hill, Chapel Hill, David Ingram
North Carolina Arizona State University, Tempe, Arizona
hearing loss and the masking-level difference speech disorders in children: cross-linguistic data
572 Contributors
Yves Joanette laryngeal movement disorders: treatment with

University of Montreal, Montreal, Quebec botulinum toxin
discourse impairments Robert C. Marshall
Joel C. Kahane University of Kentucky, Lexington, Kentucky
University of Memphis, Memphis, Tennessee speech disorders in adults, psychogenic
anatomy of the human larynx Julie J. Masterson
Richard C. Katz Southwestern Missouri State University, Springfield, Missouri
Carl T. Hayden VA Medical Center, Phoenix, Arizona; speech and language disorders in children: computer-
Arizona State University, Tuscon, Arizona based approaches
aphasia treatment: computer-aided rehabilitation Patricia McCarthy
Gitte Keidser Rush University and Rush-Presbyterian-St. Luke’s Medical
The National Acoustic Laboratories, Chatswood, Australia Center, Chicago, Illinois
hearing aids: prescriptive fitting functional hearing loss in children
Ray D. Kent Rebecca J. McCauley
University of Wisconsin-Madison, Madison, Wisconsin University of Vermont, Burlington, Vermont
instrumental assessment of children’s voice speech sound disorders in children: description and
Herman Kolk classification
Catholic University of Nijmegen, Nijmegen, The Netherlands Karla McGregor
agrammatism Northwestern University, Evanston, Illinois
Jody Kreiman semantics
University of California at Los Angeles School of Medicine, Malcolm R. McNeil
Los Angeles, California University of Pittsburgh, Pittsburgh, Pennsylvania
voice quality, perceptual evaluation of apraxia of speech: nature and phenomenology
Charissa R. Lansing apraxia of speech: treatment
University of Illinois at Champaign-Urbana, Champaign, attention and language
Illinois Lise Menn
speechreading training and visual tracking University of Colorado, Boulder, Colorado
Charles R. Larson aphasiology, comparative
Northwestern University, Evanston, Illinois Jon F. Miller
vocalization, neural mechanisms University of Wisconsin-Madison, Madison, Wisconsin
David P. Lau communication skills of people with down syndrome
Singapore General Hospital, Singapore John H. Mills
infectious diseases and inflammatory conditions of the Medical University of South Carolina, Charleston, South
larynx Carolina
Matti Lehtihalmes noise-induced hearing loss
University of Oulu, Oulu, Finland presbyacusis
aphasia, wernicke’s Murray D. Morrison
Laurence B. Leonard University of British Columbia, Vancouver, British Columbia
Purdue University, West Lafayette, Indiana infectious diseases and inflammatory conditions of the
specific language impairment in children larynx
Barbara A. Lewis Dave J. Muller
Rainbow Babies and Children’s Hospital and Case Western University College Su¤olk, Ipswich, United Kingdom
Reserve University, Cleveland, Ohio aphasia treatment: psychosocial issues
speech disorders: genetic transmission Bruce E. Murdoch
Marcia C. Linebarger University of Queensland, Brisbane, Australia
Moss Rehabilitation Research Institute and Psycholinguistic language disorders in adults: subcortical involvement
Technologies, Inc., Philadelphia, Pennsylvania Thomas Murry
reversability/mapping disorders Columbia University, New York, New York
Sue Ellen Linville assessment of functional impact of voice disorders
Marquette University, Milwaukee, Wisconsin voice therapy for neurological aging-related voice
voice disorders of aging disorders
Diane Frome Loeb Penelope S. Myers
University of Kansas, Lawrence, Kansas Mayo Clinic, Rochester, Minnesota
communication disorders in infants and toddlers aprosodia
Brenda L. Lonsbury-Martin Igor V. Náblek
University of Colorado, Denver, Colorado University of Tennessee, Knoxville, Tennessee
otoacoustic emissions temporal integration
Christy L. Ludlow Stephen T. Neely
National Institute of Neurological Disorders and Stroke and Boys Town National Research Hospital, Omaha, Nebraska
National Institutes of Health, Bethesda, Maryland otoacoustic emissions in children
Contributors 573
Arlene C. Neuman Sheila Pratt

CUNY-The Graduate Center, New York, New York University of Pittsburgh, Pittsburgh, Pennsylvania
hearing aids: sound quality auditory training
Marilyn A. Nippold speech disorders secondary to hearing impairment
University of Oregon, Eugene, Oregon acquired in adulthood
language disorders in school-age children: aspects of Adele Proctor
assessment University of Illinois at Urbana-Champaign, Champaign,
Robert J. Nozza Illinois
Temple University School of Medicine, Philadelphia, dialect, regional
Pennsylvania Jennie Pyers
middle ear assessment in the child University of California at Berkeley, Berkeley, California
language of the deaf: sign language
Janna B. Oetting
Louisiana State University, Baton Rouge, Louisiana Lorraine Olson Ramig
dialect speakers University of Colorado, Boulder, Colorado
dialect versus disorder hypokinetic laryngeal movement disorders
Jennifer Ogar Steven Z. Rapcsak
VA Northern California Health Care System, Martinez, University of Arizona, Tuscon, Arizona
California agraphia
aphasia: the classical syndromes Sean M. Redmond
Gloria Streit Olness University of Utah, Salt Lake City, Utah
University of Texas at Dallas, Dallas, Texas social development and language impairment
discourse Charlotte M. Reed
Robert F. Orliko¤ Massachusetts Institute of Technology, Cambridge,
CUNY-Hunter College, New York, New York Massachusetts
electroglottographic assessment of voice frequency compression
Mary Joe Osberger Joanne E. Roberts
Advanced Bionics Corporation, Valencia, California University of North Carolina at Chapel Hill, Chapel Hill,
cochlear implants North Carolina
cochlear implants in children otitis media: effects on children’s language
Johanne Paradis Margaret A. Rogers
University of Alberta, Edmonton, Alberta University of Washington, Seattle, Washington
language impairment in children: cross-linguistic aphasia, primary progressive
studies Mary Ann Romski
Rhea Paul Georgia State University, Atlanta, Georgia
Southern Connecticut State University and Yale Child Study augmentative and alternative communication: general
Center, New Haven, Connecticut issues
autism mental retardation
Diane Paul-Brown Clark A. Rosen
American Speech-Language-Hearing Association, Rockville, University of Pittsburgh, Pittsburgh, Pennsylvania
Maryland assessment of functional impact of voice disorders
orofacial myofunctional disorders in children John C. Rosenbek
Richard K. Peach College of Health Professions, Gainesville, Florida
Rush University and Rush-Presbyterian-St. Luke’s Medical mutism, neurogenic
Center, Chicago, Illinois Jackson Roush
aphasia, global University of North Carolina at Chapel Hill, Chapel Hill,
melodic intonation therapy North Carolina
Adrienne L. Perlman hearing loss screening: the school-age child
University of Illinois at Urbana-Champaign, Champaign, Nelson Roy
Illinois University of Utah, Salt Lake City, Utah
dysphagia, oral and pharyngeal functional voice disorders
John M. Pettit psychogenic voice disorders: direct therapy
Radford University, Radford, Virginia Dennis M. Ruscello
transsexualism and sex reassignment: speech differences West Virginia University, Morgantown, West Virginia
James O. Pickles phonological errors, residual
The University of Queensland, Brisbane, Australia Ron Scherer
physiological bases of hearing Bowling Green State University, Bowling Green, Ohio
Geo¤ Plant voice production: physics and physiology
The Hearing Rehabilitation Foundation, Somerville, Robert S. Schlauch
Massachusetts University of Minnesota, Minneapolis, Minnesota
speech tracking pseudohypacusis
574 Contributors
Carson T. Schütze Joseph Stemple

University of California at Los Angeles, Los Angeles, Institute for Voice Analysis and Rehabilitation, Dayton, Ohio
California voice therapy: holistic techniques
morphosyntax and syntax Kenneth N. Stevens
Rose A. Sevcik Massachusetts Institute of Technology, Cambridge,
Georgia State University, Atlanta, Georgia Massachusetts
augmentative and alternative communication: general voice acoustics
issues Ida J. Stockman
mental retardation Michigan State University, East Lansing, Michigan
Harry N. Seymour phonology: clinical issues in serving speakers of african-
University of Massachusetts at Amherst, Amherst, american vernacular english
Massachusetts Carol Stoel-Gammon
language disorders in african-american children University of Washington, Seattle, Washington
Janet E. Shanks mental retardation and speech in children
Department of Veterans A¤airs Medical Center, Long Beach, Edythe A. Strand
California; University of California at Irvine, Irvine, Mayo Clinic, Rochester, Minnesota
California dysarthrias: management
tympanometry
Kathy Strattman
Robert V. Shannon Wichita State University, Wichita, Kansas
House Ear Institute, Los Angeles, California phonological awareness intervention for children with
auditory brainstem implant expressive phonological impairments
Lewis P. Shapiro Gerald A. Studebaker
San Diego State University, San Diego, California University of Memphis, Memphis, Tennessee
argument structure: representation and processing speech perception indices
Elaine R. Silliman Johan Sundberg
University of Florida, Tampa, Florida KTH Voice Research Centre, Stockholm, Sweden
inclusion models for children with developmental the singing voice
disabilities
Anne Marie Tharpe
Yvonne S. Sininger Vanderbilt University Medical Center, Nashville, Tennessee
House Ear Institute, Los Angeles, California audition in children, development of hearing loss and
auditory neuropathy in children teratogenic drugs or chemicals
Bernadette Ska Jack E. Thomas
University of Montreal, Montreal, Quebec Mayo Clinic, Rochester, Minnesota
discourse impairments laryngectomy
Joseph J. Smaldino Connie A. Tompkins
University of Northern Iowa, Cedar Falls, Iowa University of Pittsburgh, Pittsburgh, Pennsylvania
classroom acoustics right hemisphere language disorders
Steven L. Small Robert G. Turner
University of Chicago, Chicago, Illinois Louisiana State University Health Sciences Center, New
aphasia treatment: pharmacological approaches Orleans, Louisiana
clinical decision analysis
Ann Bosma Smit
Kansas State University, Manhattan, Kansas Ann A. Tyler
speech sampling, articulation tests, and intelligibility in University of Nevada at Reno, Reno, Nevada
children with phonological errors speech disorders in children: speech-language
speech sampling, articulation tests, and intelligibility in approaches
children with residual errors
Richard S. Tyler
Pamela E. Souza University of Iowa, Iowa City, Iowa
University of Washington, Seattle, Washington cochlear implants in adults: candidacy
suprathreshold speech recognition Hanna K. Ulatowska
Charles Speaks University of Texas at Dallas, Dallas, Texas
University of Minnesota, Minneapolis, Minnesota discourse
dichotic listening Miodrag Velickovic
Joy Stackhouse The Mount Sinai Medical Center, New York
University of She‰eld, She‰eld, United Kingdom hypokinetic laryngeal movement disorders
speech disorders in children: a psycholinguistic Shelley L. Velleman
perspective University of Massachusetts at Amherst, Amherst,
Elaine T. Stathopoulos Massachusetts
SUNY-Bu¤alo, Bu¤alo, New York speech disorders in children; descriptive linguistic
voice disorders in children approaches
Contributors 575
Katherine Verdolini Jennifer Windsor

University of Pittsburgh, Pittsburgh, Pennsylvania University of Minnesota, Minneapolis, Minnesota
voice therapy for adults language disorders in school-age children: overview
voice therapy for professional voice users Shelley Witt
Steven F. Warren University of Iowa, Iowa City, Iowa
University of Kansas, Lawrence, Kansas cochlear implants in adults: candidacy
prelinguistic communication intervention for children Diane M. Wrisley
with developmental disabilities Oregon Health and Sciences University, Beaverton, Oregon
Ruth V. Watkins vestibular rehabilitation
University of Illinois at Urbana-Champaign, Champaign, Michael K. Wynne
Illinois Indiana University School of Medicine, Indianapolis,
language in children who stutter Indiana
Peter Watson pediatric audiology: the test battery approach
Case Western Reserve University, Cleveland, Ohio William S. Yacullo
voice therapy: breathing exercises Governors State University, University Park, Illinois
Susan Ellis Weismer masking
University of Wisconsin-Madison, Madison, Wisconsin J. Scott Yaruss
memory and processing capacity University of Pittsburgh, Pittsburgh, Pennsylvania
Nathan V. Welham speech disfluency and stuttering in children
University of Wisconsin-Madison, Madison, Wisconsin Mehmet Yavas
instrumental assessment of children’s voice Florida International University, Miami, Flordia
Robert T. Wertz bilingualism, speech issues in
VA Tennessee Valley Healthcare System, Nashville, Tennessee Paul J. Yoder
aphasia: the classical syndromes Vanderbilt University, Nashville, Tennessee
Susan L. Whitney prelinguistic communication intervention for children
University of Pittsburgh Medical Center, Pittsburgh, with developmental disabilities
Pennsylvania Christine Yoshinaga-Itano
vestibular rehabilitation University of Colorado, Boulder, Colorado
Judith E. Widen assessment of and intervention with children who are
University of Kansas Medical Center deaf or hard of hearing
otoacoustic emissions in children William A. Yost
Terry L. Wiley Loyola University Chicago, Chicago, Illinois
Arizona State University, Tempe, Arizona auditory scene analysis
pure-tone threshold assessment pitch perception
Name Index
A Alivisatos, B., 60 Aran, J. M., 519
Aase, D., 217 Alku, P., 6 Arboix, A., 126
Abassian, M., 258 Allen, D., 396 Arbusow, V., 563
Abbas, P., 417, 448, 449 Allen, D. R., 473 Archut, A., 483
Abbeduto, L., 140, 141, 353 Allen, G., 150 Arcia, E., 322
Abberton, E., 170 Allen, J., 117 Ardila, A., 250
Abbs, J. H., 127 Allen, Jont B., 413–420, 538, 539 Arehart, K. H., 515
Abdala, C., 424 Allen, P., 424 Arenberg, I. K., 464
Abdalla, F., 332 Allen, S., 280, 332 Arensberg, K., 159
Abel, E. L., 494 Allen, T., 336 Arkebauer, H. J., 68
Abelson, R. P., 300 Allison, R. S., 361 Arlinger, S., 488
Abkarian, G. G., 105 Almor, A., 240 Armstrong, N., 287
Acevedo, M. A., 211 Almqvist, B., 490 Arnce, K. D., 532
Ackley, R. S., 464 Almqvist, U., 490 Arnold, D. S., 359
Adair, J. S., 145 Alpert, C., 379 Arnold, G. E., 54, 91
Adam, G., 332 Alpin, D. Y., 532 Arnold, S. E., 241
Adamek, M., 440 Alsago¤, Z. S., 146 Arnos, K. S., 518
Adams, A. M., 349 Alsop, D. C., 241 Aronson, A. E., 27, 29, 30, 31, 38, 49, 50, 60,
Adams, C., 153, 313, 330, 396, 402 Altho¤, L. K., 532, 533 82, 93, 101, 127, 186, 187
Adams, H. P., 244, 252 Altman, M. E., 83 Arriaga, R. J., 370, 371
Adams, J., 529 Amaducci, L., 364 Arvedson, J. C., 201, 273
Adams, M., 336 Amatuzzi, M. G., 434 Asagi, H., 33
Adams, S., 101, 102, 105 Ambalavanar, R., 60 Ashbaugh, C. J., 455
Adelstein, D. J., 21 Ambrose, N. G., 180, 181, 220, 221, 333, 334 Ashbrook, E., 341
Adepoju, A. A., 439 Amir, N., 312 Ashford, J. R., 389
Adger, C. T., 321 Amoils, C., 33 Ashmead, Daniel H., 424–426
Adisman, I., 47 Anacker, S. L., 564 Asking, A., 261
Adlaf, E., 162, 400 Andersen, E. S., 240 Aslin, R., 342
Adolfsson, R., 261 Andersen, R., 276 Aslin, R. N., 394, 424, 425
A¤ord, R. J., 243 Anderson, B. J., 38, 39 Asplund, K., 261
Afjeh, A. A., 77 Anderson, D., 140, 341 Åström, M., 261
Aftonomos, L. B., 255 Anderson, F. H., 246 Atcheson, B., 476
Agarwal, M., 77 Anderson, H., 502, 503 Ateunis, L. J. C., 135
Aggarwal, S., 21 Anderson, J., 334, 494 Atiuja, A. S., 22
Agid, Y., 304 Anderson, J. D., 47 Atkinson, L., 162, 324, 400
Agnew, W. F., 428 Anderson, J. M., 275 Aurousseau, C., 519
Agudelo, D., 72 Anderson, K., 496 Austin, D., 135, 156, 183, 184, 218
Aguilar, C., 529 Anderson, L., 140 Austin, J. L., 301
Aguirre, G. K., 306 Anderson, M., 117 Auther, L. L., 389
Ahad, P., 60 Anderson, R., 211, 321 Au-Yeung, J., 335
Ahern, M. B., 104 Anderson, R. C., 397 Aviv, J. E., 38, 41, 43, 134
Ahlsén, E., 266, 267 Anderson, S. R., 175 Awan, S. N., 3
Ahlstrom, J. B., 529 Andersson, Gerhard, 556–557 Awde, J. D., 47
Ahmed, S. W., 293, 402 Andersson, K., 27, 29, 49, 50, 186, 187 Aydogan, B., 563
Ahroon, W. A., 509, 510 Andersson, U., 541 Ayslan, F., 47
Akahane-Yamada, R., 440 Andreasson, L., 561 Aziz, N. S., 518
Akeroyd, M. A., 307 Andrews, G., 181, 220, 221, 222 Aziz, Q., 132
Akin, Faith W., 467–471 Andrews, H. B., 29
Aksit, R., 39 Angelicheva, D., 434 B
Alajouanine, M. S., 243 Angell, A. L., 371 Bach, J. R., 226, 228
Alarcon, N. B., 246, 247, 248 Ankerstjerne, J. K., 563 Bachman, D. L., 258
Alavi, A., 241 Annett, M., 458, 460 Bachman, R. N., 506, 560
Alba, A. S., 228 Annino, D. J., 39 Backman, L., 258
Albanese, A., 364 Ansel, B., 277 Bacon, S. P., 548, 554
Albert, M. L., 258, 302, 303, 347, 361, 362 Antonelli, A., 490 Baddeley, A. D., 273, 349, 350, 395
Alberti, P. W., 444, 533 Aoki, K. R., 38, 39 Bader, C., 36, 37
Alberti, V., 33 Apel, K., 164, 165 Bader, C. R., 511
Alberts, M. J., 132 Aplin, D., 475, 476 Baer, T., 25, 80
Alcantara, J. I., 438, 541 Appaix, M., 565 Bahr, R., 309
Alcock, K., 185 Applebaum, M., 359 Bahramali, H., 552
Alderson, D., 34 Appuzzo, M., 337 Bailey, A., 115, 116
Alexander, G. C., 481 Aqttard, M. D., 123 Bailey, B. J., 10, 80
Alexander, M., 363, 364 Arai, M., 563 Bailey, H. A. T., 502
Alexander, M. P., 145, 243, 246, 249, 315, Aram, D. M., 122, 143, 162, 184, 218, 312, Bailey, S., 211
387 330, 396 Bain, B., 328
Alipour, F., 77 Aramany, M. A., 47 Bain, B. A., 214, 217
578 Name Index
Bakchine, S., 145 Bath, A. P., 519 Berg, A. O., 135

Baken, R. J., 3, 4, 9, 23, 77, 171 Bathgate, D., 146 Berg, K. M., 425
Baker, A., 293 Batin, R. R., 225 Berger, E. H., 497, 498, 499, 502
Baker, C., 384 Bauch, C. D., 431, 444 Berger, K., 475
Baker, E., 190–191 Bauer, R. M., 273, 275 Berger, R. A., 483
Baker, E., 368 Baum, H., 208 Bergman, B., 515, 533
Baker, K. K., 31 Baum, H. M., 527, 528 Bergman, J., 293
Baker, K. L., 31 Baum, Shari R., 109, 367, 368, 381–382, 389 Bergman, N. M., 515, 516
Baker, L., 161 Bauman-Waengler, J., 157 Berke, Gerald S., 41–43, 77, 80
Bakke, M. H., 488 Baumgaertner, A., 387, 389, 390, 391 Berkley, David, 419
Bakker, K., 164, 165 Bayles, Kathryn A., 145, 240, 291–293, 361, Berko-Gleason, G., 302
Balaban, C., 563 362 Berlin, C. I., 11, 458, 533
Balaban, M. R., 273 Bazin, B., 145, 248 Berlin, P., 348
Baldo, J., 250 Beach, W. A., 248 Berliner, K. I., 455
Baldwin, D. A., 395 Beasman, J., 308, 309 Berman, S., 135
Ball, E., 153, 154 Beatens-Beardsmore, H., 119 Bernard, B. S., 135, 136, 359
Ball, Laura, 110–114 Beattie, R. C., 549 Bernbaum, J., 194
Ball, Martin J., 150, 169–173, 367 Beauchaine, K., 559 Berndt, R. S., 231, 244, 384, 405
Ball, V., 541 Beauchaine, K. A., 527 Bernhardt, B., 164, 199, 214
Ballad, W. J., 488 Beauchaine, K. L., 515, 516, 533 Bernhardt, N., 231
Ballard, K. J., 105 Beauchamp, N., 239 Berni, M. C., 150
Baloh, R. W., 467, 469, 470, 471, 519 Beauvois, M. F., 238 Bernstein, L. E., 543, 544
Balota, D. A., 240 Beck, L. B., 487 Bernstein, R. S., 358
Balwally, A., 33 Becker, J. A., 241 Bernstein-Ellis, E., 283
Bambara, L. S., 376 Beckerman, R. C., 177 Bernstein Rattner, N., 181, 182
Bamberg, M. G. W., 300 Beckett, R. L., 7, 68 Bernthal, J. E., 68, 114, 156, 157, 192, 214,
Bance, M. L., 519 Beckford, N., 73 216, 218
Bandettini, P. A., 306 Bedi, G., 403 Berry, D., 4
Bandler, R., 60, 61 Bedore, L., 322 Berry, D. A., 77, 89, 90, 96
Banerjee, S., 434 Bedrosian, J., 113–114 Berry, G. A., 502
Bang, S., 240 Beech, S., 563 Berry, S. W., 455
Bankson, N. W., 114, 156, 192, 214, 218 Beeghly, M., 396 Berthier, M., 60
Banner, M. J., 226 Beeman, M., 387, 390 Bertrand, D., 511
Banzett, R. B., 227, 228 Beers, M., 197 Bertrand, J., 141
Barber, H. O., 470 Beery, Q. C., 47, 505 Bess, F. H., 135, 481, 495, 529
Bargones, J. Y., 424 Beeson, Pelagie M., 233–235, 273 Bester, S., 283
Barker, M. J., 455 Behar, A., 499 Bettison, S., 439
Barker, P., 239 Behrens, S. J., 109, 386 Beukelman, D. R., 68, 111, 112, 113, 126,
Barkley, R. A., 273 Behrmann, M., 235, 239 129, 131, 202, 203, 277, 278
Barkneier, J., 134 Beitchman, J. H., 162, 184, 324, 400, 402 Bever, T. G., 270, 384
Barlow, D. W., 518 Beland, Renee, 247, 363 Bhat, V. K., 551
Barlow, J., 345 Belin, P., 60 Biassou, N., 240, 241
Barnard, K., 370 Bell, K., 129, 202, 203 Bidoli, E., 45
Barnas, E., 178 Bell-Berti, F., 170 Bidus, K., 38
Barnes, J., 86 Bellefleur, P., 473 Biebl, W., 29, 50
Barney, H., 225 Bellman, S. C., 472 Bielamowicz, S., 38, 39
Baron, J.-C., 241 Bellugi, U., 141, 266, 339, 340, 341 Bier, D., 253
Baron-Cohen, S., 395 Benali, K., 241 Bilger, R. C., 458, 471
Baroudy, F., 33 Benasich, A. A., 184 Biller, H., 39
Barr, B., 475, 476 Ben David, J., 518 Biller, J., 244, 252
Barr, M., 483 Benguerel, A. P., 541 Billroth, C. A. T., 138
Barresi, B., 232 Benigni, L., 375 Binder, J. R., 525
Barrett, R., 147 Benjamin, B., 34 Binetti, G., 240
Barrette, J., 261 Benke, T., 30 Binger, C., 110
Barry, J. G., 421 Benkert, K. K., 148 Bird, A., 156, 157, 216
Bars, D. M., 532, 533 Bennett, D. N., 472 Bird, J., 153
Bartels, C., 260 Bennett, E. M., 182 Birdsall, T. G., 551
Barth, E., 54 Bennett, S., 122 Birger, M., 531
Bartke, S., 331, 354 Bennetto, L., 140 Bishop, D. V. M., 143, 153, 184, 185, 311,
Bartlett, C., 402 Benninger, M., 86 312, 324, 330, 340, 350, 371, 373, 395, 396,
Bartlett, F., 300 Benninger, M. J., 22 400, 402
Barton, S., 117 Benowitz, L. I., 387 Bishop, M. J., 177
Barwick, M., 400 Bensaid, M., 6 Bishop, S., 350
Basili, A. M., 12 Ben-Shlomo, Y., 145 Bishop-Leone, J. K., 139
Basinger, D., 370 Benson, D. F., 145, 246, 249, 250, 263, 264, Bjorkman, G., 560–561
Basso, A., 243–244, 250, 264, 368 292, 293 Blache, S. E., 205
Bastholt, L., 46 Benson, R. W., 548 Blachman, B., 153, 154
Bastiaanse, R., 231, 406 Bentin, S., 272 Black, F. O., 563
Bastian, R. W., 90 Bentler, R. A., 481 Black, J., 476
Bateman, B. D., 308 Benton, A., 111 Black, R., 153, 448
Bateman, H. E., 17 Beranek, L. L., 538, 539 Black, S., 239
Bates, E., 231, 252, 253, 266, 267, 273, 311, Berardelli, A., 31 Black, S. E., 246
312, 336, 370, 375, 376, 384, 397 Berent, G., 337 Blackshaw, A., 146
Name Index 579
Blackstone, S., 111 Botting, N., 281, 373 Brooks, P. L., 541
Blackwell, A., 267, 273, 384 Bottini, G., 387 Brooks, R. A., 241
Blackwell, B., 258 Boucher, V., 348 Brookshire, B., 396
Blackwell, K. E., 41, 42, 43 Bouhuys, A., 83 Brookshire, R. H., 258
Blackwell, W., 462, 463, 464 Bouman, M. A., 551 Brouillette, R. T., 177
Blager, F. B., 38 Bourland, C., 424 Broussole, E., 145, 248
Blair, R. L., 464 Boves, L., 9 Brouwer, W. H., 273
Blake, M., 388, 389 Bow, C. P., 421 Brown, D., 490
Blake, M. L., 391 Bowdler, D., 475, 476, 477 Brown, J., 59, 82, 101
Blakely, B. W., 563 Bowers, D., 108, 382, 387, 389 Brown, J. B., 30, 31
Blalock, P. D., 49, 92 Bowers, P. G., 329 Brown, J. R., 127
Blamey, P. J., 421, 541 Bowler, J. V., 292–293 Brown, J. S., 46
Blanch, P., 226 Boyeson, M. G., 257 Brown, J. W., 109
Bland-Stewart, L., 295, 297, 298, 318, 319 Boyle, M., 46 Brown, K. E., 563
Blanken, G., 243 Bozet, C., 436 Brown, M. L., 223–224
Blatt, P., 564 Braak, E., 241 Brown, R., 341, 356
Blaugrund, S. M., 80 Braak, H., 241 Brown, R. G., 293
Bliss, L. S., 325, 399 Bracewell, R. J., 303 Brown, R. L., 135, 136, 218
Blitzer, A., 38, 39, 41 Brackmann, D. E., 427, 428 Brown, S. F., 335
Bloch, C., 80 Bradbury, D., 210 Brown, W., 92
Block, R. M., 43 Bradford, A., 122, 142, 143, 219 Brown, W. S., 70, 82
Bloemenkamp, D., 36 Bradley, D. C., 384 Brownell, Hiram H., 302, 386–387, 390, 391
Bloise, C. G. R., 387 Bradley, J., 472 Brownell, W. E., 511
Blom, E. D., 10, 11, 139 Bradley, L., 329 Brownlee, E., 402
Blomstrand, C., 261 Bradley, R. C., 224 Brownlie, E. B., 162
Blonder, L. X., 108, 382, 389 Bradlow, A. R., 440 Bruas, P., 439, 440
Blonsky, E., 30, 126 Bradshaw, J. L., 31 Bruch, L., 31
Blood, L., 358 Bradvik, B., 108 Brummett, R. E., 518
Bloodstein, O., 180, 220, 221 Brady-Wood, S., 458 Brun, A., 241
Bloom, R. L., 36 Braida, Louis D., 419, 471–474 Bruner, J., 375
Bloomfield, L., 119 Brainerd, E. L., 56 Brunswick, N., 387
Blossom-Stach, C., 383 Brammer, M. J., 546 Bryans, B., 264
Bluestone, C. D., 135, 505, 506, 560 Brandt, T., 563 Bryant, K., 560
Blum, A., 390 Brant, L. J., 527 Bryant, P., 329
Blumin, Joel H., 41–43 Braun, A. R., 222 Bryant, S. L., 362
Blumstein, S., 459 Braun, W., 43 Bryson, S., 117
Blumstein, Sheila E., 231, 363, 364, 366–368 Bravo, G. A., 147 Brzustowicz, L., 402
Blythe, M. E., 500, 501 Bray, P., 512 Bub, D. N., 240
Boatman, D., 364 Bray, V., 483 Bucella, F., 6
Bocca, E., 490 Brazer, S. R., 132 Buchman, A. S., 252
Bode, D. L., 544 Bregman, A. S., 437, 438 Buchsbaum, B., 275, 276
Bode, J., 395 Brengelman, S. U., 309 Buck, R., 108
Boettcher, F. A., 510, 529 Brenner, C., 456 Buckingham, Hugh W., 363–365
Bohne, B. A., 510 Breslau, L. D., 240 Buckwalter, P., 162, 185, 328, 370, 399, 402
Boike, K. T., 488 Bretherton, I., 375 Buder, Eugene H., 3–6, 164, 213
Boliek, Carol A., 176–179 Brett, E., 313 Budin, C., 238
Boller, F., 368 Breuleux, A., 303 Budinger, T., 241
Bolter, J. D., 255 Brewer, D., 72, 73, 74 Budnick, A. S., 10
Boltezar, I. H., 36 Brewer, D. W., 88 Bulen, J. C., 515
Bolton, P., 115 Brey, R., 563 Bull, D., 425, 494
Bondi, A., 117 Bricker, D., 353 Bull, G. L., 224
Bone, R. C., 490–491 Brickner, M., 272 Buller, H. R., 518
Bonitati, C. M., 93 Bridgeman, E., 190 Bullmore, E. T., 546
Bonvillian, J., 341 Bridger, M. M., 27, 50 Bumfitt, S., 260, 261
Boone, D. R., 49, 81, 82, 83, 88, 89 Briess, Bertram, 86 Bunting, G., 72
Boongird, P., 109, 367, 382 Bright, P., 350 Buonaguro, A., 126
Boonklam, R., 109, 367, 382 Brin, Mitchell F., 30–31, 38, 39 Burchinal, M. R., 135, 136, 358, 359
Booth, D., 370 Brinton, Bonnie, 161–163, 325, 372–374, Burda, Angela, 194–195
Booth, J., 368 399, 400 Burgard, M., 108
Booth, J. R., 313 Briscoe, J., 350 Burger, M. C., 529
Boothroyd, A., 421, 440, 451, 456, 541, 543 Britt, L., 440 Burger, Z. R., 36
Borders, D., 296 Broad, R., 475 Burgess, C., 240
Borenstein, P., 261 Broadbent, G., 314 Burgues, J., 72
Borg, E., 509 Broca, P., 122, 249, 262 Burkard, R., 529
Bornstein, S. P., 532, 533 Brockman, S., 475 Burke, M. K., 89, 90, 93
Borod, J. C., 108 Brodnitz, F. E., 186 Burnes, E. M., 424
Bortolini, U., 197, 331, 332 Brodnitz, F. S., 50 Burns, A. F., 389
Bose, A., 347 Brody, D. B., 495 Burns, E. M., 509, 515
Bose, W., 34 Broeckx, J., 258 Burns, F., 299
Boshes, B., 30, 126 Broen, P. A., 184 Burns, P., 502
Boswell, A. E., 425 Bronstein, A. M., 565 Burns, W., 508
Bothe, A., 182 Brooks, D. N., 506 Burton, M., 476
Bottari, P., 331 Brooks, J., 186 Burton, M. W., 368
580 Name Index
Burton, W., 390 Carter, A., 345, 346 Chmiel, R., 472, 548
Bush, C. N., 150 Carter, E. A., 424 Chobor, K. L., 109
Bushell, C., 271 Carter, J., 92 Chodzko-Zajko, W., 73, 91, 92
Butcher, P., 28, 50 Cartwright, J., 108 Choi, D. C., 519
Butinar, D., 434 Caruso, A. J., 114, 142 Chollet, F., 275
Butler, E. C., 502 Carver, W. F., 417 Cholmain, C. N., 141
Butterfield, S., 394 Casali, J. G., 499 Chomsky, N., 175, 354, 405
Butters, N., 240 Casati, G., 368 Chrisostomo, E. A., 258
Butterworth, B., 30, 363, 367 Casby, M. W., 135, 136 Christensen, M., 148
Buus, S., 413, 551, 554 Case, I., 210 Christian, D., 321
Byard, P. J., 185 Case, J., 82 Christman, S., 364, 365
Byers, V. W., 472 Case, J. E., 186, 187, 188 Christopher, K. L., 38
Byl, N. N., 90 Case, J. L., 49, 50, 164 Christopherson, L., 528
Bylsma, F. W., 292 Caselli, M., 331, 332 Christy, J. A., 293
Byma, G., 403 Casey, D. E., 186, 187 Chrostowski, J., 258
Byng, S., 235, 260, 261, 385 Cashman, M., 464 Chuang, E. S., 6, 66, 67
Byrne, D., 483, 484, 485, 488 Casper, Janina K., 7, 9, 50, 54–56, 72, 73, Chumpelik, D., 105
Byrne, E. J., 292 74, 81, 82, 88, 137, 224 Church, M. W., 494
Cass, S. P., 563, 564 Churchill, A., 422
C Casselbrant, M. L., 359 Chyle, V., 138
Cain, D. H., 375 Castellano, A., 146 Chynoweth, J., 328
Cairns, P., 394 Catlin, J., 459 Cicchetti, D., 396
Cala, S. J., 7 Catts, Hugh W., 153, 163, 309, 324, 327, Cicone, M., 389
Calandrella, A. M., 375 329–330, 399 Cinotti, L., 145, 248
Calculator, S., 113–114 Cavallotti, G., 315 Cipriani, P., 331
Caldarella, P., 398, 399 Cavanaugh, R., 560, 561 Clahsen, H., 331–332, 354, 355
Calderon, R., 423 Cawthorne, T., 563 Clark, A. N. G., 258
Caldwell, E., 89, 90 Cercy, S. P., 292 Clark, G. M., 448, 449, 453, 541
Callen, D. E., 6 Cervellera, G., 490, 491 Clark, M., 201
Caltagirone, C., 389 Chabhar, M., 322 Clark, W. W., 508, 510
Calvert, G. A., 546 Chafe, W. L., 301 Clarke-Klein, S., 153, 190
Camaioni, L., 375 Chaiklin, J. B., 476, 531, 532 Clarkson, M. G., 425
Camarata, S., 114, 174, 403 Chaloner, J., 225 Clausen, R. P., 148
Campanelli, P., 475 Chaltraw, W. E., 518 Cleary, M., 337, 338
Campbell, L., 159 Chambers, J., 295, 296 Cleave, P. L., 206, 379, 403
Campbell, R., 546 Chambers, J. A., 501 Clegg, L., 137
Campbell, T. F., 273, 349, 350, 359, 370, 371, Champlin, C. A., 501 Clement, J., 83
403 Chan, A. S., 240 Clements, G., 345
Campbell, Thomas, 135–136 Chan, R. W., 56, 89, 90, 96 Clemie, S., 30
Campisi, P., 36 Chance, G., 425 Clendaniel, R. A., 563, 565
Cancelliere, A., 381, 382 Chandler, D., 434 Clercx, A., 436
Cancilliere, A. E. B., 108 Chandler, J., 564 Cleveland, S., 475, 476
Canlon, B. E., 509 Chandler, M. J., 376 Clift, A., 127
Cantekin, E. I., 505 Channell, R. W., 164 Clifton, C., 271
Cantrell, J., 298 Chantraine, Y., 303, 390 Clifton, R. K., 425, 426
Cantwell, D. P., 161 Chapell, R., 133 Close, L., 133
Caplan, David, 231–232, 241, 250, 262–265, Chapman, R. S., 288, 290, 324, 327, 396 Close, L. G., 41
363, 368, 384, 387, 405 Charles, D., 47 Clough, C. G., 293
Capone, N., 395 Charlip, W. S., 529 Cluytens, M., 394
Cappa, S. F., 240, 315, 316 Charron, S., 201, 203 Coates, V., 133
Caramazza, A., 231, 235, 236, 237, 238, 239, Chase, C., 359 Coats, A. C., 464
250, 316, 383, 384, 405 Chase, T. N., 241 Cochran, P., 164, 165
Carding, P., 49, 50 Chater, N., 394 Code, Chris, 260–261, 367
Cardoso-Martins, C., 396 Chatterjee, M., 428 Coelho, C. A., 303
Carey, A., 336, 337, 422 Chaundhuri, G., 133 Coerts, M. J., 225
Carey, S., 312 Cheesman, A. D., 139 Co¤ey-Corina, S. A., 313
Carhart, R., 451, 475, 490, 532, 548 Chen, S., 266, 268 Co¤man, D., 440
Cariski, D., 165 Chen, W., 306 Coggins, T., 289
Carlborg, B., 68 Cheng, A. K., 456 Cohen, D., 116, 117
Carlomagno, S., 235 Cheng, Li-Rong Lilly, 167–169 Cohen, H., 122
Carlyon, R. P., 551 Cherry, S. R., 305 Cohen, L. T., 448
Carney, A., 336 Chertkow, H., 240 Cohen, M., 38, 39, 543
Carney, E., 458, 459, 460 Cheslock, Melissa, 277–278 Cohen, M. M., 478
Caro-Martinez, L., 116 Cheyne, H. A., 37 Cohen, N. J., 400
Carpenter, L. J., 217 Chhetri, D. K., 43 Cohen, N. L., 455, 541
Carpenter, P. A., 272, 311, 313, 349, 350, Chial, Michael R., 497–499 Cohen, R., 208
384 Chiapello, D. A., 391 Cohen, R. A., 145, 272, 273
Carpenter, S., 246 Chiat, S., 116, 189, 190, 403 Cohen-McKenzie, W., 189
Carr, E. G., 373 Chijina, K., 92 Cohenour, R. C., 145
Carr, L., 201 Chilosi, A., 331 Cohn, E. S., 478
Carr, L. J., 313 Chin, S. B., 199 Cokely, C., 528
Carrell, T. D., 439, 440 Ching, T. Y. C., 472, 483 Colangelo, L. A., 133
Carstens, C., 186, 187 Chipchase, B. B., 330, 396 Colborn, D. K., 135, 136, 359
Name Index 581
Colburn, H. S., 491 Craig, H. K., 159, 298, 318, 319, 328, 373, Danhauer, J. L., 12
Cole, K., 379 399 Daniel, B., 45, 48, 129
Cole, L., 318 Crain, S., 231, 267 Daniel, R., 130
Cole, P., 159 Crain-Thoreson, C., 379 Daniel, S. E., 30
Coleman, R. F., 3, 80 Crampin, L., 157 Daniele, A., 240
Coleman, T., 297 Crandell, Carl C., 442–443 Daniels, S., 132
Coles, R. R., 512, 532 Cranen, B., 9 Danilo¤, R. G., 19, 156, 171, 216, 472
Coles, R. R. A., 500, 501, 503 Crary, M. A., 122, 143, 201 Danly, M., 367, 382
Collard, John, 538, 539 Crawford, A., 418 Dardarananda, R., 368
Collet, L., 512 Crawley, A., 132 Darley, F. L., 30, 31, 101, 104, 105, 127, 216,
Collier, A., 359 Crerar, M. A., 256 250, 256, 258
Collier, B., 114 Crevier, L., 78 Darwin, C. J., 438
Collins, A. A., 425 Crews, D., 97 Dau, T., 552
Collins, D. L., 306 Criner, G. J., 226 Dauman, R., 557
Collins, M. J., 105 Crippens, D., 257 Davalos, A., 132
Colombo, A., 235 Critchley, E. M., 30 Davidson, B. J., 273
Colsher, P. L., 109, 386, 389 Cronan, T., 370, 371 Davidson, M. J., 444
Coltheart, M., 234, 237, 238 Croot, K., 190–191, 240 Davidson, S. A., 487
Colton, R. H., 7, 8, 9, 50, 72, 73, 74, 81, 82, Cross, J., 200–201 Davies, W. E., 557
88, 137, 224 Cross, T. G., 422 Davis, A., 556
Comeau, L., 279, 280 Crosson, B., 273, 275, 315, 316 Davis, B., 120, 211, 286
Compton, W. C., 359 Crowe Hall, B. J., 157 Davis, Barbara L., 121–123, 142, 143
Conan, M., 261 Crowson, M., 395 Davis, D. L., 425
Cone-Wesson, B., 436, 494, 513, 515 Croxson, G., 34 Davis, G. A., 284
Conley, P., 240 Cruickshanks, K. J., 208, 527, 559, 560 Davis, H., 508, 536, 548
Conlin, K., 341 Crumley, R. L., 41, 42, 43 Davis, J. N., 258
Connell, P. J., 370, 371, 379 Crutcher, Diane, 290 Davis, K. R., 101, 264
Connolly, J. J., 398 Crutchley, A., 281, 373 Davis, L. A., 487
Constable, A., 190 Cruz, R., 361, 362 Davis, M., 205
Conti-Ramsden, G., 281, 372, 373, 378 Cruz, R. F., 240, 292 Davis, P. J., 60, 61
Conture, E. G., 181, 334, 335 Crystal, D., 213 Davis, R. I., 510
Cook, E. H., Jr., 117 Csepe, V., 368 Davis, S. B., 6
Cooksey, F. S., 563 Cucchiarini, C., 151 Davis, T., 515
Cooper, D., 134 Culatta, R., 202 Davis, W. E., 177
Cooper, G., 126 Cullen, J., 458 Davis-Penn, W., 472
Cooper, M., 82 Culp, D., 114 Dawson, D. V., 258
Cooper, W. E., 109, 367, 386, 389 Cumley, G., 114 Dawson, G., 114
Coppens, P., 252 Cummings, J. L., 292–293 Day, B. L., 31
Coppola, M., 342 Cummiskey, C., 515 Day, L., 350
Corbin-Lewis, K., 56, 86, 90, 96 Cunningham, J., 440 Deal, J. L., 104, 186, 250
Corcoran, R., 387 Cupples, L., 384 Dean, E. C., 154, 205, 256
Cord, M. T., 481 Curlee, Richard F., 181, 220–222, 333 Dean, M. R., 472, 473
Cornelisse, L., 483 Curry, E. T., 11 Dean, M. S., 502
Cornell, R., 6 Curtis, S., 258 Deary, I. J., 29
Cornell, S. H., 244 Curtiss, S., 184, 400, 402 Deas, A., 422
Cornell, T. L., 384 Cushing, E. M., 509 Debachy, B., 250
Corrin, J., 190 Cutler, Anne, 392–394 Deberdt, W., 258
Cort, R., 104 Cutler, N., 241 deBleser, R., 243, 264, 267
Cortese, E. E., 175 Cymerman, E., 370, 371 DeBodt, M. S., 6
Coslett, H. B., 382, 387 Cyrus, C. B., 38 de Boer, E., 526, 527
Costello, A. de L., 237 DeBonis, D., 327
Cotton, J., 222 D Debose, C. E., 318
Cotton, S., 483 Dacakis, G., 224 de Boysson-Bardis, B., 142
Coulter, D. C., 544 Dagenais, P. A., 157 de Bruin, M. D., 225
Coulter, D. K., 113, 337, 421, 422, 515 Dahm, J. D., 43 Debruyne, F., 73
Councill, C., 396 Dailey, A., 201 Dechongkit, S., 109, 367, 382
Countryman, S., 30–31, 89, 130 Dalalakis, J., 331–332 Decoster, W., 73
Coupal, J., 241 Dale, P., 336, 379, 397 DeCuypere, G., 224
Couropmitree, N. N., 527 Dale, P. A., 370 De Deyn, P. P., 258
Court, D., 122 Dale, P. S., 185, 371, 376, 379 DeFao, A., 130
Covell, W., 508, 510 Dallos, P., 417, 511, 524 De Filippo, C. L., 541
Cowan, L. A., 68 Dalmaire, C., 250 de Gelder, B., 393, 394
Cowan, N., 350–351 Dalton, C., 34 Dehaut, F., 384
Cowan, R. S. C., 541 Damasio, A. R., 241, 246, 249, 250, 263–264, Deidrich, W. M., 122
Cowin, K., 261 275 Dejarnette, G., 158
Cowland, J. L., 563 Damasio, H., 109, 244, 246, 264, 275, 367 Dejmal, A. E., 206
Cox, N., 180, 181, 221 D’Amato, B., 96 Dejonckere, P. H., 20, 36, 78
Cox, N. B., 77 D’Amato, C., 31 de Jong, J., 331
Cox, N. J., 185 D’Amico, B., 90 De Juan, M., 72
Cox, R., 483, 485 Damirjian, M., 60 DeKosky, S. T., 241
Cox, R. W., 525 Damste, P. H., 10 de Krom, G., 79
Cox, Robyn M., 480–481 Dancer, J., 535 Delaere, P., 73
Crago, Martha, 184, 280, 298, 331–332 Daneman, M., 350 de la Fuente, M. T., 211
582 Name Index
Delaney, B., 194 DiMattia, M. S., 89 Duckert, L. G., 518

Delaney, T., 350 Ding, D. L., 509 Duckworth, M., 150
de la Sayette, V., 241 Dinnsen, D., 345 Duclaux, R., 512
Delcelo, R., 33 Dinnsen, D. A., 174, 199 Dudley, H., 473
Delenre, P., 436 Dionne, G., 371 Du¤y, Joseph R., 50, 60, 101, 108, 126–128,
Deleyiannis, F. W., 38 Dirks, D. D., 501, 502, 549 130, 145, 246, 247, 388, 389
Delgutte, B., 417 Ditelberg, J. S., 56 Du¤y, R. J., 303
Delhorne, L. A., 541 Divenyi, P. L., 109 Dukette, D., 400
Dell, G., 364 Dix, R., 467 Dulon, D., 519
Deloche, G., 255 Dixon, R., 475 Dunca, P. W., 258
DeLong, E. R., 244 Dixon-Ernst, C., 499 Duncan, G. W., 101, 252, 264
Demorest, M. E., 543, 544 Dobie, R. A., 508, 509, 510, 557 Duncan, P. W., 564
Dempsey, J. J., 541 Docherty, G., 49, 50 Dune, P. E., 226
Demuth, K., 345, 394 Doctor, R. F., 224 Dunn, C., 198, 213
DeMyer, M., 117 Dodd, B., 120, 122, 142, 143, 189, 190, 191, Dunn, D. D., 508
Denenberg, L. J., 500, 501 218–219, 336 Dunn, J. W., 505
Dennis, M., 312, 313 Dodd, B. J., 141, 281 Dunwoody, G. W., 146
Dennis, R. E., 309 Dodd-Murphy, J., 495 Duong, Anh, 302–304
Den Os, E., 345 Dodds, B., 43 Durieux-Smith, A., 430
Denoylle, F., 478 Dodds, L., 417 Durkin, L., 39
de Paiva, A., 38, 39 Dodds, L. W., 509 Durlach, N. I., 419, 472, 473, 541
DePascalis, C., 459 Doerfler, L., 476 Durrand, V. M., 373
De-Pascalis, V., 552 Doershuk, C., 176, 179 Durrant, J. D., 430, 462
Deputy, P. N., 156 Doggett, D., 133 Durson, G., 92
Derebery, J., 434 Dogil, G., 102 Duyao, M. P., 427
Derlacki, E. L., 491 Doherty, T., 92 Dworkin, J. P., 45, 48, 105, 202
Derouesne, J., 238 Dolan, T., 424 Dye, L. M., 455
Desai, M., 208 Dollaghan, Christine A., 135–136, 350, 359, Dyer, J. J., 493, 494
Desautels, D. A., 226 370, 371, 403 Dykens, E., 115
Desgranges, B., 241 Dolly, J. O., 38, 39 Dykstra, K., 197
Desjardins, R. P., 47 Dommergues, J.-Y., 394 Dzul, A., 34
Desmond, D. W., 292–293 Don, M., 433
D’Esposito, M., 240, 241 Donahue-Kilburg, G., 285, 286 E
Destaerke, B., 224 Donaldson, G. S., 551 Eadie, Tanya L., 10–12
de Swart, B., 122 Donlan, C., 184, 185, 350, 400, 402 Eagles, E., 495–496
Detre, J., 241 Donnellan, A., 116 Eales, C., 261
Detweiler-Shostak, G., 563 Doolan, S. J., 157 Earle, A. S., 21
Deuchar, M., 279, 280 Döpke, S., 280 Easton, D. L., 206
Deutsch, S., 101 Dorkos, V. P., 472 Ebendal, T., 258
Dev, M. B., 425 Dorn, P. A., 513, 515 Eckert, R. C., 38
de Villiers, J. G., 337 Doupe, A. J., 275 Eddington, D. K., 449
de Villiers, Peter A., 336–338, 339–343 Dove, H., 56, 86, 90, 96 Eddins, David A., 553–555
DeVita, C., 241 Dow, R. S., 315 Eddy, W. F., 272, 311, 313
De Vito, G., 73 Downhill, J. R., Jr., 258 Edelman, S. W., 309
Devlin, J. T., 240 Downs, J. A., 47 Eden, G. F., 329
Devous, M. D., 258 Dowty, D., 270 Edgerton, B., 208
Dewart, H., 231 Doyle, M., 111 Edgerton, B. J., 427
De Weerdt, W., 564 Doyle, Patrick J., 101–103, 104–106 Edgerton, M. T., 224
Dewey, R. B., 126 Doyle, Philip C., 10–12, 45–48, 80–81 Edmundson, A., 143
DeWitt, K., 76, 77 Doyle, T. N., 532 Edwards, B., 481
Dhib-Jalbut, S., 433 Drachman, D., 240 Edwards, Brent, 420
Diamond, A., 142 Drager, K., 113 Edwards, J., 184, 350
Diamond, B., 133 Drake, A., 476 Edwards, Mary Louise, 150–152, 199, 205,
Diaz, E. M., 139 Draper, M., 7 213
DiChiro, G., 241 Dravins, C., 108 Edwards, S., 141
Dick, F., 231 Dreschler, W. A., 518 Edwards, V. T., 329, 402
Dick, J. P., 31 Dressler, W. U., 267 Egger, J., 29
Dickey, S. E., 216 Driscoll, C., 496 Ehrlich, J. S., 303
Dickson, K., 286 Driver, L. E., 177, 178 Eichen, E., 342
Diedrich, P., 91 Dromey, C., 31, 39 Eigsti, I.-M., 545
Diedrich, W., 216 Dromi, E., 332 Eilers, R. E., 151, 370, 425
Diedrich, W. M., 10, 11 Dronkers, Nina F., 60, 102, 231, 249–251, Eimas, P., 425
Diefendorf, Allan O., 520–522 252, 253, 367 Eisele, J. A., 312, 396
Diez, S., 72 Drucker, D. G., 66 Eisenberg, H. M., 396
Di Fabio, R. P., 564 Drummond, S. S., 367 Eisenberg, L. S., 427, 455
Diggs, C. C., 68 Druz, W. S., 473 Eisner, M. P., 137
DiGiovanni, D., 502 Duara, R., 241 Ekelman, B. L., 184, 312
Dikeman, K. J., 228 Dubno, Judy R., 510, 527–530, 549 Elbard, H., 283
Dikkes, P., 145 DuBois, A. B., 47 Elbaum, B., 309
DiLavore, P. C., 117 Dubois, B., 304 Elberling, C., 488
Dilger, J. P., 417 Dubois, S., 296 Elbert, M., 198, 199, 205, 214
Dillon, Harvey, 419, 420, 481, 482–485, Duchan, J., 116, 372 Eldert, E., 548
488 Duchek, J. M., 240 Eldredge, L., 494
Name Index 583
Eley, T., 185 Fahn, H., 73 Fischel-Ghodsian, N., 518

Eley, T. C., 371 Fahn, S., 30, 38 Fischer, H. B., 126
Elias, A., 28, 50 Fairbairn, A. F., 292 Fischer, N. D., 176, 177
Eliason, M. J., 163 Fairbanks, G., 78, 472 Fischer, S., 342
Elidan, J., 469 Fakler, B., 511 Fischler, I., 272, 273
Elkins, E., 208 Falk, R. E., 518 Fish, S., 280
Elkins, E. F., 527, 528 Fant, G., 6, 63, 64, 76, 225 Fisher, E., 73
Elkonin, D. B., 154 Farmer, A., 171, 173 Fisher, H. B., 10, 30
Elliot, L. L., 548 Farmer, M., 399 Fisher, K. V., 55
Elliott, M., 546 Farrabola, M., 243–244 Fisher, L., 338, 449, 455
Elliott, M. R., 307 Farren, D. C., 377 Fisher, S., 402
Ellis, A. W., 233, 256 Farr-Whitman, M., 296 Fisher, S. E., 185
Ellis Weismer, S., 287, 327, 350, 403 Farwell, C. B., 174 Fisher, S. G., 10
Elman, R., 261, 283, 284 Fassbender, L., 116 Fitch, J., 20
Elpern, B. S., 502 Fassbinder, Wiltrud, 388–391 Fitch, J. L., 164, 165
El Rafie, E. A., 556 Fastl, H., 3, 551 Fitch, W. Tecumseh, 56–58
Emami, A. J., 38 Fattu, J. M., 57 Fitzgerald, G., 469
Emanuel, D. C., 358 Faust, M., 231 Fitzgerald, T. S., 515
Emerich, K., 73 Fausti, S. A., 518 Fitzgibbons, P. J., 554
Emery, O. B., 240 Fawcett, A. J., 329 Fitzpatrick, P. M., 232
Emery, P., 361, 362 Fawcett, D. W., 523 Flanagan, J. L., 3, 71, 75, 77
Emmorey, K., 382 Fayad, J. N., 427, 448 Flashman, L. A., 241
Emmory, K. D., 109, 340 Fazio, B. B., 370, 371 Flax, J. D., 184, 258
Emonds, J. E., 354 Feagans, L., 359 Fletcher, Harvey, 413, 414, 415, 416, 417,
Enderby, P., 126, 258, 378 Federo¤, J. P., 108 418, 538, 539
Endicott, J. E., 529 Fedio, P., 241 Fletcher, J. M., 184, 329, 396
Engel, J. A. M., 135 Feeney, D. M., 257, 258 Fletcher, P., 185
Engen, E., 337, 343 Feinmesser, M., 429 Fletcher, P. C., 387
Engen, T., 337 Feldkeller, R., 551 Fleurant, J., 348
Engmann, D., 198 Feldman, A., 560 Flevaris-Phillips, C., 208
Epstein, L., 494 Feldman, Heidi M., 135, 136, 311–313, 359, Flexer, C., 442
Epstein, R., 27, 50 370, 371, 396 Flexner, S. B., 223
Eran, O., 456 Feldman, L., 109 Flipsen, P., 135, 136
Erber, N., 439 Feldmann, H., 556 Flipsen, P., Jr., 218
Erber, N. P., 543 Felici, F., 73 Flock, A., 509
Erdman, S., 208 Felsenfeld, S., 184, 185 Flock, S., 337
Erdman, S. A., 440 Fennell, A., 89 Florance, C. L., 104
Erhard, P., 275 Fenson, J., 252, 253, 312 Florentine, M., 413, 551, 554
Erickson, J. G., 322 Fenson, L., 336, 370, 371, 376, 397 Flory, J., 241
Erickson, R. J., 273 Fenwick, K. D., 425 Flowers, C. R., 381, 387, 389
Eriksdotter-Jonhagen, M., 258 Ferguson, C. A., 174 Flowers, R., 33
Eriksson, J., 557 Ferguson, N. A., 49 Flynn, F., 293
Erkinjuntti, T., 240, 292–293 Ferguson, R., 464, 465 Fobel, S., 552
Erre, J. P., 519 Ferketic, M., 283 Fodor, J. A., 270
Erriondo Korostola, L., 267 Ferlito, A., 33 Fogassi, L., 276
Erting, C., 342 Fernandes, M. A., 438, 490 Folger, W. N., 127
Ertmer, D., 456 Fernandez-Villa, J., 145, 146 Folsom, R. C., 424, 513, 515, 521
Esclamado, R., 43 Ferrand, C. T., 36 Folstein, M., 240
Escobar, M. D., 184, 324 Ferraro, John A., 461–465 Fombonne, E., 117
Espesser, R., 439, 440 Ferrier, E., 205 Footo, M. M., 136
Estabrooks, N., 315 Ferro, J. M., 243, 252 Forbes, M. M., 244
Esteban, E., 495 Ferry, P. C., 122 Ford, C. F., 49
Estill, J., 96 Fes, S., 50 Ford, C. N., 29, 38, 50, 92
Estivill, X., 478 Festen, J. M., 487 Ford, C. S., 308
Etard, O., 276 Fetter, M., 565 Forman, A. D., 139
Ettema, S., 134 Fex, F., 50 Formby, C., 551, 555
Evans, A. C., 60, 275 Fey, Marc E., 175, 206, 309, 327, 329, 330, Forner, L. L., 143
Evans, C. A., 306 373, 378–380, 403 Forrest, K., 123
Evans, F. J., 38 Fey, S. H., 325 Forrest, S., 277
Evans, J., 328, 350, 399 Fichtner, C. G., 108 Fortune, T. W., 488
Evans, J. L., 164 Figueroa, R. A., 321 Foster, J., 546
Evans, J. S., 211 Figura, F., 73 Foster, N. L., 31, 241
Evans, T., 563 Fikret-Pasa, S., 483 Foster, P. K., 512
Everitt, W. L., 472 Fillenbaum, G. G., 292 Foundas, A., 132
Ewing-Cobbs, L., 396 Finch, C., 73, 92 Fourcin, A., 170
Eyles, L., 295, 299 Finestone, H., 133 Fournier, J., 476
Finitzo-Hieber, T., 359, 442 Fournier, M. A., 462, 463, 464
F Fink, R. B., 232 Fowler, Cynthia G., 429–433
Fabre, P., 24 Finkelstein, S., 101, 264 Fowler, E., 415
Fabry, L. B., 436 Finlay, C., 299 Fowler, S. M., 178
Facer, G. W., 436 Finlayson, A., 283 Fowler, T. W., 478
Fagundes, D., 297 Finley, C. C., 449 Fox, A., 219
Fahey, P. F., 417 Fischel, J. E., 359 Fox, Cynthia, 30–31, 89
584 Name Index
Fox, K. E., 518 G Gersten, R., 309

Fox, P. T., 222 Gabreels, F., 122, 202 Gerstenberger, D. L., 260
Fozard, J. L., 527 Gabrielsson, A., 480, 487, 488 Gertner, B. L., 399
Frackowiak, R. S., 275, 317 Gacek, M. R., 529 Gescheider, G. A., 425
Fradis, A., 273 Gaddlie, A., 244 Geschwind, N., 250, 263, 264, 275, 315, 460
Fradis, M., 518 Gagliardi, M., 491 Gesi, A., 543
Fragassi, N. A., 146 Gagné, J.-P., 543, 544 Gfeller, K., 440
France, T., 289 Gagnon, D., 363–364 Gherini, S., 519
Franceschi, S., 45 Gainotti, G., 240, 389 Giangreco, M. F., 307, 308, 309
Francis, H. W., 456 Galaburda, A., 364 Gianotti, G., 186, 364
Francis, T., 73 Galalgher, J. J., 322 Gibb, A. G., 518
Francois, C., 348 Galambos, R., 508 Gibbard, D., 379
Frank, T., 532 Galasko, D., 292 Gibbon, F. E., 143, 157, 170–171, 189
Frankel, S. H., 77 Gall, C., 563 Gibson, D. M., 541
Franklin, D. J., 515 Gallagher, H. L., 387 Gibson, W. P. R., 464
Franks, J. R., 487, 499 Gallagher, T. M., 372, 373 Giedd, J., 57
Fraser, S., 315 Gallese, V., 276 Gierut, Judith A., 157, 174–175, 191, 198,
Frates, R. C., Jr., 177 Gallup, G. G., 387 199, 214
Frattali, C., 283 Galt, Rogers H., 539 Gilbert, D. G., 398
Frauenfelder, U. H., 394 Galvin, J. E., 30 Gilbert, H. R., 24
Frazer Sissom, D. E., 57 Galvin, K. L., 541 Gilbert, L. M., 493
Frazier, L., 271, 384 Gandour, J., 109, 174, 364, 367, 368, 381, 382 Gildersleeve, C., 120
Frederiksen, Carl, H., 303 Ganguley, B., 243 Gildersleeve-Neumann, C., 211
Freebairn, L., 184 Gans, C., 57 Gill, C., 34
Freed, D. B., 243 Gans, D., 521, 522 Gillam, R. B., 153, 286, 309, 324, 350–351,
Freedman-Stern, R., 302 Gans, K. D., 521, 522 380
Freeman, B., 114 Gantz, B., 337, 338 Gillberg, C., 117
Freeman, M., 188 Gantz, B. J., 449, 455, 456 Gill-Body, K. M., 563
Freeman, M. H., 153 Garada, B., 241 Gillenwater, J. M., 424
Freese, P. R., 162, 402 Garcia, L. J., 261, 348 Gillespie, M., 38
Freilinger, J. J., 156, 157, 216 Garcia-Milian, R., 33 Gillis, R. E., 47
French, N., 419 Garde, S., 434, 555 Gillon, G. T., 154, 155
French, N. R., 538, 539 Gardner, G., 22 Gilman, S., 31
Fresnel-Elbaz, E., 78 Gardner, H., 243, 387, 389 Gilmore, R., 275
Frey, R. H., 518 Gardner, Mark B., 415, 416, 417 Gilpin, S., 261
Fria, T. J., 505 Garlitz, S., 84 Gilroy, J., 490
Friederici, A. D., 231, 267, 271, 384 Garn-Nunn, P., 297 Gimpel, G. A., 398
Friedl, W., 29 Garnsey, S. M., 384 Ginis, J., 481
Friedland, R., 241 Garrard, P., 240 Giraud, K., 439, 440
Friedman, E., 475, 476 Garrett, M., 270, 363 Girolametto, L., 287
Friedman, O., 387 Garrett, M. F., 384 Giroux, F., 387
Friedman, R., 240, 436 Garron, D. C., 252 Giurgea, C. E., 258
Friedman, R. A., 519 Gasparini, P., 478 Giustolisi, L., 240
Friedman, S. A., 433 Gatehouse, S., 480 Givens-Ackerman, J., 181
Friedmann, Naama, 231, 232, 355, 405–407 Gates, G. A., 518, 527 Gjedde, A., 275, 557
Friedrich, G., 29 Gathercole, S. E., 349, 350, 395 Glasberg, B. R., 554
Friel-Patti, S., 135, 136, 218, 309, 359, 373, Gau‰n, J., 6, 76, 78 Glasberg, C. J., 554
440 Gaulin, C., 349, 350 Glattke, T. J., 515, 533
Fristoe, M., 151, 216 Gavish, B., 83 Glaze, L. E., 36, 82, 85, 89, 90, 93, 225
Friston, K. J., 275, 317 Gazdag, G. E., 376 Gleason, J. B., 231
Frith, C. D., 387 Ge, X., 463 Gleitman, R. M., 554
Frith, U., 387 Gearing, M., 292 Glicklich, R. E., 22
Frölich, M., 6 Gee, K., 307 Glogowska, M., 378, 379
Fromkin, V. A., 221, 384 Geers, A. E., 337, 338, 421, 456 Glosser, G., 240, 241, 303
Fromm, D., 264, 283 Geier, L., 455 Glovsky, R. M., 22
Fronataro-Clerici, L., 178 Gelabert-Gonzalez, M., 145, 146 Gnosspelius, J., 541
Frost, B. J., 541 Gelfand, S. A., 531, 532 Gobert, C., 565
Frost, J. A., 525 Gelfer, M. P., 78, 224 Gobl, C., 6
Frost, L., 117 Gendeh, B. S., 518 Godaux, E., 565
Frutiger, S. A., 241 Genesee, Fred, 279–281 Godefroy, O., 250
Fry, T. L., 176, 177 Gengel, R. W., 508, 551 Godwin-Austen, R. B., 292
Fryauf-Bertschy, H., 337, 338, 453, 455 Gentil, Michèle, 169, 170 Goehl, H., 208
Fu, Q. J., 473 Gerber, S. E., 494 Go¤man, Lisa, 142–144, 346
Fuino, A., 563 Gerber, S. J., 458 Gold, M., 145
Fujiki, Martin, 161–163, 325, 372–374, 399, Gerdeman, B. K., 82, 93, 225 Goldberg, A. I., 226
400 Gerken, G. M., 551 Goldblum, M.-C., 267
Fukuda, S., 331 Gerken, LouAnn, 344–346 Golding, J., 359
Fukui, N., 354 Gerkin, K. P., 494 Goldin-Meadows, S., 342
Fuleihan, N., 897 German, D. J., 325 Goldman, M., 83
Fulton, R., 257 Gernsbacher, M. A., 231, 240 Goldman, M. D., 7
Funkenstein, H., 101, 264 Gerratt, Bruce R., 5, 31, 41, 42, 43, 77, 78– Goldman, P., 458
Furman, J. M., 563 79, 80 Goldman, R., 151, 216
Fus, L., 178 Gerritsma, E. J., 29 Goldman, R. E., 364
Name Index 585
Goldsmith, J. A., 175 Green, R. C., 164 Haarmann, H. J., 231

Goldstein, Brian, 210–212, 321–322 Green, S. E., 148 Habib, M., 439, 440
Goldstein, D. P., 487, 502 Greenberg, H. J., 544 Hack, N., 271
Goldstein, G., 116 Greenberg, M., 370 Hacki, T., 36
Goldstein, H., 191 Greenberg, M. T., 423 Hadar, U., 275
Goldstein, K., 263 Greene, M. C. L., 78, 83 Hadley, P. A., 373, 399
Goldstein, L. B., 258 Greene-Finestone, L., 133 Haendiges, A. N., 384
Göllner, S., 331, 354 Greenfield, D., 117 Hagberg, E. N., 483
Gonnerman, L. M., 240 Greenlee, R. G., 258 Hage, A., 334
Gonzalez, A., 211, 257 Greenspan, B. S., 246 Hagen, P., 72, 73
Gonzalez, M. M., 211 Greenspan, S., 117 Hagerman, B., 487
Gonzalez, V., 73 Greenwood, K. M., 95 Hagerman, R. J., 140, 141
Gonzalez-Huix, F., 132 Greindl, M. G., 258 Haggard, M. P., 307, 438
Gonzalez-Rothi, L. J., 145, 273 Gres, C., 439, 440 Hagiwara, H., 406
Goode, R., 145 Greven, A. J., 225 Hahne, A., 271
Goode, S., 117 Grewal, N., 148 Haig, A. R., 552
Goodglass, D., 311 Grice, H. P., 301 Hain, Timothy C., 518–520
Goodglass, H., 231, 238, 249, 264, 275, 302, Griepentrog, G. J., 425 Haji, T., 80
367, 368, 381, 384, 405, 459 Grievink, E. H., 135, 136, 359 Haley, K. L., 425
Goodman, M., 396 Gri‰n, R., 387 Hall, D., 546
Goodyear, B., 306 Grillone, G. A., 39 Hall, D. A., 307
Gopher, D., 272 Grimgy, N., 7 Hall, Joe L., 419
Gopnik, M., 184, 355 Grimm, D., 549 Hall, Joseph W., 489–492, 515, 554, 555
Gordon, B., 271, 364 Grimm, H., 331 Hall, Joseph W., III, 424, 425, 430, 438, 492,
Gordon, E., 552 Grimshaw, J., 270 533, 554, 555
Gordon, J., 382 Grober, E., 240 Hall, N. E., 162, 184
Gordon, J. L., 565 Grodzinsky, Yosef, 231, 232, 271, 354, 355, Hall, P., 330
Gordon, W. P., 347 384, 405, 406, 407–409 Hall, P. K., 121, 122, 143, 184
Gordon-Salant, S., 527, 554 Groenhout, C., 43 Hall, S. M., 122
Gore, C. H., 224 Grolman, W., 138 Hallam, R. S., 556
Gore-Hickman, P., 72, 73 Gröne, B., 171 Halle, J. W., 154
Gorelick, P. B., 107 Grose, J. H., 424, 425, 490, 491, 507, 554, Halle, M., 356
Gorga, Michael P., 513, 515–517, 533 555 Hallet, M., 31
Gorlin, R. J., 478 Grosjean, F., 120 Hallet, P. E., 544
Gornbein, J., 293 Grossi, D., 146 Halleux, J., 565
Gosy, M., 368 Grossman, A., 83 Halliday, M. A. K., 300
Gottlieb, J., 364 Grossman, E., 83 Hallmayer, J., 434
Gough, P. B., 330 Grossman, Murray, 240–241 Hallowell, B., 164
Gould, J., 209 Grossman, R., 111 Hallpike, C. S., 467, 469
Gould, W. J., 64, 80 Grossmann, M., 231 Halpin, C. F., 424, 434
Goulding, P. J., 246, 247 Gruber, F. A., 215, 216, 217 Halvorson, K., 132
Goulet, P., 303, 388, 389, 390 Grugan, P. K., 240 Halwes, T., 459, 460
Goumnerova, L. C., 146 Grunwell, P., 120, 189, 198 Hamaker, R. C., 139
Gow, D. W., Jr., 368 Grywalski, C., 21, 22, 86 Hamalainen, M., 305
Gracco, V. L., 105 Gubbay, S. S., 126 Hamdan, A. L., 89
Grade, C., 258 Gudmundsen, G. I., 500, 501 Hamdy, S., 132
Grady, C., 241 Guénard, D., 304 Hamernik, R. P., 509, 510
Gra¤-Radford, N. R., 109, 246, 386, 389 Guest, C. M., 179 Hammarberg, B., 78
Grafman, J., 304 Guidotti, M., 315 Hammeke, T. A., 525
Graham, J., 476 Guillaume, S., 348 Han, J., 83
Gramss, T., 6 Guiod, P., 208, 209 Han, Y., 39
Granet, R. B., 186, 187 Guitar, B., 130 Hand, L., 156, 157, 214, 216
Grant, D., 544 Guitart, J., 321 Handler, S. D., 178
Grant, F., 484 Guitjens, S. K., 518 Haney, K. R., 184
Gratton, D. G., 47, 48 Gunther, V., 49, 50 Hanin, L., 451
Gratton, M. A., 529 Gunzburger, D., 225 Hankey, B. F., 137
Gravel, J. S., 359, 425 Gupta, S. R., 258 Hanna, J. E., 231
Graves, M. C., 43 Gur, R. C., 146, 382 Hannequin, D., 388, 389
Gray, C., 118 Gur, R. E., 146, 382 Hans, S., 20
Gray, C. D., 423 Gussenbauer, C., 138 Hansch, E. C., 293
Gray, G. A., 480 Gustafson, L., 241 Hansel, S., 108
Gray, J., 30 Gutierrez-Clellen, V. F., 281, 321, 322 Hansen, H. S., 46
Gray, L., 132 Gutmann, M., 111 Hansen, L., 292
Gray, S., 86 Guttman, N., 473 Hanson, D., 31, 34, 55, 77
Gray, S. D., 56, 90, 96 Guyard, H., 256 Hanson, D. G., 80
Green, C. L., 136 Guyatt, G., 520 Hanson, Helen M., 6, 63–67
Green, D. C., 43 Guyer, B., 194 Hanson, M., 147, 148
Green, D. M., 453, 551 Guyette, T. W., 122 Hanson, M. J., 322
Green, E., 231, 302, 368 Guzmán, B., 321 Hanson, R., 34
Green, H. M., 148 Hanson, S., 72
Green, L., 295, 297, 298 H Hanson, W. R., 246, 316
Green, L. J., 318, 319, 320 Haak, N., 297 Hansson, K., 331
Green, R., 36 Haaparanta, M., 293 Happé, F., 387, 390
586 Name Index
Harbers, H. M., 154, 395 Heller, J., 559, 560 Hinson, J., Jr., 177
Hardcastle, W. J., 150, 157, 170–171 Heller, J. W., 506 Hirano, M., 17, 19, 24, 50, 76, 78, 91, 92
Hardy, J. C., 68 Hellgren, J., 488 Hirayama, K., 252
Har-El, G., 33 Hellman, S., 488 Hirosaku, S., 91
Hari, R., 305 Hellstrom, L. I., 529 Hirose, H., 91
Harmon, R. L., 257 Helm, N., 347 Hirsch, J. E., 507
Harris, A., 552 Helm-Estabrooks, Nancy, 232, 244, 258, 303, Hirsch, L., 402
Harris, D., 476 347, 361–362 Hirsh, I. J., 439, 451, 489, 548
Harris, E. H., 104 Helminski, Janet, 518–520 Hitselberger, William, 427, 428
Harris, F. P., 515, 516 Helmsley, G., 260 Hixon, T. J., 7, 9, 36, 37, 68, 82–83, 129
Harris, J. C., 397 Helms-Tillery, A. D., 275 Hnath, T., 451
Harris, M., 181, 220, 221 Hendershot, G. E., 529 Ho, A. K., 31
Harris, S., 85 Henderson, D., 509, 510 Höß, M., 96
Harrison, D. M., 177 Henderson, F., 359 Hoasjoe, D. R., 80
Harrison, E., 182 Henderson, F. W., 136 Hoban, E., 337
Harrison, L., 315 Henderson, J. L., 425 Hoberg, M., 563
Harrison, M. J. G., 252 Hendriks, J. J. T., 135 Hobson, R., 116
Harrison, R. V., 434 Heninger, G., 460 Hochberg, I., 541
Harrison, W. A., 515 Henricksson, N. G., 470 Hodak, J. A., 391
Harshman, R., 459 Henry, A., 299 Hodge, Megan M., 200–203
Hart, Betty, 290, 370, 371 Henry, J. A., 556 Hodges, J. R., 240, 247, 248
Hart, C. H., 162, 399, 400 Henry, S., 132 Hodson, Barbara W., 151, 153–155, 190,
Hart, J., 364 Henschel, C. R., 389 199, 204, 205, 211, 214
Hart, M. N., 246 Henson, A. M., 424 Hoehn, M., 89, 130
Hart, R. P., 248 Henson, D., 226 Hoekstra, T., 354
Hartl, D. M., 20 Herdman, S. J., 563, 564, 565 Hoen, B., 261
Hartman, C., 258 Herer, G. R., 476, 515 Ho¤, Erika, 369–371
Hartman, D. E., 127 Herholz, K., 253 Ho¤er, B., 258
Hartmann, M., 438 Hermann, B., 434 Ho¤-Ginsberg, E., 371
Hartung, J. P., 336, 376, 397 Hermann, M., 243, 260 Ho¤man, H., 20
Harvey, I., 359 Hernandez, R., 73 Ho¤man, H. T., 139
Hasan, R., 300 Hernandez, T. D., 257 Ho¤man, K., 151
Haskill, A., 206 Herrero, R., 45 Ho¤man, P., 216
Hassan, S. J., 20 Hersh, D., 261 Ho¤man, P. R., 205, 216, 379
Hassanein, R. S., 464 Hershberger, S., 298 Ho¤man-Williamson, M., 194
Hatano, G., 394 Herzel, H., 4, 57, 77 Ho¤meister, R., 341, 343
Hatch, D. R., 554, 555 Hesketh, L. J., 288, 290, 328, 350, 396 Ho¤ner, E., 275
Hausler, R., 491 Hess, M. M., 89, 90, 96 Hofstede, B. T. M., 231
Hausler, U., 61 Hesselink, J., 395 Hogan, C. A., 471
Hawk, S., 210 Hewitt, G., 58 Hogben, J. H., 329, 402
Hawkins, D. B., 480, 488 Hewlett, N., 189 Hogikyan, N. D., 22
Hawkins, J. E., 508 Heylen, L., 6 Hohenstein, C., 30
Hawksahw, M., 4, 92, 93, 94 Heyman, A., 292 Hoit, Jeannette D., 7, 83, 226–228
Hawley, J., 129 Hickin, J., 261 Holas, M., 132
Haxby, J. V., 241 Hickok, Gregory, 275–276, 305–307, 368 Holcomb, P. J., 272
Hayashi, T., 258 Hicks, B., 419 Holden, L. K., 451–453
Hayden, D., 202 Hicks, B. L., 472, 473 Holden, T. A., 451–453
Hayden, D. C., 104 Hicks, D., 43 Holland, Audrey L., 244, 246, 253, 264, 273,
Hayden, M., 458 Hicks, J. L., 122 283–284, 312, 347, 396
Hayes, C. S., 502 Hickson, L., 419 Holland, R., 34
Hayes, D., 521 Hieber, S. J., 455 Holland, W., 158
Haynes, W., 159, 297 Hiel, H., 519 Holliday, M. J., 563, 565
Hazell, J. W. P., 557 Hietanen, M., 293 Hollien, H., 36, 92
Hazenberg, G. J., 246 Higbee, L. M., 325, 399 Holling, L. C., 257
He, D., 417 Hilari, K., 260 Hollis, W., 395
Head, H., 263 Hildebrandt, N., 231 Holloman, A. L., 367
Hearts, S., 297 Hildesheimer, M., 208, 209 Holm, A., 120, 191, 219
Heath, S. M., 329, 402 Hill, B. P., 176, 179 Holman, B. L., 241
Hecox, K. E., 436 Hill, E., 402 Holmberg, E. B., 8, 9, 67, 68, 88
Heeneman, H., 80 Hill, E. L., 142, 143 Holmes, A., 440, 495
Heeschen, C., 406 Hill, M. A., 293 Holmes, C. J., 306
Hegberg-Borenstein, E., 261 Hill, N. S., 226 Holstege, G., 61
Hegde, M. N., 167, 192–193 Hill, R., 73 Holt, J., 421, 422
Heiki, R., 130 Hillel, A., 33 Holtas, S., 108
Heilman, K. M., 108, 145, 382, 387, 389 Hillenbrand, J., 3 Holte, L., 560, 561
Heimbuch, R., 221 Hillis, Argye E., 234, 235, 236–230, 364 Holtzapple, P., 104
Heinz, M. G., 551 Hillman, Robert E., 6–9, 10, 37, 67, 68, 72, Honda, K., 76
Heisey, D., 29, 49, 50, 93 88 Hong, W. K., 10
Heiss, W. D., 253 Hillyard, S. A., 272, 273 Honrubia, V., 467, 469, 470, 471
Heitmuller, S., 36 Hinchcli¤, L. K., 554 Hood, J., 184
Helasvuo, M.-L., 253 Hinchcli¤e, R., 556 Hood, J. D., 501, 502, 565
Helenius, P., 275 Hinckley, Jacqueline J., 260, 283–284 Hood, Linda J., 477–479, 533
Hellbom, A., 252 Hinojosa, R., 519 Hoodin, R., 24
Name Index 587
Hooper, J., 345 Hutchings, M. E., 491 Jakielski, K. J., 122, 142, 143
Hooper, S. R., 135, 136, 359 Hutchins, B., 133 Jakobson, Roman, 363
Hoover, B., 515, 516 Hutchins, G., 364, 381 Jakubowicz, C., 267, 331, 354
Hoover, C., 73 Hutchison-Clutter, M., 551 James, A., 481
Hoover, W. A., 330 Huttenlocher, J., 370, 371 James, C., 350
Hopper, T., 293 Hyde, M., 302, 533 James, H., 272
Horak, F. B., 563, 564 Hyde, M. L., 444, 464 Jamieson, D., 424, 483
Horiguchi, S., 170 Hyde, M. R., 231 Janosky, J. E., 312, 359, 370, 371, 396
Horii, Y., 11, 130 Hykin, J. L., 546 Jarema, G., 267
Horner, J., 132, 133, 244 Hyman, B. T., 241, 246 Jastrebo¤, M. M., 557
Horodezky, N. B., 400 Hynd, G. W., 324 Jastrebo¤, P. J., 556, 557
Horohov, J., 298 Jauhiainen, T., 368
Horowitz, J., 122 I Jausalaitis, G., 111
Horsley, I., 49, 50 Iansek, T., 31 Je¤erson, L. S., 177
Horsley, J. S., 41 Iavarone, A., 235 Jeng, Patricia S., 419
Horvath, B., 296 Iddings, D. T., 186 Jenkings, W., 439, 440
Horwitz, B., 241 Iglesia, G., 132 Jenkins, M. A., 145
Hoshino, I., 563 Iglesias, Aquiles, 210, 321–322 Jenkins, W., 552
Hosie, J. A., 423 Ijsseldijk, F. J., 544 Jenkins, W. M., 90
Hosoi, H., 476 Ilmonieli, R., 305 Jensema, C. J., 421
Hospers, W., 258 Im, N., 400 Jensen, J. K., 425
Hotz, G., 361 Indefrey, P., 364 Jerabek, P., 222
Hough, M. S., 387 Ingels, K., 359 Jerger, J., 472, 476, 490, 505, 521, 548, 559
House, A. O., 29 Ingham, J. C., 205, 222 Jesteadt, W., 515, 527
House, J., 150 Ingham, R. J., 222 Jewett, D. L., 429
House, J. W., 519 Inglis, A. L., 184, 384 Jiang, J. J., 55, 88
House, William F., 427, 455 Ingram, David, 175, 189, 196–197, 198, 204 Joanette, Yves, 109, 302–304, 387, 388, 389,
Houtgast, T., 487, 539 Ingvar, D. H., 108 390, 391
Hoversten, G., 475 Ireland, C., 261 Johannsen, H. S., 334
Hovre, C., 217 Irigaray, L., 302 Johannsen, P., 557
Howard, D., 34, 317, 350 Irish, J. C., 39 Johannsen-Horbach, H., 243
Howard, M. T., 487 Irwin, R. J., 554 Johansson, B., 473
Howell, J., 154, 205 Isaacs, E., 311, 313 Johansson, B. B., 258
Howell, P., 335 Isaacson, T., 325 Johns, D. F., 105
Howie, P., 221 Isensee, C., 253 Johnson, A., 21, 86, 297
Howlin, P., 116, 117, 162 Ishii, T., 433 Johnson, B. M., 201, 202
Hoyer, N., 224 Ishikawa, T., 130 Johnson, C. J., 162, 324, 400
Hristova, A., 434 Ishiyama, A., 519 Johnson, D. L., 508, 509
Hsieh, L., 364, 381 Ishizaka, K., 3, 75, 77 Johnson, E. K., 394
Hsueh, K. D., 510 Ishizu, K., 557 Johnson, G., 328
Hu, B. H., 509 Ishpekova, B., 434 Johnson, K. A., 241
Huber, S. J., 293 Israelsson, B., 487 Johnson, R., 342
Huber, W., 232, 243, 244, 253, 258 Isshiki, N., 7, 8, 9, 64, 77, 83, 130 Johnson, R. P., 46
Hudak, M. K., 564 Ito, M. R., 77 Johnson, S. A., 240
Hudson, L., 246 Izdebski, K., 42, 43 Johnson, S. C., 241
Hudson, P., 286 Johnson, V., 299
Huebner, W. P., 565 J Johnson-Morris, J. E., 422
Hug, G., 21 Jabeen, S., 292 Johnston, J., 396, 403
Hughes, A. J., 30 Jackson, C., 54 Johnston, J. R., 324
Hughes, D., 327 Jackson, C. A., 246 Johnston, P., 439, 440
Hughes, D. L., 379 Jackson, C. L., 54 Johnston, R. G., 148
Hughes, E., 240, 241 Jackson, D. W., 423 Jones, C., 543
Hughes, J. W., 551 Jackson, H. H., 263 Jones, E. V., 385
Hughes, L., 458 Jackson, H. J., 308 Jones, H. A., 376
Hughes, M. T., 309 Jackson, J., 299 Jones, J. L., 257
Humes, L. E., 481, 483, 508, 509, 528 Jackson, K. S., 41, 42, 43 Jones, M., 328
Humpel, C., 258 Jackson, S., 298 Jones, R. O., 176, 177
Humphrey, D., 34 Jackson, S. C., 359 Jones, S. L., 494
Humphreys, G. W., 317 Jackson-Menaldi, C., 34 Jones, T., 257
Humphreys, J. M., 205 Jacob, R. G., 563 Jones-Rycewicz, C., 563
Humphries, C., 275, 276 Jacobs, B. J., 232 Jongman, A., 440
Humphries, T., 340 Jacobs, D. H., 145 Jonkees, L., 491
Hung, D. L., 268 Jacobs, F., 518 Jonsson, K. E., 490
Hunt, C. E., 177 Jacobson, B., 86 Jordan, L. S., 121, 122, 143, 184
Hunter, L. L., 462, 463, 464 Jacobson, B. H., 21 Jordan, N., 370
Hurst, J. A., 185 Jacobson, G., 21, 86 Jorgenson, Christine, 224
Hurtig, H. I., 231 Jacobson, G. P., 433, 470, 557 Josephson, J., 541
Hurtig, R. R., 472 Jacobson, J., 430 Juan-Garau, M., 280
Husain, K., 518, 519 Jacobson, J. T., 493, 494, 563 Jung, M. D., 515
Husarek, S., 449 Jaeger, J. J., 266 Jung, T. T., 519
Hussain, D., 515, 516 Jaeger, R. P., 472 Jürgens, U., 59, 60, 61
Hussey, J. D., 177 Jaeschke, R., 520 Jusczyk, P. W., 394, 424, 425
Hustad, K., 111 Jagust, W. J., 240 Just, M. A., 272, 311, 313, 349, 384
588 Name Index
K Kelley, P. M., 478 Kirchner, J. A., 15

Kaga, K., 252 Kello, C., 271 Kirchner, L., 20
Kagan, A., 284 Kelly, A. S., 440 Kirk, K., 208, 337, 338
Kahan, T., 252 Kelly, D., 297 Kirk, K. I., 455, 456
Kahana, E., 126 Kelly, D. J., 395 Kirkpatrick, C., 241
Kahane, J., 73 Kelly, E., 180, 221 Kirkpatrick, S. W., 439
Kahane, Joel C., 13–19, 91 Kelsay, D., 337, 338, 453, 455 Kirshner, H. S., 252
Kahneman, D., 272 Kelso, J. A. S., 83, 102 Kirsten, E., 458
Kahrilas, P., 34 Kemmerer, D. L., 266 Kirzinger, A., 60
Kail, M., 267 Kemp, D. T., 511, 512, 515, 516 Kison-Rabin, L., 208, 209
Kail, R., 402 Kemp, S. S., 312, 396, 554 Kistler, D., 288, 290
Kaiser, A., 353, 379 Kempler, D., 240, 246, 387 Kistler, D. J., 424, 453
Kakita, Y., 17, 19, 26 Kempton, J. B., 518 Kitagawa, H., 258
Kalaydjieva, L., 434 Kennedy, M., 111, 129, 315, 390 Kitzing, P., 68
Kalberer, A., 527 Kenstowicz, M., 174 Klaben, B., 85
Kalehne, P., 334 Kent, J. F., 47, 126, 127, 129 Klap, P., 38, 39
Kalinyak, M., 104 Kent, Ray D., 3, 9, 35–37, 47, 105, 114, 126, Klatt, D. H., 58, 65, 66
Kalinyak-Flizar, M. M., 104 127, 129, 143, 150, 164, 201, 213, 221, 264, Klatt, L. C., 65, 66
Kallen, D., 186, 187 333, 367, 420 Klein, B., 208, 559, 560
Kalman, T. P., 186, 187 Kercsmar, C., 176, 179 Klein, B. E., 527
Kalmanson, J., 231 Kerivan, J. E., 502 Klein, H. B., 175
Kalokow, D. N., 548 Kern, L., 194 Klein, J. O., 135, 358
Kamel, P., 34 Kernodle, D., 33 Klein, L. L., 527
Kamen, R. S., 179 Kertesz, A., 101, 108, 243, 244, 246, 247, Klein, M. D., 143
Kamhi, A. G., 327, 329 248, 249, 250, 253, 264, 363, 381, 382 Klein, R., 208, 527, 559, 560
Kaminski, J. R., 515, 516, 533 Kertoy, M. K., 135, 136, 179 Klima, E. S., 266
Kamphorst, W., 246 Kessler, D. K., 449 Klin, A., 116, 117
Kane, A., 239 Kessler, J., 253 Kline, S. N., 46
Kane, J. M., 532 Kessler, K. S., 455 Klingholz, F., 3
Kanner, Leo, 115, 117 Keyes, L., 322 Klingner, J. K., 309
Kaplan, C. A., 330, 396 Keyserlingk, A. G., 244 Klippi, A., 253
Kaplan, E., 231, 249, 264, 381 Khalak, H. G., 266 Klono¤, P. S., 391
Kaplan, G., 208 Khan, L., 214 Kluin, K. J., 31
Kaplan, R. F., 145 Khan, L. M. L., 151 Knapp, B., 96
Kaplan, S., 133 Khinda, V., 148 Knepper, L. E., 252
Karbe, H., 248, 253 Khoshbin, S., 31 Knight, R. T., 249
Karchmer, M. A., 421 Khunadorn, F., 109, 367, 382 Knock, T. R., 105
Kardatzke, D., 506, 560 Kiang, N., 417 Knutson, J. F., 449
Karlsson, H. B., 218 Kidd, K., 221 Knuutila, J., 305
Kartush, J. M., 470 Kienle, M. J., 541 Kobayashi, K., 396
Kasai, T., 565 Kieper, R. W., 499 Kobayashi, N., 91
Kaslon, K., 178 Kiessling, J., 483 Kobler, J. B., 7
Kassubek, J., 275 Kieszak, S. M., 495 Koch, D. B., 447, 448, 455
Kaste, M., 293 Kijewski, M. F., 241 Koch, E. G., 424
Katcher, M. L., 135, 136 Kileny, P. R., 448, 455 Koch, M. A., 136
Kates, J. M., 488 Kilford, L., 30 Koch, M. E., 456
Katirji, M. B., 43 Killackey, J., 271 Koch, U., 243
Katsarkas, A., 565 Killion, Mead C., 420, 483, 488, 500, 501 Kochkin, S., 481
Katsuki, J., 458 Killon, M., 419 Koda, J., 38
Katz, L., 329 Kim, D., 418 Koebsell, K. A., 506, 560
Katz, P., 73 Kim, D. O., 515 Koenig, P., 240, 241
Katz, Richard C., 164, 254–256 Kim, E., 292 Koenigsknecht, R., 159
Katzman, R., 240 Kim, H. J., 478 Ko¤, E., 108
Kaufman, D., 208 Kim, K., 376, 440 Kohlrausch, A., 552
Kavanagh, J. F., 135 Kim, S.-G., 306 Kohn, Susan E., 363, 364, 367
Kawamura, M., 252 Kimberley, B. P., 418 Koike, Y., 37
Kay, J., 234 Kimura, D., 458 Koivuselkä-Sallinen, P., 253
Kay-Raining Bird, E., 327 King, A. S., 57 Kolinsky, R., 394
Kayser, H., 211, 322 King, C., 337, 440 Kolk, H. H. J., 384, 385, 406
Kazandjian, M. S., 228 King, J., 349 Kolk, Herman, 231–232, 333
Kean, M.-L., 266 King, J. M., 248 Kollmeier, B., 552
Keefe, D. H., 515, 516 Kingbury, B., 413 Kong, J., 177
Keefe, K. A., 396 Kinney, H. C., 145 Kong, N., 518
Kegl, J., 342 Kinsbourne, M., 237 Konigsmark, B. W., 478
Kehoe, M., 174, 346 Kinstler, D. B., 532 Konings, P. N. M., 43
Kei, J., 496 Kintsch, W., 300, 303 Konkle, D. F., 502
Keidser, Gitte, 482–485, 488 Kinzl, H., 49, 50 Konrad, H. R., 467
Keilman, A., 36, 37 Kinzl, J., 29, 50 Kontiola, P., 240
Keith, R. L., 80, 138, 258 Kirazli, Y., 39 Korein, J., 145
Kelemen, G., 57 Kirchner, D. M., 390 Kosary, C., 137
Keller, E., 364 Kirchner, H. L., 95 Kotby, N., 85
Keller, T. A., 272, 311, 313 Kirchner, H. S., 390 Koufman, J. A., 49, 55, 72, 92
Kelley, J. P., 57 Kirchner, J., 208 Kovac, C., 319
Name Index 589
Kozma-Spytek, L., 488 Lansing, Charissa R., 543–546 Leijon, A., 487
Krakow, R., 170 Lansing, R. W., 83, 84 Leiner, A. L., 315
Kramer, L., 386 Lansky, S. B., 20 Leiner, H. C., 315
Kramer, S. E., 481 Lantsch, M., 145 Leiser, R. P., 490
Krashen, S., 459 Lanza, E., 280 Lemke, J., 20
Kraus, D. S., 10 Lanzillo, R., 146 Lemke, J. H., 93, 95
Kraus, N., 433, 434, 439, 440 Lapagarette, L., 280 Lemme, M. L., 104
Krause, K., 217 LaPlante, M. P., 529 Lenneberg, E. H., 311
Krebs, D. E., 563 LaPointe, L. L., 102, 104, 105, 130, 273 Lennox, G., 292
Kreiman, Jody, 5, 43, 78–79 Laroche, C., 261 LeNormand, M.-T., 122
Kreindler, A., 273 Larrivee, L. S., 153, 327 Lentzner, H., 208
Kreisler, A., 250 Larsen, J., 249, 250 Leon, X., 72
Kricos, P., 440, 543 Larson, Charles R., 59–61 Leonard, C., 91, 392, 402
Krieg, C. S., 518 Larson, S. A., 140 Leonard, C. L., 387
Krikhaar, E., 345 LaSalle, L. R., 181 Leonard, L., 322, 331, 332, 337, 349, 373
Krishman, G., 462 Laska, A. C., 252 Leonard, Laurence B., 175, 319, 354, 372,
Kronenberg, J., 531 Lasker, J., 113 373, 402–403
Kros, C., 418 Lass, N. J., 14, 15, 18 Leonard, R. J., 47
Krueger, D., 297 Lassman, F., 487 Leonardis, L., 434
Krueger, W. W., 532, 533 Latham, S., 496 Lerman, J., 549
Kruse, E., 6 Lau, David P., 32–34 Lessac, A., 89, 90
Kryter, K. D., 539 Lau, H., 132 Lessass, A., 93
Kubovy, M., 438 Lau, W. K., 253 Lesser, R., 234
Kuchta, J., 427 Lauder, E., 139 Letz, R., 164
Kucinschi, B. R., 76, 77 Laughlin, S. A., 315, 347 Levelt, W. J. M., 142, 275
Kuehn, D., 130 Laurence, R., 419 Leventhal, B. L., 117
Kuelen, J. E., 318 Laurent, B., 145, 248 Levi, J., 33
Kuhl, D. E., 316 Laursen, B., 371 Levin, H., 111, 396
Kuhl, P. K., 54, 275, 425 LaValley, B., 146 Levine, B. A., 271
Kuhlemeier, K., 133 Lavender, R. W., 532 Levine, D. N., 387
Kuhn, G. M., 459 Laver, J., 169 Levine, H., 495–496
Kuhn, H.-J., 57 Laver, L., 78 Levine, H. L., 315
Kuijper, E. J., 518 Law, J., 378 Levine, R. A., 556
Kuk, F. K., 488 Law, W. A., 257 Levine, S., 226, 370, 371
Kumagami, H., 464 Lawson, D. T., 449 Levine, S. M., 462, 463, 464
Kumin, L., 141, 396 Lazarus, L. W., 258 Levita, E., 126, 244
Kurita, S., 91 Lazenby, B. B., 476, 532 Levitsky, W., 460
Kurs-Lasky, M., 135, 359, 370, 371 Leadholm, B., 322 Levitt, Harry, 419, 420, 451, 472, 473, 476,
Kurtin, P., 34 Leahy, P., 141 488, 541, 544
Kusche, C. A., 423 Leanderson, R., 76 Levy, J., 394
Kutas, M., 272, 273 Lear, P. M., 509 Levy, Y., 312
Kuypers, H. G. H. M., 60, 61 Leavitt, L., 288 Lewin, J. S., 139
Kwak, N., 140 Lebacq, J., 36 Lewis, Barbara A., 135, 156, 183–185, 218
Kwiatowski, J., 122, 135, 136, 143, 151, 156, Lebrun, Y., 145 Lewis, J., 34
206, 214, 215, 217, 218 Le Conteur, A., 115, 117 Lewis, J. E., 557
Kyle, R., 34 Lecours, A. R., 250, 252, 264 Lewis, K., 206
Leddy, M., 93, 288 Lewis, N., 151, 214
L Leder, S., 208 Lewy, Frederich H., 292
Laaksonen, R., 240 Lederberg, A., 371 Lexell, J., 92
Laasko, M., 253 Lederer, S. H., 175 Lhermitte, F., 252, 264
Labov, W., 296, 300 Lee, A., 116 Li, P., 266
Laci, P., 34 Lee, C. S., 519 Li, Y. L., 533
Lacour, M., 565 Lee, E., 298 Liberman, I. Y., 329
Ladaille, P., 60 Lee, F. S., 528, 529 Liberman, M., 417
Ladefoged, P., 7, 64, 150 Lee, H., 73 Liberman, M. C., 434, 508, 509
Lahey, M., 184, 350 Lee, J. H., 140, 240 Lichtenstein, M. J., 529
Lai, C. S. L., 185 Lee, L., 86, 90, 95 Lichtheim, L., 249, 263, 264, 314
Lai, Z., 141 Lee, P., 43 Liddell, S., 339, 340, 342
Laine, M., 253 Lee, S., 36 Lidén, G., 490, 502, 503, 560–561
Laka, I., 267 Lee, S. P., 306 Lidz, C. S., 322
Lakin, K. C., 140 Lee, T. D., 96, 105, 106 Lieberman, P., 3, 9, 58
Lambert, P. R., 462, 464 Lee, V. M., 30 Lieh-Lai, M., 179
Lambert, S., 179 Leeper, H. A., 47, 49, 80, 368 Liem, K. F., 56
Lambrecht, L., 117 Lees, A. J., 30, 145 Light, J., 110, 112, 113, 114
Lancee, W., 402 Legris, G., 176, 179 Ligon, J., 55
Landeau, B., 241 Lehiste, I., 451 Liles, B. Z., 303
Lane, C., 415, 416 Lehman, M. T., 388, 389, 390, 391 Liljencrants, J., 6, 63, 64, 76
Lane, G. I., 502 Lehman-Blake, M. T., 389, 390, 391 Lillo-Martin, D., 339, 340, 341
Lane, H., 208, 209 Lehmann, C., 267 Lilly, D. J., 462, 463, 464
Lang, A., 39 Lehtihalmes, Matti, 252–253 Lim, H. H., 61
Lang, P. J., 273 Lei, S. F., 510 Lim. M., 73
Langdon, H. W., 168 Leigh, R. J., 565 Lin, E., 55
Langmore, S., 133 Leiguarda, R. C., 60, 108, 258 Lin, H.-B., 544
590 Name Index
Lin, J., 469 Lovaas, O., 117 MacSweeney, M., 546

Lin, Q. G., 6, 63, 64, 76 Lovallo, W., 108 MacWhinney, B., 267, 312, 313
Lincks, J., 47 Love, R. J., 200, 201, 202, 203 Madassu, R., 20
Lindamood, P. C., 154 Loverso, F. L., 255, 256 Maddieson, I., 58, 393
Lindamood, P. D., 154 Lovett, M., 312 Madison, L. D., 511
Lindblom, B., 111 Lowder, M. W., 449, 453 Madureira, S., 252
Lindeboom, J., 246 Lowe, J., 292 Maestas, A. G., 322
Lindeman, R. C., 59 Lowe, R. J., 157 Maez, L., 211
Lindemann, K., 434 Lowe-Bell, S., 458 Magliulo, G., 491
Linden, P., 133 Lu, C., 73 Magnúsdóttir, S., 267, 361
Linder, T. W., 286 Lu, J. F., 21 Magnusson, E., 197
Lindgrin, F., 499 Lucas, C., 296 Mahendra, N., 240, 292
Lindkvist, A., 487 Lucente, F., 33 Mahieu, H. F., 43
Linebarger, Marcia C., 231, 271, 383–385 Luchsinger, R., 54, 91 Mahoney, G., 202, 203
Linell, S., 261 Luchtman, M., 427 Mahshie, J. J., 25
Ling, A. R., 243 Luckasson, R., 113 Mair, A., 508
Ling, D., 473 Luckau, J. M., 150 Maisog, J. M., 222
Lingam, V. R., 258 Luckersson, R., 352 Maison, S. F., 509
Linklater, K., 96 Ludlow, C., 130 Make, B. J., 226
Linthicum, F. H., 448 Ludlow, Christy L., 38–39, 60 Makielski, K., 476
Linville, Sue Ellen, 72–74 Ludman, C. N., 546 Malmquist, C., 502
Lippke, B. A., 216 Ludy, C. A., 250 Maltan, A. A., 427
Lippmann, R., 419 Luebke, A. E., 512 Mamourian, A. C., 241
Lippmann, R. P., 472 Lukatela, K., 231, 267 Mancl, L. R., 424
Lisberger, S. G., 565 Luna, C. E., 551 Mandel, E., 496
Lishman, W. A., 186 Lunatto, K. E., 542 Mandel, E. M., 359
List, H., 197 Lundgren, K., 361 Mandell, A. M., 246
List, M. A., 20 Lundh, L.-G., 557 Mandler, J. M., 300
Litin, E. M., 29 Lundh, P., 485 Manegold, R. A., 554
Litovsky, R. Y., 426 Lunner, T., 488 Maniglia, A. J., 43
Littlejohns, D., 28 Luoko, L. J., 151 Mankikar, G. D., 258
Liu, V., 73 Luria, A. R., 101, 145 Mann, D. M. A., 246, 247
Liu, X., 478 Luschei, E. S., 31 Mann, D. P., 211
Lively, S. E., 440 Luthe, L., 61 Mann, S., 370
Lleó, C., 174, 345 Lutz, G., 108 Manni, J. J., 135
Lloyd, P., 370 Lutzer, V. D., 325 Manning, W. H., 222
Lluschei, E. S., 31 Luxford, W. M., 433, 448, 519 Manoukian, J. J., 36
Lo, W., 427 Luzzatti, C., 267 Mansbach, A. L., 436
Loban, W., 324 Lybarger, S., 485 Mansella, J., 208, 209
Locke, J., 196 Lyerly, O. G., 252 Månsson, A.-C., 267
Locke, J. L., 175, 218 Lynch, E., 322 Mansson, H., 220, 221
Lockwood, A. H., 266 Lynn, G. E., 490 Maragos, N. E., 130
Lockyer, L., 117 Lynn, S., 563 Marais, J., 519
Loeb, Diane Frome, 285–287, 309 Lyon, G. R., 307 Maran, A. G. D., 57
Loewenson, R. B., 293 Lyons, F., 72, 73 Marantz, A., 356
Lof, G. L., 151 Lyregaard, P. E., 483, 485 Marassa, L. K., 544
Lofgren, R., 34 Lyttkens, L., 557 Marcell, M., 289
Löfqvist, A., 25, 68 Lyxell, B., 541 Marchetti, G. F., 564
Logan, K. J., 182, 335 Marchman, V. A., 312, 396
Logan, S. A., 529 M Marcus, G. F., 356
Logemann, J. A., 30, 31, 126, 133, 138 Ma, E. P., 22 Marean, G. C., 424, 425
Löhle, E., 96 Ma, L. J., 167 Marentette, P., 340, 341
Lohmeier, H. L., 178 Maag, K. P., 563 Mares-Perlman, J. A., 208, 527
Loizou, P. C., 447 Maassen, B., 122, 202 Margolis, R. H., 462, 463, 464, 506, 507, 559,
Lomas, J., 283 MacArdle, B. M., 472 560, 561
Lombard, L. E., 20 Maccaluso-Haynes, S., 302 Marie, P., 249, 263
Lombardino, L. J., 324, 402 MacCollin, M. M., 427 Marin, O. S. M., 231, 384
Long, S. H., 164, 206, 213, 379, 403 MacDonald, J. W., 543 Marineau, C., 427
Longacre, R. E., 301 MacDonald, M. C., 240, 270–271 Marion, M. H., 38, 39
Longstreth, D., 105 Maceri, D., 73 Marion, M. S., 519
Lonigan, C. J., 359 MacGahan, J. A., 10 Mark, V. W., 244
Lonsbury-Martin, Brenda L., 511–513, 515 Macken, M. A., 150 Marler, J., 350–351
Loovis, C. F., 458, 481 Mackenzie, I. R. A., 246 Marquardt, T. M., 101–102, 122, 142, 143
Lopez-Pousa, A., 72 Mackenzie, J., 472 Marquis, J., 395
Lorch, M. P., 108 MacKenzie, M., 54 Marrugat, J., 132
Lord, C., 116, 117 Mackersi, C., 488 Marsden, C. D., 31, 293
Lorente de Nó, R., 417 Mackey, W., 119 Marsh, E. E., III, 252
Lorenz, R. R., 43 Macklis, J. D., 258 Marshall, J., 252
Loscher, J., 496 Maclaren, R., 299 Marshall, J. C., 237, 238
Lotter, V., 117 MacLarnon, A., 58 Marshall, L., 548
Lotz, W. K., 36, 37 MacLeilage, P. F., 143 Marshall, N., 104
Lou, G., 60 MacLennan, D. L., 258 Marshall, Robert C., 186–188, 243
Lounasmass, O. V., 305 MacNeilage, P., 364 Marslen-Wilson, W. D., 393
Name Index 591
Martin, A., 20 McCormick, M., 190 Meisel, J. M., 280

Martin, D., 20 McCoy, M. J., 515 Mellet, E., 276
Martin, F. N., 500, 501, 502 McCreery, D., 427, 428 Mellits, D., 273
Martin, G. F., 80 McCulloch, T. M., 139 Mellon, N., 337, 338
Martin, G. K., 515 McDade, H., 298 Mellon, N. K., 456
Martin, N., 232, 364 McDaniel, K., 293 Melnick, W., 495–496, 508
Martin, R., 364 McDaniel, K. D., 246 Meltzo¤, A. N., 543
Martin, R. E., 347 McDermott, F. B., 244 Menapace, C., 427
Martin, S., 297, 478 McDermott, H., 449 Mencher, L. S., 494
Martin, W. E., 293 McDermott, H. J., 472, 473 Mendelsohn, J., 34
Martinez, A. L., 104, 105 McDonald, E., 159, 216 Mengert, I., 210
Martino, G., 390, 391 McDonald, J., 295, 298 Mengler, E. D., 329, 402
Marti-Vilata, J. L., 126 McDonald, S., 387 Menn, Lisa, 231, 265–268, 384, 406
Marttila, R. J., 293 McDonald-Bell, C., 476 Menna, R., 400
Marvin, C., 113 McDougle, C., 117 Menyuk, P., 359
Marx, R. E., 46 McFadden, D., 417, 418 Mepham, G., 475, 476, 477
Masaaki, B., 464 McFadden, D. M., 489 Merel, P., 427
Mason, J. L., 541 McFadden, S. L., 509 Merino, B., 321
Mason, R. M., 17 McFadden, T. U., 153, 380 Merrell, K. W., 398, 399
Mason, S. M., 532 McFarlane, S. C., 49, 81, 82, 83, 88, 89 Merry, C. J., 499
Massaro, D., 543, 544 McGarr, N., 336 Mervis, C. A., 396
Massey, E. W., 133 McGarr, N. S., 25 Mervis, C. B., 141, 396
Massey, K., 36 McGee, T. J., 439, 440 Merzenich, M. M., 90, 439, 440, 448, 552
Masson, V., 256 McGillivray, L., 327 Mesaros, P., 116
Master, D. R., 186 McGlone, R., 92 Mesibov, G., 117
Masters, J., 400 McGowan, J. S., 178–179 Mesulam, M.-M., 107, 246, 247, 248, 386
Masterson, Julie J., 164–165, 213 McGregor, G., 309 Metter, E. J., 246, 316, 527
Masuda, Y., 33 McGregor, Karla K., 297, 332, 395–397 Metz, S., 177
Masullo, C., 389 McGrory, J. J., 29, 50 Metzler, J., 80
Matheny, N., 206 McGue, M., 184 Meunier, F. A., 38, 39
Mathieson, L., 78, 83 McGuire, P. K., 546 Meyer, A. S., 275
Mathy, P., 111 McGurk, H., 543 Meyer, Bjerkan, K., 331
Mathy-Laikko, P., 113 McHorney, C. A., 21 Meyer, E., 60, 275
Matsumoto, N., 533 McKee, L., 325, 399 Meyer, S. E., 491
Matsumoto, T., 91 McKeith, I. G., 292 Meyer, T. A., 456
Matthews, J., 418 McKenna, L., 557 Meyers, J. L., 232
Matthews, L. J., 528, 529 McKhann, G., 201, 240 Meza, P., 21
Matthews, S., 280 McKinley, A. M., 507 Miceli, G., 231, 383, 389
Matthies, M., 208, 209 McKinley, R. L., 498 Michaelis, D., 6
Mattis, S., 329 McKinney, N. P., 64 Michalewski, H., 434
Mattox, D. E., 563, 565 McLean, J., 376 Michelow, D., 387
Matz, G. J., 518 McLeod, S., 190–191 Michie, P. T., 552
Matzelt, D., 17 McMicken, B., 42 Mickanin, J., 241
Mauk, C., 341 McMillan, J., 91 Middlebrooks, J. C., 453
Mauner, G., 384 McNally, D. N., 46 Middleton, A., 478
Maves, M., 508 McNamara, P., 208 Mikulis, D., 132
Maw, M. A., 478 McNamara, P. M., 527, 528 Milanti, F., 133
Maw, R., 359 McNeil, M., 459 Milbrath, R., 84
Mawhood, L., 162 McNeil, Malcolm, R., 101–103, 104–106, Milenkovic, P., 4, 36, 273
Mayberry, R., 341, 342 127, 264, 272–273, 367 Miles, F. A., 565
Mayer, J., 102 McNutt, L., 505 Millar, J., 449
Mayeux, R., 293 McPherson, B., 496 Millard, T., 163
Mayhood, L., 116 McPherson, D., 433 Miller, A. D., 61
Mayne, A. M., 336, 337, 422 McQuarter, R. T., 379 Miller, B. A., 137
Mayr-Graft, A., 49, 50 McQueen, J. M., 393, 394 Miller, C., 49, 50, 402
Mazor, M., 472 McRae, K., 270 Miller, C. L., 425
Mazzocchi, F., 243 McRay, L. B., 164, 165 Miller, H., 122
Mazzuchi, A., 231 McReynolds, L. L., 15, 18 Miller, J., 159, 164, 322, 325, 483
McAdams, S., 438 McReynolds, L. V., 198 Miller, J. D., 508, 510
McArthur, G. M., 329, 402 McSweeney, J., 116, 117 Miller, Jon F., 140, 288–291, 397
McBurney, D. H., 246 McSweeny, J. L., 135, 156, 184, 218, 219 Miller, R., 133, 312
McCabe, P., 243, 250 McTear, M., 372 Miller, R. H., 17
McCandless, G. A., 483 Mead, J., 7, 83 Miller, S., 194, 402, 403, 552
McCarthy, D., 370 Means, E. D., 433 Miller, S. A., 179
McCarthy, Patricia, 475–477 Meares, R. A., 552 Miller, S. L., 439, 440
McCarthy, R. A., 275 Mecklinger, A., 271 Miller-Jones, D., 319
McCartney, K., 371 Mediavilla, S. J., 462, 463, 464 Millet, B., 6, 78
McCarton, C. M., 358 Medley, L. P., 359, 360 Mills, A. W., 453
McCathren, R. B., 375 Mega, M. S., 145 Mills, D., 448, 519
McCauley, R., 114 Mehl, A. L., 337, 421, 515 Mills, D. L, 313
McCauley, Rebecca J., 218–219 Mehler, J., 394 Mills, H., 133
McClaskey, C. L., 440 Meier, R., 341 Mills, John H., 508–510, 527–530
McConkie, G. W., 544, 545, 546 Meikle, M. B., 556, 557 Mills, R. H., 256
592 Name Index
Milner, B., 313, 458 Morgan, A., 258 Musselman, C., 422, 423
Milner, P., 208 Morgan, A. R., 244 Muter, V., 311
Milosky, L. M., 325 Morgan, C. T., 508 Myers, Penelope S., 107–109, 388, 389, 390,
Milroy, J., 294–295, 296 Morgan, D., 92 391
Milstein, C., 6, 84 Morgan, D. E., 548 Myers, R. H., 527
Milutinovic, Z., 29, 49 Morgan, M., 162, 399, 400 Myres, W. A., 455, 541
Mimouni, Z., 267 Morgan, W. J., 7, 83
Mimpen, A. M., 451 Morgon, A., 512 N
Mindham, R. H. S., 293 Morison Olson, L., 532 Nabelek, A., 442
Mineau, S. M., 532 Moritz, C. H., 241 Nábélek, Igor V., 550–552
Miner, P. B., 89 Morley, G. K., 258 Nadeau, S. E., 145, 273, 275, 315, 316, 364,
Minshew, N., 116 Morley, M. E., 122 366, 367
Mirenda, P., 111, 113, 116, 277, 278 Morrell, C. H., 527 Nadol, J. B., Jr., 434
Mirra, S. S., 292 Morris, C., 518, 519 Naeser, M. A., 243, 244, 315, 347, 367
Mirus, G., 341 Morris, H. R., 145 Naeve-Velguth, S., 275
Mirz, F., 557 Morris, J., 231 Nagafuchi, M., 472
Mishkin, M., 313 Morris, P. L., 258 Naganauma, H., 563
Mistura, K., 38 Morris, S. E., 143 Nagarajan, S., 403
Mitchell, M., 133 Morris, S. R., 214 Nagel, N., 271
Mitchell, P., 202, 203 Morrisette, M. L., 123 Nagy, V. T., 255, 256
Mitchum, C. C., 384 Morrison, C. E., 31 Nagy, W. E., 397
Miyake, A., 349, 384 Morrison, D., 241 Naidoo, S. V., 488
Miyamoto, R. T., 337, 455, 541 Morrison, M., 34, 38, 72, 73, 187 Nalipour, R., 267
Mlcoch, A. G., 126, 258 Morrison, M. D., 29, 50, 93 Nance, W. E., 478
Moats, L. C., 324 Morrisset, C., 370 Napp, A. P., 507
Mobley, J. P., 427 Morrongiello, B. A., 424, 425 Narayanan, S., 36
Moeller, M. P., 421, 515 Morse, P. A., 425 Naremore, R. C., 370, 371
Mo¤at, D. A., 464 Mortimer, J. A., 293 Nash, E. A., 38, 39
Mogford, K., 336, 340 Morton, K., 171 Nash, L., 331, 354
Mohr, J. P., 41, 101, 249, 264 Morton, N. E., 477 Nash, M., 33
Mohrad-Krohn, G. H., 107 Moscicki, E. K., 208, 527, 528 Nasri, S., 43
Molenberghs, G., 6 Moser, D. J., 145 Nass, R., 312
Molgo, J., 38, 39 Moss, A. J., 529 Nasseri, S. S., 130
Molins, A., 132 Moss, H. F., 248 Natale, A., 491
Moller, A. R., 429 Moulard, G., 255 Nath, U., 145
Molnar, C., 418 Moulin, A., 512 Nation, J., 330
Molnar, M., 368 Mount, K. H., 224 Natividad, M., 43
Monaco, A., 402 Moutier, F., 263 Naujokat, C., 243, 244
Monaco, P., 185 Mowrer, D. E., 157 Naunton, R. F., 38, 39, 502
Mongeau, L., 77 Moya, K. L., 387 Navon, D., 272
Monjure, J., 205 Mross, E. F., 303 Neary, D., 146, 246, 247
Monnot, M., 108 Mueller, H., 419 Nebes, R. D., 240
Montequin, D. W., 89, 90, 96 Mueller, H. G., 480 Neebe, E. C., 135, 136, 358, 359
Montgomery, A., 208, 543 Mueller, R. F., 478 Needleman, H., 371
Montgomery, A. A., 440, 487 Muller, Dave J., 260–261 Neely, J. H., 273
Montgomery, C. R., 425 Muller, E. M., 70 Neely, Stephen T., 415, 418, 513, 515–517,
Montgomery, J., 349, 350 Müller, S., 253 533
Montgomery, W., 34 Müller-Preuss, P., 59, 60 Ne¤, D. L., 425
Montgomery, W. M., 7 Mulligan, M., 101, 102 Negus, V. E., 57, 58
Montgomery, W. W., 22 Mullins, J. B., 177 Nehammer, A., 478
Montoya, D., 515 Mulrow, C. D., 529 Neils, J., 184
Moody, D. B., 57 Mundy, M., 359, 560 Neilson, M. D., 221, 222
Moog, J. S., 421 Munhall, K., 545 Neilson, P. D., 221
Moore, B., 419 Munoz, D. G., 246, 247 Neisser, A., 361
Moore, B. C., 551, 554 Munoz, N., 45 Nelson, D. A., 551
Moore, B. C. J., 438, 527 Munson, W. A., 413, 414, 415, 416, 417 Nelson, E., 33
Moore, C., 317 Murata, T., 476 Nelson, J. R., 473
Moore, C. A., 143 Murawski, W. W., 309 Nelson, K., 343, 515
Moore, C. J., 317 Murdoch, Bruce E., 123, 200–201, 202, 243, Nelson, K. E., 405
Moore, D. R., 491 250, 314–317 Nelson, L. K., 165
Moore, J. K., 427, 428 Murphy, B. W., 266 Nelson, M., 202
Moore, J. M., 425 Murphy, J., 336 Nelson, N., 297
Moore, P., 15, 69, 240, 241 Murphy, P. J., 6 Nelson, R., 436
Moore, R. W., 389 Murray, J. C., 350 Nelson, R. A., 427, 428, 518
Moore, W. H., Jr., 169 Murray, K. T., 449 Nespoulous, J. L., 303, 363
Moossy, J., 246 Murray, L. L., 273 Netsell, R. W., 36, 37, 46, 48, 122, 129, 130
Morais, J., 394 Murray, T., 82 Nettelbladt, U., 197
Morales-Front, A., 210 Murray, V., 252 Neubauer, J., 57
Moran, M., 159, 297 Murrell, J. R., 427 Neuman, Arlene, C., 487–488
Moraschini, S., 250 Murry, Thomas, 20–23, 89, 91–94 Neumann, H. J., 174
Morata, T. C., 508 Muscatello, M., 491 Neville, B., 201
Morell, R., 478 Müsch, H., 413, 539 Neville, H. J., 313
Morest, D. K., 509 Musiek, F. E., 439, 532, 533 Newall, P., 485
Name Index 593
Newby, H., 475 Obler, L. K., 231, 266, 267, 302, 303, 406 Owens. E., 542
Newman, Arlene, 419 O’Brien, C., 89, 130 Oxenham, A. J., 551
Newman, C. W., 21, 86, 470, 557 O’Brien, J., 292 Oyer, H. J., 543
Newman, I., 258 O’Brien, K. P., 391 Ozanne, A. E., 123, 200–201, 219
Newman, J. D., 59, 60 O’Brien, M., 370, 402 Ozdamar, O., 433
Newman, R., 395 O’Connell, B., 375
Newport, E., 341, 342 O’Connor, M. J., 146 P
Newton, R. A., 564 Odell, K. H., 273 Paatsch, L. E., 421
Ng, J., 55 Odom, R. D., 425 Pabon, J. P. H., 3
Niccum, N., 458, 459, 460 Oetinger, R., 551 Pachana, N. A., 389
Nı́ Chasaide, A., 6 Oetting, Janna B., 294–296, 297–299, 395 Packard, J., 109
Nichol, H., 29, 50 Ogar, Jennifer, 249–251 Padden, C., 340, 342
Nicholas, J., 456 Ogawa, S., 306 Paden, E. P., 154, 181, 199, 204, 205, 333
Nicholas, L. E., 258 Ogawa, T., 33 Padovani, A., 240
Nicholas, M., 108, 303 O’Grady, G. M., 515 Pae, S., 395
Nicholas, M. L., 244 O’Grady, L., 339, 340, 341 Pafitis, I. A., 515
Nichols, F., 261 Oh, A. K., 519 Pakiam, A., 39
Nicholson, R. I., 329 Ohala, D., 345, 346 Palamara, J., 448
Nicklay, C. K. H., 243 Ohala, J., 76, 363, 364 Palmer, A. R., 307
Nicol, T. G., 440 O’Hara, M., 396 Palmer, C., 6
Nicoladis, E., 279, 280 Ohde, R. N., 156, 425 Palmer, C. V., 488
Nicolai, J., 145 Ohlemiller, K. K., 509 Palmer, E. P., 473
Nielsen, D. W., 509 Ohlms, L. A., 515 Palmer, P., 134
Niemann, K., 243, 244 Ohrlich, B., 515, 516 Palmer, P. M., 66
Niemi, J., 253, 266 Okamaoto, M., 563 Paltan-Ortiz, J. D., 246
Nienhuys, T. G., 422 Okamura, H., 130 Palumbo, C. L., 24, 243
Nigro, G., 367–368 O’Leary, M., 427 Panagos, J. M., 206
Nikolova, A., 434 Oliver, B., 185, 371 Paniello, R. C., 43
Nilsson, G., 502, 503 Oliver, D., 511 Panigrahy, A., 145
Nilsson, M., 451 Oller, D. K., 151, 164, 370, 425 Pannbacker, M., 50
Nilsson, R., 490 Olley, J., 114 Papagno, C., 243, 315, 316
Ninio, A., 372 Olness, Gloria Streit, 300–301 Paparello, J., 515
Niparko, J. K., 337, 338, 456, 563, 565 Olsen, K., 34 Papathanassious, D., 276
Nippold, Marilyn A., 324–325, 334 Olsen, L., 481 Pappas, J. J., 502
Nishida, H., 464 Olsen, N., 133 Paquier, P., 252
Nishimura, C., 350 Olsen, W., 490 Paradis, Johanne, 279, 280, 298, 331–332, 348
Nishizaki, K., 33 Olsen, W. O., 431, 548 Paradis, Michel, 266, 267, 268
Niskar, A. S., 495 Olsho, L. W., 424 Paradise, J. L., 135, 136, 359, 370, 371
Nittrouer, S., 425 Olson, L., 258 Parent, T. C., 472
Nixon, C. W., 498, 499 Olsson, J. E., 532, 533 Park, Y. S., 519
Nixon, S., 108 Olswang, L. B., 286, 328 Parker, A., 34
Noble, W., 481 On, A. Y., 39 Parker, C. A., 487
No¤singer, D., 490 Onada, K., 33 Parker, R. A., 495
Noguchi, A., 476 O’Neill, J. J., 543 Parker, S. W., 563
Nondahl, D., 559, 560 Onishi, K., 240, 241 Parkin, A. J., 237
Norbury, C. F., 350 Ono, Y., 243 Parkinson, A., 485
Nordlund, B., 491 Onslow, M., 182, 192, 222 Parkinson, A. J., 455
Norre, M., 564 Opie, M., 252, 253, 266 Parkinson, W. S., 449
Norris, D., 275, 393, 394 Optican, L. M., 565 Parnes, P., 114
Norris, J. A., 205, 379 Orbelo, D. M., 108 Parr, S., 261
North, T., 184, 185, 350, 402 Orchik, D. J., 505 Parry, D. M., 427
Northern, J. L., 15, 18, 480, 524 Orchik, J. G., 463 Parsons, C. L., 205
Northrop, C., 434 Orefice, G., 146 Pasanen, E. G., 489
Northrop, G., 224 Orgogozo, J. M., 258 Pascoe, D., 483
Norton, B. L., 529 Orlando, M., 529 Pascoe, M., 190
Norton, J., 117 Orliko¤, Robert F., 3, 4, 10, 14, 23–26, 92 Pashek, G. V., 244, 253
Norton, S. J., 513, 515 Orlowski, J. P., 176, 179 Pashler, H. E., 273
Notari-Syverson, A., 379 O’Rourke, T., 340 Pasquier, F., 292–293
Nowak, M., 198, 199 Osberger, Mary Joe, 336, 338, 447–449, 454– Passingham, R., 185
Nozza, Robert J., 424, 496, 504–507, 560 456, 541 Pasternak, J., 434
Nunez, P. L., 305 Osborn, A. H., 478 Pate, D. C., 271
Núñez-Cedeño, R., 210 Osborne, A. G., Jr., 308 Paterson, M., 336
Nurmi, E., 293 Osen, K. K., 427, 428 Patrick, J., 448
Nuss, D., 72, 73 Osman-Sagi, J., 368 Patronas, N. J., 241
Nuss, R. C., 37 Osterling, J., 114 Pattee, G., 111
Nuttal, A. L., 14 Ostyn, F., 73 Patten, B. M., 17
Nuutila, A., 368 Otake, T., 394 Patterson, C., 133
Otis, B., 439 Patterson, J., 433
O Otto, A., 145 Patterson, J. H., 509
Oates, J., 95 Otto, S. A., 427, 428 Patterson, K. E., 235, 237, 238, 240, 247,
Oates, J. M., 224 Otto, S. R., 551 248, 317
Oba, S., 434, 555 Ouhalla, J., 405 Patterson, R., 526
Ober, B. A., 240 Ovesen, T., 557 Patterson, R. D., 552
594 Name Index
Paul, R., 190–191 Petit-Taboue, M. C., 241 Poplack, W., 295

Paul, P. V., 336, 423 Petock, L., 240 Popovic, M., 434
Paul, Rhea, 115–118, 286, 328 Petrides, M., 60 Popple, J., 191
Paul-Brown, Diane, 147–149 Petrill, S. A., 371 Porazo, J., 541
Paulesu, E., 387 Petrova, J., 434 Porch, B., 258
Paulk, C., 299 Petry, M., 273 Porter, L. S., 532
Pauloski, B. R., 11, 138 Pettit, John, 223–225 Portillo, F., 427, 428
Paulson, G. W., 293 Pettito, L., 340, 341 Portin, R., 293
Paulus, P. G., 157 Pezzini, A., 240 Posen, M. P., 473
Pavlovic, C., 483 Pfanner, L., 331 Poser, S., 145
Pawlas, A. A., 30–31, 89 Pfau, E. M., 29 Posner, J., 145
Payan, C., 122 Pfister, D. G., 10 Posner, M. I., 273
Payen, M., 544 Phelan, J. G., 532 Postma, A., 333
Payer, F., 241 Phelps, M. E., 305, 316 Postma, G. N., 92
Payne, A. C., 371 Phillips, B., 111 Potamianos, A., 36
Payne, K., 319, 321 Phillips, C., 356 Poth, E. A., 490
Peach, Richard K., 243–244, 271, 347–348 Phillips, D. S., 243 Potisuk, S., 109
Pearce, P. S., 287 Picard, N., 59, 145 Potter, C. R., 24
Pearl, G., 261 Picarillo, J. F., 20 Potter, H. H., 387
Pearlmutter, N. J., 270–271 Pichora-Fuller, M. K., 541 Pou, A., 33
Pearson, B. W., 138 Pickard, L., 283 Poulsen, T., 551
Pearson, J., 508 Pickett, J., 442 Poussaint, T. Y., 146
Pearson, J. D., 527 Pickles, A., 115, 117 Powe, N. R., 456
Pedersen, C. B., 557 Pickles, James O., 417, 522–525 Powell, A. L., 293
Pedrazzi, M. E., 148 Pickup, B., 89, 95 Powell, L. E., 564
Peers, I., 370 Picton, T. W., 430, 533 Powell, R., 402
Pell, M. D., 109, 381, 382, 387, 389 Pike, B., 60 Powell, T., 151, 199
Pelland-Blais, E., 36 Pilgrim, L., 335 Praamstra, P., 275
Pembrey, M., 402 Pillon, B., 304 Pracy, J., 475, 476, 477
Peña, E., 322 Pillsbury, H. C., 176, 177, 490, 491 Prather, P., 271
Pena-Brooks, A., 192 Pinango, M. M., 253 Pratt, L., 208
Penfield, W., 458, 460 Pincus, D., 390 Pratt, R., 61
Penner, M. J., 512, 551, 554 Pincus, J. H., 187 Pratt, Sheila, 207–209, 439–440
Penner, S., 458 Pinheiro, S. B., 324 Preat, S., 258
Pennington, B. F., 140 Pinker, S., 266, 355 Precoda, K., 77
Penny, J. B., 31 Pipp-Siegel, S., 422 Preece, J. P., 453
Pentland, B., 30 Pirozzolo, F. J., 293 Prelock, P. A., 309
Perez, K., 31 Piskorski, P., 515 Preminger, J. E., 487, 488, 544
Perez, L. L., 147 Pisoni, D. B., 337, 338, 424, 425, 440, 458 Prescott, T. E., 255, 256
Pérez-Vidal, C., 280 Pitcairn, D. N., 370 Pressman, L., 422
Perkell, J. S., 7, 8, 9, 67, 68, 88, 208, 209 Pitcairn, T., 30 Preves, D. A., 419, 488
Perkins, L., 297 Pizzamiglio, L., 459 Prezant, T. R., 518
Perkins, W. H., 221, 333 Plant, Geo¤., 541–542 Price, C., 364
Perlman, Adrienne, L., 132–134 Plante, A., 153 Price, C. J., 317
Perozzi, J. A., 281 Plante, E., 184 Price, D., 240
Perrin, A., 38, 39 Plassman, B. L., 83, 84 Price, T. R., 108
Perrine, S. L., 214, 217 Plaut, D., 236 Price, T. S., 185, 371
Perris, E. E., 425 Plomin, R., 185, 371 Priede, V. M., 500, 501, 503
Perry, D. W., 60 Plomp, R., 419, 451, 527, 551, 554 Prieto, T., 525
Perry, E. K., 292 Plum, F., 145 Prieve, B. A., 515
Perry, R. H., 292 Poburka, B., 93 Primeau, M., 108
Pesetsky, D., 270 Pocock, L. C., 539 Prince, A., 175
Pessin, M. S., 101, 264 Podoshin, L., 518 Pring, T., 261
Peters, C., 159 Poeck, K., 243, 258 Prins, D., 222
Peters, G., 57 Poeppel, D., 275, 276, 368 Prins, J. M., 518
Peters, J., 515, 516, 533 Poewe, W., 30 Prins, R. S., 244
Peters, J. E., 36, 37 Pogacar, S., 246 Prinz, M., 174
Peters, J. F., 433 Pohjasvaara, T., 293 Prinz, P., 343
Peters, R. W., 490, 554 Poizner, H., 266 Prinz, W., 96
Peters, S. A. F., 135, 136, 359 Polanyi, L., 301 Pritchett, B., 270
Peters, T., 378 Polk, M., 246, 248, 253 Prizant, B. M., 116, 117, 118, 373
Peters, T. J., 359 Polka, L., 561 Probst, R., 516
Peterson, A., 436 Pollack, I. F., 146 Proctor, Adele, 124–126, 159
Peterson, B., 292, 455 Pollock, J. W., 406 Proctor, D. F., 83
Peterson, C., 390 Pollock, K. E., 143, 150 Proctor, L. R., 563
Peterson, F. E., 451 Polloway, E., 113 Propp, V., 300
Peterson, G., 225 Polster, M. R., 367 Propst, M., 258
Peterson, H. W., 29 Ponglorpisit, S., 109, 367, 382 Prosek, R. A., 208, 440, 487, 543
Peterson, J. D., 150 Ponton, C., 439 Prosen, C. A., 57
Peterson, N., 7 Pool, A., 563 Proud, G. O., 472
Peterson, R. C., 246, 247 Pope, M. L., 455 Prutting, C. A., 390
Pethick, S. J., 336, 370, 371 Popelka, G., 483 Pulaski, P. D., 246
Pethnick, S., 376 Poplack, S., 321 Punch, J. L., 487
Name Index 595
Purdy, S. C., 440, 507 Redner, J., 515, 516 Risi, S., 117
Putnam, A., 83 Reed, C., 82, 84, 419 Risley, Todd, 290, 370, 371
Pye, C., 197 Reed, C. G., 12 Ritter-Sterr, C., 20
Pyers, Jennie, 339–343 Reed, Charlotte M., 471–474, 541 Rizer, F. M., 80
Pyman, B. C., 448 Reed, N., 433 Rizzi, L., 354, 355
Rees, R., 190, 473 Rizzo, M., 273
Q Reichle, J., 112, 113, 278 Rizzolatti, G., 276
Qi, Y., 6 Reichman, W., 293 Robb, S., 311
Quaranta, A., 490, 491 Reid, J., 205 Robbins, A., 337
Quart, E., 179 Reid, R. R., 309 Robbins, A. M., 455, 456, 541
Quay, S., 279, 280 Reilly, J., 252, 253, 336, 340, 341 Robbins, J., 11, 12
Quer, M., 72 Reilly, J. S., 312, 376 Robbins, K. T., 20
Qui, W. W., 531 Reilly, R., 395 Robbins, L., 337, 338
Quigley, S., 337 Reilly, S., 201 Roberts, J., 298, 560
Quill, K., 117 Reinmuth, O. M., 246 Roberts, Joanne E., 135, 136, 358–360
Quiniou, R., 256 Reinvang, I., 109 Roberts, K., 135
Quinn, M., 159 Reiss, S., 113 Roberts, L., 458, 460
Quinn, R., 322 Remacle, M., 6, 78 Robertson, S., 130
Remy, P., 348 Robertson, S. B., 287
R Renaud, A., 303 Robin, D. A., 101, 104, 105, 109, 121, 122,
Rabinowitz, W., 419 Renshaw, J. J., 455 143, 184, 273
Rabinowitz, W. M., 449, 472 Renvall, U., 560–561 Robinette, M. S., 444, 515
Rachman, S., 556 Repp, B., 458, 459 Robinson, A. J., 109, 250
Rademaker, A. W., 138 Resch, J. A., 293 Robinson, D. A., 565
Radford, A., 337, 355 Rescorla, L., 286, 330, 370 Robinson, D. W., 508
Radkowski, D., 475, 476 Restropo, M. A., 321, 328 Robinson, I., 256
Rafal, C., 296 Reuck, J. D., 258 Robinson, K., 208
Ra‰n, M. J. M., 548 Revoile, S. G., 488 Robinson, K. M., 240, 241
Raggio, M., 542 Rey, R., 439, 440 Robinson, L., 33
Rahilly, Joan, 150, 169, 172 Reynolds, C., 80 Robinson, L. A., 325, 373, 399
Raj, A., 563 Reynolds, E. G., 548 Robinson, R. G., 108, 258
Rajput, A. H., 293 Reznick, D., 336 Robinson, R. J., 184
Ramadan, N. M., 21 Reznick, J. S., 370 Robles, L., 523
Ramage, A., 361, 362 Reznick, S., 376, 397 Rochford, G., 252
Ramandan, H., 33 Rhee, C. K., 519 Rochon, E., 231, 241
Ramig, Lorraine Olson, 30–31, 50, 72, 73, Rhee, J., 240, 241 Rockette, H. E., 135
85, 89, 91, 92, 93, 95, 130 Rhew, K., 39 Rockman, B. K., 205
Ramig, P. R., 182 Rhode, W., 417 Rodriquez, B., 286
Rammage, L. A., 29, 38, 50, 93, 187 Rhodes, M. C., 507 Roeltgen, D. P., 234
Ramsden, R. T., 464 Ribaupierre, Y., 511 Rogers, J., 475, 477
Ramsey, C., 342 Ricart, W., 132 Rogers, M., 104, 105
Ramsing, S., 261 Riccio, C. A., 324 Rogers, Margaret A., 102, 245–248
Rance, G., 436 Rice, D. A., 57 Rogers, R., 34
Rane, R. L., 483 Rice, M. L., 184, 298, 318, 331, 373, 380, Rogers, S. N., 46, 117
Raney, R. A., 38 395, 399, 400, 403 Rogers-Warren, A. K., 379
Rankovic, C. M., 538, 539 Rich, N., 419 Rogin, C., 481
Rao, S. M., 525 Richardson, F., 208 Roland, T., 448
Rapcsak, Steven Z., 233–235 Richman, L. C., 163 Roll, J. P., 565
Raphael, B., 258 Richmond, A. B., 518 Rollin, W. J., 186
Rapin, I., 396 Richter, B., 96 Rolnick, M., 38
Rapoport, S., 241 Richter, J., 73 Rolsenbek, J. C., 102
Rapp, B. C., 237 Rickard, J. C., 10 Romaine, S., 321
Rappaport, B. Z., 518 Rickford, J., 295, 296 Romani, C., 231, 237
Rasmussen, T., 313, 458 Rickford, R., 295 Romano, M. N., 429
Ratner, N. B., 221, 333, 334 Riddoch, M. J., 237 Romero, M. M., 147
Ratner, R., 375 Ridley, M., 33 Rommel, D., 334
Ratusnik, D. L., 159, 224 Riedel, K., 368 Romski, Mary Ann, 277–278, 352–353
Rauchegger, H., 29, 50 Riedmann, C., 145 Rondal, J., 141
Rauschecker, J. P., 525, 556 Riege, W. H., 246, 316 Ronnberg, J., 541
Raven, R., 28, 50 Rieks, D., 463 Rontal, M., 38
Ravid, R., 246 Ries, A. G., 137 Roop, T. A., 178
Ravida, A., 206 Riesz, R., 416 Roper, S., 275
Raymond, T. H., 472 Riggs, L. C., 518 Rosa, M., 159
Rayner, K., 544 Rigrodsky, S., 361, 362 Rose, D., 563
Read, C., 36, 164 Rijntjes, M., 253 Rose, D. M., 55
Realpe-Bonilla, T., 402 Riley, P. O., 563 Rose, F. C., 126
Reby, D., 57, 58 Rinaldo, A., 33 Rose, S. B., 367
Records, M., 221 Ringdahl, A., 487 Rosen, C. L., 177
Records, N. L., 162, 370, 402 Ringel, R. L., 73, 91, 92 Rosen, Clark A., 20–23, 92, 93
Redfern, B., 249, 250 Rinne, J. O., 293 Rosen, D., 73
Redford, B., 258 Rinne, U. K., 293 Rosen, D. C., 92, 93, 94
Reding, M., 132 Rintelmann, W. F., 501, 531, 532, 533 Rosen, I., 108
Redmond, Sean M., 398–400 Riquet, M., 20 Rosen, S., 541
596 Name Index
Rosenbaum, M. L., 43 Sabol, J., 86 Schecter, G. L., 80

Rosenbek, John C., 47, 104, 105, 126, 129, Sacco, K., 312, 313 Scheglo¤, E. A., 300, 301
130, 145–146, 165 Sachs, M., 417 Scheibel, A., 91
Rosenberg, S., 353 Sackett, D. L., 520 Scheibel, M., 91
Rosenbloom, L., 373 Sacks, H., 301 Scheidt, P. C., 135
Rosenfield, D. B., 17 Sade, J., 57 Schein, M., 83
Rosenthal, S., 114 Sadeghi, N., 36 Scheinberg, J., 472
Rosner, B. A., 358, 359 Sadek, A., 241 Schelfhout, S., 217
Rosner, J., 155 Saenz, T. I., 168 Scheltens, P., 244, 246
Ross, D., 170 Sa¤ran, E. M., 231, 232, 271, 364, 384, 385 Schenckman, B. N., 487
Ross, E., 381 Sa¤ran, J. R., 425 Schenone, P., 387
Ross, E. D., 107, 108, 386, 389 Sailor, W., 307 Scherer, Ron, 72, 75–77
Ross, K. B., 253 Saint-Hilaire, M. H., 39 Scherr, C., 543
Ross, M., 472, 483, 549 Sakai, K., 258 Schick, B., 340, 341
Ross, R., 184, 402 Sakuda, M., 68 Schi¤, N. D., 145
Ross, S., 296, 299 Salamy, A., 494 Schi¤rin, D., 301
Rossing, T., 525 Saleh, M., 4 Schilder, A. G. M., 135, 136, 359
Rossman, R., 464 Salmelin, R., 275 Schildermans, F., 258
Rothenberg, M., 6, 8, 25, 65, 68, 77 Salmon, D., 292 Schilling-Estes, N., 294, 296
Rothi, L. J., 273 Salmon, D. P., 240 Schlauch, Robert S., 531–533
Rothwell, J., 132 Salmon, S. J., 224 Schlömicher-Their, J., 96
Rothwell, J. C., 31 Saltzman, D., 205 Schloss, M. D., 36
Rouleau, G. A., 427 Saltzman, E. L., 83 Schmidek, M., 327
Roulstone, S., 378 Samawi, H., 66 Schmidt, A. R., 515
Rounsley, C. A., 223–224 Samero¤, A. J., 376 Schmidt, R. A., 96, 101, 104, 105, 106, 529
Rouselle, M., 255 Samlan, R., 93 Schmiodt, J., 132
Roush, Jackson, 135, 136, 359, 495–497, 507, Samson, D., 458 Schmitt, F. A., 241
560 Sanchez, M. L. C., 281 Schneider, B. A., 424
Rovers, M. M., 359 Sanchez, S., 532 Schneider, C., 86
Rowson, V., 475, 476 Sandage, M., 89 Schneider, E., 73
Roy, C., 375 Sander, J., 476 Schneider, R., 57
Roy, E., 364 Sanders, F. W., 501 Schneider, S. L., 232
Roy, Nelson, 6, 27–29, 38, 49–51, 56, 86, 89, Sanders, H. G., Jr., 226 Schoentgen, J., 6
90, 93, 96 Sanders, I., 39 Schoeny, Z., 490
Roy, P., 260 Sandoval, K. T., 206 Schofield, K. J., 224
Royster, J. D., 499 Sandson, J., 361, 362 Scholes, R., 108
Rozdilsky, B., 293 Sano, M., 292–293 Schondel, L., 493
Rozzini, L., 240 Sanson, M., 427 Schön Ybarra, M., 57
Rubel, E. W., 424 Santo-Pietro, M. J., 361, 362 Schooling, R. L., 214
Ruben, R. J., 358 Santos-Sacchi, J., 417 Schoonveldt, G. P., 554
Rubens, A. B., 235 Sanyal, M., 359 Schouwenberg, P. F., 138
Rubens, R., 224 Sapienza, C. M., 28, 70, 83 Schraeder, P., 159
Rubenstein, A., 440 Sapin, L. R., 246 Schreiner, C., 439, 440
Rubenstein, J. T., 449 Sapir, S., 28, 30–31, 60, 89, 95, 186 Schreuder, R., 122, 202
Rubin, C., 495 Sarant, J. Z., 421, 541 Schubert, M., 483
Rubin, H. R., 456 Sarno, Martha Taylor, 126, 244, 283 Schubert, M. C., 564
Rubin, N., 246, 247 Sartori, G., 306 Schuckers, G. H., 216
Rubow, R. T., 105 Sarubbi, M., 481 Schuknecht, H. F., 479, 493, 529
Ru¤ Noll, K., 331 Sasaki, C., 258 Schuler, A. L., 117, 118, 373
Ruggero, M. A., 419, 523 Sasanuma, S., 258 Schulman, R. H., 433
Ruottinen, H., 293 Satalo¤, R. T., 4, 73, 92, 93, 94 Schulte, L. E., 36, 37, 515, 516, 559
Rupp, D., 113 Satlin, A., 241 Schultz, K. I., 473
Ruscello, Dennis M., 156–157, 202 Satz, P., 311 Schulz, G., 130, 222
Russell, A., 95 Saunders, J. W., 532 Schulz, R., 388, 391
Russo, K. D., 271 Saunders, K., 478 Schum, D. J., 528
Ruth, R. A., 462, 464 Savov, A., 434 Schumm, J. S. S., 309
Rutka, J. A., 519 Saxman, J. H., 205 Schutte, H., 9
Rutter, J. L., 427 Saxon, K., 86 Schütze, Carson T., 354–356
Rutter, M., 115, 116, 117, 162 Saykin, A. J., 241 Schwander, T. J., 487
Rvachew, S., 136, 165, 198, 199 Scabini, D., 389 Schwann, S. A., 531, 532
Ryalls, J., 109 Scarborough, H. S., 329, 330, 402 Schwartz, D., 208, 485
Ryan, A. F., 490–491, 524 Scarpa, P., 387 Schwartz, D. M., 440, 487
Ryan, W., 459 Schaefer, C., 130 Schwartz, M., 252, 363–364
Rybak, L. P., 518, 519 Schaefer, S. D., 45 Schwartz, M. F., 231, 232, 271, 364, 384,
Rydell, P., 118 Schalen, L., 27, 29, 49, 50, 186, 187 385
Ryding, E., 108 Schallert, T., 257 Schwartz, S., 327
Ryner, P., 297 Schalock, R., 113 Scott, B. L., 541
Schank, R. C., 300 Scott, C. M., 324
S Schankweiler, D., 231 Scott, M. A., 396
Sabbadini, L., 332 Schatz, K., 133 Scott, R. M., 146
Sabe, L., 258 Schauer, C. A., 273 Scott Nielsen, S., 560
Saben, C. B., 205 Schaumann, H., 117 Searl, J. P., 89
Sabo, D. L., 496, 505 Schechter, M. A., 518 Searle, J., 301
Name Index 597
Sebastian, E., 322 Shiba, K., 61 Small, L. H., 150

Sebastián-Gallés, N., 393 Shih, W. J., 241 Small, Steven L., 257–259
Secord, W. A., 217, 325 Shillcock, R., 394 Smit, Ann Bosma, 156, 157, 213–217
Sedey, A. L., 421, 422, 515 Shindo, M., 33, 72, 252 Smith, A., 142, 180, 221
Sedley, A., 336, 337 Shine, R. E., 217 Smith, B., 211
Seewald, R., 483 Shinn, P., 367 Smith, B. E., 11
Segal, B. N., 565 Shinwari, D., 76, 77 Smith, B. K., 438
Seguı́, J., 394 Shipp, D. B., 449 Smith, B. L., 143
Seidenberg, M. S., 240, 270–271 Shipp, T., 17 Smith, C., 324
Seidl, A., 240 Shiromoto, O., 50 Smith, C. G., 135
Selbie, S., 222 Shonotou, H., 252 Smith, C. H., 133
Seligman, P. M., 449 Shore, C., 375 Smith, E., 20, 95, 370, 402
Selinger, M., 255, 256 Shosan, J. B., 531 Smith, E. E., 240
Sell, M. A., 399 Shriberg, L. D., 116, 117, 122, 123, 135, 136, Smith, F. H., 224
Selman, J. W., 216 143, 150, 151, 156, 157, 183, 184, 206, 213, Smith, F. W., 508
Senghas, A., 342 214, 215, 216, 217, 218, 219 Smith, J., 217
Sercarz, J. A., 41, 42, 43 Shriner, T. H., 472 Smith, J. C., 423
Sereno, J. A., 440 Shteiman, M., 332 Smith, K., 113, 363, 364
Serge, B., 248 Shumway-Cook, A., 563, 564 Smith, L. B., 143
Seron, X., 255 Shuster, L. I., 157 Smith, L. L., 462, 463, 464
Serpanos, Y. C., 425 Shuttleworth, E. C., 246, 293 Smith, M. E., 31, 281
Sessions, R. B., 17 Sidtis, J. J., 108, 109, 381, 382 Smith, N. V., 174
Sethuraman, G., 22 Siebens, A. A., 227 Smith, P., 258
Settle, S., 159 Siegel, B., 116 Smith, P. J., 38
Seung, H., 327 Si¤ert, J., 146 Smith, S., 90, 94, 96, 241, 448, 490
Sevcik, Rose, A., 277–278, 352–353 Sigafoos, J., 278 Smith, S. C., 260
Sevush, S., 186 Silbergleit, A., 86 Smith, T., 298
Seymour, C., 159 Silliman, Elaine R., 307–309 Smitheran, J. R., 9, 36, 37, 68
Seymour, H., 159, 295, 297, 298, 299 Sills, A. W., 467 Smitherman, G., 296
Seymour, Harry N., 318–320, 371 Sills, P. S., 47 Smith-Wheelock, M., 563
Shaheen, M. K., 551 Silman, S., 531 Smolensky, P., 175
Shahin, H., 478 Silva, A. C., 306 Smurzynski, J., 515
Shailer, M. J., 554 Silveri, M. C., 231, 240 Smyth, V., 496
Shalev, R., 312 Silverman, F. H., 157 Snell, M., 113
Shallice, T., 233, 236, 237, 238 Silverman, S., 182 Snidecor, J. C., 11
Shallop, J. K., 436 Silverman, S. R., 451, 536, 548 Snow, C. E., 370, 371, 372
Shammi, P., 387 Sima, A., 31 Snow, D., 150
Shanaz, N., 561 Simeonsson, R., 114 Snow, E., 244
Shane, H. C., 105 Simmerman, C. L., 174 Snowden, J. S., 146, 246, 247
Shangold, L., 8 Simmons, F. B., 447 Snowling, M. J., 190, 324, 330, 396
Shanks, Janet E., 558–562 Simmons, J. O., 159 Snyder, C. R. R., 273
Shankweiler, D. P., 267, 329, 458 Simmons, N. N., 105 Snyder, E. Y., 258
Shannon, J. S., 376 Simmons-Mackie, N., 261 Snyder, J. M., 500, 501
Shannon, Robert V., 427–428, 433, 446, 473, Simon, B. M., 178–179 Snyder, L., 422
551, 554, 555 Simon, D., 155, 201 Snyder-McLean, L., 376
Shapiro, B. E., 367, 382 Simon, H., 472 So, L. K. H., 281
Shapiro, J. K., 249 Simon, M., 56, 86, 90, 96 Sobe, T., 478
Shapiro, Lewis P., 232, 257, 269–271 Simono¤, E., 115, 185 Sobecks, J. L., 55
Shapiro, T., 162 Simons, R. F., 273 Soder, A. L., 216
Shapiro, W. H., 541 Simpson, G. B., 359 Sodkilde-Jorgensen, H., 557
Sharara, A. I., 89 Singer, L. T., 176, 179 Sohmer, H., 429
Sharf, D. J., 156 Singer, M. I., 10, 11, 139 Soler, S., 132
Sharma, T., 440 Singer, W., 305 Soli, S. D., 451
Sharon, V. M., 21 Singh, B., 33 Solma, R. T., 439
Shattuck-Hufnagel, S., 363 Singh, S., 458 Solomon, J., 271
Shaw, E. A. G., 453 Sininger, Yvonne S., 433–436, 513, 515, 533, Solomon, N., 84, 89, 201, 203
Shaw, G., 418 555 Somani, S., 518, 519
Shaw, G. Y., 89 Sinnott, J. M., 425 Sommers, M. S., 57
Shaywitz, B. A., 184 Siqueland, E., 425 Sommers, R. I., 101
Shaywitz, S. E., 184, 329 Sirigu, A., 304 Soni, N., 33
Shea, J. J., 463 Sjogren, H., 480, 487, 488 Sonies, B., 134
Shea, S. A., 227 Ska, Bernadette, 247, 302–304, 390 Sonnenberg, E. A., 373
Sheena, D., 545 Skinner, B. F., 192 Sonninen, A., 17
Shekar, C., 167 Skinner, M. W., 449, 451–453, 483 Sorbi, S., 364
Shelton, N. W., 10 Sklare, D. A., 500, 501 Sorensen, K., 43
Shelton, R. L., 218 Skrtic, T. M., 307, 309 Sorkin, E. M., 258
Shenault, G. K., 240 Slama, A., 127 Soto-Faraco, S., 393
Shengguang, Y. S., 531 Slansky, B. L., 273 Souhrada, J. F., 38
Shepard, N. T., 563, 564 Slater, S., 194 Southwood, H., 105
Sheperd, J. C., 391 Slawinski, E. B., 136 Souza, Pamela E., 481, 488, 548–549
Shepherd, R., 448 Sloan, S., 43 Sparks, R. W., 105, 347
Sherbourne, C. D., 21 Slobin, D. I., 266, 322, 331, 341, 384 Speaks, Charles, 458–460, 548
Shewan, C. M., 368 Smaldino, Joseph J., 442–443 Spearman, C. B., 226
598 Name Index
Specht, L., 46 Stephens, M. I., 156 Sulton, L. D., 133

Spector, C. C., 325 Stephens, S. D. G., 557 Suman, V., 563
Speedie, L., 382, 387 Stern, M. B., 231 Summerfield, A. Q., 307, 449, 543, 546
Speelman, P., 518 Stern, R., 261 Summers, C., 325
Speigel, J. R., 92 Sterzi, R., 387 Sun, X., 515
Speiker, S., 370 Steuver, J. W., 498 Sundaram, M., 241
Spence, M. M., 440 Stevens, Kenneth N., 6, 63–67, 548 Sundberg, Johan, 6, 51–54, 76
Spencer, J. C., 325, 373, 399 Stevens, L. J., 399 Supalla, S., 342
Spencer, L., 337, 456 Stevens, S. S., 526 Surr, R. K., 481
Spencer, L. E., 224 Steward, F., 157 Susser, R., 36
Spens, K.-E., 541 Stewart, C., 38 Sussman, H., 101–102, 122
Sperry, E., 7 Stewart, C. F., 38 Sussman, N. M., 146
Sperry, R. W., 458 Stewart, K., 476 Suthers, R. A., 57
Spetner, N. B., 424 Stiassny-Eder, D., 244 Sutton, D., 59
Spiegel, J., 73 Stiles, J., 312, 396 Sutton, R. L., 257
Spiegel, J. R., 92, 93, 94 Stillman, R. C., 240 Suzuki, Ryuji, 420
Spirduso, W., 73 Stockman, Ida J., 158–160, 297, 298, 319 Svirsky, M. A., 208, 209, 337, 456
Spiro, M., 483 Stoel, C. M., 150 Swanson, H. L., 309, 349
Spitz, R., 184 Stoel-Gammon, Carol, 140–141, 150, 151, Swanson, S., 114
Spitzer, J., 133, 208 174, 198, 213, 214, 346 Sweeney, A., 478
Spivey-Knowlton, M. J., 270 Stokes, K. D., 439 Sweet, L. H., 108
Splaingard, M. L., 133, 179 Stokes, P. D., 123 Swigart, E., 473
Spoendlin, H., 434 Stokes, Z., 518 Swindell, C. S., 244, 264
Spraggs, P., 34, 476 Stolk, R. J., 43 Swinney, D., 271
Springarn, A., 494 Stone, Maureen, 169, 170, 171, 172 Szatmari, P., 116
Springer, L., 232 Stone, R. E., 14, 139
Square, P. A., 101, 105, 143, 202, 347, 364 Stone, W., 116 T
Square-Storer, P., 104 Stool, S. E., 135, 505 Taeschner, T., 280
Squire, L. R., 316 Storey, L., 484, 485 Tager-Flusberg, H., 116, 117, 327, 395
Squire, S., 38 Storkel, H., 102 Tait, C., 496
Sreissguth, A., 195 Stothard, S. E., 324, 330, 396 Tait, D. M., 456
St. Louis, K. O., 202 Stover, J. L., 515 Tait, M. E., 232
Staab, W., 419 Stover, L. T., 515 Taitelbaum, R., 208, 209
Stack, B., 33 Straatman, H. M., 359 Takahashi, H., 37
Stackhouse, Joy, 122, 189–191 Strand, C., 143 Takahashi, N., 252
Stadian, E. M., 240 Strand, E., 114, 129, 202, 203, 390 Takefuta, Y., 473
Stadlender, S., 231 Strand, E. A., 142 Talkin, D., 23
Sta¤el, J. G., 490 Strand, K., 121 Tallal, P., 184, 273, 350, 395, 400, 402, 403,
Sta¤ord, M., 458 Strasnick, B., 493, 494, 563 439, 440, 552
Stager, S., 222 Strassman, B. K., 423 Tan, C. K., 61
Stagner, B. B., 512 Strattman, Kathy, 153–155 Tanaka, S., 64, 92
Stainback, S., 308 Strauss, E., 311 Tanaka, Y., 60
Stainback, W., 308 Strawbridge, W., 208 Tanenhaus, M. K., 270, 271, 384
Staller, S. J., 427, 449, 462 Stredler-Brown, A., 421 Tange, R. A., 518
Stanovich, K. E., 153 Street, C. K., 324 Tangle, D., 153
Stapells, D. R., 358, 430 Strick, P. L., 59, 145 Tannen, D., 301
Stapleton, B., 296, 299 Strik, H., 6 Tanner, Jr., 551
Stark, E., 475, 476 Stringer, A. Y., 108 Tanner, D. C., 260
Stark, R. E., 142, 273 Strobino, D., 194 Tanner, K., 56
Starkstein, S. E., 60, 108, 258 Ström, T., 557 Tappe, K., 133
Starkweather, C. W., 181 Strome, M., 43 Tarayi, A., 33
Starr, A., 433, 434, 533, 555 Strömgren, T., 557 Tarbell, N. J., 146
Stathopoulos, Elaine T., 67–71, 82, 83 Strong, M., 343 Tardif, T., 371
Statlender, S., 364, 367–368 Strong, W. J., 473 Tarone, E., 158
Stebbins, W. C., 57 Stroop, J. R., 272 Tarquinio, N., 425
Steele, R., 117 Strube, H. W., 6 Tartter, V. C., 367–368, 459
Steele, R. D., 255 Strupp, J. P., 275 Tasko, S. M., 6, 29, 50
Steeneken, H. J. M., 539 Strupp, M., 563 Tatemichi, T. K., 293
Ste¤ens, M. L., 370 Stubbe, E., 120, 211 Tatham, M., 171
Stegen, K., 83 Stucker, F. J., 531 Tavy, D. L. J., 145
Stein, J., 43 Studdert-Kennedy, M., 368, 425, 458 Taylor, B., 194
Stein, J. F., 329 Studebaker, Gerald A., 480, 500, 501, 502, Taylor, H. G., 184
Stein, L., 433, 434 503, 538–540 Taylor, J. R., 248
Stein, R., 178 Studenski, S., 564 Taylor, M., 20, 95, 96
Steinberg, J., 413, 415, 416, 429 Stuebing, K. K., 329 Taylor, O., 159, 319, 321
Steinberg, J. C., 538, 539 Sturm, J., 113 Taylor, P. C., 490
Steinecke, I., 77 Stuss, D. T., 145, 249, 387 Taylor, S., 458
Stell, P. M., 57 Stypulkowski, P. H., 448, 462 Taylor, W., 508
Stelmachowicz, P. G., 527, 559 Subtelny, J. D., 68 Teagle, H. F. B., 455
Stemmer, B., 303, 387, 391 Sulkava, R., 240 Teasell, R., 133
Stemple, Joseph C., 49, 56, 82, 85–87, 90, 93, Sullivan, J., 451 Tecca, J. E., 487
96, 225 Sullivan, K., 327 Teele, D. W., 358, 359
Stephens, I., 216 Sullivan, M., 132, 261 Tees, R., 439, 440
Name Index 599
Teicher, R., 459 Tommasi, M., 145, 248 V

Telian, S. A., 455, 563, 564 Tomoeda, C. K., 240, 292, 293 Vaissiere, J., 20
Temple, C., 355 Tompkins, Connie A., 381, 387, 388–391 Vaivre-Douret, L., 122
Templer, J. W., 177 Tompson, C., 458 Valck, C. D., 83
Templin, M. C., 216 Tong, Y., 448, 449 Valdes, G., 321
Tench, P., 150 Tonkovich, J. D., 347 Valdes-Perez, R., 312
Terada, S., 38, 39 Tonnaer, J. A. D. M., 43 Valdez-Menchaca, M., 359
Terkildsen, K., 560 Tonokawa, L. L., 455 Valk, J., 246
Terrell, F., 159 Topbas, S., 197 Vallance, D. D., 400
Terrell, S., 159 Topol, D., 342 Vallar, G., 315
Testa, H. J., 246, 247 Toppelberg, C. O., 162 van Bon, W. H. J., 135, 136, 359
Testa, J. A., 108 Torgeson, J., 329 Van Borsel, J., 224
Tewfik, T. L., 36 Toriello, H. V., 478 van Buuren, R. A., 487
Thach, B. T., 177 Toriyama, M., 433 Vance, M., 190
Thal, D. J., 312, 370, 376, 396, 397 Torloni, H., 33 VanDaele, D., 134
Thal, E., 336 Tostevin, P., 34 van den Bercken, J. H. L., 135, 136
Tharpe, Anne Marie, 424–426, 493–494 Toth, A., 157 van den Broek, P., 359
Thedinger, B., 462, 463, 464 Tough, J., 370 van den Honert, C., 448
Thelander, M., 261 Touretzky, D., 364 Van der Lely, H. K. J., 231, 270, 350, 396,
Thelen, E., 143 Toussaint, L., 258 403
Theodoros, D. G., 201, 202 Towne, R., 122 van der Lugt, A., 394
Theolke, W. M., 68 Trabucchi, M., 240 Van der Merwe, A., 102, 104
Thibodeau, L. M., 440 Tranel, D., 109, 244, 246, 252, 263–264 Van Derveer, N. J., 459
Thiel, A., 253 Trauner, D., 311, 312, 395, 396 van der Veer, R., 491
Thielemeir, M., 455 Trautwein, P., 436 van der Werf, S., 311
Thielke, H., 135, 136 Travis, L. E., 161 van der Wilt, G., 359
Thomas, B. E., 244 Traynor, C., 126 Van de Sandt-Koenderman, M., 255
Thomas, J., 519 Treaba, C., 448 van Dijk, T. A., 300, 301, 303
Thomas, Jack E., 137–139 Trehub, S. E., 424, 425 Van Dongen, H. R., 252
Thompson, C., 283, 458 Tremblay, K., 434, 439, 440 Van Eeckhout, P., 348
Thompson, C. K., 104, 232, 406 Trofatter, J. A., 427 van Gelderen, E., 354
Thompson, D., 132, 315 Trojanowski, J. Q., 30 van Grunsven, M. F., 231, 232, 384
Thompson, G., 487 Trosset, M. W., 240 Van Hattum, R. J., 68
Thompson, L., 130 Trost-Cardamone, J. E., 252 vanHoek, K., 339, 340, 341
Thompson, L. A., 185 Trueswell, J. C., 271, 384 van Hoesen, G. W., 241
Thompson-Porter, C., 298 Truong, D. D., 38 Van Hosesel, R. M., 453
Thomson, J., 133 Trybus, R. J., 421 Vanier, M., 250
Thomson, J. E., 41 Tseng, C.-H., 273 van Kleeck, A., 153, 167, 286, 322, 380
Thomson, R. G., 145 Tsuta, Y., 476 Van Lancker, D. R., 108, 109, 243, 381, 382,
Thomson, V., 422 Tucci, D., 337, 338 387, 389
Thoonen, G., 122, 202 Tucker, G., 187 van Lith-Bijl, J. T., 43
Thordardottir, E. T., 281 Tucker, H. M., 43 Van Norman, R., 147, 148
Thorn, J. J., 46 Tucker, P. E., 543 Van Pelt, F., 38
Thorne, P. R., 440 Tuller, B., 83, 102 van Putten, M. J. A. M., 145
Thornton, A. R., 434, 548 Tulley, M. R., 529 Van Reenen, J., 147
Thorpe, L. A., 424 Turkstra, L. S., 145 Van Riper, C., 216
Thráinsson, H., 267 Turner, C. W., 471, 472 Van Tassel, D. J., 487, 488
Thron, A., 243, 244 Turner, Robert G., 444–447 van Turennout, M., 364
Throneburg, R. N., 181 Turnev, I., 434 Van Valin, R. D., 266
Thulborn, K. R., 272, 311, 313 Turrentine, G. A., 509 Van Vleymen, B., 258
Thyme, K., 90, 94, 96 Tusa, R. J., 563, 564 van Zomeren, A. H., 273
Tian, B., 525 Tweed, T. S., 208, 527, 559, 560 van Zon, M., 394
Tibbils, R., 464 Tybor, A., 33 van Zonneveld, R., 231, 406
Tice, R., 111, 126, 131 Tye-Murray, N., 337, 439, 541, 542 Varchevker, A., 261
Ti¤anny, W. R., 472 Tykocinski, M., 448 Varga, M., 222
Tillman, T., 442, 451, 548 Tyler, Ann A., 204–206 Vargha-Khadem, F., 185, 311, 313, 402
Timcke, R., 69 Tyler, L. K., 240, 248 Varley, R. A., 102
Timko, M. L., 387 Tyler, R., 337, 338 Varma, A., 146
Timler, G., 286 Tyler, Richard S., 449, 450–454, 455, 488, Vasilyeva, M., 370, 371
Tinetti, M. E., 564 490, 541 Vataja, R., 293
Tippett, D. C., 227 Tzeng, O. J. L., 266, 268 Vatikiotis-Bateson, E., 545
Titze, I. R., 3, 4, 6, 23, 25, 55, 57, 64, 70, 71, Vaubel, A. S. W., 531
75, 76, 77, 85, 88, 89, 90, 96 U Vaughan, C., 88
Tobey, E. A., 45, 337, 338, 456 Uemura, T., 476 Vaughn, S., 309
Tobin, Y., 208, 209 Ulatowska, Hanna K., 300–301, 302 Vaughn, T., 72
Todd, C. M., 400 Ullrich, K., 549 Vaughn-Cooke, F., 158
Tohkura, Y., 440 Umberger, F., 147, 148 Velez, R., 529
Tokumasu, K., 563 Umezaki, T., 61 Velickovic, Miodrag, 30–31
Tolbert, L., 206 Umilta, C., 306 Velleman, Shelley L., 121, 122, 123, 144,
Toledo, S. W., 56 Unwin, D. H., 258 198–199
Tomblin, B., 328 Uppenkamp, S., 552 Velmans, M. L., 473
Tomblin, J. B., 162, 184, 185, 218, 219, 309, Urbano, R., 370 Veltman, C. J., 321
329, 330, 337, 350, 370, 399, 402, 456 Utman, J., 231 Vennard, W., 76
600 Name Index
Ventry, I. M., 476, 528, 531, 532, 535 Walsh, R., 475, 476, 477 Weinstein, B. E., 528
Ventura, S., 194 Walsh, R. M., 519 Weinstein, E. A., 252
Verdolini, Katherine, 50, 55, 66, 85, 88–90, Walters, H., 402 Weintraub, S., 246, 247, 302, 386
95–97 Walton, J. P., 529 Weinzapfel, B., 381
Verdolini-Marston, K., 88, 89, 90, 93 Waltzman, S. B., 455, 541 Weisenberger, J. M., 542
Verdonck de Leeuw, I. M., 79 Wambaugh, J., 101, 102, 104, 105 Weishampel, D. B., 57
Verneil, A., 41, 42, 43 Wang, J., 39 Weismer, G., 47, 68, 83, 101, 102, 126, 129
Vernon, J. A., 557 Wang, M. D., 471 Weismer, Susan Ellis, 281, 349–351
Vernon, M. W., 258 Wang, P., 141 Weiss, A. L., 319
Vernon-Feegans, L., 358, 360 Wang, X., 403, 525 Weiss, L., 33
Veuillet, E., 512 Wang, Y., 440 Weiss, M. J., 425
Vicari, S., 311 Wapner, W., 243, 389 Weiss, M. S., 12
Vick, J. C., 209 Warburton, E. A., 317 Weiss, S., 433
Vickers, D. A., 551, 554 Ward, P., 31 Weiss Doyal, A., 302
Viemeister, N. F., 551, 552, 554 Ward, W. D., 508, 509, 534 Weissenborn, J., 267
Vignati, A., 459 Ward-Lonergan, J., 244 Weitzman, E., 287
Vignolo, L. A., 243, 315 Ware, J. E., Jr., 21 Weizman, Z. O., 371
Vihman, M. M., 122, 142 Waring, M., 427, 428 Welham, Nathan V., 35–37
Vijayan, A., 367 Warita, H., 258 Wellens, W., 73
Vilkman, E., 6 Warren, D. W., 47 Wellesch, C. W., 243
Villa, L., 33 Warren, P., 393 Wellington, W., 191
Villchur, E., 419, 420 Warren, Steven F., 287, 375–377, 379 Wellman, B., 210
Villkman, E., 95 Warren, V. G., 549 Wellman, L., 200, 201, 202, 203
Villringer, A., 305 Warrick, P., 39 Wells, B., 189, 190
Vingolo, L. A., 368 Warrington, E. K., 237 Wells, J., 150
Violani, C., 389 Wartofsky, L., 73 Welsh, L. W., 531
Vlietinck, R., 258 Washington, J. A., 159, 298, 318, 319, 328, Welsh-Bohmer, K. A., 292
Vogel, D., 92 373, 399 Wendt, K. A., 518
Vogel, S. A., 329 Washington, P., 211 Wepman, J. M., 11
Vogel, V., 147 Wasowicz, J., 164 Werker, J., 439, 440
Vogelsberg, T. T., 309 Watanabe, Y., 91 Werner, L. A., 424, 425
Vohr, B. R., 515 Waters, D., 190, 205 Wernicke, Carl, 249, 250, 252, 262–263, 275,
Volkman, J., 526 Waters, G. S., 231–232, 241, 363, 384, 387 314
Volkmar, F., 116, 117 Watkins, K., 402 Wertz, Robert T., 102, 104, 105, 249–251,
Vollmer, K., 102 Watkins, K. E., 185 253, 255, 256, 389
Volpe, B. T., 109 Watkins, Ruth V., 318, 333–335, 395 West, C., 472
Volterra, V., 280, 375 Watson, B. C., 179 West, J. E., 104
von Arbin, M., 252 Watson, C. S., 508, 510, 551 West, R. A., 59
von Cramon, D., 59, 60, 145, 146 Watson, I., 121 Westby, C., 167
von Euler, C., 76 Watson, J., 211 Westerhouse, K., 449
von Leden, H., 15, 69 Watson, P. C., 240 Westerman, S. T., 493
Vorperian, H. K., 36 Watson, P. J., 7 Weston, A. D., 156, 214, 217
Vosteen, K. H., 17 Watson, Peter, 82–84 Wetherby, A. M., 117, 118, 373, 375
Voyvodic, J., 313 Watson, R. T., 108 Wetzel, K., 165
Vrabec, D., 33 Watts, M. T., 187 Wexler, E., 460
Vreugde, S., 478 Waugh, P., 33 Wexler, K., 184, 298, 331, 355, 403
Vroomen, J., 393, 394 Waxman, J., 34 Weymuller, E. A., 20
Wazen, J. J., 464 Whalen, T. A., 425
W Weaver, M., 257 Wharry, R. E., 439
Wachtel, J., 130 Webb, W. G., 200, 202, 252 Wharton, J. A., 557
Wagenaar, E., 244 Weber, A., 394 Wheeler, D., 364
Wagner, M. T., 246 Weber, P. C., 564 Whitaker, H. A., 311, 312, 313
Wagner, R. K., 329 Weber, R. S., 137, 138 White, A., 29
Wahl, P., 157 Weber-Luxenburger, G., 253 White, M. W., 448
Wahrborg, P., 261 Webster, J. W., 208, 209 White-Devine, T., 240, 241
Wakefield, G. H., 551, 552 Webster, P., 153 Whitehead, M. L., 515
Walden, B., 208, 543, 544 Weddington, G. T., 318 Whitehouse, D., 397
Walden, B. E., 440, 481, 487 Weeks, S., 289 Whitehurst, G. J., 359, 371
Waldhauer, Fred, 419 Wegel, R., 415, 416, 418 Whiteman, B. C., 359
Waldstein, R., 208 Wegel, R. L., 502 Whiteside, S. P., 102
Waldstein, R. S., 275 Wei, L., 120, 281 Whitford, L. A., 449, 541
Wales, R. J., 421 Weidehaas, K., 145 Whitney, Susan L., 563–565
Waletzky, J., 300 Weijts, M., 385 Whitteridge, P., 7
Walhagen, M., 208 Weikert, M., 96 Whitworth, C., 518, 519
Walker, M., 563 Weil, D., 478 Wichter, M. D., 252
Walker, V. G., 375 Weiller, C., 253 Widen, Judith E., 513, 515–517
Walker-Batson, D., 258 Weinberg, B., 11 Wieder, S., 117
Wallace, I. F., 136, 358, 359, 360 Weiner, D., 564 Wiegel-Crump, C. A., 312
Wallach, H., 159 Weiner, F. F., 205 Wieneke, G. H., 36
Wallen, V., 186 Weinman, J., 563 Wigg, N. R., 396
Wallesch, C.-W., 243, 260, 315, 316 Weinreich, U., 119 Wiggins, R. D., 260
Walsh, M., 88 Weinrich, V., 56 Wightman, F. L., 424, 453, 554
Walsh, M. J., 10 Weinstein, B., 21 Wigney, D., 485
Name Index 601
Wiig, E., 325 Woodworth, G. G., 455 Young, J. L., 89

Wiigs, M., 287 Woolf, N. K., 490–491 Young, L. D., 147
Wijnen, F., 345 Woollacott, M., 564 Young, L. R., 545
Wilber, L. A., 488, 500, 501, 535 Workinger, M., 201 Younger, B. A., 425
Wilbur, R., 340, 341 Worrall, L., 284 Youngstrom, K. A., 10, 11
Wilcox, D. L., 159 Worsley, J., 261 Yuan, S. Y., 146
Wilcox, K., 156 Worth, J. H., 68 Yueh, B., 481
Wilcox, K. A., 214, 380 Worth, S., 472 Yuen, K., 33
Wilcox, M. J., 284, 375, 376 Worthington, D. W., 433 Yukizane, K., 91
Wilcox, S. A., 478 Wozniak, J., 208, 209 Yule, G., 364
Wiley, Terry L., 208, 527, 534–537, 559, 560 Wright, D., 419
Wilkinson, K., 277 Wright, G., 470 Z
Wilkinson, L. C., 308, 309 Wrisley, Diane M., 563–565 Zadunaisky-Ehrlich, S., 332
Wilks, J., 359 Wu, P., 494 Zago, L., 276
Williams, A. L., 198 Wulf, G., 96 Zahir, Z. M., 518
Williams, D., 297 Wulfeck, B., 231, 252, 253, 267, 395 Zajac, D. J., 36, 37, 170, 178
Williams, M., 136 Wundt, W. M., 272 Zalla, T., 304
Williams, M. J., 507 Wuyts, F. L., 6 Zamarripa, F., 222
Williams, R. S., 246 Wyatt, A., 388, 391 Zamuner, T., 345
Williams, S. C. R., 546 Wyatt, R., 456 Zander, L., 563
Williston, J. S., 429 Wyatt, T. A., 299, 318, 319, 322 Zandipour, M., 209
Willmes, K., 258 Wyckes, J., 258 Zargi, M., 36
Wilshire, C. E., 275, 364 Wyke, B. D., 15 Zatorre, R. J., 60, 275
Wilson, B., 162, 237, 324, 337, 338, 400, 402 Wynn, C., 295, 299 Zecker, S. G., 440
Wilson, B. S., 447, 449, 451 Wynne, Michael K., 520–522 Zee, D. S., 563, 565
Wilson, D. L., 135, 156, 184, 218 Zeisel, S., 560
Wilson, E., 133 X Zeisel, S. A., 135, 136, 358, 359, 360
Wilson, F. B., 78 Xu, L., 478 Zeitels, S., 7, 72
Wilson, J., 297, 299, 311 Xue, S., 132 Zelante, L., 478
Wilson, J. A., 29 Zelazo, P. R., 425
Wilson, K., 129 Y Zelefsky, M. J., 10
Wilson, L. G., 186 Yacullo, William S., 500–503 Zemlin, W. R., 472
Wilson, R., 560 Yairi, E., 180, 181, 220, 221, 333, 334 Zeng, F.-G., 434, 555
Wilson, W., 297 Yajima, Y., 61 Zenker, A., 84
Wilson, W. H., 58 Yamada, R. A., 440 Zenker, W., 84
Wilson, W. R., 425 Yamada, T., 440 Zenner, P., 133
Windsor, Jennifer, 324, 326–328 Yamaguchi, H., 89 Zentella, A. C., 321
Winemiller, D. R., 258 Yamashite, T., 38 Zerr, I., 145
Wing, L., 114 Yanagihara, N., 3 Zhang, C., 73, 77
Wingate, M., 333 Yano, S., 545 Zhang, S. P., 60, 61
Winholtz, W. S., 3 Yantis, P. A., 535 Zhang, W. R., 258
Winitz, H., 216 Yardley, L., 563 Zhang, X., 162, 309, 328, 329, 330, 350, 370,
Winner, E., 387, 390 Yarter, B. H., 141 399, 402
Winter, M., 434 Yaruss, J. Scott, 180–182, 335 Zhao, C., 77
Wise, R., 275 Yasuda, K., 243 Zhao, W., 77
Wise, R. J., 317 Yates, A. J., 246 Zheng, J., 511
Witt, Shelley A., 440, 450–454 Yates, C., 170 Zheng, X., 509
Wityk, R., 239 Yavas, Mehmet, 119–121 Zheng, Y., 61
Woestijne, K. V. D., 83 Ye, M., 43 Zidar, J., 434
Wohl, C., 283 Yeatman, J., 28 Zielhuis, G., 359
Woisard, V., 78 Yeend, I., 485 Ziev, M. S. R., 286
Wojtczak, J. M., 554 Yetiser, S., 563 Zilber, N., 126
Wolaver, A., 449 Yip, V., 280 Zilbovicius, M., 348
Wolf, G. T., 10 Yiu, E. M., 22 Zimlichman, R., 83
Wolf, M., 329, 531 Ylikoski, R., 293 Zimmerman-Phillips, S., 455, 456
Wolfe, V., 6, 20 Ylvisaker, M., 327 Zoccolotti, P., 389
Wolfe, V. I., 224 Yoder, D. E., 15, 18 Zoghabib, C., 283
Wolford, R. D., 449 Yoder, Paul J., 287, 375–377, 379 Zucman, C., 427
Wolfram, W., 158, 159, 294, 295, 296, 297, Yonclas, D. G., 375 Zurek, P. M., 511
321 Yoneda, S., 563 Zurif, E., 270
Wolters, E. C., 246 Yoon, C., 258 Zurif, E. B., 253, 271, 383, 384, 405
Wong, D., 77, 364, 381 York, J., 278 Zurif, E. G., 231
Wong, F. S., 80 Yorkston, K. M., 111, 113, 126, 129, 130, Zwicker, E., 3
Woo, P., 8, 34, 72, 73, 74, 88, 92 131, 202, 203 Zwislocki, J. J., 551
Wood, F. B., 329 Yoshida, M., 476 Zwitserlood, P., 393
Wood, L. A., 165 Yoshinaga-Itano, Christine, 336, 337, 421– Zwolan, T. A., 448, 455
Wood, M. H., 329 423, 515
Wood, N. W., 145 Yost, William A., 437–439, 509, 525–527
Wood, R., 38, 179 Yotsukura, Y., 91
Woodard, J. L., 164 Younes, A., 897
Woods, B., 312 Young, A., 162, 324, 400
Woodson, G., 89 Young, A. B., 31
Woodworth, G., 337, 338, 449 Young, E. C., 104
Subject Index
A ADSD (adductor spasmodic dysphonia), Allographic conversion, 234
AAA (American Academy of Audiology), on botulinum toxin for, 38–39 Allographic disorders, 235
screening for hearing loss, 495 Adults Allophonic variation, 174
AAC. See Augmentative and alternative auditory brainstem response in, 429–433 Alouatta spp, vocal production system in, 57,
communication (AAC) communication disorders in, 283–284 58
Aachener Aphasie Test (AAT), 253 voice therapy for, 88–90 Alport syndrome, hearing loss in, 479
AAE. See African-American English (AAE) Aerodynamic assessment ALS (amyotrophic lateral sclerosis)
AAVE (African American Vernacular of children’s voice, 36–37, 67–71 augmentative and alternative
English). See African-American English instrumentation for, 170 communication approaches in, 111
(AAE) of vocal function, 7–9 dysarthria due to, 126
Abdomen, in speech production, 83 Aerometry, in speech assessment, 170 Alstrom disease, hearing loss in, 479
Abdominal pneumobelts, speech production A¤ect, flat, 108 Alternating glottal airflow, 69
with, 226 A¤ective prosody, 108, 381 in children, 69–70
Abductor spasmodic dysphonia (ABSD), African-American children, language sound pressure level and, 69–70
botulinum toxin for, 38 disorders in, 318–320 Alternating motion rates (AMRs), in
ABI (auditory brainstem implant), 427– African-American English (AAE) dysarthria, 126
429 camouflaged forms in, 296 Alveolus, defects of, 46
ABR. See Auditory brainstem response contrastive patterns in, 295, 298–299 Alzheimer’s disease (AD)
(ABR) language acquisition milestones for, 318– agrammatism in, 231, 240
Absolute gradient, of tympanogram, 506 319 communication disorders in, 291–292
Abulic state, 145 phonologic issues with, 158–160 language impairments in, 240–241
Academics, language impairment and, 399 African American Vernacular English AM (akinetic mutism), 145
Accent Therapy, 90, 94 (AAVE). See African-American English American Academy of Audiology (AAA), on
ACDs (augmentative communication (AAE) screening for hearing loss, 495
devices), for aphasia, 255 Aging American National Standards Institute
ACG (anterior cingulate gyrus), in hearing dysfunction related to, 527–530 (ANSI)
vocalization, 59–61 masking-level di¤erence with, 490 on articulation index, 539
Acoustic admittance, 558–562 noise-induced hearing loss and, 510 on calibration of instruments, 496
in children, 504–505 voice disorders of, 72–74 on hearing protectors, 498, 499
in pseudohypacusis, 532 voice therapy for, 91–94 on noise-induced hearing loss, 508
Acoustical signal intensity, 413 Agrammatism, 231–232 American Sign Language (ASL), 339–343,
Acoustic assessment, of voice, 3–6 comparative aphasiology of, 265–268 423
Acoustic equivalent volume, 559, 560 ‘‘least pronunciation e¤ort’’ approach to, American Speech-Language-Hearing
Acoustic frequency perturbation, as measure 265–266 Association (ASHA)
of voice quality, 79 reversibility/mapping disorders in, 383–384 on audiometric symbols, 502
Acoustic immittance, 558–562 trace deletion hypothesis of, 231, 271, 407– on dialects, 297
in children, 504–505 409 FACS of, 283
in pseudohypacusis, 532 tree pruning hypothesis of, 405–407 on inclusion, 308
Acoustic injury thresholds, 508, 509 Agraphia, 233–235 on language disorders in African-American
Acoustic reflectometer, 507 AI (articulation index), 419, 538–540 children, 318
Acoustic reflex, 509, 521, 522 Aids, in augmentative and alternative on screening for hearing loss, 495, 561
Acoustics communication, 277 Aminoglycosides, ototoxicity of, 493, 518
of classroom, 442–443 Air-bone gaps, 502–503, 537 Amplitude compression, in hearing aids,
of voice, 63–67 Airflow 413–420
Acoustic source, during phonation, 63–67 alternating glottal, 69 and loudness, 413–416
Acoustic studies, of children’s voice, 36 average oral, 68 and masking, 416–417
Acoustic trauma, 510 measurement of, 7–8 multiband, 418–419
Acquired language disorders, semantic in children, 36–37, 67–70 physiology of, 417–418
development with, 396 Airflow open quotient, 68 and sound quality, 488
Actinomycosis, laryngeal, 33 in children, 69, 70 Amplitude modulation detection, 553, 554–
Action potential (AP), auditory nerve, 461, Air pressure 555
463, 464 intraoral, 68 AMRs (alternating motion rates), in
Activated words, 393 measurement of, 8, 9 dysarthria, 126
Actors, voice therapy for, 96–97 in children, 36–37, 68 Amyloidosis, laryngeal involvement in, 34
AD. See Alzheimer’s disease (AD) subglottal, 68 Amyotrophic lateral sclerosis (ALS)
Adaptation theory, of agrammatism, 232 Air volumes, measurement of, 7 augmentative and alternative
Adductor spasmodic dysphonia (ADSD), Airway resistance, laryngeal, 68 communication approaches in, 111
botulinum toxin for, 38–39 in children, 70–71 dysarthria due to, 126
Adjectives, in aphasic language, 266 and sound pressure level, 70–71 AN (auditory neuropathy), in children, 433–
Adjuncts, in argument structure, 270 Akinetic mutism (AM), 145 436
Admittance, 558–562 Alaryngeal voice, 10–12, 138–139 Analgesics, ototoxicity of, 519
in children, 504–505 Alcohol abuse, during pregnancy, 194–195 Anarthric mutism, 145–146
in pseudohypacusis, 532 hearing loss due to, 494 Anomia, in primary progressive aphasia, 247,
Adolescence, language disorders in, 327 Alexia, 236–239 248
Adoption studies, of speech disorders, Aliasing, 472 Anomic aphasia, 250
185 Allergies, laryngitis due to, 34 Anosognosia, in Wernicke’s aphasia, 252
604 Subject Index
Ansa cervicalis, hemilaryngeal reinnervation Apraxia of speech (AOS) Audiovisual Speech Feature Test, 452, 453
with, 43 developmental Auditory brainstem implant (ABI), 427–428,
ANSI. See American National Standards augmentative and alternative 429
Institute (ANSI) communication approaches with, 114 Auditory brainstem response (ABR)
Antecedent control, of target behaviors, 192 diagnostic criteria for, 121–123 in adults, 429–433
Anterior cingulate gyrus (ACG), in vs. motor speech involvement of unknown in auditory neuropathy, 433–434, 435
vocalization, 59–60, 61 origin, 142 electrocochleography of, 464, 465
Anterior digastric muscle, 16, 17 of known origin, 200–203 in functional hearing loss, 476
Antibiotics, ototoxicity of, 518–519 mutism in, 145 in pediatric test battery, 521, 522
Antidepressants nature and phenomenology of, 101–103 in pseudohypacusis, 533
for stroke, 257 and phonological paraphasia, 364–365 Auditory development, 424–426
for tinnitus, 557 treatment for, 104–106 Auditory inertia, 554
Antioxidants, in noise protection, 519 Apraxic agraphia, 235 Auditory-motor interaction, in speech and
AOS. See Apraxia of speech (AOS) Aprosodia, 107–109 language, 275–276
AP (action potential), auditory nerve, 461, Arabic, specific language impairment in, 332 Auditory nerve action potential, 461, 463,
463, 464 Arcuate fasciculus, in aphasia, 264 464
Apallic state, 145 Argument structure, 269–271 Auditory nervous system, processing in, 524
Aperiodicity source models, 5 Arthritis, rheumatoid, laryngeal involvement Auditory neuropathy (AN), in children, 433–
Apert syndrome, hearing loss in, 479 in, 34 436
Aphasia Articulation, instrumentation for assessment Auditory processing approach, 439
adjectives in, 266 of, 170–171 Auditory scene analysis, 437–439
agrammatic, 231–232, 383–384 Articulation disorders Auditory sensitivity, in infants, 424
trace deletion hypothesis of, 407–409 in aphasia, 367 Auditory streaming, 438
tree pruning hypothesis of, 405–407 description and classification of, 219 Auditory threshold, 430
anomic, 250 management of, 130 Auditory training, 208, 439–440
Broca’s vs. phonological disorders, 196 Augmentative and alternative communication
agrammatism in, 231–232, 408–409 Articulation index (AI), 419, 538–540 (AAC) approaches
connectionist models of, 262, 263 Articulation testing, 538 in adults, 110–112
global vs., 243 of children assessment for, 278
language deficits in, 249 with phonological errors, 214 in children, 112–114, 277–278
mapping deficit in, 271 with residual errors, 216–217 components of, 277
melodic intonation therapy for, 347–348 Articulation theory, 538, 539 defined, 277
phonological errors in, 367 Artificial laryngeal speech, 12 general issues with, 277–278
classical syndromes of, 249–251 Artificial larynges, 138 goal of, 278
computer-aided rehabilitation for, 254–256 Arytenoid cartilages, 14 indications for, 277
conduction, 250, 263, 275 in voice acoustics, 64 for mental retardation, 353
phonological errors in, 367 ASD (autistic spectrum disorders), pragmatic and speech development, 278
fluent vs. nonfluent, 247–248 impairment in, 373 Augmentative communication devices
functional approaches to, 283–284 ASHA. See American Speech-Language- (ACDs), for aphasia, 255
global, 243–244, 249 Hearing Association (ASHA) Autism, 115–118
jargon, 252 Asian-Pacific American children, speech and augmentative and alternative
mixed, 247 language issues in, 167–169 communication approaches with, 114
mutism in, 145 ASL (American Sign Language), 339–343, comorbidities with, 116
pharmacological approaches to, 257–259 423 diagnostic criteria for, 116–117
phonological analysis of language disorders Aspiration noise, 65 genetics of, 116
in, 363–365, 366–368 Aspiration pneumonia, due to dysphagia, incidence and prevalence of, 117
primary progressive, 245–248 132, 133 intelligence in, 115
prosodic disorders in, 367 Aspirin, ototoxicity of, 519 semantic development with, 395–397
psychosocial issues with, 260–261 Assertion, language impairment and, 399– and speech in children, 140–141
speech perception in, 367–368 400 treatment for, 117–118
speech production in, 366–367 Assessment of Phonological Processes– Autistic spectrum disorders (ASD), pragmatic
striatocapsular, 315 Revised (APP-R), 214 impairment in, 373
subcortical, 314–315 Assimilatory processes, 175
thalamic, 315 Ataxia, Friedreich’s, electromyography of, B
transcortical 170 Background noise, in classroom, 442
motor, 249–250, 263 Ataxic dysarthria, 127, 128 Back telemetry, in cochlear implant, 448
sensory, 250, 263 Attack, and sound quality of hearing aid, 488 Bacterial laryngitis, 33
Wernicke’s, 252–253 Attention Balance disorders, vestibular rehabilitation
agrammatism in, 231, 409 coordinated, 375 for, 565
argument structure in, 271 joint, 375 Baltimore, Maryland, dialect of, 125–126
connectionist models of, 262–263 and language, 272–273 ‘‘Bamboo nodes,’’ 34
global vs., 243 Attentional dyslexia, 237 Band articulation, 539
language deficits in, 250 Audibility, index of, 419 Bare-stem forms, 266
phonological errors in, 367 Audibility measures, 480 Basal ganglia, in language disorders, 315,
Aphasic syndromes, connectionist models of, Audiogram, 535 316, 317
262–265 in auditory neuropathy, 435 Baselines, 192
Aphasiology, comparative, 265–268 Audiometer, 534 Baserates, 192
Aphonia, functional calibration of, 496 Bats, vocal production system in, 57, 58
direct therapy for, 49–51 Audiometric zero, 534 BBTOP (Bernthal-Bankson Test of
etiology of, 27–29 Audiometry, 534–537 Phonology), 214
Appalachian dialect, 125 behavioral observation, 521 BDAE (Boston Diagnostic Aphasia
APP-R (Assessment of Phonological suggestion, 476 Examination), 253
Processes–Revised), 214 visual reinforcement, 521 ‘‘Be,’’ copula vs. auxiliary, 295, 297
Subject Index 605
Beaver Dam study, presbyacusis in, 527 Broca’s aphasia Children

Behavioral approaches agrammatism in, 231–232, 408–409 African-American, language disorders in,
to dysarthria, 130–131 connectionist models of, 262, 263 318–320
to speech disorders in children, 192–193 global vs., 243 Asian-Pacific American, speech and
Behavioral disorders, with communicative language deficits in, 249 language issues in, 167–169
disorders, 161–163 mapping deficit in, 271 auditory development in, 424–426
Behavioral observation audiometry (BOA), melodic intonation therapy for, 347–348 auditory neuropathy in, 433–436
521, 522 phonological errors in, 367 cochlear implants in, 337–338, 454–456
Bekesy audiometry, 476 Broca’s area, 525 communication disorders in, 285–287
Belle indi¤erence, la, 186 in apraxia of speech, 101, 102 computer-based approaches to speech and
Benign paroxysmal positioning vertigo in connectionist model, 263, 264 language disorders in, 164–165
(BPPV), 467 in language development, 311 deaf
Bernthal-Bankson Test of Phonology in vocalization, 60 assessment and intervention for, 421–423
(BBTOP), 214 Bromocriptine, for aphasia, 258 language acquisition by, 336–338
Beta-hemolytic streptococcus, laryngitis due Bruhn’s method, 543 developmental disabilities in, inclusion
to, 33 BTX-A (botulinum toxin type A), for models for, 307–309
Bilingualism laryngeal movement disorders, 38–39 with focal lesions, language development in,
code mixing in, 280 Buccal pumping, 56 311–313
dominance in, 280 ‘‘Burp speech,’’ 138 functional hearing loss in, 475–477
and language impairment, 279–281 language and stuttering in, 333–335
in Latino children, 321 C language development in
speech development and, 211–212 CADL-2 (Communicative Activities of Daily with focal lesions, 311–313
normal development of, 279–280 Living), 283 with hearing loss, 422
phonology in, 279 Cajun English, 295, 296 otitis media and, 358–360
pragmatics in, 280 Caloric test, 469–471 plateaus in, 421
speech issues in, 119–121 Camouflaged forms, 296 language disorders in
syntax in, 280 Candidate words, 393 assessment of, 324–325, 327–328
transfer (interference) e¤ects in, 280 Candidiasis, laryngeal, 33 cross-linguistic studies of, 331–332
vocabulary in, 279–280 Canonical babbling morphosyntax and syntax with, 354–356
Binaural squelch e¤ect, 453 in deaf children, 336, 340–341 overview of, 326–328
Biofeedback, in voice therapy, 90 with mental retardation, 141 and reading disability, 329–330
Biological risk, 286 Canonical linking rules, 270 risk factors for, 327
Birds, vocal production system in, 57 Case marking, in children with language Latino
Birth-related risk factors, for speech disorders, 354 language disorders in, 321–322
disorders, 194–195 CAT (computer-assisted treatment), for speech issues in, 210–212
Birth weight, and speech disorders, 194 aphasia, 255 middle ear assessment in, 504–507
Black English. See African-American English Cat(s), purring by, 57 motor speech involvement in, 142–144
(AAE) Catecholamines, in aphasia, 257, 258, 259 orofacial myofunctional disorders in, 147–
Blending tasks, in phonological awareness, Categorization, in phonological awareness, 149
154–155 154 otoacoustic emissions in, 515–516
Blocks, in children, 180 Caudate nucleus, in language disorders, 315 phonological awareness intervention for,
Blood oxygen level dependent (BOLD) CBF (cerebral blood flow), regional, 305–306 153–155
signal, 306 CBT (cognitive-behavioral therapy), for with phonological errors, speech sampling,
BOA (behavioral observation audiometry), tinnitus, 557 articulation tests, and intelligibility in,
521, 522 CDA (clinical decision analysis), 444–447 213–214
Body configuration, for speech production, CDI (Communicative Development prosody in, 344–346
83 Inventory), 370 with residual errors, speech sampling,
Bone conduction vibrator, 534, 535 Ceiling Rate, 541 articulation tests, and intelligibility in,
Boston Diagnostic Aphasia Examination Cent(s), 526 215–217
(BDAE), 253 Central agraphia syndromes, 234–235 screening for hearing loss in, 495–497, 561
Botulinum toxin type A (BTX-A), for Central masking, 502 specific language impairment in (see Specific
laryngeal movement disorders, 38–39 Cerebellar mutism, 146 language impairment [SLI])
Bounding theory, in children with language Cerebellum, in language disorders, 315 speech assessment in, descriptive linguistic
disorders, 354 Cerebral blood flow (CBF), regional, 305– methods for, 174–175
Boyle’s law, 68 306 speech development in, with tracheostomy,
BPPV (benign paroxysmal positioning Cerebral palsy, augmentative and alternative 176–179
vertigo), 467 communication approaches in, 113 speech disfluency and stuttering in, 180–
Brain injury Cerebrovascular accident (CVA) 182
focal, and language development, 311–313 antidepressants for, 257 speech disorders in
traumatic aphasia due to (see Aphasia) behavioral approaches to, 192–193
augmentative and alternative augmentative and alternative communi- birth-related risk factors for, 194–195
communication approaches in, 111 cation approaches in, 111–112 cross-linguistic data on, 196–197
dysarthria due to, 126 dementia due to, 292–293 description and classification of, 218–219
mutism in, 146 depression after, 258 descriptive linguistic approaches to, 198–
Brainstem reflexes, in auditory neuropathy, dysarthria due to, 126 199
436 dysphagia due to, 132–133 motor, of known origin, 200–203
Brainstem stroke, augmentative and global aphasia due to, 243–244 psycholinguistic perspective on, 189–191
alternative communication approaches in, Cervus elaphus, vocal production system in, speech-language approaches to, 204–206
111–112 58 speech in
Breathing, during singing, 51–52 CETI (Communication E¤ectiveness Index), augmentative and alternative communi-
Breathing exercises, in voice therapy, 82– 283 cation approaches to, 112–114, 277–278
84 Chemotherapy agents, ototoxicity of, 512, mental retardation and, 140–141
Briess Exercises, 86 518 phonetic transcription of, 150–152
606 Subject Index
Children (continued) Coherence, of discourse, 300 Continuous interleaved sampling (CIS),

test battery approach in, 520–522 Cohesion, textual, 300–301, 303 449
voice disorders in, instrumental assessment Cohesive ties, 300 Continuous perseveration, 362
of, 35–37, 67–71 Coma vigil, 145 Continuous positive airway pressure, for
Chlordiazepoxide, for aphasia, 258 Commenting, 375 resonance problems, 130
‘‘Chokers,’’ 97 Communication Continuous reinforcement schedule, 192
Chromosomal abnormalities, hearing loss due defined, 277 Contralateral masking, during hearing test,
to, 477–478 functions, 283 500–503
CI (contact index), 25 right hemisphere in, 386–387 Contrastive patterns, 295, 297–299
Cicatricial pemphigoid, laryngeal Communication disorders Conversation, supported, 284
involvement in, 34 in adults, 283–284 Conversational coaching, 284
Cigarette smoking, and voice disorders in in infants and toddlers, 285–287 Conversational discourse
elderly, 73 Communication E¤ectiveness Index (CETI), analysis of, 300–301
Circumlaryngeal massage, 49, 93 283 of children with focal lesions, 312
CIS (continuous interleaved sampling), 449 Communication skills, in Down syndrome, impairments of, 302–304
Cisplatin, ototoxicity of, 512, 518 288–291 levels of processing of, 302–304
Classroom acoustics, 442–443 Communicative Activities of Daily Living with right hemisphere lesions, 387, 390–
Classroom-based service delivery, 308 (CADL-2), 283 391
Cleft palate, sociobehavioral consequences of, Communicative Development Inventory Conversion deafness, 475–477
163 (CDI), 370 Conversion disorders, 28, 186
Clinical decision analysis (CDA), 444–447 Community-based service delivery, 308 Cooperative principle, 301
Clinical Test of Sensory Interaction and Comparative aphasiology, 265–268 Coordinated attention, 375
Balance (CTSIB), 564 Competing words, 393 Corpus striatum, in language disorders,
Cloze procedure, 379 Complementizer Phrase (CP), of deaf child, 315
CL/PSA (computerized language and 337 Correction rejection rate (CR), 445
phonological sample analysis), 164–165 Complete recruitment, 415 Corrective feedback, 193
CM (cochlear microphonic) component, Compliance, language impairment and, 399 Cortical analysis, in physiology of hearing,
461 Comprehensibility, with dysarthria, 131 525
in auditory neuropathy, 434–436 Compression, amplitude. See Amplitude Corticosporin otic solution, ototoxicity of,
Coaching, conversational, 284 compression 518
Coarticulatory e¤ects, 393 Compression ratio, and sound quality of COT (computer-only treatment), for aphasia,
Cocaine, hearing loss due to, 494 hearing aid, 488 255
Cochlea, physiology of, 523–524 Computer-aided rehabilitation, for aphasia, ‘‘Cover symbols,’’ 150
Cochlear amplifier, 418 254–256 CP (Complementizer Phrase), of deaf child,
Cochlear duct, physiology of, 523 Computer-assisted treatment (CAT), for 337
Cochlear fluids, physiology of, 522–523 aphasia, 255 CQ (contact quotient), 25–26
Cochlear hearing loss Computer-based approaches, to speech and Craniofacial anomalies, hearing loss with,
auditory brainstem response in, 432 language disorders in children, 164– 477–479
masking-level di¤erence with, 490–491 165 Craniofacial dysostosis, hearing loss in, 479
Cochlear implant(s), 447–449 Computerized Assessment of Intelligibility Cricoarytenoid joint, 14–15
in adults, candidacy for, 449, 450–454 in Dysarthric Speakers, 126 Cricoarytenoid muscles, 16, 18, 19
analog vs. pulsatile stimulation in, 448 Computerized language and phonological Cricoid cartilage, 14
for auditory neuropathy, 436 sample analysis (CL/PSA), 164–165 Cricopharyngeal muscle, in alaryngeal voice,
in children, 337–338, 454–456 Computer-only treatment (COT), for 11
coding strategy for, 448–449 aphasia, 255 Cricothyroid joint, 14
components of, 447–448 Concept center, 264 Cricothyroid (CT) muscle
defined, 447 Concord, in children with language disorders, anatomy of, 16, 17, 18, 19
function of, 447 354 in voice production, 75, 76, 77
history of, 447 Conduction aphasia, 250, 263, 275 Criterion, for test, 446
monaural vs. binaural, 451, 452 phonological errors in, 367 Criterion-referenced assessments, of language
monopolar vs. bipolar stimulation in, 448 Conductive hearing loss disorders in school-age children, 327
multi-electrode arrays in, 448 audiogram of, 536, 537 Critical band(s), 413
speech comprehension after, 449 auditory brainstem response in, 431 Critical band masking, 416, 539
speech recovery after, 208–209 masking-level di¤erence with, 491–492 Cross-hearing, 500, 501
Cochlear inner hair cells. See Inner hair cells Confidential voice technique, 93 Cross-linguistic data
(IHCs) Connected Discourse Tracking, 541–542 on language impairment in children, 331–
Cochlear microphonic (CM) component, Connectionist models, of aphasic syndromes, 332
461 262–265, 364 on speech disorders in children, 196–197
in auditory neuropathy, 434–436 Conservation laryngectomy, voice Cross-spectral processing, 438
Cochlear nonlinearity, 417–418 rehabilitation after, 80–81 Crouzon syndrome, hearing loss in, 479
Cochlear outer hair cells. See Outer hair cells Consistent deviant disorder, 219 CTSIB (Clinical Test of Sensory Interaction
(OHCs) Consonant cluster reduction, 345–346 and Balance), 564
Cochlear summating potential, 461 Consonant production, 143 CVA. See Cerebrovascular accident (CVA)
Cochleotoxicity, 518, 519 Constrained plasticity, 311 Cx26 mutations, hearing loss due to, 478
Cockayne syndrome, hearing loss in, 479 Consultative model of service delivery, 308 Cycles approach, 199, 205
Coda deletion, 345–346 Contact index (CI), 25
Code mixing, bilingual, 280 Contact phase, 24, 25 D
Cognitive-behavioral therapy (CBT), for Contact quotient (CQ), 25–26 DAS. See Developmental apraxia of speech
tinnitus, 557 Contextual systemic approach, for (DAS)
Cognitive disturbances, with tinnitus, 556– speechreading, 543 Deafness
557 Contextual Test of Articulation, 217 in children, assessment and intervention for,
Cognitive-linguistic approach, to speech Continuity thesis, of paraphasia, 364 421–423
disorders in children, 205 Continuous Discourse Tracking, 541–542 conversion, 475–477
Subject Index 607
language acquisition with Dichotic listening, 458–460 mutism in, 145–146

for English, 336–338 Digastric muscles, 16, 17 in Parkinson’s disease, 30–31, 126
sign, 339–343 Dinosaurs, vocal production system in, 58 spastic, 127, 130
Michel, 479 Directional preponderance (DP), 470–471 temporary mutism followed by, 146
Mondini, 479 Direct selection, in augmentative and unilateral upper motor neuron, 127
pure word, 252, 263 alternative communication, 277 Dysgraphia, 233–235
Scheibe, 479 Direct service delivery, 308 Dyslexia
Decision matrix, 444 Direct voice therapy acquired, 238
Declarative, 375 for functional dysphonia, 49–51 attentional, 237
Deep agraphia, 234, 235 for neurological aging-related voice deep, 238, 239
Deep alexia, 238, 239 disorders, 93–94 neglect, 237
Deep dyslexia, 238, 239 Discourse phonological, 238
Deer, vocal production system in, 58 analysis of, 300–301 surface, 238
Deficits of linguistic competence, 273 of children with focal lesions, 312 Dysphagia, oral and pharyngeal, 132–134
Dehydration, vocal hygiene for, 55, 89 impairments of, 302–304 Dysphonia(s)
Deictic terms, in autism, 116 levels of processing of, 302–304 functional
Delayed phonological acquisition, 219 right hemisphere in, 387, 390–391 direct therapy for, 49–51
Deletion, in phonological awareness, 155 Displaced arguments, 270 etiology of, 27–29
Dementia, 291–293 Displays, in augmentative and alternative muscular tension
due to Alzheimer’s disease, 291–292 communication, 277 botulinum toxin for, 38
due to Lewy body disease, 292 Distinctive features therapy, 198 voice handicap assessment in, 22
mutism in, 146 Distortion(s), as residual phonological errors, paradoxical breathing dysphonia, botulinum
Parkinson’s, 293 156, 157 toxin for, 38
vascular, 292–293 Distortion product otoacoustic emissions spasmodic
Depression (DPOAEs), 511, 512–513 acoustic assessment of, 4–5
speech disorders with, 186 in children, 515, 516 botulinum toxin for, 38–39
after stroke, 258, 260–261 Distributed Morphology, 356 laryngeal reinnervation for, 43
with tinnitus, 557 Distribution, of sounds, 174 Dyspraxia. See Developmental apraxia of
Descriptive linguistic approaches, for speech Diuretics, ototoxicity of, 519 speech (DAS)
disorders in children, 174–175, 198–199 Dix-Hallpike test, 467
Determination of stimulability, 214, 217 Dizziness, vestibular rehabilitation for, 519, E
Determiner Phrase (DP), of deaf child, 337 563–565 Ear, physiology of, 522–525
Developmental apraxia of speech (DAS) Dodd’s classification system, 219 Ear advantage, 458–460
augmentative and alternative Doerfler-Stewart test, 476 Ear canal volume, 505
communication approaches with, 114 Dopamine, in aphasia, 257, 258 Eardrops, ototoxicity of, 518–519
diagnostic criteria for, 121–123 Down syndrome Eardrum perforation, tympanometry of, 561,
vs. motor speech involvement of unknown communication skills in, 288–291 562
origin, 142 hearing loss in, 288, 289, 478 Early specialization, 311
Developmental articulatory dyspraxia. See intelligibility in, 288 Early Speech Perception Monosyllable Word
Developmental apraxia of speech (DAS) memory deficits in, 289 test, 456
Developmental delay, risk for, 286 motor limitations in, 289 Ear-monitoring task, 458
Developmental disabilities semantic development with, 396 Earmu¤s, 498, 499
inclusion models for, 307–309 and speech, 140, 141 Earphones, 500–501, 534, 535
prelinguistic communication intervention visual deficits in, 289 Earplugs, 497–498, 499
for, 375–377 DP (directional preponderance), 470–471 E-A-RTONE 3A earphones, 501
Developmental language disorders, semantic DP (Determiner Phrase), of deaf child, 337 Ebonics. See African-American English
development with, 395–397 DPOAEs (distortion product otoacoustic (AAE)
Developmental phonological disorders, 156 emissions), 511, 512–513 Echolalia, in autism, 116
Developmental Sentence Score (DSS), 298 in children, 515, 516 ECochG (electrocochleography), 461–465
and stuttering, 334 Drill and practice exercises, for aphasia, 254– Edema, Reinke’s, and voice disorders in
Developmental verbal dyspraxia. See 255 elderly, 72–73
Developmental apraxia of speech (DAS) Drug abuse, during pregnancy, 194–195 EEG (electroencephalography), 305
Dextro-amphetamine, for aphasia, 257, 258 Drug-related hearing loss, 493–494 E‰cency (EF), 447
Diacritics, in phonetic transcription of Drug treatment, for aphasia, 257–259 EGG (electroglottography), 23–26
children’s speech, 151 DSM-IV-TR (Diagnostic and Statistical of children’s voice, 27
Diagnostic and Statistical Manual of Mental Manual of Mental Disorders-IV-TR) Eight-speaker localization test, 453
Disorders-IV-TR (DSM-IV-TR) classification, 219 Elderly
classification, 219 DSS (Developmental Sentence Score), 298 hearing dysfunction in, 527–530
Dialect(s) and stuttering, 334 masking-level di¤erence with, 490
African-American (see African-American Dual task paradigm, 272 noise-induced hearing loss and, 510
English [AAE]) Duck-billed dinosaur, vocal production voice disorders of, 72–74
defined, 294 system in, 58 voice therapy for, 91–94
di¤erentiation of, 295–296 Dutch, speech disorders in English vs., 197 Electrical stimulation, for tinnitus, 557
vs. disorder, 297–299 Dysarthria(s) Electrocochleography (ECochG), 461–465
factors a¤ecting use of, 296 ataxic, 127, 128 Electrode(s)
of Latino children, 321 characteristics and classification of, 126– in cochlear implant, 448
regional, 124–126 128 tympanic membrane, 462
of Spanish, 210 in children, 201 Electroencephalography (EEG), 305
standardness (acceptability) of, 294–295 flaccid, 127, 128 Electroglottography (EGG), 23–26
Dialect continuum, 125 hyperkinetic, 127 of children’s voice, 27
Dialect speakers, 294–296 hypokinetic, 127, 128 Electrolaryngography, 23–26
Diaphragm, in speech production, 83 management of, 129–131 Electrolarynx, 12, 138
Diazepam, for aphasia, 257 mixed, 127 Electromagnetic articulography (EMA), 171
608 Subject Index
Electromyographic biofeedback, in voice Expressive phonological impairments (EPIs), Frequency compression, 471–474
therapy, 90 in children, phonological awareness Frequency importance function, 539
Electromyography (EMG) intervention for, 153–155 Frequency lowering, 471–474
intramuscular, of dysphagia, 134 Extended IPA (extIPA), 150, 151 Frequency resolution, in auditory
in speech assessment, 169–170 External auditory meatus, in noise-induced development, 424
Electronystagmography (ENG), 467–471 hearing loss, 509 Frequency response, and sound quality of
Electro-oculography (EOG), 467 External frame function, 17 hearing aid, 487
Electropalatography (EPG), 143, 170–171 Extratympanic electrocochleography (ET Frequency shifting, 472
ELH (endolymphatic hydrops), ECochG), 462 Frequency transposition, 473
electrocochleography of, 463–465 Extrinsic laryngeal muscles, 15–17 Frequency warping, 473–474
Elkonin cards, 154 Eye-monitoring studies, of speechreading, Fricative consonants, 143
EMA (electromagnetic articulography), 171 544–546 Friedreich’s ataxia, electromyography of,
Embedded sentences, in agrammatism, 406 170
EMG (electromyography) F Frogs, vocal production system in, 56–57
intramuscular, of dysphagia, 134 f0 . See Fundamental frequency ( f0 ) FR (fixed ratio) schedule, 192
in speech assessment, 169–170 FACS (Functional Assessment of Functional aphonia, 27–29
EMG (electromyographic) biofeedback, in Communication Skills), 283 Functional approaches, to aphasia, 283–284
voice therapy, 90 Fading, 193 Functional assessment, 283
Emotional e¤ects Failure of fixation suppression, 471 Functional Assessment of Communication
of aphasia, 260–261 False alarm rate (FA), 444–445 Skills (FACS), 283
of tinnitus, 556–557 False negative rate, 444, 445 Functional brain imaging, 305–307
Emotional processing deficits, with right False positive rate, 444–445 after early injury, 313
hemisphere lesions, 389 False vocal folds, in phonation, 77 of subcortical involvement, 317
Emotional prosody, 108, 381 Familial aggregation, of speech disorders, Functional communication, 283
Emphatic stress, in aprosodia, 109 184 Functional Communication Profile, 283
Employment, after stroke, 261 Family-centered service delivery model, 285, Functional communication skills,
Endolymphatic hydrops (ELH), 286–287 development of, 312
electrocochleography of, 463–465 Family pedigree Functional dysphonia
Endoscopy for speech disorders, 184 direct therapy for, 49–51
of dysphagia, 133–134 for stuttering, 221 etiology of, 27–29
in speech assessment, 170 Farsi, comparative aphasiology of, 267 hypertensive, 73
ENG (electronystagmography), 467–471 Feedback Functional hearing loss (FHL), 531–533
English for apraxia of speech, 105–106 in children, 475–477
acquisition by deaf child of, 336–338 corrective, 193 Functional impact, of voice disorders, 20–23
African-American (see African-American Fetal alcohol syndrome, 195, 494 Functional magnetic resonance imaging
English [AAE]) FHL (functional hearing loss), 531–533 (fMRI), 306–307
Cajun, 295, 296 in children, 475–477 after early brain injury, 313
manually coded, 342 Filler-gaps, 270 of subcortical involvement, 317
Southern White, 297, 298–299 Film techniques, for speechreading, 543 Functional overlay, 475
Standard American, 318–320 Finger counting, 105 Functional Reach Test, 564
Environmental humidification, in vocal Finger spelling, 342 Functional residual capacity (FRC), during
hygiene, 55 Fixation suppression, failure of, 471 singing, 52
Environmental Protection Agency (EPA), Fixed ratio (FR) schedule, 192 Functional voice disorders, 27–29
498, 499 Flaccid dysarthria, 127, 128 Fundamental frequency ( f0 ), 3–5, 69, 75–76
Environmental risk, 286 Flat a¤ect, 108 in children, 36, 70, 71
EOAEs (evoked otoacoustic emissions), 496 Floor of mouth, defects of, 46 with hearing loss, 208
EOG (electro-oculography), 467 Flow phonation, 52 pitch of missing, 438
EPG (electropalatography), 143, 170–171 fMRI (functional magnetic resonance sex di¤erences in, 36
Epiglottis, 14 imaging), 306–307 of transsexuals, 224–225
EPIs (expressive phonological impairments), after early brain injury, 313 Fungal laryngitis, 33
in children, phonological awareness of subcortical involvement, 317 Furosemide, ototoxicity of, 519
intervention for, 153–155 Focal lesions, language development with,
Equipotentiality, 311 311–313 G
Equivalence classes, in auditory development, Focused stimulation, 379–380 Gain
425 Foot, 345 as function of frequency, in prescriptive
Equivalent volume, 505 Formant frequencies fitting of hearing aid, 482
ER-3A earphones, 501 evolution of, 57 primary and secondary, 28, 187
ERP (event-related potential) design, 306 during singing, 51, 53–54 in pseudohypacusis, 531
Esophageal speech, 10, 11, 138 Fragile X syndrome, and speech, 140, 141 Gait assessment, for vestibular disorders,
Esophageal voice, 138 Frame, 303 564
Established risk, 286 Framingham Heart Study, presbyacusis in, g-aminobutyric acid (GABA), in aphasia,
ET ECochG (extratympanic electro- 527, 528 257, 258
cochleography), 462 FRC (functional residual capacity), during Gastroesophageal reflux disease (GERD)
Ethacrynic acid, ototoxicity of, 519 singing, 52 laryngitis due to, 34
Ethyl alcohol, hearing loss due to, 494 French, speech disorders in English vs., 196 vocal hygiene for, 55–56
Event-related potential (ERP) design, 306 Frequency and voice disorders in elderly, 73
Evoked otoacoustic emissions (EOAEs), 496 fundamental, 3–5, 69, 75–76 Gaze stabilization exercises, 520
Exaggerated hearing loss, 531–533 in children, 36, 70, 71 Gaze tests, 467–469
in children, 475–477 with hearing loss, 208 Gender marking, in children with language
Exercise, and laryngeal performance, 73 pitch of missing, 438 disorders, 354
Expert service delivery model, 285, 286 sex di¤erences in, 36 Gender reassignment, and speech di¤erences,
Expert systems, for aphasia, 255 of transsexuals, 224–225 223–225
Expressive aprosodia, 108 place coding of, 523 Generic small talk, 113
Subject Index 609
Genetic transmission Hearing Handicap for the Elderly instrument, Hyperfunctional phonation, 52
of hearing loss, 477–479 528 Hyperkinetic dysarthria, 127
of speech disorders, 183–185 Hearing level (HL) dial, 534 Hyperkinetic mutism, 145
of stuttering, 221 Hearing loss Hyperlexia, 397
Geniohyoid muscle, 16 acquired in adulthood, speech disorders due Hyperphonia, in dysarthria, 130
Genotype, 478 to, 207–209 Hypertensive dysphonia, and voice disorders
Gentamicin, ototoxicity of, 518, 519 aging-related, 527–530 in elderly, 73
GERD. See Gastroesophageal reflux disease masking-level di¤erence with, 490 Hyperthyroidism, and voice disorders in
(GERD) noise-induced hearing loss and, 510 elderly, 73
GJB2 mutations, hearing loss due to, 478 due to auditory neuropathy, 433–436 Hypoadduction, voice therapy for, 88, 89
Glide, in voice exercises, 86–87 in children, assessment and intervention for, Hypoarousal, with right hemisphere lesions,
Globus pallidus, in language disorders, 315 421–423 389
Glottal acoustic source, 63–67 cochlear Hypofunctional phonation, 52
Glottal adduction, physics and physiology of, auditory brainstem response in, 432 Hypoglossal nerve, laryngeal reinnervation
75, 76 masking-level di¤erence with, 490–491 with, 43
Glottal adduction force, during singing, 52 conductive Hypokinetic dysarthria, 127, 128
Glottal airflow, 76, 77 audiogram of, 536, 537 Hypokinetic laryngeal movement disorders,
alternating, 69 auditory brainstem response in, 431 30–31
in children, 69–70 masking-level di¤erence with, 491–492 Hypophonia
sound pressure level and, 69–70 in Down syndrome, 288, 289, 478 in dysarthria, 130
Glottal chink, 66 drug- or chemical-related, 493–494 in parkinsonism, 31
Glottal flow, Liljencrants-Fant model of, 5 due to early recurrent otitis media, 135, Hypothyroidism, and voice disorders in
Glottal incompetence, in parkinsonism, 31 358–360 elderly, 73
Glottal open quotient, 23 functional (nonorganic, psychogenic, Hypsingnathus monstrosus, vocal production
Glottal-to-noise ratio, 6 exaggerated), 531–533 system in, 57, 58
Glottal vibration, 65 in children, 475–477
Glottal volume velocity, 63, 64 vs. hearing handicap, 536–537 I
Glottal volume velocity waveform, 8 hereditary, 477–479 IA (interaural attenuation), 500–501
Glottis, 15 high-frequency, frequency compression for, ICCs (interagency coordinating councils),
pressure profiles within, 76 471–474 285–286
in vocal production, 57 language development with, 422 ICF (International Classification of
Glottograms, 23 and masking-level di¤erence, 489–492 Functioning, Disability and Health), 283
Glottographic waveforms, 23 mixed, audiogram of, 536, 537 Iconic communication, 277
Goldenhar syndrome, hearing loss in, 479 noise-induced, 496, 508–510 IDEA (Individuals with Disabilities
Goldman-Fristoe Test of Articulation, 216 pure-tone, in auditory neuropathy, 433, 434 Education Act), 194, 307–308
Grammatical development, socioeconomic retrocochlear, auditory brainstem response IEP (Individualized Educational Plan), 308
status and, 370 in, 432 IFSP (Individual Family Service Plan), 287
Granulomatosis, Wegener’s, laryngeal screening for IHAFF procedure, for prescriptive fitting of
involvement in, 34 in newborns, 421–422 hearing aid, 483, 484
Grapheme-to-phoneme conversion (GPC), in school-age child, 495–497 IHC (inner hair cell) function, in auditory
236 sensorineural neuropathy, 434
Graphemic bu¤er, 233–234 audiogram of, 536, 537 IHCs (inner hair cells), 418
Graphemic bu¤er agraphia, 235 signal-to-noise ratio with, 442 in noise-induced hearing loss, 509
GRBAS protocol, 78 speech development with, 422 physiology of, 523, 524
Grief, due to aphasia, 260 syndromes associated with, 478–479 Imaginative play, delays in, in autism, 117
Growth factors, for aphasia, 258 Hearing protection devices (HPDs), 497–499 Imaging, functional brain, 305–307
Hearing tests, 534–537 after early brain injury, 313
H Hemifacial microsomia, hearing loss in, 479 of subcortical involvement, 317
Half-octave shift e¤ect, 417 Hemilaryngeal reinnervation, with ansa Immittance, 504–505
Hammerhead bat, vocal production system cervicalis, 43 in pseudohypacusis, 532
in, 57, 58 Hemophilus influenzae, laryngitis due to, 33 Impaired neuromuscular execution, writing
Hand tapping, in melodic intonation therapy, Herpes simplex infection, vocal cord paralysis disorders due to, 235
348 due to, 33 Imperatives, 375
Hard palate, defects of, 46 Herpes zoster infection, vocal cord paralysis Inclusion models, for developmental
Head shadow e¤ect, 453 due to, 33 disabilities, 307–309
Hearing, 411–567 High-frequency hearing loss, frequency Income, e¤ect on language of, 369–371
cross-, 500, 501 compression for, 471–474 Inconsistency, assessment of, 217
physiologic bases of, 522–525 Hit rate (HT), 444, 445, 446 Inconsistent disorder, 219
spatial Holistic techniques, of voice therapy, 85–87 Independent analyses, 213
in auditory development, 425–426 Home talk, 113 Index of audibility, 419
evaluation of, 453 Hood method, for masking, 502 Index of laterality, 459
visual, 543 Howler monkeys, vocal production system in, Index of Productive Syntax (IPSyn), 298
Hearing aid(s) 57, 58 Indicating, 375
amplitude compression in, 413–420 HPDs (hearing protection devices), 497–499 Individual Family Service Plan (IFSP), 287
and loudness, 413–416 HT (hit rate), 444, 445, 446 Individualized Educational Plan (IEP), 308
and masking, 416–417 Human papillomavirus (HPV), and laryngeal Individuals with Disabilities Education Act
multiband, 418–419 papillomatosis, 33 (IDEA), 194, 307–308
physiology of, 417–418 Humidification, in vocal hygiene, 55 Infants
auditory training with, 208, 440 Hunter-Hurler syndrome, hearing loss in, auditory development in, 424–426
evaluation of outcomes of, 480–481 479 communication disorders in, 285–287
prescriptive fitting of, 482–485 Hydration, in vocal hygiene, 55, 89 high-frequency tympanometry in, 507
sound quality with, 487–488 Hyoid bone, 14 with tracheostomy, speech development in,
Hearing handicap, vs. hearing loss, 536–537 Hyperadduction, voice therapy for, 88 176–179
610 Subject Index
Infant-Toddler Meaningful Auditory IPA (International Phonetic Alphabet), 150, and reading disability, 329–330
Integration scale, 455 151 right hemisphere, 388–391
Infectious diseases, of larynx, 32–34 IPSyn (Index of Productive Syntax), 298 social development and, 398–400
Inferior laryngeal nerve, 17 ISO (International Organization for subcortical involvement in, 314–317
Inferior parietal lobe, in aphasia, 263–264 Standardization), on noise-induced Laryngeal air sacs, 57
Inflammatory processes, of larynx, 34 hearing loss, 508 Laryngeal airway pressure, in children, 36–37
in elderly, 73 Isotretinoin, hearing loss due to, 493 Laryngeal airway resistance, 68
Inflection(s), in agrammatism, 406 Italian, speech disorders in English vs., 196, in children, 70–71
Inflectional Phrase (IP), of deaf child, 337 197 and sound pressure level, 70–71
Infrahyoid muscles, 15–17 Laryngeal cancer, 137–138
Inheritance patterns, of hearing loss, 478 J Laryngeal cavity, 15
Inner hair cell (IHC) function, in auditory Jargonagraphia, in Wernicke’s aphasia, 252 Laryngeal dystonia, dysarthria due to, 130
neuropathy, 434 Jargon aphasia, 252 Laryngeal movement disorders
Inner hair cells (IHCs), 418 Jena method, 543 botulinum toxin for, 38–39
in noise-induced hearing loss, 509 Jervell and Lange-Nielsen syndrome, hearing hypokinetic, 30–31
physiology of, 523, 524 loss in, 479 Laryngeal muscle(s), 15–17, 18, 19
Insert earphones, 501 Jitter, 3–5 Laryngeal muscle exercises, 86–87
Inspiratory checking, for dysarthria, 130 Joint attention, 375 Laryngeal muscle weakness, in post-polio
Insu¿ation, in esophageal speech, 11 syndrome, 33
Insula, precentral gyrus of, in apraxia of K Laryngeal nerves, 17
speech, 102 Kanamycin, ototoxicity of, 518, 519 reinnervation of, 41–43
Integral stimulation and phonetic placement, Kay Elemetrics nasometer, 170 Laryngeal papillomatosis, 33
104 Kay Elemetrics Sonography, 171 Laryngeal paralysis
Intellectual disability, augmentative and Khan-Lewis Phonological Analysis (KLPA), due to herpes simplex and herpes zoster
alternative communication approaches 214 infections, 33
with, 113–114 K’iche’, speech disorders in English vs., 197 reinnervation for, 42–43
Intelligibility Kinzie’s method, 543 Laryngeal prominence, 14
of African-American Vernacular English, Klebsiella scleromatis, laryngeal infection Laryngeal reinnervation, 41–43
158–159 with, 33 Laryngeal reposturing techniques, 49–50
of children Klippel-Feil syndrome, hearing loss in, 479 Laryngeal transplantation, 43
deaf, 336 ‘‘Knoll’’ glide, 86–87 Laryngeal trauma, 34, 45
with phonological errors, 214–215 Knowledge of performance (KP), for apraxia inflammatory processes due to, 34
with residual errors, 217 of speech, 105 and voice disorders in elderly, 73
of dialect, 124–125 Knowledge of results (KR), for apraxia of Laryngeal tuberculosis, 33
in Down syndrome, 288 speech, 105 Laryngeal tumors, viral infections and, 33
with dysarthria, 129–131 KTH Speech Tracking Procedure, 541 Laryngectomy
with hearing aid, 480 partial or conservation, 80–81
index of, 538, 539 L speech rehabilitation after, 10–12
Intelligibility drills, for articular problems, La belle indi¤erence, 186 total, 137–139
130 Lacunar stroke, dysarthria due to, 126 Laryngitis
Intensity Language, 229–410 allergic, 34
cycle-to-cycle perturbations of, 3–5 agrammatism of, 231–232 bacterial, 33
of voiced sounds, 76 attention and, 272–273 chronic, 34
Intensity resolution, in auditory development, auditory-motor interaction in, 275–276 fungal, 33
425 of deaf child sicca, 73
Intentional communication, 375 English as, 336–338 Laryngopharyngeal reflux, vocal hygiene for,
Interagency coordinating councils (ICCs), sign, 339–343 55–56, 89
285–286 poverty e¤ects on, 369–371 Laryngotracheitis, viral, 32–33
Interarytenoid muscles, 16, 18, 19 right hemisphere in, 386–387 Larynx(ges)
Interaural attenuation (IA), 500–501 sign, 339–343, 423 aging impact on, 91–92
Interference e¤ects, in bilingualism, 280 and stuttering, 333–335 anatomy of, 13–19
Interjections, in children, 180 Language acquisition, in African-American cartilaginous skeleton in, 14–15
Intermittent voice breaks, botulinum toxin English, 318–319 developmental, 36
for, 38 Language delay, vs. language deviance, 327 regional relationships in, 14
International Classification of Functioning, Language development artificial (electro-), 12, 138
Disability and Health (ICF), 283 with focal lesions, 311–313 evolution of, 57
International Organization for with hearing loss, 422 functions of, 13–14, 41
Standardization (ISO), on noise-induced otitis media and, 358–360 of infants and young children, 177
hearing loss, 508 plateaus in, 421 infectious diseases and inflammatory
International Phonetic Alphabet (IPA), 150, Language deviance, vs. language delay, 327 conditions of, 32–34
151 Language disorders lowering of
Intervertebral canal, evolution of expansion in adolescence, 327 evolution of, 58
of, 58 in aphasia, phonological analysis of, 363– in singers, 53
Intramuscular electromyography, of 365, 366–368 reinnervation of, 41–43
dysphagia, 134 bilingualism and, 279–281 sexual dimorphism of, 36
Intrinsic laryngeal muscles, 16, 17, 18, 19 in children vocal folds of, 17–19
Inuktitut, specific language impairment in, African-American, 318–320 Lateral cricoarytenoid muscle, 16, 18, 19
332 assessment of, 324–325, 327–328 Laterality, index of, 459
Inverse filtering, 8 computer-based approaches to, 164–165 Lateral superior olivary nucleus, 524
Inverse square law, 443 cross-linguistic studies of, 331–332 Latino children
Iowa Medical Consonant Test, 453 Latino, 321–322 language disorders in, 321–322
IP (Inflectional Phrase), of deaf child, morphosyntax and syntax with, 354–356 speech issues in, 210–212
337 overview of, 326–328 LBD (Lewy body disease), 292
Subject Index 611
LDL (loudness discomfort level), in Loudness level, 413 Medical Outcomes Study (MOS), 21
prescriptive fitting of hearing aid, 482 Loudness normalization, in prescriptive MEE (middle ear e¤usion)
Learning disorders, in school-age children, fitting of hearing aid, 483 and speech development, 135–136
327 Loudness recruitment, 413, 415–416 tympanography for, 505, 506, 507
‘‘Least pronunciation e¤ort’’ approach, to Loudness units (LU), 413 MEG (magnetoencephalography), 305
agrammatism, 265–266 Loudness view, 419 Melodic intonation therapy (MIT), 105, 347–
Least restrictive environment, 308 LSA (Linguistic Society of America), on 348
Lee-Silverman Voice Treatment (LSVT), 89, African-American English, 318 Mel scale, 526
93–94, 130 LSVT (Lee-Silverman Voice Treatment), 89, Memory, working
Left ear advantage (LEA), 458–460 93–94, 130 in school-age children, 328
Left hemisphere, in language development, LU (loudness units), 413 and semantic development, 395–396
311, 312–313 Lungs, in vocal production, 56–57 in specific language impairment, 349–351
Left hemisphere damage (LHD), prosodic Lung volume, for speech production, 83–84 Memory deficits
deficits due to, 381–382 Lupus erythematosus, systemic, laryngeal in Alzheimer’s disease, 291
Left superior temporal gyrus, during speech involvement in, 34 in Down syndrome, 289
production, 275–276 Lx, 24 and semantic development, 395–396
Leprosy, laryngeal infection with, 33 in specific language impairment, 349–351
Lessac-Madsen Resonant Voice Therapy M Ménière’s disease (MD),
(LMRVT), 89–90 MacArthur Communicative Development electrocochleography of, 463–465
Lewy bodies, in parkinsonism, 30 Inventory, 370 Mental retardation
Lewy body disease (LBD), 292 Macro-speech act, 301 defined, 352
Lexical agraphia, 234 Macrostructures, 300 and language, 352–353
Lexical decision, 237 Magnetic resonance imaging (MRI) and speech
Lexical metaphor, right hemisphere in, 387 functional, 306–307 augmentative and alternative
Lexical Neighborhood test, 456 after early brain injury, 313 communication approaches to, 113–114
Lexical perseverations, 362 of subcortical involvement, 317 in children, 140–141
Lexical representations, 189, 190 in speech assessment, 171 Merger, in syntax framework, 269
Lexical selection model, 316 Magnetoencephalography (MEG), 305 Metaphon, 154, 205
Lexical semantic processing, with right Mainstreaming, 308 Metaphonology, 153
hemisphere lesions, 387, 389–390 Malingering, 475, 476, 477 Metaphor, right hemisphere in, 387, 389–390
Lexical-semantic spelling route, 233 Malnutrition, due to dysphagia, 132–133 Meter, 345–346
Lexicon, 189 Mandible, defects of, 46–47 Methylphenidate, for aphasia, 258
in children with language disorders, 354 Manipulation, in phonological awareness, Metronomic pacing, 105
LHD (left hemisphere damage), prosodic 155 MFDR (maximum flow declination rate), 23,
deficits due to, 381–382 Manually coded English (MCE), 342 69, 76
Lidocaine, for tinnitus, 557 Mapping deficits, 383–385 in children, 69, 70
Lifestyle factors, and voice disorders in in aphasia, 271 Michel deafness, 479
elderly, 73 Mapping hypothesis, of agrammatism, 231 Microphone, in cochlear implant, 447–448
Liljencrants-Fant (LF) model, of glottal flow, Mapping therapy, 385 Microstructures, 300
5 Markedness, 174, 199 ‘‘Microworlds,’’ for aphasia, 255
Limbic system, in vocalization, 59–61 Masked threshold, 416 Middle ear, physiology of, 522–523
Linear phonology, 175 Maskers, for tinnitus, 557 Middle ear assessment, in child, 504–507
Linguistic competence, deficits of, 273 Masking Middle ear e¤usion (MEE)
Linguistic conditions, 295 amplitude compression and, 416–417 and speech development, 135–136
Linguistic constraints, 295 central, 502 tympanography for, 505, 506, 507
Linguistic prosody, 108, 381–382 critical band, 416, 539 Midmasking level, 503
Linguistic Society of America (LSA), on downward spread of, 417 Milieu interventions, 379
African-American English, 318 during hearing test, 500–503, 534–535 Minimal Competency Core (MCC) analysis,
Lip, defects of, 46 upward spread of, 416–417 298, 319
Lipreading, 543–546 Masking-level di¤erence (MLD), hearing loss Minimal-contact phase, 24, 25
Liquid consonants, 143 and, 489–492 Minimal contrast drills, for articular
Listening, dichotic, 458–460 Massage, circumlaryngeal, 49, 93 problems, 130
Listening/reading span task, in specific Maxilla, defects of, 46, 47 Minimalist syntax, in children with language
language impairment, 350 Maximally intrusive interventions, for disorders, 354–355
Literalness, with right hemisphere lesions, preschoolers, 378–379 Minimally intrusive interventions, for
387 Maximal pair therapy, 199 preschoolers, 379
LMRVT (Lessac-Madsen Resonant Voice Maximum flow declination rate (MFDR), 23, Minimal pairs contrasts
Therapy), 89–90 69, 76 for apraxia of speech, 104
Localization, evaluation of, 453 in children, 69, 70 assessment of, 174
Logorrhoea Maximum masking level, 503 for speech disorders in children, 198–199,
in primary progressive aphasia, 247 MCC (Minimal Competency Core) analysis, 205
in Wernicke’s aphasia, 252 298, 319 Minimum flow, 69
Lombard test, 476 McDonald Deep Test of Articulation, 216 Minimum masking level, 502–503
Loop diuretics, ototoxicity of, 519 MCE (manually coded English), 342 Miss rate (MS), 444, 445, 446
Loudness MCL (most comfortable level), in MIT (melodic intonation therapy), 105, 347–
amplitude compression and, 413–416 prescriptive fitting of hearing aid, 482 348
in auditory development, 425 MD (Ménière’s disease), Mixed hearing loss, audiogram of, 536, 537
after conservation laryngectomy, 81 electrocochleography of, 463–465 MLD (masking-level di¤erence), hearing loss
factors leading to, 77 Mean length of utterance (MLU), 298 and, 489–492
Loudness additivity, 413–415 and stuttering, 334 MLU (mean length of utterance), 298
Loudness discomfort level (LDL), in Mechanical ventilation and stuttering, 334
prescriptive fitting of hearing aid, 482 speech development with, 176–179 Möbius syndrome, hearing loss in, 479
Loudness growth rate, 415–416 speed production with, 226–228 Modeling, 192–193
612 Subject Index
Modiolus, 524 NAL-R procedure, for prescriptive fitting of Number of di¤erent words (NDW), and
Modulation transfer function (MTF), 539 hearing aid, 483 stuttering, 334
Molecular genetic analysis, of speech NAL-SSPL procedure, for prescriptive fitting Nystagmus
disorders, 185 of hearing aid, 484–485 evaluation of, 467–471
Mondini deafness, 479 Naming, in Alzheimer’s disease, 240 gaze-evoked, 467
Monkeys, vocal production system in, 57, 58 Narrative discourse positional, 469
Morphemes, in children with language analysis of, 300–301 slow-component velocity of, 470
disorders, 354 of children with focal lesions, 312 spontaneous, 469
Morphology disorders, in school-age impairments of, 302–304
children, 324–325 levels of processing of, 302–304 O
Morphosyntax, in children with language with right hemisphere lesions, 387, 390–391 OAEs. See Otoacoustic emissions (OAEs)
disorders, 354–356 Narrative structure, 300–301 Oblique interarytenoid muscle, 16, 18
Morphosyntax intervention, 206 Narrow-band spectrogram, 171 Occupational Safety and Health
MOS (Medical Outcomes Study), 21 Nasometry, 170, 170 Administration (OSHA)
Most comfortable level (MCL), in National Hearing Conservation Association on hearing protection devices, 498, 499
prescriptive fitting of hearing aid, 482 (NHCA), 499 on noise-induced hearing loss, 508
Motor aphasia, transcortical, 249–250, 263 National Institute of Occupational Safety and Octave, 526
Motor learning, in apraxia of speech, 105– Health (NIOSH), 499 Ocular motor tests, 467
106 NDW (number of di¤erent words), and OHC (outer hair cell) compression, 413, 414,
Motor limitations, in Down syndrome, 289 stuttering, 334 418
Motor planning, in apraxia of speech, 102 Negative pressure ventilation OHC (outer hair cell) function, in auditory
Motor programming, in apraxia of speech, in infants and young children, 177 neuropathy, 433
102 speech production with, 226 OHCs (outer hair cells)
Motor speech disorder(s) Neglect dyslexia, 237 in noise-induced hearing loss, 509
of known origin, in children, 200–203 Neologisms, in aphasia, 363–364 physiology of, 523, 524
pure, 263 Neologistic output, 247 OME. See Otitis media with e¤usion (OME)
Motor speech involvement, in children, 142– Neomycin, ototoxicity of, 518, 519 Omohyoid muscle, 16, 17
144 Neonates. See Newborns Ontogenetic specialization, 311
Mouthing words, after total laryngectomy, Neural mechanisms, of vocalization, 59–61 Optimality theory, 175
138 Neurofibromatosis type 2 (NF2), auditory Optokinetic (OPK) system, 467
MRI. See Magnetic resonance imaging brainstem implant with, 427–428, 429 Organ of Corti, in noise-induced hearing loss,
(MRI) Neurogenic mutism, 145–146 509, 510
MS (miss rate), 444, 445, 446 Neurolinguistic analysis, of language Orofacial myofunctional disorders, in
MTF (modulation transfer function), 539 disorders in aphasia, 363–364 children, 147–149
Mucopolysaccharidosis, hearing loss in, 479 Neurological disorders Orthographic input lexicon
Multiband compression, 418–419 aging-related, voice therapy for, 91–94 in agraphia, 234
Multidirectional Reach test, 564 masking-level di¤erence with, 489–490 in alexia, 236, 237
Multiple activation, 393 Neutralization, 174 Oscillopsia, 563
Multisyllabic Neighborhood test, 456 Newborns OSHA (Occupational Safety and Health
Muscular tension dysphonia (MTD) hearing screening for, 421–422, 561 Administration)
botulinum toxin for, 38 high-frequency tympanometry in, 507 on hearing protection devices, 498, 499
voice handicap assessment in, 22 NF2 (neurofibromatosis type 2), auditory on noise-induced hearing loss, 508
Music, pitch in, 52–53, 525–526 brainstem implant with, 427–428, 429 Ossicular discontinuity, tympanometry of,
Music perception, in auditory development, NHCA (National Hearing Conservation 562
425 Association), 499 Osteogenesis imperfecta, hearing loss in, 479
Mutism NHR (noise-to-harmonics ratio), 3 Osteoradionecrosis, 46
akinetic, 145 Nicaraguan Sign Language, 342 Otitis media, early recurrent
anarthric, 145–146 NIOSH (National Institute of Occupational and language development, 358–360
in aphasia, 145 Safety and Health), 499 and speech development, 135–136
in apraxia, 145 Nitchie’s method, 543 Otitis media with e¤usion (OME)
in autism, 116 No ear advantage (NoEA), 458–460 hearing loss due to, 495
cerebellar, 146 n of m strategy, 449 and language development, 358–360
in dementia, 146 Noise-induced hearing loss, 496, 508–510 and speech development, 135–136
in dysarthria, 145–146 Noise notch, 519 tympanometry of, 560–561
hyperkinetic, 145 Noise Reduction Rating (NRR), 498–499 Otoacoustic emissions (OAEs)
neurogenic, 145–146 Noise-to-harmonics ratio (NHR), 3 in children, 515–516
after surgery, 146 Noncontrastive patterns, 295, 297–298 clinical applications of, 511–513
in traumatic brain injury, 146 Nonlinear phonology, 175, 199, 214 in pseudohypacusis, 532–533
Mutual intelligibility, of dialect, 124–125 Nonliteral language, with right hemisphere Otopalatal-digital syndrome, hearing loss in,
Myasthenia gravis, dysarthria in, 130 lesions, 387 479
Mycobacterial infections, of larynx, 33 Nonorganic hearing loss, 531–533 Otosclerosis, tympanometry of, 561, 562
Mycobacterium leprae, laryngeal infection in children, 475–477 Otoscopy, 504
with, 33 Nonsteroidal anti-inflammatory drugs Ototoxic medications, 493–494, 518–520
Mylohyoid muscle, 16, 17 (NSAIDs), ototoxicity of, 519 Outer ear, physiology of, 522–523
Myoelastic-aerodynamic theory, of Nonsymbolic communication, 277 Outer hair cell (OHC) compression, 413, 414,
phonation, 57 Nonverbal processing deficits, with right 418
Myofunctional disorders, orofacial, in hemisphere lesions, 389 Outer hair cell (OHC) function, in auditory
children, 147–149 Nonword repetition, in specific language neuropathy, 433
impairment, 349–350 Outer hair cells (OHCs)
N NRR (Noise Reduction Rating), 498–499 in noise-induced hearing loss, 509
NA (nucleus ambiguus), in vocalization, 61 Nucleus ambiguus (NA), in vocalization, 61 physiology of, 523, 524
NAL-NL1 procedure, for prescriptive fitting Nucleus retroambiguus (NRA), in Output limiting, and sound quality of hearing
of hearing aid, 483 vocalization, 61 aid, 487–488
Subject Index 613
Overmasking, 501 Peripheral laryngeal nerve damage, in elderly, developmental, 156

Oxidative stress, in noise-induced hearing 72 genetic transmission of, 183–185
loss, 509 Peripheral neuropathy, with auditory in school-age children, 324
neuropathy, 434 Phonological dyslexia, 238
P Peripheral paralysis, in elderly, 72 Phonological errors
PACE treatment, 284 Peripheral structural changes in aphasia, 366–368
PAG (periaqueductal gray), in vocalization, due to surgical ablation, 45–48 children with, speech sampling, articulation
60–61 due to trauma, 45 tests, and intelligibility in, 213–214
Paget disease, hearing loss in, 479 Perisylvian cortex, in vocalization, 60, 61 residual, 156–157
Palatal lift, for resonance problems, 130 Perseveration, 361–362 Phonological manipulation, 155
Paradoxical breathing dysphonia, botulinum Pervasive developmental disorders (PDDs), Phonological process analysis, 214
toxin for, 38 116 Phonological processing, 153
Paragrammatism augmentative and alternative and reading disability, 329
in Alzheimer’s disease, 240 communication approaches with, 114 Phonological process therapy, 199
in Wernicke’s aphasia, 252 semantic development with, 395–397 Phonological sensitivity, 153
Paralexias, semantic, 237 PET (positron emission tomography), 305– Phonologic issues, with speakers of African-
Paraphasias, phonemic, 275, 363–365 306 American Vernacular English, 158–160
Parasaurolophus, vocal production system in, of stuttering, 222 Phonologic lexicon
58 of subcortical involvement, 317 in agraphia, 234
Parietal lobe, inferior, in aphasia, 263–264 PET (pressure equalization tube), in alexia, 236, 237–238
Parkinson’s disease (PD) tympanometry of, 561 Phonology
agrammatism in, 231 Pharmacological approaches, to aphasia, in bilingualism, 279
dementia in, 293 257–259 linear and nonlinear, 175, 199
dysarthria in, 30–31, 126 Pharyngoesophageal (PEO) segment, 10–11 Phonology testing, of children
hypokinetic laryngeal movement disorders Pharynx, of infants and young children, 177 with phonological errors, 214
in, 30–31 Phenotype, 478 with residual errors, 217
Parsing, vs. mapping, 384–385 Phenotype definitions, for speech disorders, Photo Articulation Test–Third Edition, 216
Part-word repetitions, in children, 180 183–184 Phrasal metaphor, right hemisphere in, 387
Parvocellular reticular formation, in Phon(s), 413 Phrase repetitions, in children, 180
vocalization, 61 Phonation. See also Voice Phrenic nerve, laryngeal reinnervation with,
PCC (Percentage of Consonants Correct), evolution of, 56–58 43
214 flow, 52 Phrenic nerve pacers, speech production with,
PCC-R (Percentage of Consonants Correct– hyperfunctional (pressed), 52 226
Revised) measure, 136 hypofunctional, 52 Pierre Robin syndrome, hearing loss in, 479
PD. See Parkinson’s disease (PD) physics and physiology of, 75–77 Pinna, physiology of, 522
PD (prevalence), 446, 447 Phonation threshold pressure (PTP), Piracetam, for aphasia, 258
PDC (probability distribution curve), 445– hydration and, 55 Pitch
446 Phonatory adductory range, 75 in aprosodia, 109
PDDs (pervasive developmental disorders), Phonatory threshold pressure, 75 in auditory development, 425
116 Phoneme-grapheme conversion, 233, 234 complex, 526–527
augmentative and alternative Phonemic awareness, 153 factors leading to, 77
communication approaches with, 114 Phonemic carryover perseveration, 362 of missing fundamental, 438
semantic development with, 395–397 Phonemic contrast, with hearing loss, 208 musical, 52–53, 525–526
Peak admittance, 505 Phonemic development, with mental perception of, 525–527
Peak clipping, and sound quality of hearing retardation, 141 Pitch chroma, 526
aid, 488 Phonemic inventory, 174, 213–214 Pitch control, 75–76
Peak compensated acoustic admittance, 505 Phonemic paraphasias, 275, 363–365 Pitch height, 526
Peak compensated static acoustic admittance, Phonetic derivation, 104 Pitch range, 53–54
559–560 Phonetic inventory, 174 Pitch scales, 526
Pediatric patients. See Children of Spanish, 210 PIW (Percentage of Intelligible Words), 214,
Peer relations, with language impairment, Phonetic movement, organization and 217
399 sequencing of, 143 Place coding, of frequency, 523
Pemphigoid, cicatricial, laryngeal Phonetic placement, integral stimulation and, Plasticity, constrained, 311
involvement in, 34 104 Plateau, 502
Pendred syndrome, hearing loss in, 479 Phonetic transcription, of speech in children, Playback, slow, 472
Pennsylvania, western, dialect of, 126 150–152 Plethysmography, in speech assessment, 170
PEO (pharyngoesophageal) segment, 10–11 Phonological acquisition, delayed, 219 PMT (prelinguistic milieu teaching), 376–377
Percentage of Consonants Correct (PCC), Phonological agraphia, 234–235 POGO II procedure, for prescriptive fitting of
214 Phonological alexia, 238 hearing aid, 485
Percentage of Consonants Correct–Revised Phonological analysis, of language disorders Point vowels, 58
(PCC-R) measure, 136 in aphasia, 363–365 Polychondritis, relapsing, laryngeal
Percentage of Intelligible Words (PIW), 214, Phonological awareness, in children, 190, 324 involvement in, 34
217 Phonological awareness intervention, for Polypoid degeneration, and voice disorders in
Perceptual evaluation, of voice quality, 78– children with expressive phonological elderly, 72–73
79 impairments, 153–155 Positional nystagmus, 469
Performers, voice therapy for, 96–97 Phonological breakdown, in aphasia, 363, Positive pressure ventilation
Performing environments, toxic substances 364 in infants and young children, 177
in, 96 Phonological development, of Latino speech production with, 226, 227
Periaqueductal gray (PAG), in vocalization, children, 210–212 Positive reinforcement, 192
60–61 Phonological disorders Positron emission tomography (PET), 305–
Period, cycle-to-cycle perturbations of, 3–5 vs. articulation disorders, 196 306
Periodicity, as reference, 3–5 in bilingual Latino children, 211–212 of stuttering, 222
Peripheral agraphia syndromes, 234, 235 description and classification of, 219 of subcortical involvement, 317
614 Subject Index
Postcentral gyrus, in apraxia of speech, 101, Psychogenic hearing loss, 531–533 Rehabilitation
102 in children, 475–477 computer-aided, for aphasia, 254–256
Posterior cricoarytenoid muscle, 16, 18, 19 Psychogenic speech disorders (PSDs), 186– after laryngectomy, 10–12
Posterior digastric muscle, 16, 17 188 partial or conservation, 80–81
Posterior probabilities, 446–447 Psychogenic voice disorders vestibular, 519, 563–565
Post-polio syndrome, laryngeal muscle direct therapy for, 49–51 Reinforcement, positive, 192
weakness in, 33 etiology of, 27–29 Reinforcement schedules, 192
Postural control, evaluation of, 564 Psycholinguistic perspective, on speech Reinke’s edema, and voice disorders in
Posture, and speech production, 83 disorders in children, 189–191 elderly, 72–73
Poverty, e¤ect on language of, 369–371 Psychosocial issues Reinke’s space, 17
PPA (primary progressive aphasia), 245–248 with aphasia, 260–261 Reinnervation, of larynx, 41–43
Pragmatic development, in school-age with Asian-Pacific American children, 168 Relational analyses, 175, 213–214
children, 325 with communicative disorders, 161–163 Relative gradient, of tympanogram, 506
Pragmatic impairment, 372–374 PTP (phonation threshold pressure), Release time, and sound quality of hearing
Pragmatics, 372–374 hydration and, 55 aid, 488
in bilingualism, 280 ‘‘Pull-out’’ services, 308 Repetitive strain injury, 95
defined, 372 Pure motor speech disorder, 263 Requesting, 375
Precentral gyrus, of insula, in apraxia of Pure-tone average (PTA), 536 Residual errors, 156–157
speech, 102 Pure-tone hearing loss, in auditory children with, speech sampling, articulation
Pregnancy, substance abusing during, 194– neuropathy, 433, 434 tests, and intelligibility in, 215–217
195 Pure-tone threshold assessment, 534–537 description and classification of, 218
Prelinguistic communication intervention, for for pseudohypacusis, 532 Resonance, during singing, 51, 53–54
developmental disabilities, 375–377 Pure word deafness, 252, 263 Resonance problems, behavioral treatment
Prelinguistic milieu teaching (PMT), 376–377 Purring, 57 of, 130
Presbyacusis, 527–530, 528, 529 Push-pull technique, 89 Resonant voice, 52, 89, 93
masking-level di¤erence with, 490 Putamen, in language disorders, 315 Respiratory support, in dysarthria, 129–130
Preschool Intelligibility Measure, 214 Respirodeglutometry (RDG), of dysphagia,
Preschool language intervention, 378–380 Q 134
Preschool period, language assessment with Quality index, 538 Respitrace, 170
hearing loss in, 423 Quality theory, 539 Response/release/semantic feedback model,
Pressed phonation, 52 Question formation, in agrammatism, 406 315–316
Pressure equalization tube (PET), Quiet, breathy voice approach, 88, 89 Responsive small group (RSG), 376–377
tympanometry of, 561 Quinine derivatives, ototoxicity of, 519 Reticular formation (RF), in vocalization, 61
Prevalence (PD), 446, 447 Retrocochlear hearing loss, auditory
Primary gain, 28, 187 R brainstem response in, 432
Primary progressive aphasia (PPA), 245–248 Radiation therapy Reverberation, in classroom, 442
Priming, in Alzheimer’s disease, 240 for head and neck cancer, 46 Reverberation time (RT), in classroom, 442
Print-to-sound conversion, 236 with total laryngectomy, 137–138 Reversibility disorders, 383–385
Probabilities, posterior, 446–447 Rate control, for dysarthria, 130–131 Revisions, in children, 180
Probability distribution curve (PDC), 445– rCBF (regional cerebral blood flow), 305–306 RF (reticular formation), in vocalization, 61
446 RDG (respirodeglutometry), of dysphagia, RHD (right hemisphere damage)
Proband, for speech disorders, 184 134 aprosodia due to, 108–109, 381–382
Proboscis monkeys, vocal production system REA (right ear advantage), 458–460 emotional and nonverbal processing deficits
in, 57 Reaction time, of children with focal lesions, with, 389
Professional voice users, voice therapy for, 312 Rheumatoid arthritis, laryngeal involvement
95–97 Reactive oxygen species, in noise-induced in, 34
Program-of-action perseverations, 362 hearing loss, 509 Rhythmicity, motor speech involvement in,
Progressive approximation, 104 Reading 143
Prolongations, sound, 180 acquired impairment of, 236–239 Rhythmic structure, 394
Prolonged speech, 105 semantic development and, 396–397 Rib cage, in speech production, 83
Prompting, 193 Reading disability, language impairment and, Right ear advantage (REA), 458–460
Proportion of Blocked Words, 541 329–330 Right hemisphere
Prosodic contour, 386 Real-ear attenuation-at-threshold (REAT), in discourse and conversation, 387, 390–391
Prosodic disorders, 381–382 498, 499 in language and communication, 386–387
in aphasia, 367, 368 Real ear probe microphone, 480 in language development, 311, 312–313
Prosodic movement, organization and Recall test, for dichotic listening, 458 in lexical and phrasal metaphor, 387
sequencing of, 143 Recasting, 379 in lexical semantic processing, 387, 389–390
Prosody, 344–346 Receiver, in cochlear implant, 448 in prosody, 108–109, 381–382, 386–387,
defined, 107, 344, 386 Receiver operating characteristic (ROC) 389
functions of, 381, 386 curve, 446 Right hemisphere damage (RHD)
and melodic intonation therapy, 347–348 Receptive aprosodia, 108 aprosodia due to, 108–109, 381–382
meter in, 345–346 Recruitment, 413, 415–416 emotional and nonverbal processing deficits
right hemisphere in, 108–109, 381–382, Recurrent laryngeal nerve (RLN), 17 with, 389
386–387, 389 reinnervation of, 41–43 Right hemisphere language disorders, 388–
syllable shape in, 344–346 Recurrent perseveration, 362 391
Prosthetics, for surgical defects, 47 Recurrent utterances, in global aphasia, 243 Rima glottidis, 15
Protoimperatives, 375 Red deer, vocal production system in, 58 RIOT protocol, for Asian-Pacific American
PSDs (psychogenic speech disorders), 186– Reduced syntax therapy, 232 children, 167–168
188 Referencing, 375 Risk, for developmental delay, 286
Pseudohypacusis, 531–533 Reflectometry, 507 RLN (recurrent laryngeal nerve), 17
in children, 475–477 Regional cerebral blood flow (rCBF), 305– reinnervation of, 41–43
Psychiatric disorders, with communicative 306 ROC (receiver operating characteristic)
disorders, 161–163 Regional dialect, 124–126 curve, 446
Subject Index 615
Rocking beds, speech production with, 226 Semantic perseverations, 362 SOAEs (spontaneous otoacoustic emissions),
Rothenberg’s procedure, 68 Semantic pragmatic deficit syndrome, 373 511, 512
RSG (responsive small group), 376–377 Semantic processing, 395 Social development, and language
RT (reverberation time), in classroom, 442 Semantics, defined, 395 impairment, 398–400
Rule-governed alternations, 174–175 Semitone, 526 Social reintegration, with aphasia, 261
Sensitivity, 444, 445 Sociobehavioral consequences, of
S Sensorineural hearing loss (SNHL) communicative disorders, 162–163
Saccadic system, 467, 468 audiogram of, 536, 537 Socioeconomic status (SES), e¤ect on
SAE (Standard American English), African- signal-to-noise ratio with, 442 language of, 369–371
American English vs., 318–320 Sensory aphasia, transcortical, 250, 263 Soft palate, defects of, 46
SAM (sinusoidal amplitude modulation) Sensory cortex, in aphasia, 275 Somatosounds, 556
detection, 553 Sensory di¤erentiation exercises, 90 Sones, 413
Sarcoidosis, laryngeal involvement in, 34 Sensory implant, 208–209 Sonority, principle of, 364
Saturation sound pressure level (SSPL), 483– Sensory palsy, laryngeal reinnervation for, Sound(s), distribution of, 174
484 43 Sound articulation, 539
‘‘Say-It-And-Move-It’’ task, 154 Sentence Intelligibility Test, 126 Sound localization, in auditory development,
Sca¤olded narrative approach, 205 Sentence-picture matching test, with right 425–426
Scalae, physiology of, 523 hemisphere lesions, 387 Sound pressure, 63, 64
Scales, musical, 526 Service delivery models, 308 Sound pressure level (SPL)
Scanning, in augmentative and alternative SES (socioeconomic status), e¤ect on and alternating glottal airflow, 69–70
communication, 277 language of, 369–371 of children, 68
S-CAT (Second Contextual Articulation Sex di¤erences, in fundamental frequency, defined, 69
Tests), 217 36 and laryngeal airway resistance, 70–71
Scenarios, 284 Sex reassignment, and speech di¤erences, Sound prolongations, 180
Scheibe deafness, 479 223–225 Sound spectrography, in speech assessment,
Schema, 303 SF-36, 21 171–173
School-age children SFOAEs (stimulus frequency otoacoustic SP (summating potential), 461, 463, 464
hearing loss in emissions), 511, 512 Spanish-speaking children, speech issues in,
language assessment with, 423 in children, 515 210–212
screening for, 495–497 Shadowing, 502 Spasmodic dysphonia
language disorders in SHAPE (Smit-Hand Articulation and acoustic assessment of, 4–5
assessment of, 324–325 Phonology Evaluation), 214, 216 botulinum toxin for, 38–39
overview of, 326–328 Shaping, 193 laryngeal reinnervation for, 43
risk factors for, 327 Shimmer, 3–5 Spastic dysarthria, 127, 130
School talk, 113 Signal-to-noise ratio (S/N), in classroom, Spatial hearing
Schwannoma, vestibular 442 in auditory development, 425–426
auditory brainstem implant with, 427–428, Signed English, 342 evaluation of, 453
429 Signed Exact English, 342 Spatial location, of sound source, 524
auditory brainstem response with, 432 Sign language, 339–343, 423 Spatially separated sources, sounds from,
Scleroma, larynx in, 33 Silent speech, after total laryngectomy, 138 438–439
Screening, for hearing loss Simulations, for aphasia, 255 Speaker and listener skills, socioeconomic
in newborns, 421–422, 561 Simultagnosia, 237 status and, 370
in school-age child, 495–497, 561 Singers Speaker-listener distance, in classroom, 443
Script(s), 284, 303 elderly, 92 Speaker localization test, 453
Script knowledge, 300 voice handicap assessment in, 22 Speaking valve, pediatric, 178
SCV (slow-component velocity), of voice therapy for, 96–97 Special education, 307, 308
nystagmus, 470 Singer’s formant, 53, 54 Specialization
Secondary gain, 28, 187 Singing, voice in, 51–54 early, 311
Second Contextual Articulation Tests Sinusoidal amplitude modulation (SAM) ontogenetic, 311
(S-CAT), 217 detection, 553 Specificity, 445
Seeing Essential English, 342 Situational level, of discourse, 303–304 Specific language impairment (SLI), 402–
Segmental inventory, 213–214 Situation model, 300 403
Segmentation, of spoken language, 392–394 SLI. See Specific language impairment (SLI) in African-American children, 318
SELD (slow expressive language Slips-of-the-tongue, 364 clinical markers for, 403
development), 286 SLN (superior laryngeal nerve), 17 criteria for, 402
Self directed inference, with right hemisphere reinnervation of, 41–43 cross-linguistic studies of, 331–332
lesions, 389 Slow-component velocity (SCV), of defined, 402
Self-esteem, with aphasia, 261 nystagmus, 470 early intervention services for, 286
Self-help groups, for aphasia, 261 Slow expressive language development genetic basis for, 402
Self-management, with language impairment, (SELD), 286 language di‰culties with, 402–403
399 Slow playback, 472 memory deficits in, 349–351
Semantic development, 395 SMA (supplemental motor area), in motor speech involvement in, 142–144
with acquired language disorders, 396 vocalization, 59, 60, 61 neuroanatomy of, 402
with developmental language disorders, Smit-Hand Articulation and Phonology pragmatic impairment in, 373
395–397 Evaluation (SHAPE), 214, 216 prevalence of, 402
with Down syndrome, 396 Smitheran and Hixon technique, 68 in school-age children, 326–327
memory deficits and, 395–396 Smoking, and voice disorders in elderly, 73 evaluation of, 324–325
and reading, 396–397 Smooth pursuit system, 467, 468 semantic development with, 395–397
in school-age children, 325 S/N (signal-to-noise ratio), in classroom, social development with, 399–400
Semantic impairment, in Alzheimer’s disease, 442 socioeconomic status and, 370–371
240 SNHL (sensorineural hearing loss) Spectral component ratios, 6
Semantic level, of discourse, 303 audiogram of, 536, 537 Spectral contrast, enhancement of, 524
Semantic paralexias, 237 signal-to-noise ratio with, 442 Spectrographic analysis, 171–173
616 Subject Index
Speech, 99–228 Speech rehabilitation, after laryngectomy, aphasia due to (see Aphasia)
apraxia of (see Apraxia of speech [AOS]) 10–12 augmentative and alternative
artificial laryngeal, 12 partial or conservation, 80–81 communication approaches in, 111–112
auditory-motor interaction in, 275–276 Speech sampling, for children dementia due to, 292–293
bilingual, 119–121 with phonological errors, 213–214 depression after, 258, 260–261
developmental apraxia of with residual errors, 216 dysarthria due to, 126
augmentative and alternative Speech-song, 348 dysphagia due to, 132–133
communication approaches with, 114 Speech sound disorders employment after, 261
diagnostic criteria for, 121–123 description and classification of, 218–219 global aphasia due to, 243–244
vs. motor speech involvement of unknown genetic transmission of, 183–185 Structural level, of discourse, 303, 304
origin, 142 Speech synthesizer, in evaluation of voice Stuck-in-set perseveration, 361–362
esophageal, 10, 11, 138 quality, 79 Stuttering
instrumental assessment of, 169–173 Speech Tracking, 541–542 in adults, 221–222
mental retardation and, 140–141 Speech transmission index, 539 in children, 180–182
phonetic transcription of, 150–152 Speed quotient, 68–69 language and, 333–335
prolonged, 105 Spelling, in agraphia, 233–235 epidemiology of, 220–221
silent, after total laryngectomy, 138 SPL. See Sound pressure level (SPL) Stylohyoid muscle, 16, 17
spontaneous, in Alzheimer’s disease, 240 Spoken language, segmentation of, 392–394 Subcortical involvement, in language
tracheoesophageal, 10, 11–12 Spondee thresholds, in pseudohypacusis, 532 disorders, 314–317
Speech act, 301 Spontaneous nystagmus, 469 Subglottal pressure, 64, 68, 76
Speech assessment Spontaneous otoacoustic emissions (SOAEs), in children, 69
descriptive linguistic methods for, 174–175 511, 512 during singing, 51–52
with peripheral structure defects, 47–48 Spontaneous speech, in Alzheimer’s disease, Substance abuse, during pregnancy, 194–
Speech delay, 218 240 195
Speech development Sprechgusang, 348 Substantia nigra, in language disorders, 315
early recurrent otitis media and, 135–136 Squamous cell carcinoma, of peripheral Subthalamic nucleus, in language disorders,
with hearing loss, 422 structures of speech system, 45–46 315
of Latino children, 210–212 SSPL (saturation sound pressure level), 483– Successive approximations, 193
with tracheostomy, 176–179 484 Suggestion audiometry, 476
Speech disfluency, in children, 180–182 Standard American English (SAE), African- Summating potential (SP), 461, 463, 464
Speech disorders American English vs., 318–320 Superior laryngeal nerve (SLN), 17
in children Stapedius muscle, physiology of, 523 reinnervation of, 41–43
behavioral approaches to, 192–193 Staphylococcus aureus, laryngitis due to, 33 Superior temporal gyrus, during speech
birth-related risk factors for, 194–195 Static admittance, 505 production, 275–276
computer-based approaches to, 164–165 Stem cell transplantation, for aphasia, 258 Superstructure, 300
cross-linguistic data on, 196–197 Stenger test, 476, 532 Supine position, breathing exercises in, 83
description and classification of, 218–219 Stereocilia, physiology of, 523 Supplemental motor area (SMA), in
descriptive linguistic approaches to, 198– Stereotypic patterns of behavior, in autism, vocalization, 59, 60, 61
199 117 Supported conversation, 284
motor, of known origin, 200–203 Sternocleidomastoid muscle, 17 Suppression threshold, 416
psycholinguistic perspective on, 189–191 Sternohyoid muscle, 16, 17 Supra-aural earphones, 500–501
speech-language approaches to, 204–206 Sternothyroid muscle, 16, 17 Suprahyoid muscles, 15–17
genetic transmission of, 183–185 Stevens’ law, 413 Suprathreshold speech recognition, 548–549
due to hearing impairment, 207–209 Stimulability, determination of, 214, 217 Surface agraphia, 234
psychogenic, 186–188 Stimulation activities, for aphasia, 254 Surface alexia, 238
sociobehavioral consequences of, 162–163 Stimulus, in electrocochleography, 462–463 Surface dyslexia, 238
Speech Disorders Classification System, 135, Stimulus frequency otoacoustic emissions Surface level, of discourse, 303
218 (SFOAEs), 511, 512 Surface prompts, 105
Speech Enhancer, 165 in children, 515 Surgery, mutism after, 146
Speech features, evaluation of, 452, 453 Stimulus onset, 438 Surgical ablation, peripheral structural
Speech imaging, 171 Stop consonants, 143, 198 changes due to, 45–48
Speech intelligibility index, 538, 539 Story schema, 300 Swallowing disorder, 132–134
Speech issues, in Latino children, 210–212 Story-Telling Probes of Articulation Swedish, speech disorders in English vs., 197
Speech-language approaches, to speech Competence, 217 Syllable shape, 344–345, 346
disorders in children, 204–206 Strategies, in augmentative and alternative Syllable shape processes, 175
Speech misarticulations, in orofacial communication, 277 Symbol(s), in augmentative and alternative
myofunctional disorders, 148 Strength training, for articular problems, communication, 277
Speech perception 130 Symbolic communication, 277
in aphasia, 367–368 Streptococcus pneumoniae, laryngitis due to, Symptom incongruity, 28
in auditory development, 425 33 Symptom psychogenicity, 28
in auditory neuropathy, 434 Streptomycin, ototoxicity of, 518, 519 Symptom reversibility, 28
indices of, 538–540 Stress Syntactic bootstrapping, 396
Speech processor, in cochlear implant, 448 agrammatism with, 231 Syntactic tree, 405–407
Speech production and speech disorders, 186 Syntax
in agrammatism, 405–407 Stressed syllables, 394 argument structure in, 269–271
in aphasia, 366–367 Stress pattern, 345 in bilingualism, 280
Speech Production-Perception Task, 175 Stress rhythm, 394 in children
Speechreading, 543–546 Striatocapsular structures, in language deaf, 337
Speech recognition disorders, 315 with focal lesions, 312
in classroom, 442–443 Stroboscopic endoscopy, in speech with language disorders, 354–356
presbyacusis and, 528–529 assessment, 170 Syntax disorders, in school-age children,
suprathreshold, 548–549 Stroke 324–325
Speech recognition systems, 165 antidepressants for, 257 Syphilis, laryngeal involvement with, 33–34
Subject Index 617
Syrinx, 57 Tinnitus, 556–557 Tree truncation hypothesis, of agrammatism,

Systematic Analysis of Language Transcripts, vs. pseudohypacusis, 532 232
325 Tinnitus retraining therapy (TRT), 557 Tremor, vocal, 94
Systemic lupus erythematosus, laryngeal TM. See Tympanic membrane (TM) botulinum toxin for, 38–39
involvement in, 34 TMFD (temporary mutism followed by Treponema pallidum, laryngeal infection with,
dysarthria), 146 33–34
T Toddlers, communication disorders in, 285– TRT (tinnitus retraining therapy), 557
Tactaid, 541 287 True negative rate, 445
Tactile aids, 541 Tone quality, in voice exercises, 87 True positive rate, 444, 445
TA (thyroarytenoid) muscle Tongue, defects of, 46, 47 TSD (theory of signal detection), 458
anatomy of, 16, 18 Tongue thrust, in orofacial myofunctional TT ECochG (transtympanic
in vocal production, 75–76, 77 disorders, 147, 148 electrocochleography), 462, 464
Target behaviors, antecedent control of, 192 TPH (tree pruning hypothesis), 405–407 Tuberculosis, laryngeal, 33
TBI. See Traumatic brain injury (TBI) TPP (tympanogram peak pressure), 559, Tumors, of peripheral structures of speech
TDH (trace deletion hypothesis), 231, 271, 560 system, 45–48
407–409 Trace deletion hypothesis (TDH), 231, 271, Turkish, speech disorders in English vs., 197
Teachers, voice therapy for, 95 407–409 Tutorials, for aphasia, 255
Telegraphic speech, in primary progressive Trachea, of infants and young children, 177 TW (tympanogram width), 506, 559, 560
aphasia, 248 Tracheoesophageal puncture, 139 TWA (time-weighted average) sound pressure
Telephony, 539 Tracheoesophageal (TE) speech, 10, 11–12 levels, 498
Temperature-sensitive auditory neuropathy, Tracheostoma, 11 Twin studies
434 Tracheostomy of nonword repetition, 350
Templin-Darley Tests of Articulation, 216 speech development with, 176–179 of speech disorders, 185
Temporal coding, of sound frequency, 524 speech production with, 226–227 of stuttering, 221
Temporal gap detection, 553, 554 Tracheostomy tube, for infants and young Two-ear recognition task, 458, 459
Temporal integration (TI), 550–552 children, 177, 178 Two-tone suppression (2TS), 417
Temporal integration curves (TICs), 550 Tracking Tympanic membrane (TM)
Temporal modulation, 438 Speech, 541–542 electrocochleogram recorded from, 461–
Temporal processing, in auditory visual, 543–546 465
development, 424–425 Tracking Rate, 541 monomeric, tympanometry of, 562
Temporal resolution, 553–555 Trait theory, of functional dysphonia, 28– Tympanic membrane (TM) electrode, 462
Temporal transitions, 524 29 Tympanogram(s)
Temporary mutism followed by dysarthria Transactional model of development, 376 normal, 505
(TMFD), 146 Transcortical aphasia pattern classification of, 505–506, 559
Tense, in children with language disorders, motor, 249–250, 263 qualitative analysis of, 506
355–356 sensory, 250, 263 shapes of, 559
Tensor tympani, physiology of, 523 Transfer e¤ects, in bilingualism, 280 Tympanogram peak pressure (TPP), 559,
TEOAEs (transient evoked otoacoustic Transfer function, 538 560
emissions), 511, 512–513 Transgendered individuals, and speech Tympanogram width (TW), 506, 559, 560
in auditory neuropathy, 435 di¤erences, 223–225 Tympanometric screening instruments,
in children, 515, 516 Transglottal airflow, during singing, 52 calibration of, 496
Teratogens, hearing loss due to, 493–494 Transglottal pressure, 76 Tympanometry, 558–562
TE (tracheoesophageal) speech, 10, 11–12 Transient evoked otoacoustic emissions in child, 504–507
Test battery approach, in pediatric audiology, (TEOAEs), 511, 512–513 high-frequency, in infants, 507
520–522 in auditory neuropathy, 435
Test-retest approach, for African-American in children, 515, 516 U
children, 319 Translaryngeal pressure, for speech Ultrasonography
Textual cohesion, 300–301, 303 production, 83 of dysphagia, 134
Thalamus, in language disorders, 315, 316, Transmission link, in cochlear implant, 448 in speech assessment, 171, 172
317 Transmitter, in cochlear implant, 448 Undermasking, 501
Thalidomide, hearing loss due to, 493 Transplantation, laryngeal, 43 Underspecification theory, 363
Thematic roles, in syntax framework, 270 Transsexualism, and speech di¤erences, 223– Unilateral upper motor neuron dysarthria,
Theory of mind, 387, 390, 423 225 127
Theory of signal detection (TSD), 458 Transtympanic electrocochleography Upper motor neuron dysarthria, unilateral,
Theta marking, 270 (TT ECochG), 462, 464 127
Threshold shift, 502 Transverse interarytenoid muscle, 16, 18 Usher syndrome, hearing loss in, 478
Thyroarytenoid (TA) muscle Trauma
anatomy of, 16, 18 acoustic, 510 V
in vocal production, 75–76, 77 laryngeal, 34, 45 Vagus nerve, 17, 41
Thyrohyoid muscle, 16, 17 in elderly, 73 Vancomycin, ototoxicity of, 518, 519
Thyroid angle, 14 inflammatory processes due to, 34 VAPP (Voice Activity and Participation
Thyroid cartilage, 14 to peripheral structures, 45 Profile), 22
Thyroid disorders, and voice disorders in Traumatic brain injury (TBI) Variable forms, 295
elderly, 73 augmentative and alternative Vascular dementia, 292–293
Thyromuscularis fibers, 18 communication approaches in, 111 VC (vital capacity), and speech production,
Thyroplasty, for dysarthria, 130 dysarthria due to, 126 83, 84
Thyrovocalis fibers, 18 mutism in, 146 Velotrace, 170
TI (temporal integration), 550–552 Traumatic midbrain syndrome, 146 Velum, defects of, 46
TICs (temporal integration curves), 550 Traumatic ossicular discontinuity, Ventilator-supported speech production, 226–
Tidal breathing, during singing, 52 tympanometry of, 562 228
Time-compressed slow playback, 472 Treacher Collins syndrome, hearing loss in, Verb, argument structure of, 269–271
Time-weighted average (TWA) sound 479 Verbal working memory, in school-age
pressure levels, 498 Tree pruning hypothesis (TPH), 405–407 children, 328
618 Subject Index
Vernacular English, African-American. See Vocal production system WDRC (wide dynamic range compression),
African-American English (AAE) breathing in, 82–84 413
Vertigo evolution of, 56–58 Weber’s law, 416
benign paroxysmal positioning, 467 physics and physiology of, 75–77 Wegener’s granulomatosis, laryngeal
vestibular rehabilitation for, 519, 563–565 Vocal tract filter function, 76 involvement in, 34
Vestibular rehabilitation, 519, 563–565 Vocal tremor, 94 Wernicke’s aphasia, 252–253
Vestibular schwannoma botulinum toxin for, 38–39 agrammatism in, 231, 409
auditory brainstem implant with, 427–428, Vocoding, 473 argument structure in, 271
429 Voice, 1–98 connectionist models of, 262–263
auditory brainstem response with, 432 acoustic assessment of, 3–6 global vs., 243
Vestibulo-ocular reflex (VOR), 565 alaryngeal, 10–12, 138–139 language deficits in, 250
Vestibulotoxicity, 518, 519–520 in children, instrumental assessment of, 35– phonological errors in, 367
VFE (Vocal Function Exercises), 85–87, 90 37 Wernicke’s area, 525
VHI (Voice Handicap Index), 21–22 confidential, 93 in connectionist model, 263, 264
Vibrato, 52 electroglottographic assessment of, 23–26 in language development, 311
Vibratory cycle, 24, 25 esophageal, 138 in speech production, 275
Vibrotactile aid, 541 resonant, 52, 89, 93 Western Aphasia Battery (WAB), 253
Vibrotactile stimulation, 105 in singing, 51–54 Whispering, after total laryngectomy, 138
Videofluorography, in speech assessment, Voice acoustics, 63–67 White noise generators, for tinnitus, 557
171 Voice Activity and Participation Profile Whole-word carryover, 362
Videofluoroscopy, of dysphagia, 133 (VAPP), 22 Whole-word repetitions, in children, 180
Video-nystagmography (VNG), 467 Voice breaks Wide-band spectrogram, 171–173
Viral laryngotracheitis, 32–33 intermittent, botulinum toxin for, 38 Wide dynamic range compression (WDRC),
Viscosity, hydration and, 55 in voice exercises, 86–87 413
Visemes, 543 Voice conservation, for professional voice Williams syndrome, and speech, 141
Visual deficits, in Down syndrome, 289 users, 96 Wilson Voice Profile System, 78
Visual hearing, 543 Voice disorders Word deafness, pure, 252, 263
Visual reinforcement audiometry (VRA), 521, of aging, 72–74 Word use, in autism, 116
522 assessment of functional impact of, 20–23 Working memory
Visual speech perception, 543–546 in children, instrumental assessment of, 35– in school-age children, 328
Visual tracking, 543–546 37, 67–71 and semantic development, 395–396
Vital capacity (VC), and speech production, functional (psychogenic) in specific language impairment, 349–351
83, 84 direct therapy for, 49–51 Writing disorders, 233–235
VNG (video-nystagmography), 467 etiology of, 27–29
Vocabulary in parkinsonism, 30–31 X
in bilingualism, 279–280 prevalence of, 20 Xerostomia, due to radiotherapy, 46
of deaf child, 336–337 psychogenic, 27–29 X-ray microbeam imaging, in speech
Vocal cord paralysis Voice Handicap Index (VHI), 21–22 assessment, 171
due to herpes simplex and herpes zoster Voice handicap measures, 20–23
infections, 33 Voice Outcome Survey (VOS), 22 Y
reinnervation for, 42–43 Voice production system Yawn-sign phonation, 88, 89
Vocal education, for neurological aging- evolution of, 56–58 Yes/no target-monitoring task, 458–459
related voice disorders, 93 physics and physiology of, 75–77
Vocal fold(s) Voice quality Z
anatomy of, 17–19 after conservation laryngectomy, 81 Zero-crossing-rate division, 473
in children, 71 factors leading to, 77
developmental anatomy and physiology of, perceptual evaluation of, 78–79
36 Voice rehabilitation, after laryngectomy, 10–
electroglottography of, 23–26 12
false, in phonation, 77 partial or conservation, 80–81
injuries to, 55 Voice source, during singing, 51, 52–53
medialization of, for dysarthria, 130 Voice therapy
during phonation, 75, 76, 77 for adults, 88–90
in voice acoustics, 64 breathing exercises in, 82–84
in voice therapy, 89 for functional dysphonia, 49–51
Vocal fold adduction, and voice therapy, 88– holistic, 85–87
89 for neurological aging-related voice
Vocal fold elongation abnormalities, voice disorders, 91–94
therapy for, 89 for professional voice users, 95–97
Vocal fold mucosa, hydration of, 55 for transsexuals, 224, 225
Vocal fold oscillation, during phonation, 76 Volume velocity, 63, 64
Vocal fold paralysis, voice handicap Volume-velocity waveform, 63, 65
assessment in, 22 VOR (vestibulo-ocular reflex), 565
Vocal fold vibration, modal, glottal volume VOS (Voice Outcome Survey), 22
velocity for, 63 Vowel(s), point, 58
Vocal function, aerodynamic assessment of, Vowel production, 143
7–9 Vowel prolongation, in dysarthria, 126
Vocal Function Exercises (VFE), 85–87, 90 VRA (visual reinforcement audiometry), 521,
Vocal hygiene, 54–56 522
Vocalis muscle, 19
Vocalization, neural mechanisms of, 59–61 W
Vocal ligament, 17–19 Waardenburg syndrome, hearing loss in, 479
Vocal performers, 96–97 WAB (Western Aphasia Battery), 253

Encyclopedia of Communication Disorders

Enviado por

Dados do documento

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

Encyclopedia of Communication Disorders

Enviado por

Direitos autorais:

Formatos disponíveis

The MIT Encyclopedia of

Edited by Raymond D. Kent

Library of Congress Cataloging-in-Publication Data

The MIT encyclopedia of communication disorders / edited by Raymond D. Kent.

Stuttering 220 Lingustic Aspects of Child Language Impairment—

Noise-Induced Hearing Loss 508 Speechreading Training and Visual Tracking

Figure 1. Instrumentation and resulting signals for

the cough reﬂex, assists in dislodging material from the

Regional Anatomical Relationships. The larynx is

Cartilaginous Skeleton. The larynx is composed of ﬁve

The laryngeal cavity is conventionally divided into

Table 1. Morphological Characteristics of the Suprahyoid and Infrahyoid Muscles

Table 2. Morphological Characteristics of the Intrinsic Laryngeal Muscles

biomechanical outcomes of the actions of the intrinsic

Figure 5. Schematic of the layered

Further Readings relationship of communication ability to global quality-

Summary daily activities. Journal of Speech, Language, and Hearing

electroglottography (EGG), has emerged as the most

Using multiple glottographic techniques, Baer,

intonation, voicing, and ﬂuency characteristics. In fact,

Pfau, E. M. (1975). Psychologische Untersuchungsergegnisse Hypokinetic Laryngeal Movement

Electroglottographic Studies of Children’s Voice. Al- References

abductor spasmodic dysphonia. Otolaryngology–Head and Further Readings

Sapienza, C. M., Walton, S., and Murry, T. (2000). Adductor

Laryngeal Reinnervation Procedures

The human larynx is a neuromuscularly complex organ

the thyroid cartilage. A large inferiorly based window is

References van Lith-Bijl, J. T., Stolk, R. J., Tonnaer, J. A. D. M.,

Figure 3. Spectra of the vowel [u] as spoken and sung by a

Vocal hygiene has been part of the voice treatment liter-

Figure 1. Examples of unusual vocal

Figure 1. The lateral and mesial surface of the human brain.

Figure 1. A, Idealized waveform of

Figure 2. Schema showing how the

Figure 3. A, Derivative Us0 (t) of the

spectrum (Hanson, 1997). The three consequences just

Hanson, H. M., and Stevens, K. N. (2002). A quasiarticulatory

and closing instants on the waveform are taken at a

Figure 2. A, Airﬂow open quotient as a function of age and sex.

amplitude of vibration when they increase their SPL,

Figure 5. Fundamental frequency as a function of age, sex, and

vocal function at age 14 in boys. It is not merely coinci-

sity values in the valleys of the resonant structure (Titze,

(Zhang et al., 2001). The turbulence and vorticities of

References Voice Rehabilitation After

is. Hixon (1975) provides a useful parameterization of E¤ects of Posture

polyps, nonspeciﬁc phonotraumatic changes, paralysis, References

References broader than and encompasses that of developmental

Variability. Children with motor speech disorders

Ball, M. J., Rahilly, J., and Tench, P. (1996). The phonetic

Severity Considerations. A common practice in the

cated that the two subjects were able to acquire correct

Further Readings Vaughn-Cooke, F. (1987). Are black and white vernaculars

Speech and Language Issues in Table 1. Cultural Di¤erences Between Asian-Paciﬁc

movements of the folds. This technology is often coupled

the respiratory and laryngeal muscles, muscle groups in

Speech Assessment in Children: guage. In linguistic terminology, these gaps would be

Descriptive linguistic methods have long been used in

residual errors. These two subtypes may have di¤erent

Sex as a Risk Factor. A robust ﬁnding in studies of fa-

Diagnosis valuable diagnostic information is gleaned from the

facets of unintelligibility in an individual child can

Further Readings Table 1. Conditions and Factors Present at Birth or Shortly

preschool, Spanish-speaking children accurately produce phonological systems of bilingual (English-Spanish)