Escolar Documentos
Profissional Documentos
Cultura Documentos
MMPI-2
Critical Items, Content Scales, and
Subscales
Critical Items
The empirical keying or contrasted-groups methodology used by Hathaway in developing
the eight basic clinical scales of the MMPI represented a significant innovation in
personality inventory construction. Previous inventories relied on intuitive judgment to
produce items related to a trait of interest and to combine them with other items thought
to be related to the same trait to compose a scale. The items and scales thus composed
tended to be highly face valid (i.e. the items tended to be obviously related to the trait to
be measured). Unfortunately, they were often unsatisfactorily related to external criteria
and were overly vulnerable to the test-taking attitudes of the examinee that would bias
responses and potentially distort the results of testing.
The new approach required only that items selected for inclusion differentiate
between normal and abnormal criterion groups empirically. The authors of the MMPI
gave little consideration to the content of individual items and were concerned only that
the breadth of coverage for the original item pool was sufficient to sample a wide array of
attitudes and behaviors. The justification for the new empirical strategy was provided in
the classic manifesto of Meehl (1945a), The Dynamics of Structured Personality Tests.
In the decades following the release of the MMPI, clinical needs and advances in
psychometric theory and methods have seen a return to attaching greater importance to
test item content. The dust-bowl empiricist rationale for proscribing the examination
of item content in the clinic was succinctly stated by Meehl (1945b): The scoring does
not assume a valid self-rating to have been given (p. 147). But studies demonstrating
comparable validities for personality assessment instruments developed under both
rational and empirical strategies (Hase & Goldberg, 1967), and the appearance of
sophisticated positions defending the importance of test item content (Goldberg &
Slovic, 1967; Jackson, 1971), were instrumental in moderating Meehls (1945) earlier
position (Meehl, 1971, 1972). These developments were predated, however, by the needs
of clinicians to explore the MMPI as part of and prelude to acquiring proficiency in its use
and the pressing demands of clinical work to understand patients under investigation.
Caldwell (1991) has pointed out that the basic elements of the MMPI that are subject
to interpretation may be located on a gradient of increasing obviousness or transparency,
with the (a) subtle components of the clinical scales and certain additional scales (e.g. K
and, perhaps, Mf), (b) certain highly subtle empirically derived scales (e.g. MACR and
O-H), and (c) the basic clinical scales having subtle components corresponding to three
initial steps of this gradient. Intermediate steps would include empirically derived scales
Interpreting the Content of the MMPI-2 393
more generally, including the more obvious of the basic clinical scales. Still more obvious
would be the components of the basic clinical scales and scales whose developmental
methodology emphasized internal consistency standards, such as the Wiggins, the
MMPI-2 content scales, and the RC scales. The top-most steps in this gradient would
include scales composed of items with unusually low endorsement rates, such as the
F and FB scales, and the so-called critical items. It should not be surprising that the
growth in the appreciation of test item content and its potential importance in the clinic
should have originated with the distinction to which Wiener and Harmon (Wiener,
1948; Wiener & Harmon, 1946) called attention, between subtle and obvious content,
and critical items.
Initially, clinicians sought access to the patients responses to certain items thought
to be clearly indicative of psychopathological disturbance. Grayson (1951) gathered a
set of 38 items for use with a VA population. Endorsement of any of these in the course
of psychodiagnostic evaluation was considered sufficient to warrant more detailed
clinical inquiry into the content area of the item, regardless of whether the MMPI profile
appeared pathological. In some cases, these items were considered pathognomonic of a
condition, such as delusions or suicidal ideation. In others, the item would serve as a red
flag or a stop sign, forcing an interruption of the diagnostic process in order to explore the
patients grounds for its endorsement. These thus came to be called stop or critical items.
The history of critical items actually goes back to the Woodworth (1920) Personal Data
Sheet, which included 10 neurotic tendency items. These were considered indicators of
neurosis, regardless of the remainder of the individuals responses.
Although the Grayson items were widely adopted (Gravitz, 1968), particularly after
being reprinted in An MMPI Handbook (Dahlstrom & Welsh, 1960), alert practitioners
soon discovered that this group of items was overwhelmingly redundant with the F
and Sc scales. Eighty-five percent (32/38) of these items were scored on one or both of
these, 40 percent (16 items) on the F scale alone (Koss, 1979). Moreover, for 92 percent
(35/38) of the items, the keyed response was True. Thus, however salient the content of
the individual items might be as a springboard for investigation within the interview,
it became clear that the Grayson items placed undue stress on psychoticism, to the
exclusion of other problem areas; that these items, like the F scale itself, were sensitive to
gross deviancy and to a set on the patients part to exaggerate or conceal psychological
complaints; and that they were also vulnerable to an acquiescent response style, the
inclination to mark items True, regardless of their content.
Another set of critical items was selected by Caldwell (1969). Seeking a broader
range of content, Caldwell chose 68 items, distributed among nine categories. As a set,
the Caldwell critical items covered a wider range of problem areas than the Grayson
items. Nevertheless, although not quite so dominated by psychoticism, with 56 percent
(38/68) of the items overlapping F and Sc (20 items, or 33 percent, on F alone), these
items remain relatively saturated with this source of variance. These shortcomings
were in addition to the greatest concern of all: that the Grayson and Caldwell critical
items, with their origins in the rational-intuitive processes of their creators, had
no demonstrated empirical relationships with the symptoms and complaints they
enunciated. Critiques by Greene (1980) and R. G. Evans (1984) noted the occurrence
of the Grayson and Caldwell critical items in normal groups, and other contradictory
evidence for their validity.
394 Interpreting the Content of the MMPI-2
Seeking to address the external validity of critical items, Lachar and Wrobel (1979)
investigated the empirical correlates of a large number of items, including those appearing
in the Grayson and Caldwell lists, nominated by clinicians as relevant to 14 common
problem areas for psychiatric in- and outpatients. Lachar and Wrobel ultimately settled
on 111 items, distributed into 11 content areas, of which 99 achieved significant (.05)
correlations with counterpart information on problems recorded in patient files; the
remaining 12 items achieved acceptable correlations with closely related criteria.
An alternative approach was taken by Koss and Butcher (1973; Koss, Butcher, &
Hoffmann, 1976) to ensure empirical correlates for critical items by identifying six
crisis situations, each marked by a set of behaviors or complaints exhibited by patients
Table 7.1 A comparison of the content contained within three sets of critical items
LacharWrobel Caldwell KossButcher
Content No. of Content No. of Content category No. of
category items category Items items
(MMPI/ (MMPI/ (MMPI/
MMPI-2) MMPI-2) MMPI-2)
Psychological discomfort
Anxiety and 11/11 Acute anxiety 9/17
tension
Depression and 16/16 Distress and 11/11 Depressed- 25/22
worry depression suicidal ideation
Sleep disturbance 6/6 Suicidal 5/5
thoughts
Reality distortion
Deviant beliefs 15/15 Ideas of 10/1 Persecutory 12/11
reference, ideas
persecution,
and delusions
Deviant thinking 11/10 Peculiar 9/9 Mental 3/11
and experience experience and confusion
hallucinations
Characterological adjustment
Substance abuse 4/3 Alcohol and 4/3 Situational 15/7
drugs stress due to
alcoholism
Antisocial 9/9 Authority 5/5
attitude problems
Family conflict 4/4 Family discord 7/7
Problematic 4/4 Threatened 3/5
anger assault
Sexual concern 8/6 Sexual 7/6
and deviation difficulties
Somatic 23/23 Somatic 10/10
symptoms concerns
Interpreting the Content of the MMPI-2 395
at the time of their admission to the hospital. After defining the crisis group, Koss and
Butcher asked clinicians to identify MMPI items that corresponded to the behaviors
and complaints characteristic of each of the groups. Nominated items were then cross-
validated on newly admitted patients, with non-crisis psychiatric patients serving as
controls.
Despite the different methods used to develop the LacharWrobel, Caldwell, and
KossButcher critical item sets, Table 7.1 suggests a high degree of similarity in the
content of the items for each set, as well as roughly comparable areas of coverage. Each
critical item list references distress and dysphoria, cognitive disruption, psychotic
ideation, and substance abuse. (See Table A6, pp. 569576 in Friedman et al., 2001, for
the MMPI-2 items and scoring direction for the KossButcher, LacharWrobel, and
Nichols critical item lists.)
A fourth list was developed by Nichols (1989) when he found many of the Lachar
Wrobel critical items categories insufficiently homogeneous and those of Caldwell and
of Koss and Butcher, too restricted. For example, in his consultations for a neurologist,
he wanted to be able to specify the kinds of somatic complaints endorsed more precisely,
both in terms of their specific content and in terms of the proportion of items endorsed
within a specific content area, such as motor difficulties or genitourinary complaints.
He was also discontented with the inclusion of items in categories implicating psychotic
mentation that might reflect only unusual culturally based beliefs and experiences. For
example, the item Evil spirits possess me at times, although frequently scored on scales
highly saturated with psychoticism (e.g. F, Pa, BIZ, PSYC, and RC6) and potentially
endorsed as an acknowledgment of the kind of hallucinatory or other anomalous
experience common to psychotic states, may also be endorsed at relatively high
frequency by members of certain religious sects or immigrants from countries wherein
a belief in spirits, evil and otherwise, is more common than in the United States. Nichols
therefore devised a new set of items based on both rational and statistical considerations.
Categories were initially selected from large-scale item factor analyses on a very large
Midwestern psychiatric sample. These categories were then refined by subdividing many
of the somatic factors into more discrete classes of symptoms by eliminating categories
that essentially duplicated the content of normed scales, such as fears and phobias, and
by examining patterns of item overlap among content scales and the Caldwell, Koss
Butcher, and LacharWrobel lists. Because of their established validity characteristics,
virtually all of the items in the KossButcher and LacharWrobel sets were retained
on the Nichols Critical Item List (NCIL). More than any of the alternative critical item
lists, the NCIL permits a more specific assessment of both the range and the intensity of
symptomatic expressions, particularly within the health/somatic/neurological area. The
NCIL for the MMPI-2 contains 217 items spread over four major classes and 23 specific
item clusters.
Of the two chief controversies surrounding the use of critical items, one is conceptual,
the other statistical. The conceptual issue is whether inventory items should be
considered behavior samples or behavioral signs. According to Koss (1979), the earliest
inventories viewed item responses as veridical self-reports or samples of behavior
that could stand in lieu of actual interview or observational data, thereby providing a
more efficient basis for clinical description. It was in part in reaction to this view, and
the disappointing performance of previous inventories guided by it, that Hathaway
396 Interpreting the Content of the MMPI-2
chose to adopt an empirical approach to the composition on his scales. Abandoning
the assignment of items to scales on the basis of judgments a priori, Hathaway left
between-groups differences in endorsement frequency to identify each item as a sign of
the criterion group, its significance to be determined by further investigation. Meehls
(1945b) enunciation of the empirical rationale emphasized the range of understandings
various people might bring to test items and stressed the fact that a given statement was
endorsed over the content of the statement itself. In his words, the empirical approach
consists simply in the explicit denial that we accept a self-rating as a feeble surrogate
for a behavior sample, and substitutes the assertion that a self-rating constitutes an
intrinsically interesting and significant bit of verbal behavior, the non-test correlates
of which must be discovered by empirical means.
(Meehl, 1945a, p. 297)
Content Scales
As highly homogeneous collections of items with similar content, the MMPI-2 content
scales also provide a means by which the patient can communicate with the clinician.
Because the most immediate access to the symptomatic behavior and concerns of
the patient is through scales having a strong thematic character, content scales are
designed in a way that allows them to respond directly to aspects of the examinees self-
presentation on the MMPI-2. As Wiggins, Goldberg, and Applebaum (1971) noted, the
view that the MMPI constitutes an opportunity for communication between S [subject]
and the tester has much to commend it; not the least of which is the likelihood that this
is the frame of reference adopted by the S himself (p. 403).