Escolar Documentos
Profissional Documentos
Cultura Documentos
][University of Nebraska]
On: 19 August 2010
Access details: Access Details: [subscription number 917350222]
Publisher Psychology Press
Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37-
41 Mortimer Street, London W1T 3JH, UK
To cite this Article Spain, Seth M. , Miner, Andrew G. , Kroonenberg, Pieter M. and Drasgow, Fritz(2010) 'Job Performance
as Multivariate Dynamic Criteria: Experience Sampling and Multiway Component Analysis', Multivariate Behavioral
Research, 45: 4, 599 — 626
To link to this Article: DOI: 10.1080/00273171.2010.498286
URL: http://dx.doi.org/10.1080/00273171.2010.498286
This article may be used for research, teaching and private study purposes. Any substantial or
systematic reproduction, re-distribution, re-selling, loan or sub-licensing, systematic supply or
distribution in any form to anyone is expressly forbidden.
The publisher does not give any warranty express or implied or make any representation that the contents
will be complete or accurate or up to date. The accuracy of any instructions, formulae and drug doses
should be independently verified with primary sources. The publisher shall not be liable for any loss,
actions, claims, proceedings, demand or costs or damages whatsoever or howsoever caused arising directly
or indirectly in connection with or arising out of the use of this material.
Multivariate Behavioral Research, 45:599–626, 2010
Copyright © Taylor & Francis Group, LLC
ISSN: 0027-3171 print/1532-7906 online
DOI: 10.1080/00273171.2010.498286
Andrew G. Miner
Target Corporation
Pieter M. Kroonenberg
Leiden University
Fritz Drasgow
University of Illinois, Urbana-Champaign
Questions about the dynamic processes that drive behavior at work have been
the focus of increasing attention in recent years. Models describing behavior
at work and research on momentary behavior indicate that substantial variation
exists within individuals. This article examines the rationale behind this body
of work and explores a method of analyzing momentary work behavior using
experience sampling methods. The article also examines a previously unused set of
methods for analyzing data produced by experience sampling. These methods are
known collectively as multiway component analysis. Two archetypal techniques of
multimode factor analysis, the Parallel factor analysis and the Tucker3 models, are
used to analyze data from Miner, Glomb, and Hulin’s (2010) experience sampling
study of work behavior. The efficacy of these techniques for analyzing experience
sampling data is discussed as are the substantive multimode component models
obtained.
599
600 SPAIN, MINER, KROONENBERG, DRASGOW
Historically, measures of job performance have been of great interest for orga-
nizational scientists attempting to establish the validity of selection systems
(Austin & Villanova, 1992). For instance, if a city uses a cognitive ability
measure to select its police officers, applied psychologists want to validate
those cognitive ability scores against a criterion such as the new officers’ arrest
records where arrest records serve as a measure of job performance. Validity
for predicting a criterion is commonly evaluated via the correlation between
the predictor measure and criterion. For the police example, this would be the
correlation between the cognitive ability scores and the arrest record.
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
There are three sources of meaningful variance in job performance scores (Ghis-
elli, 1956): (a) static dimensionality, the ordinary factorial representation of
performance; (b) dynamic dimensionality, temporal factors influencing the per-
formance domain; and (c) individual dimensionality, variability in the type of
performance across persons in the same job. We consider each of these sources
in turn, beginning with individual dimensionality. For similar distinctions in a
more general sense, see Cattell (1952; refer to Figure 1).
Ghiselli (1956) described individual dimensionality as the way that individu-
als assigned to the same job within an organization perform that specific job in
qualitatively different ways, not merely with different proficiency. Dimensions
in the individuals mode differentiate types of employees. For example, two
salespersons may provide the same economic benefit to the organization, but
one contributes by directly making sales, whereas the other contributes by cre-
ating goodwill, encouraging customers to make purchases throughout the store
(Ghiselli, 1956, p. 4). Such individual difference dimensions of performance
could be important in a variety of situations. If the organization derives the same
economic benefit from different employees in different ways, these differences
should be reflected in selection and reward systems.
Static dimensionality refers to the latent structure of the variables measuring
job performance. Historically, the study of job performance was characterized
by a search for the “ultimate criterion,” a comprehensive index of performance.
It has been pointed out that this is an inappropriate way to conceptualize
performance (e.g., Dunnette, 1963; Ghiselli, 1956). There is evidence across
many jobs that overall job performance can consist of as many as eight dimen-
sions (Campbell, 1994; Campbell et al., 1990). At minimum, job performance
researchers should consider both task performance, the technical core of the
job, and contextual performance, the social and nontechnical contributions an
individual makes at work (Borman & Motowidlo, 1997). These dimensions
have been found to independently contribute to supervisors’ perceptions of their
602 SPAIN, MINER, KROONENBERG, DRASGOW
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
subordinates’ overall job performance (Motowidlo & Van Scotter, 1994). For
instance, consider two salespeople who have equal sales. One is known to be
a loner, whereas the other gives advice and assistance to coworkers. The latter
will be viewed as the superior performer due to the social contributions this
salesperson makes.
It may be necessary to have a single measure of job performance, for example,
when establishing the predictive validity of a selection battery (Hulin, 1982).
One might then construct some composite of scores on various performance
dimensions. The weighting scheme chosen to combine different dimensions of
job performance has been found to greatly impact those validity estimates in
Monte Carlo simulations (Murphy & Shiarella, 1997). These authors found that
34% of the variance in validity estimates over simulated samples was accounted
for by the performance dimension weighting scheme.
Finally, the dynamic nature of performance criteria is important to consider
for employee selection. There are well-documented changes in the predictive
relationships between selection instruments and performance measures (Alvarez
& Hulin, 1972). In an academic example, SAT scores are most predictive of
first-semester college grade point average (GPA), and the correlations between
SAT and GPA decrease after that (e.g., Humphreys, 1976). These changes in
MULTIVARIATE DYNAMIC CRITERIA 603
predictive validity have been found to be virtually ubiquitous (Hulin et al., 1990).
The fact that predictive validity changes over time is usually called validity
degradation and is the primary approach to the study of dynamic criteria, though
there are other approaches (e.g., Austin, Humphreys, & Hulin, 1989; Barrett,
Caldwell, & Alexander, 1985). This work is summarized in meta-analyses by
Hulin et al. and Keil and Cortina (2001), which show that validity degradation
is pervasive and occurs with all manner of ability predictors.
If performance changes over time, it would useful to find predictors of the
change itself. Personality measures have been used in addition to cognitive
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
moods on overall job satisfaction. Dalal, Lam, Weiss, Welch, and Hulin (2009)
investigated the dimensionality of organizational citizenship and counterpro-
ductive behaviors and their relationships to state affect and performance. Miner,
Glomb, and Hulin (2005, Miner & Glomb, 2010) found general support that indi-
viduals modify their work behaviors in response to mood changes. These studies
demonstrate that a substantial portion of the variance in organizationally relevant
variables is within-person, underscoring the need for longitudinal studies.
Experience sampling data are three-mode (Persons Variables Occasions)
and are frequently analyzed with multilevel models with the occasions mode
nested within persons. This article addresses whether such data may be examined
using multiway analysis, as illustrated in the next section. Situations where
these analyses would be useful include examination of whether the structures
of counterproductive and citizenship behaviors change over time, or, even more
fitting for multiway methods, research on whether different groups of employees
display different patterns of change in these structures. Multiway models are a
natural way of addressing such questions but may prove difficult to implement
with experience sampling data. For instance, in the data presented here, the
measurement occasions occurred randomly within 2-hr windows. Therefore,
Participant 1’s Time 1 is not strictly simultaneous with Participant 2’s but should
be basically equivalent. One goal of this article is to serve as a methodological
exercise in an attempt to judge whether this assumption of near simultaneity is
tenable. This concern is further addressed in the Discussion section.
MULTIWAY ANALYSES
When viewed in the way specified by Ghiselli (1956), the performance domain
is defined by a three-way data relation box (Cattell, 1952). This box is defined
by its three modes: individuals, variables, and occasions. This box is referred to
as an array, a generalization of the matrix concept (see Figure 1). In Figure 1,
participants, variables, and occasions are all both modes and ways. The front
side of the array in Figure 1, the participants by variables “slice” is the matrix
we usually handle in psychological data. Each measurement occasion in a
MULTIVARIATE DYNAMIC CRITERIA 605
profiles. Cattell’s (1944) idea was to measure the same individuals on the same
variables across two systematically different occasions, say before and after an
experimental manipulation. If one recovered the same factors at both occasions
then this common orientation of the factors or these parallel proportion profiles
could, according to Cattell (1944), only occur if the factors represented real
psychological constructs.
The Parafac model (Harshman, 1970) is essentially both a model formulation
and computational solution of this principle applied to an arbitrary number of
conditions or occasions. Moreover, Harshman and Carroll and Chang (1970)
showed that the parallel proportional profile principle could be extended to the
n-way case. The Parafac model is mathematically identical to the canonical
decomposition model in multidimensional scaling (Carroll & Chang, 1970). In
this article, we refer only to the former name. The Parafac model is presented
in matrix notation as
Xk D ADk B0 C Ek ; (1)
where Xk is the kth frontal slice of the data array X (matrices are boldfaced and
multiway arrays boldfaced and underlined). A is the coefficient matrix for the
first mode; B0 is the transpose of the coefficient matrix of the second mode; Dk
is a diagonal matrix containing the kth row of the third-mode coefficient matrix
along its diagonal, that is, dssk D cks. Ek is the kth frontal slice of residual array
E. In sum notation this becomes
xijk D †s dssk Œais bjs C eijk D †s csk Œais bjs C eijk ; (2)
which shows that for the kth slice of the data array the dssk -coefficients provide
weights for the coefficients of the other two modes. They are the “proportion-
ality” constants that capture the systematic variation between slices of the data
array (cf. Harshman & Lundy, 1984a). A dssk (csk ) coefficient defines the relative
importance of component s to occasion k. It should be noted that the model is
in fact proportional with respect to all its modes.
It is important to distinguish two types of variation: system variation, pertain-
ing to the system as a whole, and object variation, which pertains to variation
606 SPAIN, MINER, KROONENBERG, DRASGOW
of specific (groups of) objects, that is, part of the system but not the system as a
whole. For the Parafac model to be applicable sufficient system variation should
be present in the multiway data. System variation involves a conceptualization
of components that lie “in the system” under examination such that the same
instances of the components affect all of the objects; there should be parallel
proportional profiles. The object variation model involves a conceptualization
of components as having an instance of the component “in” each object under
study, such that the components are properties of (groups of) individuals under
study. The system model implies synchronous variation in component influences
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
across levels of the third mode (e.g., occasions), whereas the object model does
not imply such synchronous variation (cf. Harshman & Lundy, 1984a, pp. 130–
133). The basic Parafac model has no provision for modeling object variation.
Therefore, if substantial object variation is present, a more complicated model
such as the Tucker3 should be used.
Tucker (1964, 1966) formally introduced the three-mode factor analytic model
for three-way data as an extension to the well-known two-mode model for two-
way data. This model was later named after him by Kroonenberg and de Leeuw
(1980). The Tucker family of models can account for both system and object
variability. Of these models the Tucker3 is the most complex and it can also be
shown to be more complex than the Parafac model. The sum notation for the
model is
Here, aip represents the coefficient of the i th individual on the pth A-mode
factor, whereas bjq represents the coefficient of the j th variable on the qth
variable factor, and ckr represents the coefficient of the kth timepoint on the r th
time factor. Finally gpqr represents the element of the core array that links the
pth, qth, and r th factors in the A-, B-, and C-modes, respectively, with each
other. It can also be interpreted as a representation of the interaction between the
factors of the three modes (for further details see Kroonenberg, 2008, Sec. 4.8).
Thus the columns of the coefficient matrices A, B, and C are the components
that may be represented as vectors in graphs displaying the component spaces.
For convenience, we often speak of component scores for participants, compo-
nent loadings for variables, and component scores or weights for timepoints.
Only the variable loadings can, however, be understood as in the two-way
case, that is, as variable-component correlations, if the variables have been
centered and normalized in a specific way (see the following). In most cases, the
interpretation of the component scores proceeds in the same way as component
scores in two-way component analyses where the components are in normalized
coordinates, that is, have unit sums of squares. Another way of thinking about
the component matrices is in terms of “idealized” instances of the real objects
MULTIVARIATE DYNAMIC CRITERIA 607
in that mode, that is, ideal individuals, ideal (or latent) variables, and ideal
measurement occasions or trends (Kroonenberg, 2008, Sec. 9.4; Tucker, 1966).
The core array is what allows Tucker models to account for object variability
where the Parafac model cannot. So whereas Component 1 in the individuals
mode must correspond to Component 1 in the variables and time modes in
a Parafac analysis, the same does not apply to a Tucker3 model. The core
allows three-way interactions between the components in different modes and
for different numbers of components in each mode. A number of ways of
interpreting the core are possible. One may view the core array as “idealized
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
data” such that each element represents the score on the ideal variable by
the ideal person at the ideal measurement occasion corresponding to that core
element (cf. Tucker, 1966). As the scalar notation makes clear (Equation 3),
one can also think of the core elements as regression weights for predicting the
original data using the three-way interaction of the components (Kroonenberg,
2008, pp. 225–228). Thus, core elements give the importance of a combination
of mode components for reproducing the original data.
RESEARCH QUESTIONS
Inn, Hulin, and Tucker (1972) offered perhaps the only multivariate dynamic
criterion study using multiway analysis. They fit a Tucker3 model to data from a
sample of 184 airline reservation agents measured five times on 11 performance
variables. These authors found a 4 3 3 solution, demonstrating multidi-
mensionality in each mode, consistent with Ghiselli’s (1956) three sources
conjecture. Given the dearth of previous research, the current study is largely
exploratory and we propose few a priori hypotheses. The major expectation of
this study is that multiple components will be extracted in each mode of the data,
consistent with Ghiselli and with Inn et al.. Given previous work on affect cycles
in experience sampling designs, it is likely that at least one of the time compo-
nents will show cyclic trends in loadings (Weiss et al., 1999). These loadings are
analogically similar to variable loadings on growth parameters in latent growth
curve analysis (e.g., Chan, 1998), though they are both exploratory in character
and expected to display cyclical rather than curvilinear patterns. Further, we
expect that we should be able to recover three B-mode components for the ob-
jective, behavioral, and self-rated performance variables discussed in the Method
section. We have no hypotheses regarding individuals-mode components.
This article is primarily a methodological exploration. We want to know if
it is possible to fit well-behaved multiway models to experience sampling data.
We expect to be able to do so, and the only remaining question is whether the
Parafac model will be sufficient. The Parafac model is a constrained Tucker3
model, which itself can be seen a constrained two-way principal components
608 SPAIN, MINER, KROONENBERG, DRASGOW
systematic variation redundantly or simply model noise (Smilde, Bro, & Geladi,
2004, pp. 154–155). If a degenerate Parafac solution is obtained, it usually
indicates the presence of Tucker structure or object variability.
METHOD
average call hold time in the 30-min window in which the signal occurred.
values for the missing-data locations (Smilde et al., 2004). Though this data set
is not large enough to allow for cross-validation, the stability of the results were
examined using several random starts.
RESULTS
Both Parafac and Tucker3 models were fit to the data. Analyses were performed
using the 3WayPack suite of programs (Kroonenberg, 1994). Multiway models
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
were compared based on variance accounted for (VAF) measures. The VAF of
the model was divided by number of factors estimated in order to compare the
model’s explanatory power with its parsimony (Timmerman & Kiers, 2000).
Similar indices were discussed in Kroonenberg and Oort (2003; Kroonenberg,
2008, pp. 179ff.). These are not formal fit indices but aid the researcher’s decision
making regarding which models are good candidates for interpretation.
Component models assume that variables are ratio-scaled ones (Harshman
& Lundy, 1984a). However, psychological measurements mostly have interval
properties. Centering will change the data into ratio-scale deviations from the
mean so that they can be handled in a component analysis. Harshman and
Lundy (1984b) and Kroonenberg (2008, chap. 6) discuss appropriate types of
centering for three-way data, and centering across participants per variable-
timepoint combination is their recommended procedure for the present data.
Our data were accordingly centered across the participant mode, such that
the mean for each variable at each measurement occasion was subtracted out
of each measurement. This is the most appropriate centering strategy because
the starting point of the study is arbitrary. An accompanying advantage of this
centering is that the matrix of variable-occasion means can be seen as the score
of the average person so that the deviation scores can be interpreted with respect
to this average person. Additionally, the three objective measures (call handle
time, call wait time, and call hold time) were corrected for nonnormality by
taking their natural logarithms. Finally, the variables were size-normalized over
all participants-timepoint combinations so that the values of the variables were
comparable as standard scores across variables and remained comparable per
variable across all occasions. To be precise, the scores analyzed with the three-
mode models were
where a dot replaces the index over which the sum was taken. A final advantage
of this form of standardization is that given proper scaling of the output (see
the following) the component loadings for the variables can be interpreted as
variable-component correlations (Harshman & Lundy, 1984a).
MULTIVARIATE DYNAMIC CRITERIA 611
Descriptive Statistics
Presenting even descriptive statistics for the entire data array would be cumber-
some. Thus descriptive statistics for the A-mode matricization are presented in
Table 1. Matricization is the unfolding of the three-mode data array into a two-
way data matrix extended along one of the modes of data classification. Given
an i j k data array, the A-mode matricization is of order ik j (cf. Kiers,
2000), where the participant mode (k) is the slower running mode such that the
rows of the matrix consist of the 1st participant’s 40 measurement occasions
followed by the 2nd participant’s 40 measurement occasions, and so on. Table 1
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
presents the means, standard deviations, and correlations for the variables in
the A-mode matricization. This correlation matrix can be interpreted as usual,
though it conflates within- and between-person covariances.
Means Analysis
Prior to three-way analysis, we explored the mean time trends for the variables,
that is, trends for means that were removed before the three-way analyses.
Figures 2 through 4 present the time trends for seemingly similar variables.
Figure 2 displays the self-rated variables: average handle time, average quality
of service, and overall service. These variables display very similar time trends.
Figure 3 shows the trends for the behavioral variables: the focal performance
composite, withdrawal behaviors, citizenship behaviors, neutral behaviors, and
location. Again, these display highly similar trends. The objective measures,
average handle time, average wait time, and average call duration, do not show
such a clear pattern.
Table 2 displays the component loadings for a varimax-rotated PCA of the
time series for these variables averaged over participants. Two components were
recovered: the self-rated variables loaded strongly on one, with location, focal
performance, withdrawal, and neutral behaviors loading strongly on the other.
Log-average handle time and citizenship performance also showed reasonable
loadings on this component. The call duration and call wait time variables did
not load strongly on either component.
Three-Way Analysis
Based on several initial solutions, we decided to eliminate 1 participant from
the three-way analyses. This participant had much too high an influence on the
solutions. After this, the preferred models are 2 2 1, 3 2 2, 4 2 2,
and 4 2 4 Tucker models and a two-component Parafac model. The two-
component Parafac model was a rotation of the 2 2 2 Tucker model (see
Table 3).
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
612
TABLE 1
Descriptive Statistics for the A-Mode Matricization of ESM Data Array
M SD 1 2 3 4 5 7 8 9 11 10 6
Note. Call time variables were log-transformed prior to analysis. Correlations in boldface are statistically significant at the .05 level.
MULTIVARIATE DYNAMIC CRITERIA 613
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
FIGURE 2 Centered average time series for self-rated customer service variables. Corre-
lations among time series: 0.3–0.6.
FIGURE 3 Centered average time series for behavioral variables. Correlations between .6
and .9 except with Citizenship, which range from .2 to .4.
of the data used for the local regressions (Cleveland & Devlin, 1988). The
first component has been rotated to optimal constancy. The second component
is orthogonal to the first one. There is no clear periodicity (i.e., performance
cycles) in these time components. However, to uncover such periodicity would
require that the measurement occasions were spaced evenly and truly taken at
the same time for all participants. We constructed joint biplots to investigate
the relationships between individuals and variables for the two time components
(e.g., Kroonenberg, 2008, Appendix B). Variables are displayed as vectors in the
two-dimensional space, and individuals projecting highly on a variable vector
scored highly on it; individuals plotted near each other have similar profiles.
Figure 6 is the joint biplot of individuals and variables for Time Component
1 (variable coordinates presented in Table 4). Because Time Component 1
is largely constant, this biplot represents individual differences in how the
participants carried out their job duties over time. For instance, we can see how
Participants 19, 20, 23, 46, and 50 had high scores on call wait time but low
MULTIVARIATE DYNAMIC CRITERIA 615
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
FIGURE 4 Uncentered average time series for logged call time variables (objective
performance). Correlations between .0 and .3. Note. The trends for these variables are
rather dissimilar. Therefore they are displayed uncentered so as to show their unique trends
more clearly.
scores on call duration and handle time. On the other hand, Participants 25, 40,
and 48 scored highly on call duration and handle time but low on call wait time.
Similarly, Participants 20, 23, and 27 had high scores on citizenship whereas Par-
ticipants 3 and 9 had very low scores on citizenship. The objective call time mea-
sures and citizenship are the most important variables for this time component.
Figure 7 displays the biplot for the second time component (variable co-
ordinates are also presented in Table 4). This component showed much more
pronounced fluctuations than Component 1 and an increasing trend toward the
end of the study. This suggests that the importance of the relationships displayed
in Figure 7 became more important over the course of the study. This increase
of importance is especially due to handling time and citizenship but also with-
drawal, focal performance, focal location, call duration, and quality of service.
TABLE 2
Varimax-Rotated Component Matrix for Time Trends
of Variable Averages
Component
Variable Name 2 1
TABLE 3
Fit Information on Parafac and Tucker Models
Overall
Proportional Fit Proportional Fit Components
Completed Data
Valid Completed
Model Data Data 1 2 3
Note. Because the presence of a large amount of missing data, the fit of the completed data
of an analysis with fewer components may have a better completed fit than a model with more
components. PF D Parafac; T3 D Tucker3.
616
MULTIVARIATE DYNAMIC CRITERIA 617
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
Participants with positive scores, such as 30, show increasing trends over time.
Those with negative scores, like 3 and 9, show decreases.
To aid interpretation of the individual components, we examined the cor-
relations of these components with individual difference measures (e.g., Van
Mechelen & Kiers, 1999). The first component correlated positively with meta-
mood clarity and negatively with overall job withdrawal (r D :30, p D :04 and
r D :32, p D :03, respectively). The second component correlated with meta-
618 SPAIN, MINER, KROONENBERG, DRASGOW
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
FIGURE 6 Joint biplot for participants and variables for the first time component.
TABLE 4
Variable Coordinates for Joint Biplots
Time Time
Component 1 Component 2
Component 1 2 1 2
Note. Variable ordering has been changed to illustrate influence. Loadings in boldface are
considered substantial enough to define the component.
MULTIVARIATE DYNAMIC CRITERIA 619
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
FIGURE 7 Joint biplot for participants and variables for the second time component.
mood repair positively (r D :29, p D :05) and marginally positively with overall
citizenship performance (r D :27, p D :07) and marginally negatively with over-
all work withdrawal (r D :25, p D :10). The third component correlated with
overall citizenship performance (r D :38, p D :01) and marginally negatively
with satisfaction with supervisor ( :23, p D :10). The first component has
perhaps the clearest interpretation as these individuals report great awareness of
their moods and high overall job withdrawal: these employees are likely very
in tune with their emotions and unlikely to quit. They are probably having a
relatively positive work experience relative to the average employee. The second
component also has a reasonably clear interpretation. Given that individuals are
able to intentionally repair their moods and report high levels of citizenship
performance and low work withdrawal, these are likely individuals who can
deal with negative emotions effectively and remain engaged with their work.
The third component may again reflect some form of engagement. Employees
scoring high on this component report high citizenship in spite of lower ratings
on supervisory satisfaction.
The variables-mode component loadings are displayed in Table 5. The two
components cleanly separate the objective and behavioral measures, and the
self-reported measures do not load substantially on either component. With the
objective measures, call duration and handle time have the opposite sign as call
wait time. Citizenship performance has the highest loading for the behavioral
measures, with focal performance, location, and withdrawal all loading between
.31 and .34. Interestingly, all of these loadings, even withdrawal, are positive.
It is also interesting that the more direct self-reported measures of performance
do not systematically load on these components.
The Tucker core is shown in Table 6. The combination of the first components
in all modes has the largest core element and accounts for the most variance.
The most substantial elements following this are the combination of the second
participant and the behavioral variables component with each of time compo-
nents (g221 and g222 , respectively) accounting for 3% and 2% of the original
variance, respectively.
620 SPAIN, MINER, KROONENBERG, DRASGOW
TABLE 5
Component Loadings for B Mode (Variables)
TABLE 6
Tucker Core and Variance Explained for Each Three-Way
Combination of Components
Time Component 1
Variance
Objective Behavioral Explained
Time Component 2
Variance
Objective Behavioral Explained
DISCUSSION
fully equivalent timepoints across individuals or if the study took place over
organizationally meaningful epochs, such as a new product rollout or layoff
announcement, the Parafac model may have proven more useful in describing
the data.
This theme bears some greater consideration. Often with longitudinal and
experience sampling studies, in particular, researchers wade into the stream of
events with no real care as to when they do so. Put simply, Time 1 often is
not really Time 1 but an arbitrary starting point for the study. Likewise, the
studies often end at an equally arbitrary point in time. Multiway thinking can
help encourage researchers to treat time as an important facet in the design of
studies and sample measurement opportunities appropriately.
There are three major limitations to this study. The first is that the data were not
collected for the purpose of a structural analysis. Therefore, the amount of shared
variance among the indicator variables may not adequately cover the latent job-
performance variables. Although this does not preclude the analyses described
in this article, data collected with such analyses in mind would undoubtedly
provide cleaner results. Second, there was a substantial amount of missing data.
However, only Participant 16 appeared to show undue influence on the model
solutions, and this participant was removed from the analyses reported here.
Third, the sample involved only two groups of employees from one organization,
limiting the external validity of the results. Further study of the job performance
domain and its three sources of variance should be examined across a sample
from the population of jobs. This requires collecting experience sampling and
other longitudinal data in many organizations with many different types of
employees. One possibility is applying these techniques to large samples of
jobs that are appraised on multiple performance indicators at fixed intervals,
such as in the military. Such formal performance evaluations at multiple points
in time would address questions about the normative development of job per-
formance.
However, without examining other studies of performance and its determi-
nants, the underlying dynamics in studies with longer measurement epochs
will never be fully understood (cf. Beal et al., 2005). Data to answer these
sorts of questions should be collected using multiple methods and examined
using multiple analytic procedures. Such a strategy will allow for a scientific
understanding of the dynamic interaction of individual and workplace attributes
in the study of organizational behavior.
CONCLUSION
The major performance theories in applied psychology indicate that the criterion
space is a multidimensional one. However, these theories are static; they are
largely agnostic as to the dynamic nature of performance and its determinants.
It is well established that the relationship between measures of predictors of
MULTIVARIATE DYNAMIC CRITERIA 623
job performance and performance appraisal measures degrades over time. This
situation suggests the possibility of dynamism on both sides of the performance
prediction model. In order to understand the dynamic relationship between
two variables, we must first understand the processes underlying each of the
variables.
The results of this study provide further evidence that organizational scientists
must attend to dynamic processes. Further, these results echo the exhortations of
Ghiselli (1956) and Dalal and Hulin (2008) to more broadly consider the criterion
space when designing personnel interventions. It is important that these between-
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
ACKNOWLEDGMENTS
Portions of this research were presented at the 2007 Annual Meeting of the
Academy of Management, Philadelphia, PA. This article is based on the first
author’s master’s thesis. We thank Chuck Hulin, Sungjin Hong, and Theresa
Glomb for their assistance and for comments on earlier drafts of this article.
REFERENCES
Alvarez, K. M., & Hulin, C. L. (1972). Two explanations of temporal changes in ability-skill
relationships: A literature review and theoretical analysis. Human Factors, 14, 295–308.
Austin, J. T., Humphreys, L. G., & Hulin, C. L. (1989). Another view of dynamic criteria: A critical
reanalysis of Barrett, Caldwell, and Alexander. Personnel Psychology, 42, 583–596.
Austin, J. T., & Villanova, P. (1992). The criterion problem: 1917–1992. Journal of Applied Psy-
chology, 77, 70–88.
Barrett, G. V., Caldwell, M. S., & Alexander, R. A. (1985). The concept of dynamic criteria: A
critical reanalysis. Personnel Psychology, 38, 41–56.
Beal, D. J., Weiss, H. M., Barros, E., & MacDermid, S. M. (2005). An episodic process model of
affective influences on performance. Journal of Applied Psychology, 90, 1054–1068.
Borman, W. C. (1991). Job behavior, performance, and effectiveness. In M. Dunnette & L. Hough
(Eds.) Handbook of industrial/organizational psychology (Vol. 2, pp. 271–326). Palo Alto, CA:
Consulting Psychologists Press.
Borman, W. C., & Motowidlo, S. J. (1997). Task performance and contextual performance: The
meaning for personnel selection research. Human Performance, 10, 99–109.
Campbell, J. P. (1991). Modeling the performance prediction problem. In M. Dunnette & L. Hough
(Eds.), Handbook of industrial/organizational psychology (Vol. 2, pp. 687–732). Palo Alto, CA:
Consulting Psychologists Press.
624 SPAIN, MINER, KROONENBERG, DRASGOW
Campbell, J. P. (1994). Alternative models of job performance and their implications for selection
and classification. In M. G. Rumsey & C. B. Walker (Eds.), Personnel selection and classification
(pp. 33–51). Hillsdale, NJ: Erlbaum.
Campbell, J. P., McHenry, J. J., & Wise, L. L. (1990). Modeling job performance in a population
of jobs. Personnel Psychology, 43, 313–333.
Carroll, J. D., & Chang, J.-J. (1970). Analysis of individual differences in multidimensional scaling
via an n-way generalization of “Eckart-Young” decomposition. Psychometrika, 35, 283–319.
Cattell, R. B. (1944). “Parallel proportional profiles” and other principles for determining the choice
of factors by rotation. Psychometrika, 9, 267–283.
Cattell, R. B (1952). The three basic factor analytic research designs: Their interrelations and
derivatives. Psychological Bulletin, 49, 499–520.
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
Chan, D. (1998). The conceptualization and analysis of change over time: An integrative approach
incorporating longitudinal mean and covariance structures analysis (LMACs) and multiple indi-
cator latent growth model (MLGM). Organizational Research Methods, 14, 421–483.
Cleveland, W. S., & Devlin, S. J. (1988). Locally weighted regression: An approach to regression
analysis by local fitting. Journal of the American Statistical Association, 83, 596–610.
Csikszentmihalyi, M., & Larson, R. W. (1987). Validity and reliability of the experience sampling
method. Journal of Nervous and Mental Disease, 175, 526–536.
Dalal, R. S., & Hulin, C. L. (2008). Motivation in organizations: Criteria and dynamics. In R. Kanfer,
G. Chen, & R. Pritchard (Eds.), Work motivation: Past, present, and future. (pp. 63–100). New
York: Routledge.
Dalal, R. S., Lam, H., Weiss, H. M., Welch, E. R., & Hulin, C. L. (2009). A within-person ap-
proach to work behavior and performance: Concurrent and lagged citizenship-counterproductivity
associations, and dynamic relationships with affect and overall job performance. Academy of
Management Journal, 52, 1051–1066.
Dunnette, M. D. (1963). A note on the criterion. Journal of Applied Psychology, 47, 251–254.
Ghiselli, E. E. (1956). Dimensional problems of criteria. Journal of Applied Psychology, 40,
1–4.
Goldberg, L. R. (2001). International Personality Item Pool: A scientific collaboratory for the
development of advanced measures of personality and other individual differences. Available
from http://ipip.ori.org/ipip/ipip.html
Harshman, R. A. (1970). Foundations of the Parafac procedure: Models and conditions for an “ex-
planatory” multimodal factor analysis. UCLA Working Papers in Linguistics, 16, 1–84. (University
Microfilms, Ann Arbor, MI, No. 10,085)
Harshman, R. A., & Lundy, M. E. (1984a). The PARAFAC model for three-way factor analysis and
multidimensional scaling. In H. G. Law, C. W. Snyder, J. A. Hattie, & R. P. McDonald (Eds.),
Research methods for multimode analysis (pp. 122–215). New York: Praeger.
Harshman, R. A., & Lundy, M. E. (1984b). Data preprocessing and the extended PARAFAC model.
In H. G. Law, C. W. Snyder, J. A. Hattie, & R. P. McDonald (Eds.), Research methods for
multimode analysis (pp. 216–284). New York: Praeger.
Hulin, C. L. (1982). Some reflections on general performance dimensions and halo rating error.
Journal of Applied Psychology, 67, 165–170.
Hulin, C. L., Henry, R. A., & Noon, S. L. (1990). Adding a dimension: Time as a factor in the
generalizability of predictive relationships. Psychological Bulletin, 107, 328–340.
Humphreys, L. G. (1976). The phenomena are ubiquitous-but the investigator must look. Journal of
Educational Psychology, 68, 521.
Inn, A., Hulin, C. L., & Tucker, L. R. (1972). Three sources of criterion variance: Static dimen-
sionality, dynamic dimensionality, and individual dimensionality. Organizational Behavior and
Human Performance, 8, 58–83.
Keil, C. T., & Cortina, J. M. (2001). Degradation of validity over time: A test and extension of
Ackerman’s model. Psychological Bulletin, 127, 673–697.
MULTIVARIATE DYNAMIC CRITERIA 625
Kiers, H. A. L. (1991). Hierarchical relations among three-way methods. Psychometrika, 56, 449–
470.
Kiers, H. A. L. (2000). Toward a standardized notation and terminology in multiway analysis.
Journal of Chemometrics, 14, 105–122.
Kroonenberg, P. M. (1994). The TUCKALS line: A suite of programs for the analysis of three-way
data. Computational Statistics and Data Analysis, 18, 73–96.
Kroonenberg, P. M. (2008). Applied multiway data analysis. Hoboken, NJ: Wiley.
Kroonenberg, P. M., & de Leeuw, J. (1980). Principal component analysis of three-mode data by
means of alternating least squares algorithms. Psychometrika, 45, 69–97.
Kroonenberg, P. M., & Oort, F. J. (2003). Three-mode analysis of multimode covariance matrices.
British Journal of Mathematical and Statistical Psychology, 56, 305–335.
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010
Larson, R., & Csikszentmihalyi, M. (1983). The experience sampling method. In H. T. Reis (Ed.),
Naturalistic approaches to studying social interaction (pp. 41–56). San Francisco: Jossey-Bass.
Miner, A. G., Glomb, T., & Hulin, C. L. (2005). Experience sampling mood and its correlates at
work. Journal of Occupational and Organizational Psychology, 78, 171–193.
Miner, A. G., Glomb, T., & Hulin, C. L. (2010). State mood, task performance, and behavior at
work: A within–persons approach. Organizational Behavior and Human Decision Processes, 112,
43–57.
Motowidlo, S. J., & Van Scotter, J. R. (1994). Evidence that task performance should be distinguished
from contextual performance. Journal of Applied Psychology, 79, 475–480.
Murphy, K. R. (1989). Is the relationship between cognitive ability and job performance stable over
time? Human Performance, 2, 183–200.
Murphy, K. R., & Shiarella, A. H. (1997). Implications of the multidimensional nature of job
performance for the validity of selection tests: Multivariate frameworks for studying test validity.
Personnel Psychology, 50, 823–854.
Salovey, P., Mayer, J. D., Goldman, S. L., Turvey, C., & Palfai, T. P. (1995). Emotional attention,
clarity, and repair: Exploring emotional intelligence using the Trait Meta-Mood Scale. In J. W.
Pennebaker (Ed.), Emotion, disclosure, and health (pp. 125–154). Washington, DC: American
Psychological Association.
Smilde, A., Bro, R., & Geladi, P. (2004). Multiway analysis: Applications in the chemical sciences.
Hoboken, NJ: Wiley.
Smith, P. D. (1976). Behaviors, results, and organizational effectiveness: The problem of criteria. In
M. Dunnette (Ed.), Handbook of industrial and organization psychology (pp. 745–775). Chicago:
Rand McNally.
Smith, P. C., Kendall, L., & Hulin, C. L. (1969). The measurement of satisfaction in work and
retirement. Chicago: Rand McNally.
Thoresen, C. J., Bradley, J. C., Bliese, P. D., & Thoresen, J. D. (2004). The big five personality
traits and individual job performance growth trajectories in maintenance and transitional job
stages. Journal of Applied Psychology, 89, 835–853.
Timmerman, M. E., & Kiers, H. A. L. (2000). Three-mode principal components analysis: Choosing
the number of components and sensitivity to local optima. British Journal of Mathematical and
Statistical Psychology, 53, 1–16.
Tucker, L. R. (1964). The extension of factor analysis to three-dimensional matrices. In H. Gullikson
& N. Frederikson (Eds.), Contributions to mathematical psychology (pp. 110–127). New York:
Holt, Rinehart, & Winston.
Tucker, L. R. (1966). Some mathematical notes on three-mode factor analysis. Psychometrika, 31,
279–311.
Weiss, H. M., & Cropanzano, R. (1996). Affective events theory: A theoretical discussion of the
structure, causes and consequences of affective experiences at work. In B. M. Staw & L. L.
Cummings (Eds.), Research in organizational behavior (Vol. 18, pp. 1–74). Greenwich, CT: JAI
Press.
626 SPAIN, MINER, KROONENBERG, DRASGOW
Weiss, H. M., Nicholas, J. P., & Daus, C. S. (1999). An examination of the joint effects of affective
experiences and job satisfaction and variations in affective experiences over time. Organizational
Behavior and Human Decision Processes, 78, 1–24.
Van Mechelen, I., & Kiers, H. A. L. (1999). Individual differences in anxiety response to stressful
situations: A three-way component analysis model. European Journal of Personality, 13, 409–
428.
Zyphur, M. J., Bradley, J. C., Landis, R. S., & Thoresen, C. J. (2008). The effects of cognitive
ability and conscientiousness on performance over time: A censored latent growth model. Human
Performance, 21, 1–27.
Downloaded By: [Spain, Seth M.][University of Nebraska] At: 21:30 19 August 2010