Escolar Documentos
Profissional Documentos
Cultura Documentos
Next chapter
1. Feel for the Data
Production
Sales
Finance
Servicing
Maintenance
Personnel
Public Relations
Accounting
Frequencies for other
Gender
variables
Men Women
79.9% 20.1%
Example:
2, 3, 4, 6, 7, 8,Mode?
8, 8, 9, 9, 10, 10, 10,
10
Median: The score associated with the 50th
percentile
Scale of measurement:
Ordinal, interval or ratio
DO NOT use with nominal data
Methods of determination:
N = even
List scores from low to high
Median is the Mean of
N and N+1
2 2
N = odd
List scores from low to high
Median = N+1
2
Mean: Arithmetic average
Most common measure of central
tendency
Scale of measurement:
Interval or ratio
An instructor recorded the average number
of absences for his students in one semester.
For a random sample the data are:
2 4 2 0 40 2 4 3 6
Mean:
0 2 2 2 3 4 4 6 40
20
Measures of Dispersion
The measure of dispersion is also unique to
nominal and interval data.
Two sets of data might have the same
mean, but the dispersions could be
different.
For example, if Company A sold 30, 40, and
50 units of a product during the months of
April, May, and June, respectively, and
Company B sold 10, 40, and 70 units during
the same period, the average units sold per
month by both companies is the same40
unitsbut the variability or the dispersion in
the latter company is larger.
Measures of Dispersion
The three measurements of dispersion
connected with the mean are the
range, the variance, and the standard
deviation.
Range. Range refers to the extreme values
in a set of observations.
The range is between 30 and 50 for
Company A (a dispersion of 20 units), while
the range is between 10 and 70 units (a
dispersion of 60 units) for Company B.
Another more useful measure of dispersion
is the variance.
The Range
The range is defined as the
difference between the largest score
in the set of data and the smallest
score in the set of data, XL - XS
What is the range of the following
data:
4 8 1 6 6 2 9 3 6 9
The largest score (XL) is 9; the
smallest score (XS) is 1; the range is
X -X =9-1=8 23
When To Use the Range
The range is used when
you have ordinal data or
you are presenting your results to people
with little or no knowledge of statistics
The range is rarely used in scientific
work as it is fairly insensitive
It depends on only two scores in the set of
data, XL and XS
Two very different sets of data can have
the same range:
1 1 1 1 9 vs 1 3 5 7 9
24
Measures of Dispersion
Variance. The variance is calculated by
subtracting the mean from each of the
observations in the data set, taking the
square of this difference, and dividing the
total of these by the number of
observations.
In the above example, the variance for
each of the two companies is:
(30 40)2 + (40 40)2 + (50 40)2 /3
Variance for Company A = 66.7
(10 40)2 + (40 40)2 + (70 40)2 /3
Variance for Company B = 600
As we can see, the variance is much
larger in Company B than Company
A. It makes it more difficult for the
manager of Company B to estimate
how much goods to stock than it is
for the manager of Company A.
Thus, variance gives an indication of
how dispersed the data in a data set
are.
What Does the Variance Formula
Mean?
Variance is the mean of the squared
deviation scores
The larger the variance is, the more
the scores deviate, on average, away
from the mean
The smaller the variance is, the less
the scores deviate, on average, from
the mean
27
Measures of Dispersion
Standard Deviation. The standard
deviation, which is another measure of
dispersion for interval and ratio scaled
data, offers an index of the spread of a
distribution or the variability in the data.
It is a very commonly used measure of
dispersion, and is simply the square
root of the variance. In the case of the
above
two companies, the standard deviation for
Companies A and B would be 66.7 and
600 or 8.167 and 24.495, respectively.
Standard Deviation
Definition
Measures of dispersion are
descriptive statistics that describe
how similar a set of scores are to
each other
The more similar the scores are to each
other, the lower the measure of
dispersion will be
The less similar the scores are to each
other, the higher the measure of
dispersion will be
In general, the more spread out a 30
Measures of Dispersion
Which of the
distributions of 125
scores has the 100
larger dispersion? 75
50
25
The upper 0
distribution has 1 2 3 4 5 6 7 8 9 10
125
more dispersion 100
because the 75
50
scores are more 25
0
spread out 1 2 3 4 5 6 7 8 9 10
That is, they are less 31
similar to each other
Standard Deviation-
Variance
Standard deviation = variance
32
The relations between Means
and Deviations
1. Coefficient of Variation: is defined as
the ratio of thestandard deviationto the
mean.
The coefficient of variation should be
computed only for data measured on a
ratio scale, as these are measurements
that can only take non-negative values.
The coefficient of variation may not have
any meaning for data on aninterval scale
Skewness
The skewness for anormal distributionis zero,
and any symmetric data should have a
skewness near zero. Negative values for the
skewness indicate data that are skewed left
and positive values for the skewness indicate
data that are skewed right. By skewed left, we
mean that the left tail is long relative to the
right tail. Similarly, skewed right means that
the right tail is long relative to the left tail. If
the data are multi-modal, then this may affect
the sign of the skewness
Skewness
S.K
=
S.K
=
Example
No Yes All
48
Correlations: Measuring and
Describing Relationships
51
Correlations: Measuring and
Describing Relationships
52
Discussion the result of
the analysis SPSS
T test )/( ):/(
Anove )/( 3
correlation
Regression
)( ) (
.
:
multiple Linear
Regression "
" "
" " " " "
" :
Correlation and Regression
...
r 1 . 1
1
1-
.
.
.
:
) : (Pearson
) : (Spearman
)
( ) 8
5 8 5(
.
) : (Kandell,s tau
.
) : (Phi
) (
) (.
) : (Cramers
) ( )
)
Correlations
Pearson Correlation 1 .959**
Sig. (2-tailed) . .000
N 10 10
Pearson Correlation .959** 1
Sig. (2-tailed) .000 .
N 10 10
**. Correlation is significant at the 0.01 level
(2-tailed).
tailed Significance = 0.000.2
r 0.959
=0.5
Correlations
Pearson Correlation 1 .959** .780** .833**
Sig. (2-tailed) . .000 .008 .003
N 10 10 10 10
Pearson Correlation .959** 1 .746* .811**
Sig. (2-tailed) .000 . .013 .004
N 10 10 10 10
Pearson Correlation .780** .746* 1 .890**
Sig. (2-tailed) .008 .013 . .001
N 10 10 10 10
Pearson Correlation .833** .811** .890** 1
Sig. (2-tailed) .003 .004 .001 .
N 10 10 10 10
**. Correlation is significant at the 0.01 level (2-tailed).
*. Correlation is significant at the 0.05 level (2-tailed).
0.01
0.05
0.949
0.949
Correlations
Pearson Correlation 1.000 .949
.949 1.000
)Sig. (1-tailed . .000
.000 .
N 10 10
10 10
0.88
N Correlation Sig.
Pair Current Salary &
474 .880 .000
1 Beginning Salary
T-Test
: )(
:
: "
ANOVA ) Analysis
( 0f Variance ) ( Fisher
) ( Ttest
:
T
test
:
) ( ) (
.
) (
Independent Samples Test
Multivariate Analysis of Variance
) ( One Way Manova
) ( Two Way Manova
Two Way
Analysis of Variance
.
.
Two Way ANOVA
) (
) (.
:
- ) (main effect
.
- ) (main effect
.
- ) (Interaction
Two Way Analysis of Variance
"
0.05
" "
"
"
"
"
.
Between-Subjects Factors
Value Label N
Gender f Female 216
m Male 258
Employment 1 Clerical 363
Category 2 Custodial 27
3
Manager 84
N
Descriptive Statistics
2.Testing Goodness
of Data
3. Hypothesis Testing
Checking the Reliability of Measures:
Cronbachs Alpha
The inter item consistency reliability or
the Cronbachs alpha reliability
coefficients of the five independent and
dependent variables were obtained.
They were all above .80. A sample of the
result obtained for Cronbachs alpha test
for the dependent variable, Intention to
Leave, together with instructions on how
it is obtained is, shown in:
Reliability Coefficients 6 items
Alpha = .8172
The result indicates that the Cronbachs alpha
for the six item Intention to Leave measure is .
82. The closer the reliability coefficient gets to
1.0, the better. In general, reliabilities less
than .60 are considered to be poor, those in
the .70 range, acceptable, and those over .80
good. Cronbachs alpha for the other four
independent variables ranged from .81 to .85.
Thus, the internal consistency reliability of the
measures used in this study can be
considered to be good.