Escolar Documentos
Profissional Documentos
Cultura Documentos
Cross-Tabulation
• While a frequency distribution describes one variable at a
time, a cross-tabulation describes two or more variables
simultaneously.
Gender
Row
Internet Usage Male Female Total
Light (1) 5 10 15
Heavy (2) 10 5 15
Column Total 15 15
Two Variables Cross-Tabulation
• Since two variables have been cross-classified, percentages
could be computed either columnwise, based on column
totals (Table 15.4), or rowwise, based on row totals (Table
15.5).
Internet Usage
No 68% 79%
No 50% 50%
No 35% 35%
Income
Eat Frequently in Fast- Low High
Food Restaurants
Family size Family size
Small Large Small Large
Yes 65% 65% 65% 65%
No 35% 35% 35% 35%
Column totals 100% 100% 100% 100%
Number of respondents 250 250 250 250
Statistics Associated with
Cross-Tabulation Chi-Square
• To determine whether a systematic association exists, the
probability of obtaining a value of chi-square as large or larger
than the one calculated from the cross-tabulation is estimated.
• An important characteristic of the chi-square statistic is the
number of degrees of freedom (df) associated with it. That is, df =
(r - 1) x (c -1).
• The null hypothesis (H0) of no association between the two
variables will be rejected only when the calculated value of the
test statistic is greater than the critical value of the chi-square
distribution with the appropriate degrees of freedom, as shown in
Figure 15.8.
Chi-square Distribution
Fig. 15.8
Do Not Reject
H0
Reject H0
2
Critical
Value
Statistics Associated with
Cross-Tabulation Chi-Square
2 to test the
• The chi-square statistic ( ) is used
statistical significance of the observed association in
a cross-tabulation.
• The expected frequency for each cell can be
calculated by using a simple formula:
nrnc
fe = n
15 X 15 = 7.50 15 X 15 = 7.50
30 30
15 X 15 15 X 15
= 7.50 = 7.50
30 30
2
Then the value of is calculated as follows:
2 = S (fo - fe)2
fe
all
cells
Statistics Associated with
Cross-Tabulation Chi-Square
= 3.333
Statistics Associated with
Cross-Tabulation Chi-Square
• The chi-square distribution is a skewed distribution whose shape
depends solely on the number of degrees of freedom. As the
number of degrees of freedom increases, the chi-square
distribution becomes more symmetrical.
• Table 3 in the Statistical Appendix contains upper-tail areas of the
chi-square distribution for different degrees of freedom. For 1
degree of freedom the probability of exceeding a chi-square value
of 3.841 is 0.05.
• For the cross-tabulation given in Table 15.3, there are (2-1) x (2-1)
= 1 degree of freedom. The calculated chi-square statistic had a
value of 3.333. Since this is less than the critical value of 3.841,
the null hypothesis of no association can not be rejected
indicating that the association is not statistically significant at the
0.05 level.
Example
Is the type of beverage ordered with lunch at a restaurant
independent of the age of the customer? A random sample of
309 lunch customers was taken and the results are as follows.
Using alpha = 0.01, test to determine if the two factors are
related.
AGE COFFEE/TEA SOFT DRINK OTHER
21-34 26 95 18 139
35-55 41 40 20 101
OVER 55 24 13 32 69
91 148 70 309