Escolar Documentos
Profissional Documentos
Cultura Documentos
We can enter the data into SPSSPc by typing it directly into the data editor, or by cutting
and pasting:
Next, by clicking on ‘Variable View’, we can apply variable and value labels where
appropriate:
Assuming, for now, that if a relationship exists between the two variables, it is linear in
nature, we can generate a simple Scatterplot (or Scatter Diagram) for the data. This is
accomplished with the command sequence:
Which yields the following (editable) scatterplot:
12000
10000
8000
Sales Revenue of Store
6000
4000
2000
0
0 1000 2000 3000 4000 5000 6000 7000
We can generate a simple straight line equation from the output resulting when using the
Enter Command in regression:
Which yields:
Variables Variables
Model Entered Removed Method
1 Square
Footage
a
of . Enter
Store
a. All requested variables entered.
b. Dependent Variable: Sales Revenue of Store
M ode l Summary
ANOVAb
Sum of
Model Squares df Mean Square F Sig.
1 Regression 1.06E+08 1 106208119.7 121.009 .000 a
Residual 10532255 12 877687.937
Total 1.17E+08 13
a. Predictors: (Constant), Square Footage of Store
b. Dependent Variable: Sales Revenue of Store
SS
R
SS SS
T E
b0
Coefficientsa
Standardi
zed
Unstandardized Coefficien
Coefficients ts 95% Confidence Interval for B
Model B Std. Error Beta t Sig. Lower Bound Upper Bound
1 (Constant) 901.247 513.023 1.757 .104 -216.534 2019.027
Square Footage of Store 1.686 .153 .954 11.000 .000 1.352 2.020
a. Dependent Variable: Sales Revenue of Store
b1
^
So then Yi = 901.247 + 1.686X (noting that no direct interpretation of the Y
intercept at 0 Square Footage is possible, so
that the intercept represents the portion of
the annual sales varying due to factors other
than store size)
and where
SST = SSR (regression sum of squares) + SSE (error sum of squares)
4 .75
3
.50
.25
Frequency
Of course, the assessment of normality by visually scanning the data leaves some
statisticians unsettled; so I usually add an appropriate test of normality conducted on the
data:
Click Here
Results in data automatically added to the data file:
Then, simply produce the requisite scatterplot as before:
2000
1000
0
Unstandardized Residual
-1000
-2000
1000 2000 3000 4000 5000 6000
1.0
Studentized Residual
.5
0.0
-.5
-1.0
-1.5
-2.0
-2.5
1000 2000 3000 4000 5000 6000
Note the equivalence of results between the two plots. Statistically speaking, the X values
and Residuals may be inferred to be 0.00. We can infer this using the correlation utility in
SPSSPc, which tests the null hypothesis that the Pearson rho for the population is equal to
0.00:
Corre lations
Square Unstanda
Footage rdized Studentize
of Store Residual d Residual
Square Footage of Store Pearson Correlation 1.000 .000 .015
Sig. (2-tailed) . 1.000 .959
N 14 14 14
Unstandardized Residual Pearson Correlation .000 1.000 .999**
Sig. (2-tailed) 1.000 . .000
N 14 14 14
Studentized Residual Pearson Correlation .015 .999** 1.000
Sig. (2-tailed) .959 .000 .
N 14 14 14
**. Correlation is significant at the 0.01 level (2-tailed).
It should be noted that the distribution of the data also suggest that an assumption of
linearity is also reasonable at this point.
M ode l Summaryb
Change Statistics
Adjusted Std. Error of R Square Durbin-W
Model R R Square R Square the Estimate Change F Change df1 df2 Sig. F Change atson
1 .954a .910 .902 936.8500 .910 121.009 1 12 .000 2.446
a. Predictors: (Constant), Square Footage of Store
b. Dependent Variable: Sales Revenue of Store
Standardi
zed
Unstandardized Coefficien
Coefficients ts 95% Confidence Interval for B
Model B Std. Error Beta t Sig. Lower Bound Upper Bound
1 (Constant) 901.247 513.023 1.757 .104 -216.534 2019.027
Square Footage of Store 1.686 .153 .954 11.000 .000 1.352 2.020
a. Dependent Variable: Sales Revenue of Store
ANOVAb
Sum of
Model Squares df Mean Square F Sig.
1 Regression 1.06E+08 1 106208119.7 121.009 .000 a
Residual 10532255 12 877687.937
Total 1.17E+08 13
a. Predictors: (Constant), Square Footage of Store
b. Dependent Variable: Sales Revenue of Store
noting that t2, as expected, equals F; and the p-values are therefore equal. Note that SPSS
also provides the confidence interval associated with the slope.
Finally, SPSS allows you to calculate and store both Confidence and Prediction Limits
for the observed data. After you generate the scatterplot, left double-click on the chart;
this will take you to the chart editor:
Next:
Then:
Click on ‘Fit Options’
Regression Analysis for Site Selection
Scatterplot of Data Including Confidence & Prediction Limits
12000
10000
8000
Sales Revenue of Store
6000
4000