Escolar Documentos
Profissional Documentos
Cultura Documentos
If both of the following are true, then residual plots help determine
the function g.
Example:
True model: three predictors.
We fit a model with two.
Residual plots are potentially non-random.
We cant use residual plots to tell us what part of the model has
been misspecified.
+
Example 1: Function g DNE
p = # of predictors
0 5 10 15 20 25 30 35
Years of Experience
+
Marginal Model Plots:
Assessing Mean
Data Model
70
Salary
50 4060
0 5 10 15 20 25 30 35
Years of Experience
+
Marginal Model Plots:
Multiple Predictors
If we have two predictor variables, we wish to compare
models (1) and (2) below.
16 18 20 22 24 26 28
Food
Food
18 20 22 24 26 12 14 16 18 20 22 24 26
Service Decor
+
Marginal Model Plots:
Multiple Predictors
Since the two fits in the following plot differ markedly,
we conclude that the model below is not a valid model
for the data.
Data Model
16 18 20 22 24 26 28
Food
20 30 40 50 60
Cost
+
Transformations (Box Cox:
Approach 1)
yjPowerTransformationstoMultinormality
Est.PowerStd.Err.WaldLowerBoundWaldUpperBound
MET1.02680.17150.69081.3629
ECAB1.23730.68150.09852.5731
YOUNG0.46531.35592.19233.1229
OLD1.90890.80890.32353.4943
Likelihoodratiotestsabouttransformationparameters
LRTdfpval
LRtest,lambda=(0000)63.74116944.738432e13
LRtest,lambda=(1111)1.45282648.349635e01
+
Transformations: State
Spending
Step 2: Transform Y, given predictors
lm.1<lm(EX~MET+ECAB+YOUNG+OLD+WEST)
tranmod<powerTransform(lm.1,family="yjPower")
summary(tranmod)
Est.PowerStd.Err.WaldLowerBoundWaldUpperBound
Y10.16680.58290.97571.3094
Likelihoodratiotestsabouttransformationparameters
LRTdfpval
LRtest,lambda=(0)0.0813801510.7754357
LRtest,lambda=(1)2.1012440610.1471793
+
Transformations: Using Logs for %
Effects
Coefficients:
EstimateStd.ErrortvaluePr(>|t|)
(Intercept)0.447300.351381.2730.206
log(Weekday)1.061330.0284837.270<2e16***
Tabloid0.531370.068007.8141.26e11***
Goal:
Find out whether X2 adds anything to the
model after X1 has already been added.
Idea:
Were interested in the following Final
Model:
39
+
Bridges
40
+
Summary
41
+
Summary
42
+
Omitted Variables
Two cases:
1 = 0 and/or 2 = 0: The omitted variable has no effect on the
regression model that has only x.
1 0 and 2 0: The omitted variable does have an effect on
the model that has only x.
Ex: Y and x could appear highly correlated even when 1 = 0.
Ex: Y and x could appear strongly negatively associated even
when 1 > 0.
+
Omitted Variables