Escolar Documentos
Profissional Documentos
Cultura Documentos
sav
Problem:
Hypothesis:
H1: Effect on Household income of Age, Level of education, Years with current
employer and Years at current address.
Variables:
Dependent Variables:
Household income
Independent Variables:
Age in years
Level of education
For Analysis of this problem we are using liner regression because in this
data dependent variable is one and its also a scale variable and data simple
size is > 30,
After analyzing the scatter plot, its shows the pattern, its mean
heteroscedasticity is present in the model and we dont have further
independent variables to include in model to remove heteroscedasticity.
Its misspecified model, selected independent variables does not having the
capability of explaining the target variables.
Now we are reporting our model by mentioning that this model will partially
explain our target variables while major portion of target variables is still
unexplained.
Multicollinearity
Coefficientsa
Years at current
.015 .169 .003 .086 .932 .637 1.569
address
After removing all the illness now we can report model of regression
Liner Regression Model Reporting:
1. Hypothesis Testing
ANOVAa
Since Sig. value in anova table is <0.05, which refers to reject HO and it
conclude that Household income is affected by Age, Level of education, Years
with current employer and Years at current address is true.
2. Model Summary
Model Summaryb
R (Strength of relationship)
Adjusted R-Square
It tells you how wrong the regression model is on average using the units of
the response variable. Smaller values are better because it indicates that the
observations are closer to the fitted line.
3. Model Construction
Coefficientsa
Years at current
.015 .169 .003 .086 .932 .637 1.569
address
Expected Model
Final Model
Years with current employer is formed most effective 59% on our target
variable Household income followed by Level of education (31%), Age in
years (14%) and Years at current address (0.3%)