Escolar Documentos
Profissional Documentos
Cultura Documentos
Seonghoon Kim
(courtesy of Bryant Kim at Cornell)
School of Economics
Singapore Management University
August, 2015
1 / 35
Introduction
I
What is econometrics?
I
I
I
2 / 35
Introduction
Example - A study published in Nature (Rauscher, Shaw and Ky,
1993) suggested that listening to Mozart for 10-15 minutes could
temporarily raise your IQ by 8 or 9 points. In fact, shortly after the
study was published and reported in the popular press, the U.S.
state of Georgia began handing out classical-music CDs to the
parents of all infants, and there were similar but less official
programs in Colorado, Florida and elsewhere.
3 / 35
4 / 35
Introduction
There are two types of causal relationships commonly assessed in
the policy world:
I
Other?
5 / 35
Introduction
I
Other?
6 / 35
Causality
Causality: A specific action leads to a specific, measurable
consequence
I We are often interested in assessing the magnitude of the
causal effect that a certain factor (X) has on an outcome
(Y)
I
I
Examples:
I
I
Causality
I
Causality
I
9 / 35
Causality
I
10 / 35
Validity
Validity represents a set of criteria by which the credibility of
research may be judged.
I
11 / 35
Validity
Internal Validity
A study has strong internal validity if it estimates the causal
effect of interest for the population represented by our sample.
I
I
I
12 / 35
Validity
External Validity
A study has strong external validity if its findings can be
generalized to other settings (i.e. people, time periods, locations,
age groups, etc.)
I
Random Assignment
Random Sampling
13 / 35
Randomized Experiments
I
14 / 35
Randomized Experiments
I
15 / 35
Randomized Experiments
I
Randomized Experiment
I
16 / 35
1980s
Four-year study
Examined the effect of class size in grades K-3
Experimental Data
I
18 / 35
Experimental Data
Mean Background Characteristics
Free Lunch(%)
Male(%)
Black(%)
+
Treatment
47.2
51.5
31.1
Control
48.5
51.3
32.5
Difference
-1.3
0.2
-1.4
P-Value+
0.325
0.883
0.302
: Corresponds to a Z test where the null hypothesis is that the proportions in the treatment and control
19 / 35
Experimental Data
I
20 / 35
Experimental Data
Estimation
. tab small, sum(tscorek)
small class |
Summary of tscorek
in K |
Mean
Std. Dev.
Freq.
------------+-----------------------------------0 |
918.20133
72.214225
4048
1 |
931.94189
76.358633
1738
------------+-----------------------------------Total |
922.32872
73.746597
5786
Question: What is the difference in average test scores between
small and regular classes?
21 / 35
Experimental Data
Hypothesis Testing
H0 : T = C
HA : T 6= C
YT YC
t=
SE (YT YC )
Where SE (YT YC )is the standard error of YT YC
s
(YT )2 (YC )2
SE (YT YC ) =
+
nT
nC
t=
YT YC
931.94 918.20
13.74
q
=
=
= 6.38
2
2
2.15
SE (YT YC )
(76.36)
(72.21)
1738 + 4048
22 / 35
Experimental Data
QUESTION: Is this difference in mean test scores statistically
significant at the 5% level?
23 / 35
Experimental Data
Conclusions from analysis of STAR Experiment (Kindergarten Result)
24 / 35
Observational Data
Observational Data for the Class Size and Student Outcome
Question: California Test Score Data Set
I
25 / 35
Observational Data
Look at the data in a scatterplot:
26 / 35
Observational Data
How do we address the question of whether students do
better in small classes, using these data?
I
Divide the school districts into those with small classes (STR
less than 20) and those with larger classes (STR more than
20).
27 / 35
Observational Data
Estimation:
. tab small, sum(testscr)
|
Summary of testscr
small |
Mean
Std. Dev.
Freq.
------------+-----------------------------------0 |
649.97885
17.853364
182
1 |
657.35126
19.358012
238
------------+-----------------------------------Total |
654.15655
19.053348
420
QUESTION: What is the difference in average test scores
between small and regular classes?
28 / 35
Observational Data
I
Ysmall Ylarge
7.4
=
= 4.05
1.8
SE (Ysmall Ylarge )
Observational Data
Compare characteristics of treatment and control groups:
Free Lunch(%)
English as a second language(%)
Average Income(thousand $)
+
Small
41.6
12.5
16.3
Regular
48.7
20.0
14.0
Difference
-7.1
-7.5
2.3
P-Value+
0.001
0.001
0.001
: Corresponds to a Z test (t test) where the null hypothesis is that the proportions (means) in the
30 / 35
Observational Data
Conclusions from analysis of non-experimental data:
I
31 / 35
Observational Data
I
32 / 35
33 / 35
34 / 35
Differences-in-differences
Fixed effects
Instrumental variables
Regression discontinuity
35 / 35