Escolar Documentos
Profissional Documentos
Cultura Documentos
Confidence intervals
Introduction
The difference between the present and the new commercial may be a
result of chance.
The sample mean is a random variable (if you pick a different sample
you will probably get a different sample mean).
Statistical confidence
Example 6.5:
In a sample of 453 large firms the mean monthly premium paid
for the health insurance is
405 dollars. We assume that is
known to be 112.
Statistical confidence
is approximately , 5.27
The 69-95-99.7 rule says that the probability is
about 0.95 that is within 10.54 of .
The 68-95-99.7 rule
3 2
+ + 2 + 3
Statistical confidence
To say that lies within 10.54 of is the same as saying that is
within 10.54 of ! (because distances are symmetrical)
Statistical confidence
We are 95% confident that is between 394 and 416.
Be sure you understand that in reality could be outside this interval!
(Although this only happens in 5% of all samples)
We cannot know whether our sample is one of the 95% or one of the
unlucky 5%. The only thing we can say is: we arrived at the conclusion
that is between 394 and 416 by a process that gives correct results
95% of the time.
10
Confidence intervals
A confidence interval always has the form
estimate margin of error
where the margin of error m shows the precision of the estimate.
11
Confidence intervals
A confidence interval is an
interval that contains the true
parameter in C percent of the
samples.
Figure 6.5: Twenty-five samples from the
same population gave these 95%
confidence intervals.
12
13
501.94 1.96
.
/0
500.7 , 503.2
Since 500 is not included in the C.I., we are 95% confident that differs
from 500. So we can reject the hypothesis that the mean fill of all bottles
is 500ml.
Inferential analysis can be done either with confidence intervals or
significance tests (next lesson).
14
15
,
*
*
,
Note: in practice is usually unknown, but if n is large then the sample standard
deviation s will be close to the unknown population standard deviation . So
substituting s for in the formula will result in an approximately correct minimum
sample size n.
Exercise 1
In a sample of 50 Belgian employees, the mean
gross monthly wage is 2400 euro. The standard
deviation in the population is known to be
1000 euro.
*
,
End solutions:
a. (2219,2581)
b. (2123,2677)
c. (2312,2488)
d. 1536 cases
16
.
Exercise 2
*
17
0.49 (49% of the respondents in the sample say they will vote
Obama)
Assume that standard deviation is
0.50.
Estimate the margin of error in this poll for a 95% confidence level.
End solution:
the margin of error in this poll is approximately 2 percentage points.