Escolar Documentos
Profissional Documentos
Cultura Documentos
8.1 CONFIDENCEINTERVALESTIMATION
FORTHE MEAN (o KNOWN)
In Section 7 .4, you used the Central Limit Theorem and knowledge of the population distribu-
tion to determine the percentageof sample means that fall within certain distancesof the pop-
ulation mean. For instance,in the cereal-fill example used throughout Chapter 7 (seeExample
7 '6 on page 268), 95o/oof all samplemeansare between362.12and 373.88grams.This
state-
ment is basedon deductivereasoning.However, incluctivereasoning is what you need here.
You need inductivereasoningbecause,in statisticalinference,you use the resultsofa
sin-
gle sample to draw conclusionsabout the population, not vice versa.Supposethat in
the cereal-
fill example, you wish to estimate the unknown population mean, using the information
from
o n l y a s a m p l e .T h u s , r a t h e rt h a n t a k e p + ( 1 . 9 6 l ( o l 1 r nf ) t o f i n d t h e u p p e ra n d l o w e r l i m i t s
around p, as in Section 7 .4, you substitute the sample mean, X- for the unknown and
, I use
X t ( 1 . 9 6 ) ( o l ( . ' l n ) a s a n i n t e r v a l t o e s t i m a t et h e u n k n o w n p . A l t h o u g h i n p r a c t i c e y o u
select a single sample of size n and compute the mean, X, in order to understandthe full
meaning of the interval estimate,you need to examine a hypotheticalset of all possible
sam-
plesofr values.
Supposethat a sample of n : 25 boxes has a mean of 362.3 grams. The interval developed
t o e s t i m a t ep i s 3 6 2 . 3I ( l . 9 6 X l 5 ) l ( ^ 1 2 5 ) ,o r 3 6 2 . 3+ 5 . g 9 .T h e e s t i r n a t e
of p is
356.42(u(368.18
FIGURE
8.1
Confidenceinterval
estimatesfor five
differentsamples of
n = 25 taken from a
p o p u l a t i o nw h e r e
p=368ando:15
362.12 368 373.88
I
Xt = 362.s
356.42 362.3 368.18
I =sos.s
JOJ.bZ 375.38
x. =:oo
360
Xa= 362.12
356.24 362.12 398
Xu= eZa.eA
+ (1.s6)(t
36s.s sybl2s)
or 369.5+ 5.88.Theestimate
is
363.62SuS375.38
354.12(p(365.88
356.24(uS368.00
368.00ap!319.76
362.3
t( 1.e6Xls) /( J25)
362.3+ 5.88
356.42S u( 368.18
where Z: the value correspondingto a cumulative area of 1 - al2 from the standardized
normal distribution (that is, an upper-tail probability of ul2).
The value ofZ neededfor constructinga confidenceinterval is calledthe critical value for
the distribution.95o/o confidencecorrespondsto an o, value of 0.05.The critical Zvalue corre-
spondingto a cumulativeareaof 0.9750 is L96 becausethere is 0.025 in the uppertail of the
distributionand the cumulativearealessthanZ: 1.96is 0.975.
There is a differentcritical value for eachlevel ofconfidence. I - cr.A level ofconfidence
of 95ohleadsto a Z value of | .96 (seeFigure 8.2). 99% confidencecorrespondsto an cr value
of 0.01.The Z value is approximately2.58 becausethe upper-tailareais 0.005 and the cumula-
t i v e a r e al e s st h a nZ : 2 . 5 8 i s 0 . 9 9 5( s e eF i c u r e8 . 3 ) .
FIGURE
8.2
N o r m a cl u r v e f o r
d e t e r m i n i n gt h e
Z v a l u en e e d e d
for 95% confidence
'p, X
-1.96 0 +1.96 Z
FIGURE
8.3
N o r m a cl u r v ef o r
d e t e r m i n r ntgh e
Z v a l u en e e d e d
for99% confidence
x
+ 2 . 5 8Z
288 CHAPTEREIGHT ConfidencelntervalEstimation
Xxz*=ro.essr(1.e6)g
4n {100
= 10.998
+ 0.00392
Sp ( 11.00192
10.99408
Xtz*=lo.eest(2.58)g
4n {100
= 1 0 . 9 91
80 . 0 0 5 1 6
(p ( 11.00316
10.99284
8.2 CONFIDENCEINTERVALESTIMATION
FORTHE MEAN (o UNKNOWN)
Just as the mean of the population, p, is usually unknown, you rarely know the actual standard
deviation of the population,o. Therefore,you often need to constructa confidenceinterval
estimateof p, using only the sample statistics X and S.
Student'st Distribution
At the beginning of the twentieth century, William S. Gosset, a statistician for Guinness
Breweries in Ireland (see reference3) wanted to make inferencesabout the mean when o was
unknown. BecauseGuinnessemployeeswere not permitted to publish researchwork under
their own names,Gossetadoptedthe pseudonym"Student." The distribution that he developed
is known as Student's r distribution and is commonly referred to as the t distribution.
If the random variableX is normally distributed"then the following statistic has a t distri-
bution with n - I degrees of freedom:
Y -tt
s
,FN
This expressionhas the same form as the Z statistic in Equatron(7 .4) on page 266, except that
S is used to estimatethe unknown o. The concept of degreeso.f'.freedom is discussedfurther on
pages 291-292.
FIGURE
8.4
Standardized - S t a n d a r d i z endo r m a ld i s t r i b u t i o n
n o r m a ld i s t r i b u t i o n - f distribution
a n d t d i s t r i b u t i o nf o r for 5 degrees
5 degrees of freedom of freedom
The degreesof freedom, n - 1, aredirectly related to the sample size,n. As the samplesize
and degreesof freedom increase,S becomesa better estimateof o, and the r distribution grad-
ually approachesthe standardizednormal distribution, until the two are virtually identical.
With a sample size of about 120 or more, S estimateso precisely enough that there is little dif-
ferencebetweenthe t and Z distributions.
As statedearlier, the t distribution assumesthat the random variableX is normally distrib-
uted. In practice, howeveq as long as the sample size is large enough and the population is not
very skewed, you can use the I distribution to estimate the population mean when o is
unknown. When dealing with a small sample size and a skewedpopulation distribution,the
validity of the confidence interval is a concern.To assessthe assumptionof normality, you can
8.2: Confidence for theMean(o Unknown) 291
IntervalE,stimation
evaluate the shape of the sample data by using a histogram, stem-and-leafdisplay, box-and-
whisker plot, or normal probabilityplot.
You find the critical values of r for the appropriatedegreesof freedom from the table of the
r distribution (seeTable E.3). The columns of the table representthe area in the upper tail of the
r distribution. The rows of the table representthe degreesof freedom.The cells of the table rep-
resentthe particular / value for each specific degreeof freedom. For example, with 99 degrees
of freedom, if you want 95% confidence, you find the appropriatevalue of l, as shown in Table
8. L The 95% confidencelevel meansthat2.5ohof the values(an areaof 0.025)are in eachtail
of the distribution. Looking in the column for an upper-tail area of 0.025 and in the row corre-
sponding to 99 degreesof freedom gives you a critical value for t of 1.9842.Becauser is a sym-
metrical distribution with a mean of 0, if the upper-tail value is +1.9842, the value for the
lower-tail area (lower 0.025) is - l .9842.A I value of -1.9842 means that the probability that t
is lessthan -1.9842 is 0.025,or2.5oh(seeFigure 8.5).Note that for a 95% confidenceinterval,
you will always use an upper-tail area of 0.025. Similarly, for a99o/oconfidence interval, use
0 . 0 0 5 ,f o r 9 8 % ou s e0 . 0 1 ,9 0 % u s e0 . 0 5 ,a n d 8 0 % u s e0 . 1 0 .
T A B L E8 . 1 Upper-TailAreas
Determining the Critical Degreesof Freedom ?{ .10 .05 .01 .005
Valuefrom the t Table
foran Areaof 0.025 I 1.0000 3.0777 6 . 3 1 3 8 1 2 . 31.8207 63.6574
in EachTailwith 99 2 0 . 8 1 6 5 1 . 8 8 s 6 2.9200 4 7 6.9646 9.9248
Degrees of Freedom J 0.1649 1 . 6 3 1 7 2.3s34 3 . 824 4.s407 5.8409
4 0.7407 1.s332 2 . 1 l38 2 . 164 3.1469 4.604r
5 0.1267 1 . 4 1 s 9 2 . 0 1 s 0 2 706 3.3.649 4.0.322
FIGURE
8.5
t d i s t r i b u t i o nw i t h 9 9
degreesof freedom
+1.9842
In order to compute 52, you first need to know X . Therefore, only r - I of the samplevalues
are free to vary. This means that you have n - 1 degreesof freedom. For example, suppose a
sample of five values has a mean of 20. How many values do you need to know beforeyoucan
determine the remainder of the values?The fact that n: 5 and X = 20 also tells you that
il
Tr =too
.L"t
because
n
\v
i=l ;
Thus, when you know four of the values, the fifth one is not free to vary becausethe sum must
add to 100. For example, if four of the values are 18, 24, 19, and 16, the fifth value must be 23
so that the sum equals100.
INTERVAL
CONFIDENCE FORTHEMEAN(o UNKNOWN)
V +t..,
s
G
where /,_, is the critical value of the r distribution, with n - I degreesof freedom for an
areaof al2 in the upper tail.
To illustrate the application of the confidence interval estimate for the mean when the
standarddeviation, o, is unknown, recall the Saxon Home Improvement Company Using
Statisticsscenariopresentedon page 284.You wanted to estimatethe mean dollar amount
listed on the salesinvoices for the month. You select a sample of 100 sales invoices from the
population of salesinvoices during the month, and the sample mean of the 100 salesinvoicesrs
$110.27,with a sample standarddeviationof $28.95.For95o/oconfidence,the critical value
from the / distribution(as shown in Table 8.1) is L9842. Using Equation(8.2),
it. 'S
^ lln I T
1n
= | t0.2+ 28.95
7 ( 1 . s 8 4) 2
{100
= 1 1 0 . 2+7 5 . 7 4
$ 1 0 4 . 5( 3u < $ 1 1 6 . 0 1
FIGURE8.6 f-A'*--q
j 1 rEstimate
for the MeanSalesInvoiceI
MicrosoftExcel m-
1--_
worksheet to compute j- Q Data
a confidence interval ;4ffi
estimate for the mean 5ft
s a l e si n v o i c ea m o u n t 6 t S a m n l eS i z e
for the SaxonHome 7 ConfidenceLevel
lmprovement Company B
9 l n t e r m e d i aCt ae l c u l a t i o n s
1 0 S t a n d a rEdn o ro f t h eM e a n :84/SaRT{86}
s of Freedrlm =85-1
12 | Value =Trilv(l - 87,811)
57443 = 8 1 2" 8 1 0
SeeSectlon EB.2to create
thrs. ConfidenceInterval
:85 - 813
:85 * 813
Thus, with 95% confidence,you concludethat the mean amount of all the salesinvoicesis
between$ 104.53and $ I 16.01. The 95% confidencelevel indicatesthat if you selectedall possi-
ble samplesof 100 (something that is never done in practice), 95o/oof the intervals developed
would include the populationmean somewherewithin the interval.The validity of this confi-
denceinterval estimatedependson the assumptionof normality for the distributionof the amount
of the salesinvoices.With a sampleof 100,the normality assumptionis not overly restrictive,and
the useof the t distribution is likely appropriate.Example 8.3 further illustrateshow you construct
the confidenceinterval for a mean when the population standarddeviation is unklown.
E X A M P L E8 . 3 ESTIMATING
THEMEANFORCEREOUIRED
TO BREAKELECTRIC
INSULATORS
A manufacturingcompanyproduceselectric insulators.If the insulatorsbreak when in use,a
short circuit is likely. To test the strength ofthe insulators,you carry out destructivetesting to
determine how rnuch,lbrzc is required to break the insulators.You measureforce by observing
how many poundsare appliedto the insulatorbefore it breaks.Table 8.2 lists 30 valuesfrom
this experiment, which are located in the file [![!fQ Construct a 95o/oconfidence interval
estimatefor the population mean force required to break the insulator.
T A B L E8 . 2
1,870 1,128 1,656 I,610 1,634 t,784 1,522 1,696 1,592 t,662
F o r c e( i n P o u n d s ) 1 , 8 6 6 1 , 7 6 4 1 , 7 3 4 1 . 6 6 2 1 , 7 3 4 1 , 7 7 4 1 , 5 5 0 | . t 5 6 t . 76 2 r , 8 6 6
R e q u i r e dt o B r e a k 1 , 8 2 0 t , 7 4 4 1 , 7 8 8 1 , 6 8 8 1 , 8 1 0 1 , 7 5 2 1 , 6 8 0 I. 8 1 0 t . 6 5 2 t , 7 3 6
theInsulator
FIGURE8.7
MicrosoftExcel
confidence interval
estimate for the mean
amountof force
requiredto break
electricinsulators q_ ___-_,,___
L
I i Intermediale
Calculations
=BIISORT{85)
-86-l
-Tlilv(l - 87, 811)
llnlervalHalfWidlh -812 " 810
SeeSection EB.2to create
this.
-85 - 813
-85 . 813
294 CHAPTEREIGHT Confidence
Intcrval
Estin.ration
each tail, with 29 degreesof freedom.From Table 8.3, you seethat t),\:2.0452. Thus, usi
, - 3 0 ,a n dt , o : 2 . 0 4 5 2 ,
X = 1 , 1 2 3 . 4S, : 8 9 . 5 5 n
x + t, t
s
G
-t)7 89'55
= | 4 -+ 't-) "o r-s ") r'
'60
= 1 . 1 2 3 . 4+ 3 3 . 4 4
1,689.96{uS1.756.84
p r o b a b i l i t yp l o t f o r
1600
the amount of force
ranr riraA tn hrorlz 1400
e l e c t r i ci n s u l a t o r s
1200
! rooo
800
200
---#
-?.5 -1.5 .1 .0.5 0 0.5 1 2 2.5
Z Value
a. Constructa 95ohconfidenceinterval for the population pilation stagein which the policy pagesare generatedand
meanbouncedcheckfee. sentto thebank for delivery.The ability to deliverapproved
b. Interpretthe intervalconstructedin (a) policiesto customersin a timely manneris criticalto the
profitabilityof this serviceto the bank.During a periodof
8.19 The datain the file @ft[! representthe total fat,
one month,a randomsampleof 27 approvedpolicieswas
in gramsper serving,for a sampleof 20 chickensand-
selected,and the total processingtime, in days,wasas
wichesfrom fast-foodchains.The dataareasfollows:
shownbelowandstoredin the file@,
7 8 4 5 1620202419 30 t3 19 t6 64 28 28 3t 90 60 56 3l s6 22 l8
23 30 25 t9 29 29 30 30 40 s6
"FastFood:AddingHealthto theMenu," 45 48 t7 t7 lt 91 92 63 50 sl 69 16 17
Extractedfrom
Source:
ConsumerReports,
September2004,pp.28-31. a. Constructa 95ohconfidenceinterval estimateof the
a. Constructa95o/oconfidenceintervalfor the population meanprocessing time.
meantotal fat, in gramsper serving. b. What assumptionmust you make about the population
in (a).
b. Interpretthe intervalconstructed distributionin (a)?
c. Do you think that the assumptionmadein (b) is seri-
8.20 Oneof the majormeasures of the qualityof service ouslyviolated?Explain.
providedby any organizationis the speedwith which it
responds to customercomplaints.A largefamily-helddepart- 8.22 The data in the file [l$$@f!represent the bat-
ment storeselling furniture and flooring, including carpet, tery life (in shots)for three-pixeldigitalcameras:
hadundergone a majorexpansionin thepastseveralyears.In 300 180 85 170 380 460 260 35 380 t20 rr0 240
particular,the flooring departmenthad expandedfrom 2 Source: Extractedfrom "Cameras: More Featuresin the Mix,"
installationcrewsto an installationsupervisoqa measurer, ConsumerReports,July 2005,pp. 14 18.
and l5 installationcrews.Lastyear,therewere50 complaints a. Constructa 95o/oconfidenceinterval for the population
concemingcarpetinstallation.The following data,in the file meanbatterylife (in shots).
@EE, representthenumberof daysbetweenthereceipt b. What assumptiondo you needto makeaboutthe popu-
of a complaintandtheresolutionof the complaint: lationofinterestto constructthe intervalin (a)?
27 152 2 t23 81 74 21 c. Giventhe datapresented, do you think the assumption
54 5 35r37 3l
neededin (a) is valid?Explain.
ll t9 t26 ll0 110 2961359431265
8.23 One operationof a mill is to cut piecesof steelinto
t2 4 16s 32 29 2 8 2 9 2 6 2 s 114 13
partsthat are usedlater in the frame for front seatsin an
13105274 52 30 22 36 26 20 23 automobile.The steelis cut with a diamondsawandrequires
33 68 the resultingpartsto be within +0.005inch of the length
specifiedby the automobilecompany.The measurement
a. Constructa 95o/oconfidenceinterval estimateof the
reportedfrom a sampleof 100steelparts(andstoredin the
meannumberof daysbetweenthe receiptof a complaint
file @ is the difference,in inches,betweenthe actual
andtheresolutionof the complaint.
lengthof the steelpart, asmeasuredby a lasermeasurement
b. What assumptionmustyou makeaboutthe population
device,andthe specifiedlengthofthe steelpart. For exam-
distributionin (a)?
ple, the first observation,-0.002, represents a steelpartthat
c. Do you think that the assumptionmadein (b) is seri-
is 0.002inch shorterthanthe specifiedlength.
ouslyviolated?Explain.
a. Constructa 95% confidenceintervalestimateof the
d. What effect might your conclusionin (c) haveon the
meandifferencebetweenthe actuallengthof the steel
validityof theresultsin (a)?
part andthe specifiedlengthofthe steelpart.
8.21 In New York State,savingsbanksare permittedto b. What assumptionmust you make aboutthe population
sell a form of life insurancecalledsavingsbank life insur- distributionin (a)?
ance(SBLI). The approvalprocessconsistsof underwrit- c. Do you think that the assumptionmadein (b) is seri-
ing, which includesa reviewof the application,a medical ouslyviolated?Explain.
informationbureaucheck,possiblerequestsfor additional d. Comparethe conclusionsreachedin (a) with thoseof
medicalinformationandmedicalexams,anda policy com- Problem2.23on page53.
point estirnatefor n is the sampleproportion,p - Xln, where ir is the sarrple size and X is the
in the sarnplehaving the characteristicof interest.E,quation(8.3)defines the
nurnberof iter.t.rs
confidenceinterval estir.nate for the populationproportion.
coNFTDENCE
TNTERVAL
ESTTMATE
FOXTXSjROPORTTON
,+ 7.lP(t- Pt
IT
or
p(t - p) p(I - p)
p-Z <n< p+z (8.3)
n n
r = s a m p l es i z e
p(t- p)
rto n*7
il
an
CS ( 0 r. 0 x 0 . e 0 )
=0.10t(1.96)
ith 100
)nt
= 0 . 1 0t ( 1 . 9 6 ) ( 0 . 0 3 )
he
ral : 0 .l 0 + 0 . 0 5 8 8
3nt
0.0412<n(0.1588
m-
hat
Therefore,you havc95% confidencethatbetwecn4.12%and 15.88%of all thesalesinvoices
the
containerrors.Figure8.10showsa MicrosoftExcelworksheet
forthesedata.
,eel
FIGURE
8.10
1
Microsoft Excel 2
worksheet to construct 3
ac o n f i d e n cien t e r v a l ,
eri-
estimate for the 5
5
proportion of sales
:of 7
invoices that contain I
errors I =85/84
10 =NORMSTNV(I - 86)2)
11 S l a n d a r dE n o r o f t h e =sQRr(Be'(1- Bey84l
N lntewalHalfWidth =ABS(810'811)
IJ
, are 1i
15 *89 - 812
sris- =89'812
15
The
298 IntervalEstimation
CHAPTEREIGHT Confidence
E X A M P L E8 . 4 PRINTED
ESTIMATINGTHE PROPORTIONOF NONCONFORMINGNEWSPAPERS
A largenewspaper wantsto estimatethe proportionof newspapers printedthathavea noncon-
improper page setup. missing pages,or duplicate 8.
forming attribute,suchas excessive ruboff,
VC:
pages.A randomsampleof 200 newspapers from
is selected all the newspapersprintedduring
nonconformance. Constructand far
a singleday.For this sampleof 200,35 containsometype of
printed during the day (E
interfret a 90o/oconfidenceinterval for the proportion of newspapers
of
thathavea nonconformingattribute.
Jot
SOLUTION UsingEquation(8.3), a.
?5
- ::- = 0.175. and with a90oh level of confidenceZ =
'D b.
200
8.2
p(t - p) sm
p!z the
n
Ap
= 0 . 1 7 t5 ( .r ..6.4- s. [)o
{ -.;r: 7 s ) ( 0 . 8 2 s )
use
a.i
I
= 0.175r (1.645X0.0269)
l
t75 !0.0442 I
b.i
0 . 1 3 0 83 n 3 0 . 2 1 9 2
I
I
You concludewith 90% confidencethat between13.08%and2l.92ohof the newspapers I
printedon that dayhavesometype of nonconformance. c..
b.
collection to ensure that the confidence interval is narrow enough to be useful in making
decisions. Determining the proper sample size is a complicated procedure,subject to the
constraints of budget, time, and the amount of acceptable sampling error. In the Saxon
Home Improvement example, you want to estimate the mean dollar amount of the sales
invoices,you must determine in advancehow large a sampling error to allow in estimating
the population mean. You must also determine in advancethe level of confidence (that is,
90%,95%, or 99oh)to use in estimatingthe population pa'rameter.
o
+ z _T
X
"\ln
l ln this context, some The amount added to or subtractedfrom X is equal to half the width of the interval. This
statisticiansrefer to e as quantity representsthe amountof imprecisionin the estimatethat results from sampling error.
r.l
the "margin of error." T h e s a m p l i n ge r r o r . ' e . i s d e f i n e da s
o
_T
e=Z
1n
Solving for r gives the sample size needed to construct the appropriate confidence interval
'Appropriate"
estimatefor the mean. means that the resulting interval will have an acceptable
amount of sampling error.
SAMPLESIZEDETERMINATION
FOR THE MEAN
The samplesize,n, is equalto the productof the Z valuesquaredandthe variance,
o, squared,dividedby the squareof the samplingerroq e,
22o2
n-""}:-
I
(8.4)
e
FIG
2YouuseZ insteadof t To determine the sample size, you must know three factors: Micr
won
because,to determine the
1. The desired confidence level, which determinesthe value of Z, the critical value from the dete
criticalvalue of t, you need
s t a n d a r d i z endo r m a ld i s t r i b u t i o n 2
for e
to know the samplesize,but saleJ
you do not know it yet. For 2. The acceptablesampling error, e
for t
most studies, the sample size 3. The standarddeviation,o lmpr
neededis large enough that
l=
the standardizednormal In some business-to-businessrelationships that require estimation of important parame-
distributionis a good ters, legal contracts specify acceptablelevels of sampling error and the confidence level lG
h
approximation of the required. For companiesin the food or drug sectors,governmentregulationsoften specify See I
t distribution. sampling errors and confidence levels. In general, however, it is usually not easy to specify this.
8 . 4 :D c t c r n r i n i S
n sa r r o lS
c izc 301
the two factorsneededto determinethe samplesize.How can you determinethe level of corr-
fidence and sarnplingerror?Typically,thesequestionsare answeredonly by the subjectmat-
ter expert(that is, the individual most farniliar with the variablesunder study).Although 95%
is the most common confidencelevel used,if rnore confidenceis desired"then 99%,rnight be
more appropriate;if less confidenceis deemedacceptable,then 90%,uright be used.For the
samplingerror, you shouldthink not of how much samplingerror you would like to have(you
really do not want any error) but of how much you'can toleratewhen drawing conclusions
from the data.
In additionto specifyingthe confidencelevel and the sarnplingerror,you needan estimate
of the standarddeviation.Unfortunately,you rarely know the populationstandarddeviation,o.
In some instances,you can estimatethe standarddeviationfrom past data.In other situations,
you can make an educatedguessby taking into accountthe rangeand distributionofthe vari-
able. For example,if you assumea normal distribution,the rangeis approximatelyequalto 6o
(that is, +3o aroundthe mean) so that you estimateo as the rangedivided by 6. If you cannot
estintateo in this way,you can conducta small-scalestudy and estimatethe standarddeviatiorr
f r o r nt h e r e s u l t i u gd a l a .
To explore how to determinethe samplesize neededfor estirnatingthe populationmean,
consideragainthe audit at SaxortHome lmprovement.In Section8.2, yoLrselecteda sampleof
100 salesinvoicesand constructeda 95o/oconfidenceintervalestimateof the populationrnearr
salesinvoice amount.How was this samplesize detennined?Should you have selecteda dif-
ferentsamplesize'/
Supposethat, after consultationwith company officials, you determinethat a sarnpling
e r r o r o f n o m o r e t h a n + $ 5 i s d e s i r e d ,a l o n g w i t h 9 5 o l c o n f i d e n c e .P a s td a t a i n d i c a t et h a t
t h e s t a n d a r dd e v i a t i o no f t h e s a l e sa r n o u n ti s a p p r o x i m a t e l y$ 2 5 . T h u s , e : $ 5 , o - $ 2 5 , a n d
Z : | . 9 6 ( f o r 9 5 o hc o n f i d e n c e ) .U s i n g E q u a t i o n( 8 . 4 ) ,
22o2 ( I . 9 6) 2( 2 5 ) 2
e- (5)"
= 96.04
FIGUR8
E. 1 1
MicrosoftExcel
worksheetfor I
determiningsamplesize 2
3
torestimatingthe mean 4
salesinvoiceamount q
forthe SaxonHome 6
improvement Company 7
Le-
t-r ct
I -NORTS[{V(l- 86}21
t--t 10 -((89'B4)/85r2
/el ul 11
fv SeeSectionEB.4to create tz
;; -ROUr{DUP(810,
0}
ifv mls. IJ
302 CHAPTER EIGHT Confidence Interval Estimatron
z2o2 (1.e6)2(loo)2
'n- = -
e2 Q'2
o
-T
e=Z
"ln
Z2n(l - n)
,= (8.s)
",
8.4:Determining
Sample
Size 303
1. The desiredconfidencelevel, which determinesthe value of Z, the critical value from the
standardizednormal distribution
2. The acceptablesamplingerror,e
3. The populationproportion,n
In practice,selectingthesequantitiesrequiressomeplanning.Onceyou determinethe
desiredlevel of confidence,you can find the appropriateZ valuefrom the standardized normal
distribution.The samplingerror,e, indicatesthe amountof error that you arewilling to tolerate
in estimatingthe populationproportion.The third quantity,n, is actuallythe populationparu-
meterthat you want to estimate!Thus,how do you statea valuefor what you aretaking a sam-
ple in orderto determine?
Hereyou havetwo alternatives.In many situations,you may havepastinformationor rele-
vant experiencethat providean educatedestimateof n. Or, if you do not havepastinformation
or relevantexperience,you cantry to providea value for a that would neverunderestimatethe
samplesize needed.Referring to Equation (8.5), you can seethat the quantity n(l - ru)
appearsin the numerator.Thus,you needto determinethe value of n that will makethe quan-
tity n(l - n) as largeas possible.When n : 0.5, the productn(l - n) achievesits maximum
result.To showthis, severalvaluesof n, alongwith the accompanyingproductsof a(l - n), are
as follows
Z2n(l - n)
"2
( 1.e6) 2( 0.1sxO.s5)
(0.07)"
= 99.96
Becausethe generalrule is to round the samplesize up to the next whole integerto slightly
oversatisfiithe criteria,a samplesizeof 100is needed.Thus,the samplesizeneededto satisf,i
the requirementsof the company,basedon the estimatedproportion,desiredconfidencelevel,
and samplingerror,is equalto the samplesizetakenon page297.The actualconfidenceinter-
val is narrowerthan requiredbecausethe sampleproportionis 0.10,while 0.l5 wasusedfor n
in Equation(8.5).Figure8.12showsa Microsoft Excel worksheetfor determiningsamplesize.
304 CHAPTEREIGHT ConfidenceIntervalEstimation
FIGURE8.12 A B
or thc Proportlon of InInor Salcr Invrlecl
MicrosoftExcel
worksheetfor Dd
determining sample 4 I|md. ofTru. ProDofdon 0.rl
unlllno Enor
sizefor estimatingthe 6 funcr Lrvrl
proportionof sales 7
invoiceswith errors I lnlermedietoCalculalions
I ! Value 1 -lloRxsfirv(l - Bs]n]
for the SaxonHome t0 CelculatedSamolaSize -(89^2. Bf. (1 - B4D/85^2
lmprovementCompany tl
t-l t3 r Slr. -ROUI{DUP(810,0)
I\N7T
I-ll
E
(1.645)',(0.50X0.s0)
(0.05)2
= 270.6
+0.02of the populationproportion,how largea sample d. You havebeenaskedto updatethe resultsofthis study,
sizeis required? Determinethe samplesize necessary to estimatethe
e. Discussthe effectsof changingthe desiredconfidence populationproportionsin (a)through(c) to within +0.02
level and the acceptable
samplingerror on samplesize with 95o/o
confidence.
requirements.
8.49 A studyof 658 CEOsconductedby the Conference
8.48 A poll of 1,286youngadult cell phoneuserswas Board reportedthat 250 statedthat their company'sgreat-
conductedin March 2006.Thesecell phoneusers,aged est concern was sustainedand steadytop-line growth
78-29,wereactivelyengagedin multipleusesof their cell ("CEOs' GreatestConcerns,"USA TodaySnapshots, May
phones.The datasuggestthat 707 took still pictureswith 8 , 2 0 0 6p, . 1 D ) .
their phones,604 playedgames,and 360 usedthe Internet a. Constructa 95ohconfidenceintervalfor the proportion
(Extracted from "Poll: CellphonesAre Annoying but of CEOs whosegreatestconcernwas sustained and
Invaluable,"usatoday.com, April 3,2006). Constructa steadytop-linegrowth.
95%confidenceintervalestimateof thepopulationpropor- in (a).
b. Interpretthe intervalconstructed
tion of youngadultsthatusedtheir cell phoneto c. To conducta follow-upstudyto estimatethe population
a. takestill pictures. proportionof CEOs whosegreatestconcernwassus-
b. play games. tainedand steadytop-linegrowthto within +0.01with
c. usethe Internet. 95% confidence,how manyCEOswouldyou survey?
AUDITING
Auditing is the collectionand evaluationof evidenceaboutinformationrelatingto an
economicentity,suchas a solebusinessproprietor,a partnership,a corporation,or a
governmentagency,in orderto determineand report on how well the information
t,
iorrespondsto establishedcriteria.
Estimatingthe PopulationTotalAmount
In auditing applications,you are often more interestedin developing estimatesof the popula-
tion total amount than the population mean. Equation (8.6) shows how to estimate a popula-
tion total amount.
ryX = (5,000X$l
10.27)= $551,350
,c N-n too
NX + N(tntl:- =55r,350
t (s,000x1.e
84D?J-/t'999
'
"ln N-1 ./tOO1 5,000- I
= 551,350+ 28,721.295(0.99005)
= 551,350
+ 28,436
$522,914< Population
total < $579,786
8.13
TolalAmounl of All SalosInvoicet
Excel a
E X A M P L E8 . 7 DEVELOPINGA CONFIDENCEINTERVALESTIMATE
FOR THE POPULATIONTOTAL
An auditoris facedwith a populationof 1,000vouchersandwantsto estimatethe total value
the populationof vouchers.A sampleof 50 vouchersis selected,
with the followingresults:
Meanvoucheramount(X ) = $1.076.39
Standarddeviation(S) =$2'73.62
7try = (1,000)(1,07
6.39)= $1,076,390
( 1.000x I ( 1.000x2.0
1,076.39) ogol2J3fr
= 1,076,390
+ 77,762.878
(0.97517)
= 1,076,390
+ 75,832
< Population
$1,000,558 total < $1,152,222
DifferenceEstimation
An auditorusesdifferenceestimationwhenhe or shebelievesthaterrorsexistin a setof
and he or shewantsto estimatethe magnitudeof the errorsbasedonly on a sample.The
lowing stepsareusedin differenceestimation:
l . Determinethe samplesizerequired.
, Calculatethe differencesbetweenthe valuesreachedduring the audit and the original
uesrecorded.The differencein valuei, denotedD,, is equalto 0 if the auditorfindsthat
original value is correct,is a positivevaluewhen the auditedvalueis largerthanthe
nal value,and is negativewhenthe auditedvalue is smallerthanthe original value.
3. Computethe meandifferencein the sample,D,by dividing the total differenceby
samplesize,as shownin Equation(8.8).
MEAN DIFFERENCE
2o,
D-t=t (8.8)
n
whereD, : Audited value- Original value
8.5: Applicationsof ConfidenceInterval Estimationin Auditing 309
STANDARD
DEVIATION
OF THE DIFFERENCE
Sr-^ :.,
L(ui-Dr
i=l
SD= (8.e)
n-l
INTERVAL
CONFIDENCE E STIMATE
FORTHETOTALDIFFERENCE
.s^ffi;
N D + N ( t 'n ) l # " , l - (s.10)
4 n \ lN - l
$9.03 $1.41 $17.32 $8.30 $s.21 $ r 0.80 $6.22 $5.63 $4.97 $7.43 52.99 54.63
i,,
-eo
D-t=t ='" =0.90
n 100
numerator,there and3
differences.Each
last88
areequal n
0.eF -D)'
\tn,
;-l
SD=
,-l
Sn = 2'752
(s,000)(0.90)
r (s,000x1
.s84D2+
= 4 , 5 0 0+ 2 , 7 0 2 . 9 1
FIGURE8.14
otal DlfforcnceIn Acfual lnd Enter.d
MicrosoftExcel
worksheetfor the tota
differencebetweenthe
invoiceamountsfound
duringauditand the
amountsenteredinto
the accountingsystem -SUl{olficrencod).t!!AAl
-80/85
for the SaxonHome -Ba'B!0
lmprovement Company -SARTGl6)
-saRT{(Bl -85}/(8r - ll}
-F4 - 812'8lrySQRT(Fq
-B!t - |
-Tll{\t(l - 86, 815}
-815'Bll
See Section E8.7 to create
this.
- 8 1 1. 8 1 7
-B1l + 817
E X A M P L E8 . 8 DIFFERENCE
EST]MATION
Returningto Example8.7 on page308,supposethat 14vouchersin the sampleof 50 vouchers
containerrors.The valuesof the 14 errorsare listedbelow and storedin the file !@@.
Observethat two differencesarenesative:
tr,
: ?i =
6e0.34
=13.8068
D-'='
n50
-
)j = to, D)'
I
SD=
n-l
) 2( 3 8 . 9 7- 1 3 . 8 0 6 8 +
(75.4r- 13.8068+ ) 2' . ' + ( 0 - 1 3 . 8 0 6 8 ) 2
= 37.427
-]
I
j
ferencein the population as follows:
( 1,000x ) lg
+(1,000)(2.0096
l 3.8068)
= 13,806.8
+ 10,372.4
< Totaldifference< 524,179.20
$3,434.40
ONE-SIDED
CONFIDENCE
INTERVAL
FORA PROPORTION
- p+Z p(t - p)
Upperbound (8.11)
n
where Z : the value corresponding to a cumulative area of ( I - cr) from the standardized
normal distribution (that is, a right-hand tailprobability of a).
= 0 . 0 5+ 1 . 6 4 5 ( 0 . 0 1 0 9 X 0=. 908. )0 5+ 0 . 0 1 7 6= 0 . 0 6 7 6
312 CHAPTEREIGHT ConfidenceIntervalEstimation
E X A M P L E8 . 9 ESTIMATING
THE RATEOF NONCOMPLIANCE
A largeelectronics
firm writesI millionchecksayear.An internalcontrolpolicyfor the
pany is that the authorizationto sign eachcheckis grantedonly after an invoicehasbeen
tialed by an accountspayablesupervisor.The company'stolerableexceptionrate for this
trol is 4%. If control deviationsare found in 8 of the 400 invoicessampled,what should
auditordo?To solvethis. usea95Yolevelofconfidence.
SOLUTION Theauditorconstructs a95Yoone-sidedconfidenceintervalfor the
invoicesin noncomplianceand comparesthis to the tolerableexceptionrate. Using
(8.11) on page3 I l, p : 81400: 0.02,andZ : 1.645for 95oloconfidence,
-
= 0.02+ 1.6450.02(l 0.02)
= 0.02+ 1.645(0.007X0.9998)
= 0.02+ 0.01l5 = 0.0315
Theauditorconcludes
with95%confidence
thattherateof noncompliance
is lessthan3.
Becausethis is less than the tolerableexceptionrate, the auditor concludesthat the i
controlcomplianceis adequate.In otherwords,the auditoris more thang5ohconfident
rateof noncompliance is lessthan4%.
8.52 If p: 0.04,n : 300,and N: 5,000,calculatethe ll0 362 246 85 510 208 173 425 316 r79
upper bound for a one-sidedconfidenceinterval estimate
of the populationproportion,7t,using the following levels Constructa 90ohconfidenceinterval estimateof the total
ofconfidence: family dental expensesfor all employeesin the preceding
a.90o/o b. 95% c. 99o/o yeat.
8.6: ConfidenceIntervalEstimationand Ethical Issues 3 l3
A branchofa chainoflaree electronics storesis con- Sample Historical Audited Sample Historical Audited
ing an end-of-monthinventory of the merchandisein Number Cost ($) Value ($) Number Cost ($) Value ($)
8.6 ESTIMATION
INTERVAL
CONFIDENCE AND ETHICAL
ISSUES
Ethicalissuesrelatingto the selectionof samplesandthe inferences that accompany themcan
occur in severalways.The major ethical issuerelatesto whetherconfidenceinterval estimates
areprovidedalongwith the samplestatistics.To providea samplestatisticwithout alsoinclud-
ing the confidenceinterval limits (typically set at 95o/o),thesamplesizeused,and an interpre-
tation of the meaningof the confidenceintervalin termsthat a laypersoncanunderstandraises
ethicalissues.Failureto includea confidenceintervalestimatemight misleadthe userof the
resultsinto thinking that the point estimateis all that is neededto predictthe populationchar-
acteristicwith certainty.Thus,it is importantthat you indicatethe intervalestimatein a promi-
nent place in any written communication,along with a simple explanationof the meaningof
the confidenceinterval.In addition,you shouldhighlightthe samplesize.
Oneof the mostcommonareaswhereethicalissuesconcerningestimationoccursis in the
publicationof the resultsof politicalpolls.Often,theresultsof the polls arehighlightedon the
314 IntervalEstimatron
CHAPTEREIGHT Confidence
front page of the newspapeq and the sampling error involved along with the methodology used
is printed on the page where the article is typically continued,often in the middle of the news-
paper.To ensurean ethical presentationofstatistical results,the confidence levels,samplesize,
and confidence limits should be made availablefor all surveysand other statisticalstudies.
8.7 o (CD-ROMTopiclESTIMATION
AND SAMPLESlzE
DETERMINATIONFOR FINITEPOPULATIONS
In this section,confidence intervals are developedand the sample size is determinedfor situa-
tions in which sampling is done without replacement from a finite population. For further dis-
cussion,seeE@EEEEUon the StudentCD-ROM that accompaniesthis book.
To determinewhat equationto usefor a particularsitu- . Ifyou havea numericalvariable,do you know thepopu-
ation,you needto askseveralquestions: lationstandarddeviation?If you do, usethe normaldis-
. Are you developinga confidenceinterval or are you tribution.If you do not,usethe t distribution.
determiningsamplesize?
. Do you havea numericalvariableor do you havea cate- The next four chaptersdevelopa hypothesis-testing
gorical variable? approachto makingdecisionsaboutpopulationparameten.
Confidence Interval for the Mean (o Known) Confidence Interval for the Mean (o Unknown)
o
X + Z -T=
,s
\ln T,
X -'* , <X+z* (8.1)
s.s
X-tn-t--r<tt<X+tn-'#
\ln tln .ln
Chapter
ReviewProblems 315
Confidence
Interval Estimatefor the Proportion SampleSizeDeterminationfor the Mean
L.
22o2
P = zl!!!:-!)
\n
ll
-- ----=-
e' (8.4)
beverages. HFCS is cheaperthan sugarand is about75% Supposethat the market researcherwants to take
sweeterthansucrose.Someresearchers think that HFCSis anothersurveyin a differentcity.Answerthesequestions:
linkedto the growingobesityproblemin the UnitedStates c. What samplesize is requiredto be 95o/oconfidentof
(Extracted from P. Lempert, "War of the Sugars," estimatingthe populationmeanto within *2 hoursand
Progressive Grocer,April 15, 2006,p.20). The following assumes that the populationstandarddeviationis equal
consumerviews are from a nationwidesurvevof 1.1l4 to 5 hours?
responses: d. What samplesize is neededto be 95o/oconfidentof
Viewson HFCS being within +0.035of the populationproportionwho
watchthe eveningnewson at least3 weeknightsif no
Yes No previousestimateis available?
e. Basedon (c) and (d), what samplesizeshouldthemar-
Are you concernedaboutconsumingHFCS? 80o/o20o/o ket researcher selectif a singlesurveyis beingcon-
Do you think HFCSshouldbe banned
ducted?
in foodsoldto schools? 88% 12%
Do you think HFCSshouldbe banned 8.71 The real estateassessorfor a county government
inallfoods? 56% 44% wants to study various characteristicsof single-family
Constructa95o/oconfidenceintervalestimateof the popu- housesin the county. A random sample of 70 houses
lationproportionof peoplewho revealsthe following
. Heatedareaof the houses(in squarefeet): X = 1,759,
a. areconcerned aboutconsumingHFCS.
b. think HFCSshouldbe bannedin food soldto schools. ^s:380.
. 42 houseshavecentralair-conditioning.
c. think HFCSshouldbe bannedin all foods.
d. You are in chargeof a follow-up survey.Determinethe a. Constructa 99o/oconfidenceinterval estimateof the
samplesizenecessary to estimatethe proportionsin (a) populationmeanheatedareaof thehouses.
(c)
through to within +0.02 with 95oloconfidence. b. Constructa 95o/oconfidenceinterval estimateof the
populationproportionof housesthat havecentralair-
8.69 StarwoodHotels conducteda surveyof 401 top conditioning.
who play golf (Extractedfrom D. Jones,"Many
executives
CEOs Bend the Rules (of Golf)," USA Today,June26, 8.72 Thepersonneldirectorof a largecorporationwishes
2002).Among the resultswerethe following: to study absenteeism amongclerical workersat the corpo
. 329cheatat golf. ration'scentraloffice during the year.A randomsampleof
. 329 hateotherswho cheatat golf. 25 clericalworkersrevealsthe following:
. 289believebusiness andgolfbehaviorparallel. . Absenteeism: X = 9.7 days,S: 4.0days.
. 80 wouldlet a clientwin to getbusiness. . l2 clericalworkerswereabsentmorethan l0 days.
. 40 would call in sick to play golf. a. Constructa 95o/oconfidenceinterval estimateof t
Constructa 95o/oconfidenceintervalestimatefor eachof meannumberof absencesfor clerical workersduri
thesequestions.Basedon theseresults,what conclusions the year.
canyou reachaboutCEOs'attitudestowardgolfl b. Construct a 95o/oconfidence interval estimate of the
populationproportionof clericalworkersabsent
8.70 A marketresearcherfor a consumerelectronics than 10 daysduring the year.
companywantsto studythe televisionviewing habitsof Suppose thatthepersonneldirectoralsowishesto take
residentsof a particular area.A random sampleof 40 survey a branchoffice.Answerthesequestions:
in
respondents is selected,and eachrespondentis instructed c. What sample size is neededto have 95% confidencei
to keepa detailedrecordof all televisionviewingin a par- estimatingthe populationmeanto within +1.5days
ticularweek.The resultsareasfollows: the populationstandarddeviationis 4.5 days?
. Viewingtime per week: X = 15.3hours,.t: 3.8
d. What samplesizeis neededto have90% confidence
hours. estimatingthepopulationproportionto within +0.075
. 27 respondents watchthe eveningnewson at least3 no previousestimateis available?
weeknights. e. Basedon (c) and(d), whatsamplesizeis neededif a
a. Constructa 95o/oconfidenceinterval estimatefor the gle surveyis beingconducted?
meanamountof televisionwatchedper week in this
city. 8.73 The marketresearchdirectorfor Dotfy's
b. Constructa 95ohconfidenceinterval estimatefor the Store wants to study women's spendingon
populationproportionwho watchthe eveningnewson A surveyof thestore'screditcardholdersis designed
in
at least3 weeknightsper week. to estimatethe proportionof womenwho purchase
ReviewProblems 317
Chapter
primarily from Dotty's DepartmentStore and the a. Constructa 95ohconfidenceinterval estimateof the
yearlyamountthatwomenspendon cosmetics.A pre- populationmeanamountspentin the pet supplystore.
survevfoundthat the standarddeviationof the amount b. Constructa 90ohconfidenceinterval estimateof the
spendon cosmeticsin a yearis approximately$ 18. population proportion of customerswho own only a
samplesizeis neededto have99o/oconfidenceof cat.
imatingthe populationmeanto within +$5? The branchmanagerof anotheroutlet (Store2) wishes
samplesize is neededto have 90% confidence
,-What to conducta similar surveyin his store.The managerdoes
estimatingthe population proportion to within not have accessto the information generatedby the man-
.045? agerof Storel. Answerthe followingquestions:
on the resultsin (a) and (b), how many of the c. What samplesizeis neededto have95% confidenceof
's credit card holders should be sampled? estimatingthe populationmean amount spentin his
storeto within +$1.50if the standarddeviationis $10?
d. What samplesize is neededto have90% confidenceof
The branch managerof a nationwide bookstore
estimatingthe populationproportion of customerswho
wantsto study characteristicsof her store'scus-
own only a cat to within +0.045?
Shedecidesto focus on two variables:the amount
e. Basedon your answersto (c) and (d), how large a sam-
spentby customersand whetherthe customers
ple shouldthe managertake?
considerpurchasingeducationalDVDs relating to
preparationexams,such as the GMAT, GRE, or 8.76 The owner of a restaurantthat servescontinental
. Theresultsfrom a sampleof 70 customersare as food wantsto study characteristics of his customers.He
decidesto focus on two variables:the amountof money
Amountspent:X = $28.52,,S:$11.39. spentby customersand whethercustomersorderdessert.
28 customersstatedthat they would considerpur- The results from a sample of 60 customersare as
chasinsthe educationalDVDs. follows:
a 95%oconfidenceinterval estimateof the . Amountspent:X = $38.54,S=$7.26.
meanamountspentin the bookstore. . l8 customerspurchaseddessert.
a 90% confidenceinterval estimateof the a. Constructa 95o/oconfidenceinterval estimateof the
Lonproportionof customerswho would consider population mean amount spent per customerin the
ine educationalDVDs. restaurant.
that the branchmanaserof anotherstorein the b. Constructa 90ohconfidenceinterval estimateof the
wantsto conduct a similar survey in his store. population proportion of customerswho purchase
thefollowing questions: dessert.
samplesize is neededto have95% confidenceof The ownerof a competingrestaurantwantsto conducta
ing the populationmean amount spent in his similar surveyin her restaurant.This owner doesnot have
to within +$2 if the standarddeviationis assumed accessto the information of the owner of the first restau-
$10? rant.Answerthe following questions:
samplesizeis neededto have90% confidenceof c. What samplesizeis neededto have95% confidenceof
ing the populationproportionwho would con- estimatingthe populationmean amount spentin her
purchasingthe educational DVDs to within restaurantto within +$1.50,assumingthat the standard
,| deviationis $8?
on your answersto (c) and (d), how large a sam- d. What samplesize is neededto have90% confidenceof
shouldthe managertake? estimatingthe populationproportion of customerswho
purchasedessertto within +0.04?
The branchmanagerof an outlet (Store l) of a
e. Basedon your answersto (c) and (d), how large a sam-
chainof pet supply storeswantsto study char-
ple shouldthe ownertake?
icsof her customers.In particular,shedecidesto
twovariables:the amountof moneyspentby cus- 8.77 The manufacturerof "Ice Melt" claimsits product
andwhetherthe customersown only one dog, only will melt snow and ice at temperaturesas low as 0o
ormorethanonedog and/orcat.The resultsfrom a Fahrenheit.A representativefor a large chain of hardware
of 70 customersare as follows: storesis interestedin testingthis claim. The chain pur-
t of moneyspent:X = $21.34,S:59.22. chasesa largeshipmentof 5-poundbagsfor distribution.
customers own only a dog. The representativewants to know with 95% confidence,
customersown only a cat. within +0.05,what proportionof bagsof Ice Melt perform
mersown more than one dos and/or cat. thejob as claimedby the manufacturer.
318 EIGHTConfidence
CHAPTER Interval
Estimation
a. How many bagsdoesthe representative needto test? amountthat the auditor determinedshouldhavebeen
What assumptionshouldbe madeconcerningthe popu- reimbursed.
lation proportion?(This is called destructivetesting;
8.80 A home furnishingsstore that sells bedroomfurni-
that is, the productbeing testedis destroyedby the test
ture is conductingan end-of-monthinventory of the beds
andis thenunavailable to be sold.)
(mattress,bed spring, and frame) in stock.An auditorfor
b. The representative tests50 bags,and42 ofthem do the
the store wants to estimatethe meanvalue of the bedsin
job as claimed.Constructa 95o/oconfidenceinterval
stock at that time. Shewantsto have99% confidencethat
estimatefor the populationproportion that will do the
her estimateof the meanvalueis correctto within +$100.
job asclaimed.
On the basisof pastexperience,sheestimatesthat the stan-
c. How can the representativeuse the results of (b) to
darddeviationofthe valueofa bed is $200.
determinewhetherto sell the Ice Melt product?
a. What samplesizeshouldsheselect?
8.78 An auditorneedsto estimatethe percentage of timesa b. Using the samplesize selectedin (a), an audit wascon-
companyfails to follow an internalcontrolprocedure.A sam- ducte4 with the following results:
ple of 50 from a populationof 1,000itemsis selecte4andin
7 instances, the internalcontrolprocedurewasnot followed. X = $1,654.27 S = $184.62
a. Constructa90% one-sided confidenceintervalestimate Constructa 99o/oconfidenceinterval estimateof the total
of thepopulationproportionof itemsin which the inter- value of the bedsin stock at the end of the month if there
nal controlprocedurewasnot followed. were258 bedsin stock.
b. If the tolerableexceptionrate is 0.15, what shouldthe
auditorconclude? 8.81 A quality characteristicof interestfor a tea-bag-
filling processis the weight of the tea in the individual
8.79 An auditorfor a governmentagencyneedsto evalu- bags.In this example,the labelweight on the packageindi-
atepaymentsfor doctors'officevisitspaid by Medicarein catesthat the meanamountis 5.5 gramsof tea in a bag.If
a particularzip code during the month of June.A total of the bags are underfille4 two problemsarise.First, cus-
25,056visitsoccurredduringJunein this area.The auditor tomersmay not be able to brew the tea to be as strongas
wantsto estimatethe total amountpaid by Medicareto they wish. Second,the companymay be in violation of the
within +$5 with 95% confidence.On the basisof past truth-in-labelinglaws. On the other hand, if the mean
experience,she believesthat the standarddeviation is amountof tea in a bag exceedsthe label weight, the com-
approximately$30. pany is giving awayproduct. Getting an exact amountof
a. What samplesizeshouldsheselect? tea in a bag is problematicbecauseof variationin the tem-
Using the samplesizeselectedin (a), an audit is conducted peratureand humidity insidethe factory,differencesin the
with the following results. densityof the tea, and the extremelyfast filling operation
Amountof Reimbursement of the machine(approximately170bagsper minute).The
following dataarethe weights,in grams,of a sampleof 50
X = $93.70 ,l = $34.55 tea bagsproducedin one hour by a singlemachine(the
dataarestoredin the file S$!!@:
In l2 of the office visits, an incorrectamountof reimburse-
ment was provided.For the 12 office visits in which there Weight of TeaBags, in Grams
was an incorrect reimbursement,the differencesbetween 5.65 s.44 5.42 5.40 s.53 5.34 5.54 s.4s 5.52 5.41
the amountreimbursedand the amountthat the auditor
determinedshouldhavebeen reimbursedwere as follows 5.57 5.40 5.53 5.54 5.s5 5.62 5.56 5.46 5.44 5.5Ii
(andare storedin the file@!@ 5.47 5.40 s.47 5.61 5.53 5.32 5.67 5.29 5.49 5.55
$17 $2s $14 -$10 $20 $40 $3s $30 $28 $22 $15 $5 5.77 5.57 5.42 5.58 5.58 5.50 5.32 5.50 5.53
b. Constructa 90% confidenceintervalestimateof the pop- 5.61 5.45 5.44 5.25 5.56 s.63 5.50 5.57 5.67
ulationproportionof reimbursementsthat containerrors.
a. Constructa 99o/oconfidenceinterval estimateof
c. Constructa 95o/oconfidenceinterval estimateof the
populationmeanweight of the teabags.
populationmeanreimbursementper office visit.
b. Is the companymeetingthe requirementsetforth on
d. Constructa 95%oconfidenceinterval estimateof the
labelthatthe meanamountof teain a bag is 5.5
populationtotal amountof reimbursements for this geo-
graphicareain June. 8.82 A manufacturingcompanyproducessteel
e. Constructa 95%oconfidenceinterval estimateof the for electricalequipment.The main componentpart of
total differencebetweenthe amountreimbursedand the housingis a steeltroughthat is madeout of a l4-gauge
Chapter
ReviewProblems 3I9
8.M 8.4298.460 8.412 8.420 8.410 8.40s 8.323 8.420 Report Writing Exercises
8.4478.4058.439 8.411 8.427 8.420 8.498 8.409 8.85 Referringto the resultsin Problem8.82on page318
concerningthe width of a steeltrough, write a report that
Constructa 95o/oconfidenceinterval estimateof the summarizesyour conclusions.
meanwidth of the troughs.
Interpretthe intervaldevelopedin (a). Team Proiect
The manufacturerof Boston and Vermontasphalt 8.86 Refer to the team project on page 73 (see the
know that productweight is a major factor in the Eil@@@ file). Constructall appropriateconfidence
's perceptionof quality. The last stageof the interval estimatesof the populationcharacteristicsof low-
line packagesthe shinglesbeforethey areplaced risk, average-risk,and high-risk mutual funds. Include
woodenpallets.Once a pallet is full (a pallet for most theseestimatesin a reportto the vice presidentfor research
holdsl6 squaresof shingles),it is weighed,andthe at the financial investmentservice.
t is recorded.The data file [@ contains
weight(in pounds)from a sampleof 368 pallets of Student Survey Database
shinglesand330palletsof Vermontshingles. 8.87 Problem1.27on page 15 describesa surveyof 50
Forthe Boston shingles,constructa 95ohconfidence undergraduate students(seethe file GEEEEffiE!E$.
intervalestimateof the meanweight. a. For thesedata,for eachvariable,constructa95%oconfi-
Forthe Vermont shingles,constructa 95Voconfidence denceinterval estimateof the populationcharacteristic.
intervalestimateof the meanweight. b. Write a reportthat summarizesyour conclusions.
Evaluatewhetherthe assumptionneededfor (a) and (b)
8.88 Problem1.27on page15 describes a surveyof 50
hasbeenseriouslyviolated.
undergraduate students(seethe fileEEE@).
Basedon theresultsof(a) and(b), whatconclusions can
a. Selecta sampleof 50 undergraduate studentsat your
you reachconcerningthe meanweight of the Boston
schooland conducta similar surveyfor thosestudents.
andVermontshingles?
b. For the data collected in (a), repeat (a) and (b) of
I The manufacturerof Boston and Vermont asphalt Problem8.87.
providesits customerswith a 20-yearwarranty on c. Comparethe resultsof (b) to thoseof Problem8.87.
ll
of its products.To determinewhethera shinglewill last
8.89 Problem1.28on page 15 describesa surveyof 50
5 longasthewarrantyperiod"acceleratedlifetestingis con-
MBA students(seethe fileEEElffiElldH).
at the manufacturingplant. Accelerated-lifetesting
fs a. For thesedata,for eachvariable,constructa95o/oconfi-
r'
)6
it would be subjectto in a
the shingleto the stresses
of normal usevia a laboratoryexperimentthat takes
denceinterval estimateof the populationcharacteristic.
b. Write a reportthat summarizesyour conclusions.
a few minutesto conduct.In this test, a shingleis
h" scrapedwith a brush for a short period oftime, 8.90 Problem1.28on page 15 describesa surveyof 50
I
andthe shingle granules removed by the brushing are MBA students(seethe file!@@l!@).
hr weighed(in grams).Shinglesthat experiencelow amounts a. Selecta sampleof 50 graduatestudentsin your MBA
br ofgranulelossareexpectedto lastlongerin normalusethan programandconducta similar surveyfor thosestudents.
gs shingles thatexperience high amountsof granuleloss.In this b. For the data collected in (a), repeat (a) and (b) of
he situation,a shingleshouldexperienceno more than 0.8 Problem8.89.
rcl gramsof granulelossif it is expectedto lastthe lengh of the c. Comparethe resultsof (b) to thoseof Problem8.89.
320 CHAPTER EIGHT ConfidenceIntervalEstimatron
Managingthe SpringvilleHerald
The marketing departmenthas been consideringways to The group agreed to use a random-digit dialing
increasethe number of new subscriptionsand increasethe method to poll 500 local householdsby telephone.Using
rate of retention among customerswho agreed to a trial this approach,the last four digits of a telephonenumberare
subscription. Following the suggestion of Assistant randomly selected to go with an area code and exchange
Manager Lauren Alfonso, the department staff designed a (the first 6 digits of a l0-digit telephonenumber). Only
survey to help determine various characteristicsof readers those pairs of area codes and exchangesthat were for the
of the newspaperwho were not home-delivery subscribers. Springville city areawere used for this survey.
The surveyconsistsof the following 10 questions: Of the 500 households selected"94 householdseither
refusedto participate,could not be contactedafter repeated
1. Do you or a memberofyour householdeverpurchase attempts, or representedtelephone numbers that were not
the Springville Hera ld? in service.The summary results are as follows:
(l)Yes (2)No
[ f t h e r e s p o n d e n ta n s w e r s n o , t h e i n t e r v i e w i s Households
That PurchasetheSpringvilleHerald Frequency
terminated.] Yes 352
2. Do you receive the Springville Herald via home No JA
delivery?
(l)Yes (2)No Householdswith Home Delivery Frequency
[f no,skipto question 4.]
Yes 136
3. Do you receivetheSpringvilleHerald:
( I ) Monday-Saturday(2) Sundayonly (3) Everyday No 216
[f everyday,skip to question9.] Type of Home Delivery Subscription Frequency
4. How often during the Monday-Saturdayperiod do
you purchasethe SpringvilleHerald? Monday-Saturday l8
(l) Everyday (2) Mostdays (3) Occasionally or never Sundayonly 25
5 . How often do you purchasetheSpringvilleHerald on 7 daysa week 93
Sundays?
PurchaseBehavior of Nonsubscribers
( l) EverySunday (2) 2-3 Sundaysper month
for Monday-Saturday Editions Frequency
(3) No morethanoncea month
6 . Whereareyou mostlikely to purchasetheSpringville Every day 78
Herald? Most days 95
(l) Convenience store (2) Newsstand/candy store Occasionally
or never 43
(3) Vendingmachine (4) Supermarket (5) Other
7 . Woufdyou considersubscribingto the Springville PurchaseBehavior of Nonsubscribers
for Sunday Editions Frequency
Heraldfor a trial periodif a discountwereoffered?
(l) Yes (2)No Every Sunday 138
[f no, skipto question 9.] 2-3 Sundaysa month 54
8 . The Springville Herald currently costs $0.50 No more than oncea month )4
Monday-Saturday and $1.50on Sunday, for a totalof
$4.50perweek.Howmuchwouldyoubewillingto pay Nonsubscribers'PurchaseLocation Frequency
perweekto gethomedeliveryfor a 90-daytnal period?
Convenience store 74
9 . Do you read a daily newspaperother than the Newsstand/candy store 95
SpringvilleHerald? Vendingmachine 21
(l)Yes (2)No
Supermarket 13
t 0 .As an incentivefor long-termsubscribers, the news- Otherlocations IJ
paperis considering the possibilityofoflering a card
thatwouldprovidediscounts at certainrestaurants in Would Consider Trial Subscription
the Springvilleareato all subscribers who pay in if Offered a Discount Frequency
advancefor six monthsof homedelivery.Wouldyou
Yes +0
wantto get sucha cardunderthe termsof this offer?
( l ) Y e s ( 2 )N o
No 170
Rcf-crenccs 321
Yes 66
No 286
Web Case
1. Cochran,W. G., Sumplingkc'hnique.r,3rd ed. (NewYork: 4. Larsen,R. L., and M. L. Marx, An Introductionto Mqth-
Wiley, 1977). ematical Statistic'sand lts Applic'ations,4th ed. (Upper
2. Fisher, R. A., and F. Yates, Sraristical Tables fiir SaddleRiver,NJ: PrenticeHall, 2006).
Biological, Agricultural and Medic'al Researc'h,5thed. 5. Microsoft Excel 2007 (Redmond,WA: Microsoft Corp.,
(Edinburgh:Oliver & Boyd, 1957). 2007).
3. Kirk, R. E., ed., Statistical Issues:A Reader /br the 6. Snedecor,G. W., and W G. Cochran"StatisticulMethods,
BehavinraI Sc'ienc'es(Beln-ront,CA: Wadsworth, 1972). 7th ed. (Arrres,IA: Iowa StateUniversityPress,1980).
322 EXCELcoMpANIoN to chaDter
8
E8.1 COMPUTINGTHE CONFIDENCE If you know the sarnplesize and sample mean of your
INTERVALESTIMATEFOR THE sample, click Sample Statistics Known and enter those
values.Otherwise,click Sample Statistics Unknown and
MEAN (o KNOWN)
enter the cell range of your sample as the Sample Cell
You compute the confidence interval estimatefor the mean Range.
(o known) either by using the PHStat2 Estimate for the
Mean, sigma known procedureor by making entries in the
g@@EEworkbook.
Usingthe CIE_SKWorksheet
Open to the CIE_SK worksheet of the
Usinq PHStat2 Estimate
workbook. This worksheet uses the NORMSINV(P<E
for tFe Mean, Sigma Known
and CONFIDENCE(l-conJidence level,population stan-
SelectPHStat ) Confidence Intervals ) Estimate for dard deviution, sample siee) functions to compute the
the Mean, sigma known. ln the Estimate for the Mean, Z value and interval half-width for the Example 8.1 mean
sigma known dialog box (shown below), enter values for paper length problem on page 288. To adapt this worksheet
the Population Standard Deviation and the Confidence to other problems, enter the appropriatepopulation stan-
Level. Click one of the input optionsand make the required dard deviation, sample mean, sample size, and confidence
entries.Enter a title as the Title and click OK. level values in the tinted cells 84 throush 87 and entera
n e w t i t l e i n c e l lA l .
I
Using the CIE_Pworksheet
II
Dd6 Data
Poprlatim $ardard DcviaHon: t-- of Trrr ProporHml
Estimatc
Sanphg Errsn
CmfirJenceLenrd:
r- 5anphrgError:
Cmf*Jence
Lwd:
lgs*qe
G*pnt Ofrims O-tp.t O$iors
I
E8.5 COMPUTINGTHE SAMPLESIZE E8.6 COMPUTINGTHE CONFIDENCE (
NEEDEDFOR ESTIMATING
THE INTERVALESTIMATEFORTHE
PROPORTION POPULATIONTOTAL
Y o u c o r n p u t et h e c o n f i d e n c e i n t e r v a l e s t i m a t ef o r t h e
You compute the sample size needed for estimating the
population total either by using the PHStat2 Estimatefor
proportion either by using the PHStat2 Determinationfor
the Population Total procedure or by making entries in the
the Proportion procedure or by making entries in the
EIEEIE[IE*orkbook.
@workbook.
and the Confidence Level. Enter a title as the Title and Click one of the inputoptionsandmakethe required I
click OK. entries. !
E8.7: Computingthe ConfidenceIntervalEstimatefor theTotal Difference 325