Escolar Documentos
Profissional Documentos
Cultura Documentos
Distributions
@ Oxford Cereals
USINGSTATISTICS
FIGURE
7.1
Types
of samples Typesof SamplesUsed
__"_""
1____
I I
N o n p r o b a b i l i tS
yamples ProbabilitySamples
r:
..,. li
,, ".,,"-..- ,'-,: lr "L'-'*-''-"".-"''''*=
='"-
-+ ii :;: -i =:S'
Judgment Ouota C h u n k C o n v e n i e n c e S i m p l e Systematic Stratified Cluster
Sample Sample Sample Sample Random Sample Sample Sample
Sample
S i mp l e R a n d o mSam ples
f the In a simple random sample, every itern from a frame has the same chanceof selectionas
every other item. ln addition,every sampleof a fixed size has the samechanceof selectionas
every other sample of that size. Simple random sampling is the most elementary random sam-
s that pling technique.It forms the basisfor the other random samplingtechniques.
maps. With simple random sampling, you use r?to representthe san,ple size and lV to represent
)scer- the frame size.You number every item in the frame from I to N. The chancethat you will select
posite any particular member of the frame on the first selectionis l/N.
You select sampleswith replacementor without replacement.Sampling with replace-
r c ' 1. 1 , ment meansthat after you selectan item, you return it to the frame, where it has the sarneprob-
ability of being selectedagain.hnaginethat you have a fishbowl containing.V businesscards.
254 CHAPTER SEVEN Sampling and Sampling Distributions
To selectthe sirnple random sample, you choose an arbitrary starting point from the table
of random numbers. One method you can use is to close your eyes and strike the table of ran-
dom numberswith a pencil. Supposeyou used this procedureand you selectedrow 06, column
05, of Table 7.1 (which is extractedfrom TableE.l) as the startingpoint. Although you can go
in any direction, in this example,you read the table from left to right, in sequencesof three dig-
its, without skipping.
T A B L E7 . 1 Column
Usinga Tableof 00000 00001 11r11 tttt2 ) ' r ) r ) ) ) ) ) 7 33333 33334
Random Numbers Row 12345 67890 12345 67890 12345 67890 12345 67890
0l 49280 88924 35779 00283 81163 072',75 89863 02348
02 61870 41657 07468 08612 98083 97349 20775 4s091
03 43898 6s923 2s078 86129 78496 97653 91550 08078
04 62993 93912 30454 84s98 56095 20664 12872 64647
05 33850 58555 51438 85507 71865 79488 76783 31708
Begin 06 97340 03364 88472 04334 63919 36394 l 10e5 92470
sefection 07 70543 29776 10087 10072 s5980 64688 68239 20461
(row 06, 08 89382 93809 00796 95945 34101 81277 66090 88872
'72142
column5) 09 37818 67140 50785 22380 16703 53362 44940
10 60430 22834 14130 96593 23298 56203 926',71 1s92s
rl 82975 661s8 84731 19436 ss190 69229 28661 r367s
12 39087 71938 403s5 s4324 08401 26299 49420 59208
13 ss700 24s86 93247 32s96 I 1865 63397 44251 43189
'7s912
14 l4ts6 23991 78643 83832 32768 18928 s7010
15 32166 53251 70654 92827 6349t 04233 33825 69662
l6 23236 73751 31888 81718 06546 83246 47651 04877
17 45794 26926 15130 82455 78305 55058 52551 47182
18 09893 20505 14225 68514 46427 56788 96297 78822
19 s4382 74598 91499 t4523 68479 27686 46162 83554
20 94750 89923 37089 20048 80336 94598 26940 36858
21 10297 34135 53140 33340 42050 82341 44104 82949
22 85157 47954 3297e 26s15 57600 40881 122s0 13142
23 l l 100 02340 12860 74691 96644 89439 28107 25815
24 36871 50775 30592 57r43 17381 68856 25853 35041
2s 23913 48357 63308 16090 51690 54607 72407 55538
Source: Partialll, extractcd.froniThe Rand Corporation. A Million Random Digits with 100,000Norrnal Deviates
(Glencoe, lL: The Free Press, 1955.1 and displayed in TableE.l in Appendi.r E.
SystematicSamples
In a systematicsample,you partition the N items in the frame into n groupsof /citems,
A/
Stratified Samples
In a stratified sample,you first subdividethe N items in the frame into separatesubpopula-
tions, or strata. A stratais defined by somecommoncharacteristic,suchas genderor yearin
school.You selecta simple randomsamplewithin eachof the strataand combinethe results
from the separatesimplerandomsamples.Stratifiedsamplingis moreefficient thaneithersim-
ple randomsamplingor systematicsamplingbecauseyou are ensuredof the representation of
items acrossthe entire population.The homogeneityof items within each stratumprovides
greaterprecisionin the estimatesof underlyingpopulationparameters.
E X A MP L E 7 . 2 SELECTINGA STRATIFIEDSAMPLE
A companywantsto selecta sampleof 32 full-time workersfrom a populationof 800 full-time
employeesin orderto estimateexpenditures from a company-sponsored dentalplan. Of the
full-time employees,25Yoaremanagersand,75Yo are nonmanagerial workers.How do you
selectthe stratified samplein order for the sampleto representthe correctpercentageof man-
agersand nonmanagerialworkers?
SOLUTION If you assumean 80% responserate,you needto send40 surveysto get the nec-
essary32 responses.The frame consistsof a listing of the namesand email addressesof all
N: 800 full-time employeesincludedin the companypersonnelfiles. Because25o/o of the full-
time employeesare managers,you first separatethe population frame into two strata:a sub-
populationlisting of all 200 managerial-levelpersonneland a separatesubpopulationlisting of
all 600 full-time nonmanagerialworkers.Becausethe first stratumconsistsof a listing of 200
managers,you assignthree-digitcode numbersfrom 001 to 200. Becausethe secondstratum
7.1: Typesof SurveySamplingMethods 257
C l u ste rS a mp les
In a cluster sample,you divide the N items in the frame into severalclustersso that eachclus-
ter is representativeof the entire population. Clusters are naturally occurring designations,
suchas counties,electiondistricts,city blocks,households,or salesterritories.You then take a
random sampleof one or more clustersand study all items in each selectedcluster.If clusters
are large,a probability-basedsampletaken from a singleclusteris all that is needed.
Clustersamplingis often more cost-effectivethan simple random sampling,particr-rlarly if
the population is spread over a wide geographic region. However, cluster sampling often
requires a larger sample size to produce results as precise as those frorn sirnple random sam-
pling or stratifiedsampling.A detaileddiscussionof systematicsampling,stratifiedsampling,
and cluster sampling procedurescan be found in reference L
oq
6
). o
assDesignation and thereforeaccommodates 200 students.It is college
(D
-a policy to fully integratestudentsby genderand class
r6 Jr. Sr. Total designation on eachfloor of eachdormitory.If theregis-
^\
p1 ^)5_ +
a
b.J 500 480 2,200 trar is able to compile a frame througha listing of all
O (rl
studentoccupantson eachfloor within eachdormitory,
:2 * o --.1 400 380 1,800
53-ro whattypeof sampleshouldyou take?Discuss.
a)2 900 860 4,000
probabilitysampleof n: 7.8 Prenumbered salesinvoicesarekeptin a salesjournal.
----.vsultsfrom the sampleto the The invoicesarenumberedfrom 0001to 5,000.
ol'tfiiT;:iflll'i.ni,ou,,, a. Beginningin row 16,column l, and proceeding hori-
m rl",r,un zontallyin TableE.1, selecta simplerandomsampleof
alphabetical listing of the namesof ltt ,r,l: 4,000regis- 50 invoicenumbers.
teredfull+ime students, what type of samplecouldyou b. Selecta systematicsampleof 50 invoicenumbers.Use
take?Discuss. the random numbersin row 20, columns5-7, as the
b. What is the advantageof selectinga simple random startingpoint for your selection.
samplein (a)? c. Are the invoicesselectedin (a) the sameas those
c. What is the advantage of selectinga systematicsample selectedin (b)?Why or why not?
in (a)?
d. If the frameavailablefrom theregistrartfiles is a listing 7.9 Supposethat 5,000salesinvoicesare sepa-
of the namesof all N: 4,000registeredfull-time stu- rated into four strata. Stratum 1 contains50
dentscompiledfrom eight separatealphabeticallists, invoices,stratum2 contains500 invoices,stra-
basedon the genderand classdesignationbreakdowns tum 3 contains1,000invoices,and stratum4 contains
shownin the classdesignation table,what type of sam- 3,450invoices.A sampleof 500 salesinvoicesis needed.
ple shouldyou take?Discuss. a. Whattype of samplingshouldyou do?Why?
e. Supposethat eachof the N : 4,000registeredfull-time b. Explain how you would carry out the samplingaccord-
students livedin oneof the20 campusdormitories.Each ing to the methodstatedin (a).
dormitorycontainsfour floors,with 50 bedsper floor, c. Why is the samplingin (a) not simplerandomsampling?
Survey Error
Even when surveysuse random probability sampling methods,they are subjectto potential
errors.Thereare four typesofsurvey errors:
r Coverageerror
r Nonresponseerror
r Samplingerror
r Measurementerror
Good survey researchdesignattemptsto reduceor minimize thesevarious types of survey
error. often at considerablecost.
Ethicallssues
Ethical considerationsarise with respectto the four types ofpotential errors that can
when designingsurveysthat useprobability samples:coverageerror, nonresponse error,
pling error,andmeasurement error.Coverageerror canresultin selectionbiasand
ethical issueif particulargroupsor individuals arepurposelyexcludedfrom the frameso
the surveyresultsare more favorableto the survey'ssponsor.Nonresponseerror can
nonresponsebias andbecomesan ethica\ issueif the sponsorknowingly designsthe
that particular grcups or individuals are less likely than othersto respond.Sampling
becomes an ethical issue if the findings are purposely presented without reference to
size and margin of error so that tfte sponsor can promote a viewpoint that might ofherwise
truly insignificant.Measurementerror becomesan ethicalissuein oneof threeways:(1) a
vey sponsorchoosesleadingquestionsthat guidethe responsesin a particulardirection;(2) an
interviewer,throughmannerismsand tone,purposelycreatesa halo effector otherwiseguides
the responsesin a particulardirection;or (3) a respondentwillfully providesfalseinformation.
Ethical issuesalso arisewhen the resultsof nonprobabilitysamplesare usedto form conclu-
sionsaboutthe entire population.When you use a nonprobabilitysamplingmethod,you needto
explainthe samplingproceduresandstatethat the resultscannotbe generalizedbeyondthe sample.
7.3: Sampling
Distribution 261
7.3 SAMPLINGDISTRIBUTIONS
In many applications,you want to make statisticalinferencesthat use statisticscalculatedfrom
samples to estimate the values of population parameters.In the next two sections,you will
learn about how the sample mean (the statistic) is used to estimatea population mean (a para-
meter) and how the sample proportion (the statistic) is used to estimatethe population propor-
tion (a parameter). Your main concern when making a statistical inference is drawing conclu-
sions about a population, not about a sample. For example, a political pollster is interestedin
the sample results only as a way of estimating the actual proportion of the votes that each can-
didate will receive from the population of voters. Likewise, as plant operations manager for
Oxford Cereals,you are only interestedin using the sample mean calculatedfrom a sample of
cerealboxes for estimatingthe mean weight containedin a population of boxes.
In practice, you select a single random sample of a predeterminedsize from the popula-
tion. The items included in the samole are determined throush the use of a random number
262 CHAPTERSEVEN Sampling Distributions
andSampling
generator,such as a table of random numbers (see Section 7.1 and Table E.l) orby usmg
Microsoft Excel (seepages281 282).
Hypothetically,to use the sample statisticto estimatethe population parameter,you should
examine every possible sample of a given size that could occur.A sampling distribution is the
distribution of the results if you actually selectedall possible samples.
FIGURE7.2
Numberof errorsmade
by a populationof four
administrativeassistants
25
N u m b e ro f E r r o r s
When you have the data from a population, you compute the mean by using Equation(7.1),
POPULATION MEAN
dividedby thepopulation
meanis thesumof thevaluesin thepopulation
Thepopulation
size.1y'.
\i r.
L,t
rt-- (7.1)
7.4: Sampling
Distribution
of theMean 263
POPUIATIOI!STANDARDDEVIATION
ltxi -r)2
i=l
(7.2'
3+2+l+4 =2.5errors
It=
O
and
- - - l.lz vrlvlJ
r4
1l
E 7.3 Administrative
Sample Assistants SampleOutcomes SampleMean
15Samples of
= 2 Administrative I Ann,Ann 3,3 Fr=3
nts from a
onofN=4
2 Ann, Bob 3,2 X2=2.5
iveAssistants Ann, Carla 3 ,I xr:2
Samplingwr'th 4 Ann, Dave 3,4 Xa = 3.5
) Bob,Ann 213 Xt:z.s
6 Bob, Bob 2,2 Xa=2
Bob, Carla 2,1 X1 = 1.5
8 Bob, Dave 2,4 x-s:3
9 Carla, Ann 1,3 Xs =2
l0 Carla,Bob 1,2 &o = 1.5
ll Carla,Carla l, I Xrr=l
t2 Carla, Dave 1,4 xtr:2's
l3 Dave,Ann 413 Xs=3.5
l4 Dave,Bob 4,2 Xu:3
l5 Dave,Carla 4,1 X6=2,5
l6 Dave, Dave 4,4 Yrc:4
ILx :2.5
FIGURE
7.3
S a m p l i n gd i s t r i b u t i o n
of the mean,based
o n a l l p o s s i b l es a m p l e s
c o n t a i n i n gt w o
a d mi n i s t r a t i v ea s s i s t a n t s
Source: Data are from
Table 7.3.
z5
M e a n N u m b e ro f E r r o r s
The value of the standard deviation of all possible sample means, called the standard
error of the mean, expresseshow the sample means vary from sample to sample. Equation
(7.3) defines the standard error of the mean when sampling with replacement or without
replacement(seepage 254) fuom large or infinite populations.
o sa
o t = -T (7.3) nu
^'ln
be
sa
Therefore, as the sample size increases,the standarderror of the mean decreasesby a fac- the
tor equal to the squareroot of the sample size.
You can also use Equation (7.3) as an approximation of the standard error of themean
when the sample is selectedwithout replacement if the sample contains less than 5% of the
entire population. Example 7.3 computes the standard error of the mean for such a situa-
tion. (See the section 7.6.pdf file on the StudentCD-ROM that accompaniesthis bookfor
the case in which more than 5% of the population is contained in a sample selectedwithout
replacement.)
Distribution
7.4: Sampling of theMean 265
o 15 15
vv--- --_-J
n 425 )
"l
Samplingfrom NormallyDistributedPopulations
Now that the conceptof a samplingdistributionhasbeenintroducedandthe standarderrorof
themeanhasbeendefine{ whatdistributionwill the samplemean,X. follow?If you aresam-
pling from a populationthat is normallydistributedwith mean,[^t,and standarddeviation,o,
regardless of the samplesize,r, the samplingdistributionof the meanis normallydistribute4
with mean,pt : p, andstandarderrorof themean,ot.
In the simplestcase,if you takesamplesof sizen: 1, eachpossiblesamplemeanis a sin-
gle valuefrom thepopulationbecause
2r' v
V -_- i = I
A _ ni _ -y
n I
7.4
FIGURE
Samplinq distributions
of the m-eanfrom 500
samplesof sizesn : 1,
2, 4',B, 16, and32
selectedfrom a normal
population
l
Howcanyoudeterminetheprobabilitythatthesampleof25boxeswillhaveameanbelow
365grams?Fromtheno,,natdistribution(Section6.2),youknowthatyoucanfindthearea
Zvahes"
;;ffi valueX by convertingto standardized
""y
X -p
/,=
o
IntheexamplesinSection6.2,youstudiedhowanysinglevalve,xdiffersfromthemean'Now'm likelihood
^"u", xi ^ayou Tant to determinethe
this example,the value involved is a sample x'
(7.4),io find the zvalte,you substitute for 1tx
X
thata samplemeanis uelow365.In Equaiion
for p, and o7 for o'
L=--
X -vx _ X - * (7.4)
oN o
4n
7.4: SamplingDistributionof theMean 267
42s
The areacorrespondingto Z: -1.00 in TableE.2 is 0.1587.Therefore,15.87%of all the pos-
sible samplesof 25 boxeshavea samplemeanbelow 365 grams.
The precedingstatementis not the sameas sayingthat a certainpercentageof individual
boxeswill havelessthan 365 gramsof cereal.You computethat percentageas follows:
x -P - 365-368 j
7 = = = -0.20
o1515
MPLE7.4 THEEFFECT
OF SAMPLESIZEn ON THECOMPUTATION
OF o;
Howis thestandard
errorofthemeanaffectedby increasing
thesamplesizefrom25to 100boxes?
SOLUTfONIf r: 100boxes, (7.3)onpage264:
thenusingEquation
o
-T 15 15
6N= -----l.J
.{n r/too lo
7=x-ItN - 3 6 5 _ 3 6 8= - 3 = _ 2 . 0 0
oy 15 1.5
ffi
FromTableE.2,thearealessthanZ: -2.00 is 0.0228.Therefore,2.28%of the samplesof 100
boxeshavemeansbelow 365 grams,as comparedwith15.87% for samplesof 25 boxes.
268 CHAPTER SEVEN Samplingand SamplingDistributions
Sometimesyou needto find the intervalthat containsa fixed proportionof the sample
means.You needto determinea distancebelow and abovethe populationmeancontaininga
specificareaof the normalcurve.FromEquation(7.4)on page266,
X -1t
Z_
o
T,
Solvingfor X resultsin Equation(7.5).
l5
x ,' = 3 6 8+ ( - 1 . 9 6 )- + = 3 6 8- 5 . 8 8= 3 6 2 . 1 2
^l't<
V--
t5
Y r = 3 6 8 + ( l . 9 6 t- : = = 3 6 8 + 5 . 8 8 = 3 7 3 . 8 8
42s
Therefore, 95o/oof all sample means based on samples of 25 boxes are between 362.12 and
3 7 3 . 8 8s r a m s .
Samplingfrom Non-NormallyDistributedPopulations-
The Central Limit Theorem
Thus far in this section,only the samplingdistributionof the meanfor a normallydistributed
populationhasbeenconsidered. However,in manyinstances, eitheryou know thatthe popula-
tion is not normallydistributedor it is unrealisticto assumethatthepopulationis normallydis-
tributed.An imporlanttheoremin statistics, theCentralLimit Theorem,dealswith thissituation.
THE CENTRALLIMITTHEOREM
The Central Limit Theorem statesthat asthe samplesize(that is, the numberof valuesin
eachsample)gelslarge enough,the samplingdistributionof the meanis approximately
normallydistributed.This is true regardlessof the shapeof the distributionof the individual
valuesin the population.
7.4:Sampling
Distribution
of theMean 269
7.5
distributionof
meanfor different
,ulations for samples
n = 2 , 5 , a n d3 0
Valuesof X
Valuesof X Valuesof X
VISUAL EXPLORATIONSExploringSamplingDistributions
.lsirjl
loolr q.td Wndod Yirudtxpbrltiffi leh AdohePDF -8r:
Nunba of tfnoN 1';;-- -:l
l*
pertaly: :J *--') t 9,E-)lil U * ) r o o ' / c- u -
I_
lr ' ralv I
/ !l ===:jJ $ %':d3;13 !3:F _-.1r-A.= J
*l
f-*,"
4 Threes
5 Fous
6 Fives
7 Sixes
I Sevens
9 Eights
10 Nines
11 Tens
i
I
i
Fves Sixes Sevens Eiahts Nines Tens ElevensTwelves
i
-*l
izo
i'1'., r\.t*",t\TwoDi(e/ lrl r l, f. i' ' i
Ready
_.t
7.4:Sampling
Distribution
of theMean 271
7.5 SAMPLINGDISTRIBUTION
OF THEPROPORTION
Considera categoricalvariablethat hasonly two categories,suchasthe customerprefersyour
brandor the customerprefersthe competitor'sbrand.Of interestis the proportionof items
belongingto one of the categories-for example,the proportionof customersthat prefersyour
brand.The populationproportion,represented by n, is the proportionof itemsin the entirepop-
ulation with the characteristicof interest.The sampleproportion,represented by p, is the pro-
portion of items in the samplewith the characteristicof interest.The sampleproportion,a sta-
tistic, is usedto estimatethe populationproportion,a parameter.To calculatethe sample
proportion,you assignthe two possibleoutcomesscoresof I or 0 to representthe presenceor
absenceof the characteristic.You then sum all the I and 0 scoresand divide by n, the sample
size.For example,if, in a sampleof five customers,threepreferredyour brandandtwo did not,
you havethree ls andtwo 0s. Summingthe three ls andtwo 0s anddividing by the samplesize
of 5 givesyou a sampleproportionof 0.60.
SAMPLEPROPORTION
X Numberof itemshavingthecharacteristic
of interest
-r \r.v,
n Samplesize
the characteristic, you assignhalf a scoreof I and assignthe other half a scoreof 0, and p is
equal to 0.5. Ifnone ofthe individualspossesses the characteristic,
you assigneacha scoreof
0 , a n d p i s e q u a lt o 0 .
While the samplemean, X, is an unbiasedestimatorof the populationmean,p, the statis-
tic p is an unbiasedestimatorof the populationproportion,fi. By analogyto the samplingdis-
tribution of the mean,the standard error of the proportion, o,,, is given in Equation(7.7).
STANDARD
ERROR
OF THEPROPORTION
n(l - n)
(7.7\
n
(7.8)
't
- 0.40 - 0.r 0
0.30 - 0.r 0
(0.40)(q{0) 6n 0.0346
200 ! ioo
= -2.89
Using Table E.2, the area under the normal curve less than -2.89 is 0.0019. Therefore,the
probabilitythat the sampleproportionis lessthan 0.30 is 0.0019-a highly unlikely event.This
rleans that if the true proportion of successes
in the populationis 0.40, less than one-fifth of
l% of the samplesof n - 200 would be expectedto have sampleproportionsof lessthan 0.30.
274 CHAPTERSEVEN Sampling
andSampling
Distributions
forecastas the winner of the election.If you selecta ran- 50 TiVo owners.
dom sampleof 100 voters,what is the probabilitythat a a. What is the probabilitythat morethanhalf the peoplei
candidatewill be forecastasthe winner when the sample watched or recordedFriends?
a. thetruepercentage of her voteis 50.1%? b. What is the probability that lessthan 25ohof the
b. the true percentageofher vote is 609io? in the samplewatchedor recordedFriends?
c. the true percentageof her vote is 49oh(and she will c. If a randomsampleof size 500 is taken,how does
actuallylosethe election)? changeyour answersto (a) and (b)?
d. If the samplesize is increasedto 400, what are your
7.34 According to Gallup's annual poll on
answersto (a) through(c)? Discuss.
finances,while mostU.S.workersreportedliving
7.31 You plan to conducta marketingexperi- ably now,manyexpecteda downturnin their lifestyle
mentin which studentsareto tasteoneof two dif- they stop working. Approximately half said they
ferentbrandsof soft drink. Their task is to cor- enoughmoneyto live comfortablynow and expectedto
rectly identify the brandtasted.You selecta randomsample so in the future (J. M. Jones,"Only Half of Non-Reti
of 200 studentsand assumethat the studentshaveno abil- Expectto be Comfortablein Retirement,"TheGallup
Summarv 275
May 2,2006).If you selecta randomsam- a. what is the probability that the samplehasbetween25%
of 200U.S.workers, and3}o/owho do not intendto work for pay at all?
whatis theprobabilitythat the samplewill havebetween b. If a current sampleof 400 Americansages50 to 70
45%and55% who saythey haveenoughmoneyto live employedfull time or part time has 35o/owho do not
comfortablynow and expectto do so in the future? intend to work for pay at all, what can you infer about
theprobabilityis 90% that the samplepercentage will thepopulationestimateof 29%?Explain.
becontainedwithin what symmetricallimits of the pop- c. If a current sampleof 100 Americansages50 to 70
ulationpercentage? employedfull time or part time has 35% who do not
theprobabilityis 95% that the samplepercentagewill intend to work for pay at all, what can you infer about
i be containedwithin what symmetricallimits of the pop- the populationestimateof 29oh?Explain.
ulationpercentage? d. Explain the differencein the resultsin (b) and (c).
Accordingto the NationalRestaurantAssociation, 7.37 The IRS discontinuedrandom audits in 1988.
of fine-dinins restaurantshave instituted policies Instead,the IRS conductsaudits on returnsdeemedques-
ictingthe useof cell phones("BusinessBullelinl' The tionableby its DiscriminantFunctionSystem(DFS), a
StreetJournal,June1,2000,p.Al). If you selecta complicatedand highly secretivecomputerizedanalysis
samoleof 100fine-dininerestaurants. system.In an attemptto reducethe proportion of "no-
whatis theprobabilitythat the samplehasbetween15% change"audits(that is, auditsthat uncoverthat no addi-
and25%that haveestablished policiesrestrictingcell tional taxesare due), the IRS only auditsreturnsthe DFS
phoneuse? scoresas highly questionable. The proportion of no-
theprobabilityis 90% that the samplepercentagewill changeaudits has risen over the years and is currently
becontainedwithin what symmetricallimits of the pop- approximately0.25 (T. Herman,"UnhappyReturns:IRS
ulationpercentage? Moves to Bring Back RandomAudits," The Wall Street
theprobabilityis 95% that the samplepercentagewill Journal,June20, 2002,p. A1). Supposethat you selecta
becontainedwithin what symmetricallimits of the pop- randomsampleof 100audits.What is the probabilitythat
ulationpercentage? the samplehas
Suppose that in January2007,you selecteda random a. between24ohand26o/ono-changeaudits?
sampleof 100fine-dining restaurantsand found that 3 I b. between20o/oand307ono-changeaudits?
hadpoliciesrestrictingthe use of cell phones.Do you c. more than30%ono-changeaudits?
thinkthatthe populationpercentagehaschanged?
7.38 Referringto Problemi.37, theIRS announced thatit
.36 An article(P.Kitchen, "RetirementPlan:To Keep plannedto resumetotally randomauditsin 2002. Suppose
ing,"Newsday,September24,2003) discussedthe that you selecta random sampleof 200 totally random
irementplans of Americansages50 to 70 who were auditsand Ihat 90ohof all the returnsfiled would result in
full time or part time. Twenty-ninepercentof the no-changeaudits.What is theprobabilitythatthe samplehas
saidthat they did not intendto work for pay at a. between89o/oand9lo/ono-changeaudits?
If you selecta randomsampleof 400 Americansages b. between85% and 95o/ono-changeaudits?
to70 employedfull time or part time, c. more lhan95o/ono-chanseaudits?
ffire@
I A S S T S
7.40 Why does the standard error of the mean
T
I decreaseas the sample size,n, increases'J
7.43 Under what circumstancesdoes the sampling distri-
bution of the proportion approximately follow the normal
distribution'l
7.41 Why doesthe samplingdistributionof the meanfol-
low a normaldistributionfor a largeenoughsamplesize, 7.44 What is the difference between probability and non-
eventhoughthepopulationmaynotbenormallydistributed? probability sampling?
Chapter
Review
Problems 277
65 to 84, recorda length of2; iffrom 85 to 94, record a 3. If a random digit between 0 and 6 is selected consider
length of 3; if from 95 to 98, recorda length of 4;if 99, the ball white; if a random digit is a 7, 8, or 9, consider
record a length of 5. the ball red.
S e l e c ts a m p l e so f n : 2 , n : 5 , a n dn : 1 0 .C o m p u t et h e S e l e c ts a m p l e so f r - 1 0 ,n : 2 5 , a n d n - 5 0 d i g i t s .I n
mean for each sample. For example, if a sample of size 2 each sample,count the number of white balls and compute
resultsin the random numbers 18 and 46, thesewould cor- the proportion of white balls in the sample. If each student
respondto lengths of0 and 1, respectively,producing a in the class selectsfive different samples for each sample
sample mean of 0.5. If each student selects five different size, a frequency distribution of the proportion of white
samplesfor each sample size, a frequency distribution of balls (for each sample size) can be developedfrom the
the sample means (for each sample size) can be developed results of the entire class.What conclusionscan you reach
from the resultsof the entire class.What conclusionscan about the sampling distribution of the proportion as the
you reach concerning the sampling distribution of the samplesize is increased?
mean as the sample size is increased?
7 . 6 5 ( C l a s s P r o j e c t ) S u p p o s et h a t s t e p 3 o f P r o b l e m
7.64 (Class Project) Using Table8.1, simulatethe selec- 7.64 usesthe following rule: "If a random digit between0
tion of different-coloredballs frorn a bowl as follows: and 8 is selected,consider the ball to be white; if a ran-
1. Start in the row correspondingto the day of the month in d o m d i g i t o f 9 i s s e l e c t e d c, o n s i d e rt h e b a l l t o b e r e d . "
which you were born. C o m p a r e a n d c o n t r a s tt h e r e s u l t s i n t h i s p r o b l e m a n d
2. Selectone-disitnurnbers. t h o s ei n P r o b l e m7 . 6 4 .
Managingthe SpringvilleHerald
Continuingits quality improvement effort first describedin sured.Assumingthat the distributionhas not
the Chapter6 "Managing the Springville Heruld" case,the changedfrom what it wasin the pastyear,what is
productiondepartmentof the newspaperhas been monitor- the probabilitythat the mean blacknessof the
ing the blacknessof the newspaperprint. As before, black- spotsis
nessis measuredon a standardscale in which the target a. lessthan1.0?
valueis 1.0.Data collectedover the past year indicatethat b. between0.95and 1.0'?
theblacknessis normally distributed,with a mean of 1.005 c. between 1.0and1.05?
anda standarddeviationof0.10. d. lessthan0.95or greaterthan1.05?
e. Supposethat the mean blacknessof today's
EXERCISE sampleof 25 spotsis 0.952.Whatconclusion
SH7.1 Each day,25 spots on the first newspaperprinted canyou makeaboutthe blacknessof the news-
are chosen,and the blacknessof the spotsis mea- paperbasedon this result?Explain.
Web Case
Dats
Nr.snber
d Svr$esr
r**'*-*'*'
Srrpls siear I
Data DbtribuHonO$ims
Sanple $ner tr LFrifsnn
[-
i-' Gersrstelictd rardsn rsl$6ers f' 5tardvdaed Nsrmd
: f Csscrde
tt
s krtvahesfromrarqe '..J
**;
vdre*CEllRa*rpr
*.*prt O$ons
itr f**t ceilconta*ns
lab*l Ttlar i**-**-*
i-"$stqram
OutgrtO$ions
Tftle: I ueb
""1
*ryqr If you want to use the Discrete option, first open to a
worksheet that contains a table of X and P(,X) values and
then select this procedure.Then select Discrete and enter
E7,2 CREATINGSIMULATED that table range as the X and P(X) Values Cell Range.
SAMPLING DISTRIBUTIONS
Youcreatesimulated sampling distributions by first using the
Using ToolPakRandom Number
ToolPakRandom Number Generation procedure to create a
Generation
worksheetof all the random samples.Then you add formulas
tocompute the samplemeansand other appropriatemeasures SelectTools ) Data Analysis. From the list that appearsin
foreachsample.You can also use the PHStat2 Sampling the Data Analysis dialog box, select Random Number
282 EXCELcoMPANIoNto chapter
7
Generation and click OK. In the Random Number To createa histogramfrom the setof samplemeansfor
Generationdialog box (shownbelow), enterthe number your simulation,entera formulathat usesthe AVERAGE
of samplesas the Number of Variables and enterthe function in a row below the cell rangethat containsthe
samplesizeof eachsampleas the Number of Random samplescreatedby the procedure. Thenusethe techniques
Numbers. Select the type of distribution from the for creatingfrequencydistributionsand histogramsdis-
Distribution drop-downlist and make entries in the cussedin the ExcelCompanionto Chapter2 to createyour
Parameters (The contentsof this area
area,as necessary. histosram.
vary accordingto the distributionchosen.)Click New
WorksheetPly andthenclick OK.
EXAMPLE 100 Samplesof SampleSize 30
from a Uniformly Distributed Population
t*rr6cr of lr|lrblas: g-Tffi Basic Excel SelectToolsI Data Analysis.Fromthelist
that appearsin the Data Analysis dialog box, select
l$mbcr of R{dom ilmlels: f c..d I
Random Number Generation and click OK. In the
Q|6ffi.*bnr thf; THdp-l RandomNumber Generationdialog box (shownat left),
Parameteri enter100 as the Number of Variablesandenter30 asthe
B$erGcrt 0 sdt Number of Random Numbers.SelectUniform fromthe
Distribution drop-downlist, click New WorksheetPly,
andthenclick OK.