Você está na página 1de 8

1 Please use a table to provide a concept mapping that appropriately categorizes and summarizes all key concepts of

Chapter 10.

Inference
About Means
and
Proportions
with Two
Populations
Inferences • μ1 = mean of population 1
About the • μ2 = mean of population 2
Difference • Sigma1 and Sigma2: two population standard deviations
Between Two • n1 and n2: sample size of a simple random sample selected from population 1 and 2,
Population respectively (independent simple random samples).
Means Based • Interval Estimation of μ1 - μ2
on • Point estimate of the difference between two population means: X1−X2
Independent • Standard error of X1−X2: Sigma(X1−X2)= pow((Sigma1**2)/n1+
Samples: ( Sigma2**2/n2),0.5)
Sigma1 and • Interval estimate: X1−X2 +- margin of error
Sigma2 ▪ X1−X2 +- z(alpha/2)* pow((Sigma1**2)/n1+ ( Sigma2**2/n2),0.5)
Known • Z= (X1−X2-D0)/ pow((Sigma1**2)/n1+ ( Sigma2**2/n2),0.5). In many cases,
D0 =0
Inferences
About the • Interval Estimation of μ1 - μ2
Difference • Point estimate of the difference between two population means: X1−X2
Between Two • df = pow((Sigma1**2)/n1+ ( Sigma2**2/n2),2)/ ((Sigma1**2)/n1)**2*(1/n1-1)
Population + (Sigma2**2)/n2)**2*(1/n2-1))
Means: • Standard error of X1−X2:
Sigma1 and • Interval estimate: X1−X2 +- t(alpha/2)* pow((Sigma1**2)/n1+
Sigma2 ( Sigma2**2/n2),0.5)
Unknown • t= (X1−X2-D0)/ pow((Sigma1**2)/n1+ ( Sigma2**2/n2),0.5). In many cases,
D0 =0
Inferences • p1: proportion for population 1
About the • p2: proportion for population 2
Difference • p1bar: sample proportion for a simple random sample from population 1
Between Two • p2bar: sample proportion for a simple random sample from population 2
Population • Interval Estimation of p1-p2
Proportions • Point estimator of the difference between two population proportion :p1bar-p2bar
• Standard error of p1bar-p2bar :pow((p1(1-p1))/n1 + p2(1-p2))/n2 ),0.5)
• Margin of error = z(alpha/2)* :pow((p1(1-p1))/n1 + p2(1-p2))/n2 ),0.5)
• Interval estimate: p1bar-p2bar+- z(alpha/2)* :pow((p1(1-p1))/n1 + p2(1-p2))/n2 ),0.5)

1 Please use a table to provide a concept mapping that appropriately categorizes and summarizes all key concepts of
Chapter 11.

Inference About
Population
Variances
Sampling • The sample variance
Distribution of • s**2=(summation(X1-Xbar)**2)/n-1
(n- • Whenever a simple random sample of size n is selected from a normal population,
1)s**2/Sigma**2 the sampling distribution of (n-1)*s**2/Sigma**2 has a chi-square
distribution with n-1 degrees of freedom
• Interval Estimation:
• (n-1)*s**2/chi(alpha/2)**2< =Sigma**2<=(n-1)*s**2/chi(1-alpha/2)**2
• Test statistics for hypothesis tests about a population variance: chi**2=(n-1)*
s**2/sigma**2
Inferences About • Sampling distribution of s1**2/s2**2 when sigma1**2= sigma2**2
Two Population • Whenever independent simple random samples of size n1 and n2 are selected from
Variances two normal population with equal variances, the sampling distribution of
s1**2/s2**2 has an F distribution with n1-1 degrees of freedom for the
numerator and n2-1 degrees of freedom for the denominator; s1**2 is the
sample variance for random sample of n1 items from population1. and s2**2
is the sample variance for the random sample of n2 items from population2.
• Test statistics for hypothesis tests about population variances with sigma1**2=
sigma2**2
• F= s1**2/s2**2

1 Assume “from scipy.stats import norm, t, f, chi2” has been executed in your Jupyter notebook. Use one line of Python
code to answer the following questions.
a The variance in drug weights is critical in the pharmaceutical industry. For a specific drug, with
weights measured in grams, a sample of 18 units provided a sample variance 0.36. Construct a
90% confidence interval estimate of the population variance for the weight of this drug. Construct
a 90% confidence interval estimate of the population standard deviation. Test whether the
population variance for the weight of this drug is less than 0.5 (alpha = 0.05)

90% confidence interval estimate of the population variance for the weight of this drug:

sample_var=0.36;n=18;CL=0.9;alpha=0.05
CL_Lower=(n-1)* sample_var/chi2.ppf((1+CL)/2,n-1)
CL_Upper=(n-1)* sample_var/chi2.ppf((1-CL)/2,n-1)
print(CL_Lower, CL_Upper)
= [0.22,0.70]

90% confidence interval estimate of the population standard deviation:


[CL_Lower**0.5, CL_Upper**0.5]=[0.47,0.84]

Hypothesis testing for H0: variance>=0.5, Ha: H0: variance<0.5


P_value=chi2.cdf((n-1)* sample_var/0.5,n-1)
Print(P_value)
P_value= 0.214
As P_value >0.05 so we accept null hypothesis and therefore we conclude that the population variance for the
weight of this drug is greater than or equal to 0.5

a Investors commonly use the standard deviation of the monthly percentage return for a mutual fund as a
measure of the risk for the fund; in such cases, a fund that has a larger standard deviation is
considered more risky than a fund with a lower standard deviation. The standard deviation for the
American Century Equity Growth fund and the standard deviation for the Fidelity Growth
Discovery fund were recently reported to be 15.0% and 18.9%, respectively. Assume that each of
these standard deviations is based on a sample of 60 months of returns. Do the sample results
support the conclusion that the Fidelity fund has a larger population variance than the American
Century fund? Which fund is more risky?

s1=0.15;s2=0.189;n1=60;n2=60;alpha=0.05

H0: Sigma1**2>=Sigma2**2
Ha: Sigma1**2<Sigma2**2

F= s1**2/s2**2
print(F)
F= 0.629

P_value= f.cdf(F,n1-1,n2-1)
print(P_value)
P_value= 0.039

For 95% confidence:


As P_value<0.05 so we reject null hypothesis and we conclude that the standard deviation for the American
Century Equity Growth fund is less than the standard deviation for the Fidelity Growth Discovery fund so
American Century Equity Growth fund is less risky

For 99% confidence:


As P_value>0.01 so we accept null hypothesis and we conclude that the standard deviation for the American
Century Equity Growth fund is more than the standard deviation for the Fidelity Growth Discovery fund so
American Century Equity Growth fund is more risky

1 Par, Inc., is a major manufacturer of golf equipment. Management believes that Par’s market share could be
increased with the introduction of a cut-resistant, longer-lasting golf ball. There- fore, the research group
at Par has been investigating a new golf ball coating designed to resist cuts and provide a more durable
ball. The tests with the coating have been promising. One of the researchers voiced concern about the
effect of the new coating on driving distances. Par would like the new cut-resistant ball to offer driving
distances comparable to those of the current-model golf ball. To compare the driving distances for the
two balls, 40 balls of both the new and current models were subjected to distance tests. The testing was
performed with a mechanical hitting machine so that any difference between the mean distances for the
two models could be attributed to a difference in the two models. The results of the tests, with distances
measured to the nearest yard, follow. These data are below. Complete the following.
a Formulate and present the rationale for a hypothesis test that Par could use to compare the driving
distances of the current and new golf balls.

From the given information, μ1 is the population means driving distance for the current golf ball and μ2 is the
population means driving distance for the new golf ball.
H0: μ1 = μ2
Hα: μ1 is not equal to μ2
Test statistics: Two sample t-test

a Analyze the data to provide the hypothesis testing conclusion. What is the p-value for your test? What
is your recommendation for Par, Inc.?

def se_for_two_means(s1,s2,n1,n2):
return pow(s1*s1/n1 +s2*s2/n2,0.5)
x1=270.28;s1=8.76;n1=40;x2=267.50;s2=9.90;n2=40
SE=se_for_two_means(s1,s2,n1,n2)
t_value=(270.28-267.50)/SE
t_value=1.33
p_value= 2*norm.cdf(-1.33)= 0.18
As p_value>0.05 so we accept null hypothesis
So we conclude that population means driving distances of the current and new golf balls are equal

a Provide descriptive statistical summaries of the data for each model.


For first model :
mean=270.28
standard deviation=8.76
Variance= 76.68
Count=40
For second model:
mean=267.50
Standard deviation=9.90
Variance= 97.94
Count=40

a What is the 95% confidence interval for the population mean driving distance of each model, and what
is the 95% confidence interval for the difference between the means of the two populations?

For current:

Since there 20 data values in the sample so degree of freedom is


df=20-1=19
mean=270.28
standard deviation=8.76
t_critical = t.ppf(1-0.05/2,19)= 2.09
95% confidence interval for current model:
[270.28-2.09*8.76/pow(40,0.5),270.28+2.09*8.76/pow(40,0.5)]= [267.38, 273.17]

For new:
Since there 20 data values in the sample so degree of freedom is
df=20-1=19
mean=267.50
Standard deviation=9.90
t_critical = t.ppf(1-0.05/2,19)= 2.09
95% confidence interval for new model:
[267.50-2.09*9.9/pow(40,0.5),267.50+2.09*9.9/pow(40,0.5)]= [264.22, 270.77]

def se_for_two_means(s1,s2,n1,n2):
return pow(s1*s1/n1 +s2*s2/n2,0.5)
x1=270.28;s1=8.76;n1=40;x2=267.50;s2=9.90;n2=40

SE=se_for_two_means(s1,s2,n1,n2)
df=19
t=t.ppf(1-alpha/2,df)
MOE=t*SE

So required confidence interval is:-


[x1-x2-MOE,x1-x2+MOE]= [-1.38, 6.94]

a Do you see a need for larger sample sizes and more testing with the golf balls? Discuss

Yes, we need for larger sample sizes and more testing with the golf balls

Você também pode gostar