Você está na página 1de 12

ROSE ANN A.

CAYABYAB

MSC 6 TF 1:00-2:30

BSED/FILIPINO

PROF. MOGUER

1)Normal Distributions

a function that represents the distribution of many random variables as a


symmetrical bell-shaped graph. It is sometimes called the "bell curve," It is also
called the "Gaussian curve" after the mathematician Karl Friedrich Gauss.
The curve forms a bell shape and the midpoint is where the mean, median, and
mode can be found, The curve is symmetric,The curve is spread out in a
particular way

2) Why Study Normal Distribution?


Many data distributions resemble a normal curve
The sampling distribution of any distribution is a normal distribution provided the
sample size is large enough
Used in estimation and inferential statistics (e.g.,hypothesis testing)
3) Properties of a Normal Distribution

Properties of a Normal Curve

A normal distribution is a continuous probability distribution for a random variable x. The


graph of a normal distribution is called the normal curve, which has all of the following
properties:
1. The mean, median, and mode are equal.
2. The normal curve is bell-shaped and is symmetric about the mean.
3. The total area under the curve is equal to one.
4. The normal curve approaches, but never touches, the x-axis.
5. Between and + the graph is concave down and elsewhere the graph is
concave up. The points at which the graph changes concavity are called inflection
points.
The Standard Normal Distribution

The Standard Normal Distribution

Properties of the Standard Normal Distribution

1. The cumulative area is close to 0 for z-scores close to z = -3.5.


2. The cumulative area increases as the z-score increases.
3. The cumulative area for z = 0 is 1
4. The cumulative area is close to 1 for z-scores close to z = 3.5.
Properties of a Standardized Normal
Curve: Z-Scores
Convert the raw scores to z-scores
Every distribution of z-scores:
Has a mean equal to 0, a median equal to 0, and a mode equal to 0
Has a standard deviation equal to 1
Has a shape that is the same as the underlying distribution of raw scores
4) Formulas under the normal distribution

Z Score Formulas

The basic z score formula for a sample is: z = (x ) /


For example, lets say you have a test score of 190. The test has a mean () of 150 and
a standard deviation () of 25. Assuming a normal distribution, your z score would be:
z = (x ) /
= 190 150 / 25 = 1.6.
The z score tells you how many standard deviations from the mean your score is. In this
example, your score is 1.6 standard deviations above the mean.

You may also see the z score formula shown to the left. This is exactly the same
formula as z = x / , except that xx(the sample mean) is used instead of (the

population mean) and s (the sample standard deviation) is used instead of (the
population standard deviation). However, the steps for solving it are exactly the same.
Z Score Formula: Standard Error of the Mean
When you have multiple samples and want to describe the standard deviation of those
sample means (the standard error), you would use this z score formula:
z = (x ) / ( / n)
This z-score will tell you how many standard errors there are between the sample mean
and the population mean
Sample problem: In general, the mean height of women is 65 with a standard deviation
of 3.5. What is the probability of finding a random sample of 50 women with a mean
height of 70, assuming the heights are normally distributed?
z = (x ) / ( / n)
= (70 65) / (3.5/50) = 5 / 0.495 = 10.1
The key here is that were dealing with a sampling distribution of means, so we know we
have to include the standard error in the formula. We also know that 99% of values fall
within 3 standard deviations from the mean in a normal probability distribution (see 68
95 99.7 rule). Therefore, theres less than 1% probability that any sample of women will
have a mean height of 70.

the t score formula is:

Where
xx= sample mean
0 = population mean
s = sample standard deviation
n = sample size
If you have only one item in your sample, the square root in the denominator
becomes 1. This means the formula becomes:

In simple terms, the larger the t score, the larger the difference is between the
groups you are testing. Its influenced by many factors including:

How many items are in your sample.


The means of your sample.
The mean of the population from which your sample is drawn.
The standard deviation of your sample.
5) five sample of normal Distribution .
1)X is a normally normally distributed variable with mean = 30 and standard
deviation = 4. Find
a) P(x < 40)
b) P(x > 21)
c) P(30 < x < 35)
answer:
Note: What is meant here by area is the area under the standard normal curve.
a) For x = 40, the z-value z = (40 - 30) / 4 = 2.5
Hence P(x < 40) = P(z < 2.5) = [area to the left of 2.5] = 0.9938
b) For x = 21, z = (21 - 30) / 4 = -2.25
Hence P(x > 21) = P(z > -2.25) = [total area] - [area to the left of -2.25]
= 1 - 0.0122 = 0.9878
c) For x = 30 , z = (30 - 30) / 4 = 0 and for x = 35, z = (35 - 30) / 4 = 1.25
Hence P(30 < x < 35) = P(0 < z < 1.25) = [area to the left of z = 1.25] - [area to the left of
0]
= 0.8944 - 0.5 = 0.3944
2) A radar unit is used to measure speeds of cars on a motorway. The speeds are
normally distributed with a mean of 90 km/hr and a standard deviation of 10
km/hr. What is the probability that a car picked at random is travelling at more
than 100 km/hr?
Answer:
Let x be the random variable that represents the speed of cars. x has = 90 and = 10.
We have to find the probability that x is higher than 100 or P(x > 100)

For x = 100 , z = (100 - 90) / 10 = 1


P(x > 90) = P(z >, 1) = [total area] - [area to the left of z = 1]
= 1 - 0.8413 = 0.1587
The probability that a car selected at a random has a speed greater than 100 km/hr is
equal to 0.1587

3) For a certain type of computers, the length of time bewteen charges of the
battery is normally distributed with a mean of 50 hours and a standard deviation
of 15 hours. John owns one of these computers and wants to know the
probability that the length of time will be between 50 and 70 hours.
Answer:
Let x be the random variable that represents the length of time. It has a mean of 50 and
a standard deviation of 15. We have to find the probability that x is between 50 and 70
or P( 50< x < 70)
For x = 50 , z = (50 - 50) / 15 = 0
For x = 70 , z = (70 - 50) / 15 = 1.33 (rounded to 2 decimal places)
P( 50< x < 70) = P( 0< z < 1.33) = [area to the left of z = 1.33] - [area to the left of z = 0]
= 0.9082 - 0.5 = 0.4082
The probability that John's computer has a length of time between 50 and 70 hours is
equal to 0.4082.
4) Entry to a certain University is determined by a national test. The scores on
this test are normally distributed with a mean of 500 and a standard deviation of
100. Tom wants to be admitted to this university and he knows that he must score
better than at least 70% of the students who took the test. Tom takes the test and
scores 585. Will he be admitted to this university?
Answer:
Let x be the random variable that represents the scores. x is normally ditsributed with a
mean of 500 and a standard deviation of 100. The total area under the normal curve
represents the total number of students who took the test. If we multiply the values of
the areas under the curve by 100, we obtain percentages.

For x = 585 , z = (585 - 500) / 100 = 0.85


The proportion P of students who scored below 585 is given by
P = [area to the left of z = 0.85] = 0.8023 = 80.23%
Tom scored better than 80.23% of the students who took the test and he will be
admitted to this University.
5) The length of similar components produced by a company are approximated
by a normal distribution model with a mean of 5 cm and a standard deviation of
0.02 cm. If a component is chosen at random
a) what is the probability that the length of this component is between 4.98 and
5.02 cm?
b) what is the probability that the length of this component is between 4.96 and
5.04 cm?
Answer:
a) P(4.98 < x < 5.02) = P(-1 < z < 1)
= 0.6826
b) P(4.96 < x < 5.04) = P(-2 < z < 2)
= 0.9544
6) A stanine (standard nine)
What is a Stanine Score?
A stanine (standard nine) score is a way to scale scores on a nine-point scale. It can
be used to convert any test score to a single-digit score. Like z-scores and t-scores,
stanines are a way to assign a number to a member of a group, relative to all members
in that group. However, while z-scores and t-scores can be expressed with decimals like
1.2 or 3.25, stanines are always positive whole numbers from 0 to 9.
Stanines are also similar to normal distributions. You can think of these scores as a bell
curve that has been sliced up into 9 pieces. These pieces are numbered 1 through 9,
starting at the left hand section. However, where a standard normal distribution has
a mean of 0 and a standard deviation of 1, stanines have a mean of 5 and a standard
deviation of 2.

Image: UH.edu

What do Stanines mean?


A person with a score of 9 is in the top 4% of the scorers, while a person with a score of
1 is in the bottom 4%. These types of scores allow you to easily tell if a score is below
the mean (a score of 5) or above the mean.
How to Convert a Score to a Stanine
Step 1: Rank the scores from lowest to highest.
Step 2: Assign a stanine score to your scores from Step 1:
Stanine score

Percentage of scores

Bottom 4%

Next bottom 7%

Next bottom 12%

Next Bottom 17%

Middle 20%

Next top 17%

Next top 12%

Next top 7%

Top 4%

The mean lies in the middle of the fifth stanine, cutting the center 20% into two parts.
Loss of Information
Stanines are a very simple way of categorizing items into top, middle and bottom
percentages. This simplicity means that its a very imprecise way to measure anything.
Everyone in the same stanine receives the same score. For example, a person at the
bottom of the 5th is almost 20 percentage points below the person at the top of the 5th.
These differences are what is called loss of information.

7) The percentile rank

The percentile rank of a score is the percentage of scores in its frequency distribution
that are equal to or lower than it. For example, a test score that is greater than or equal
to 75% of the scores of people taking the test is said to be at the 75th percentile, where
75 is the percentile rank. In educational measurement, a range of percentile ranks, often
appearing on a score report, that shows the range within which the test takers true
percentile rank probably occurs. The true value refers to the rank the test taker would
obtain if there were no random errors involved in the testing process.

Percentile ranks are commonly used to clarify the interpretation of scores on


standardized tests. For the test theory, the percentile rank of a raw score is interpreted
as the percentages of examinees in the norm group who scored at or below the score of
interest.
Percentile ranks are not on an equal-interval scale; that is, the difference between any
two scores is not the same between any other two scores whose difference in percentile
ranks is the same. For example, 50 25 = 25 is not the same distance as 60 35 = 25
because of the bell-curve shape of the distribution. Some percentile ranks are closer to
some than others. Percentile rank 30 is closer on the bell curve to 40 than it is to 20.
The mathematical formula is
{\displaystyle {\frac {{\text{c}}_{\ell }+0.5f_{i}}{N}}\times 100\%} \frac{\text{c}_\ell + 0.5
f_i }{N} \times 100\%
where c is the count of all scores less than the score of interest, i is the frequency of
the score of interest, and N is the number of examinees in the sample. If the distribution
is normally distributed, the percentile rank can be inferred from the standard score.
8) What is a Z Test?

A Z-test is a type of hypothesis test.


Hypothesis testing is just a way for you to figure out if results from a test are valid or
repeatable. For example, if someone said they had found a new drug that cures cancer,
you would want to be sure it was probably true. A hypothesis test will tell you if its
probably true, or probably not true. A Z test, is used when your data is
approximately normally distributed.

When you can run a Z Test.


Several different types of tests are used in statistics (i.e. f test, chi square test, t test).
You would use a Z test if:

Your sample size is greater than 30. Otherwise, use a t test.


Data points should be independent from each other. In other words, one data
point isnt related or doesnt affect another data point.
Your data should be normally distributed. However, for large sample sizes (over
30) this doesnt always matter.
Your data should be randomly selected from a population, where each item has
an equal chance of being selected.
Sample sizes should be equal if at all possible.

How do I run a Z Test?


Running a Z test on your data requires five steps:
1.
2.
3.
4.
5.

State the null hypothesis and alternate hypothesis.


Choose an alpha level.
Find the critical value of z in a z table.
Calculate the z test statistic (see below).
Compare the test statistic to the critical z value and decide if you should support
or reject the null hypothesis.

You could perform all these steps by hand. For example, you could find a critical value
by hand, or calculate a z value by hand. For a step by step example, see:

One Sample Z Test.

Or for another example, watch the following video:

You could also use technology, for example:

Two sample z test in Excel.


Find a critical z value on the TI 83.
Find a critical value on the TI 89 (left-tail).

Two Proportion Z-Test

This tests for a difference in proportions. A two proportion z-test allows you to compare
two proportions to see if they are the same.

The null hypothesis (H0) for the test is that the proportions are the same.
The alternate hypothesis (H1) is that the proportions are not the same.

Sample question: lets say youre testing two flu drugs A and B. Drug A works on 41
people out of a sample of 195. Drug B works on 351 people in a sample of 605. Are the
two drugs comparable? Use a 5% alpha level.
Step 1: Find the two proportions:
P1 = 41/195 = 0.21 (thats 21%)
P2 = 351/605 = 0.58 (thats 58%).
Set these numbers aside for a moment.
Step 2: Find the overall sample proportion. The numerator will be the total number of
positive results for the two samples and the denominator is the total number of people
in the two samples.
p = (41 + 351) / (195 + 605) = 0.49.
Set this number aside for a moment.
Step 3: Insert the numbers from Step 1 and Step 2 into the test statistic formula:

Solving the formula, we get:


Z = 8.99
We need to find out if the z-score falls into the rejection region.
Step 4: Find the z-score associated with /2. Ill use the following table of known values:

The z-score associated with a 5% alpha level / 2 is 1.96.


Step 5: Compare the calculated z-score from Step 3 with the table z-score from Step 4.
If the calculated z-score is larger, you can reject the null hypothesis.
8.99 > 1.96, so we can reject the null hypothesis.

9) Statistical Significance (T-Test)


Statistical Significance (T-Test)
Statwing represents t-test results as distribution curves.
Assuming there is a large enough sample size, the difference between these two
groups is probably statistically significant.
Note: The below discusses statistical significance for the unranked Independent
Samples T-Test, the most common form of T-Test.
Definition
A t-tests statistical significance indicates whether or not the difference between two
groups averages most likely reflects a real difference in the population from which the
groups were sampled.

Example
Lets say youre interested in whether the average New Yorker spends more than the
average Kansan per month on movies.
You ask a sample of 3 people from each state about their movie spending. You might
observe a difference in those averages (like $14 for the average Kansan and $18 for the
average New Yorker). But that difference is not statistically significant; it could easily just
be random luck of which 3 people you randomly sampled that makes one group appear
to spend more money than the other. If instead you ask 300 New Yorkers and 300
Kansans and still see a big difference, that difference is less likely to be caused by the
sample being unrepresentative.
Note that if you asked 300,000 New Yorkers and 300,000 Kansans, the result would
likely be statistically significant even if the difference between the group was only a
penny. The t-tests effect size complements its statistical significance, describing the
magnitude of the difference, whether or not the difference is statistically significant.
Definition
A statistically significant t-test result is one in which a difference between two groups is
unlikely to have occurred because the sample happened to be atypical. Statistical
significance is determined by the size of the difference between the group averages, the
sample size, and the standard deviations of the groups. For practical purposes
statistical significance suggests that the two larger populations from which we sample
are actually different.

Você também pode gostar