Escolar Documentos
Profissional Documentos
Cultura Documentos
2009
CHAPTER 1:
INTRODUCTION
1.1 DEFINITION, NATURES AND IMPORTANCE OF STATISTICS
The word statistics is derived from the Latin word status meaning state. Early uses of
statistics involved compilation of data and graphs describing various aspects of the state or
country. The word statistics has two basic meanings. We sometimes use this word when
referring to actual numbers derived from data and the other refers to statistics as a method of
analysis.
DEFINITION: Statistics a collection of quantitative data, such as statistics of crimes, statistics
on enrollment, statistics on unemployment, and the like. Statistics is also a science, which deals
with the collection, presentation analysis, and interpretation of quantitative data.
Statistics involves much more than the simple collection, tabulation and summarizing of
data. Statistics is also a tool that helps us develop general and meaningful conclusions that go
beyond the original data. The following are some examples of the uses of statistics:
Surveys
Consumer Preference
Experiments
Sampling
Economics
NATURES OF STATISTICS
DESCRIPTIVE STATISTICS
It deals with the methods of organizing, summarizing and presenting a mass of data so
as to yield meaningful information.
INFERENTIAL STATISTICS
It deals with making generalizations about a body of data where only a part of it is
examined. This comprises methods concerned with the analysis of a subset of data
leading to predictions or inferences about the entire set of data.
Example: Determine whether the following statements use the area of descriptive statistics or
statistical inference.
1. A bowler wants to find his bowling average for the past 12 games.
2. A manager would like to predict based on previous years sales, the sales performance
of a company for the next five years.
2009
3. A politician would like to estimate, based on an opinion poll, his chance for winning in
the upcoming senatorial election.
4. A teacher wishes to determine the percentage of students who passed the examination.
5. A student wishes to determine the average monthly expenditures on school supplies for
the past five months.
6. A basketball player wants to estimate his chance of winning the most valuable player
(MVP) award based on his current season averages and the averages of his opponents.
Answer:
1.
2.
3.
4.
5.
6.
___________________
___________________
___________________
___________________
___________________
___________________
Exercise 1.1
Direction: Classify the following statements as belonging to the area of descriptive statistics or
statistical inference. Write DS for descriptive statistics and write IS for inferential statistics on
the space provided.
_________________1. Yesterdays records show that five (5) employees were absent due to
Dengue fever.
_________________2. If the present trend continues, architects will construct more
contemporary homes than colonials in the next 5 years.
_________________3. In a certain city, arsonists deliberately set 3% of all fires reported last
year.
_________________4. At least 30% of all new homes being built today are of a contemporary
design.
_________________5. As a result of a recent poll, most Filipinos are in favor of finding work
employment abroad.
_________________6. Philippines Gross Domestic Product (GDP) grows by 4.6% in 2002, 1.4
percentage higher than its 3.2% performance in 2001.
_________________7. The average grade of 10 students in English is 89.46%.
_________________8. Based from the present sales trend, it is expected that after two years,
this years sales will be doubled.
2009
_________________9. All four provinces of ARMM are among the 10 poorest provinces in the
Philippines for 2002.
_________________10. During the period 1996 to 2002, unemployment rates among women
were consistently higher compared to men except in 1999 and 2000.
1.2 DEFINITION OF SOME BASIC STATISTICAL TERMS
The following are terms commonly used in Statistics:
1. Population is the set of all individuals or entities under consideration or study. It may be a finite
or infinite collection of objects, events, or individuals, with specified class or characteristics
under consideration.
Example 1. The researcher would like to determine the average age of patients infected with
dengue fever for the month of June at Medical Center Imus.
Population: the set of all patients with dengue fever.
Example 2. The researcher would like to determine the number of BM students in CvSU-CBE.
Population: the set of all students in CvSU-CBE.
2. Variable is a characteristic of interest measurable on each and every individual in the
population, denoted by any capital letter in the English alphabet.
Types of Variable
Qualitative Variable consists of categories or attributes, which have non-numerical
characteristics.
Example: classification, year level, sex and subjects enrolled
Quantitative Variable consists of numbers representing counts or measurements.
Variable for population 1: A = age
Variable for population 2: S = sex
3.
4.
5.
6.
2009
Census Survey, most often simply referred to as census, is conducted by gathering information
from the entire population.
Sampling Survey, most often simply referred to as survey, is conducted by gathering
information only from part of the population.
Exercise 1.2.1
Direction: Identify the population, variable of interest, and type of variable in the following:
1. The dean of CIH would like to determine the average weekly allowance of BS HRM
students.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
2. The registrar of DLSU-D would like to conduct a survey on the preferred courses of 4 th
year high school students in Cavite.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
3. The dean of the CBA would like to know the number of students who are smoking.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
4. A survey by a group of students entitled Dress Code will be conducted to first year
students to determine the fashion preferences of these students.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
5. Information will be collected to new voters for 2004 election to identify their opinion
regarding politics in the Philippines.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
2009
6. From all students registered this semester, the Mathematics Department would like to
know how many students like mathematics.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
7. A study to be conducted by NGO would determine the Filipinos awareness about the
war against IRAQ.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
8. A group of students taking Statistics conducted a study on the effect of boy-girl
relationship to the academic performance of the students.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
9. Some parents would like to determine whether Counter Strike is good or bad to the
behavior of their children.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
10. The head librarian would like to identify the book/s commonly read by DLSU-D students.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
11. A statistics teacher, Ms. Nacion, would like to determine whether the number of
students in a class at the start of the semester can determine the number of failures.
Population:______________________________________________________________
Variable: _______________________________________________________________
Type of variable:_________________________________________________________
2009
Exercise 1.2.2
Direction: Identify each statement as having discrete or continuous data. Write D for discrete
and C for continuous.
_____________1. Among the 1,500,000 microcomputer chips made by Motocolla, 2 are found
to be defective.
_____________2. Yesterdays records show that 25 students were absent.
_____________3. Radar on EDSA indicated that the driver was going 150 kph when ticketed for
speeding.
_____________4. The amount of time that a taxi driver spends yielding to individual
pedestrians each year is 2.367 seconds.
_____________5. Upon completion of a diet and exercise program, Elmer weighed 12.37 lbs.
less than when he started the program.
Exercise 1.2.3
Direction: Identify which of the following quantitative data would be presented by a discrete
variable or a continuous variable. Write DV for discrete variable and write CV for continuous
variable.
_____________1. Number of students
_____________2. Time (in minutes) to finish an exam
_____________3. Distance (in km.) of school from place of residence
_____________4. Length (in cm) of fish caught
_____________5. Width of the newest brand of cellular phone
_____________6. Percentage increase in enrolment this year
_____________7. Number of enrollees
_____________8. Monthly income of 100 randomly selected persons at KADIWA Market
_____________9. Sum of points in tossing a pair of dice
_____________10. Lifetime (in years) of televisions produced by ZONY
2009
2009
CHAPTER 2:
2009
2009
TYPES OF DATA
1. Primary Data- are information collected from an original source of data, which is first-hand in
nature. Examples are data collected from interviews and surveys.
2. Secondary Data- are information collected from published or unpublished sources like books,
newspapers, and thesis.
FOUR IMPORTANT POINTS TO CONSIDER WHEN COLLECTING DATA
1. If measurements of some characteristics from people (such as height) are being obtained, better
results will be achieved if the researcher does the measuring instead of asking the respondent
for the value.
2. The method of data collection used may expedite or delay the process. Avoid a medium that
would produce low response rates.
3. Ensure that the sample size is large enough for the required purposes.
4. Ensure that the method used to collect data actually results in a sample that is representative of
the population.
METHODS OF DATA COLLECTION
1. Direct or Interview Method
The direct or interview method of data collection use at least two persons (an interviewer and
interviewee/s) exchanging information. This method will give us precise and consistent
information because clarifications can be made. Also, questions not fully understood by the
respondent, the interviewer could repeat the question until it suits the interviewees level.
However, this method is time consuming, expensive and has limited field coverage.
2. Indirect or Questionnaire Method
This is a method where written answers are given to prepared questions. This method requires
less time and is inexpensive since the questionnaires can be mailed or hand-carrried. Also, this
will give a respondent a sense of freedom in honestly answering the questions because of
anonymity.
3. Registration Method
This is a method enforced by certain laws.
4. Observation Method
This is a method, which observes the behavior of individuals or organizations in the study. This is
also used when the respondents cannot read nor write.
5. Experiment Method
This method is used when the objective of the study is to determine the cause and effect of
certain phenomena or event.
10
2009
Exercise 2.1
A. Answer the following questions:
1. What are the advantages and disadvantages of open questions?
2. What are the advantages and disadvantages of closed questions?
3. Choose the letter that corresponds to an unbiased option.
A. Anyone is free to smoke cigarettes if he wants to.
__________ Agree
_________Disagree
B. No sane person will burn money on cigarettes.
__________ Agree
_________Disagree
C. Smoking may be dangerous to your health.
__________ Agree
_________Disagree
4. Change the following questions to make it simpler and clearer.
A. What is your opinion about beauty contest being demeaning to womanhood and
against the vaunted equality of the sexes?
B. What is your mass measure in metric units?
5. Determine which of the following questions is best to ask about someones health?
A. Are you a weakling?
B. Are you bursting over with health?
C. Can you be an advertisement for a health club?
D. Are you in good health?
6. To find out how much someone likes music, which question is best to ask?
A. Do you go for the present trend-all noise and sound?
B. How much time do you spend listening to the radio?
C. Do you prefer popular music, jazz, classics, or rock?
D. Do you listen to boring classical tunes?
B. From the situations given below, identify the possible errors or difficulties that may be
encountered during data collection.
1. To research recognition of a certain brand of deodorant, you plan to conduct a
telephone survey of 1000 consumers in the Philippines. What is wrong with using
telephone directories as the population from which the sample is drawn?
2. A group of college students conduct a survey in an attempt to determine the typical
annual salary of the schools alumni. Would alumni with very low salaries be likely to
respond? How would this affect the result? Identify one other factor that might affect
the result.
3. Assume that you are hired by a company to collect data on the waist sizes of college
students. Why is it better to obtain direct measurements than to ask people sizes of
their waists?
11
2009
12
2009
stratum is taken.
Formula:
Where:
13
2009
Example: Given is the enrollment data of CvSU-CBE for first semester, SY 2009-2010.
PROGRAM
EA
PA
BSIT
420
BSCS
210
BSBM
300
BSE
40
ABPS
40
BENT
25
BSHRM
400
TOTAL(N)
1435
Select a sample of 200 students using Equal and Proportional Allocation
3. Systematic Random Sampling
Systematic sampling with a random start is a method of selecting a sample by taking every
unit from an ordered population, the first unit being selected at random. K is called the
sampling interval and the reciprocal is the sampling fraction.
Formula:
Where:
N = population
n = sample size
Non-Probability Sampling
Non-probability sampling does not involve random selection of samples. It does not necessarily
mean, however, that non-probability samples are not representative of the population. With nonprobability samples, the population may or may not be represented well, and it will often difficult to
know how well the population has been represented.
TWO TYPES OF NON-PROBABILITY SAMPLING
1. ACCIDENTAL, HAPHAZARD OR CONVENIENCE SAMPLING
This is one of the most common methods of sampling, which is primarily based on the
convenience of the researcher. Most common examples are interviews conducted frequently by
television news programs to get quick (although non-representative) reading of public opinion.
2. PURPOSIVE SAMPLING
In this type of sampling technique, samples are taken with a purpose in mind. Usually, one or
more specific predefined sought.
Purposive sampling can be very useful for situations where a target sample needs to be
reached quickly and where sampling for proportionality is not the primary concern. With a
purposive sample, it is likely to get opinions of target populations that are more readily accessible.
All of the methods that follow can be considered subcategories of purposive sampling
methods.
14
2009
Exercise 2.2
Identify the type of sampling used in the following statements.
1. An engineer selects every 50th cell phone from the assembly line for careful testing and
analysis.
__________________________________________________________________________
2. A reporter writes the name of each senator on a separate card, shuffles the cards, and
then draws five names.
__________________________________________________________________________
3. The dean at CvSU-CBE surveys all students from each of the 12 randomly selected classes.
__________________________________________________________________________
4. A reporter obtains sample data from readers who decide to mail in a questionnaire printed
in the latest issue.
__________________________________________________________________________
5. A mathematics professor selects 18 men and 18 women from each of the four classes.
__________________________________________________________________________
6. In conducting research for a psychology course, a student of CvSU-CBE interviews students
who are leaving the CBE building.
__________________________________________________________________________
15
2009
7. A reporter obtains numbered listing of the 1000 companies with the highest stock market
values, uses a computer to generate 20 random numbers between 1 and 1000, and then
interviews the chief executive officers of the companies corresponding to these numbers.
__________________________________________________________________________
8. A medical student at DLS-UMC interviews all diabetic patients in each of 15 randomly
selected hospitals in the country.
__________________________________________________________________________
9. A researcher interviews every 45th patients in the list of in-patients.
__________________________________________________________________________
10. A student interviews school principals and classroom teachers about the implementation
of the 2002 Basic Education Curriculum.
__________________________________________________________________________
2.3 Methods of Data Presentation
Methods of Presenting Data:
1. Textual Method- a narrative description of the data gathered.
2. Tabular Method- a systematic arrangement of information into columns and rows.
3. Graphical Method- an illustrative description of the data.
2.3.1 The Frequency Distribution Table (FDT)
An FDT is a statistical table showing the frequency or number of observations contained in
each of the defined classes or categories.
Parts of a Statistical Table
1. Table Heading- includes the table number and the title of the table.
2. Body- main part of the table that contains the information or figures.
3. Stubs or classes- classification or categories describing the data and usually found at the
left most side of the table.
4. Caption- designations or identifications of the information contained in a column,
usually found at the top most of the column.
16
2009
TABLE HEADING
CAPTION
STUBS/CLASSES
Perception of
Leadership Behavior
Strongly Favorable
Favorable
Slightly Favorable
Slightly Unfavorable
Unfavorable
Strongly Unfavorable
Frequency
10
11
12
14
22
31
TOTAL
BODY
100
TYPES OF FDT:
1. Qualitative or Categorical FDT is a frequency distribution table where the data are
grouped according to some qualitative characteristics; data are grouped into non
numerical categories.
Example:
TABLE 2: Frequency Distribution of the Gender Respondents of a Survey
Gender of Respondents
Frequency
Male
38
Female
62
TOTAL
100
2. Quantitative FDT is a frequency distribution table where the data are grouped according
to some numerical or quantitative characteristics.
17
2009
Example:
WEIGHT (in kilogram)
Frequency
7-9
10-12
13-15
16-18
2
8
14
19
19-21
TOTAL
50
18
2009
2. Class Mark (CM) midpoint of the class interval where the observations tend to cluster
about.
15
23
27
33
38
44
52
10
20
25
31
35
15
24
30
33
40
45
53
42
50
57
13
21
20
25
31
34
42
45
55
26
32
36
43
51 57
19
2009
It is accurate.
It is clear.
It is simple.
It has a good appearance.
20
2. The following
deficiencies:
136
125
135
137
126
2009
are the body weights (in grams) of 50 rats used in a study of vitamin
92
119
126
120
118
115
115
127
95
100
118
101
103
146
113
121
129
110
126
106
137
87
126
119
125
132
108
118
119
117
120
110
82
105
102
104
133
104
132
146
129
124
113
95
148
Construct the FDT of the given data set and write a brief report about it.
3. The following are the number of customers a restaurant served for lunch on 60
weekdays:
50
64
55
51
60
41
71
53
63
64
49
59
66
45
61
57
65
62
58
65
55
61
60
55
53
57
58
66
53
56
64
46
59
49
64
60
58
64
42
47
59
62
56
63
61
68
57
51
61
51
60
59
67
52
52
58
64
43
60
62
Construct the FDT of the given data set and write a brief report about it.
4. Construct a graph for the given FDT and write a brief interpretation.
The 2002 Purchases by A Car Rental Agency
CAR MAKER
NUMBER OF PURCHASES
1. Chevrolet Cavalier
45
2. Ford Mustang
30
3. Ford Taurus
60
4. Pontiac Grand Am
15
5. Toyota Camry
30
CHAPTER 3:
indicate the summation of, thus we can write the sum of the
observations as:
The numbers 1 and n are called the lower and upper limits of summation, respectively.
Example: Write out the following in full, that is, without summation signs:
21
2009
Rules on Summation
Rule # 1: The summation notation is distributive over addition.
22
2009
Examples:
A. Use the rules on summation to write out the expansion of the given expression:
C. Given:
Find the value of the following expressions:
Exercise 3.1
Evaluate the following:
1. Given:
23
2009
2. Given:
3. Given:
3.2
Definition: A measure of central tendency is any single value that is used to identify the
center of the data or the typical value. It is often referred to as the average.
3.2.1 The Arithmetic Mean
The most common average and sometimes simply referred to as the mean
The sum of all the values of the observations divided by the number of
observations
Denoted by a Greek letter (mu) for population
the sample mean, used to estimate the population mean , is computed as:
Examples:
24
2009
1.
The numbers of employees at 5 different stores are 4, 8, 10, 12, and 6. Find the mean
number of employees for the 5 stores.
Solution:
2.
Scores in Algebra for the first long quiz for a sample of 10 students are as follows: 84,
75, 90, 98, 88, 79, 95, 86, 93, and 89.
Solution:
If N is even:
Example: Find the median of the given data set: 75, 75, 67, 71, 72
Solution:
25
2009
It locates the point where the observation values occur with the greatest
density.
It does not always exist, and if it does, it may not be unique. A data set is said
to be unimodal if there is only one mode, bimodal if there are two modes,
trimodal if there are three modes, and so on.
It is not affected by extreme values.
It can be used for qualitative as well as quantitative data.
Examples:
Identify the mode(s) of the following data sets.
Data Set 1.
2
Blue
White Yellow
Red
Green
Blue
Orange
Data Set 2.
2
Data Set 3.
1
Data Set 4.
Red
Blue
Exercise 3.2
1.
The grades of a student on seven examinations were 85, 96, 72, 89, 95, 82 and 85.
Find the students average grade.
2.
The salaries of 4 employees were P12,000, P10,000, P15,000 and P18,000. What is
the average salary?
3.
Out of 100 numbers, 20 were 5s, 40 were 4s, 35 were 6s and 5 were 2s. Find the
mean.
4.
Find the median of the set of numbers:
5
3
7
3
8
2
1
5. Find the median of the set of numbers:
11
25
18
79
12
13
26
2009
6. A student received grades of 89, 72, 92 and 35. What is the mode of the given grades?
7. The reaction times of an individual to certain stimuli were measured by a psychologist to
be 0.23, 0.52, 0.25, 0.52, 0.26, 0.25, 0.39, and 0.22 seconds. Determine the modes of
the given reaction times.
8. The numbers of incorrect answers on a true-false test for 15 students were recorded as
follows:
2, 1, 3, 0, 1, 3, 6, 0, 3, 3, 5, 2, 1, 4, 2. Find the mean, median and mode.
9. The following are the response times in seconds of a smoke alarm after the release of
smoke from a fixed source: 12, 9, 11, 7, 9, 14, 6,10. Find the mode.
10. A bridge is designed to carry a maximum load of 150,000 pounds. Is the bridge
overloaded if it is carrying 18 vehicles having a mean weight of 4,630 pounds?
11. For three rounds of golf Peter scores 88, 79, and 82. What fourth-round score would he
need to reduce his mean score to 81 for all rounds?
12. The average IQ of 10 students in Math is 114. If 9 of the students have IQ scores of 101,
118, 128, 106, 115, 99, 118, 109 and 125. What must be the other IQ?
10
10
27
, read as
2009
10
10
, read as first
Example: From the given set of scores in a quiz, find the 4th decile or
3
11
12
18
19
Solution: Since the data is already arranged from lowest to highest then we may proceed in
finding the 4th decile.
3
11
12
18
19
3.3.3 Quartiles
Quartiles are values that divide the array into 4 equal parts. Thus,
quartile, is the value below which 25% of the values fall,..,
value which 75% of the values fall.
28
, read as first
2009
Example: From the given set of scores in a quiz, find the 3rd quartile or
3
11
12
18
19
Solution: Since the data is already arranged in ascending order, then we may proceed in finding
the 3rd quartile.
3
11
12
18
19
Exercise 3.3
1.
2.
CHAPTER 4:
MEASURES OF DISPERSION
Measures of Dispersion indicate the extent to which individual items in a series are
scattered about an average. It is used to determine the extent of the scatter so that steps may
be taken to control the existing variation. It is also used as measure of reliability of the average
value.
General Classifications of Measures of Dispersion
1. Measures of Absolute Dispersion
2. Measures of Relative Dispersion
4.1 Measures of Absolute Dispersion
29
2009
The measures of absolute dispersion are expressed in the units of the original observations. They
cannot be used to compare variations of two data sets when the averages of these data sets differ a lot
in value or when the observations differ in units of measurement.
4.1.1 The Range
The range of a set of measurement s is the difference between the largest and smallest values.
Example:
The IQ scores of 5 members of the Morales family are 108, 112, 127, 116, and 113. Find the range.
4.1.2
, where
And the population standard deviation can be estimated by the sample standard deviation s,
where
Example:
A sample of 5 households showed the following number of household members: 3, 8, 5, 4, and
4. Find the standard deviation.
4.2 Measures of Relative Dispersion
30
2009
Measures of Relative Dispersion are unitless and are used when one wishes to compare the
scatter of one distribution with another distribution.
4.2.1 Coefficient of Variation
The coefficient of variation, CV, is the ratio of the standard deviation to the mean and is
usually expressed in percentage. It is computed as
Exercise 4.1
1.
2.
3.
4.
5.
CHAPTER 5:
PROBABILITY
5.1 Random Experiments, Sample Spaces, Events
31
2009
32
2009
ways.
Example:
1. How many sample points are in the sample space when a pair of dice is tossed once?
2. Miguel is going to watch a basketball game at the Araneta Coliseum. The Coliseum has
four gates where he can enter. If he cannot pass through the same gate twice, in how
ways can he enter and leave the Coliseum?
3. How many different ways are there to arrange the letters in the word LOGIC?
5.2.2 Permutation
A permutation is an arrangement of all parts of a set of objects.
Linear Permutation
The number of permutations of n distinct objects is n! (read as n factorial).
33
2009
Example: Two raffle tickets are drawn from 20 for the first and second prizes. Find the number
of sample points in the sample space S.
Circular Permutation
The number of permutations of n distinct objects arranged in a circle is (n-1)!.
Example: In how many ways can 5 different plants be planted in a circle?
Permutation of Things Not All Different
The number of distinct permutations of n objects of which
second kind,
, of a
of
kind is
Example: How many distinct permutations can be made from the letters of the word
Mississippi?
5.2.3 Combination
A combination is a selection of r objects from n without regard to order.
Theorem: The number of combinations of n distinct objects taken r at a time is
34
2009
2.
In how many ways can 5 people be seated on a sofa, if there are only three seats
available?
3.
In how many ways can 6 children be seated in a round table?
4.
An organization sponsored a raffle where they were to sell all 500 tickets sequentially
numbered from 001 to 500. How many of the tickets
a. Has the same three digits
b. Ends in 3
c. Contain the digits 1, 2, 3 (not necessarily in that order)
5.
The geographical distribution of the hometown of 80 students of CvSU-CBE is given
as: 50 from Luzon, 10 from Visayas, and 20 from Mindanao. How many ways can three
students be selected at random such that
a. All of them come from any three places
b. No student comes from Luzon and Mindanao
6.
In how many ways can 10 individuals be selected from 25 individuals?
7.
If a student can answer any 6 questions from an exam with 10 questions, how many
ways can he answer the exam?
8.
If there are 9 horses in a race, in how many ways can they finish first, second, and
third?
9.
How many ways can 10 students line up in a food counter?
10. How many choices do we have if we are going to bet in the Lotto 6/42 draw?
11. In how many ways can 6 boys and 7 girls be seated in a row of 13 chairs? What if the
boys and the girls must alternate?
12. If in an examination consisting of 24 questions a student may omit 6, in how many
ways can he select the problems he will answer?
13. How many three-digit numbers greater than 300 can be formed from 0, 1, 2, 3, 4, 5
and 6, if each digit can be used only once?
14. How many sample points or elements are there when simultaneously a coin is tossed
once, two dice are thrown and a card is selected at random from an ordinary deck of 52
cards?
15. In how many ways can a window dresser display four shirts in a circular
arrangement?
16. How many distinct permutations can be made from the word COMMUNICATION?
17. How many ways can a manager select the top three employees based on
performance from 8 of his employees?
18. In how many ways can 10 students be split into 2 groups containing 4 and 6 students,
respectively?
19. In how many ways can the 5 starting positions in a basketball team be filled with 8
men who can play any position?
35
2009
20.
21.
22.
How many distinct permutations can be made from the letters of the word REACT?
In how many ways can a jack, a queen and a king be chosen from a deck of 52 cards?
How many ways are there to select 3 candidates from 8 equally qualified recent
graduates for openings in Intel, Philippines?
23. In how many ways can the letters in the word REPUBLIC be arranged if 2 letters are
used at a time?
5.3 Basic Concept and Properties of Probability
The probability of an event A, denoted by P(A), is the chance that the event will occur.
It is the sum of the probabilities of mutually exclusive outcomes that constitute the event. It
must satisfy the following properties:
for any event A
Example: A coin is tossed twice. What is the probability that at least one head occurs?
Theorem: If an experiment can result in any one of N different equally likely outcomes and if
exactly n of these outcomes corresponds to event A, then the probability of event A is:
Example: If a card is drawn from an ordinary deck, find the probability that it is a heart.
Solution: Let H be the event of obtaining a heart
N= 52 cards
n= number of hearts
Therefore, the probability that the card is heart, is .
Theorems on Probabilities of Events
Theorem: The Additive Rule
If A and B are two events, then
Example:
1. The probability that a student passes History is , and the probability that he passes English is .
If the probability of passing at least one course is , what is the probability that he will pass both
courses?
Solution:
36
2009
2. What is the probability of drawing a red or a face card if a card is selected from an ordinary deck
of 52 cards?
Solution:
Corollary 1:
If A and B are mutually exclusive events, then
Corollary 2:
If
Theorem:
If A and are complementary events, then
Exercises:
1. What is the probability of getting a total of 7 or 11 when a pair of dice is tossed?
37
2009
2. What is the probability of drawing an Ace or a King, if a card is randomly chosen from a deck of
52 cards?
3. The probability of passing History is , what is the probability of failing the subject?
Conditional Probability
Definition: The probability of an event B occurring when it is known that some event A has occurred is
called a conditional probability. It is defined by:
Example: A card is drawn from a standard deck. Suppose we are told that the card picked is spade. What
is the probability that the card drawn is the ace of spades?
Solution:
Thus,
*events are said to be independent since they do not copy from each other.
Therefore, the probability that Jack and Rose will both correctly answer the toughest question is .
38
2009
Exercise 5.3
1.
2.
3.
4.
5.
6.
7.
8.
If two dice are thrown and the sum of dots are noted, what is the probability that the
sum is:
a. 5
b. 11 or 8
c. At least 10
d. 5 and 6
e. Even?
If a card is drawn at random from a deck of 52 cards, what is the probability that the
card is :
a. Red
b. Even
c. Face card
d. Red or face
e. Red and face?
An organization sponsored a raffle where they were to sell all 500 tickets sequentially
numbered from 001 to 500. Find the probability that the winning number
a. Has the same three digits
b. Ends in 3
c. Contain the digits 1,2,3 (not necessarily in that order)
The geographical distribution of hometown of 80 students of CvSU-CBE is given as: 50
from Luzon, 10 from Visayas, and 20 from Mindanao. Suppose three students are
selected. Find the probability that
a. They all come from any of the three places
b. No student comes from Luzon and Mindanao
A poker hand consists of 5 cards, the order is conventionally disregarded from a wellshuffled deck of 52 cards. What is the probability of having
a. 3 Kings and 2 Queens
b. 4 red cards and 1 black card
c. 4 aces and a Jack
The probability that a student will pass Statistics is 0.50 and the probability that he
will pass English is 0.80. The probability that he will pass both is 0.60. What is the
probability that the student will pass at least one of the two subjects?
What is the probability that an even number or a 3 will appear when a fair die is
tossed?
The probability that it will rain tomorrow is 0.73. What is the probability that it will be
sunny tomorrow?
39
2009
9.
A box contains 6 red and 4 black balls. Two balls from the box are drawn one at a
time without replacement. What is the probability that the second ball is red if it is
known that the first is red?
10. The balls used in selecting the numbers for bingo carry the numbers 1,2,3,75. If one
of the balls is selected at random, what is the probability that it is a number less than or
equal to 15?
11. If three of twenty tires are defective and four of them are randomly chosen for
inspection, what is the probability that one of the defective tires will be included?
12. If an airlines records show that 468 of 600 of its jets from San Francisco to Phoenix
arrived on time, what is the probability that any one of the airlines jets from San
Francisco to Phoenix will not arrive on time?
13. Harry Potter is faced with the problem of opening a safe with 10 buttons numbered
from 0 to 9 without using his magic. The safe can be opened by pressing three buttons,
not necessarily distinct, in correct order.
a. What is the probability that Harry will hit the right combination?
b. Realizing that the probability of getting the right combination is too small Harry
thought of a way of swinging the odds in his favor. He pulverized the lead from a
pencil and blew the powder onto the buttons, revealing three buttons that have
been pressed many times before. What is the probability that Harry will be able to
open the safe in a single trial?
14. A bowl contains 15 red beads, 30 white beads, 20 blue beads, and 7 black beads. If
one of the beads is drawn at random, what are the probabilities that it will be
a. Red beads
b. White or blue beads
c. Black beads
d. Neither white nor black
15. If the probabilities are respectively, 0.92, 0.33 and 0.29 that a person vacationing in
Boracay, will visit Palawan, or both, what is the probability that a person vacationing
there will visit at least one of the two areas?
16. The probabilities that a student will get an A or a B or a C in a Math course is 0.09,
0.15, and 0.53. What is the probability that a student will get a grade lower than C?
17. If the probability of passing Statistics is 0.65, what is the probability of failing the
subject?
18. If a coin is tossed twice, what is the probability of getting a head on the second toss if
the outcome on the first toss is also a head?
19. A jeepney driver buys spare part X and Y for his vehicle at a bargain shop. The
probability that X is defective is 0.10 and the probability that Y is defective is 0.05.
Determine the following probabilities:
40
2009
CHAPTER 6:
PROBABILITY DISTRIBUTION
6.1 Concept of a Random Variable
A function whose value is a real number determined by each element in the sample space is
called a random variable. It is a quantity resulting from an experiment that, by chance, can assume
different values.
The term random variable is used to describe the value that corresponds to the outcome from a
given experiment.
A capital letter is used to denote a random variable.
A random variable may be a continuous random variable or a discrete random variable.
Example:
1. A coin is tossed three times. List down the elements of the sample space. List down the possible
values of a random variable Y, the number of heads that fall.
Solution:
S=
41
2009
2. Two balls are drawn in succession without replacement from an urn containing 4 red balls and 3
black balls. List down the elements of the sample space and possible values of a random variable
X, the number of red balls.
Solution:
3. Let W be a random variable giving the number of heads minus the number of tails in three
tosses of a coin. List the elements of a sample space for the three tosses of the coin and to each
sample point assign a value for the random variable W.
Solution:
S=
If a sample space contains a finite number of possibilities or an unending sequence with as many
elements as there are whole numbers, it is called a discrete sample space. A random variable defined
over a discrete sample space is called discrete random variable.
If a sample space contains an infinite number of possibilities equal to the number of points on a
line segment, it is called continuous sample space. A random variable defined over a continuous sample
space is called continuous random variable.
42
2009
Definition: The table or formula listing of all the possible values that a random variable can take on,
along with the associated probabilities, is called discrete probability distribution.
Note: The probabilities associated with all possible values of a discrete random variable must sum to 1.
Example:
1. A coin is tossed three times. List down the elements of the sample space. List down the possible
values of a random variable Y, the number of heads that fall.
Solution:
S=
Note:
2. Let W be a random variable giving the number of heads minus the number of tails in three
tosses of a coin. List the elements of a sample space for the three tosses of the coin and to each
sample point assign a value for the random variable W.
Solution:
S=
The discrete probability distribution of the random variable W is:
43
2009
Note:
Exercise 6.1
1.
Construct the probability distribution of the sum of the numbers when a pair of dice
is tossed.
b(x; n, p)
n
x
pxqn x ,
for x=0,1,2,.,n.
SAMPLE PROBLEM:
44
2009
1. Find the probability of obtaining exactly three 2s if an ordinary die is tossed 5 times.
Solution:
From the given problem, n = 5, x = 3,
b(3;5; ) = 5C3
0.032
Therefore, the probability of obtaining exactly three 2s when a die is tossed 5 times is
0.032.
2. The probability that a patient recovers from a rare blood disease is 0.4. If 15 people are
known to have contracted this disease, what is the probability that (a) at least 10 survive
;(b) from 3-8 survive; and (c) exactly 5 survive?
Given:
Solution:
a.
= 0.034
Therefore, the probability that at least 10 people will survive is 0.034.
b.
POISSON DISTRIBUTION
Experiments yielding numerical values of a random variable X, the number of outcomes
occurring during a given time interval or in a specified region, are often called Poisson
experiments. The given time interval may be of any length, such as a minute, a day, a week, a
month, or even a year. Hence a Poisson experiment might generate observations for the
random variable X representing the number of telephone calls per hour received by an office,
45
2009
or the number of days school is closed due to snow during winter. The specified region could be
a line segment, an area, a volume, or perhaps a piece of material. In this case X might represent
the number of typing errors per page, or the number of bacteria in a given culture.
A Poisson experiment is one that possesses the following properties:
1. The number of outcomes occurring in one time interval or specified region is independent of
the number that occur in any other disjoint time interval or region of space.
2. The probability that a single outcome will occur during a very short time interval or in a small
region is proportional to the length of the time interval or the size of the region and does
not depend on the number of outcomes occurring outside this time interval or region.
3. The probability that more than one outcome will occur in such a short time interval or fall in
such a small region is negligible.
The number X of outcomes occurring in a Poisson experiment is called a Poisson
random variable and its probability distribution is called the Poisson distribution. Since its
probabilities depend only on , the average number of outcomes occurring in the given time
interval or specified region, we shall denote them by a symbol p(x; ) .
DEFINITION:
POISSON DISTRIBUTION
The probability distribution of the Poisson random variable X, representing the number
of outcomes occurring in a given time interval or specified region, is
p x;
e x
x!
for x = 0,1,2,..,
Where is the average number of outcomes occurring in the given time interval or specified
region and e=2.71828.
EXAMPLE:
1. The average number of days school is closed due to snow during the winter in a certain
city in the eastern part of United States is 4. What is the probability that the schools in
this city will close for 6 days during a winter?
Solution:
46
2009
Given:
Thus, the probability that the schools in the United States will close in 6 days due to
winter, is 0.10.
2. The average number of field mice per acre in a 5-acre wheat field is estimated to be 10.
Find the probability that a given acre contains 12 mice.
Solution:
Given:
Therefore, the probability that a 5-acre wheat field contains 12 mice is 0.095.
Exercise 6.1.1
1. A baseball players batting average is 0.250. What is the probability that he gets exactly
1 hit in his next 5 times at bat?
2. A multiple-choice quiz has 15 questions, each with 4 possible answers of which only 1 is
the correct answer. What is the probability that sheer guesswork yields from 5 -10
correct answers?
3. The probability that a patient recovers from a delicate heart operation is 0.9. What is
the probability that exactly 5 of the next 7 patients having this operation survive?
4. A study conducted at George Washington University and the National Institute of Health
examined national attitudes about tranquilizers. The study revealed that approximately
70% believe that tranquilizers dont really cure anything, they, they just cover up the
real trouble. According to this study, what is the probability that at least 3 of the next 5
people selected at random will be that tranquilizers actually do cure the problem rather
than just cover it up?
47
2009
5. On the average a certain intersection results in 3 traffic accidents per month. What is
the probability that in any given month at this intersection (a) exactly 5 accidents will
occur? (b) less than 3 accidents will occur?
6. A secretary makes 2 errors per page on the average. What is the probability that on the
next page she makes (a) 4 or more errors? (b) no error?
7. A certain area of the eastern United States is, on the average, hit by 6 hurricanes a year.
Find the probability that in a given year this area will be hit by (a) fewer than 4
hurricanes; (b) anywhere from 6 to 8 hurricanes.
6.2
-3
-2
-1
for
and
for
constants
and
48
where
2009
2. The normal curve approaches the horizontal axis asymptotically as we proceed in either
direction away from the mean.
3. The total area under the curve and above the horizontal axis is equal to 1.
DEFINITION: The distribution of a normal random variable with mean zero and standard
deviation equal to 1is called a standard normal distribution.
If
corresponding values:
Therefore,
Examples:
1.
Let Z be a standard
normal random variable. That is,
b.
49
2009
c.
d.
2.
Let Z be a standard
normal random variable. That is
a.
b.
c.
50
2009
3.
Let X be a normal
random variable with
a.
c.
Solution:
Given:
a.
X is a
normal random variable
Therefore, the
b.
Therefore, the
c.
51
2009
Therefore, the
4.
What
is
the
What
score
If
654
students
took the examination, then how many students got a score below 60?
Solution: Given: =84, =12
a.
Therefore, the probability of an individual obtaining a score of 100 or above on this test
is 0.0918 or 9.18%.
b.
In notation form,
the statement is equivalent to:
52
2009
Therefore, the score that includes 50% of those who took the exam is 84.
c. Given: =84, =12, N= 654
The number of students who got a score lower than 60 is equal to the product of the
probability and the total number of students.
Exercise 6.2
1. Let Z be a standard normal variable. Find the following probabilities:
a.
b.
c.
d.
2. Given a normal distribution with = 82 and
a value
a. Less than 78
53
2009
b. More than 90
c. Between 75 and 80
3. The mean weight of 500 male students at a certain college is 151 pounds. And the
standard deviation is 15 pounds. Assume that the weights are normally distributed.
a. How many students weigh between 120 and 155 pounds?
b. What is the probability that a randomly selected male student weighs less than 128
pounds?
CHAPTER 7:
ESTIMATION
Objectives:
At the end of this chapter, the students are expected to:
1.
Demonstrate
knowledge and understanding of the basic concepts of estimation;
2.
Compute
and
interpret the estimates of the mean, variance and standard deviation; and
3.
Relate knowledge
of estimation to real life situations.
54
2009
Parameter
Statistic
Population mean ()
Population Standard Deviation ()
Interval Estimation of the Mean for a Single Population
Confidence Interval for , is known
If
is the mean of a random sample of size n from a population with known variance
confidence interval for is given by
Note:
55
2009
For small samples selected from nonnormal populations, we cannot expect our degree of
confidence to be accurate. However, for small samples of size
, regardless of the shape
of most population, sampling theory guarantees good results.
To compute a
is known. Since
Example:
A survey of the delivery time of 100 orders worth P20,000 from WILLIAMS PIZZA
yielded a mean of 55 minutes with a standard deviation of 12 minutes. Assuming that the
delivery time follow a normal distribution, construct a 95% confidence interval for the true
mean.
Solution:
Given:
minutes,
= 5%
we obtained:
Conclusion: The WILLIAMS PIZZA is 95% confident that the true mean delivery time is between
52.648 minutes and 57.352 minutes.
not exceed
Example:
56
2009
The heights of a random sample of 50 college students showed a mean of 174.5 cm and
a standard deviation of 6.9 cm. What can we assert with 98% confidence about the possible size
of our error if we estimate the mean height of all college students to be 174.5?
Solution:
Given: = 174.5 cm, = 6.9 cm, n= 50 students,
= 2%
Conclusion: We can therefore conclude that we are 98% confident that the sample mean differs
from the true mean height by 2.27 cm.
Example:
The monthly wage of new employees at a certain broadcasting company is said to follow
a normal distribution with a standard deviation of P1,000. How large sample would be needed
to be 99% confident that the sample mean will be within P300 of the true mean.
Solution:
Given:
= 1%
by substitution:
57
2009
Conclusion: Therefore we can conclude that the sample size should be 74 employees to be 99%
confident that the sample mean will be within P300 of the true mean wage.
is unknown
and s are the mean and standard deviation respectively, of a random sample of size
where
degrees of freedom.
Solution:
Given:
by substitution:
we obtained:
58
= 1%
2009
Conclusion: Therefore we can conclude that we are 99% confident that the true average nicotine
content of a certain brand of cigarette is within 3.2818 milligrams and 3.9182 milligrams.
Exercise 7.
1. An electrical firm manufactures light bulbs that have a length of life that is
approximately normally distributed, with a standard deviation of 40 hours. If a random
sample of 30 bulbs has an average life of 780 hours, find a 96% confidence interval for
the population mean of all bulbs produced by this firm. How large a sample is needed if
we wish to be 96% confident that our sample mean will be within 10 hours of the true
mean?
2. The contents of 7 similar containers of sulfuric acid are 9.8, 10.2, 10.4, 9.8, 10.0, 10.2
and 9.6 liters. Find a 95% confidence interval for the mean content of all such
containers, assuming an approximate normal distribution for container contents.
3. A random sample of 100 PUJ (Public utility jeep) shows that a jeepney is driven on the
average 24,500 km per year, with a standard deviation of 3,900 km.
a. Construct a 99% confidence interval for the average number of kilometer a jeepney
is driven annually.
b. What can we assert with 99% confidence about the possible size of our error if we
estimate the average number of km driven by jeepney drivers to be 23,500 km per
year?
4. Suppose that the time allotted for commercials on a primetime TV program is known to
have a normal distribution with a standard deviation of 1.5 minutes. A study of 35
showings gave an average commercial time of 10 minutes. Compute for the maximum
error. Construct a 95% confidence interval for the true mean.
5. A random sample of 12 female students in a certain dorm showed an average weekly
expenditure of P750 for snack foods, with a standard deviation of P175. Construct a 90%
confidence interval for the average amount spent each week on snack foods by female
students living in this dormitory, assuming the expenditures to be approximately
normally distributed.
6. The mean and standard deviation for the quality grade point averages of a random
sample of 28 college seniors are calculated to be 2.6 and 0.3 respectively. Find the 95%
confidence interval for the mean of the entire senior class. How large a sample is
required if we want to be 95% confident that our estimate of is not off by more than
0.05?
59
2009
7. To estimate the average serving time at a fast food restaurant, a consultant noted the
time taken by 40 counter servers to complete a standard order (consisting of 2 burgers,
2 large fries and 2 drinks). The servers averaged 78.4 seconds with a standard deviation
of 13.2 seconds to complete the orders. What can the consultant assert with 95%
confidence about the maximum error if he uses
seconds as an estimate of the
true average time required to complete this standard order?
8. A company surveyed 4400 college graduates about the lengths of time required to earn
their bachelors degrees. The mean is 5.15 years, and the standard deviation is 1.68
years. Based on these sample data, construct the 99% confidence interval for the mean
time required by all college graduates.
9. In a time-use study, 20 randomly selected managers were found to spend an average of
2.4 hours each day on paperwork. The standard deviation of the 20 observations is 1.30
hours. Construct a 95% confidence interval for the mean time spent on paperwork by
managers.
10. In a study of physical attractiveness and mental disorders 231 subjects were rated for
attractiveness, and the resulting sample mean and standard deviation are 3.94 and 0.75,
respectively. Determine the sample size necessary to estimate the sample mean,
assuming you want a 95% confidence and a margin of error of 0.05.
11. The number of incorrect answers on a true-false test for a sample of 15 students was
recorded as follows: 2, 1, 3, 0, 1, 3, 6, 0, 3, 3, 5, 2, 1, 4, 2. Estimate the variance.
12. In a study of the use of hypnosis to relieve pain, sensory ratings were measured for 16
subjects, with the results given below. Use these sample data to estimate the mean.
8.8
6.2
7.7
7.4
6.4
6.1
6.8
9.8
8.3
11.9 8.5
5.2
6.1
11.3 6.0
10.6
60