Escolar Documentos
Profissional Documentos
Cultura Documentos
1-1
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1-2
What is Statistics?
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1
10/17/2015
1-3
When you have completed this chapter, you will be able to:
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1-4
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2
10/17/2015
1-5
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1-6
What is Meant by Statistics?
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
3
10/17/2015
1-7
Example of Statistics
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
collecting
organizing
What
presenting data
is drawing inferences
Meant from a sample of
information
by about an entire population
Statistics? as well as
predicting and
developing policy analysis
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
4
10/17/2015
1-9
Question: What Statistics?
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 10
Question: What Statistics?
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
5
10/17/2015
1 - 11
Question: What Statistics?
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 12
in everyday life
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
6
10/17/2015
Marketers Investors
Accountants Economists
Sports people
Consumers
Hospitals Statisticians
Quality Controllers
Educators Politicians Physicians
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Weather
Forecasters
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
7
10/17/2015
1 - 15
Types of
Statistics
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Types of Statistics 1 - 16
Descriptive Inferential
8
10/17/2015
Descriptive Inferential
A. A Gallup poll found that 83%
of the people in a survey knew
which country won the gold
medal in Men’s Hockey in 2002.
B. The accounting department of
a firm will select a sample of
invoices to check for accuracy of
all the invoices of the company.
C. Wine tasters sip a few drops
of wine to make a decision
with respect to all the wine
waiting to be released for sale.
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 18
The Method of
Experimentation
9
10/17/2015
1 - 19
1 - 20
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
10
10/17/2015
1 - 21
A population is a
collection of
all possible individuals,
objects,
or
measurements of interest
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 22
From
Take a
11
10/17/2015
1 - 23
Take a
Measurement
for each one
Record
in the sample
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 24
12
10/17/2015
1 - 25
1 - 26
Tables
… are an efficient method of displaying data
and depicting data accurately.
e.g.
13
10/17/2015
1 - 27
Line
Pie
Bar
More on these in chapter 2…
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 28
More in chapter 8 …
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
14
10/17/2015
1 - 29
Example of Using Sample
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 30
Example of Using Sample
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
15
10/17/2015
1 - 31
Example of Using Sample
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 32
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
16
10/17/2015
1 - 33
Question: What Decision?
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 34
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
17
10/17/2015
Types of Data 1 - 35
A Variable
a characteristic
of
a population or sample
that is of interest to us
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Types of Data 1 - 36
Variables
Qualitative Quantitative
Categorical Numerical
Observations Observations
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
18
10/17/2015
Variables 1 - 37
Qualitative – or Attribute
Country of Birth
U.K.
Eye Colour
Germany
Blue
Gender Taiwan
Brown
Male China
Hazel
Female India
Green
Japan
Red
Russia
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Variables 1 - 38
Quantitative – Numeric
Minutes to end Number of
of Class Two-Door
55 Garages Number of
Number of Satisfied
45 in a Street Maple Leafs Fans
Children in
30
5 a Family 10
20 0
0 1 30 20
2 40 30
3 ... 40
4 …
…
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
19
10/17/2015
Variables 1 - 39
Characteristics
… can only assume certain values
Discrete and
there are usually “gaps” between values
e.g. - Number of bedrooms in a house
- Number of hammers sold (1,2,3,…etc)
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Variables 1 - 40
20
10/17/2015
1 - 42
Summary of Types of Variables
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 43
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
21
10/17/2015
1 - 44
Nominal
Ordinal
Interval
Ratio
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 45
Nominal
Data can only be classified into categories or counted
and cannot be arranged in any particular order
Example M & Ms
Category: Candy
Classification: By Colour only
(No natural order)
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
22
10/17/2015
1 - 46
Nominal
Example M & Ms
Mutually Exclusive:
…where an individual, object, or measurement is
included in ONLY ONE CATEGORY
Exhaustive:
…where each individual, object, or measurement
MUST APPEAR in one of the categories
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 47
Ordinal
…involves data arranged in some order,
but
the differences between data values
cannot be determined or are meaningless!
Example During a taste test of 4 soft drinks:
Mello Yello was ranked number……..…. 1.
Sprite number……………………………. 2.
Seven Up number..…………………..…... 3.
Orange Crush number ….……………….4.
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
23
10/17/2015
1 - 48
Interval
…similar to the Ordinal Level,
with the additional property
that meaningful amounts of differences between data
values can be determined.
There is no natural zero point
Example
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 49
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
24
10/17/2015
1 - 50
Ratio
…the Interval Level with an inherent zero
starting point.
Differences and ratios are meaningful
for this level of measurement.
Examples
Monthly income of surgeons
1 - 51
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
25
10/17/2015
1 - 52
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Sources of Statistical 1 - 53
Information
Published Data
Statistical Abstracts
Weather
Sports
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
26
10/17/2015
Sources of Statistical 1 - 54
Information
www.bps.go.id
www.bankofcanada.ca
Government of
Internet Canada & Provinces www.gc.ca
www.theweathernetwork.com
www.mcgrawhill.ca/college/lind
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Sources of Statistical 1 - 56
Information
Commissioned surveys:
27
10/17/2015
1 - 57
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Caution 1 - 58
For Example
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
28
10/17/2015
Caution 1 - 59
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 60
Chart 1-11A
1986 - 1999
3200
3000
2800
2600
2400
2200
2000
0 1 2 3 4 5 6 7 8 9 10 11 12
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
29
10/17/2015
1 - 61
Chart 1-11B
1986 -1999
3000
2500
2000
1500
1000
500
0
7 8 9 10 11 12
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1 - 62
Chart 1-11C
1986 -1999
3200
3000
2800
2600
2400
2200
2000
7 8 9 10 11 12
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
30
2-1
f requency Distributions
Describing Data
Graphic Presentations
Copyright©©2003
Copyright 2004by
byThe
TheMcGraw-Hill
McGraw-HillCompanies,
Companies,Inc.
Inc.All rights reserved. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2-2
When you have completed this chapter, you will be able to:
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
1
2-3
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2-4
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2
2-5
A Frequency Distribution is a
grouping of data into
non-overlapping classes
(mutually exclusive)…
showing the
number of observations
in each category
or class.
2-6
3
2-7
2-8
There are five steps
that can be used to
Construct a Frequency Distribution:
4
2-9
Decide how many classes you wish to use
Rule of Thumb:
For most data sets, you would want
between 3 and 12 classes!
Use the 2 to the K rule.
Choose k so that 2 raised to the power of k is greater
than the number of data points (n) or 30.
In this 2k = 30 students
case…
25 = 32, so use k = about 5 classes
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2 - 10
Determine the class width
5
Set up the individual class limits 2 - 11
2 - 12
Tally the items into the classes
6
2 - 13
Count the number of items in each class
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2 - 14
Using different limits
…will give you a different distribution, e.g.
Hours Studying x Frequency f
7.5 to under 12.5 1
12.5 to under 17.5 12
17.5 to under 22.5 10
30
22.5 to under 27.5 5
27.5 to under 32.5 1
32.5 to under 37.5 1
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
7
2 - 19
Relative Frequency
Distribution
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
8
2 - 21
Using different limits
2 - 22
Stem-and-leaf Displays
A statistical technique for displaying a set of data.
Each numerical value is divided into two parts:
1. the leading digits become the stem and
2. the trailing digits become the leaf.
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
9
2 - 23
Stem-and-leaf Displays
2 - 24
Stem-and-leaf Displays
10
2 - 25
Stem-and-leaf Displays
86, 79, 92, 84, 69, 88,
91, 83, 96, 78, 82, 85.
Split Stem Leaf
Now, list the remaining
669 6 9 ‘leaf’ scores!
778 7 8 9
882 8 2 3 4 5 6 8
991 9 1 2 6
All 12 Scores
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2 - 26
11
2 - 29
Graphic
Presentation of a
Frequency
Distribution
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Graphic Presentation of a 2 - 30
Frequency Distribution
Histograms
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
12
Graphic Presentation of a 2 - 31
Frequency Distribution
A Histogram
is a graph in which the
classes are marked on
the horizontal axis and
Frequency
the class frequencies on
the vertical axis
The class frequencies
are represented by the
heights of the bars and
the bars are drawn Class
adjacent to each other.
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Graphic Presentation of a 2 - 32
Frequency Distribution
Histogram
14
Hours Studying x f
12
10.0 to under 15 7 10
15.0 to under 20 12 8
20.0 to under 25 7 6
25.0 to under 30 3
4
30.0 to under 35 1
2
0 10 15 20 25 30 35
Hours spent studying
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
13
Graphic Presentation of a 2 - 33
Frequency Distribution
14
A frequency polygon
12
consists of line segments 10
connecting the points 8
formed by 6
4
the class midpoint and
2
the class frequency. 0
distribution 35
30
is used to determine how 25
many or what proportion 20
15
of the data values are 10
below or above 5
0
a certain value. 10 15 20 25 30 35
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2 - 34
Making a
Histogram
in Excel
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
14
Using 2 - 35
See
Click on HISTOGRAM
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Using 2 - 36
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
15
Using 2 - 37
Using 2 - 38
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
16
Using 2 - 39
17
Cumulative Frequency Distribution 2 - 41
For Hours Studying
Cumulative
Hours Studying xf Hours Studying f
10.0 to under 15 7 under 15 7
15.0 to under 20 12 under 20 19
20.0 to under 25 7 under 25 26
25.0 to under 30 3 under 30 29
30.0 to under 35 1 under 35 30
Graph…..
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Hours Studying
35
Cumulative 30
f 25
under 15 7 20
under 20 19 15
under 25 26 10
under 30 29 5
under 35 30
0 10 15 20 25 30 35
Hours spent studying
Notice that the limits are the plotted points.
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
18
2 - 43
Pie
Bar
Line
… used primarily for Qualitative Data
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2 - 44
Pie
…is useful for displaying a
Relative Frequency Distribution
A circle is divided
proportionally to the
relative frequency
and portions of the circle
are allocated for the
different groups.
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
19
2 - 45
Pie
200 runners were asked to indicate their
favourite type of running shoe.
Type # of runners selecting:
Nike 92
Adidas 49
Reebok 37
Asics 13
Other 9
Draw a pie chart based on this information.
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2 - 46
Pie
Relative Frequency Distribution
for the running shoes
Type # % Asics
Reebok Other
Nike 92 46.0 6.5%
Adidas 49 24.5 18.5% 4.5%
Reebok 37 18.5
Asics 13 6.5 Adidas 24.5% 46.0% Nike
Other 9 4.5
200 100
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
20
2 - 47
Pie
Type # %
Nike 92 46.0
Adidas 49 24.5
Reebok 37 18.5
Asics 13 6.5
Other 9 4.5
200 100
Bar 2 - 48
Examples of…
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
21
Bar 2 - 49
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Bar 2 - 50
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
22
Bar 2 - 51
Other formats…
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Canadian City
Employment
Rate
Bar 2 - 52
Victoria 57.7
Vancouver 61.4
Edmonton 67.1
Employment Rate in Canadian Cities
70
Winnipeg 66.7
68
Saskatoon 63.7
66
% employment
Regina 67.4
64
Thunder Bay 61.0
62
London 63.3
60
Kitchener 66.0
58
Hamilton 63.2
56
Toronto 65.1
54
Quebec 59.7
52
Sherbrooke 59.2
Montreal 60.4
Halifax 60.5
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
23
Canadian City
Employment
Rate
Bar 2 - 53
Victoria 57.7
Vancouver 61.4
Edmonton 67.1
Employment Rate in Canadian Cities
Winnipeg 66.7
70
- by Province
Saskatoon 63.7
68
Regina 67.4
% employment
66
Thunder Bay 61.0 64
London 63.3 62
Kitchener 66.0 60
Hamilton 63.2 58
Toronto 65.1 56
54
Quebec 59.7
52
Sherbrooke 59.2
Montreal 60.4
Halifax 60.5
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Bar 2 - 54
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
24
Clustered Bar 2 - 55
100
80
60
40
20
0
Stacked Bar 2 - 56
100
80 % males
60
40 % females
20
0
1987-88 1997-98
Copyright © 2004 by The McGraw-Hill Companies, Inc. Data Source: Statistics Canada
Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
25
2 - 57
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2 - 58
Shapes
of Moda
Clas
ls
Histograms
26
2 - 59
…a balanced effect!
Both ‘balanced’ or
‘have symmetry’
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
2 - 60
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
27
Modal
2 - 61
Class
A modal class is the one with the
largest number of observations
Modal
2 - 62
Class
biModal
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
28
2 - 63
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
Line 2 - 64
3-D In combination
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
29
Line 2 - 65
Time Plot
M oMn ot h
n th
l yly SSt te
e ee ll PPro
r oddu uc tio
c t ino n
8.5
7.5
6.5
5.5
Mo n th J F MAM J J A S O N D J F MAM J J A S O N D J F MAM J J A S O
Line 2 - 66
66
64
62
60
58
56
54
52
Preparing a Line Chart for this type of data is not overly useful!
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
30
Line 2 - 67
Line 2 - 68
0.2
0.5
0.1
0.0 0.0
0 10 20 30 40 50 0 10 20 30 40 50
Sales Sales
Copyright © 2004 by The McGraw-Hill Companies, Inc. Dr. Said Mirza Pahlevi – STMIK Pascasarjana Nusa Mandiri
31
30/10/2015
3-1
3-2
When you have completed this chapter, you will be able to:
1. Calculate the arithmetic mean, the weighted mean, the median,
the mode, and the geometric mean of a given data set.
1
30/10/2015
Central Tendency
arithmetic mean median mode
weighted mean geometric mean
It is unique
2
30/10/2015
Formula x
m =
N
m … is the population mean
(pronounced mu)
N … is the total number of observations
x … is a particular value
S … indicates the operation of adding
(sigma)
3-6
Terminology
Parameter
…is a measurable characteristic of a
Population
Statistic
…is a measurable characteristic of a
Sample
3
30/10/2015
Formula x
m =
N
The Kiers family
owns four cars. Find the mean
The following is mileage for the cars.
the current mileage
on each of the four
cars: 56000 + 23000 + 42000 + 73000
= 4
56,000 23,000
42,000 73,000 = 48 500
Formula x
x =
n
x …is the sample mean (read “x bar”)
n … is the number of sample observations
x … is a particular value
S … indicates the operation of adding
(sigma)
4
30/10/2015
3-9
Formula x =
x =
14 + 15 + 17 + 16 + 15
5
n
= 77 / 5 = 15.4
The average bonus given last year was $15 400
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
3 - 10
Properties of an
Arithmetic Mean
…Every set of interval-level and ratio-
level data has a mean
… All the values are included in
computing the mean
…A set of data has a unique mean
5
30/10/2015
3 - 11
Arithmetic Mean
as a Balance Point
Illustrate the mean of the values 3, 8 and 4.
= 15 / 3 =5
3 - 12
The mean is affected by unusually
large or small data values
6
30/10/2015
3 - 13
Determining
5
the Mean
in Excel
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Using 3 - 14
See
Click on DATA
ANALYSIS
See…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
7
30/10/2015
Using 3 - 15
See
Using 3 - 16
INPUT NEEDS
See
A3:A42
See…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
8
30/10/2015
Using 3 - 17
See Solution
Alternate solution…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Using 3 - 18
CLICK ON
See…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
9
30/10/2015
Using 3 - 19
See…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Using 3 - 20
See
See…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
10
30/10/2015
Using 3 - 21
See
3 - 22
Weighted Mean
The weighted mean of a set of numbers
x1, x2, ... xn,
with corresponding weights w1, w2, ...,wn,
is computed from the following formula:
w1 x1 + w2 x2 + ... + wn xn
mw =
w1 + w2 + w3 + ... + wn xn
11
30/10/2015
3 - 23
Weighted Mean
3 - 24
12
30/10/2015
3 - 25
The Median
3 - 26
The Median
13
30/10/2015
3 - 27
3 - 28
Properties of the Median
14
30/10/2015
3 - 29
The Mode
The Mode
is the value of the observation
that appears most frequently used
3 - 30
EXAMPLE
15
30/10/2015
3 - 31
Disadvantages
3 - 32
Geometric Mean
The Geometric Mean (GM) of a
set of n numbers is defined as the
nth root of the product of the n numbers.
The geometric mean is used to average
percents, indexes, and relatives.
The formula is:
GM = n ( x 1 )( x 2 )( x 3 ). .. ( xn )
16
30/10/2015
3 - 33
Geometric Mean
The interest rate on three bonds was
5, 21, and 4 percent
The Geometric Mean is:
GM = 3 ( 5 )( 21 )( 4 ) = 7 . 49
The arithmetic mean is (5+21+4)/3 =10.0
The GM gives a more conservative profit figure
because it is not heavily weighted
by the rate of 21percent
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
3 - 34
Geometric Mean
continued…
Another use of the geometric mean is to determine the
percent increase in sales,
production or other business or economic series
from one time period to another.
The formula is:
17
30/10/2015
3 - 35
Geometric Mean
continued…
835 , 000
GM = 8 1 = . 0127
755 , 000
i.e. the Geometric Mean rate of increase is 1.27%.
3 - 36
EXAMPLE
18
30/10/2015
3 - 37
Determining the
Median, Mode or
Geometric Mean
in Excel
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
3 - 38
Using
Click
DATA ANALYSIS
See…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
19
30/10/2015
Using 3 - 39
Highlight
DESCRIPTIVE
STATISTICS
INPUT NEEDS
SUMMARY
STATISTICS
See SOLUTION
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Using 3 - 40
Solution
The
geometric mean
doesn’t show up in
summary
statistics!
Alternate solution…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
20
30/10/2015
Using 3 - 41
CLICK ON
See…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Using 3 - 42
SCROLL DOWN
to
STATISTICAL
See…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
21
30/10/2015
Using 3 - 43
HIGHLIGHT
MEDIAN
IN RIGHT MENU
See
OR…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Using 3 - 44
HIGHLIGHT
GEOMETRIC MEAN
See IN RIGHT MENU
OR…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
22
30/10/2015
Using 3 - 45
HIGHLIGHT
MODE
IN RIGHT MENU
See
See…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Using 3 - 46
23
30/10/2015
3 - 48
24
30/10/2015
Using 3 - 49
3 - 50
The Mean
of Grouped Data
25
30/10/2015
3 - 51
The Mean
of Grouped Data
3 - 52
26
30/10/2015
3 - 53
The Mean
of Grouped Data
The Mean x = S fM
3 - 54
of Grouped Data n
Continued…
Movies Frequency Class (f)(M)
Showing f Midpoint (M)
1 to under 3 1 2 2
3 to under 5 2 4 8
5 to under 7 3 6 18
7 to under 9 1 8 8
9 to under 11 3 10 30
Total 10 66
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
27
30/10/2015
The Mean 3 - 55
of Grouped Data
Continued…
Movies Frequency Class (f)(M)
Showing f Midpoint (M)
Total 10 66
Formula
X= SfM
n
= 66 = 6.6
10
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
The Mean 3 - 56
of Grouped Data
Determine the average student study time
Hours Frequency Class (f)(M)
Studying f Midpoint (M)
10 to under 15 5 12.5 62.5
15 to under 20 12 17.5 210
28
30/10/2015
3 - 57
Symmetric Distribution
zero skewness
mode = median = mean
3 - 58
Right Skewed Distribution
Mean and Median are to the right of the Mode
Positively skewed
Mode<
Median<
Mean
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
29
30/10/2015
3 - 59
Left Skewed Distribution
Mean and Median are to the left of the Mode
Negatively skewed
< Mode
< Median
Mean
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
3 - 60
30
30/10/2015
4-1
4-2
When you have completed this chapter, you will be able to:
1
30/10/2015
4-3
4-4
Measures of Dispersion
• Range
• Mean Deviation
• Variance
• Standard Deviation
2
30/10/2015
4-5
Why Dispersion-1?
4-6
Why Dispersion-2?
3
30/10/2015
4-7
Terminology
Range
…is the difference between the
largest and the smallest value.
4-8
Terminology
Mean Deviation
…is the arithmetic mean of the absolute values of
the deviations from the arithmetic mean.
S x - m For
MD = population
N
All values are used in the calculation.
It is not unduly influenced by large or small values.
4
30/10/2015
4-9
For Samples
4 - 10
5
30/10/2015
4 - 11
4 - 12
Terminology
Variance
…is the arithmetic mean of the
squared deviations
from the arithmetic mean.
6
30/10/2015
4 - 13
Computing the Variance
2 S( x - m )2
s =
N
S( x - x )2
variance and
hence n-1
2
s =
n -1
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
4 - 14
The ages of the Dunn family are:
2, 18, 34, 42
m=
x = 96 = 24
N 4
2 S( x - m )2 (2 - 24 )2 ... (42 - 24 )2
s = =
N 4
944
=
4
= 236
7
30/10/2015
4 - 15
Population Standard Deviation
s = s2
= 236 = 15.36
4 - 16
EXAMPLE
The hourly wages earned by a sample of five
students are: $7, $5, $11, $8, $6.
Find the mean, variance, and Standard Deviation.
x =
x = 37
= 7.40
n 5
s= s2 = 5.29 = 2.30
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
8
30/10/2015
4 - 17
The Mean
of Grouped Data
The Mean x = S fM
4 - 18
of Grouped Data n
Continued…
Movies Frequency Class (f)(M)
Showing f Midpoint (M)
1 to under 3 1 2 2
3 to under 5 2 4 8
5 to under 7 3 6 18
7 to under 9 1 8 8
9 to under 11 3 10 30
Total 10 66
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
9
30/10/2015
The Mean
x = S fM
4 - 19
of Grouped Data n
Continued…
Movies Frequency Class (f)(M)
Showing f Midpoint (M)
Total 10 66
Formula x = S fM
n
Now: Compute the
variance and = 66 = 6.6
standard deviation. 10
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
4 - 20
Standard Deviation
for Grouped Data
10
30/10/2015
4 - 21
Sample Variance
for Grouped Data
5 to under 7 3 6 18 108
7 to under 9 1 8 8 64
9 to under 11 3 10 30 300
Total 10 66 508
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Sample Variance 4 - 22
2 ( S fM )2
S fM -
= n
s2 n-1
662
= 508 - 10 The standard
9 deviation is
The variance is = 8.04 8.04 = 2.8
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
11
30/10/2015
4 - 23
Example
4 - 24
12
30/10/2015
4 - 25
Interpretation and Uses
of the Standard Deviation
Chebyshev’s Theorem:
For any set of observations,
the minimum proportion of the values
that lie within k standard deviations
of the mean is at least:
1
Formula 1 -
k2
where k2 is any constant greater than 1
4 - 26
How well
does this
data set
fit
Chebychev’s
Theorem?
13
30/10/2015
4 - 27
Solution (continued)
Using
Step 1
Determine the mean and Mean = $319
standard deviation of the sample SD = $101.78
Step 2
Input k =2 1- 1
into Chebyshev’s theorem 22 = 1 – ¼ = 3/4
Step 3
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
4 - 28
Solution (continued)
Step 3
Using the mean and SD, Mean = $319
find the range of data values SD = $101.78
within 2 SD of the mean
( x - 2S, x + 2S) = 319 - (2)101.78, 319 +2(101.78)
= (115.44, 522.56)
Now, go back to the sample data,
and see what proportion of the values fall between
115.44 and 522.5656
Proportion
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
14
30/10/2015
4 - 29
Solution (continued)
Proportion of the values
that fall
between 115.44 and 522.56
We find that
48-50
or 96%
of the data
values are in
this range
– certainly
at least 75%
as the theorem
suggests!
4 - 30
Interpretation and Uses of the
Standard Deviation
Empirical Rule:
For any symmetrical, bell-shaped distribution:
…About 68% of the observations
will lie within 1s of the mean
…About 95% of the observations will
lie within 2s of the mean
…Virtually all the observations
will be within 3s of the mean
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
15
30/10/2015
Bell-Shaped Curve 4 - 31
m-3s m m 3s
m-2s m2s
m-1s m1s
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
4 - 32
How well
does this
data set
fit the
Empirical
Rule?
16
30/10/2015
Solution 4 - 33
4 - 34
Skewness
…is the measurement of the
lack of symmetry
of the distribution
17
30/10/2015
4 - 35
Skewness
SK1 = 3 ( Mean - Median )
SD
Following are the earnings per share for a sample of 15
software companies for the year 2000. The earnings
per share are arranged from smallest to largest.
$0.09 0.13 0.41 0.51 1.12 1.20 1.49 3.18
3.50 6.36 7.83 8.92 10.13 12.99 16.40
Find the
Mean = 4.95 SK = 3(4.95-3.18)/5.22
coefficient 1
Median = 3.18
of = 1.017
SD = 5.22
skewness.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
4 - 36
Positively Skewed Distribution
Mean and Median are to the right of the Mode
Mode<
Median<
Mean
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
18
30/10/2015
4 - 37
Negatively Skewed Distribution
Mean and Median are to the left of the Mode
< Mode
< Median
Mean
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
4 - 38
Percentile
19
30/10/2015
4 - 39
Example of Percentile
sort
First Quartile
Third Quartile
4 - 40
Interquartile
Range
…is the distance between the third quartile
Q3 and the first quartile Q1.
This distance
will include the middle 50 percent of the
observations.
Interquartile Range = Q3 - Q1
20
30/10/2015
4 - 41
Example
For a set of observations the
third quartile is 24 and the first quartile is 10.
What is the interquartile range?
4 - 42
Box Plots
21
30/10/2015
4 - 43
Example
Based on a sample of 20 deliveries, Buddy’s
Pizza determined the following information.
The…minimum delivery time was 13minutes
…the maximum 30 minutes
The…first quartile was 15 minutes
…the median 18 minutes, and
… the third quartile 22 minutes
Develop a box plot for the delivery times.
Solution 4 - 44
22
30/10/2015
4 - 45
4 - 46
Why?
Stock A: 7 6 8 5 7 3
Stock B: 15 -10 18 10 -5 8
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
23
30/10/2015
4 - 47
Stock A: 7 6 8 5 7 3
Mean = 36/6 = 6
Stock B: 15 -10 18 10 -5 8
Mean = 36/6 = 6
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
4 - 48
Stock A: 7 6 8 5 7 3
8–3=5
Stock B: 15 -10 18 10 -5 8
18 – ( -10) = 28
Therefore, Stock B is riskier.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
24
30/10/2015
4 - 49
Relative Dispersion
The coefficient of variation
is the ratio of the standard deviation to the
arithmetic mean, expressed as a percentage:
s
CV = x (100%)
A standard deviation of 10 may be perceived as
large when the mean value is 100,
but only
moderately large
when the mean value is 500!
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
4 - 50
Example
Rates of return over the past 6 years for
two mutual funds are shown below.
Fund A: 8.3, -6.0, 18.9, -5.7, 23.6, 20
Fund B: 12, -4.8, 6.4, 10.2, 25.3, 1.4
25
30/10/2015
4 - 51
4 - 52
26
30/10/2015
4 - 53
4 - 54
Solution s
CV = x (100%)
Fund A: CV = 13.19 / 9.85 = 1.34
27
11/23/2015
5-1
A Survey of
Concepts
5-2
When you have completed this chapter, you will be able to:
1
11/23/2015
5-3
Descriptive Inferential
2
11/23/2015
5-5
Terminology
Probability
…is a measure of the
likelihood that an event in the future will happen!
5-6
Terminology
Random Experiment
…is a process
repetitive in nature
the outcome of any trial is uncertain
well-defined set of possible outcomes
each outcome has a probability
associated with it
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
3
11/23/2015
5-7
Terminology
…is a particular result of a
random experiment.
... is the collection or set of all
the possible outcomes of a
random experiment.
…is the collection of one or more
outcomes of an experiment.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
5-8
Example
4
11/23/2015
Subjective
…probability is based on whatever information is available
Objective
Classical Probability Empirical Probability
… is based on the … applies when the number
assumption that the of times the event happens
outcomes of an experiment is divided by the number of
are equally likely observations
S ubjective
Probability
5 - 10
5
11/23/2015
CPlassical
robability
5 - 11
E Pmpirical
robability
5 - 12
6
11/23/2015
E Pmpirical
robability
5 - 13
5 - 14
Example
7
11/23/2015
5 - 15
What do you think?
5 - 16
Random Experiment
1 2 3 4 5 6
An event is the occurrence of an even number
i.e. we collect the outcomes 2, 4, and 6.
8
11/23/2015
5 - 17
Tree Diagrams
5 - 18
Tree Diagrams
Origin First Second Expressed as:
Flip Flip
P(HH)= 0.25
H HH
H P(HT)= 0.25
T HT Simple Events
H TH
P(TH)= 0.25
T
T TT P(TT)= 0.25
1.00
New
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
9
11/23/2015
5 - 19
Tree Diagrams
Origin Appetizer Entrée Dessert
Pie
Menu Ice Cream
Beef
Appetizer:
Soup or Turkey
Soup Pie
Juice
Entrée: Fish Ice Cream
Beef Beef Pie
Turkey Ice Cream
Fish Turkey Pie
Dessert: Juice
Ice Cream
Pie Fish
Pie
Ice Cream
Ice Cream
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
5 - 20
Tree Diagrams
How many complete dinners are there?
5 - 12
T re e D ia g ra m s
O r ig in A p p e t iz e r E n tr é e D essert
P ie
M enu Ice Cream
Beef
P ie
A p p e t iz e r :
S oup or T urkey Ice Cream
S oup P ie
J u ic e
E ntré e : F is h Ice Cream
Beef
T ur ke y
F is h
Beef
T urk ey
P ie
Ice Cream
P ie
12
J u ic e
D e s s e r t: Ice Cream
P ie F is h
P ie
Ice C ream
Ice Cream
10
11/23/2015
5 - 21
Tree Diagrams
5 - 12
How many
Tree dinners
Diagramsinclude beef?
5 - 22
Tree Diagrams
What is the probability that a complete dinner will
include…
Tree Diagrams
5 - 12 Juice?
6/12
Origin Appetizer Entrée Dessert
Pie
Menu Ice Cream Turkey?
Beef
Pie
Appetizer:
Soup or Soup
Turkey Ice Cream 4/12
Juice Pie
Entrée: Fish Ice Cream
Beef Pie
Beef
Turkey
Pie
Ice Cream Both beef
Fish Turkey
Juice and soup?
Dessert: Ice Cream
Pie Fish
Pie
See next
Ice Cream
Ice Cream
2/12 slide…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
11
11/23/2015
5 - 23
M * N Rule
If one thing can be done in M ways,
and if after this is done, something else
can be done in N ways,
then both things can be done in a
total of M*N different ways in that stated order!
Legend: Appetizer Entrée Dessert
Refer back to tree diagram example:
# different meals = 2 * 3 * 2 = 12
# meals with beef = 2 * 1 * 2 = 4
# meals with juice = 1 * 3 * 2 = 6
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
5 - 24
3 * 2 * 5 = 30
12
11/23/2015
5 - 25
robability
What is the probability of
drawing a red Ace
from a deck of well-shuffled cards?
5 - 26
robability
Deck = 52 Cards
4 Suits
Key steps
Using robabilityAnalysis
1. Determine….the Outcomes that Meet Our
Condition
2. List….all Possible Outcomes
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
13
11/23/2015
5 - 27
robability
4 Suits
4 Suits
x
Clubs Diamonds Hearts Spades 13 cards
13 cards in each
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
5 - 28
robability
Deck = 52 Cards
4 Suits (13 cards in each)
Scenarios
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
14
11/23/2015
5 - 29
Deck = 52 Cards
4 Suits (13 cards in each)
Scenarios
Condition Outcomes 4
1. Draw an Ace
All Possible Outcomes 52
2. Draw a Black Ace Condition Outcomes 2
All Possible Outcomes 52
3. Draw a Red Card Condition Outcomes 26
All Possible Outcomes 52
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
5 - 30
Deck = 52 Cards
4 Suits (13 cards in each)
Scenarios
15
11/23/2015
Deck = 52 Cards 5 - 31
= 4/52
= 4/52
P( Jack or King) = 4/52 + 4/52 = 8/52
Deck = 52 Cards 5 - 32
16
11/23/2015
Deck = 52 Cards 5 - 33
Deck = 52 Cards 5 - 34
17
11/23/2015
5 - 35
Deck = 52 Cards 5 - 36
P(King|Black ) = 2/26
This is called a CONDITIONAL probability
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Alternate solution
18
11/23/2015
Deck = 52 Cards 5 - 37
Deck = 52 Cards 5 - 38
Formula
P (Both)
P(A|B) = P(Both) = P(Given) P (A|B)
P(Given)
P(King of Clubs|Black ) = (1/52) / (26/52)
= (1/52) * (52/26)
= 1/26
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
19
11/23/2015
Deck = 52 Cards 5 - 39
5 - 40
20
11/23/2015
5 - 41
Reading
Probabilities
from a Table
(Contigency Table)
Reading Probabilities 5 - 42
from a Table
A survey of undergraduate students in the School of
Business Management at Eton College revealed the
following regarding the gender and majors of the students:
Gender Accounting International HR TOTAL
Male 150 150 50 350
Female 175 160 65 400
325 310 115 750
What is the Probability of selecting a female student?
400/750 = 53.33%
More
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
21
11/23/2015
Reading Probabilities 5 - 43
from a Table
What is the Probability of selecting a Human
Resources or International major?
Gender Accounting International HR TOTAL
Male 150 150 50 350
Female 175 160 65 400
325 310 115 750
Reading Probabilities 5 - 44
from a Table
What is the Probability of selecting a Female
or International major?
22
11/23/2015
Reading Probabilities 5 - 45
from a Table
What is the Probability of selecting a Female
Accounting student?
Gender Accounting International HR TOTAL
Male 150 150 50 350
Female 175 160 65 400
325 310 115 750
More
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Reading Probabilities 5 - 46
from a Table
What is the Probability of selecting a Female,
given that the person selected
is an International major?
Gender Accounting International HR TOTAL
Male 150 150 50 350
Female 175 160 65 400
325 310 115 750
23
11/23/2015
Reading Probabilities 5 - 47
from a Table
What is the Probability of selecting a Female,
given that the person selected
is an International major?
Formula P(A|B) = P(Both)
P(Given)
P(F|I) = P(F and I) / P(I)
= (160/750) / (310/750)
= 160/310
= 51.6%
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Reading Probabilities 5 - 48
from a Table
What is the Probability of selecting an
International major, given that the person
selected is a Female?
Gender Accounting International HR TOTAL
Male 150 150 50 350
Female 175 160 65 400
325 310 115 750
24
11/23/2015
Reading Probabilities 5 - 49
from a Table
Terminology 5 - 50
Independent Events
Events are independent if the occurrence of
one event does not affect the probability of the other
Each flip is
independent of the other! Find the probability
Flip once of flipping
Flip twice
2 Heads in a row
P(2H) = .5*.5
Consider the random
= .25 or 25%
experiment of flipping a coin twice.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
25
11/23/2015
Terminology 5 - 51
Independent Events
Draw three cards with replacement
i.e., draw one card,
look at it,
put it back,
and repeat twice more.
Each draw is independent of the other
Find the probability of drawing 3 Queens in a row:
P(3Q) = 4/52 * 4/52 *4/52 = 0.00046 = most unlikely!
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Dependent Events 5 - 52
Consider 2 events:
Drawing a RED card from a deck of cards
Drawing a HEART from a deck of cards
26
11/23/2015
5 - 53
ayes’
heorem
5 - 54
ayes’
heorem
Formula
P(A1 ) P(B|A1 )
P(A1|B) =
P(A1 ) P(B|A1)+ P(A2 )P(B|A2 )
Example
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
27
11/23/2015
5 - 55
ayes’
heorem
Duff Cola Company recently received several
complaints that their bottles are under-filled.
A complaint was received today but the production
manager is unable to identify which of the two
Springfield plants (A1 or A2) filled this bottle.
What is the probability that the under-
filled bottle came from plant A1?
5 - 56
ayes’
heorem
What is the probability that the under-
filled bottle came from plant A1?
% of Total Production % of Underfilled Bottles
A1 55 3
A2 45 4
List the P(plant A1) = .55
1
Probabilities given P(plant A2) = .45
Input values into P(Underfilled - A1) = .03
2 formula and compute P(Underfilled - A2) = .04
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
28
11/23/2015
5 - 57
ayes’
heorem
What is the probability that the under-
filled bottle came from plant A?
List the P(plant A1) = .55
1
Probabilities given P(plant A2) = .45
29
8/01/2016
6-1 6-2
When you have completed this chapter, you will be able to:
1. Define the terms probability distribution and
random variable.
2. Distinguish between discrete and
Discrete Probability Distributions continuous random variables.
3. Calculate the mean, variance, and standard deviation of
a discrete probability distribution.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
1
8/01/2016
6-3 6-4
Introduction Introduction
• Chapter 2 – 4 explained the descriptive statistics • Chapter 5 examined something that would probably
– Describing raw data by organizing it into a frequency happen
distribution and portraying the distribution in tables, – We note that this facet of statistics is called statistical
graphs, and charts inference.
– Computing a measure of location—such as the – The objective is to make inferences (statements) about a
arithmetic mean, median, or mode—to locate a typical population based on a number of observations, called a
value near the center of the distribution sample, selected from the population
– The range and the standard deviation are used to
describe the spread in the data
• Probability is a value between 0 and 1 inclusive, and
can be combined using rules of addition and
• These chapters focus on describing something that multiplication
has already happened.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
2
8/01/2016
6-5 6-6
Introduction
Terminology
• Chapter 6 explains a probability distribution Random Variable
that gives the entire range of values that can …is a numerical value determined by
occur based on an experiment the (random) outcome of an experiment.
• A probability distribution is similar to a
Probability Distribution
relative frequency distribution.
…is the listing of all possible outcomes
• However, instead of describing the past, it of an experiment
describes a likely future event. and the corresponding probability.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
3
8/01/2016
Discrete Continuous
Discrete Continuous Students in a class Examples
Under this distribution Under this distribution Distance driven by an
executive to get to work
the random variable the random variable
has a has an Number of children The length of time of a
countable number infinite number in a family particular phone call
of possible outcomes of possible outcomes Mortgage Loan The length of
Number of Mortgages time of an
Resulting of Resulting of some
counting type of measurement
approved in a month afternoon nap!
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
4
8/01/2016
6-9 6 - 10
Distinguishing features
of a
Consider a random experiment in which
Discrete Distribution:
a coin is tossed three times
The sum of the probabilities of the various
outcomes is 1.00 Let x be the number of Heads
5
8/01/2016
6
8/01/2016
6 - 13 6 - 14
Probability Distribution
Consider a Probability Distribution
random
experiment in # of
x P(x)
which a coin is Outcomes
tossed three times. 0 1 1/8
Determine the 1 3 3/8
probability
distribution. 2 3 3/8
What is the
3 1 1/8
probability of
tossing 2 heads in 8 8/8 = 1
3 flips?
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
7
8/01/2016
8
8/01/2016
Variance of a Discrete 6 - 17
Variance of a Discrete 6 - 18
9
8/01/2016
6 - 19
Computing the m 6 - 20
10
8/01/2016
Computing the s 2 6 - 21 6 - 22
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
11
8/01/2016
6 - 23 Binomial Probability 6 - 24
Terminology Distribution
Bernoulli Trial The experiment consists of n Bernoulli or
…is a random experiment in which the binomial trials
number of possible outcomes The outcomes are classified into
is precisely two! one of two mutually exclusive categories,
…such as success or failure
For Example…
The probability of success
stays the same for each trial
Heads Tails
The trials are independent
Right Wrong
Interested in the number of successes
Course Grade… A B or C or D or…!
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
12
8/01/2016
Binomial Probability 6 - 25 6 - 26
Distribution Combination
Let…
n be the number of trials
x be the number of observed successes
π be the probability of success on each trial
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
13
8/01/2016
x (n-x)
6 - 27
P( x) =n Cx π ( 1 - π) 6 - 28
The Department of Labour reports that DATA: 20% Unemployed & Sample of 10
20% of the workforce
aged between 15 and 19 years is unemployed.
Exactly three are unemployed
P ( 3) = 10C 3 (. 20 )3 (1 - . 20 )10-3
From a sample of 10 workers in this age group, = (120)(.0080)(.2097)
calculate the following probabilities:
= .2013 or 20.13%
Exactly three are unemployed
At least three are unemployed
At least three are unemployed
P(x 3) = 10 C3 (.20)3 (.80)10-3 +... +10C10 (.20)10 (.80)0
None are unemployed
At least one is unemployed = .2013 + .0881 + ... + .000
= .322 or 32.2%
Calculations More…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
14
8/01/2016
x (n-x)
P( x) =n Cx π ( 1 - π) 6 - 29 Binomial Probability 6 - 30
Distribution
DATA: 20% Unemployed & Sample of 10 n = 10
Table 6.2 Probability
None are unemployed X 0.05 0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90 0.95
P ( 0) = 10C 0 (. 20 )0 (1 - . 20 )10-0 0 0.599 0.349 0.107 0.028 0.006 0.001 0.000 0.000 0.000 0.000 0.000
1 0.315 0.387 0.268 0.121 0.040 0.010 0.002 0.000 0.000 0.000 0.000
= (1)(1) (.1074) 2 0.075 0.194 0.302 0.233 0.121 0.044 0.011 0.000 0.000 0.000 0.000
3 0.010 0.057 0.201 0.267 0.215 0.117 0.042 0.009 0.001 0.000 0.000
= .1074 or 10.74% 4 0.001 0.011 0.088 0.200 0.251 0.205 0.111 0.037 0.006 0.000 0.000
At least one is unemployed 5 0.000 0.001 0.026 0.103 0.201 0.246 0.201 0.103 0.026 0.001 0.000
6 0.000 0.000 0.006 0.037 0.111 0.205 0.251 0.200 0.088 0.011 0.001
…same as “all the time”(100%) except when 7 0.000 0.000 0.001 0.009 0.042 0.117 0.215 0.267 0.201 0.057 0.010
“none are unemployed” 8 0.000 0.001 0.000 0.001 0.011 0.044 0.121 0.233 0.302 0.194 0.075
9 0.000 0.000 0.000 0.000 0.002 0.010 0.040 0.121 0.268 0.387 0.315
i.e. P ( 1) = 1 – P(0) = 1 - .1074 = .8926 or 89.26% 10 0.000 0.001 0.000 0.000 0.000 0.001 0.006 0.028 0.107 0.349 0.599
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
15
8/01/2016
Distribution
Using Appendix A
n = 10
From text Appendix A Probability Binomial Probability Distribution
X 0.05 0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90 0.95 DATA: 20% Unemployed & Sample of 10
0 0.599 0.349 0.107 0.028 0.006 0.001 0.000 0.000 0.000 0.000 0.000
1 Where
0.315 represents
0.387 0.268 X the
0.121 0.040 ‘Number Unemployed’
0.010 0.002 0.000 0.000 0.000 0.000 Exactly three are unemployed
2 0.075 0.194 0.302 0.233 0.121 0.044 0.011 0.000 0.000 0.000 0.000
3 Represents
0.010 the0.267 0.215
0.057 0.201 ‘Probability’ 0.117 0.042 0.009 0.001 0.000 0.000 X 0.05 0.10 0.20 0.30 …0.80 0.90 0.95
4 0.001 0.011 0.088 0.200 0.251 0.205 0.111 0.037 0.006 0.000 0.000
0 0.599 0.349 0.107 0.028 …0.000 0.000 0.000
5 Explanations
0.000 0.001 0.026 0.103 0.201 0.246 0.201 0.103 0.026 0.001 0.000 1 0.315 0.387 0.268 0.121 …0.000 0.000 0.000
6 0.000 0.000 0.006 0.037 0.111 0.205 0.251 0.200 0.088 0.011 0.001 2 0.075 0.194 0.302 0.233 …0.000 0.000 0.000
7 0.000 0.000 0.001 0.009 0.042 0.117 0.215 0.267 0.201 0.057 0.010 3 0.010 0.057 0.201
0.201 0.267 …0.001 0.000 0.000
8
9
0.000Represents
0.001 0.000the
0.001 0.011 0.044 0.121 0.233
‘ PROBABILITY’ when0.302
0.000 0.000 0.000 0.000 0.002 0.010 0.040 0.121 0.268
n = 100.194
0.387
0.075
0.315
4 0.001 0.011 0.088 0.200 …0.006 0.000 0.000
10
= .201 or 20.1%
0.000 0.001 0.000 0.000 0.000 0.001 0.006 0.028
Using0.107 0.349 0.599
Appendix A
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
16
8/01/2016
Using Appendix A
Using Appendix A
Binomial Probability Distribution
Binomial Probability Distribution
DATA: 20% Unemployed & Sample of 10
DATA: 20% Unemployed & Sample of 10
At least three are unemployed
At least three are unemployed X 0.05 0.10 0.20 To account for the ‘at least 3 unemployed’,
Alternate Reasoning: 0 0.599 0.349 0.107 we must TOTAL the percentages from
1 0.315 0.387 0.268 3 to 10, inclusively
If ‘at least three are unemployed’ 2 0.075 0.194 0.302
0.201
it follows that 3 0.010 0.057 0.201
0.088
4 0.001 0.011 0.088
‘at most seven are employed!’ 0.026
5 0.000 0.001 0.026
0.006
You can turn this into a problem of 6 0.000 0.000 0.006
0.001
= 0.322 or 32.2%
7 0.000 0.000 0.001
80% employment if you wish 0.000
8 0.000 0.000 0.000
0.000
9 0.000 0.000 0.000
0.000
10 0.000 0.000 0.000
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
17
8/01/2016
18
8/01/2016
2
Meanm and Variance s 6 - 37 6 - 38
of a
Binomial Probability Distribution
The Ontario
Formula m = nπ
Department of = 10(.20)
Labour reports that = 2.0
20% of the workforce Poisson Probability
aged between
15 and 19 years is
Formula s2= nπ(1 - π) Distribution
unemployed. = 10(.20)(.80)
From a sample of 10 = 1.60
workers in this age
group, calculate: Therefore, the
andm 2
s standard deviation is 1.6 = 1.3
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
19
8/01/2016
6 - 39 6 - 40
Introduction Poisson probability distribution
• When there is a large number of trials, but a • Describes the number of times some event
small probability of success, binomial occurs during a specified interval.
calculation becomes impractical – The interval may be time, distance, area, or
– Example: Number of deaths from horse kicks in volume
the Army in different years • It is often referred to as the “law of
improbable events,” meaning that the
probability, π, of a particular event’s
happening is quite small
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
20
8/01/2016
6 - 41 Poisson Probability 6 - 42
Applications Distribution
The Binomial Distribution becomes
• It is used as a model to describe: more skewed to the right
– the distribution of errors in data entry
Positive
– the number of scratches and other imperfections
in newly painted car panels
– the number of defective parts in outgoing as the Probability of success become smaller.
shipments The limiting form of the Binomial Distribution
– the number of customers waiting to be served at where the probability of success
a restaurant or waiting to get into an attraction
at Disney World π is small and n is large is called the
Poisson Probability Distribution
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
21
8/01/2016
Distribution Distribution
Poisson Probability Distribution
can be described mathematically using the formula: The mean number of successes… m
mx e -m can be determined in binomial
nπ
P( x) = x!
situations by…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
22
8/01/2016
6 - 45 Poisson Probability 6 - 46
Example Distribution
Probability of an accident in a year is 0.00024. So in a Poisson with µ =2.4
town of 10,000 people, the mean number of person
involved in accident:
µ= nπ
= 10,000 x 0.00024 = 2.4
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
23
31/08/2018
THE
Normal
PROBABILITY
DISTRIBUTION
When you have completed this chapter, you will be able to:
1
31/08/2018
Recall…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
2
31/08/2018
Variables
Quantitative … can be classified as either
Discrete or
Numerical Continuous
Observations
Characteristics
Continuous … can assume any value
within a specified range!
e.g. - Pressure in a tire
- Weight of a beef chop
- Height of students in a class
Also Recall that…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
3
31/08/2018
Continuous Probability
Distributions
a b x
Data:
Heights of adult Canadian males…
a) n = 50, class interval = 2
b) n = 500, class interval = 1
c) n = 5000, class interval = 0.4
d) Probability function
See Histograms
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
4
31/08/2018
5
31/08/2018
6
31/08/2018
+
Theoretically, curve extends to infinity
Mean, median, and mode are equal
7
31/08/2018
The Standard
Normal Probability Distribution
… is a normal distribution
with a mean of 0 and a standard deviation of 1.
Also called the Z Distribution
Formula z (X µ)
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
8
31/08/2018
The Standard
Normal Probability Distribution
Total Area under the curve is 100% or 1
The Standard
Normal Probability Distribution
Formula z (X µ)
The bi-monthly
starting salaries of
recent MBA z $3,300 $3,000
graduates $300
follows the normal
distribution with a z = 1.00
mean of $3,000
A z-value of 1 indicates that
and a standard
deviation of $300. the value of $3,300 is
What is the one standard deviation
z-value for a above the mean of $3,000.
salary of $3,300?
Show curve
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
9
31/08/2018
$3300
z
$3000
z
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
The Standard
Normal Probability Distribution
Formula z (X µ)
The bi-monthly
starting salaries of
recent MBA $2,550 $3,000 z
graduates follows $300
the normal Z = -1.50
distribution with a
mean of $3,000 A z-value of –1.50 indicates that
and a standard the value of $2,550 is
deviation of $300. one and a half (1.5)
What is the standard deviations
z-value for a salary
below the mean of $3,000.
of $2,550?
Show curve
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
10
31/08/2018
The Standard
Normal Probability Distribution
A z-value of –1.50 indicates that the value of
$2,550 is one and a half standard deviations
below the mean of $3,000.
$2550
z
$3000
z
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
The Standard
Normal Probability Distribution
… there is an infinite number of normal
distributions, but only one table
… each distribution is determined by
&
…work with & to calculate normal
probabilities, by figuring out
… how many standard deviations is x
away from the mean?
11
31/08/2018
If…
= 50 = 8
+
0 20 40 50 60 80 100
40 is 10 units(-10)away from 50 40 is –1.25 away from 50
If 1 standard deviation () is 8 units, then 10 units must be 1.25 SD
12
31/08/2018
The Standard
Normal Probability Distribution
the Z Distribution
40 – 50
z – 10
8 8
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
The Standard
Normal Probability Distribution
the Z Distribution
- z-value + z-value
Left of mean Right of mean
0 20 40 50 60 80 100
Z-scale 0
The mean µ, of a standard normal distribution is 0, and
Standard Deviation is 1.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
13
31/08/2018
The Standard
Normal Probability Distribution
Calculate P(50 < X < 60)
If… Transferring each
= 50 = 8 value into z-scores,
z (X µ)
0 20 40 50 60 80 100
0 1.25
Z1= (60-50)/8 = 1.25
P(50 < X < 60) = P(0 < z < 1.25) z-table
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
14
31/08/2018
The Standard
Normal Probability Distribution
the Z Distribution
… 0.3944 is the area between the mean and a positive z score 1.25
0.3944
0 20 40 50 60 80 100
0 1.25
Therefore, the Probability of 50<X<60 is 39.44%
The Standard
Normal Probability Distribution
Calculate P(40 < X < 60)
If… Transferring each
= 50 = 8 value into z-scores,
z (X µ)
0 20 40 50 60 80 100
-1.25 0 1.25
Z1= (40-50)/8 = -1.25 Z2= (60-50)/8 = 1.25
…and
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
15
31/08/2018
The Standard
Normal Probability Distribution
… because both sides are symmetrical, the left side
(the area between the mean and a negative z score -1.25)
must have the same area………….. 0.3944.
0 20 40 50 60 80 100
-1.25 0 1.25
The Standard
Normal Probability Distribution
the Z Distribution
Calculate P(40 X 60)
If…
= 50 = 8
0 20 40 50 60 80 100
P(40 < X < 60) = P(40 X 60)
…since P(X= 40 or 60) is zero
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
16
31/08/2018
1 Locate Area on
the normal curve
0 1.00
Be sure to always sketch the curve, insert the
given values and shade the required area.
17
31/08/2018
Locate Area on
1
the normal curve
18
31/08/2018
1 Locate Area on
the normal curve
19
31/08/2018
1 Locate Area on
the normal curve
-2.22 0 2.22
The Area between –2.22<Z<0 is 0.4868
20
31/08/2018
1 Locate Area on
the normal curve
-2.96 0 2.96
The Area between –2.96<Z<0 is 0.4985
Therefore, the Probability of –2.96<Z<0 is 49.85%
1 Locate Area on
the normal curve
-1.65 0 1.65
The Area (a1) between –1.65<Z<0 is 0.4505
Add together
The Area (a2) between 0<Z<1.65 is 0.4505
21
31/08/2018
Locate Area on
1
the normal curve
Look up – 2.00
2 then 1.00 in Table
-2.00 0 1.00
The Area (a1) between –2.00<Z<0 is 0.4772
Add together
The Area (a2) between 0<Z<1.00 is 0.3413
Therefore, the required Total Area is 0.8185
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
1 Locate Area on
the normal curve
-0.44 0 1.96
The Area (a1) between –0.44<Z<0 is 0.1700
Add together
The Area (a2) between 0<Z<1.96 is 0.4750
22
31/08/2018
The Standard
Normal Probability Distribution
Total Area under the curve is 100% or 1
23
31/08/2018
1 Locate Area on
the normal curve
2 a1
Look up 1.96 in Table Area = 0.5
3 Adjust as needed 0 1.96
1 Locate Area on
the normal curve
Area = 0.5
2 Look up 1.64 in Table
a1
3 Adjust as needed -1.64 0
24
31/08/2018
1 Locate Area on
the normal curve
Area = 0.5
2 Look up 0.95 in Table a1
1 Locate Area on
the normal curve
Area = 0.5
2 Look up 0.95 in Table
a1
25
31/08/2018
1 Locate Area on
the normal curve
A
Area = 0.5
2 Look up 1.00 in Table a1
3 Adjust as needed
0 1.00
1 Locate Area on
the normal curve a1 Find
0 1.96
3 Adjust as needed 2.58
The Area (a1) between 0<Z<1.96 is 0.4750
Subtract
The Area (a2) between 0<Z<2.58 is 0.4951
26
31/08/2018
1 Locate Area on
the normal curve
a1
2 Look up 0.55 then 1.96 in Table
a2
0.55
3 Adjust as needed 1.96
27
31/08/2018
Between:
68.26%
95.44%
99.74%
-1 -1
-2 -2
-3 -3
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
28
31/08/2018
robability that…
a person from Newmarket, selected at random,
will use between 80 and 88 litres per day, is…?
(X 80 - 80 0.00
z1 10 80 88
(X 88 - 80
z1 10 0.80 Z-scale 0.0 0.80
Z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07
0.00 0.0000 0.0040 0.0080 0.0120 0.0160 0.0199 0.0239 0.0279
0.10 0.0398 0.0438 0.0478 0.0517 0.0557 0.0597 0.0636 0.0675
0.20 0.0793 0.0832 0.0871 0.0910 0.0948 0.0987 0.1026 0.1064
0.30 0.1179 0.1217 0.1255 0.1293 0.1331 0.1368 0.1406 0.1443
0.40 0.1554 0.1591 0.1628 0.1664 0.1700 0.1736 0.1772 0.1808
0.50 0.1915 0.1950 0.1985 0.2019 0.2054 0.2088 0.2123 0.2157
0.60 0.2291 0.2257 0.2324 0.2357 0.2389 0.2422 0.2454 0.2486
0.70 0.2580 0.2611 0.2642 0.2673 0.2704 0.2734 0.2764 0.2794
0.80 0.2881 0.2910 0.2939 0.2967 0.2995 0.3023 0.3051 0.3078
0.90 0.3159 0.3186 0.3212 0.3238 0.3264 0.3289 0.3315 0.3340
1.00 0.3413 0.3438 0.3461 0.3485 0.3508 0.3531 0.3554 0.3577
1.10 0.3643 0.3665 0.3686 0.3708 0.3729 0.3749 0.3770 0.3790
1.20 0.3849 0.3869 0.3888 0.3907 0.3925 0.3944 0.3962 0.3980
1.30 0.4032 0.4049 0.4066 0.4082 0.4099 0.4115 0.4131 0.4147
robability that…
a person from Newmarket, selected at random,
will use between 80 and 88 litres per day, is…?
29
31/08/2018
robability that…
a person from Newmarket, selected at random,
will use between 76 and 92 litres per day, is…?
(X 76 - 80
z1 10 0.40 76 92
(X 92 - 80 Z-scale -.4 1.20
z1 10 1.20 Z
0.00
0.10
0.00
0.0000
0.0398
0.01
0.0040
0.0438
0.02
0.0080
0.0478
0.03
0.0120
0.0517
0.04
0.0160
0.0557
0.05
0.0199
0.0597
0.06
0.0239
0.0636
0.07
0.0279
0.0675
0.20 0.0793 0.0832 0.0871 0.0910 0.0948 0.0987 0.1026 0.1064
0.30 0.1179 0.1217 0.1255 0.1293 0.1331 0.1368 0.1406 0.1443
0.40 0.1554 0.1591 0.1628 0.1664 0.1700 0.1736 0.1772 0.1808
0.50 0.1915 0.1950 0.1985 0.2019 0.2054 0.2088 0.2123 0.2157
0.60 0.2291 0.2257 0.2324 0.2357 0.2389 0.2422 0.2454 0.2486
0.70 0.2580 0.2611 0.2642 0.2673 0.2704 0.2734 0.2764 0.2794
0.80 0.2881 0.2910 0.2939 0.2967 0.2995 0.3023 0.3051 0.3078
0.90 0.3159 0.3186 0.3212 0.3238 0.3264 0.3289 0.3315 0.3340
1.00 0.3413 0.3438 0.3461 0.3485 0.3508 0.3531 0.3554 0.3577
1.10 0.3643 0.3665 0.3686 0.3708 0.3729 0.3749 0.3770 0.3790
1.20 0.3849 0.3869 0.3888 0.3907 0.3925 0.3944 0.3962 0.3980
1.30 0.4032 0.4049 0.4066 0.4082 0.4099 0.4115 0.4131 0.4147
robability that…
a person from Newmarket, selected at random,
will use between 76 and 92 litres per day, is…?
a1 =.1554
a2 =.3849
Add together A =.5403
The area under a normal curve between a z-value of 76 and
a z-value of 92 is 0.5403, therefore, we conclude that 54.03% of
the residents will use between 76 to 92 litres per day!
30
31/08/2018
31
31/08/2018
Using
the Normal Curve
to Approximate
the Binomial Distribution
32
31/08/2018
Binomial Experiment
33
31/08/2018
Example
Solution 1
34
31/08/2018
35
31/08/2018
36
31/08/2018
37
31/08/2018
38
8- 1
Sampling
Methods
&
Central Limit Theorem
8- 2
When you have completed this chapter, you will be able to:
1. Explain under what conditions sampling is the
proper way to learn something about a population.
2. Describe methods for selecting a sample.
3. Define and construct a sampling distribution
of the sample mean.
4. Explain the central limit theorem.
5. Use the central limit theorem to find probabilities of
selecting possible sample means from
a specified population.
1
8- 3
KEY 1.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
8- 4
2
8- 5
KEY 1.
If the proper
method for selecting
the sample is
NOT MADE … the SAMPLE
will not be truly
representative of the
TOTAL Population!
8- 6
3
8- 7
Techniques
with Replacement without Replacement
Each data unit in the Each data unit in the
population is allowed to population is allowed to
appear in the sample appear in the sample
more than once no more than once
Methods
8- 8
4
8- 9
8- 10
5
8- 11
Example
8- 12
6
8- 13
Example
We might study the advertising expenditures for the 352
largest companies in the United States.
8- 14
7
8- 15
8- 16
Example
8
8- 17
Selected Cluster
Suppose you divided the state into 12 primary units, then
selected at random four regions—2, 7, 4, and 12—and
concentrated your efforts in these primary units.
8- 18
Sampling Error
9
8- 19
Terminology
“Sampling error” … is the difference between
a sample statistic
and its
corresponding population
parameter
“Sampling distribution … is a probability distribution
of the sample mean” consisting of
all possible sample means
of a given sample size
selected from a population
Example
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
8- 20
Example
10
8- 21
Solution
8- 22
Solution
What is the
sampling error?
11
8- 23
Solution
8- 24
Solution
12
Solution 8- 25
8- 26
13
The law firm of Hoya and Associates has five partners.
8- 27
8- 28
14
8- 29
8- 30
15
8- 31
Example …continued
Mean
24 Organize the sample means
26 into a Sampling Distribution
24
Sample Frequency Relative frequency
22 Mean Probability
28
26 22 1 1/10
24 24 4 4/10
28
26 3 3/10
26
24 28 2 2/10
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
8- 32
Example …continued
24 4 10
26 3 = 25.2
28 2
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
16
8- 33
Example …continued
Partner Hours
Dunn 22
22 26 30 26 22
Hardy 26
5
Kiers 30
Malinowski 26 = 25.2
Tillman 22
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
8- 34
17
8- 35
8- 36
18
8- 37
Related Concepts
8- 38
19
8- 39
Example
8- 40
Solution
20
8- 41
Solution
8- 42
21
8- 43
8- 44
22
8- 45
8- 46
Z-Value
sampling error
23
8- 47
Example
8- 48
24
8- 49
Solution
8- 50
Point Estimates
A point estimate is one value ( a single point)
that is used to estimate a population parameter
sample mean
sample standard deviation
sample variance
sample proportion
More
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
25
8- 51
Point Estimates
Population follows… Population does NOT follow…
the normal distribution the normal distribution
The sampling distribution If the sample is of at least 30
of the sample means also follows observations, the sample WILL
the normal distribution follow the normal distribution
Probability of a sample mean Probability of a sample mean
falling within a particular region, falling within a particular region,
use:
Z= X use:
Z= X
n s n
26
Using the Sampling Distribution 8- 53
nswer…
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
X
1 Formula z
s n
320 330 a1
= 0.79
80 40
320 330
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
27
Using the Sampling Distribution 8- 55
2 Look up 0.79
in Table
Required Area = a1
a1 =0.2852
0.2852 + .5 = 0.7852
320 330
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
8- 56
END
28
Estimation and
Confidence Intervals
Chapter 9
McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Learning Objectives
LO1 Define a point estimate.
LO2 Define level of confidence.
LO3 Compute a confidence interval for the population
mean when the population standard deviation is
known.
LO4 Compute a confidence interval for a population mean
when the population standard deviation is unknown.
LO5 Compute a confidence interval for a population
proportion.
9-2
1
LO1 Define a point estimate.
Sampling
Why Use Sampling?
1. To contact the entire population is too time consuming.
2. The cost of studying all the items in the population is often too expensive.
3. The sample results are usually adequate.
4. Certain tests are destructive.
5. Checking all the items is physically impossible.
9-3
Estimates
Point Estimate versus Confidence Interval Estimate
• A point estimate is a single value (point) derived from a sample and used
to estimate a population value.
• A confidence interval estimate is a range of values constructed from
sample data so that the population parameter is likely to occur within that
range at a specified probability. The specified probability is called the level
of confidence.
What are the factors that determine the width of a confidence interval?
1. The sample size, n.
2. The variability in the population, usually σ estimated by s.
3. The desired level of confidence.
9-4
2
LO2
9-5
3
LO3 Compute a confidence interval for the population
mean when the population standard deviation is known.
x sample mean
z z - value for a particular confidence level
σ the population standard deviation
n the number of observations in the sample
1. The width of the interval is determined by the level of confidence and the
size of the standard error of the mean.
2. The standard error is affected by two values:
- Standard deviation
- Number of observations in the sample
9-7
Example
4
Answer 1
Answer 2
5
How do we interpret these
results?
Suppose we select many samples of 256 store
managers, perhaps several hundred
For each sample, we compute the mean and then
construct a 95 percent confidence interval, such as we
did above
We could expect about 95 percent of these confidence
intervals to contain the population mean
About 5 percent of the intervals would not contain the
population mean annual income, which is µ
6
LO4 Compute a confidence interval for the population mean
when the population standard deviation is not known.
Examples:
The Dean of the Business College wants to estimate the mean number
of hours full-time students work at paying jobs each week. He selects a
sample of 30 students, contacts each student and asks them how many
hours they worked last week.
9-13
Characteristics of t-Distribution
1. It is, like the z distribution, a continuous distribution
9-14
7
When? 9 - 16
NO YES
Population standard
n 30 or more?
deviation known?
NO YES NO YES
Use a
nonparametric Use the z Use the t Use the z
test distribution distribution distribution
(see Ch16)
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
8
LO4
• df degree of freedom
• 1 or 2 tail
9-17
Example
9
10
Answer
The endpoints of the confidence interval are 0.256 and 0.384
If we repeated this study 200 times, calculating the 95%
confidence interval with each sample’s mean and the
standard deviation
190 of the intervals would include the population mean
Ten of the intervals would not include the population mean
This is the effect of sampling error
A further interpretation is to conclude that the population
mean is in this interval
The manufacturer can be reasonably sure (95 % confident)
that the mean remaining tread depth is between 0.256 and
0.384 inches
Because the value of 0.30 is in this interval, it is possible that
the mean of the population is 0.30
11