Você está na página 1de 23

E370

5/13/17
Sampling, Classifying
and Graphing Data to
get information.
Week 2, Part 3:
Graphing
Quantitative
histogram

frequency polygon

ogive

stem-and-leaf plot or display

Summary tables expand into


graphs
Column charts with specifications
Classes are mutually exclusive and
collectively exhaustive.
Classes are always in numerical order.
No classes are skipped, even if they are
empty.
Columns touch.

Themost commonly used chart for


numerical data
Weak Points
The look of the data is easily influenced by
the classes selected.
An easy graph with which to lie.

Histograms
Minutes Commuting to work in New York

42
39

34 34

22
20
18
15

11
10
7

3
2 2
1

0
2.5 7.5 12.5 17.5 22.5 27.5 32.5 37.5 42.5 47.5 52.5 57.5 62.5 67.5 72.5 77.5

Minutes
A line version of the histogram
A plot of class midpoints and frequencies
A closed chart
Classes are mutually exclusive and
collectively exhaustive
Classes are always in numerical order
No classes are skipped, even if they are
empty
Strengths
Best way to represent several variables on
the same axes.
Weak Points
Based on a histogram, so same problems.

The Frequency Polygon


Minutes Commuting to Work in New York
45

40

35

F 30
r
e 25
q
u 20
e
n 15
c
y
10

Minutes
Minutes Commuting to work in LA and New York
100

90

80

70
F
r 60
e
q 50
u
e 40
n
30
c
y 20

10

Minutes

LA NY Combined
Minutes Commuting to Work in LA, NY and Combined
100

90

80

F 70
r
e 60
q
50
u
e
40
n
c 30
y
20

10

0
2.5 7.5 12.5 17.5 22.5 27.5 32.5 37.5 42.5 47.5 52.5 57.5 62.5 67.5 72.5 77.5

Minutes

LA NY Combined
Agraph of a cumulative
frequency
Can be absolute or relative
cumulative
Non-decreasing graph
Plot of upper class limits and
frequencies
Always begins at (0,0)

The Ogive
Minutes Commuting to Work in New York

250
C
u
m
u 200
l
a
t
i
v
150
e

F
r 100
e
q
u
e
n 50
c
y

0
0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80

Minutes
Auseful alternative to the
histogram for appropriate data
sets.
Each observation is divided into a
stem and a leaf.
A stem is a subset of the values based on
leading digits.
A leaf is a list of trailing digits that have the
stem in common.
For example, consider observations
19, 27, 28, 32
The stems would be the leading digit in each
value, 1, 2 & 3
Stem-n-leaf plot
The leaves would be the trailing digit in
each value, 9, 7, 8 & 2.
The stems are listed vertically from smallest
to largest.
The leaves are listed horizontally, in numerical
order, to the right of the appropriate stem,
one value for every observation, even if values
repeat.
Observations Stem Leaves
242 24 2 4 5 5
244 25
245 26
245 27

Good for relatively small data sets (n < 150)


with not a lot of detail in the digits.

Organize a stem-n-leaf plot


19 0
20 6
21 1
22 9
23 0 1 4 5 5
24 2 4 5 5 5 5 7 7 8 8 8 9
25 1 2 2 3 4 4 5 7 7 9
26 0 0 1 1 2 2 3 3 3 4 5 6 8
27 1 2 3 4 4 5 8
28 0 1 2 3 3 3 5 6 6 7 8 9 9 9
29 0 0 1 2 2 2 3 5 5 6 8 9 9
30 1 2 2 4 8 8 8 9
31 1 2 4 8
32 2 4 4

Stem-n-leaf plot
33
34
0
9
1 3

35 0 5
Qualitative
bar or column chart
Frequencies are represented as
bars or columns, one for each
class
When differences between
classes is small, bars enable one
to better see the differences
Not a histogramcolumns do
NOT touch

And qualitative graphs . . .


WorldOil Reserves
Billionsof barrels
0 50 100 150 200 250 300
Angola
Canada
China
Iran
Iraq
Kazakhstan
Kuwait
Libya
Mexico
Nigeria
Russia
Saudia Arabia
United Arab Emirates
United States
Venezuela
Other
pie chart
Simple, easily understood
Classes are slices, size of slice is frequency
Excellent for relative comparisons among
categories.
Greatly enhanced if color can be used to
distinguish categories.
If there is a large number of categories &/
frequencies are small, it may not be
informative.
If only black & white patterns are available
categories are difficult to distinguish.

Pie Charts
WorldOil Reserves
1.9%
1.4%
1.9% 1.7% Saudia Arabia
2.5% Canada
2.7%
Iran
3.0% 19.5%
Other
Iraq
5.1% United Arab Emirates
Kuwait
5.6% Venezuela
13.3% Russia
7.2% Libya
Nigeria
Mexico
7.2%
9.9% Kazakhstan
Angola
8.3%
9.0% United States
China
Pareto Diagram or chart
A column chart for categorical data.
Columns (categories) are ordered
from highest frequency to lowest
Sorts the important few from the
trivial many.
Only useful on non-numerical data.
If there are lots of categories, it
rapidly pinpoints the important
groups.

Pareto Diagram
WorldOil Reserves
300

B
250
i
l b
l a 200
i r
o r 150
n e
s l 100
s
o 50
f

Country

Você também pode gostar