Escolar Documentos
Profissional Documentos
Cultura Documentos
Section one
Paul Bottomley
Bottomleypa@cf.ac.uk
Silver, pp.24-26, 35-38, 45-48, 50-68
The Nature of Statistics
A statistic is anything we calculate from a set of data.
How much did you spend last time you went supermarket shopping
Ratio
(to the nearest )? ___47___
Cardiff 10K Road Race
Imagine that you enter the Cardiff
10 km. You get your race number, train
hard and complete the course. Yippee
but you wont beat Mo Farah!
What type of data are each of the
following variables?
What statistical operations are OK?
14
12
10
Modal
Frequency
8
6 category
4
2
0
#1 #2 #3 #4
Sound System
Price of TVs
16 9
14
Price of TVs 8
Typical value:
7
12
Typical value: 6
10
400 - 500
Frequency
Frequency
8 250 - 500 5
4
6
3
4 2
2 1
0 0
250 500 750 1000 1250 100 300 500 700 900 1100 1300
_
The sample mean is denoted X and pronounced x bar
In mathematical notation, for n data points, mean is
_
X
X 1 X 2 X 3 ... X n
X
n n
Central Tendency: Mean Cont.
Lets return to B&0 prices. B&O
Mean = (800 + 891 ++ 757) / 8 = 8251 / 8 800
= 1031.38 891
Typical B&O costs about 1031 (in 1996)! 1295
Remember the units when telling the story. 1451
Adv: It is easily understood and unique. 580
Calculation based on all data, so no 1192
information wasted. 1285
Disadv: may give strange results (2.4 kids) 757
Only appropriate for interval/ratio data. Sum = 8251
Can be distorted by outlying values.
Central Tendency: The Median
The median (Md) is the middle value when the data
are placed in ascending order.
50% of the data values are smaller than the median.
50% of the data values are larger than the median.
With n data points, the median position is (n + 1)/2.