Escolar Documentos
Profissional Documentos
Cultura Documentos
(SUMMARY)
Observation
Observation
In a group of crickets, small ones seem to avoid large ones
Imagine that you are collecting samples (i.e. a number of individuals) from a population of little ball creatures - Critterus sphericales Little ball creatures come in 3 sizes:
Small =
Medium =
Large =
The real population (all the little ball creatures that exist)
Your samples
Each sample is a representation of the population BUT No single sample can be expected to accurately represent the whole population
Thrown quadrat??
13
4 6
7
5 2 9 14 11
10
12 15
Pseudoreplication
Sample size = 10
Sample size = 1
TYPES OF
DATA
RATIO DATA - constant size interval - a zero point with some reality
e.g. Temperature
Ordinal Scale
- ranked data
Nominal Scale
The kind of data you are dealing with is one determining factor in the kind of statistical test you will use.
The real population (all the little ball creatures that exist)
The real population (all the little ball creatures that exist)
Your samples
The real population (all the little ball creatures that exist)
m = SXi N
m - population mean
Your samples
X = SXi n
X = SXi n
X = SXi n
X = SXi n
X = SXi n
Sample mean
X = SXi n
X = SXi n
X = SXi n
Median =
+ 2
Median = X(n+1)
2
Measures of Dispersion Why worry about this?? -because not all populations are created equal
Distribution of values in the populations are clearly different BUT means and medians are the same
Measures of Dispersion 1. Range - difference between the highest and lowest values
Remember little ball creatures and the five samples
Range =
Measures of Dispersion 2. Mean Deviation X is a measure of central tendency Take difference between each measure and the mean
Xi - X
BUT
SXi - X = 0
So this is not useful as it stands
Measures of Dispersion 2. Mean Deviation (contd) But if you take the absolute value -get a measure of disperson
S |Xi - X|
and
S |Xi - X|
n
= mean deviation
Measures of Dispersion 3. Variance -eliminate the sign from deviation from mean Square the difference
(Xi - X)2
And if you add up the squared differences - get the sum of squares
S(Xi - X)2
Measures of Dispersion 3. Variance (contd) Sum of squares can be considered at both the population and sample level Population Sample
SS = S(Xi -
m)2
ss = S(Xi - X)2
Measures of Dispersion 3. Variance (contd) If you divide by the population or sample size - get the mean squared deviation or VARIANCE Population Sample
s2
= S(Xi N
m)2
Population variance
Measures of Dispersion 3. Variance (contd) Note something about the sample variance
Degrees of freedom or df or n
Measures of Dispersion 4. Standard Deviation - just the square root of the variance Population Sample
s = S(Xi N
m)2
Standard Deviation - very useful Most data in any population are within one standard deviation of the mean
NORMAL DISTRIBUTION
Discrete
Continuous
Other distributions
> 2 categories
Discrete
Continuous
Other distributions
> 2 categories
Central Limit Theorem Any continuous variable influenced by numerous random factors will show a normal distribution.
Data points that would be affected by a large number of random (=unpredictable) events physical activity age genes
Blood pressure
smoking diet
stress
So, for comparison between them, we need to standardize their presentation in some way
Example of a z-score calculation The mean grade on the Biometrics midterm is 78.4 and the standard deviation is 6.8. You got a 59.7 on the exam. What is your z-score? Z=X- s Z = 59.7 - 78.4 = -2.75 6.8
If you look at the formula for z-scores: z = value of a random variable - mean standard deviation
Each standard deviation away from the mean defines a certain area of the normal curve