Escolar Documentos
Profissional Documentos
Cultura Documentos
BIOSTATISTICS
------ Lovitt
Origin and development of
statistics in Medical Research
In 1929 a huge paper on application of
statistics was published in Physiology
Journal by Dunn.
In 1937, 15 articles on statistical methods
by Austin Bradford Hill, were published in
book form.
In 1948, a RCT of Streptomycin for
pulmonary tb., was published in which
Bradford Hill has a key influence.
Then the growth of Statistics in Medicine
from 1952 was a 8-fold increase by 1982.
C.R. Rao
Douglas Altman Ronald Fisher Karl Pearson
Gauss -
BIOSTATISICS
(1) Statistics arising out of biological
sciences, particularly from the fields of
Medicine and public health.
(2) The methods used in dealing with
statistics in the fields of medicine, biology
and public health for planning,
conducting and analyzing data which
arise in investigations of these branches.
Reasons to know about
biostatistics:
Medicine is becoming increasingly
quantitative.
The planning, conduct and interpretation
of much of medical research are
becoming increasingly reliant on the
statistical methodology.
Statistics pervades the medical literature.
Example: Evaluation of Penicillin (treatment
A) vs Penicillin & Chloramphenicol
(treatment B) for treating bacterial
pneumonia in children< 2 yrs.
Data Colllection
Inferential Statistiscs
Descriptive Statistics
Data Presentation
Estimation Hypothesis Univariate analysis
Measures of Location
Tabulation Testing
Measures of Dispersion
Diagrams Ponit estimate Multivariate analysis
Measures of Skewness &
Graphs Inteval estimate
Kurtosis
Main sources for the collection of data:-
1. Experiments
2. Surveys
3. Records
Principles followed for the presentation of
statistical data:
QUALITATIVE DATA
DISCRETE QUANTITATIVE
CONTINOUS QUANTITATIVE
Qualitative data:-
In such data there is no notion of magnitude or
size of the characteristic or attribute as the
same cannot be measured
There is only one variable i.e., frequency
They are classified by counting the individuals
having the same characteristic and not by
measurement
This data are DISCRETE in nature such as no.
of deaths in different years, population of
different towns and so on.
QUALITATIVE
Nominal
Example: Sex ( M, F)
Exam result (P, F)
Blood Group (A,B, O or AB)
Color of Eyes (blue, green,
brown, black)
ORDINAL
Example:
Response to treatment
(poor, fair, good)
Severity of disease
(mild, moderate, severe)
Income status (low, middle,
high)
Quantitative data:-
QUANTITATIVE (CONTINOUS)
Number of Children
Hb
CONTINUOUS DATA
DISCRETE DATA
Interval scale :
Data is placed in meaningful intervals and order. The unit of
measurement are arbitrary.
Data Colllection
Inferential Statistiscs
Descriptive Statistics
Data Presentation
Estimation Hypothesis Univariate analysis
Measures of Location
Tabulation Testing
Measures of Dispersion
Diagrams Ponit estimate Multivariate analysis
Measures of Skewness &
Graphs Inteval estimate
Kurtosis
Frequency Distributions
<9.0 0 2 2
9.0 9.9 1 3 4
10.0 10.9 3 5 8
11.0 11.9 6 8 14
12.0 12.9 10 6 16
13.0 13.9 5 4 9
14.0 14.9 3 2 5
15.0 15.9 2 0 2
Total 30 30 60
Elements of a Table
Ideal table should have Number
Title
Column headings
Foot-notes
Number Table number for identification in a report
Discrete data
--- Bar charts (one or two groups)
Continuous data
--- Histogram
--- Frequency polygon (curve)
--- Stem-and leaf plot
--- Box-and-whisker plot
Example data
68 63 42 27 30 36 28 32
79 27 22 28 24 25 44 65
43 25 74 51 36 42 28 31
28 25 45 12 57 51 12 32
49 38 42 27 31 50 38 21
16 24 64 47 23 22 43 27
49 28 23 19 11 52 46 31
30 43 49 12
Histogram
20
Frequency
10
20
Frequency
10
68 63 42 27 30 36 28 32
79 27 22 28 24 25 44 65
43 25 74 51 36 42 28 31
28 25 45 12 57 51 12 32
49 38 42 27 31 50 38 21
16 24 64 47 23 22 43 27
49 28 23 19 11 52 46 31
30 43 49 12
Stem and leaf plot
Stem-and-leaf of Age N = 60
Leaf Unit = 1.0
6 1 122269
19 2 1223344555777788888
(11) 3 00111226688
13 4 2223334567999
5 5 01127
4 6 3458
2 7 49
Box plot
80
70
60
50
Age
40
30
20
10
Descriptive statistics report:
Boxplot
- minimum score
- maximum score
- lower quartile
- upper quartile
- median
- mean
15 12 12
9 8 Frequency in the Y axis
10
5 and categories of variable
0 in the X axis
Smo Alc Chol DM HTN No F-H
Exer The bars should be of equal
Risk factor
width and no touching the
other bars
The distribution of risk factor among cases with
Cardio vascular Diseases
HIV cases enrolment in
USA by gender
Bar chart
12
Enrollment (hundred)
10
8
6
Men
4 Women
2
0
1986 1987 1988 1989 1990 1991 1992
Year
HIV cases Enrollment
in USA by gender
Stocked bar chart
18
16
Enrollment (Thousands)
14
12
10
8 Women
6 Men
4
2
0
1986 1987 1988 1989 1990 1991 1992
Year
Graphic Presentation of
Data
the frequency polygon
(quantitative data)
the histogram
(quantitative data)