Escolar Documentos
Profissional Documentos
Cultura Documentos
Learning Objectives
One of the most common problems in the conduct of an experiment involves missing
observations where it is not possible to obtain valid measurements on some of the
experimental units. Missing observations arise in an experiment (not as a result of
treatment) when plants are destroyed, animals die, errors are made in recording/
transcription, or recorded data are lost.
Two major difficulties arise when missing data occurs, namely, loss of information and
non-applicability of the standard analysis of variance.
It is important to stress that the definition of missing values and the suggested remedies
specifically exclude the case where recordings are not made because of a treatment effect.
1. Lost samples.
3. Improper treatment.
4. Due to weed competition and/or herbicide phytotoxicity when the experiment is not to
evaluate different weed control treatments.
6. Character whose measurement depends on the existence of some “yield” such as 100-
grain weight and panicle length, % infection (in case of detached leaf), number of
days to insect adulthood (insect died) root/shoot ratio (seedling did not germinate or
seedling died).
7. Outlying data are usually recognized after the data has been transcribed and recorded.
Data may be considered outlying if their values are too extreme to be considered
within the possible range of the normal behavior of the experimental materials.
Common errors resulting in outlying data are misread observations, incorrect
transcription, and improper application of the sampling techniques or misuse of
measuring instruments.
Remedial Measures
Missing data can be estimated if they occur in less than 40% of the total number of
replications of a particular treatment. Otherwise, missing data are non-estimable and the
treatments involved should be deleted. In handling missing data, determine first whether
each is a legitimate case of missing data, i.e., if it is not treatment-related.
a. For estimable missing data (< 40% of the number of replications of a given
treatment are missing):
i) Compute the appropriate estimate of each missing data value and run ANOVA
on the augmented data set (with proper adjustment of degrees of freedom and
error mean square).
b. For non-estimable missing data (> 40% of the number of replications of a given
treatment are missing).
ii) Multi-factor experiment. The level in which the missing observation occurred
should be deleted from one of the factors. The choice of which factor to
delete will depend on the major interest of the researcher.
a. < 40% of the number of treatments: Handle each case accordingly as discussed
above.
b. > 40% of the number of treatments: Delete the affected replication from the
analysis.
where
t = no. of treatments
R = total of row with missing data
C = total of column with missing data
T = total of treatment with missing data
G = grand total of all observed values
rM + bT − P
Split-Plot X=
( b − 1)(r − 1)
where
b = level of subplot factor
r = no. of replications
M = total of the specific main plot with missing data
T = total of the treatment combination with the missing data
tM + cT − P
Split-Split Plot X=
(c − 1)(r − 1)
where
c = level of sub-subplot factor
r = no. of replications
M = total of the specific subplot with missing data
T = total of treatment with missing data
P = total of all subplots with the same treatments as that of the missing data
a (bT − P) + r (aH + bV − B) − bL + S
Strip Plot X=
(a − 1)(b − 1)(r − 1)
where
a = level of horizontal factor
b = level of vertical factor
r = no. of replications data
T = total of treatment with missing data
P = total of the specific level of horizontal factor with missing data
H = total of the specific level of horizontal strip with missing data
V = total of the specific level of vertical strip with missing data
= 5,265 kg / ha
The missing value is replaced by X = 5,265 kg/ha and all sums of squares in the
analysis of variance are then computed as usual. However, the total and error d.f. are
reduced by 1, the number of missing values.
The correction factor for bias is subtracted from the treatment sum of squares and the
total sums of squares, i.e.,
[R − ( t − 1)X]2
Bias =
t ( t − 1)
[26,453 − (6 − 1)(5,265)] 2
=
6(6 − 1)
= 546
Source of DF SS MS Computed
Variation F
Replication 3 2,188,739 729,580
Treatment 5 1,139,955 227,991 2.07ns
Error 14 1,540,726 110,052
Total 22 4,869,420
Gomez, K.A. and Gomez, A.A. (1984). Statistical Procedures for Agricultural Research.
2nd. ed. John Wiley and Sons, Inc. New York.
Ostle, B. and Mensing, R.W. (1975). Statistical Research. 3rd ed. The Iowa University
Press. Ames, Iowa.
Snedecor, G.W. and Cochran, W.G. (1967). Statistical Methods. 6th ed. The Iowa State
University Press. Ames, Iowa.