Escolar Documentos
Profissional Documentos
Cultura Documentos
If we only observe the second table, we notice that B has completed more than A on both days
yet, the percentage of B is less when calculated on an overall level with the same sample size.
This easily misleads us into making different wrong assumptions.
The above example is a classic phenomenon of Simpsons Paradox.
Why is it important?
The practicality of Simpsons paradox surfaces when the data possess the dilemma of which part
of data must be used. Whether it should be the aggregate data or the partitioned one.
Even if we do take the partitioned data into, the unequal sample size can give misleading
inferences like in the example above.
Without aggregating the data, we could have easily concluded that B always performed better
than A just looking at the percentages and forget to appreciate the improving in A from Day 1 to
Day to and hence the overall completion.
The practical significance! Kidney stone Treatment
During one of the medical advancement, doctors were comparing the success rate of old kidney
stone treatment and the new treatment. The new treatment which was non-surgical were found to
be more successful than the old one. But a deeper investigation revealed that the new treatment
was mostly operated on patients with small kidney stones.
Hence, the doctors had to reevaluate the results taking the size of the stone as a crucial variable
and look at the partitioned data to conclude and avoid Simpsons paradox.