Escolar Documentos
Profissional Documentos
Cultura Documentos
net/epid168/exams/
d. There were 290 deaths due to all causes among the exposed
group and 983 deaths due to all causes among the unexposed
group. What measure of effect could be calculated to strictly
compare all-cause mortality between the exposed and the
unexposed group. (2 pts)
8. The issue of classification of disease is fundamental to
epidemiological investigations. The degree that we correctly
separate cases of disease from non cases can be quantified in terms
of specificity and sensitivity. The issue of correct classification is
important in research involving cerebrovascular disease (stroke).
Generally speaking there are two kinds of strokes, ischemic (blood
flow is restricted to brain tissue because of blocked artery in or
leading to the brain) and he morrhagic (a vessel in the brain
ruptures causing bleeding in the brain). These two pathologic
processes are quite different.
Background information:
A panel of experts reviewed the medical records of 525 patients
discharged from the hospital with diagnosis codes indicative of a
stroke (ICD 430-438). The panel classified strokes as either
ischemic or not ischemic. Assume the diagnos is reached by the
panel is the most accurate classification possible. Of the 525 cases,
325 had a discharge diagnosis code for ischemic stroke (ICD code
434). Of these 325 patients, 85 were determined by the panel not
to be ischemic strokes. All but 20 o f the patients with discharge
#
person
s
#
accident
s
293
100
974
27
300
Never
1106
15
8293
145
Total
6559
335
8693
153
Heavy
Moderate
#
person
s
#
accident
s
4479
Rate*
Boulder, Colorado
Rat
e*
Do not
develop a
stroke
Person
years (PY)
45
5,955
43,200
INACTIVE
135
13,865
100,800
Total
180
19,820
144,000
Physical
activity
level
ACTIVE
Incidence
per 1,000
PY
Explain:
Explain:
13.
Suppose that in 1998 researchers hypothesized that
communication ability and skill in young adulthood was
related to Alzheimers Disease. To test this they evaluated
hand written essays completed by a group of 350 nuns joining
a single religious sect in 1930. By careful review of these
writing samples, the researchers categorized all 350 as either
Low error
profile
# of
Death
s
Year of
Death
Alzheimers
Disease
1985
1985
Alzheimers
Disease
1990
1990
Alzheimers
Disease
1995
1995
Heart Disease
1980
Heart Disease
10
1980
Heart Disease
10
1990
Heart Disease
15
1995
Other
20
1960
Other
25
1960
Other
10
1970
Other
30
1970
# of
Deaths
Year of
Death
Alzheimers
Disease
1980
Alzheimers
Disease
Alzheimers
Disease
Alzheimers
Disease
Cause of Death
Cause of Death
d. Using data from this study compute an odds ratio for the
association of a high error communication profile with death
from Alzheimers disease. Show a clearly labeled 2x2 table. (2
pts)
Answer Guide
1. a. Manifestational criteria: disease definition and classification
based on observable characteristics, such as symptoms, signs,
history, labloratory findings, response to treatment, prognosis.
Causal criteria: disease definition and classification based on the
cause of the condition,
b. Manifestational criteria: Examples are cancers, arthritis,
cholescystitis, schizophrenia, depression, addiction, insomnia, . . .
Causal criteria : microbial diseases for which the pathogen has
been identified (syphilis, TB, malaria, yellow fever, influenza, etc.),
lead poisoning, birth trauma,
2. (C)- Other choices are incorrect because controls in case-cohort
studies are not matched to cases (A), contrrols are selected at
random with both designs (B), and cases must be selected without
regard to exposure (D).
3. New cases or events, population at risk or source population,
passage of time
4. The size of the population may have grown (number increases even
though rate does not); the age distribution of the population may
have changed (e.g., influx of families with small children,
outmigration of families with older children), so that agestandardized rate may not change but a greater proportion of the
population may be in the higher risk age range (assuming that
younger children have higher injury rates).
5. (D)- All of the above - use of prevalent cases requires that duration
is not related to exposure, controls should provide estimate of
exposure in study base, and rare disease assumption is required for
OR to estimate RR (though not for OR to estimate IDR).
12.
a. ARP = (I1 - I0) / I1 = (RR-1) / RR = (1.34-1.04) / 1.34 =
0.30 / 1.34 = 22% (after rounding)
The "I can't remember formulas" method:
ARP = attributable cases / all exposed cases = attributable
cases / 135
Attributable cases = attributable risk * Exposed PY = (1.341.04)*100,800 = 30.24
ARP = 30/135 = 22% (after rounding)
Interpretation: Based on these data, 22% (about one in five) strokes
in people who are physically inactive can be attributed to their
physical inactivity; in other words, if physically inactive people
became active early enough in their lives, their stroke incidence
would decrease by 22%
b. A key point here is that 27% is the prevalence of physically
active people, whereas the exposure is physical inactivity, whose
prevalence is therefore 100% - 27% = 73%
13.
a. This is a retrospective cohort study (researchers developed
the hypothesis in 1998).
b. High error profile: (2 + 5 + 6 + 5)/8021 = 2.24 per 1,000
women-years.
Low error profile: (1+3+4) / 12,287 = 0.651 per 1,000 wy
Women-years (WY) are computed as follows:
End
1980
1985
1990
1995
1980
1995
1960
1970
1998
Totals
Start
1930
1930
1930
1930
1930
1930
1930
1930
1930
Years
50
55
60
65
50
65
30
40
68
Women
2
5
6
5
10
15
25
30
52
150
WY
100
275
360
325
500
975
750
1,200
3,536
8,021
d.
Alzheimers Disease
Handwriting Profile
AD Yes
AD No
High error
18
132
Low error
192
Instructions:
o Write the last 4 digits of your ID number in space provide on each page (top
right).
o
Write clearly and legibly; avoid writing on the back of these pages.
1. Which of the following best describes the retrospective design where subjects are
sampled by disease status and is often used when the investigator is interested in rare
diseases. (4 pts)
A. intervention trial
B. case control study
C. retrospective cohort
D. ecologic study
E. none of the above
2. Which of the following best describes the study design that can be either retrospective
or prospective and is often used when the investigators are interested in rare
exposures. (4 pts)
A. intervention trials
B. cohort studies
C. prevalence studies
D. case control study
E. none of the above
3. The strength of an association is one of the criteria for evaluating the cause and effect
relationship between an exposure and outcome. Which of the following is a measure
of the strength of association? (Choose one best answer). (4 pts)
A. incidence rate among the exposed
B. cumulative incidence among the exposed
C. the ratio of odds of exposure among cases to the odds of exposure among the
non-cases
D. odds of disease among exposed relative to the prevalence of exposure in the
source population
E. none of the above
4. Incidence rates of a disease are often referred to as direct measures of risk. Can
incidence rates be calculated from case-control studies? Briefly explain in 1-2
sentences why they can or can not be calculated. (4 pts)
RATE
PROPORTION
NEITHE
R
RATE
PROPORTION
NEITHE
R
c. Prevalence
RATE
PROPORTION
NEITHE
R
d. Relative risk
RATE
PROPORTION
NEITHE
R
____ ____ a. A "J" or "U" shaped relationship of a continuous risk factor and continuous
measure of disease suggests a Pearson product-moment correlation coefficient
of near plus one or minus one.
____ ____
____ ____
____ ____
d. The study base for a case-control study consists of those people who if they
developed the disease could have been counted as cases.
e. The Bradford Hill criterion "coherence" means that the association has been
____ ____ observed repeatedly in different places, by different observers, and at different
times.
____ ____
7. The death rates from various conditions are often compared across geographic areas.
These comparisons are usually based on directly age-standardized mortality rates.
Which of the following best describes what is meant by an age-standardized rate
created by the direct method? (Choose one best answer). (4 pts)
A. The number of events in each age stratum of a standard population is used to
create a weighted average rate.
B. The event rates in each age stratum in the standard population are used to
create a weighted average rate.
C. The event rates in the geographic area of interest are applied to the agestratum sizes of a standard population to create a rate that is a weighted
average.
D. The event rates in the geographic area of interest are compared to the event
rates of a standard population to create a summary rate that is a weighted
average.
8. In order to estimate counts and rates of work-related fatalities, the National Traumatic
Occupational Fatality system has introduced a tick-box on the death certificate to
indicate "injury at work." Kraus et al. (Am J Epidemiol 1995; 141: 973-9) attempted
to validate this "injury at work" classification system against a gold standard
[International Classification of Diseases (ICD) death certificate codes designating
deaths that occurred during work-related activities]. After reviewing a sample of
100,000 death certificates, the authors reported the following: 1,195 true positives;
788 false positives; 97,672 true negatives; 345 false negatives. ("positive" indicates
that the tick-box was checked; "negative" indicates that it was not checked; "true"
indicates agreement between the tick-box and the ICD code).
a. Using the counts provided above, complete the 2x2 table below: (2 pts)
ICD Classification
Death
Certificate
Workrelated
Not workrelated
TOTAL
Work-related
Not workrelated
TOTAL
b. What are the sensitivity and specificity of the "injury at work" classification
system? (4 pts)
Cases of ARM
Never smokers
368
26
Ever smokers
864
79
e. Assuming causality, what is the proportion of cases of agerelated maculopathy that could have been prevented in the
population of males ages 43-86 in Beaver Dam if the smokers
had never smoked? Show your work. (4 pts)
10.
The following data come from a national survey of the
occurrence of back pain. A case of low back pain was defined as
having at least one episode of severe back pain occurring over a
period of 6 months. The number of cases was obtained from
surveys of different occupation groups as well as a national random
sample.
Cell phone
manufacturing
Textile
manufacturing
National random
sample
Age
Perso
ns
case
s
Rate
Perso
ns
Case
s
Rate
Perso
ns
Cas
es
rate
25-39
1000
.002
100
.02
10,00
0
30
.003
40-55
700
25
.037
500
30
.06
15,00
0
900
.06
55+
50
15
.300
1500
150
.100
15,00
0
120
0
.08
Total
1750
42
.024
2100
182
.087
40,00
0
213
0
.053
c. Can these two ratios in part (a) and (b) be compared? Briefly
explain why or why not. (3 pts)
11. The evidence supporting obesity as a risk factor for colon cancer
Number of incident
cases of colon cancer
Person-years
of follow up
<22
28
53,475
22 - <24
41
38,919
24 - <26
36
36,610
26 - <28
40
32,635
28 - <30
35
21,122
30+
42
34,904
mass index*
Crude incidence
rate/100,000
PY
b. Complete the table by calculating the crude body mass indexspecific incidence rates. (3 pts)
d. Calculate the attributable risk proportion of those in the 28<30 BMI category. In one sentence interpret your answer.
(the attributable risk formulas provided in class can be used
even though the data provide is for rates) (2 pts)
12.
Analyses of data from cohort studies often have to deal with
the reality that participants have unequal lengths of follow up.
Given the data below, calculate the (a) total person time (month) of
follow up, (b) the overall incidence density rate, (c) 13 month
cumulative incidence, and (d) the product limit estimate of failure.
Each horizontal line represents a cohort participant. Each vertical
line represents one month. Arrows indicate time of loss to follow
up. Black boxes indicate onset of disease (failure). (2 pts each)
a. ______________
b. ______________
c. ______________
d. ______________
Answer Guide
1. B. Case-control studies are said to use sampling by disease and are
suited for studying rare diseases.
2. B. Cohort studies can be either retrospective or prospective and
are often used to study rare exposures.
3. The ratio of odds of exposure among cases to odds of exposure
among noncases is the odds ratio, which is a measure of
association.
Workrelated
Not workrelated
TOTAL
1195
788
1,983
345
97,672
98,017
1,540
98,460
100,000
Never
smokers
Total
Case
ARM
cases
79
26
105
Status
Non-cases
785
342
1127
Total
864
368
1232
<22
28
53,475
52.4
22 - <24
41
38,919
105.3
24 - <26
36
36,610
98.3
26 - <28
40
32,635
122.6
28 - <30
35
21,122
165.7
30+
42
34,904
a. 43 person-months
2. The authors describe the study design they used as a "populationbased case-control study". Briefly explain how this is different than
a non-population based case-control study. Include in your answer
issues regarding the selection of cases, selection of controls, and
validity. (3 pts)
Self Report =
YES
Self report =
NO
YES
NO
7. Cases in this study were histologically confirmed. This is an
example of which of the following disease classification criteria?
Choose one best answer. (3 pts)
A. Causal criteria
B. Ecologic criteria
C. Manifestational criteria
D. Etiologic criteria
E. None of the above
8. Consider the data presented in Table 1 of this article. Which of the
following best represents the proportion of the risk of brain cancer
in the population that is attributable to working on a farm (farm
occupation). Assume that a farm occupation is causally related to
brain cancer risk. Choose one best answer. (4 pts)
A. 33%
B. 57%
C. 10%
D. 29%
E. Cannot be calculated from case-control studies
9. A case-control study like the one described in this paper is most
useful when it helps us understand what is happening in the study
base (underlying population). Which of the following best describes
the study base in this article? Choose one best answer. (3 pts)
A. The study base is those who if they developed brain cancer
could have been selected as a case.
B. The study base is those who have an equal probability to be
selected as a case or control.
C. The study base is those who are identified as cases or
controls after excluding non-responders.
14.
The authors state that they "found a dose-response
relationship among men between brain cancer and duration of
consuming drinking water from chlorinated surface water". Using
3 Bradford Hill criteria, in 3-4 sentences, address causality (or the
lack of causality) of the relationship of drinking water to brain
cancer. (4 pts)
15.
An early study of drinking water and brain cancer was an
ecological study conducted by the lead author of the present
article. In this study, brain cancer mortality rates in 923 U.S.
counties were compared with average levels of THM measured in
the drinking water supplies of those counties. For counties in which
the sampled water supply served at least 85% of the residents of
that county, the correlation coefficient between county-specific
mortality rates from brain cancer and trihalomethane levels was
0.24 in White men and 0.19 in White women. After reviewing this
paper, your colleague concluded that THM in drinking water are
causally related to brain cancer. However, you are more cautious in
your interpretation, citing the "ecological fallacy." Please define the
ecologic fallacy (2 pts) and describe why it limits the causal
16.
The authors used information provided by cases and controls
on place of residence, primary source of drinking water, and tap
water and total fluid consumption to create an index of cumulative
lifetime exposure. However, the natural history of cancer
(initiation, promotion, conversion, and progression) may encompass
many years. If drinking water is involved at the earliest stages of
brain cancer (initiation), then drinking water exposures in the
recent past may be more important than present exposures or
those in the distant past (e.g., in childhood). As defined in class,
which of the following periods would be important in defining the
minimal and maximal length of time expected between drinking
water exposure and diagnosis with histologically confirmed glioma?
Choose one best answer. (3 pts)
A. Induction period
B. One year case fatality
C. Latent period
D. Both a and c
E. None of above
17.
The authors included all cases of histologically confirmed
malignant brain cancers, including glioblastoma, fibrillary and
gemistocytic astrocytoma, and mixed glioma. If authors suspected
that drinking water exposure was associated with only certain
subtypes of brain cancer (i.e., disease heterogeneity), which of the
following strategies could they employ at the analysis stage? (3 pts)
A. Adjustment for cancer type using mathematical modeling
(e.g., logistic regression)
B. Stratification of cases by brain cancer type
C. Direct standardization by brain cancer type
D. Indirect standardization by brain cancer type
E. Matching cases and control by brain cancer type
18.
The authors restricted their analysis to those cases and
controls with at least 70 percent of their lifetime years with a
known source of drinking water. This approach was used to reduce
which type of bias? Choose one best answer (3 pts)
A. Confounding bias
B. Selection bias
C. Information bias
D. Random error
E. None of the above
19.
20.
a. Using the data in Table 3, label and complete a 2x2 table for
the association between brain cancer and >=40 years
residence with a chlorinated surface water source (versus 0
years), collapsing over sex (i.e., combine the data for men
and women). (4 pts)
b. Calculate the odds ratio for your 2x2 table in part a. Show
your work. (3 pts)
21.
Characteristics of cases and controls included in this study
are shown in Table 1. Using this information answer the following
questions.
a. Calculate the appropriate crude (unadjusted) measure of
association between farm occupation and brain cancer. Consider
those ever working on a farm as sufficient to be classified as having
a farm occupation. In 2 sentences or less interpret what this odds
ratio means. (4 pts)
Farm Occupation
CASE
CONTROL
YES
NO
b. Assume that 10% of the cases that were labeled as never having
worked on a farm truly had worked in such an environment.
Furthermore assume that 15% of the controls that were labeled as
having ever worked on a farm, in fact never really did work on a
farm. What would the true association be between farm occupation
and brain cancer? Assume that the classification of disease status is
valid. (4 pts)
Answer Guide
1. Case-control studies are well-suited for studying risk factors for
brain cancer because the disease is rare (hence difficult to study in
a cohort design). Also, the case-control design facilitates examining
many risk factors of current interest, a substantial advantage when
so few risk factors have been identified. A retrospective cohort
study can examine only exposures for which historical data are
available.
2. A "population-based case-control study" is a case-control study for
which the study base is a defined population. With a hospital-based
case-control study, it is difficult to specify the study base, since
which cases come to a given hospital is influenced by such factors
as seriousness and treatability of the disease, type of hospital, and
health care financing ability and arrangements. A representative
sample from this same defined population yields a control group
that permits valid estimation of odds ratios. In contrast, the validity
of measures of association estimated using a control group selected
from among hospitalized persons is always somewhat uncertain,
since it is generally impossible to know how well such controls
provide valid estimates of the study base.
3. B. The method of finding cases was passive surveillance.
4. D. Using incident cases allows the odds ratio to estimate the
incidence density ratio or risk ratio. In contrast, the exposure
Yes
No
Total
Yes private
well
20
15
35
No private
well
44
50
26
59
85
Total
water
40+ years
< 40 years
Cases
30
Control
s
38
423
13.
Average population
50,010
2,500
Cases
32
112
Control
s
246
780
20.
a.
Risk of brain cancer by number of years resided in a
dwelling supplied with chlorinated surface water
Cases
Controls
Total
>=40 years
None
Total
13 + 7 = 20
92 + 78 = 170
190
81 + 60 = 141
875 + 400 =
1275
1416
161
1445
1606
a.
Farming occupation and brain
cancer risk
Farming occupation
Yes
No
Total
Cases
85
206
291
Control
s
628
1355
1983
Total
713
1561
2274
OR = (85x1355) / (206x628)
approx.= 0.89
The OR of 0.89 indicates no (or possibly a slight inverse)
crude association between brain cancer risk and having had a
farming occupation.
b. If 10% of "unexposed" cases in fact had had a farming
occupation, then 0.10x206=21 cases should be reclassified as
exposed; if 15% of "exposed" controls in fact had not had a
farming occupation, then 0.15628=94 controls should be
reclassified as exposed. The resulting table and OR would be:
Farming occupation and brain cancer risk
Farming occupation
Yes
No
Total
Cases
85 + 21 = 106
206 21 =
185
291
Control
s
628 94 =
534
1355 + 94 =
1449
1983
Total
640
1634
2274
1. Briefly list two reasons why a case control study is (or is not) appropriate to
examine individual risk factors for hip osteoarthritis. (2 pts)
2. The authors state that their cases come from a defined population. List four
features of the population or the study design that support this statement or helped the
authors to achieve it? (4 pts)
3. Considering the study population, study design, and other information in the article,
which of the following statements is (are) TRUE and which is (are) FALSE. (2 pts
each)
a. In these two health districts, the incidence density of symptomatic hip
osteoarthritis of sufficient severity to warrant hip arthroplasty exceeds 40 per
100,000 person-years.
b. If about 12% of the population was age 65 years or older, then about 12,000
people age 65 years or older in the two districts have radiographic evidence of
hip osteoarthritis.
c. The data in Table 1 demonstrate that women are 1.9 times as likely to
develop severe symptomatic hip osteoarthritis as are men.
d. The data in Table 2 indicate that female gender is not a risk factor for hip
osteoarthritis.
e. In this study, matching the control group to the cases on age, as opposed to a
random sample of the general adult population, probably resulted in greater
statistical power and precision.
4. The case identification process was based on a register in each district made up of
persons on a waiting list for a total hip arthoplasty (surgical reformation of the hip
joint). Waiting lists for procedures are common in societies with a nationa l or social
medicine system. In the United States, a region wide waiting list for a hip arthoplasty
is unlikely, as the availability of receiving this procedure would be more related to
insurance status or ability to afford such a procedure. Explain how using the register
system in the Untied Kingdom to select cases either increases or decreases the
possibility of selection bias as compared to a study conducted in the United States. (4
pts)
5. How was the diagnosis of hip osteoarthritis made in this study? Was this based on
manifestional or causal criteria? Explain your answer. (3 pts)
6. According to the authors: "For each case, a control of the same sex and age was
selected from the list of the same general practice held by the county Family Health
Service Association". State in one sentence the rationale for using a list from ge neral
practioners? (3pts)
7. Eighty-four percent of the patients listed for total hip arthroplasty fulfilled the
criteria for entry into the study as cases. Which of the following best describes the
criteria: (3 pts)
a. age > 45 years, being on the waiting list for hip arthroplasty, and the
presence of Heberdens nodes.
b. age > 45 years, pain duration at least for 36 months, and presence of
Heberdens nodes.
c. history of hip fracture within the past year, being on the waiting list for hip
arthroplasty and reside in the study area.
d. presence of Heberdens nodes, history of hip fracture within the past year,
and reside in the study area.
e. being on the waiting list for hip arthroplasty, reside in the study area, and
age > 45 years
8. The authors report that 89% of the eligible cases agreed to participate and 60% of
the 1060 controls approached agreed to participate. Which of the following best states
a condition regarding the non-responders that could lead to an odds ratio re ported for
the risk of osteoarthritis associated with previous hip injury that is biased away from
the null (>1). Choose one best answer. (3 pts)
a. control non-responders are more likely to have a history of hip injury
compared to case non-responders.
10. The authors selected controls who were individually matched to cases by age,
gender, and family practitioner. Matching in the design stage is usually considered
only for those variables that are known to be confounders. Under which of the follow
ing circumstances could gender be a confounder of the association between a risk
factor (obesity) and the outcome (hip osteoarthritis)? Circle all that apply. (4 pts)
a. the prevalence of obesity and the prevalence of hip osteoarthritis are both
higher in men that in women
b. the prevalence of obesity is lower in men than women, but the prevalence of
hip osteoarthritis is higher in men than women.
c. the prevalence of obesity is higher in men than women, but the prevalence
of hip osteoarthritis is the same in men and women.
d. the prevalence of obesity is the same in men and women, but the prevalence
of hip osteoarthritis is higher in men than women.
11. The odds ratios in Table 2 are "mutually adjusted for the other two variables" by
logistic regression. The following questions concern the models used to estimate the
odds ratios in the table (ignore the fact that it was "condit ional" logistic regression
and ignore the middle categories for body mass index and presence of Heberdens
nodes) (2 pts each):
a. How many logistic models were necessary to estimate the odds ratios for
body mass index >28.0, definite Heberdens nodes, and previous hip injury
among women.
b. The odds ratio estimate for hip injury in women was 2.8. What must the
logistic coefficient have been?
c. From this table, estimate the odds ratio for women who had both definite
Heberdens nodes and previous hip injury compared to women who had
neither.
12. In this study, information on medical history, life style, and leisure time physical
activities was obtained through a "structured interviewer-administered questionnaire".
(page 517). It is possible that persons on a waiting list for a hip arthoplasty would be
more keenly aware of hip injuries they may have had in the past than controls. If true,
this is an example of which of the following? Choose one best answer. (3 pts)
a. differential case ascertainment bias
b. differential misclassification bias
c. differential selection bias
d. differential precision bias
e. none of the above
13. Among women, the odds of previous hip injury is higher among cases than
controls (Table 2; OR=2.8). As indicated in the footnotes for Table 2, the odds ratio
for pervious hip injury is adjusted or controlled for the other two variables in the Ta
ble (body mass index and Heberdens nodes). Using the counts shown in Table 2,
calculate an unadjusted (crude) odds ratio for previous hip injury in women. (3 pts)
Unadjusted (crude) odds ratio = _________
14. Which of the following conclusions can be made from the above results? (choose
one best answer) (3 pts)
a. the unadjusted (crude) association between hip injury and hip osteoarthritis
in women is completely confounded by body mass index and Heberdens
nodes.
b. since the unadjusted and adjusted odds ratios are similar, the risk factor (hip
injury) must not be associated with the adjustment variables (body mass index
and Heberdens nodes)
c. since the unadjusted and adjusted odds ratios are similar, there is no effectmeasure modification of the association between hip injury and hip
osteoarthritis.
d. none of the above
15. The odds ratios presented in Table 5 are adjusted for previous hip injury. Why
might they still be confounded by hip injury? (3 pts)
16. In Table 6, is the crude association between previous hip injury and risk of
unilateral hip osteoarthritis biased towards the null or away from the null? (2 pts)
17. Based on the data in Table 3, what is the odds ratio for Heberden's nodes (definite
versus none) for persons in the Upper tertile of body mass index? (3 pts)
18. Rothman has proposed that "public health synergism" is present when an observed
joint effect exceeds that expected under the additive model. Do the odds ratios in
Table 3 indicate the presence of "public health synergism" for effect of Heberden 's
nodes and elevated body mass index on hip osteoarthiritis? If not, do the odds ratios
conform to a multiplicative model? Include in your answer a 1-2 sentence assessment
of whether these data indicate "public health synergism". (For this question, ignore
the row for "Possible" Heberden's nodes and the column for the middle tertile of body
mass index, and assume that both Heberdens nodes and elevated BMI reflect casual
risk factors for hip osteoarthritis. Note: do not necessarily rely on the autho rs'
description of this table.) (6 pts)
19. The authors investigated the association of specific sporting activities with risk of
hip osteoarthritis. Their data are presented in Table 5. Using their data, compute
separately the unadjusted (crude) risk of osteoarthritis associated with pla ying golf
and for swimming in men and women combined. Consider those who do not
participate in any sport as the reference group and assume no missing data. Show two
appropriate 2x2 table and your calculations. (4 pts)
19a. Compare these unadjusted (crude) odds ratios with the ones presented in Table 3.
Briefly describe and explain the comparison. (3 pts)
19b. Consider the possibility that golfers who have hip osteoarthritis are reluctant to
seek medical attention for their condition for fear it will mean the end of their ability
to play golf. Therefore, cases who golf are less likely to be se lected for this study
than cases who do not golf. If the true OR associated with golf is 2.0, then which of
the following best describes the selection bias and its impact on the odds ratio you
computed. (3 pts)
a. non-differential selection bias resulting in an odds ratio biased toward the
null.
b. non-differential selection bias resulting in an odds ratio biased away from
the Null.
c. differential selection bias resulting in an odds ratio biased away from the
null.
d. differential selection bias resulting in an odds ratio biased toward the null.
e. none of the above
19c. The authors state that "...the association with swimming may have arisen because
patients with hip osteoarthritis were advised to swim..." (page 521). Suppose that 25%
of the cases had been incorrectly classified as swimmers and assume that the
misclassified cases had not participated in any other sporting activity, either. Recompute the odds ratio for the association of hip osteoarthritis and swimming, after
re-classifying these individuals, using the number from the 2x2 table in question 19
above. Briefly discuss how your conclusion about the role of swimming does (or does
not) change. In what direction did misclassification bias the study OR? (3 pts)
20. The odds ratio (95% confidence interval) estimating the risk of osteoarthritis
associated with a previous hip injury was 24.8 (3.1-199.3) in men and 2.8 (1.4-5.8) in
women (see Table 2).
a. Which estimate indicates a stronger association? (2 pts)
b. Which estimate is more precise? (2 pts)
21. Which one of the statements best interprets the following passage? (3 pts)
"In a previous case-control study (17) of men aged 60-76 years, we observed a
doubling of risk for hip osteoarthritis among those in the highest third of body
mass index distribution, as compared with those in the lowest third, although
the increased risk was not statistically significant." (p519 bottom of right
column)
a. Hip osteoarthritis is not as significant when it occurs in obese older patients,
because it is expected that overweight that lasts for many years will lead to
damage to the joints.
b. A doubling of risk is not significant from a statistical perspective, because it
represents only a moderate association.
c. The doubling of risk was not statistically significant because a p-value was
not computed, so it is not possible for the authors to know whether the
increased risk was due to chance.
d. If 1,000 independent random samples the same size as that study population
were drawn from a population with no increased risk of hip osteoarthritis,
fewer than 950 would have an OR between 0.5 and 2.0.
e. If 1,000 independent random samples the same size as that study population
were drawn from a population with a doubling of risk of hip osteoarthritis for
the highest third of the body mass distribution, as compared with the lowest
third, more th an 5% of the samples would display no elevation in risk.
f. If 1,000 independent random samples the same size as that study population
were drawn from a population with a doubling of risk of hip osteoarthritis for
the highest third of the body mass distribution, as compared with the lowest
third, fewer t han 80% would display an association of that magnitude.
22. A medical journalist, confused by the thrust of this article, comes to you and says:
"I've read this article several times, but I can't figure out what it shows about the
relationship of body mass index, Heberden's nodes, and hip osteoarthri tis. The
authors explain that 'two broad mechanisms are believed to underlie the pathogenesis
of osteoarthritis at any joint site: mechanical stress and a generalized predisposition to
the disorder' as indexed by Heberdens nodes [p519 right column]. T hat seems
straightforward enough, and they later conclude that the analysis 'supports the notion
that this condition arises through an interaction between a generalized predisposition
to the disorder and specific mechanical insults to the hip' [p521]. Y et on page 518
[right column], the authors state that there was 'no statistically significant interaction'
between body mass index and Heberden's nodes, and on page 519 [left column] they
refer to obesity and a tendency to polyarticular involvement as 'i ndependent risk
factors for hip osteoarthritis'. Would you please assess for me what this article shows
about the relationship among body mass index, Heberden's nodes, and hip
osteoarthritis? I have room for 40-60 words. Thanks!" (6 pts)
23. Write a brief statement for or against a causal relationship between hip injury and
risk of osteoarthritis. Comment specifically on at least two of Bradford Hills criteria
for causal inference. Support your conclusion with data or statements f rom the
article. (4 pts)
Answer Guide
1. Briefly list two reasons why a case control study is (or is not)
appropriate to examine individual risk factors for hip osteoarthritis. (2
pts)
Condition rare, faster to complete than cohort study, wide range of
exposures of interest.
2. The authors state that their cases come from a defined population.
List four features of the population or the study design that support this
statement or helped the authors to achieve it? (4 pts)
1. The two health districts had a centralized orthopedic facility for
assessment and treatment of hip osteoarthritis;
2. Local orthopedic surgeons were willing to enter all patients into
the study;
3. All men and women 45 years and older who were placed on the
waiting list for primary total hip arthoplasty were considered for
the study;
4. The authors included patients who consulted orthopedic
surgeons privately.
5. The study excluded patients who lived outside the two districts.
The diverse socioeconomic profile was an advantage for
generalizability but does not make this a defined population.
e. being on the waiting list for hip arthroplasty, reside in the study
area, and age > 45 years (answer)
8. The authors report that 89% of the eligible cases agreed to participate
and 60% of the 1060 controls approached agreed to participate. Which of
the following best states a condition regarding the non-responders that
could lead to an odds ratio reported for the risk of osteoarthritis
associated with previous hip injury that is biased away from the null
(>1). Choose one best answer. (3 pts)
a. control non-responders are more likely to have a history of hip
injury compared to case non-responders. (answer)
b. control non-responders are less likely to have a history of hip
injury compared to case non-responders.
c. being a non-respondent is not related to previous hip injury.
d. none of the above
9. What was accomplished by replacing controls who refused to
participate? (Choose one best answer) (3 pts) If controls who refused
had not been replaced:
a. selection bias would have been greater;
b. the control group would have been less representative of the
study base;
c. probability of a Type I error would have been greater;
d. probabillty of a Type II error would have been greater; (answer)
e. nondifferential misclassification bias would have been greater.
f. it would have been necessary to control for age and sex in the
analysis.
Answer: d. Failure to replace controls who refused would have
reduced both the number of controls and of cases (due to the
matching), with a loss of statistical power and increase in the
probability of a type II error.
10. The authors selected controls who were individually matched to cases
by age, gender, and family practitioner. Matching in the design stage is
usually considered only for those variables that are known to be
confounders. Under which of the following circumstances could gender
be a confounder of the association between a risk factor (obesity) and the
outcome (hip osteoarthritis)? Circle all that apply. (4 pts)
Body mass
index
Lowest third
Middle third
Highest third
1.0
1.1 (0.7-1.8)*
1.6 (1.0-2.7)
Possible
1.5 (0.8-2.7)
1.5 (0.8-2.6)
2.0 (1.1-3.6)
Definite
1.4 (0.9-2.3)
2.2 (1.4-3.7)
3.2 (1.9-5.4)
None
Cases
Controls
YES
51
34
NO
140
162
OR = 1.7
Swimming
Cases
Controls
YES
156
110
NO
140
162
OR = 1.6
19a. Compare these unadjusted (crude) odds ratios with the
ones presented in Table 3. Briefly describe and explain the
comparison. (3 pts)
Table shows 1.4 and 1.5, respectively. This suggests
that BMI, nodes, and hip injury explain very little of the
association of these two sports with hip osteoarthritis.
19b. Consider the possibility that golfers who have hip
osteoarthritis are reluctant to seek medical attention for their
condition for fear it will mean the end of their ability to play
golf. Therefore, cases who golf are less likely to be selected
for this study than cases who do not golf. If the true OR
associated with golf is 2.0, then which of the following best
describes the selection bias and its impact on the odds ratio
you computed. (3 pts)
a. non-differential selection bias resulting in an odds
ratio biased toward the null.
b. non-differential selection bias resulting in an odds
ratio biased away from the null.
c. differential selection bias resulting in an odds ratio
biased away from the null.
d. differential selection bias resulting in an odds ratio
biased toward the null. (answer)
e. none of the above
Cases
Controls
YES
156-25% = 117
110
NO
140 + 39 = 179
162
Match the term from column A with the most appropriate topic or
concept from column B (use each term only once and each topic only
once). (1 pt each = 12 pts)
Column A - Terms
Column B - Topics
____
cumulative incidence
1. Case-control studies
____
incidence density
2. Causal inference
prevalence
3. Confounds cross-sectional
____
dose response
4. Death certificate
____
induction period
5. Descriptive epidemiology
____
odds ratio
6. Diagnostic tests
____
7. Estimates risk
____
8. Measures impact
____
____
____
migrant studies
11. Proportion
____
cohort effect
____
data
2.
myocardial infarction?
____
a.
manifestational criteria
____
b.
Bradford criteria
____
c.
causal criteria
____
d.
etiologic criteria
3.
many
a.
____
b.
____
c.
____
d.
4.
a.
____
b.
____ c. prevent bias introduced when the investigators know what type
of treatment the patients are receiving
____
5.
____
d.
b and c
The indirect method of age standardization applies stratumspecific rates from an external population to the age
distribution of the study population.
____
b.
A standardized mortality ratio is an example of a stratumspecific crude rate.
____
c.
Standardized mortality
comparisons among multiple populations.
____
d.
ratios
are
perferred
for
making
be compared.
Placebo
All
121
121
242
Non-infected
112
90
202
HIV-infected
31
40
7.4
25.6
16.5
Births (no.)
Infection status of infant
___________________________________________________________________
7A.
____
Which one answer best describes the transmission rate in the table?
(4 pts)
a.
proportion
____
b.
relative rate
____
c.
absolute rate
____
d.
odds
7B.
Using the data in the table, estimate the relative risk of HIV
infection for infants whose mothers took zidovudine relative to
infants of mothers who took placebo. Show formula and
calculations. (4 pts)
_______________________________________________________________
_______________________________________________________________
_______________________________________________________________
_______________________________________________________________
_______________________________________________________________
_______________________________________________________________
7C.
7D.
Zidovudine is now routinely offered in association with all
pregnancies to known HIV-seropositive mothers in the United States.
However, growth of resistant strains will reduce the drug's effectiveness
in preventing perinatal HIV transmission.
Observational studies for
assessing zidovudine's effectiveness have serious methodologic problems,
but which of the following case-control designs would be the most nearly
valid? (Choose one best answer.) (4 pts)
____ a.
infants.
Results:
Table 1. Behavioral variables in 966 subjects
___________________________________________________________________
Variable
Mean (SD)
Range
Median
8B.
mean
b.
SD
c.
range
d.
median
b.
c.
d.
Table 2.
7/930
Gonorrhea
42/940
Chlamydia
66/957
109/908
_______________________________________________________
8C.
Based on the above data and assuming that the the two diseases have
the same average duration, how do their incidence rates compare in
this population? (Choose the one correct answer.) (3 pts)
a.
b.
c.
8D. Based on the above data but this time assuming that the two
diseases have the same incidence, how do their average durations compare in
this population? (Choose the one correct answer.) (2 pts)
a.
b.
9C. What measure would you use to quantify the strength of association
between cigarette smoking and AA-10? Show the formula for this measure,
substitute the appropriate numbers for that formula, compute the result,
and state its meaning in one sentence. (4 pts)
a. Formula
b. Substitution
c. Result
d. Meaning ____________________________________________________
_______________________________________________________________
_______________________________________________________________
_______________________________________________________________
Syphilis
Gonorrhea
Chlamydia
0
10
15
1
8
23
0
15
8
3
21
18
1
11
17
Dropouts (cumulative)
10
30
50
90
120
21
24
2
12
17
3
19
14
4
24
11
140
190
270
Number tested
890
870
850
810
780
760
710
630
____________________________________________________________________
(Subjects can become infected with the same organism more than once
and/or become co-infected with more than one organism.)
10A.
pts)
10B.
What is the average incidence density (per 100 person months or per
100 person years) of chlamydia for the two years of follow up?
Assume that: dropouts contribute no time to follow up after the last time
they are tested; subjects remain at risk even while infected. (3 pts)
10C. Give two reasons for preferring incidence density over cumulative
incidence for assessing frequency of infection in this cohort. (6 pts)
i. ___________________________________________________________
_______________________________________________________________
ii. ___________________________________________________________
_______________________________________________________________
11.
design
that
makes
it
_______________________________________________________________
_______________________________________________________________
_______________________________________________________________
11B. Many of the criteria for causal inference pertain to the evaluation
of evidence from multiple studies, but several can also apply to a single
study. Name two (2) such criteria and use them to evaluate (quantitatively
where possible) the evidence from the above study. (6 pts)
i. ___________________________________________________________
_______________________________________________________________
_______________________________________________________________
_______________________________________________________________
_______________________________________________________________
ii. ___________________________________________________________
_______________________________________________________________
_______________________________________________________________
_______________________________________________________________
_______________________________________________________________
Answer Guide
1.
data
Matching (1 pt each):
Column A - Terms
7 cumulative incidence (11 is ok)
12 incidence density
11 prevalence (7 is ok)
2
9
1
8
4
6
10
5
3
1.
2.
3.
Column B - Topics
Case-control studies
Causal inference
Confounds cross-sectional
dose response
4. Death certificate
induction period
5. Descriptive epidemiology
odds ratio
6. Diagnostic tests
preventive fraction in the exposed 7. Estimates risk
underlying cause of death
8. Measures impact
positive predictive value
9. Natural history of disease
detectable, pre-clinical phase
10. Population screening
migrant studies
11. Proportion
cohort effect
12. Relative rate
(Credit was also given for some other pairings.)
5.
T
_ _ _ _ _ _ _ _ _ _ _ _
|
|
^
|
|
|
|
|
|
|
v
|_______________________
|
|
s
.
|_______________________
(25.6% - 7.4%) / 25.6%
0
=
Since both diseases have the same incidence, the ratio of their
durations equals the ratio of their prevalence odds:
prev. odds for gonorrea
-----------------------prev. odds for chlamydia
42 / 898
-------66 / 891
0.468
------0.741
= 0.63
(Credit was also given for "prevalence = incidence x duration", though this
true only approximately.)
9A.
AA-10*
Absent fewer than
10 days
Smokers
------100
Nonsmokers
---------300
Total
----400
1,100
6,500
7,600
------
-----
-----
Total
1,200**
6,800
8,000
9C.
CI in smokers
----------------CI in nonsmokers
8.3%
-----4.4%
1.89
8.3% = incidence
in
|XXXXXXXXXXXXXXX|
|
|
|
|
3.9% x 1,200 |
= 47
|
persons
3.9% =
|XXX
XXXX|
|\\\\\\\\\\\\\\\|
| 4.4% x 1,200 |
|\\
= 53
\\|
300
6,800
Nonsmokers
risk"
1,200 (15%)
Smokers
4.4% = incidence
in
persons
This number
Number of smokers
(I1 - I0)
(8.3% - 4.4%)
x
x
1,200
1,200
(Total) Cases
--------------------(Total) person-time
123 cases
= ------------------ = 0.65/100 person-months = 7.8/100 person-yrs
18,930 person-months
10C
These diseases have an extended risk period (i.e., one longer than
the period of observation)
People can acquire these diseases more than once
Different lengths of follow-up time per subject
"I have neither given nor received help from others in completing
this examination."
o
___________________________________________________________________________
___________
b. histologically-confirmed
12.
For each of the following statements, indicate if it is
TRUE OR FALSE: (1 pt each)
13.
A list of control variables for use in the logistic
regression models appears on page 325, middle of column 2.
These variables have been chosen because they (choose one
best answer): (2 pts)
A. are likely to be associated with breast cancer risk in the
bottle-fed women.
B. are known or suspected risk factors for breast cancer,
or at least proxies for such factors
C. are likely to be associated with infant feeding history in
the controls
D. are likely to be associated with infant feeding history in
the cases
14.
The presentation of data in Table 2 can be used to
examine a number of relationships. Using these data give a
numerical example of each of the following (show your work
and in one sentence explain what the number means): (2 pts
eac h)
a. An association between breast cancer risk and having
zero pregnancies. Use > 3 pregnancies as a reference.
b. An association between having been breastfed and
being over 165 cm in height. Use <160 cm as a
reference.
c. An association between breast cancer and having been
breastfed, overall.
15.
On page 326, 2nd column, the authors state "As shown
in Table 3, the risk of breast cancer associated with having
been breastfed, was about 0.7 for both pre- and
postmenopausal women." In this context, to which of the
following epi demiologic measures does the term "risk"
refer? Choose one best answer. (2 pts)
A. Cumulative incidence
B. Incidence density
C. Attributable risk
D. Odds ratio
16.
Using the data in Table 3, estimate AND state the
meaning of the following measures (for this question you
may ignore the possibility of selection bias in cases and
controls):
a. Attributable Risk Proportion (ARP) for NOT having
been breastfed for all breast cancer (both
premenopausal and postmenopausal breast cancer,
combined). Note that an ARP is also known as the
etiologic fraction in the e xposed. (3 pts)
b. Population Attributable Risk Proportion (PARP) for
NOT having been breastfed for premenopausal and for
postmenopausal breast cancer, separately (i.e., 2
PARP's). Note that the PARP is also known as the
etiologic fract ion. (4 pts)
17.
In the multiple logistic model referred to as Model 2 in
Table 3, what was the coefficient for the variable not-havingbeen-breastfed among all breast cancer cases? (2 pts)
Which of the following assumptions is involved in that
model? Indicate True or False for each assumption. (1 pt
each)
a. The odds of breast cancer vary as the product of the
odds for age and the odds for education.
b. The odds of breast cancer vary as the sum of the odds
for age and the odds for education.
c. Age, education, and not having been breastfed were
independent of (i.e., uncorrelated with) each other.
d. Breast cancer is a rare disease.
18.
Suppose that cases who refused to participate in this
study were less likely to have been breastfed as infants than
those who participated in the study. Which of the following
best describes what this fact would imply for the obser ved
relative risk associated with being breastfed compared with
what would have been observed had all persons participated
I the study? (choose one best answer). (2 pts)
A. the observed relative risk would be biased away from
the null.
B. the observed relative risk would be subject to selection
bias and the direction of the bias can not be estimated.
C. the observed relative risk would be biased toward the
null.
D. the observed relative risk would be subject to
misclassification bias and the direction of the bias can
not be estimated.
19.
In table 3, the confidence intervals for the OR's for all
women do not include the value 1.0, whereas all but one of
the OR's for premenopausal breast cancer and
postmenopausal breast cancer do. Mathematically, what does
this patte rn reflect? (2 pts)
Age > 60
Total
Breastf Bottlefe
ed
d
Breastf Bottlefe
ed
d
Breastf Bottlefe
ed
d
Cases
24
40
256
100
280
140
Contro
ls
79
86
204
54
280
140
OR
0.653
0.678
1.0
21.
An epidemiology graduate student finds evidence in the
literature that childhood sunlight exposure may affect adult
breast cancer risk. To explore this hypothesis, she obtains
from the authors the palace of birth for all of the sub jects in
the present study and constructs a sunlight exposure
variable ('high" or "low") based on geologic and meteorologic
data for the years of the subject's childhood. Her data show
Control
s
Total
24
Low sunlight
Case
s
Control
s
Total
67
81
36
191
284
22.
Use the data from Table 2 (Distribution of
Characteristics of Postmenopausal Cases and Controls) to
draw separate 2 x 2 tables for women who have had : 0
pregnancies, 1-2 pregnancies, and >=3 pregnancies. (5 pts)
a. calculate odds ratios for each of these three categories.
b. Assuming no effects of confounding, interpret your
findngs in part (a).
23.
A hypothetical cross-sectional ancillary study to this
report was conducted. In that study a survey of breast cancer
annual incidence rates in geographically distinct areas was
completed. Region A in the upper Midwest were breast c
ancer mortality is high, and Region B the Southeast where
mortality from breast cancer is low. The following data were
obtained.
Region A
Region B
Age
No.
of
cas
es
Populati
on
Rate/1,0
00
No.
of
cas
es
Populati
on
Rate/1,0
00
4050
10
7,000
1.4
10
15,000
0.7
5160
15
10,000
1.5
20
5,000
4.0
6165
30
3,000
10
600
55,000
10.9
Tot
al
55
20,000
630
75,000
4050
1,000
5.0
2,000
3.0
5160
2,000
2.5
10
15,000
0.7
6165
500
8.0
1,000
4.0
Tot
al
14
3,500
20
18,000
69
23,500
650
93,000
< High
School
Educati
on
>=
High
School
Educati
on
Grand total
Crude
2.9
24.
Write a brief statement for or against a causal
relationship between breastfeeding in infancy and risk of
breast cancer as an adult. Comment specifically on at least
two of Bradford Hill's criteri for causal inference. Include in
y our comments data or statements from the article. (5 pts)
25.
Assuming that this relationship is causal, why might a
similar study, 50 years from now, fail to find as strong a
relationship? (2 pts)
Answer Guide
1.
C.
2.
c.
D.
a.
b.
8.
B.
9.
A.
10.
Table:
breastfed
Breastfeeding biomarker found
S
e
l
f
r
e
p
o
r
t
Yes
No
Total
-------------------------------------------Breastfed
70
26
96
Not breastfed
80
28
108
-------------------------------------------Total
150
54
204
b.
c.
d.
a. Table:
Adult breast cancer by having been breastfed as an infant,
among premenopausal women with education beyond high school
Breastfed
Not breastfed
Case
Control Total
-----------------------61
93
154
69
61
130
-------------------------
Total
130
154
284
Table:
Adult breast cancer by having been breastfed as an infant,
among premenopausal women with education beyond high school,
assuming that 20% of controls who reported having been
breastfed had in fact not been
Breastfed
Not breastfed
Total
Cases
Controls Total
------------------------61
74
135
69
80
149
------------------------130
154
284
no association.
Body mass
index (kg/mz)
16-22
23-27
>27
Cases
------------------------Breastfed
Not breastfed
---------- -------------48
103
90
15
26
17
Controls
------------------------Breastfed
Not breastfed
--------------------89
125
91
19
16
16
To show the details, here is a table for estimating OR's for body mass
index and breast cancer:
Body mass
index (kg/m sq)
16-22
23-27
>27
Breastfed
--------------Cases Controls
48
89
103
125
90
91
Not breastfed
--------------Cases Controls
15
19
26
16
17
16
Total
--------------Cases Controls
63
108
129
141
107
107
and the resulting OR's are [e.g., (90 * 89) / (48 * 91) = 1.83]:
Breastfed
Body mass
--------index (kg/m sq)
16-22 (ref. level)
1.0
23-27
1.83
>27
1.83
Not breastfed
-------------
Total
---------
1.0
2.06
1.34
1.0
1.57
1.71
The OR's in the total column are shown to illustrate that in this
case there is some confounding by breastfeeding history, at body
mass index level 23-27 kg/m sq. Within either breastfed or not
breastfed group there is no "dose-response" relationship.
g. True - Generally, generally an outbreak investigation begins after
the outbreak has begun and the investigation seeks to determine what
characteristics of cases might have been responsible for their disease. If
the cases happened to be part of an existing cohort for which the requisite
exposure information was already available in some form, then a
retrospective cohort study would be another possibility.
If cases are
still occurring a prospective cohort study might be initiated, but the
better an idea the investigators have about which exposures to assess, the
more they should intervene to minimize the occurrence of additional cases.
h.
14.
a.
No pregnancies
Cases
Controls
Total
------------------------------50
38
88
>= 3 pregnancies
Total
167
216
383
------------------------------217
254
471
OR = (50 x 216) / (38 x 167) = 1.7 (for zero vs. >= 3 pregnancies)
Interpretation: having never been pregnant was associated with an
increased breast cancer rate, with an apparent 70% greater rate
among nulligravidae (women who have never been pregnant).
Other choices of a reference level produce the same result, e.g.,
1-2 pregnancies as the reference level:
OR = (50 x 102) / (38 x 82) = 1.6.
If both groups, 1-2 pregnancies and 3+ pregnancies are combined
and used as the reference group, then:
OR = (50 x 318) / (38 x 249) = 1.7
b.
> 165 cm
< 160 cm
Total
----------------------------------148
183
331
Not breastfed
Total
41
25
66
---------------------------------189
208
397
Breastfed
241
Not breastfed
305
546
Total
58
51
109
---------------------------------299
356
655
D.
16.
a.
(RR - 1) / RR
(1.45 - 1) / 1.45
0.45/1.45
0.31
If know the formula (or can derive it from the diagram and the
"grand synthesis"):
PARP
P(E|D) (RR-1)
--------------RR
Premenopausal:
(117)
----------- (1.47-1)
(117+112)
----------------------1.47
(0.51) (0.47)
--------------1.47
0.16
AND
(58)
-------------- (1.45-1)
(58+241)
Postmenopausal: -------------------------
(0.19) (0.45)
--------------- = 0.06
1.45
1.45 Meaning: In women who wre not breastfed, some 16% of premenopausal
breast cancer and some 6% of postmenopausal breast cancer were attributable
to their having not been breastfed.
=
c.
17.
Assumptions:
a. True - The odds of breast cancer vary as the product of the odds
for age and the odds for education.
b. False - Only in a few special cases will the product of two odds
equal their sum (e.g., both odds equal zero or both odds equal two). The
logistic model is additive in the logit (logarithm of odds), multiplicative
in the odds.
c. False - One of the reasons for using mathematical modeling is that
the risk factors (exposures and potential confounders) ARE associated
(i.e., not independently distributed)
d.
18.
C.
19.
20.
Cases
Controls
OR
AGE < 60
AGE > 60
TOTAL
---------------------------------------------------Breast Bottle
Breast Bottle
Breast Bottle
------ ----------- ----------- -----24
40
256
100
280
140
79
86
204
54
280
140
---------------------------------------------------0.653
0.678
1.0
a.
b.
the 219 premenopausal women who were NOT breastfed as infants grew up with
"high" sunlight exposure. Based on this fact and the partially-completed
tables below,
(a) calculate the odds ratio of breast cancer with respect to breastmilk
exposure within each of the two sunlight exposure strata, and
(b) briefly describe the relationship of the sunlight exposure variable to
the association between breast cancer and breastmilk exposure (i.e. in
relation to confounding and effect modification. (4 pts)
22.
High Sunlight
Breastfed Yes
Breastfed No
Total
Cases
44
81
125
Controls
24
*42
66
Total
68
123
191
Low Sunlight
Breastfed Yes
Breastfed No
Total
Cases
67
36
103
Controls
*120
*61
181
Total
187
97
284
Breast
Bottle
Total
a)
0 pregnancies
Cases Controls
34
35
16
3
50
38
1-2 pregnancies
Cases Controls
71
90
11
12
82
102
3 pregnancies
Cases Controls
136
180
31
36
167
216
b)
Region A
Cases Population Rate/1000
< High School Education
Age
40-50
10
7,000
1.4
51-60
15
10,000
1.5
61-65
30
3,000
10
Total
55
20,000
5.0
2.5
8.0
Region B
Population Rate/1000
10
20
600
15,000
5,000
55,000
630
75,000
6
10
4
2,000
15,000
1,000
14
3,500
20
18,000
69
23,500
650
93,000
Crude
a.
Cases
0.7
4.0
10.9
3.0
0.7
4.0
2.9
the
2.
on
(3 pts)
3.
this
4.
example
Active surveillance
Ongoing crossectional survey
Passive surveillance
Follow up study of dynamic population
Choose
5.
of
apply
This study determined exposure and outcomes using data from "a list
all members of the agricultural community who were certified to
restricted-use pesticides in 1991" (p. 394-methods) and from "all
inthrough
wedlock live births recorded in the state for the years 1989
1992" (p. 394-methods). Briefly assess the strength of these data
sources in establishing the temporal sequence of pesticide exposure
and birth defects and provide support for your assessment. (4 pts)
6.
it
rate
proportion
ratio
rate
proportion
ratio
rate
proportion
ratio
rate
proportion
ratio
one
than
B.
did
C.
D.
than
9.
is
what
pts)
___________________________________________________________________
10.
birth
birth
11.
ratio
birth
and
have
musculoskeletal
12.
have
in
smaller
of
____ ____
birth
the
13.
Table 4 shows the frequency per 1000 births of major anomalies for
the
ecologic study
prospective cohort study
retrospective cohort study
region-specific case control study
need
causal
This
15.
defects
Suppose that after this publication came out, another study was
conducted in Illinois to investigate the hypothesis that birth
occurred more often in Illinois
However,
as compared to Minnesota.
in this new study the authors thought that the type of water
consumed
Appliers
Normal
anomalies
Water Type
rate*
Well water only
____
City water only
____
Bottled water only
____
Total
____
With anomalies
Illinois Pesticide
Normal
With
(#)
(#)
rate*
(#)
(#)
3379
93
26.8
100
874
27
30.0
200
206
23.7
7293
145
4456
125
28.0
7593
153
calculate the crude rate and the water-type specific rates for
Illinois. Briefly describe how these two states compare in
rates of birth anomalies. (4 pts)
b.
calculate
the
the
17.
Which of the following statements about the present study are (is)
TRUE and which are (is) FALSE. Indicate TRUE or FALSE for each
statement. (2 pts each)
TRUE FALSE
____ ____
____ ____
____ ____
is
____ ____
____ ____
variables
____ ____
____ ____
exposure.
[question #18 has been removed, 10/7/97]
19.
in
3.
with
C.
basic
with a view to
were then
derives from an
Passive surveillance
(or
=
=
births
Meaning:
by
RR)
or
27%
ARP
(OR - 1) / OR
=
AR / P1
Meaning:
all
(1.37 - 1) / 1.37
(using OR for
0.270
27%
10B.
births
So PAR = AR x Pe
births.
0.02425
0.0072466 x 0.02425
=
=
2.4% of live
0.0655 = 0.000176
1.8 per 10,000 live
or PCrude - P0
10,000
Meaning:
0.020217 - 0.02004
0.000177
1.8 /
general
ARP
Or, PARP
Pe|d (OR-1) / OR
= Pe x ARP
0.008905
from part a.
Meaning: Approximately 1% of all Minnesota live births with
anomalies
are attributable to pesticide exposure in pesticide appliers.
are
fewer
(Note:
primarily due to the fact that the OR of 1.37 has been rounded to
significant digits than are the prevalences computed above.
11.
OR = 1.04 (Derivation:
"Corrected" cases in exposed
Proportion in exposed = 96 /
"Corrected" cases in control
Proportion in control = 3697
0.0211 / 0.0202 = 1.04
14.
and
causal
this
which
one in
which the denominators equal the sum of these numbers plus the
numbers of
births with anomalies. In addition, full credit is given if the
rates
for Minnesota were recomputed.
Here is the version in which the
stated
rates were used and the # of live births column was treated as if it
meant "Total live births":
Birth anomaly prevalences for Illinois, by water type:
Well water: 2/100 = 20.0 per 1000 live births
City water: 6/200 = 30.0 per 1000 live births
Bottled water: 145/7293 = 19.9 per 1000 live births
Overall (crude): 153/7593 = 20.2 per 1000 live births
Thus, the crude prevalence is higher in Minnesota than in Illinois.
Number of live births (both states combined)
-------------------------------------------Well water
3479
City water
1074
Bottled water
7499
Total
12,052
Standardized prevalence for MN:
3479 x 26.8 + 1074 x 30.0 + 7499 x 23.7
---------------------------------------12,052 x 1000
12,052 x 1000
The standardized prevalence for Minnesota also exceeds that for
Illinois, though by a smaller amount than the difference in the
crude
to
of
False
True
False
False
True - (however, a correlation coefficient indicates the extent
association in the sense of two variables moving in tandem; it
does
sense
change
18.
19.
Points in favor of action at this time are the evidence that the
relationship is causal (biological plausibility, consistency
between
183,721
studies
Against taking action other than continuing research are that the
evidence is still not very strong (biological mechanisms not yet
elucidated, relationship is not highly specific, epidemiologic
limited and not entirely consistent, experimental evidence not
available), the potential impact on agriculture and therefore food
prices is considerable, and the costs to industry and commerce from
restrictions on a major product are substantial. Moreover, the
relative weakness of the odds ratios (below 2.0) indicates a
significant possibility that other factors could be responsible for
the increase in birth anomaly prevalence seen in association with
pesticide exposure, a possibility whose investigation requires
better
your
disease
than
bowel
than
with
A. Manifestational criteria
B. Causal criteria
C. Both manifestational and causal criteria
D. Neither
3. Medical records were used to validate the hospital diagnoses of
Crohn's
disease and ulcerative colitis. By using this validation process
instead of relying on hospital discharges coding alone, the authors
are
reducing which of the following sources of error? (Choose one best
answer) (3 pts)
A.
Selection bias
B.
Prevalence-incidence bias
C.
Information bias
D.
Surveillance bias
-2-
the
population of cases.
5.
past
would
best
(Choose one
answer) (3 pts)
A. Controls underreported sucrose intake but cases did not.
B. Cases underreported sucrose intake but controls did not.
C. Both cases and controls underreported sucrose intake.
D. Both cases and controls overreport sucrose intake.
6.
were
the
7.
and
_
9.
with
to
12.
the
Briefly
not
explain why based on these data the authors state that smoking did
confound these associations. (3 pts)
13.
Low
Controls
Cases
Controls
Cases
12
10
14
150
100
135
28
Fast foods
1+ times/wk
None
(3 pts)
risk
-414.
in
In the discussion (page 50), the authors state that if the change
diet is the same in cases as in controls, then the relative risk
estimates would be biased toward unity. This is an example of
which of
the following?
A.
B.
C.
D.
15.
This articles does not present p-values yet reports 95% confidence
intervals for all odds ratios. Which of the following best
describes
what information a confidence interval conveys that a p-value does
not.
(Choose one best answer) (3 pts)
context
the
point estimate.
power
point
16.
17.
disease.
18.
(per
the
Swedish population.
2/week
None
18.0
9.1
6.8
3.7
data?
a. Which model for the joint effect of these two food items, the
additive model or the multiplicative model, better fits the
Your answer should give the formula for each model and show how
to
(5 pts)
1. 20% of the heavy coffee drinkers ( 3 cups per day) among cases
only decaffeinated coffee.
20.
model
pts)
A. Age
B. Gender
C. Total energy intake
D. Ulcerative colitis
21. In the multiple logistic model that yielded the relative risk
estimate
of 0.7 for Ulcerative colitis in relation to daily vegetable
consumption
(Table 4), what was the value of the coefficient for the vegetable
consumption variable assuming that it was coded as 1=daily, 0=less
frequently? Write the conversion equation of coefficient to
relative
risk estimate. (3 pts)
22. Assume that the population of Stockholm County in the age range
covered
by this study was 1,000,000 in 1980 and remained constant
throughout the
decade. What was the average annual incidence of hospitaldiagnosed
Crohn's disease during that period regardless of when their medical
record became available? (3 pts)
23. Using the data in Table 2, for which of the following two
associations
is there more of an indication of confounding by age and total
energy
intake in WOMEN? Support your answer with relevant data and/or
computations. (3 pts)
a. Crohn's disease and sucrose intake (highest versus lowest level)
level)
24.
study
_
Briefly state one major strength and one major limitation of this
(2 pts)
-6-
25. List two Bradford Hill criteria for evaluating whether dietary
sucrose
intake is causally related to inflammatory bowel disease. Evaluate
each
using specific facts from the article. (4 pts)
26.
are
TRUE and which are FALSE (answer TRUE or FALSE for each statement).
(2
pts each)
a. In women, the rate of (hospitalized) ulcerative colitis was
higher
involved
case
recall over longer periods of time, on the average, than was the
for the ulcerative colitis cases.
d. The proportion of controls with high dietary fat intake was
higher
27.
twin
friend,
Your
Your friend comes to visit with you over the holidays, and while
sleeping late one morning she comes across your class notes from
168.
"Suppose that fast foods, soft drinks, whole grain bread, and
type cereal affect Crohn's disease risk independently, and that I
ignore other risk factors. Suppose also that the excess risks are
additive. Is my twin sister's risk of Crohn's disease 10 times my
own?"
that
She shows you how she used the information in Table 3 to obtain
estimate:
(3.4 - 1) + (2.8 - 1) + ((1/0.4) - 1) + ((1/0.2) - 1) + 1 = 10.7
She goes on to explain "(3.4 -1) is the excess risk from fast
foods, and
((1/0.4) - 1) is the excess risk from eating bread that is not
whole
grain."
Even though you're not quite fully awake, you feel justifiable
pride in
big
of
Answer Guide
1. A. To test the hypothesis that persons with inflammatory bowel
disease
are more likely to have been exposed to certain dietary factors
than
those without inflammatory bowel disease.
2.
A. Manifestational criteria
3.
C. Information bias
4.
from
5.
9.
D. Ratio (The response scale for each item was ordinal, but in order
to
create the total energy variable the authors had to convert each
response into calories.)
B. Matching plus mathematical modeling.
11.
The odds ratios for 80 to 104 grams per day was 1.4 and for intakes
of
10.
greater than 105 grams per day the odds ratio was 1.3. This
suggests a
tendency for cases to have a greater proportion of high fat eaters
than
controls.
low
a. The crude (with respect to smoking) and adjusted odds ratios are
same.
between
the
the
values in Table 2.
13.
2.7
a. Odds ratios:
ratio
which
disease
Crohn's disease risk] (i.e., lower fast food intake and education
as independent main effects to influence Crohn's disease risk).
A. Nondifferential misclassification bias
B. A confidence interval provides information on the precision of
point estimate.
There appears to be a strong protective effect of daily consumption
Muesli-type breakfast cereals and Crohn's disease (odds ratio = 0.2
The authors state that sucrose and fiber intake could be associated
18.
two
a. Under the additive model, we expect the joint excess rate of the
one another as well as with Crohn's disease and thus each factor
might be
a confounder of the associations between Crohn's disease and the
other
("mutual confounding"). The odds ratio was 2.6 for a high sucrose
intake
(bottom page 48). When adjusted for fiber the sucrose odds ratio
changed
only slightly to 2.5. Therefore, fiber was a only a slight modifier
of
the sucrose and Crohn's disease relationship.
factors will be equal to the sum of the excess rate from each
factor
separately.
rates:
and =2
fast foods per week = rate (daily soft drinks, without fast food)
rate (less freq. soft drink, =2 fast food per week) - rate
(neither).
(absent),
equation
9.1
18.0.
the
This
Therefore, the additive model does not explain the full amount of
observed joint risk.
the
each
expressed
numbers
is
The close agreement for the observed joint rate and that
with
model.
who
posting
19.
for
group.
Under this model the odds ratio for =3 cups caffeinated coffee,
relative
to none or only decaffeinated = (56 x 201) / (50 x 18) = 12.5
20.
21.
1.2
a.
b.
c.
d.
F
F
T
F