Escolar Documentos
Profissional Documentos
Cultura Documentos
Developing symptom-based
predictive models of endometriosis
as a clinical screening tool: results
from a multicenter study
Kelechi E. Nnoaham, M.D.,a Lone Hummelshoj,b Stephen H. Kennedy, M.R.C.O.G.,c Crispin Jenkinson, D.Phil.,d
and Krina T. Zondervan, D.Phil.,d,e on behalf of the World Endometriosis Research Foundation Women's
Health Symptom Survey Consortium
a
Department of Public Health, University of Oxford, Oxford; b World Endometriosis Research Foundation, London;
Nufeld Department of Obstetrics & Gynaecology, University of Oxford, Oxford; d Health Services Research Unit,
University of Oxford, Oxford; and e Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United
Kingdom
c
Objective: To generate and validate symptom-based models to predict endometriosis among symptomatic women prior to undergoing
their rst laparoscopy.
Design: Prospective, observational, two-phase study, in which women completed a 25-item questionnaire prior to surgery.
Setting: Nineteen hospitals in 13 countries.
Patient(s): Symptomatic women (n 1,396) scheduled for laparoscopy without a previous surgical diagnosis of endometriosis.
Intervention(s): None.
Main Outcome Measure(s): Sensitivity and specicity of endometriosis diagnosis predicted by symptoms and patient characteristics
from optimal models developed using multiple logistic regression analyses in one data set (phase I), and independently validated in
a second data set (phase II) by receiver operating characteristic (ROC) curve analysis.
Result(s): Three hundred sixty (46.7%) women in phase I and 364 (58.2%) in phase II were diagnosed with endometriosis at
laparoscopy. Menstrual dyschezia (pain on opening bowels) and a history of benign ovarian cysts most strongly predicted both any
and stage III and IV endometriosis in both phases. Prediction of any-stage endometriosis, although improved by ultrasound scan
evidence of cyst/nodules, was relatively poor (area under the curve [AUC] 68.3). Stage III and IV disease was predicted with good
accuracy (AUC 84.9, sensitivity of 82.3% and specicity 75.8% at an optimal cut-off of 0.24).
Conclusion(s): Our symptom-based models predict any-stage endometriosis relatively poorly and stage III and IV disease with good
accuracy. Predictive tools based on such models could help to prioritize women for surgical
investigation in clinical practice and thus contribute to reducing time to diagnosis. We invite
Use your smartphone
other researchers to validate the key models in additional populations. (Fertil Steril
to scan this QR code
2012;98:692701. 2012 by American Society for Reproductive Medicine.)
and connect to the
Key Words: Endometriosis, predictive model, logistic regression
Discuss: You can discuss this article with its authors and with other ASRM members at http://
fertstertforum.com/nnoahamk-symptom-based-predictive-models-endometriosis/
Received February 18, 2012; revised April 2, 2012; accepted April 13, 2012; published online May 30,
2012.
K.E.N. has nothing to disclose. L.H. has nothing to disclose. S.H.K. has nothing to disclose. C.J. has
nothing to disclose. K.T.Z. has nothing to disclose.
Supported in part by a grant to the World Endometriosis Research Foundation (WERF) from TAP
Pharmaceuticals (now Abbott Endocrine) (L.H., K.T.Z., and S.H.K.). K.T.Z. received payment for
translation of the study questionnaire into Dutch. L.H. and S.H.K. have been consultants for Abbott.
Reprint requests: Kelechi E. Nnoaham, M.D., Directorate of Public Health, NHS Berkshire West, 57-59
Bath Road, Reading RG30 2BA, United Kingdom (E-mail: nnoaham.kelechi@berkshire.nhs.uk).
Fertility and Sterility Vol. 98, No. 3, September 2012 0015-0282
Copyright 2012 American Society for Reproductive Medicine, Published by Elsevier Inc.
Open access under CC BY license. doi:10.1016/j.fertnstert.2012.04.022
692
the drop in Nagelkerke's R2 (assessing the disease variance explained by the variables in the model) when removing variables
was not to exceed 10%. Variables from each of the submodels
were then included in the complete models, which were again
reduced iteratively by considering [1] the Hosmer and Lemeshow test for goodness of t, [2] the drop in Nagelkerke's
R2 when removing variables, and [3] the signicance of the association of each variable in the model with endometriosis
(dropping variables from P>.5 down to P>.2). This resulted
in best-tting nal models for the prediction of any-stage
and stage III and IV endometriosis based on the phase I data.
For each of the two diagnostic outcomes, a model including
and excluding preoperative ultrasound evidence of cysts/nodules was generated (Supplemental Table 1). Final models were
subsequently tted to phase II data to assess their predictive
performance (see model validation below). In addition, reduced
models were generated, which excluded variables in the besttting nal models that had opposite directions of effect in
phase I and II data sets. These reduced models were also
assessed for performance; thus, a total of eight model types
were generated and assessed (Supplemental Table 1).
Model validation. To illustrate the performance of the derived
predictive models in the phase I population data, and the
relative drop in performance when externally validating in
phase II data, the accuracy of the models in predicting anystage and stage III and IV endometriosis was assessed by
analyzing the receiver operating characteristic (ROC) curve
in both phase I and II. The ROC curve displays the relationship
FIGURE 1
694
TABLE 1
Association results and regression coefcients from best tting nal full models for stage III and IV endometriosis excluding (model 5) and including (model 7) ultrasound evidence in phase I, and when
subsequently applied to phase II.
Phase I
Model
Stage III and IV no ultrasound
(model 5)
Variable
Surgery/consultation for ovarian cyst*
OR (95% CI)
2.61 (1.355.06)
P value
.004
Phase II
b
0.961
OR (95% CI)
2.65 (1.444.86)
695
.002
0.973
.16
.99
.057
.28
.006
.84
.011
.78
.13
.06
0.418
0.002
0.257
0.515
1.129
0.040
0.434
0.043
0.222
0.135
.83
.51
.29
.028
.21
0.044
0.147
0.161
0.429
0.453
< .001
.33
.71
1.336
0.577
0.150
.81
0.150
.004
.001
.59
1.039
1.441
0.224
< .001
2.415
0.63 (0.470.83)
1.80 (0.943.46)
2.58 (1.275.26)
2.06 (0.745.78)
.001
.076
.009
.17
0.465
0.589
0.949
0.725
0.79 (0.601.06)
1.41 (0.802.46)
3.38 (1.537.46)
1.95 (0.636.04)
.11
.23
.003
.25
0.232
0.341
1.217
0.667
3.43 (1.249.44)
0.43 (0.131.40)
0.71 (0.530.94)
0.59 (0.380.92)
0.28 (0.120.67)
.017
.16
.015
.019
.004
1.231
0.843
0.350
0.530
1.270
2.43 (0.767.81)
0.47 (0.131.76)
0.81 (0.641.03)
0.85 (0.591.24)
0.74 (0.381.47)
.14
.26
.079
.41
.40
0.887
0.752
0.210
0.159
0.296
2.82 (1.565.09)
0.39 (0.101.49)
.001
.17
1.037
0.941
3.96 (2.247.00)
0.54 (0.171.71)
< .001
.29
1.376
0.620
P value
696
0.720
1.959
0.325
.049
< .001
.47
RESULTS
Description of the Predictive Models
* Variables retained in the reduced models. Variables in the best-tting nal models that had opposite directions of effect in phase I and II data sets were dropped in the reduced models. IBS irritable bowel syndrome.
a
Included as a linear term: OR represents unit change from light to moderate to heavy.
b
Included as a linear term: OR represents unit change from <21 days, 2224 days, 2528 days, 2932 days, 3335 days, to 36 days.
c
Included as a linear term: OR represents unit change from never, occasionally (1 every 3), often (2 every 3), to always.
d
Included as a linear term: OR represents unit change from never/rarely, sometimes, often, most of the time, to always.
e
Included as a linear term: OR represents unit change from 0 to 10.
f
Included as a linear term: OR represents unit change from not at all, less than 1 time in 5, less than half the time, about half the time, more than half the time, to almost always.
g
Included as a linear term: OR represents unit change from none, 1 time, 2 times, 3 times, 4 times, to 5 times.
h
Included as a linear term: OR represents unit change from 0, 1, 2, to 3.
i
Ethnicity is included as a categorical term, with largest group (white) as reference.
0.299
0.640
0.774
.69
.037
.26
P value
P value
Model
Continued.
TABLE 1
Variable
OR (95% CI)
Phase I
OR (95% CI)
Phase II
1.09
0.25
1.16
0.30
3.39
0.23
7.62
0.26
4.63
0.34
3.08
0.21
1.04
0.63
1.05
0.77
5.39
0.47
4.7
0.31
1.5
0.35
3.61
0.33
Note: LR logistic regression.
a
Binomial exact.
42.6
36.2
79.3
87.6
78.9
85.5
58.7
79.8
78.7
73.1
76.4
49.7
Phase II
Phase I
56.4 (52.8
59.9)
94.4 (90.0
97.3)
18.4 (15.3
21.7)
0.262
84.9 (81.4
88.0)
82.26 (76.0
87.5)
75.76 (70.5
80.5)
0.241
Phase II
Phase I
90.8 (88.1
93.0)
76.71 (69.0
83.3)
89.93 (86.6
92.6)
0.373
83.3 (79.6
86.6)
70.9 (63.5
77.5)
84.70 (79.9
88.7)
0.452
Phase II
Phase I
87.3 (84.3
89.8)
85.11 (78.1
90.5)
72.33 (68.0
76.4)
0.172
51.8 (47.8
55.8)
93.7 (90.7
96.0)
9.96 (6.6
14.3)
0.592
Phase II
Phase I
Phase II
52.0 (48.4
55.5)
86.7 (82.7
90.0)
17.3 (13.7
21.3)
0.479
87.3 (84.2
90.0)
74.3 (68.5
79.6)
84.2 (79.5
88.1)
0.473
Phase I
Phase II
68.3 (63.9
72.4)
84.8 (80.1
88.7)
43.5 (36.1
51.1)
0.302
Phase I
84.2 (81.1
87.0)
74.0 (68.5
79.0)
79.5 (74.8
83.6)
0.471
Model parameter
80.0 (75.6
83.3)
58.1 (52.1
63.9)
89.2 (83.5
93.5)
0.795
TABLE 2
Diagnostic value of the nal full models for prediction of any-stage and stage III and IV endometriosis in the phase I study and phase II validation populations.
Ultrasound
scan only
53.9 (49.9
57.9)
97.4 (94.4
99.0)
10.5 (7.6
14.0)
0.389
697
FIGURE 2
Receiver operating characteristic curves for the full models predicting any-stage and stage III or IV endometriosis.
Nnoaham. Predicting endometriosis from symptoms. Fertil Steril 2012.
DISCUSSION
In this multicenter study of symptomatic premenopausal
women presenting with symptoms that were potentially
indicative of endometriosis, we show that a combination of
symptom characteristics and variables in the medical history,
with or without ultrasound evidence of cysts/nodules, can
predict the nding of stage III and IV endometriosis at laparoscopy with reasonably good accuracy. The best-tting predictive model included, along with ultrasound evidence,
menstrual dyschezia, ethnicity, and a history of benign ovarian cysts as the variables with the strongest predictive performance. These variables are mostly disease risk factors
reported in previous studies. Specically, menstrual dyschezia is strongly associated with deep inltrating endometriosis,
698
As far as we know, this is the rst study to use robust modeling techniques for model generation, followed by external
validation to generate and validate symptom-based predictive
models of endometriosis in a large prospectively recruited cohort of women across different countries and ethnicities. Previous attempts, focused on subtypes of endometriosis (13, 14),
have been hampered by small sample sizes (11), and failed to
validate models in populations independent to those from
which model parameters were generated (34). The
enrollment of women from diverse backgrounds according
to a uniform set of criteria potentially addresses issues with
the global utility of clinical prediction of endometriosis
arising from inconsistencies in disease denition across
studies and population. We invite other research groups to
validate the key models in this paper in additional
populations, as well as in subgroups that may be of specic
clinical or population-based interest (e.g., those women who
had infertility as the only surgical indication, who had
biopsy-proven disease, or who were of a particular ethnicity).
Although the WHSS was designed to improve on the
limitations of earlier studies, it had itself potential limitations.
First, endometriosis was diagnosed visually, without histologic conrmation, although this followed the European
Society of Human Reproduction and Embryology guideline
(23), based on the premise that negative histology does not
exclude the presence of disease. Consequently, disease status
may have been inappropriately assigned; however, participating hospitals were experienced in diagnosing endometriosis.
In a separate diagnostic validation study, 29 surgeons from
the participating centers viewed nine standardized videos to
allow, in a blinded manner, the assessment of consistency
in diagnosis and staging of disease. Preliminary analysis suggested substantial inter-rater agreement in disease identication and staging (both Fleiss k > 0.60; C. Becker and K. May,
unpublished data). Second, the generation of the models with
reduced numbers of variables was based on both phase I and
phase II data. More parsimonious models are always preferable (38); however, because they are partly based on phase II
evidence, they would require additional external validation.
Third, although a strength of the study was that results were
generated using data from a wide variety of clinical centers
worldwide covering a range of patient proles, this meant
that the results may have been affected to some extent by
selection bias possibly arising from [1] differential frequency
of concomitant pathologies, in particular the higher proportion of women with nonendometriotic adhesions amongst
controls compared to cases (32.4% vs. 14.7% in phase I), and
[2] the signicant variations across centers in proportions of
cases in the sample populations. This is another reason why
we call for further independent validation of the models in
additional clinical populations, which are likely to each
have their own unique patient population.
In conclusion, the diagnostic delay, high investigation
costs, and personal suffering associated with endometriosis
might be reduced by access to a screening tool that predicts
endometriosis with good accuracy in women presenting in
a clinical setting. Although prediction of any-stage endometriosis is relatively poor, the symptom-based models developed and validated in this study predict stage III and IV
699
REFERENCES
1.
2.
3.
4.
5.
6.
7.
700
31.
32.
33.
34.
35.
36.
37.
38.
701
SUPPLEMENTAL FIGURE 1
701.e1
SUPPLEMENTAL TABLE 1
Model types generated based on phase I data alone (full models) and on additional association information from phase II data (reduced
models).
Model name
Diagnosis
Any-stage endometriosis
Stage III or IV endometriosis
Full model
Reduced model
701.e2
SUPPLEMENTAL TABLE 2
Characteristics of the 1,396 women in the Women's Health Symptom Survey, phases I and II.
Phase I (N [ 771)
Age (y), mean SD
Ultrasound scan
Suggested endometrioma
Suggested nodule
Pathology observed at surgery
Any-stage endometriosis
ASRM stage
I
II
III
IV
Bladder nodule
Cul-de-sac nodule
Bowel nodule
Fibroids
Adhesions
Nonendometriotic ovarian cyst
Phase II (N [ 625)
Cases (n [ 360)
Controls (n [ 411)
Cases (n [ 364)
Controls (n [ 261)
31.2 6.0
312 (86.7)
108 (34.6)
17 (5.4)
360 (100.0)
360 (100.0)
31.0 6.7
340 (82.7)
12 (3.5)
1 (0.3)
298 (72.5)
0 (0.0)
32.4 6.3
341 (93.7)
165 (48.4)
35 (10.3)
364 (100.0)
364 (100.0)
32.2 6.2
235 (90.0)
6 (2.6)
11 (4.7)
188 (72.0)
0 (0.0)
118 (32.8)
59 (16.4)
89 (24.7)
91 (25.3)
16 (4.4)
46 (12.8)
25 (6.9)
58 (16.1)
53 (14.7)
44 (12.2)
0 (0.0)
0 (0.0)
0 (0.0)
0 (0.0)
0 (0.0)
0 (0.0)
0 (0.0)
62 (15.1)
133 (32.4)
91 (22.1)
91 (25.0)
39 (10.7)
101 (27.7)
128 (35.2)
12 (3.3)
61 (16.8)
18 (4.9)
77 (21.2)
77 (21.2)
39 (10.7)
0 (0.0)
0 (0.0)
0 (0.0)
0 (0.0)
0 (0.0)
0 (0.0)
0 (0.0)
57 (21.8)
65 (24.9)
71 (27.2)
701.e3
SUPPLEMENTAL TABLE 3
Association results and regression coefcients from best-tting nal full models for any-stage endometriosis excluding (model 1) and including (model 3) ultrasound evidence in phase I, and when
subsequently applied to phase II.
Phase I
Model
Any-stage no ultrasound (model 1)
Variable
OR (95% CI)
P value
Phase II
b
OR (95% CI)
P value
0.679
0.427
0.026
0.076
0.112
1.24
0.632
0.471
0.202
0.056
0.134
0.112
0.618
1.22
0.501
0.209
0.009
1.42
0.182
0220
0.568
0.273
0.314
1.74
0.267
0.018
4.806
0.503
0.338
0.165
1.137
1.035
0.144
0.196
0.199
701.e4
Continued.
Phase I
Model
Variable
OR (95% CI)
P value
Phase II
b
OR (95% CI)
P value
b
0.174
0.621
0.507
0.021
0.264
1.072
0.148
1.863
0.851
0.432
0.045
2.064
0.396
0.071
701.e5
SUPPLEMENTAL TABLE 3