External validation of a dynamic prediction model for repeated predictions of natural conception over time

Human Reproduction · 2018 · vol. 33(12) , pp. 2268–2275 · doi:10.1093/humrep/dey317 · PMID:30358841 · W2898446336
article OA: green CC0 ⤵ 1 in-corpus citation
AI-generated summary by claude@2026-06, 2026-06-08

A dynamic prediction model for natural conception was externally validated in a Scottish cohort, showing good calibration and moderate discrimination for predicting pregnancy chances over time.

One-sentence paraphrase of the abstract; not a substitute for reading it. No clinical advice. How this works

AI-generated deep summary by claude@2026-06, 2026-06-09 · read from full text

This study externally validated a previously developed dynamic prediction model for repeated predictions of natural conception in a Scottish registry cohort of couples with unexplained subfertility who registered at a fertility clinic between 1998 and 2011, using record-linkage to obtain follow-up and an endpoint defined as time to natural conception leading to ongoing pregnancy (≥12 weeks). The model incorporated clinical factors (e.g., female age, duration of subfertility, sperm motility, primary/secondary subfertility, and referral source) and updated predictions based on the number of menstrual cycles elapsed since completion of the fertility workup, evaluating calibration, discrimination, and clinical utility at multiple time points. It showed good calibration with moderate discrimination (c ~0.60–0.64) and a sufficiently wide range of predicted probabilities to separate good versus poor prognosis groups, but slightly overestimated conception chances by about 2–3 percentage points, attributed to missing exact workup completion dates that were estimated. This paper does not explicitly discuss endometriosis or adenomyosis; it was included in the corpus via a keyword match in the upstream search index.

Read from the paper's body, not the abstract. Not a substitute for reading the paper. No clinical advice. How this works

Abstract

STUDY QUESTION: How well does a previously developed dynamic prediction model perform in an external, geographical validation in terms of predicting the chances of natural conception at various points in time? SUMMARY ANSWER: The dynamic prediction model performs well in an external validation on a Scottish cohort. WHAT IS KNOWN ALREADY: Prediction models provide information that can aid evidence-based management of unexplained subfertile couples. We developed a dynamic prediction model for natural conception (van Eekelen model) that is able to update predictions of natural conception when couples return to their clinician after a period of unsuccessful expectant management. It is not known how well this model performs in an external population. STUDY DESIGN, SIZE, DURATION: A record-linked registry study including the long-term follow-up of all couples who were considered unexplained subfertile following a fertility workup at a Scottish fertility clinic between 1998 and 2011. Couples with anovulation, uni/bilateral tubal occlusion, mild/severe endometriosis or impaired semen quality according to World Health Organization criteria were excluded. PARTICIPANTS/MATERIALS, SETTING, METHODS: The endpoint was time to natural conception, leading to an ongoing pregnancy (defined as reaching a gestational age of at least 12 weeks). Follow-up was censored at the start of treatment, at the change of partner or at the end of study (31 March 2012). The performance of the van Eekelen model was evaluated in terms of calibration and discrimination at various points in time. Additionally, we assessed the clinical utility of the model in terms of the range of the calculated predictions. MAIN RESULTS AND THE ROLE OF CHANCE: Of a total of 1203 couples with a median follow-up of 1 year and 3 months after the fertility workup, 398 (33%) couples conceived naturally leading to an ongoing pregnancy. Using the dynamic prediction model, the mean probability of natural conception over the course of the first year after the fertility workup was estimated at 25% (observed: 23%). After 0.5, 1 and 1.5 years of expectant management after the completion of the fertility workup, the average probability of conceiving naturally over the next year was estimated at 18% (observed: 15%), 14% (observed: 14%) and 12% (observed: 12%). Calibration plots showed good agreement between predicted chances and the observed fraction of ongoing pregnancy within risk groups. Discrimination was moderate with c statistics similar to those in the internal validation, ranging from 0.60 to 0.64. The range of predicted chances was sufficiently wide to distinguish between couples having a good and poor prognosis with a minimum of zero at all times and a maximum of 55% over the first year after the workup, which decreased to maxima of 43% after 0.5 years, 34% after 1 year and 29% after 1.5 years after the fertility workup. LIMITATIONS, REASONS FOR CAUTION: The model slightly overestimated the chances of conception by ~2-3% points on group level in the first-year post-fertility workup and after 0.5 years of expectant management, respectively. This is likely attributable to the fact that the exact dates of completion of the fertility workup for couples were missing and had to be estimated. WIDER IMPLICATIONS OF THE FINDINGS: The van Eekelen model is a valid and robust tool that is ready to use in clinical practice to counsel couples with unexplained subfertility on their individualized chances of natural conception at various points in time, notably when couples return to the clinic after a period of unsuccessful expectant management. STUDY FUNDING/COMPETING INTEREST(S): This work was supported by a Chief Scientist Office postdoctoral training fellowship in health services research and health of the public research (ref PDF/12/06). There are no conflicts of interest.
Full text 39,367 characters · extracted from oa-pdf · 10 sections · click to expand

Introduction

Approximately 10% of all couples who wish to have a child do not con- ceive within the first year of trying ( Gnoth et al., 2003 ; Wang et al., 2003). For approximately half of these couples, no clear barrier for conception can be found during the workup and these couples are considered unexplained subfertile ( Aboulghar et al., 2009 ; Brandes et al., 2010 ). It is unclear whether these couples should start with ART; first, since observational studies report that 18 –38% of unex- plained subfertile couples will conceive naturally in the year after the fertility workup ( Hunault et al., 2004 ; van der Steeg et al., 2007 ; van Eekelen et al., 2017a ) and second, since there remains uncertainty regarding the effectiveness of ART for unexplained subfertile couples (Pandian et al., 2015 ; Tjon-Kon-Fat et al., 2016 ; Veltman-Verhulst et al., 2016; van Eekelen et al., 2017b ). In the absence of clear evidence on the management of unexplained subfertile couples and when to offer ART, an enticing option is to cal- culate chances of natural conception and to base counselling on this estimated prognosis ( van Eekelen et al., 2017b ). Fundamental to this approach is to identify couples that are expected to bene fit from treat- ment and those who are not. In clinical practice, this would imply that couples with a good prognosis to conceive naturally are advised to continue to try and become pregnant by sexual intercourse, while cou- ples with an unfavourable prognosis are advised to start ART. Several prediction models for natural conception have been published of which the model by Hunault et al., which calculates a prognosis of con- ception leading to live birth over the first year after the completion of the fertility workup, has been externally validated and subsequently implemented in the national guidelines and clinical practice in the Netherlands ( Hunault et al., 2004 ; van der Steeg et al., 2007 ; Leushuis et al., 2009; NVOG, 2010). A practical drawback of the Hunault mod- el is that it cannot give a prediction at later time points when couples who continued expectant management after the fertility workup but did not conceive, return to the clinic. This is because applying the Hunault model at later time points leads to overestimation due to the selection of less fertile couples over time that is not incorporated in the Hunault model ( van Eekelen et al., 2017b ). Van Eekelen et al. recently developed a dynamic prediction model that accommodates the need for repeated predictions ( van Eekelen et al., 2017a ). This model comprises the clinical factors female age, duration of subfertility (both at completion of the fertility workup), percentage of progressively motile sperm, primary or secondary sub- fertility and being referred to the fertility clinic by a general practitioner or a specialist. In addition to these factors, the model uses as an input the number of menstrual cycles that have passed since the completion of the fertility workup, with zero cycles denoting the prediction is made immediately after the workup. The output is the predicted prob- ability to conceive naturally in the following cycle, leading to ongoing pregnancy, which can be extended to predict over any given number of cycles with a maximum of 2.5 years after the workup (~28 –34 cycles). When couples return after a period of expectant manage- ment, the number of cycles that have passed since the workup can be changed to update the predicted probability over subsequent cycles. The model developed by van Eekelen et al. showed promising

Results

in the internal validation, but this in itself is insuf ficient to advise clinical implementation since models tend to perform better in the cohort they were developed on than in another cohort in which the model may be applied ( Steyerberg, 2009). The aim of this study was to externally validate the van Eekelen model on a large cohort that followed couples for natural conception after registration in the fertility clinic of the Grampian region of Scotland, UK. This is the largest contemporary cohort following cou- ples for natural conception, aside from the Dutch cohort on which the dynamic model was developed.

Materials and methods

We included couples diagnosed with unexplained subfertility residing in the Grampian region of Scotland who registered with the Aberdeen Fertility Centre (AFC) from 1998 to 2011 ( Pandey et al., 2014 ). Only patients from the Grampian region visiting the AFC were selected because there is no other fertility clinic in the region and it was considered import- ant to have a complete overview of a couple ’s trajectory after the fertility workup, which includes treatment information. We combined the AFC registration database with three other data sources using record-linkage to get the complete follow-up for couples from the registration at the AFC until ongoing pregnancy, treatment or end of the study, which was the 31 March 2012. The AFC database comprises patient characteristics and diagnostic information. Data entry in the AFC database is validated and checked by regular case note audits. First, we record-linked couples registered in the AFC database to the centre ’s Assisted Reproduction Unit database which contained dates when treatment was started. Second, we identi fied natural conceptions leading to an ongoing preg- nancy by record-linkage of the AFC database with the Aberdeen Maternity and Neonatal Databank, which contained gestational age, outcome and delivery date of (early) pregnancies for all women residing in Aberdeen 2269External validation of a dynamic prediction model City District. Third, we performed record-linkage with the national Scottish Morbidity Records Maternity database for identifying gestational age, outcome and delivery date of (early) pregnancies for women who delivered elsewhere in Scotland. The Data Management Team of the University of Aberdeen created a new pseudonomized identi fier for all women by using the Community Health Index identi fier. This new study-speci fic identi fier cannot be used to trace back to individuals and was then used by author D.J.M. to record- link the databases within the Grampian Data Safe Haven environment. This process was carried out according to the Standard Operating Procedures of the Data Management Team, University of Aberdeen. The resulting linked dataset was thus a combination of these four data sources. Ethical approval was provided by the North of Scotland Research Ethics Committee (reference: 12/NS/0120). Access to the Aberdeen Fertility Clinic and the Assisted Reproduction Unit databases was approved by the Aberdeen Fertility Databases Steering Committee. Access to the Aberdeen Maternity and Neonatal Databank was approved by the Aberdeen Maternity and Neonatal Database Steering C ommittee. Access to the Scottish Morbidity Records Maternity database was approved by the Privacy Advisory Committee of Information Services Division Scotland. We de fined unexplained subfertility as couples who tried to conceive for more than 50 weeks before the fertility workup was completed and who had no obvious barriers to conception in terms of uni or bilateral tubal occlusion, anovulation, mild or severe endometriosis according to the revised American Society for Reproductive Medicine (ASRM) score ( ASRM, 1997 ) or impaired semen quality according to World Health Organization (WHO) criteria ( WHO, 1999 , 2010). We used the gesta- tional age at birth or early pregnancy outcome to derive the date of con- ception and included only pregnancies in the analysis that occurred after registration of the couple at the clinic and that were ongoing, de fined as reaching a gestational age of at least 12 weeks. Time to conception was censored at the date of start of IUI, start of IVF, when the woman returned to the fertility centre with a different male partner or at the end of study. Missing data The date of completion of the fertility workup was not reported in the AFC database. The van Eekelen model uses this date as the starting point of follow-up, i.e. the time point from which onwards the model can be used to estimate a prognosis. The date of registration and the diagnosis category were available in the database. Judging from local protocols, we assumed there were 3 months in between registration and completion of the fertility workup for all couples. In a sensitivity analysis, we repeated the validation study assuming 1.5 or 4.5 months between registration and completion of the fertility workup for all couples. Menstrual cycle length is used to determine the number of elapsed men- strual cycles since the fertility workup when updating predictions using the dynamic prediction model. Cycle length was not recorded in the AFC database and we therefore assumed an average cycle length of 28 days for all women. Data on outcomes or at least one prognostic factor were missing for ~4% of couples; 0.5% on pregnancy or follow-up, 0.5% on female age, 2.3% on duration of subfertility, 0.5% on primary or secondary subfertility, 1.9% on the percentage of progressive motile sperm and 0.5% on referral status. We had no reason to believe that couples with missing data differed systematically from couples with complete data and we analysed couples for which data was complete. Analysis We calculated the predicted probabilities of natural conception over 1 year for all couples in the validation cohort using the formula in the Appendix of the paper by van Eekelen et al. ( van Eekelen et al., 2017a ). To test the model ’s ability to not only predict after the completion of the fer- tility workup but also when a couple returns after an unsuccessful period of expectant management, we calculated the prognosis at four time points: directly after completion of the workup, after 0.5, 1 and 1.5 years of expectant management. We evaluated model performance in terms of calibration, i.e. the degree of agreement between observed and predicted natural conception rates, and discrimination, i.e. the ability of the dynamic prediction model to distinguish between couples who do conceive and couples who do not conceive. To assess calibration, we first explored whether the overall prediction of the model was correct by comparing the average predicted probability over a time period with the observed conception rate over that same time period. This is referred to as calibration-in-the-large and assesses whether the model systematically under or overestimates the observed conception rate ( Steyerberg, 2009). Second, we assessed whether the effects of patient characteristics were estimated correctly in three ways: by visuals using calibration plots for risk groups, by calibration within groups with similar patient characteristics and by calculating a calibration slope. For the calibration plots, we ordered the predicted probabilities of couples and divided them in risk groups with similar predictions ( n = 135 per risk group). We compared the mean pre- dicted chances within these groups with the corresponding observed frac- tion of ongoing pregnancy as estimated by the Kaplan –Meier method. We visualized the observed fractions and predicted probabilities per risk group in plots and tabulated the absolute differences. In the plots, the 45 ° line indicates what would be a perfect agreement between the observed frac- tion and average predicted probability within a risk group. We repeated the calibration procedure but instead of grouping based on predicted risks, we grouped couples based on having similar patient characteristics. We again compared the mean predicted chances within these groups with the corresponding observed fraction of ongoing preg- nancy as estimated by the Kaplan –Meier method and tabulated the results. To calculate the calibration slope, we used the prognostic index (i.e. the sum of the multiplication between all patient characteristics and the coef fi- cients from the model) as an explanatory variable in a Cox model for each of the four evaluated time periods ( van Houwelingen, 2000 ). Ideally, the calibration slope is unity, i.e. 1, indicating that the strength of the patient characteristics in the evaluated model perfectly matches the validation data. Third, we used a recalibration procedure as an alternative way to assess the systematic under or overestimation (calibration-in-the-large) and the strength of the patient characteristics (calibration slope) in the model. We did this by using the same coef ficients for the patient characteristics as reported by van Eekelen et al. to calculate a prognostic index but re-estimated the other parameters of the beta-geometric model in the val- idation dataset ( Bongaarts, 1975; Weinberg and Gladen, 1986). The recali- bration model re-estimates three parameters, which we compared to those in the van Eekelen model and tested for the difference between the two using independent samples z-tests. Systematic under or overesti- mation was assessed by comparing the intercept and the variance para- meters. The intercept parameter indicates the estimated pregnancy chances in the first cycle after the fertility workup and the variance param- eter indicates how fast the estimated chances decrease over consecutive failed natural cycles. Similarity in strength of the patient characteristics was assessed by again calculating a calibration slope parameter, which would ideally be 1. We assessed discrimination by calculating Harrel ’s c statistic at the four time points, which we compared to those found at internal validation ( Harrell et al., 1996). Finally, we explored the range of predicted probabilities at the four time points to see if they facilitate meaningful prognostic strati fication of couples (Coppus et al., 2009). 2270 van Eekelen et al. All analyses were conducted in R version 3.4.3 and RStudio ( R Core Team, 2013). A P-value below 0.05 was considered statistically signi ficant.

Results

Data of 1203 couples were included (Fig. 1). The baseline characteris- tics of the couples are shown in Table I. In total, 398 (33%) couples conceived naturally, leading to an ongoing pregnancy. The median follow-up was 1 year and 3 months after the completion of the workup (average follow-up 2 years and 6 months). The observed rates of natural conception up to 2.5 years are depicted in Fig. 2 (upper panel). For couples who did not yet conceive after 0.5, 1 or 1.5 years after the completion of the fertility workup, the observed rates of natural conception over the following year are depicted in Fig. 2 (lower panel). The mean probability of natural con- ception as predicted by the dynamic model over the course of the first year after the fertility workup was 25%, while the observed fraction was 23% (95%CI 20 –25). For couples who did not conceive after 0.5, 1 and 1.5 years of expectant management, the mean estimated prob- ability of conceiving over the course of the following year was esti- mated at 18, 14 and 12%. The observed rates were 15% (13 –18%), 14% (11 –17%) and 12% (9 –15%) for these three time periods, respectively (Fig. 2, lower panel). Except for the second period during which the model slightly overestimated the pregnancy chances by 3% points, the mean predicted probabilities fell within their respective confidence limits of the observed rates, indicating good agreement between the average prediction rendered by the dynamic model and the corresponding observed rate of natural conception. The calibration plots for the four time periods are presented in Fig. 3. The dynamic prediction model was well calibrated based on the upward trends observed in the four plots, indicating that higher pre- dicted probabilities correspond to higher observed rates, and the CIs from the observed rates which all but one cover the ideal 45 ° line. The second calibration plot starting at 0.5 year after the fertility workup showed a slight overestimation since all points are below the 45 ° line. The absolute differences between observed fractions and predicted probabilities of natural conception within risk groups are shown in Table II. This was on average 2.8% points and 9.6 at the highest. The results for the calibration grouping couples by similar character- istics are shown in Supplementary data. Results were similar to those in the calibration using risk groups, with a slight overestimation in the time periods right after the completion of the fertility workup and after 0.5 year of expectant management. The calibration slopes using Cox models were 0.86, 1.01, 1.01 and 0.62 for the four time periods, respectively. None of the correspond- ing P-values were below 0.05, indicating no statistical evidence for under or overfitting. In the recalibration model, the intercept and variance parameters were similar to those reported by van Eekelen et al. (P = 0.69 and P = 0.29 for the difference, respectively), indicating similar underlying chances of pregnancy in the first cycle after the workup and a similar decrease in chances as time progresses. The slope was 0.90 ( P = 0.37), indicating a similar strength of patient characteristics in the valid- ation cohort and no signi ficant difference from 1. The discriminative ability of the model in the validation cohort was moderate and similar to that in the Dutch development cohort, ran- ging over time from a c statistic of 0.61 (95%CI 0.57 –0.64) in the first 5466 couples registered between 1998 and 2011 in the Aberdeen Fer/g415lity Clinic Women excluded with diagnoses other than unexplained subfer/g415lity (n = 3945) 1521 couples with unexplained subfer/g415lity 1203 couples in the final analysis Couples that did not provide consent for treatment data to be used for research (n = 10) Couples conceived before comple/g415on of fer/g415lity workup (n = 234) Couples excluded with missing outcome data (n = 8) Couples excluded with missing predictor values (n = 39) Couples excluded that were followed for less than one cycle of expectant management (n = 6) Couples with a dura/g415on of subfer/g415lity of 50 weeks or less (n = 21) Figure 1 Flow chart of couples with unexplained subfertility who were considered for inclusion in the external validation. 2271External validation of a dynamic prediction model year, 0.62 (95% CI 0.58 –0.67) from 0.5 year, 0.63 (95% CI 0.57 –0.69) from 1 year, to 0.60 (95% CI 0.52 –0.67) for 1.5 years after the com- pletion of the fertility workup, all for conceiving in the following year. The c statistics were around 0.61 for all four time periods and seemed stable over time. The range of predictions varied between 0% and 55% over the course of the first year after the fertility workup. After 0.5, 1 and 1.5 years of expectant management the ranges narrowed to 0 –43%, 0–34% and 0 –29% respectively, all over the course of the following year, facilitating a distinction between couples with a good or poor prognosis. Sensitivity analyses

Results

from the two sensitivity analyses are reported online as Supplementary data. Theanalysis where we assumed 1.5 months between registration and completion of the fertility workup showed a very good performance of the dynamic prediction model (Supplementary Table SI, Supplementary Figs S1 and S2). The analysis assuming 4.5 months between registration and completion of the fertility workup showed similar results to the primary analysis but with slightly more overestimation of chances by the model (Supplementary Table SII, Supplementary Figs S3 and S4).

Discussion

We conducted an external, geographical validation of the van Eekelen model that can be used for repeated predictions of natural conception when couples return to the clinic after unsuccessful expectant manage- ment. The model performed well in a Scottish cohort of couples with unexplained subfertility that visited a fertility clinic and the model is expected to be generalizable to other fertility centres and countries where the procedure of managing unexplained subfertile couples is comparable to the Netherlands and the UK. In addition, the predicted probabilities varied suf ficiently to aid in distinguishing between couples with a good and poor prognosis in terms of natural conception. The data from the AFC was of high quality, registering every unex- plained subfertile couple in the Grampian region. All natural concep- tions leading to ongoing pregnancy, including after miscarriages and other early pregnancy outcomes, were found using data linkage with maternity records. Indications for the fertility workup and de finitions of censoring and prognostic characteristics in the Scottish cohort were very similar to the Dutch cohort, aiding comparability ( van Eekelen et al., 2017a). The model was well calibrated, which we consider of higher import- ance than discrimination since the c statistics can be expected to be moderate due to the limited range of predicted chances in fertility (Mol et al., 2005 ; Cook, 2007). This restricts the maximum possible c statistics, even if a model was to produce perfect predictions. Recalibration, in which one or more parameters of the prediction model are updated to accommodate better predictions in a different country or clinical setting, was not necessary since the recalibration model showed similar values for all parameters as observed in the development cohort. The main limitation to our study was missing data in terms of dates of completion of the fertility workup and menstrual cycle lengths. Menstrual cycle length was not considered very in fluential since the estimations of the number of cycles per individual are reasonable approximations due to the narrow range of possible cycle lengths in our selection of unexplained subfertile couples, but we did have to make strong assumptions about the date of completion of the fertility workup. We assumed 3 months between registration and completion of the fertility workup, which resulted in ongoing pregnancies before 3 months after registration being excluded. The ‘starting’ moment of follow-up thus differed from the Dutch development cohort since in the latter, the date of last tubal test was used as the end of the workup. Some Dutch clinics did not conduct a visual test of tubal patency, i.e. laparoscopy or hysterosalpingography after a negative

Result

for the chlamydia antibody test. In those Dutch clinics, the workup was thus considered as complete earlier after registration compared to the AFC where visual tests of tubal patency are a part of the standard protocol. This may have led to the observed slight over- estimation in the first year after the fertility workup and after 0.5 year of expectant management but, despite these differences, the dynamic model was still able to estimate a prognosis that was reasonably accur- ate on cohort and risk group level. The results from the sensitivity ana- lysis assuming 1.5 months between registration and completion of the fertility workup were very good because the resulting population more closely resembled that of the Dutch development cohort in which the same average duration was observed between registration and the workup completion. Accordingly, in the analysis assuming 4.5 months between registration and completion of the fertility workup, the per- formance of the dynamic model was poorer because the populations differed more due to additional selection that occurred. The dynamic model is able to reassess the chance of natural con- ception after any given period of expectant management from the completion of the fertility workup onwards. For example, a couple with 1-year secondary subfertility is referred by a general practitioner to the fertility clinic of which the woman is 33 years old at the comple- tion of the fertility workup and the man has 40% progressive motile sperm. Applying our model gives a predicted 38% chance of natural conception over the first year after the workup and they might be advised expectant management. When the couple returns to the clinic after 10 unsuccessful months/cycles, reapplying the model yields 25% chance over the following year, which is a realistic decrease given they have tried for an additional 10 months. This could be a reason to con- sider starting treatment. ........................................................................................ Table I Baseline characteristics at completion of the fertility workup. n = 1203 Mean or n 5th–95th Percentile or % Female age, in years 33.3 25 –41 Duration of subfertility, in years 2.7 1.3 –5.6 Primary female subfertility 697 58% Percentage of progressive motile sperm 51 24 –76 Referral by secondary care 84 7% 2272 van Eekelen et al. Both the Hunault model and the dynamic model performed well in external validations, indicating that the added value of the dynamic model lies in the ability to update predictions at later time points ( van Eekelen et al., 2017a ). This provides clinicians and patients with infor- mation regarding their prognosis of natural conception not only right after the completion of the fertility workup but also when the couple returns after an additional, unsuccessful period of expectant manage- ment, thus aiding in making clinical decisions at multiple time points throughout a couple’s trajectory. The ability to update predictions also aids in studies which include the prognosis of natural conception as an in- or exclusion criterion, since the prognosis of couples who return after unsuccessful expectant management can be updated accurately, leading to the desired homogeneity of the study sample ( van den Boogaard et al., 2014 ). The dynamic model is flexible and can be used to predict over any desired number of menstrual cycles, for instance when the couple is interested in time periods shorter or longer than 1 year. In short, the dynamic model has a wider clinical applicability than the Hunault model and should be the model of choice.

Conclusion

The van Eekelen model is a valid and robust tool that is ready to use in clinical practice to counsel couples with unexplained subfertility on their individualized chances of natural conception at various points in time, notably when couples return to the clinic after a period of unsuc- cessful expectant management. Figure 2 Cumulative chances of natural conception leading to ongoing pregnancy. Cumulative chances after the completion of fertility workup (upper panel) and updated chances of natural conception over the course of 1 year at the completion of the fertility workup or 0.5, 1 and 1.5 years thereafter (lower panel) in the validation cohort. Percentages are Kaplan –Meier estimates of the observed fraction of natural conception leading to ongoing pregnancy. 2273External validation of a dynamic prediction model Supplementary data Supplementary data are available at Human Reproduction online.

Acknowledgements

The authors would like to thank Prof. Egbert te Velde for all of his efforts regarding development of the dynamic prediction model and the current validation study. The authors acknowledge the data man- agement support of the Grampian Data Safe Haven (DaSH) and the associated financial support of NHS Research Scotland, through NHS Grampian investment in the Grampian DaSH. For more information, Figure 3 Calibration of the predictions of the dynamic prediction model: predicted vs observed 1-year natural conception rates at fourfixed time points. .......................................................................................... Table II Calibration of the dynamic prediction model by risk groups. Mean difference Max difference Number of risk groups After completion of workup 3.2 9.6 9 After 0.5-year EM 3.0 4.7 7 After 1-year EM 2.1 3.5 5 After 1.5-year EM 2.7 4.5 4 Total 2.8 9.6 25 Data are the mean and maximum of the absolute differences (in percentage points) between predicted and observed 1-year natural conception rates per risk group of n = 135, stratified by the elapsed period of expectant management (EM). 2274 van Eekelen et al. visit the DaSH website http://www.abdn.ac.uk/iahs/facilities/grampian- data-safe-haven.php. The authors would like to thank all the staff at Aberdeen Fertility Clinic for their help with database queries. Authors’ roles NvG, MDJ, BS, FvdV, MvW and MJE conceived the study. MDJ per- formed the data linkage, storage in the Safe Haven and cleaned the data. RvE, NvG and MJE designed the statistical analysis plan. RvE, MDJ and NvG analysed the data. RvE drafted the manuscript. All authors contributed critical revision to the paper and approved the final manuscript. Funding Chief Scientist Office postdoctoral training fellowship in health services research and health of the public research (ref PDF/12/06). The views expressed here are those of the authors and not necessarily those of the Chief Scientist Office. The funder did not have any role in the study design; the collection, analysis and interpretation of data; the writing of the report nor the decision to submit the paper for publication. Conflict of interest None declared.

References

Aboulghar M, Baird D, Collin J, Evers J, Fauser B, Lambalk C, Other A. Intrauterine insemination. Hum Reprod Update 2009;15:265–277. ASRM. Revised American Society for Reproductive Medicine classi fication of endometriosis: 1996. Fertil Steril 1997;67:817–821. Bongaarts J. A method for the estimation of fecundability. Demography 1975;12:645–660. Brandes M, Hamilton CJ, de Bruin JP, Nelen WL, Kremer JA. The relative contribution of IVF to the total ongoing pregnancy rate in a subfertile cohort. Hum Reprod 2010;25:118–126. Cook NR. Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation 2007;115:928–935. Coppus SF, van der Veen F, Opmeer BC, Mol BW, Bossuyt PM. Evaluating prediction models in reproductive medicine. Hum Reprod 2009;24: 1774–1778. Gnoth C, Godehardt D, Godehardt E, Frank-Herrmann P, Freundl G. Time to pregnancy: results of the German prospective study and impact on the management of infertility. Hum Reprod 2003;18:1959–1966. Harrell FE Jr., Lee KL, Mark DB. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measur- ing and reducing errors. Stat Med 1996;15:361–387. Hunault CC, Habbema JD, Eijkemans MJ, Collins JA, Evers JL, te Velde ER. Two new prediction rules for spontaneous pregnancy leading to live birth among subfertile couples, based on the synthesis of three previous models. Hum Reprod 2004;19:2019–2026. Leushuis E, van der Steeg JW, Steures P, Bossuyt PM, Eijkemans MJ, van der Veen F, Mol BW, Hompes PG. Prediction models in reproductive medicine: a critical appraisal. Hum Reprod Update 2009;15:537–552. Mol BW, Coppus SF, Van der Veen F, Bossuyt P. ROC-curves are mislead- ing; calibration is not! Fertil Steril 2005;84:253–254. NVOG, Dutch Society for Obstetrics and Gynaecology. Guideline on: subferti- lity (2010). http://bit.ly/1UhuYMV. (5 February 2017, date last accessed). Pandey S, McLernon DJ, Scotland G, Mollison J, Wordsworth S, Bhattacharya S. Cost of fertility treatment and live birth outcome in women of different ages and BMI. Hum Reprod 2014;29:2199–2211. Pandian Z, Gibreel A, Bhattacharya S. In vitro fertilisation for unexplained subfertility. Cochrane Database Syst Rev 2015;2:Cd003357. R Core Team. R: A Language and Environment for Statistical Computing . Vienna, Austria: R Foundation for Statistical Computing, 2013. http:// www.R-project.org/. Steyerberg E. Clinical Prediction Models: A Practical Approach to Development, Validation and Updating. New York, USA: Springer, 2009. Tjon-Kon-Fat RI, Bensdorp AJ, Scholten I, Repping S, van Wely M, Mol BW, van der Veen F. IUI and IVF for unexplained subfertility: where did we go wrong? Hum Reprod 2016;31:2665–2667. van den Boogaard NM, Bensdorp AJ, Oude Rengerink K, Barnhart K, Bhattacharya S, Custers IM, Coutifaris C, Goverde AJ, Guzick DS, Hughes EC et al . Prognostic pro files and the effectiveness of assisted conception: secondary analyses of individual patient data. Hum Reprod Update 2014;20:141–151. van der Steeg JW, Steures P, Eijkemans MJ, Habbema JD, Hompes PG, Broekmans FJ, van Dessel HJ, Bossuyt PM, van der Veen F, Mol BW. Pregnancy is predictable: a large-scale prospective external validation of the prediction of spontaneous pregnancy in subfertile couples. Hum Reprod 2007;22:536–542. van Eekelen R, Scholten I, Tjon-Kon-Fat RI, van der Steeg JW, Steures P, Hompes P, van Wely M, van der Veen F, Mol BW, Eijkemans MJ et al . Natural conception: repeated predictions over time. Hum Reprod 2017a;32:346–353. van Eekelen R, van Geloven N, van Wely M, McLernon DJ, Eijkemans MJ, Repping S, Steyerberg EW, Mol BW, Bhattacharya S, van der Veen F. Constructing the crystal ball: how to get reliable prognostic information for the management of subfertile couples. Hum Reprod 2017b;32: 2153–2158. van Houwelingen HC. Validation, calibration, revision and combination of prognostic survival models. Stat Med 2000;19:3401–3415. Veltman-Verhulst SM, Hughes E, Ayeleke RO, Cohlen BJ. Intra-uterine insemination for unexplained subfertility. Cochrane Database Syst Rev 2016;2:Cd001838. Wang X, Chen C, Wang L, Chen D, Guang W, French J. Conception, early pregnancy loss, and time to clinical pregnancy: a population-based pro- spective study. Fertil Steril 2003;79:577–584. Weinberg CR, Gladen BC. The beta-geometric distribution applied to comparative fecundability studies. Biometrics 1986;42:547–560. WHO. World Health Organization. Laboratory Manual for the Examination of Human Semen and Sperm-Cervical Mucus Interaction , 4th edn. Cambridge: Cambridge University Press, 1999. WHO. World Health Organisation. Laboratory Manual for the Examination and Processing of Human Semen , 5th edn. Geneva: World Health Organization, 2010. 2275External validation of a dynamic prediction model

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-pdf

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Condition tags

endometriosis

MeSH descriptors

Fertilization Infertility Models, Biological Adult Female Fertilization Humans Infertility Male Pregnancy Pregnancy Rate Prognosis Registries Scotland

Citation neighborhood (sparse)

Too few in-corpus citations on either side for a chart; here are the lists.

Cites (2)

Cited by (1)

References (31)

Cited by (1)

Source provenance

europepmc
last seen: 2026-06-11T06:19:48.454388+00:00
openalex
last seen: 2026-05-10T10:51:49.019974+00:00
pubmed
last seen: 2026-05-13T22:19:31.300640+00:00
License: CC0 · commercial use OK