Development and validation of a clinical nomogram for predicting cumulative live birth rate in ovarian endometrioma patients undergoing ethanol sclerotherapy and in vitro fertilization/intracytoplasmic sperm injection

other OA: gold CC-BY-4.0

Abstract

BACKGROUND: Effective individualized tools to predict cumulative live birth rates are lacking for patients with ovarian endometriomas undergoing In Vitro Fertilization or Intracytoplasmic Sperm Injection after ultrasound-guided ethanol sclerotherapy. This study aimed to construct and validate a nomogram model for predicting the cumulative live birth rate in this specific population. METHODS: This retrospective study analyzed the clinical data of 194 patients with ovarian endometriosis who underwent ethanol sclerotherapy followed by In Vitro Fertilization or Intracytoplasmic Sperm Injection between January 2020 and December 2024. All patients were randomly divided into training (n = 135) and validation (n = 59) cohorts in a 7:3 ratio. Independent risk factors for the cumulative live birth rate were identified through a comprehensive three-stage screening process that included univariate regression, Least Absolute Shrinkage and Selection Operator regression, and multivariate logistic regression analyses. The nomogram model was constructed using the selected variables, and its discrimination, calibration, and clinical utility were assessed using Receiver Operating Characteristic curves, calibration curves, and Decision Curve Analysis. RESULTS: A total of 194 patients were included, with an overall cumulative live birth rate of 50.0% (97/194). Multivariate regression analysis identified four independent predictors of the cumulative live birth rate: previous live birth history, controlled ovarian hyperstimulation protocol, number of oocytes retrieved, and cyst diameter. The nomogram constructed using these variables exhibited good discriminatory ability in both the training and validation cohorts, with areas under the curve of 0.849 (95% Confidence Interval: 0.782-0.912) and 0.853 (95% Confidence Interval: 0.754-0.952), respectively. Bootstrap validation (500 iterations) confirmed the stability of the model. Calibration was acceptable (Hosmer-Lemeshow test, P>0.05), and decision curve analysis indicated a favorable net benefit across a threshold probability range of 10-50%, demonstrating its clinical utility. CONCLUSIONS: This study developed and validated a parsimonious nomogram model that accurately predicts the cumulative live birth rate in patients with ovarian endometriosis undergoing In Vitro Fertilization/Intracytoplasmic Sperm Injection after ethanol sclerotherapy. Based on four key predictors (previous live birth history, controlled ovarian hyperstimulation protocol, number of oocytes retrieved, and cyst diameter), the model demonstrates excellent and robust predictive performance. It serves as an intuitive and reliable clinical tool to support individualized counseling and shared treatment decision-making.
Full text 46,004 characters · extracted from pmc · 5 sections · click to expand

Intro

Endometriosis (EMs) affects approximately 10% of reproductive-aged women and is a major cause of pelvic pain and infertility ( 1 , 2 ). Ovarian endometriomas (OMAs) occur in 17–44% of cases and impair fertility through pelvic anatomical distortion, proinflammatory environments, and damage to adjacent ovarian tissue, leading to a reduced ovarian reserve ( 3 , 4 ). Managing OMAs in fertility-seeking patients requires careful balancing of symptom control and ovarian preservation. The stripping technique removes the healthy ovarian cortex alongside the cyst walls, substantially lowering postoperative anti-Müllerian hormone (AMH) levels and antral follicle count (AFC) ( 5 , 6 ). This functional decline is particularly concerning In Vitro Fertilization/Intracytoplasmic Sperm Injection (IVF/ICSI) candidates, as a reduced ovarian reserve correlates with poor stimulation responses and decreased live birth rates ( 7 ). Ultrasound-guided ethanol sclerotherapy (EST) has emerged as a minimally invasive alternative for the management of OMAs. This procedure involves cyst aspiration followed by ethanol instillation, which chemically ablates the cyst lining while preserving the ovarian reserve ( 8 , 9 ). Meta-analyses reveal effective cyst reduction and pain relief, with AMH levels remaining stable or improving, in contrast to the significant decline observed post-cystectomy ( 10 ). The mechanism of action involves disruption of the cyst’s epithelial lining through the direct cytotoxic effects of ethanol, followed by inflammation and fibrosis, which ultimately leads to cyst obliteration ( 11 ). Unlike surgical excision, EST precisely targets OMAs without damaging the surrounding normal ovarian tissue, thereby better preserving the ovarian reserve and follicular pool ( 12 ). Systematic reviews have demonstrated that EST results in comparable or superior oocyte retrieval compared with laparoscopic cystectomy, with pregnancy rates ranging from 20% to 57% ( 8 , 11 ). When performed before IVF/ICSI, EST may enhance reproductive outcomes by reducing local inflammation and supporting folliculogenesis ( 13 ). Despite these advantages, predicting individual live birth outcomes remains challenging. IVF/ICSI success depends on the complex interactions among patient characteristics, disease factors, and treatment parameters ( 14 ). Although biomarkers such as AMH are routinely used, their predictive accuracy for live births remains limited ( 15 ). Nomograms have shown superior accuracy over single-variable models in reproductive medicine ( 16 , 17 ); however, there has been no prediction of the cumulative live birth rate (CLBR) in patients with OMAs undergoing IVF/ICSI after EST. This study sought to develop and validate a nomogram for predicting CLBR in OMAs who underwent EST prior to IVF/ICSI. We hypothesized that a multivariable model incorporating clinical, sonographic, and treatment variables would yield accurate individualized predictions to enhance patient counseling and inform clinical decision-making.

Results

This study retrospectively analyzed 220 patients with OMAs who underwent ultrasound-guided EST followed by IVF/ICSI between January 2020 and December 2024 at our institution. In accordance with the stringent inclusion and exclusion criteria, 26 patients were excluded from the study: 11 were lost to follow-up, 5 had incomplete data, 7 presented with severe male factor infertility, and 3 did not undergo IVF/ICSI. Consequently, 194 patients were deemed eligible for the final analysis ( Figure 1 ). Patients were randomly split 7:3 into a training cohort (n=135, 70%) for model development and a validation cohort (n=59, 30%) for internal validation, establishing a statistical foundation. The median age of the study population was 33.0 years, with a median AMH level of 2.83 ng/mL, eight oocytes retrieved, and four usable embryos produced. Among all baseline variables, only bT (training: 28.22 ng/dL vs. validation: 22.75 ng/dL, P = 0.045) and AFC (training: 11.00 vs. validation: 9.00, P = 0.030) showed statistically significant differences. All other variables, including age, BMI, infertility features, cyst parameters, COH protocols, and embryological outcomes, were comparable (P>0.05) ( Table 1 ). This confirmed successful randomization and a good baseline balance, providing a reliable dataset for the subsequent model construction. Flowchart of study population selection and cohort allocation. EMRS, Emergency Medical Response System; LASSO, Least Absolute Shrinkage and Selection Operator; IVF, In Vitro Fertilization; ICSI, Intracytoplasmic Sperm Injection. Baseline characteristics and clinical parameters of study participants. Continuous variables with a normal distribution are presented as mean ± standard deviation (SD), while those with a non-normal distribution are shown as median [interquartile range]. Categorical variables are expressed as numbers (n) and percentages (%). P-values were determined using the independent t-test for continuous variables with normal distribution, the Mann-Whitney U test for those with non-normal distribution, and the Chi-square test or Fisher’s exact test for categorical variables. AFC, Antral Follicle Count; AMH, Anti-Müllerian Hormone; BMI, Body Mass Index; bFSH, Basal Follicle-Stimulating Hormone; bLH, Basal Luteinizing Hormone; bE 2 , Basal Estradiol; bT, Basal Testosterone; bPRL, Basal Prolactin; bP, Basal Progesterone; COH, Controlled Ovarian Hyperstimulation; GnRH, Gonadotropin-Releasing Hormone; IVF, In Vitro Fertilization; ICSI, Intracytoplasmic Sperm Injection; PPOS, Progestin-Primed Ovarian Stimulation; MII, Metaphase II. We conducted a univariate logistic regression analysis to evaluate the association between each candidate variable and CLBR. Seventy-one variables spanning six clinical domains were assessed. Using a liberal screening threshold of P < 0.10 to avoid prematurely discarding potentially informative predictors, 26 variables showed statistically significant associations with CLBR ( Table 2 ). The complete univariate results for all 71 candidate variables are presented in Supplementary Table S1 . From the 26 identified variables, the LASSO regression analysis, utilizing 10-fold cross-validation, determined an optimal λ value of 0.0807 ( Figures 2A, B ). Consequently, nine variables were retained: female age, male age, AFC, previous live birth history, COH protocol, number of oocytes retrieved, number of cryopreserved embryos, LH level on HCG trigger day, and cyst diameter. The nine variables were further refined using multivariate logistic regression with backward elimination. This process yielded a streamlined final model comprising four independent predictors: previous live birth history, COH protocol, total number of oocytes retrieved, and endometrioma diameter ( Table 3 ). Univariate logistic regression analysis results (P < 0.10) associated with cumulative live birth rate. Results are presented as regression coefficients (β), standard errors (SE), odds ratios (OR) with 95% confidence intervals (CI), Z statistics, and P-values. P-values < 0.05 are considered statistically significant. Ref, Reference category. AFC, Antral Follicle Count; AMH, Anti-Müllerian Hormone; CI, Confidence Interval; COH, Controlled Ovarian Hyperstimulation; GnRH, Gonadotropin-Releasing Hormone; HCG, Human Chorionic Gonadotropin; LH, Luteinizing Hormone; MII, Metaphase II; OR, Odds Ratio; PPOS, Progestin-Primed Ovarian Stimulation. Results of the LASSO regression. (A) Coefficient profiles of the 26 candidate variables. (B) Cross-validation curve for model selection. Multivariate logistic regression analysis of factors associated with cumulative live birth rate. Data are presented as beta coefficient (β), standard error (SE), odds ratio (OR) with 95% confidence interval (CI), Z-statistic, and P-value. P-values < 0.05 are considered statistically significant. AFC, antral follicle count; AMH, anti-Müllerian hormone; CI, confidence interval; COH, controlled ovarian hyperstimulation; GnRH, gonadotropin-releasing hormone; HCG, human chorionic gonadotropin; LH, luteinizing hormone; MII, metaphase II; OR, odds ratio; PPOS, progestin-primed ovarian stimulation; Ref, reference category; SE, standard error. We constructed a nomogram to predict the CLBR in women with OMAs after EST and IVF/ICSI treatment. The nomogram included previous live birth history, COH protocol, cyst diameter, and number of oocytes retrieved ( Figure 3 ). To calculate individualized CLBR prediction probabilities using the nomogram, first, find values for the five predictors and read their scores on the “Points” scale. Then, sum these scores to get “Total Points.” Finally, the total points were projected downward to the “Diagnostic Possibility” scale to determine the predicted CLBR probability. Model calibration showed a clear link between total points and the predicted CLBR probability. A total score of 90 corresponded to a 10% CLBR, 115 to 20%, 131 to 30%, 145 to 40%, 157 to 50%, 170 to 60%, 183 to 70%, 199 to 80%, and 224 to 90%. This probability gradient helps doctors predict patient outcomes, provide advice, and set realistic treatment goals and decisions. Nomogram for predicting cumulative live birth rate in women with ovarian endometriomas after ethanol sclerotherapy and subsequent IVF/ICSI. To use the nomogram, locate the patient’s value for each predictor on the corresponding axis, draw a vertical line to the ‘Points’ axis to determine the points, sum the points for all predictors, and locate the total points on the ‘Total Points’ axis. Draw a vertical line down to the ‘Diagnostic Possibility’ axis to obtain the predicted CLBR. COH, controlled ovarian hyperstimulation; PPOS, progestin-primed ovarian stimulation; GnRH, Gonadotropin−Releasing Hormone. The discriminatory capacity of the nomogram was assessed using ROC curve analysis in both datasets to differentiate between patients who achieved and did not achieve cumulative live birth. The nomogram showed good discrimination, with an AUC of 0.849 (95% CI: 0.782-0.912) in the training dataset ( Figure 4A ) and 0.853 (95% CI: 0.754-0.952) in the validation dataset ( Figure 4B ). The AUC in the validation set was higher than that in the training set, suggesting minimal overfitting and good model generalizability. Overlapping CIs indicate the model’s stable discriminative ability. The DeLong test indicated no statistically significant difference between the AUCs of the training and validation sets (P = 0.9527), suggesting consistent discriminative performance across the datasets. Bootstrap resampling validation with 500 iterations was performed for both cohorts. The Bootstrap-validated AUC was 0.877 (95% CI: 0.817-0.937) in the training cohort and 0.879 (95% CI: 0.792-0.967) in the validation cohort, as shown in Figures 4C, D . Bootstrap CIs aligned with the original ROC analysis confirmed the robustness of the model discrimination. The Bootstrap AUC in the validation cohort exceeded that in the training cohort, demonstrating strong generalizability without overfitting. Receiver operating characteristic (ROC) curves of the nomogram for predicting the cumulative live birth rate. (A) ROC curve of the training set. (B) ROC curve for the validation set. (C) ROC curve of the training set after 500 bootstrap resampling. (D) ROC curve of the validation set after 500 bootstrap resampling. The red line represents the apparent ROC, and the gray lines represent the ROC curves generated from 500 bootstrap samples. ROC, receiver operating characteristic; AUC, area under the ROC curve. Calibration analysis was performed to evaluate the agreement between the predicted probabilities and the observed outcomes. Calibration plots were constructed to visually assess the relationship between the predicted and actual CLBR. In the training set, the model demonstrated good calibration, with the calibration curve generally aligned with the ideal line. The key calibration metrics included a Brier score of 0.151, an intercept of 0.000, and a slope of 1.000, indicating accurate probability estimates without any systematic bias. Furthermore, the Hosmer-Lemeshow goodness-of-fit test yielded a non-significant result (P = 0.415), supporting the conclusion that there was no statistically significant difference between the predicted and observed CLBR. In the validation set, the model maintained a good calibration ( Figure 5B ). The Brier score was 0.158, which is comparable to that of the training set and indicates a low prediction error. The calibration intercept was -0.549 with a slope of 0.915, indicating a slight tendency to overestimate risk at lower probabilities and underestimate it at higher ones; nonetheless, the overall performance remained robust. Crucially, the Hosmer-Lemeshow test was also non-significant in the validation cohort (P = 0.083), confirming that the model’s predictions were well-calibrated and reliable for external data. Calibration analysis of the nomogram for predicting cumulative live birth rate. (A) Calibration curve for the training set. (B) Calibration curve for the validation set. DCA was conducted to evaluate the clinical utility and net benefit of using the nomogram for decision-making in both the training and validation sets ( Figure 6 ). DCA determines the range of threshold probabilities at which the model provides a greater net benefit than the default strategies of treating all or none of the patients. In the training set ( Figure 6A ), the nomogram demonstrated a significant net benefit across a wide range of threshold probabilities, from approximately 5% to 90%. This indicates that if a clinician’s threshold for intervention lies within this range, using the nomogram to guide decisions is superior to “treat all” and “treat none” patients. The model achieved a peak net benefit of approximately 0.48 at a 10% threshold. Even at more moderate thresholds, the benefit remained substantial; for instance, at a threshold probability of 25%, the net benefit was approximately 0.40, and at 50%, it was approximately 0.30. This means that at a 25% threshold, using the model is equivalent to a strategy that correctly identifies 40 additional CLBR cases per 100 patients without increasing unnecessary interventions compared with the default strategies. In the validation set ( Figure 6B ), the clinical value of the model was confirmed. The nomogram provided a positive net benefit for threshold probabilities ranging from approximately 5%–85%. The net benefit curve for the nomogram was consistently higher than that of both the “treat all” and “treat none” strategies within this range. The peak net benefit in the validation cohort was approximately 0.38, observed at a low threshold of approximately 5-10%. At the 25% threshold, the net benefit was approximately 0.25, and at the 40% threshold, it was approximately 0.20. These results robustly demonstrate that the nomogram is not only statistically accurate but also clinically practical, offering tangible benefits for guiding personalized treatment decisions in women with OMAs undergoing IVF/ICSI. Decision curve analysis for the nomogram predicting cumulative live birth rate. (A) Decision curve analysis for training set. (B) Decision curve analysis for validation set. The blue line represents the nomogram model, the red line represents the “treat all” strategy, and the green line represents the “treat none” strategy. The x-axis shows the threshold probability, and the y-axis shows the net benefit. The shaded area at the bottom indicates the range in which the nomogram is recommended for clinical use.

Discussion

In this study, we developed and validated a comprehensive nomogram to predict CLBR in women with OMAs undergoing EST and IVF/ICSI. Our model incorporated four clinical variables: previous live birth history, cyst diameter, COH protocol, and number of oocytes retrieved, showing good discriminative ability with AUCs of 0.849 and 0.853 in the training and validation sets, respectively. It demonstrated good calibration and clinical utility for individualized fertility counseling in the study. Bootstrap resampling validation (500 iterations) further confirmed the robustness of the model, with Bootstrap AUCs of 0.877 and 0.879 in the training and validation sets, respectively. The Bootstrap CIs were consistent with the original analysis, demonstrating reliable model performance estimates. The development of this nomogram addresses a critical gap, as women with OMAs undergoing ovary-sparing EST represent a unique patient subset. Lavadia et al. ( 9 ) confirmed that EST is a viable treatment option that preserves the ovarian reserve and improves IVF/ICSI outcomes compared to cystectomy. Olawade et al. ( 26 ) demonstrated that AI-driven personalized treatment protocols for IVF could optimize ovarian stimulation and improve clinical outcomes using individualized approaches. Our model builds on this by providing a tailored prognostic tool that quantifies individual success probabilities, moving beyond generalized treatment recommendations to facilitate precise, patient-centered, and clinical decision-making. Moreover, our model can be integrated into clinical workflows at various decision points, ranging from the initial consultation to treatment monitoring, thereby facilitating the advancement of precision in reproductive medicine. The inclusion of endometrioma diameter as a significant negative predictor highlights the complex interplay between cyst size, disease burden, and reproductive outcomes. Our findings are corroborated by evidence demonstrating that endometriomas measuring ≥3 cm are associated with significantly lower live birth rates (28.6% vs. 70.0%; P = 0.004) than smaller cysts, even when ovarian reserve parameters are comparable ( 27 ). This size-dependent effect occurs through multiple mechanisms, including inflammatory infiltration, pro-inflammatory cytokine production, and mechanical distortion of the ovarian architecture, which impairs the follicular microenvironment ( 28 , 29 ). Although an overall reduction in live birth rates with EMs was not observed in some studies, larger cysts have been shown to exert more pronounced local effects ( 30 ). Therefore, EST represents a minimally invasive alternative to surgery, which has been highlighted as a risk for further compromise of the ovarian reserve ( 31 ). The positive correlation between the number of oocytes retrieved and CLBR is among the most well-established findings in reproductive medicine, highlighting the critical role of ovarian response in determining treatment success. A pivotal study by Becker et al. ( 32 ) provided a comprehensive analysis of one-year CLBRs, showing that CLBR surpassed 60% when more than 15 oocytes were retrieved and exceeded 70% with over 20 oocytes, although this relationship was influenced by maternal age. This aligns with the finding that 8–14 oocytes may be optimal for fresh transfer cycles, whereas a larger yield confers a cumulative benefit ( 33 ). Furthermore, a study by Chai et al. ( 34 ) in women with an expected poor ovarian response according to the Bologna criteria showed that those who had more than three oocytes retrieved achieved significantly higher CLBR than those with ≤3 oocytes (44.0% vs. 18.6%, P = 0.006), highlighting the critical role of oocyte yield even in challenging patient groups such as those with endometriosis. The selection of a COH protocol in women with EMs is critical, extending beyond ovarian response to encompass the interplay between hormonal stimulation, inflammation, and reproductive outcomes. Our model’s identification of the COH protocol as a significant predictor shows it is a key therapeutic intervention. However, the optimal protocol remains debated, with evidence suggesting that a uniform approach is inadequate for this heterogeneous patient population. The GnRH agonist long protocol is favored as pituitary desensitization can reduce pelvic inflammation in EMs. This pre-stimulation period may improve endometrial receptivity and follicular environment. Anti-inflammatory protocols may benefit severe EMs or cases with elevated inflammatory markers ( 35 ). A retrospective study by Kolanska et al. ( 36 ) provided evidence supporting this, demonstrating that a GnRH agonist protocol resulted in significantly higher live birth rates after fresh embryo transfer than an antagonist protocol. They suggested the antagonist protocol might impair endometrial receptivity in fresh cycles, aligning with endometriosis’s known effects on the endometrium ( 37 ). However, these potential benefits must be weighed against the longer treatment duration and higher gonadotropin requirement. In contrast, the GnRH antagonist protocol offers a more flexible, shorter, and patient-friendly regimen with a significantly lower risk of Ovarian Hyperstimulation Syndrome (OHSS). A more recent and comprehensive systematic review by Kuan et al. ( 38 ) found that both GnRH agonist and antagonist protocols generally yielded similar clinical pregnancy and live birth rates. This suggests that with the now-common practice of a ‘freeze-all’ strategy, any potential negative impact of the antagonist protocol on the endometrium in a fresh cycle can be effectively bypassed. The review noted that the GnRH agonist protocol might yield a higher total number of oocytes, which could translate to a higher cumulative pregnancy rate, reinforcing the importance of oocyte yield in this patient group ( 38 ). Recently, the PPOS protocol has emerged as a viable all-oral alternative that effectively prevents premature LH surges and enhances patient comfort. Its role in endometriosis, particularly in patients with diminished ovarian reserve after surgery, is an area of active investigation. A retrospective study by Yang et al. ( 39 ) compared PPOS with ultra-long GnRH agonist and antagonist protocols in patients with endometriomas. The study found that while PPOS was associated with a lower live birth rate than the ultra-long GnRH agonist protocol in the first embryo transfer cycle (OR 2.5, 95% CI 1.1-5.7), its outcomes were comparable to those of the GnRH antagonist protocol. Another study by Li et al. ( 40 ) focusing on patients with DOR after OMA surgery found no significant difference in the CLBR among GnRH antagonist, PPOS, and mild-stimulation protocols, although it concluded that both antagonist and PPOS protocols were superior to mild-stimulation. Murria et al. ( 41 ) found that PPOS cycles might produce lower-quality blastocysts than agonist and antagonist protocols do. Evidence shows that PPOS and GnRH antagonist protocols are effective, although PPOS may offer advantages, such as improved cycle flexibility and reduced inflammation ( 42 ). This finding highlights the ongoing debate and reinforces the fact that no single protocol is universally superior. This finding highlights the ongoing debate and reinforces the fact that no single protocol is universally superior to the others. Thus, the selection process should be based on a detailed analysis of the patient’s specific clinical profile, which includes ovarian reserve, past treatment responses, and distinct inflammatory and hormonal conditions linked to endometriosis, rather than adopting a generalized approach. A counterintuitive yet clinically insightful finding from our model is that a history of previous live births acts as a negative predictor of CLBR. This challenges the conventional belief that parous women conceive more easily and highlights the nature of EMs as a chronic, progressive disease. We focus our interpretation of this phenomenon on three core concepts: “secondary infertility,” “disease progression,” and “interpregnancy interval” interpregnancy interval. First, EMs is not a static condition; its pathophysiological processes evolve over time. Even if a woman has previously achieved a successful pregnancy naturally or with simple assistance, the underlying EMs lesions can continue to progress postpartum. Inflammation, oxidative stress, and hormonal resistance persistently erode reproductive potential. A review showed that inflammation and oxidative stress are key mechanisms by which EMs impairs fertility, affecting oocyte quality, sperm, and embryo viability ( 43 ). When these patients seek fertility treatment years later, they face a more hostile internal environment than during their first pregnancy. IVF/ICSI requirement may indicate selection bias of advanced disease stages. The interpregnancy interval is crucial, with a retrospective ART study showing decreased pregnancy and live birth rates for intervals ≥24 months compared to 12–18 months ( 44 ). For patients with EMs, this time window is critical as it corresponds to disease progression. Although secondary infertility has a better prognosis in the general infertile population ( 45 ), the progressive nature of EMs makes it a special case. For parous patients with EMs, prior live births likely occurred earlier in disease progression, while current infertility reflects cumulative damage. A national cohort study showed women with EMs have lower first-birth and fertility rates than the general population, further supporting the persistent negative impact of EMs on fertility ( 46 ). The impact of EMs on fertility varies by age, with a prospective cohort study showing stronger infertility risk association in women under 35 ( 47 ). For women diagnosed with EMs at a younger age, cumulative fertility damage may be more pronounced over time. A history of previous live births should not be viewed as a good prognostic indicator in EMs patients, but rather as a timestamp indicating longer disease development. This emphasizes the need for comprehensive evaluation of parous patients with endometriosis and secondary infertility. Clinicians should not be reassured by previous successful pregnancy but remain vigilant about disease progression impacts. The negative link between previous births and CLBR needs cautious interpretation due to potential selection bias. Women with a history of live birth who subsequently seek IVF treatment may represent a subgroup with more advanced or progressive disease, as they were able to conceive naturally or with less invasive treatments in the past but now require IVF due to the disease progression. This nomogram serves as a practical tool for translating complex statistical data into individualized clinical guidance, fostering shared decision-making for women with OMAs undergoing IVF/ICSI after EST. Its primary utility lies in pre-treatment counseling, where it provides a quantitative estimate of the CLBR to manage patient expectations. Based on our DCA results, a threshold probability of 30% was a reasonable benchmark. For patients with a predicted CLBR above this threshold, clinicians can offer reassurance and encourage proceeding with IVF/ICSI treatment. Conversely, for those with a prognosis below 30%, the model facilitates crucial discussions on alternative strategies. This may include optimizing the COH protocol, considering a repeat EST for larger cysts, or realistically counseling on the potential need for multiple retrieval cycles. It is imperative to underscore that this nomogram was designed as a decision-support tool to augment, not supplant, clinical judgment. The predictions provided by the model should be integrated with the clinician’s experience, the patient’s individual circumstances, and shared decision-making. It serves as an objective reference to facilitate more personalized counseling and expectation management, helping both clinicians and patients to better understand the likelihood of success before embarking on the IVF/ICSI journey. This study had several notable strengths that enhanced the validity and clinical applicability of our findings. To our knowledge, this is the first study to develop and validate a prediction model specifically for the CLBR in women with OMAs following EST and subsequent IVF/ICSI treatment, addressing a critical gap in reproductive medicine. Second, we employed a rigorous statistical methodology adhering to the TRIPOD guidelines, including a comprehensive three-stage variable selection process (univariate regression, LASSO regression, and multivariate logistic regression) to minimize overfitting and ensure the parsimony of the model. Third, Bootstrap resampling validation with 500 iterations provided a robust assessment of model stability and reproducibility, confirming the reliability of our predictions. Fourth, we used CLBR as the primary outcome, which is more clinically meaningful than the single-cycle pregnancy rate, as it captures the full reproductive potential of all embryos from a single oocyte retrieval cycle. Finally, the model demonstrated consistent performance across both the training and validation cohorts, with good discrimination (AUC 0.849-0.853), acceptable calibration, and favorable clinical utility, as evidenced by decision curve analysis, supporting its potential for clinical implementation. This study has some limitations that should be considered when interpreting its findings. First, the retrospective, single-center design may limit its generalizability. Although our sample size of 194 patients was adequate, larger multicenter cohorts are required for robustness. Second, our model excluded variables such as genetic polymorphisms, inflammatory biomarkers, and surgical history due to data constraints. Future studies should include these factors to improve performance. Third, the finding that a history of previous live birth is a negative predictor requires cautious interpretation, possibly reflecting endometriosis progression, selection bias, or unmeasured confounders such as the time interval since the last live birth. External validation is essential for confirming this association. Another limitation of this study is the statistically significant difference in two key ovarian reserve markers—basal testosterone and AFC—between the training and validation sets, despite random allocation. This baseline imbalance could potentially challenge the assessment of the model’s generalizability. Ideally, methods such as Propensity Score Matching or rigorous covariate adjustment should be employed to eliminate such confounding bias. Due to the retrospective nature and limited sample size of our study, we did not implement these more complex statistical corrections. Although our model demonstrated good predictive performance on the validation set despite the baseline differences, which to some extent proves its robustness, we must acknowledge that this methodological shortcoming may limit the direct applicability of our findings. Future research, particularly in larger prospective cohorts, should prioritize ensuring comparability between groups through strict matching or stratified designs to obtain more reliable validation results. While this study establishes a foundation for individualized CLBR prediction in women with OMAs after EST and IVF/ICSI, several directions warrant further investigation. First, external validation in independent multicenter cohorts across diverse geographic regions and clinical settings is essential to confirm the generalizability and robustness of the nomogram. Second, prospective studies are needed to evaluate the model’s performance in real-world clinical practice and assess its impact on clinical decision-making, patient satisfaction, and reproductive outcomes. Third, the incorporation of emerging biomarkers (e.g., AMH kinetics, inflammatory markers, and genetic polymorphisms) and advanced analytical techniques (e.g., machine learning algorithms and artificial intelligence) may further enhance predictive accuracy and enable dynamic risk stratification. Finally, extending this prediction framework to other endometrioma management strategies (e.g., laparoscopic cystectomy and expectant management) and investigating the cost-effectiveness of nomogram-guided treatment selection represent important areas for future research to optimize fertility care in women with OMAs.

Conclusions

We developed and validated the first nomogram to predict the CLBR in women undergoing IVF/ICSI after EST for OMAs. The model incorporates four clinical variables (previous live birth history, cyst diameter, COH protocol, and number of oocytes retrieved) and demonstrates good discrimination (AUC 0.849-0.853), acceptable calibration, and favorable clinical utility. Notably, a history of previous live birth was identified as a negative predictor of CLBR, providing new insights into reproductive outcomes in this population. This nomogram acts as a useful resource for personalized fertility advice and treatment planning, facilitating collaborative decision-making in clinical settings.

Materials|Methods

This retrospective cohort study was conducted at the Reproductive Medicine Center of Shenzhen Hengsheng Hospital in accordance with the established guidelines for observational research in the field of reproductive medicine. The study population consisted of women diagnosed with OMAs who underwent ultrasound-guided EST before their IVF/ICSI cycle between January 2020 and December 2024. The study protocol was approved by the hospital’s Institutional Review Board (HSYY2022-12-22) and adhered to the principles outlined in the Declaration of Helsinki for medical research involving human subjects. Owing to the retrospective design of the study, the requirement for informed consent was waived, and appropriate data anonymization procedures were implemented. The reporting of this manuscript adheres to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) statement for prediction model development and validation ( 18 ). The eligibility for inclusion in the study was determined based on the following criteria: (1) female participants aged between 20 and 45 years at the time of ultrasound-guided EST; (2) a diagnosis of OMAs confirmed via transvaginal ultrasound, characterized by unilocular or multilocular cysts with homogeneous, low-level “ground-glass” echogenicity, with at least one cyst measuring 4 cm or larger in diameter; (3) receipt of EST as the primary treatment for OMAs, followed by the initiation of at least one complete IVF/ICSI cycle within 6 months; and (4) availability of comprehensive clinical data, including complete baseline characteristics, controlled ovarian hyperstimulation (COH) parameters, embryological outcomes, and follow-up records until the determination of pregnancy outcomes or cycle cancellation. Patients were excluded from the study if they met any of the following criteria: (1) absence of more than 20% of key predictor variables; (2) history of ovarian surgery for OMAs or adnexal pathologies that compromise ovarian reserve; (3) severe male factor infertility; (4) known chromosomal abnormalities or genetic disorders in either partner; (5) concurrent malignancy or severe systemic disease; and (6) inability to follow up before the determination of pregnancy outcomes. The primary outcome was the CLBR, defined as the delivery of a live-born infant with a gestational age of at least 28 weeks resulting from all fresh and frozen-thawed embryo transfers following the first completed oocyte retrieval cycle after EST ( 19 ). The CLBR was determined for each patient, considering embryo transfers until a live birth was achieved or all embryos from the initial cycle were utilized. The follow-up period extended from oocyte retrieval to either live birth or final embryo transfer, with a minimum duration of 12 months, to adequately capture reproductive outcomes. All EST procedures were conducted by reproductive endocrinologists with over five years of experience, adhering to a standardized institutional protocol informed by established international guidelines ( 20 ). A 17-gauge needle was used to puncture the endometrioma under transvaginal ultrasound guidance. Complete aspiration of the cyst contents was performed, and the aspirated volume and fluid characteristics were documented. Absolute ethanol (95%) was introduced into the cyst cavity, comprising 50–80% of the aspirated volume, with a maximum volume of 8 mL for 10–15 min. During this period, gentle agitation was employed to ensure optimal contact between the ethanol and the cyst wall. Subsequently, ethanol was aspirated, and the cavity was irrigated with saline to eliminate any residual sclerosants from the cavity. Patients were administered antibiotics and observed for 1–2 h before discharge. A follow-up transvaginal ultrasound was conducted at 4–6 weeks to evaluate cyst reduction and treatment efficacy before proceeding with IVF/ICSI cycles. COH protocols were selected according to current evidence-based guidelines and individualized based on patient characteristics ( 21 , 22 ). Available protocols include ultra-long, long, antagonist, mild stimulation, progestin-primed ovarian stimulation (PPOS), and natural cycle approaches to ovarian stimulation. Protocol selection followed standard clinical practice, considering the patient’s age, ovarian reserve markers, and previous treatment responses ( 23 ). The starting dose of gonadotropins (recombinant Follicle-Stimulating Hormone (FSH) or human menopausal gonadotropin) was 150–300 IU daily. Dose adjustments were based on follicular development, which was monitored through transvaginal ultrasound and hormone tests. Final oocyte maturation was induced with human Chorionic Gonadotropin (hCG) or Gonadotropin-Releasing Hormone (GnRH) agonist when two dominant follicles reached a diameter of ≥ 18 mm. Transvaginal oocyte retrieval was performed 34–36 h after the trigger under ultrasound guidance. Fertilization was performed via conventional IVF or ICSI, based on semen parameters and history. Embryos were cultured to the cleavage (day 3) or blastocyst stage (day 5/6) and assessed for quality. Fresh embryo transfer was conducted, or viable embryos were cryopreserved for frozen embryo transfer (FET) cycles. Luteal support was provided using progesterone, according to the institutional protocols. We collected 71 candidate predictor variables from the electronic medical record system (EMRS). These variables were categorized into eight groups: (1) demographic characteristics (e.g., female age and body mass index (BMI)); (2) reproductive history (e.g., live birth history, gravidity, and parity); (3) basal hormonal profiles (e.g., FSH, luteinizing hormone (LH), estradiol (E 2 ), and testosterone (T)); (4) ovarian reserve parameters (e.g., AMH and AFC); (5) endometrioma characteristics (e.g., cyst diameter and laterality, where the diameter of the endometrioma is documented as the largest measurement taken through transvaginal ultrasound before EST); (6) treatment protocols (e.g., COH protocol); (7) laboratory outcomes (e.g., number of oocytes retrieved, number of metaphaseII(MII)oocyte, and fertilization rate); and (8) embryological outcomes (e.g., number of usable embryos and number of blastocysts formed). All variables were measured before or during the IVF/ICSI treatment cycle. The linearity assumption for continuous variables was assessed using restricted cubic splines with three knots. No significant nonlinear relationships were detected; thus, continuous variables were modeled as linear terms. Patients with incomplete data on the primary outcomes or key predictors were excluded from the final analysis. For the remaining candidate predictors, the missing data were minimal (<5%). A complete case analysis was performed for the model development. Missing data were assessed and found to be minimal (0.05). Therefore, a complete case analysis was deemed appropriate. Participants were randomly assigned to training (70%) and validation (30%) cohorts using computer-generated random sequences, with stratification by outcome to ensure balanced representation of live birth events across both datasets. A random seed was established to ensure reproducibility, and baseline characteristics were compared between cohorts to verify successful randomization. A systematic three-step variable selection methodology was employed to construct a parsimonious yet robust predictive model. Step 1: Univariate analysis. All candidate predictors were subjected to univariate logistic regression analyses within the training cohort. Variables exhibiting associations with the primary outcome (P < 0.10) were retained for subsequent analysis. This liberal threshold was selected to prevent the premature exclusion of potentially significant predictors while maintaining statistical efficiency. Step 2: The variables identified in Step 1 were subjected to Least Absolute Shrinkage and Selection Operator (LASSO) regression analysis, utilizing 10-fold cross-validation repeated 100 times to determine the optimal penalty parameter (λ). We adopted the one-standard-error rule (λ.1se) to select the optimal penalty parameter, which favors a more parsimonious model by choosing the largest λ value within one standard error of the minimum cross-validated error. The rationale for prioritizing a parsimonious model in this clinical context is threefold. First, models with fewer variables are more readily adopted in clinical settings, thereby enhancing their practical utility. Second, simpler models reduce overfitting and improve generalizability when applied to new populations. Third, focusing on key predictors reduces the data collection burden and costs, making it more efficient for clinical decision-making. This approach balances predictive accuracy and clinical applicability ( 24 ). LASSO regression inherently addresses multicollinearity and facilitates feature selection by reducing the coefficients of less significant variables toward zero. Step 3: Multivariate logistic regression with stepwise selection. Variables with non-zero LASSO coefficients were included in a multivariable logistic regression analysis, utilizing backward stepwise elimination based on statistical significance (P < 0.05) to formulate the final model. The model’s assumptions were thoroughly checked, including the linearity of continuous variables (assessed through restricted cubic splines), absence of multicollinearity (variance inflation factor<5), and sufficiency of the sample size. A nomogram was constructed using the final multivariable logistic regression model with the rms package in R, serving as an intuitive graphical tool for personalized CLBR prediction. It integrates all significant predictors with appropriate scaling and point allocation based on regression coefficients, allowing clinicians to compute individualized probability estimates for patient counseling and treatment planning purposes. The model’s ability to distinguish between outcomes was measured using the Area Under the Receiver Operating Characteristic (ROC) curve (AUC), with 95% confidence intervals (CIs). Performance was rated as follows: 0.5, no discrimination; 0.6–0.7, poor to fair; 0.7–0.8, moderate; 0.8–0.9, good; and 0.9 or higher, excellent. Bootstrap resampling validation was performed separately on the training and validation groups to evaluate the stability and reliability of the model performance estimates. This involved taking 500 random samples with replacement from each group, with each sample being the same size as the original group. After 500 iterations, the AUC distribution was obtained, and the 95% confidence interval (CI) was calculated using the DeLong method. The agreement between the predicted probabilities and observed frequencies of the CLBR was evaluated using calibration plots and the Hosmer-Lemeshow goodness-of-fit test. A well-calibrated model would have a calibration curve close to the 45-degree diagonal line and a P > 0.05 in the Hosmer-Lemeshow test. Decision curve analysis (DCA) was conducted to evaluate the net clinical benefit of the nomogram across a range of threshold probabilities, comparing the model’s performance against strategies for treating all patients or treating no patients. The analysis quantified the clinical value of the prediction model in supporting treatment decisions. Sample size calculation followed the 10 events per variable (EPV) guideline for the logistic regression models ( 25 ). With an expected live birth rate of 50% and 5–8 predictor variables, 50–80 events (100–160 patients) were needed. Accounting for data exclusion and 20% follow-up loss, the target was set at 125–200 patients. According to the EPV principle, our sample size of 194 patients, which included 97 live-birth events, yielded an EPV of 19.4. This was sufficient for developing a prediction model that incorporated the five variables. Continuous variables were described using the mean ± standard deviation (SD) for data with a normal distribution or median with interquartile range (IQR) for data not normally distributed, with normality checked via the Shapiro-Wilk test and visual inspection of histograms and Q-Q plots. Categorical variables were expressed as frequency and percentage. Comparisons between groups were conducted using Student’s t-test or the Mann-Whitney U test for continuous variables and the chi-square test or Fisher’s exact test for categorical variables, with effect sizes reported as Cohen’s d for continuous variables and Cramér’s V for categorical variables. We used the R software (version 4.2.2) for all statistical analyses. This included packages such as rms for regression modeling and creating nomograms, glmnet for LASSO regression, pROC for ROC analysis, and mice for multiple imputation. SPSS version 26.0 (IBM Corp., Armonk, NY, USA) was used for the supplementary analyses. All analyses employed a two-sided significance level of P < 0.05, and random seeds were used to ensure the reproducibility of the results.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: pmc

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Condition tags

endometriosisendometrioma

MeSH descriptors

Birth Rate Birth Rate Birth Rate Birth Rate Birth Rate Birth Rate Birth Rate Birth Rate Birth Rate Birth Rate Birth Rate Birth Rate Birth Rate Birth Rate Birth Rate Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-06-12T06:13:51.797165+00:00
pmc
last seen: 2026-05-13T20:22:03.195721+00:00
pubmed
last seen: 2026-06-12T06:09:34.012427+00:00
unpaywall
last seen: 2026-05-11T08:34:28.763810+00:00
License: CC-BY-4.0 · commercial use OK · attribution required
Courtesy of the U.S. National Library of Medicine