Identifying key determinants of cumulative live birth in women with ovarian endometrioma undergoing ethanol sclerotherapy followed by in vitro fertilization or intracytoplasmic sperm injection: an interpretable machine learning analysis

In: Frontiers in Cell and Developmental Biology · 2026 · vol. 14 , pp. 1742816 · doi:10.3389/fcell.2026.1742816 · PMID:41970956 · W7140606668
article OA: gold CC0
AI-generated summary by claude@2026-06, 2026-06-09

This study developed an interpretable machine learning model, identifying antral follicle count, progesterone, downregulation, cyst diameter, and prior live birth as key predictors of cumulative live birth after ethanol sclerotherapy and IVF/ICSI for ovarian endometrioma.

One-sentence paraphrase of the abstract; not a substitute for reading it. No clinical advice. How this works

Abstract

Background: fertilization remains challenging. This study aimed to develop and validate a machine learning model for predicting the cumulative live birth rate in women with endometriomas who underwent alcohol sclerotherapy followed by assisted reproduction. Methods: fertilization or intracytoplasmic sperm injection cycles between January 2020 and December 2024 at our institution. Patients were allocated to the training (135 patients, 70%) and validation (59 patients, 30%) groups. Feature selection used univariate logistic regression (p < 0.10) to identify 19 predictors, which were refined using the Boruta, Recursive Feature Elimination, and maximum relevance minimum redundancy algorithms. Features identified by all methods were selected as the final predictors. Four machine learning algorithms (Decision Tree, Random Forest, Extreme Gradient Boosting, Support Vector Machine) were compared using discrimination, calibration, and utility metrics. SHapley Additive exPlanations analysis was used to interpret the model. Results: The cumulative live birth rate was 50.0% (97/194). Five predictors were identified: antral follicle count, progesterone level on gonadotropin starting day, downregulation, cyst diameter, and previous live birth history. The Extreme Gradient Boosting model showed optimal performance, with an AUC of 0.830 (95% confidence interval: 0.719-0.941), sensitivity of 0.783, specificity of 0.750, and Brier score of 0.176. SHapley analysis revealed​ that a higher antral follicle count and downregulation positively impacted birth prediction, whereas elevated progesterone levels and larger cyst diameters had negative effects. Conclusion: We developed an explainable Extreme Gradient Boosting model for predicting cumulative live birth rates in women with ovarian endometriomas after ethanol sclerotherapy and assisted reproductive technology. SHapley Additive exPlanations analysis identified key predictors and revealed their non-linear contributions to outcomes, providing transparent explanations for predictions. This interpretable machine learning approach offers a clinical decision-support tool for patient counseling and treatment optimization, advancing beyond traditional methods in capturing reproductive outcomes.

My notes (saved in your browser only)

Condition tags

endometrioma

Citation neighborhood

Papers in the corpus that this work cites (lower rings, blue) and that cite this one (upper rings, green). Dot size scales with the paper's in-corpus citation count — bigger dot = more influential within the endo/adeno field. Click a dot to open that paper. [ expand to 2 hops ] — adds papers reached through this work's immediate citers/citees. Heavier; up to 60 extra dots.

References (44)

Source provenance

openalex
last seen: 2026-06-10T17:14:06.276822+00:00
License: CC0 · commercial use OK