Enhancing Genetic Association Power in Endometriosis through Unsupervised Clustering of Clinical Subtypes Identified from Electronic Health Records
preprint
OA: green
CC0
⤵ 2 in-corpus citations
Abstract
Abstract Background Endometriosis affects 10% of reproductive-age women, and yet, it goes undiagnosed for 3.6 years on average after symptoms onset. Despite large GWAS meta-analyses (N > 750,000), only a few dozen causal loci have been identified. We hypothesized that the challenges in identifying causal genes for endometriosis stem from heterogeneity across clinical and biological factors underlying endometriosis diagnosis. Methods We extracted known endometriosis risk factors, symptoms, and concomitant conditions from the Penn Medicine Biobank (PMBB) and performed unsupervised spectral clustering on 4,078 women with endometriosis. The 5 clusters were characterized by utilizing additional electronic health record (EHR) variables, such as endometriosis-related comorbidities and confirmed surgical phenotypes. From four EHR-linked genetic datasets, PMBB, eMERGE, AOU, and UKBB, we extracted lead variants and tag variants 39 known endometriosis loci for association testing. We meta-analyzed ancestry-stratified case/control tests for each locus and cluster in addition to a positive control (Total N endometriosis cases = 10,108). Results We have designated the five subtype clusters as pain comorbidities, uterine disorders, pregnancy complications, cardiometabolic comorbidities, and EHR-asymptomatic based on enriched features from each group. One locus, RNLS , surpassed the genome-wide significant threshold in the positive control. Thirteen more loci reached a Bonferroni threshold of 1.3 x 10 -3 (0.05 / 39) in the positive control. The cluster-stratified tests yielded more significant associations than the positive control for anywhere from 5 to 15 loci depending on the cluster. Bonferroni significant loci were identified for four out of five clusters, including WNT4 and GREB1 for the uterine disorders cluster, RNLS for the cardiometabolic cluster, FSHB for the pregnancy complications cluster, and SYNE1 and CDKN2B-AS1 for the EHR-asymptomatic cluster. This study enhances our understanding of the clinical presentation patterns of endometriosis subtypes, showcasing the innovative approach employed to investigate this complex disease.
My notes (saved in your browser only)
Condition tags
Citation neighborhood
Papers in the corpus that this work cites (lower rings, blue) and that cite this one (upper rings, green). Dot size scales with the paper's in-corpus citation count — bigger dot = more influential within the endo/adeno field. Click a dot to open that paper. [ expand to 2 hops ] — adds papers reached through this work's immediate citers/citees. Heavier; up to 60 extra dots.
References (46)
- Central changes associated with chronic pelvic pain and endometriosis via openalex
- Economic burden of endometriosis via openalex
- Endometriosis and pelvic pain: epidemiological evidence of the relationship and implications via openalex
- Epigenetic role of the nuclear factor NF-Y on ID gene family in endometrial tissues of women with endometriosis: a case control study via openalex
- Expression levels of MCP-1, HGF, and IGF-1 in endometriotic patients compared with non-endometriotic controls via openalex
- Factors Associated with Time to Endometriosis Diagnosis in the United States via openalex
- Heritability of endometriosis via openalex
- Leveraging electronic health record data for endometriosis research via openalex
- Pathogenesis and pathophysiology of endometriosis via openalex
- Real-World Evaluation of Direct and Indirect Economic Burden Among Endometriosis Patients in the United States via openalex
- REVIEW: Accuracy of laparoscopy in the diagnosis of endometriosis: a systematic quantitative review via openalex
- Short-acting and Long-acting Opioids Utilization among Women Diagnosed with Endometriosis in the United States: A Population-based Claims Study via openalex
- Strong Association Between Endometriosis and Symptomatic Leiomyomas via openalex
- Surgery for endometriosis: beyond medical therapies via openalex
- Surgical Treatment of Endometriosis via openalex
- The burden of endometriosis: costs and quality of life of women with endometriosis and treated in referral centres via openalex
- W2123971761 via openalex
- W2114410175 via openalex
- W2109887283 via openalex
- W2161160262 via openalex
- W2161633633 via openalex
- W2099085143 via openalex
- W2096791516 via openalex
- W2077955816 via openalex
- W2617005810 via openalex
- W2027867013 via openalex
- W2811066984 via openalex
- W2895486342 via openalex
- W2899020224 via openalex
- W2017733151 via openalex
- W2950099124 via openalex
- W1993137812 via openalex
- W3206719626 via openalex
- W4205170775 via openalex
- W4214754424 via openalex
- W1976383685 via openalex
- W4226083203 via openalex
- W4280645308 via openalex
- W4310375294 via openalex
- W4313453265 via openalex
- W4320857121 via openalex
- W4324045019 via openalex
- W1715364430 via openalex
- W4389148856 via openalex
- W4391925485 via openalex
- W4392083189 via openalex
Cited by (2)
Source provenance
- europepmc
- last seen: 2026-06-04T01:45:00.660873+00:00
- openalex
- last seen: 2026-06-04T00:00:01.174412+00:00
- pmc
- last seen: 2026-05-17T02:30:03.883495+00:00
- pubmed
- last seen: 2026-05-30T00:32:59.063209+00:00
License: CC0
· commercial use OK