Exosome-Enriched Hub Gene Networks Identify Diagnostic Biomarkers and Repurposable Therapeutic Targets in Endometriosis

other OA: gold CC-BY-4.0
AI-generated deep summary by claude@2026-06, 2026-06-11 · read from full text

This study integrated transcriptomic data from three GEO microarray cohorts (GSE7305, GSE25628, GSE11691) to derive a consensus differentially expressed gene signature of endometriosis, using TAC reprocessing, PCA-based outlier removal, and intersection of DEGs with directionally consistent log2 fold-changes. Functional enrichment (DAVID, KEGG) of 262 shared genes highlighted immune/inflammatory pathways and processes related to cell adhesion, migration, and angiogenesis, with “extracellular exosome” emerging as a prominent cellular component term. The authors then prioritized exosome-associated hub candidates by mapping differentially expressed genes to extracellular vesicle resources (ExoCarta and Vesiclepedia) and planned protein–protein interaction network and LINCS L1000 connectivity analyses to nominate repurposable small molecules, with key caveats including reliance on microarray cohorts and variability in initial group separation that required reprocessing and outlier removal. This paper is centrally about endometriosis — it derives an exosome-enriched hub gene network and proposes LINCS L1000-based repurposing candidates using consensus endometriosis transcriptomic signatures.

Read from the paper's body, not the abstract. Not a substitute for reading the paper. No clinical advice. How this works

Abstract

Endometriosis is a heterogeneous chronic inflammatory disorder associated with substantial diagnostic delay and limited therapeutic options, highlighting the need of robust non-invasive biomarkers and actionable molecular targets to complement existing low-sensitivity tests. To identify conserved pathogenic mechanisms with translational potential, here, we uniformly reprocessed three independent the Gene Expression Omnibus (GEO) microarray cohorts (GSE7305, GSE25628, and GSE11691) and applied a strict, directionally consistent intersection strategy to identify conserved transcriptional signals. We identified 262 consensus differentially expressed genes enriched for immunity/inflammation, cell adhesion and migration, and angiogenesis, consistent with key biological hallmarks of lesion establishment and persistence. Protein-protein interaction topology prioritized 11 highly connected hub genes (VCAM1, CCL2, MCAM, CD14, CD24, FGFR1, SIRPA, CSF1R, S100A9, S100A8, and LY96) that likely act as an integrated immune-adhesion-angiogenesis axis. Notably, 63/262 (24%) of the consensus genes were annotated to the extracellular exosome compartment, supporting their translational relevance as liquid-biopsy candidates. Finally, connectivity mapping using the LINCS L1000 framework nominated small-molecule perturbagens predicted to reverse the endometriosis-associated signature, providing a rational starting point for drug-repurposing experiments. In conclusion, this study elucidates a conserved immune-adhesion-angiogenesis axis driven by an 11-gene hub network in endometriosis. These core regulators represent promising candidates for the development of non-invasive liquid biopsies and precision, non-hormonal therapeutics.
Full text 37,408 characters · extracted from pmc · 5 sections · click to expand

Section 4

Human endometriosis gene expression datasets were retrieved from the NCBI Gene Expression Omnibus (GEO). To ensure compatibility with our uniform reprocessing pipeline, we specifically selected studies utilizing Affymetrix platforms that provided raw probe-level data (CEL files). The inclusion criteria were defined as follows: (i) samples derived from Homo sapiens, (ii) availability of clear metadata defining case (endometriosis/ectopic) and control (normal/eutopic) groups, and (iii) absence of cohort duplication. Following metadata harmonization and quality screening, three independent datasets, GSE7305 , GSE25628 , and GSE11691 , were selected for downstream analysis. We processed all raw CEL files in Transcriptome Analysis Console (TAC; Affymetrix/Thermo Fisher, Waltham, MA, USA) following a uniform, pre-specified workflow. Briefly, arrays were imported and assigned to two biological groups EM (endometriosis) and Normal using the series/sample annotations curated from GEO. We performed routine quality control in TAC (array-level signal distributions, boxplots, principal component analysis, and probe-level diagnostics) to screen for technical outliers and artifacts. Specifically, we visualized the first three principal components (PC1, PC2, and PC3) to assess global data structure and sample clustering in three-dimensional space. Expression values were then computed using TAC’s implementation of the Robust Multi-array Average (RMA) procedure, background correction, quantile normalization, and log 2 summarization to the probeset level, followed by gene-level consolidation using the corresponding platform annotation files. For each dataset, TAC generated a matrix of gene-wise group summaries (mean expression in EM and Normal) and effect sizes (log 2 fold-change), along with accompanying p -values for the EM and Normal contrast. Specifically, genes were considered significantly differentially expressed if they met the threshold criteria of a nominal p -value < 0.05 and an absolute fold change ≥ 2, determined via Empirical Bayes ANOVA. The resulting genes by sample matrices and DEG tables were exported for downstream curation and pathway analyses. For orthogonal in silico validation, an independent microarray dataset ( GSE201912 ) was obtained from the GEO database. This dataset was processed and analyzed for differential expression using the identical RMA normalization and statistical thresholds ( p < 0.05, the absolute fold change ≥ 2) as applied to the primary discovery cohorts. After generating gene-level summaries of mean expression in the endometriosis group, denoted as EM, and in the Normal group, together with the corresponding fold changes, we harmonized probe annotations and resolved cases in which a single gene was represented by multiple probes. When a gene symbol appeared more than once, we retained the probeset with the smallest p -value and removed the remaining entries to avoid inflating downstream counts. Any probeset that lacked a valid gene symbol was excluded to prevent ambiguity in gene-level analyses. We then performed an intersection analysis across the three datasets using Microsoft Access to identify differentially expressed genes shared by all cohorts. The intersecting gene list was submitted to DAVID for GO enrichment across the categories BP, CC, and MF, as well as pathway analysis using the KEGG. For each GO term, we retained the term name, the number of contributing genes, the gene ratio, and the associated p -value, and we removed entries with p -values greater than 0.05. The remaining terms were sorted in descending order by gene count, and the top twenty were visualized as an enrichment plot generated with the ggplot2 package in R version 4.3.1 [ 84 ]. Building on the statistically significant results from the standardized GO BP database ( p < 0.05; Figure 5 A), we applied a manual curation step as a secondary biological filter to consolidate highly redundant functional annotations and prioritize processes directly driving the clinical pathophysiology of endometriosis. Specifically, rather than selecting broad, generic cellular functions (e.g., signal transduction) or multiple overlapping parent–child terms, we explicitly focused on highly enriched, distinct representative modules corresponding to the established hallmarks of ectopic lesion development and survival: innate immune response, adaptive immune response, inflammatory response, angiogenesis, cell migration, and cell adhesion. For each selected term, we extracted the corresponding differentially expressed genes using official HGNC gene symbols. We then submitted each term-specific gene list to STRING version 12.0 with the organism set to Homo sapiens to assemble PPI networks, drawing on interaction evidence from curated databases, experimental studies, and coexpression. To limit spurious edges, we required a medium to high confidence threshold, defined as a combined score of at least 0.40. The interaction tables were downloaded in tab-separated (TSV) format and imported into Cytoscape version 3.10.4. Within Cytoscape, we removed isolated nodes, defined as nodes with degree equal to zero, and, where applicable, restricted subsequent analysis to the largest connected component to ensure stable centrality estimates. Network topology was quantified with cytoHubba version 0.1, using Degree as the primary ranking metric, where Degree is the number of first-neighbor connections per node. Specifically, the top 7 to 10 nodes with the highest degree scores within each BP-specific subnetwork were extracted. Finally, a manual curation step was applied to identify intersecting hubs; only those top-ranked genes that concurrently participated in two or more of the selected BP terms were designated as the final core hub genes for downstream analysis. Candidate small-molecule compounds were predicted using the L1000 platform from the Library of Integrated Network-Based Cellular Signatures (LINCS) [ 28 ]. Genes that were commonly deregulated across the three datasets were first identified and then split into upregulated and downregulated gene sets, which were submitted separately to L1000 in “reverse” mode to search for compounds that could oppositely regulate the input signature. For each query, L1000 returned the top 50 ranked small molecules. These candidates were subsequently cross-checked in PubChem to obtain their validated chemical names, and, based on these annotations, each compound was further classified according to its predominant inhibitor type.

Intro

Endometriosis (EM) is a chronic gynecological condition characterized by the presence of endometrium-like tissue outside the uterine cavity [ 1 ]. It is estimated to affect 11% of reproductive-aged women worldwide, approximately 200 million individuals, and is identified in up to 50% of patients with chronic pelvic pain or infertility [ 2 , 3 , 4 , 5 , 6 ]. Despite its prevalence, the pathogenesis of endometriosis remains incompletely understood. Several theories have been proposed, among which three are most widely cited: the retrograde menstruation hypothesis, the coelomic metaplasia hypothesis, and the Müllerian remnants hypothesis [ 2 , 4 , 7 , 8 , 9 ]. Sampson’s retrograde menstruation model, the most broadly accepted, posits that endometrial fragments shed during menses reflux through the fallopian tubes into the peritoneal cavity, where they implant and proliferate to form ectopic lesions [ 1 ]. Nonetheless, this mechanism alone does not fully account for extra-pelvic or rare gonadal-independent presentations, prompting alternative explanations. The coelomic metaplasia hypothesis suggests that peritoneal mesothelial cells can transdifferentiate into endometrium-like tissue, whereas the Müllerian remnants hypothesis proposes that displaced embryonic Müllerian-derived cells persist and, under specific stimuli such as estrogen exposure or diethylstilbestrol, give rise to ectopic lesions [ 10 ]. Current diagnosis typically integrates clinical presentation with imaging and serum biomarkers. Transvaginal ultrasonography and MRI aid in detecting ovarian endometriomas. Although CA-125 is the most commonly used biomarker, its sensitivity and specificity are poor, particularly in early-stage disease. Other markers, such as CA 19-9 or HE4, are primarily used to assist in excluding malignancy rather than for direct diagnosis. Consequently, currently available blood markers lack sufficient accuracy for stand-alone diagnosis [ 5 , 7 , 11 ]. Hallmark symptoms include chronic pelvic pain, dysmenorrhea, dyspareunia, bowel dysfunction, and infertility [ 5 , 7 , 11 ]. Some patients also report gastrointestinal complaints (e.g., nausea, vomiting, increased bowel frequency with pelvic pain) and lower urinary tract symptoms, particularly cyclical frequency and urgency [ 12 , 13 ]. Pelvic examination can help localize tenderness, yet definitive diagnosis often relies on invasive methods, underscoring the urgent clinical need to develop novel non-invasive markers with high diagnostic potential. Disease severity is commonly graded using the revised American Society for Reproductive Medicine (rASRM) system, which stratifies endometriosis into four stages. In Stage I, lesions are few and superficial without notable adhesions; Stage II involves a greater number of predominantly superficial implants with minimal deep disease and no significant adhesions; Stage III features increased lesion burden with the onset of deep infiltration, small ovarian endometriomas (4 cm), and dense adhesions that may distort pelvic anatomy and tether the ovaries, fallopian tubes, uterus, and bowel (rASRM). Patients with Stage I–II disease may experience limited disruption of daily activities, whereas Stage III–IV disease is frequently associated with severe pain and impaired fertility, leading to a substantial reduction in quality of life. By anatomic distribution, EM encompasses several phenotypes, including superficial peritoneal lesions, ovarian endometrioma, deep infiltrating endometriosis (DIE), and uterine adenomyosis/adenomyoma. These phenotypes can coexist within the same patient and may reflect partially distinct biological programs [ 14 , 15 , 16 , 17 , 18 ]. Beyond benign morbidity, EM is associated with an increased risk of certain malignancies, most notably ovarian cancer, including endometrioid and clear cell subtypes [ 19 , 20 ]. Although epidemiologic associations are well described, the molecular determinants of malignant transformation remain incompletely defined, reinforcing the value of mechanistic studies that can inform prevention and targeted intervention. The management of EM is tailored to the severity of the disease and the localization of lesions, utilizing pharmacological, surgical, or combined modalities. The primary goals of medical therapy are to alleviate symptoms and prevent postoperative recurrence [ 21 ]. Currently, combined oral contraceptives (COCs) and high-dose progestin regimens are widely employed to suppress ovulation and induce decidualization, thereby reducing lesion size. These therapies are generally well tolerated and cost-effective, providing pain relief for the majority of patients. However, a subset of women may not respond to treatment or may experience adverse effects such as breakthrough bleeding, breast tenderness, nausea, and headaches [ 22 , 23 ]. For patients who are unresponsive to progestin-based therapies, low estrogen regimens serve as an effective second-line alternative. Specifically, gonadotropin-releasing hormone (GnRH) agonists function via a negative feedback loop on the pituitary gland; this inhibits gonadotropin secretion and subsequently downregulates ovarian steroidogenesis. A significant limitation of GnRH agonists, however, is their instability in the digestive tract, which precludes oral administration. Consequently, these agents must be delivered via parenteral, subcutaneous, intramuscular, intranasal, or vaginal routes [ 24 ]. Given that current clinical interventions are not universally effective and are often accompanied by adverse effects, there is an imperative need to develop novel therapeutic agents. Bioinformatics provides the essential computational framework to interpret the massive datasets generated by modern high-throughput technologies, such as next-generation sequencing (NGS) and microarrays [ 25 ]. To decipher biological context, the Database for Annotation, Visualization and Integrated Discovery (DAVID; https://davidbioinformatics.nih.gov/ , accessed on 8 August 2025) platform facilitates Gene Ontology (GO) and pathway enrichment analysis, linking gene expression to specific biological processes (BP), cellular components (CC), and molecular functions (MF). Following this, protein–protein interaction networks can be constructed via STRING and visualized in Cytoscape, where the cytoHubba plugin allows for the identification of core hub genes based on topological ranking [ 26 ]. These analyses are further complemented by databases like Wiki Pathways and the Kyoto Encyclopedia of Genes and Genomes (KEGG) to contextualize signaling alterations. Notably, DAVID’s compatibility with diverse gene identifiers makes it a robust tool for integrating heterogeneous data sources [ 27 ]. Beyond elucidating molecular mechanisms, transcriptomic signatures serve as a powerful template for therapeutic discovery. To bridge the gap between gene expression data and clinical translation, the Library of Integrated Network-based Cellular Signatures (LINCS) L1000 dataset provides a comprehensive resource for connectivity mapping. By querying the differentially expressed genes (DEGs) against the L1000 database, it is possible to identify small molecules that induce a gene expression profile inverse to that of the disease state. This “signature reversion” strategy allows for the high-throughput screening of potential therapeutic agents and accelerates the identification of candidates for drug repurposing [ 28 ]. In this study, we integrated transcriptomic data from three independent GEO cohorts to derive a robust consensus signature of EM. We then used functional enrichment and network topology to prioritize hub genes with plausible mechanistic roles and translational accessibility, including extracellular vesicle/exosome-associated targets. Finally, we applied the LINCS L1000 connectivity framework to nominate small molecules predicted to reverse the EM signature, providing experimentally testable repurposing hypotheses.

Results

Raw Affymetrix microarray CEL files were downloaded from GEO and processed in TAC using a unified workflow. TAC performed automated background correction and quantile normalization, after which we generated PCA plots to assess separation between the EM and Normal groups. Across the three datasets ( GSE7305 , GSE25628 , GSE11691 ), only GSE7305 displayed clear group separation in the initial PCA ( Figure 1 A). In contrast, GSE25628 and GSE11691 showed limited separation with partial mixing of EM and Normal samples ( Figure 1 B,C). To reduce the influence of outlier samples on normalization and downstream differential expression, we examined sample coordinates along PC1 and removed outlier samples. Because GSE25628 and GSE11691 use paired designs, outliers were removed symmetrically within each pair to preserve pairing. After reprocessing in TAC, PCA plots showed improved clustering in both datasets, with EM and Normal forming two distinct groups ( Figure 2 A,B). Following differential expression analysis, the unfiltered numbers of DEGs were: GSE7305 , 3966 (2238 upregulated; 1728 downregulated); GSE25628 , 1712 (829 up; 883 down); and GSE11691 , 859 (581 up; 278 down). We then curated these lists by (i) excluding entries that had a probeset ID but no mapped gene symbol as such records cannot be incorporated into GO analyses, and (ii) collapsing duplicate gene symbols by retaining, for each gene, the entry with the lowest p -value. After curation, the final DEG counts were: GSE7305 , 2539 (1362 up; 1177 down; Figure 3 A); GSE25628 , 1417 (675 up; 742 down; Figure 3 B); and GSE11691 , 693 (465 up; 228 down; Figure 3 C). The DEG lists for all three datasets will be provided in Supplementary File S1 . To identify a robust molecular signature, we performed an intersection analysis across GSE7305 , GSE25628 , and GSE11691 , requiring directionally consistent log 2 fold-changes. The overlap structure, including pairwise comparisons and the three-way intersection, is visualized in Figure 4 . This consensus approach yielded 262 shared differentially expressed genes (200 upregulated and 62 downregulated; Supplementary File S2 ). Functional enrichment analysis via DAVID revealed that these genes are highly concentrated in biological processes critical to endometriosis pathology. Specifically, GO BP terms were enriched for innate and adaptive immune responses, inflammation, cell adhesion, cell migration, and angiogenesis ( Figure 5 A). In the Cellular Component category, “extracellular exosome” emerged as a prominent term, consistent with vesicle-mediated communication in lesion biology ( Figure 5 B). KEGG pathway analysis further corroborated these findings, highlighting immune regulation and matrix dynamics pathways such as PI3K/Akt, MAPK, Complement cascades, and Cell Adhesion Molecules ( Figure 5 D). The top 20 significant terms were visualized as bubble plots ( Figure 5 ), with full results detailed in Supplementary File S3 . Collectively, these functional profiles demonstrate that the 262 consensus genes successfully capture the core pathological landscape of endometriosis spanning immunity, inflammation, and adhesion. This biological coherence provides a robust rationale for using these specific gene sets to construct protein–protein interaction networks and prioritize key regulatory hub genes in the subsequent analysis. Guided by the enrichment results, we focused on the CC term extracellular exosome, which encompassed 63 differentially expressed genes, 24% of the three-way overlap. We cross-referenced these 63 genes against two curated exosome resources, ExoCarta and Vesiclepedia. Of these, 12 genes were listed in both databases, 60 appeared only in ExoCarta, and 3 were absent from both resources. Gene-level annotations and database matches are summarized in Table 1 . The high concordance with established exosome databases reinforces the secretory nature of these targets, suggesting that they are likely released into the circulation and could serve as accessible candidates for non-invasive liquid biopsy. Guided by the GO BP analysis, we selected highly enriched BP terms corresponding to the core pathophysiological mechanisms that drive the clinical manifestations of endometriosis. Specifically, we focused on innate immune response, adaptive immune response, inflammatory response, angiogenesis, cell migration, and cell adhesion, and retrieved the genes corresponding to each term. For each term-specific gene set, we constructed a PPI network using the STRING database with the organism restricted to Homo sapiens, and imported the resulting interaction tables into Cytoscape. Network prioritization was then performed with the cytoHubba plugin using degree centrality (defined as the number of first-neighbor connections for each node) to identify highly connected candidate hub genes. For each term, nodes were ranked by degree, and the top 7–10 genes were retained; the corresponding core subnetworks were visualized to highlight putative control points within each process. This workflow yielded hub networks for innate immune response (yellow), adaptive immune response (blue), inflammatory response (pink), angiogenesis (orange), cell migration (green), and cell adhesion (grey). Several key hub genes (red) participated in two or more BP terms ( Figure 6 ), suggesting that these genes may represent critical regulators underlying BP terms associated with common clinical manifestations of endometriosis. Notably, VCAM1, CCL2, MCAM, CD14, CD24, FGFR1, SIRPA, CSF1R, S100A9, S100A8, and LY96 repeatedly appeared in at least two functional terms. Among these genes, we found that the functional annotations related to common clinical manifestations of endometriosis are tightly interconnected and mutually reinforcing. In addition, VCAM1 is annotated as a component of extracellular vesicles, further enhancing its potential importance in EM. Guided by the LINCS L1000 framework, we adopted a reverse-signature strategy to nominate compounds predicted to counteract disease-relevant transcriptional changes, i.e., suppress genes commonly up-regulated across the three datasets and/or restore genes commonly down-regulated, thereby potentially exerting therapeutic effects. For each query, L1000 returns a ranked list of candidate perturbagens. Compounds whose mechanisms of action could not be corroborated from the published literature were excluded from further consideration. After curation, the retained compounds clustered into several mechanistic classes. A subset was predicted to modulate immune-related signaling, including WHI-P97 (JAK3 inhibitor), Torin-1 (mTOR inhibitor), and ellipticine (reported as a TLR3 inhibitor). Others primarily targeted proliferation and growth pathways, such as PIK-75 (PI3Kα inhibitor), radicicol (HSP90 inhibitor), and PHA-767491 (CDC7 inhibitor). We also identified BRD-K05151076 as an estrogen-receptor inhibitor. Protein-synthesis inhibitors, including thiostrepton, brefeldin A, and puromycin, were likewise recovered. For each small molecule, we further summarized its perturbed genes and whether those genes participate in endometriosis-related Gene Ontology biological processes (GO BP), including cell adhesion, cell migration, angiogenesis, inflammatory response, innate immune response, and adaptive immune response (details in Table 2 ). Notably, we also recovered cerulenin (a fatty-acid synthase inhibitor) and LRRK2-IN-1 (an LRRK2 kinase inhibitor currently investigated in the context of Alzheimer’s disease); both may have additional therapeutic potential for endometriosis. Finally, several predicted small molecules are already approved for clinical use in other indications; their names, approved indications, and regions of approval are summarized in Table 3 . As a next step, we plan to evaluate these candidates in cellular and animal models to assess their efficacy for drug repurposing in endometriosis. To orthogonally validate our 11-gene hub network, we analyzed an additional, independent endometriosis microarray dataset ( GSE201912 ). To capture the most distinct pathological signature, we specifically contrasted normal endometrium (controls) against ovarian endometrioma (ectopic lesions). The complete list of differentially expressed genes for this validation cohort is provided in Supplementary File S5 . The differential expression status of the 11 hub genes in this validation cohort is summarized in Table 4 . Remarkably, 7 out of the 11 hub genes ( CCL2 , CD14 , CD24 , VCAM1 , CSF1R , SIRPA , and LY96 ) were significantly differentially expressed in this independent test set ( p < 0.05, the absolute fold change ≥ 2). Importantly, the directionality of expression for all 7 replicated genes was perfectly consistent with our primary multi-cohort consensus signature, reinforcing their robustness as highly conserved pathogenic markers in endometriosis.

Discussion

In this multi-cohort re-analysis of three independent GEO series, we identified a robust intersection of differentially expressed genes shared across datasets. These genes were significantly enriched in biological processes highly relevant to endometriosis, including innate and adaptive immune responses, inflammation, extracellular matrix adhesion, directed cell migration, and angiogenesis. Network analysis further converged on a set of highly connected nodes, prioritizing VCAM1 , CCL2 , MCAM , CD14 , CD24 , FGFR1 , SIRPA , CSF1R , S100A9 , S100A8 , and LY96 as key hubs. Rather than acting in isolation, these genes likely constitute a coherent pathogenic network that sustains the chronic inflammatory and angiogenic microenvironment of ectopic lesions. By bridging immune activation and tissue remodeling, these central mediators contribute to the persistence of clinical symptoms and represent promising candidate targets for future therapeutic intervention. In our independent in silico validation using dataset GSE201912 , 7 out of the 11 hub genes were successfully replicated with consistent directionality when specifically comparing normal endometrium to ovarian endometrioma. This high replication rate reinforces their roles as a robust, highly conserved core signature. Although the remaining 4 hub genes (FGFR1, MCAM, S100A8, and S100A9) were detected but did not strictly meet the stringent differential expression thresholds in this specific validation cohort, likely reflecting platform-specific probe variances or patient-level heterogeneity, substantial experimental evidence firmly supports their pathological relevance. For instance, FGFR1 is significantly overexpressed in ectopic endometrium, where its signaling was correlated with pain symptoms [ 54 ]. Similarly, MCAM is a critical cell adhesion molecule that identifies highly angiogenic endometrial mesenchymal stem-like populations [ 55 ]. Furthermore, S100A8 and S100A9 function as potent pro-inflammatory damage-associated molecular patterns (DAMPs) secreted by macrophages to facilitate endometriotic vascularization and fibrogenesis [ 56 , 57 ]. Thus, these 11 genes, collectively, remain biologically critical nodes within the endometriosis immune-adhesion-angiogenesis axis. The enrichment of immune and inflammatory pathways aligns with the well-recognized inflammatory milieu of EM, where dysregulated leukocyte recruitment and cytokine signaling can facilitate lesion survival [ 58 , 59 , 60 ]. Adhesion and migration terms align with the capacity of shed endometrial fragments to attach to the peritoneum and invade [ 60 , 61 ], while angiogenesis supports lesion survival in ectopic niches [ 62 ]. Positioning the identified 11 hub genes spanning chemokines (e.g., CCL2 ), adhesion molecules (e.g., VCAM1 , MCAM ), and immune regulators (e.g., CD14 , CSF1R , S100A8 / S100A9 ) at the convergence of these processes suggests they function as a coordinated pathogenic network. Rather than acting independently, these hubs likely orchestrate the complex interplay between leukocyte-stromal interactions and vesicle-mediated communication that drives lesion microenvironment dysfunction. The repeated appearance of VCAM1 , CCL2 , MCAM , and innate immune regulators (e.g., CD14 , CSF1R , S100A8 / S100A9 ) across multiple GO terms supports their role as convergent regulators bridging immune-stromal crosstalk and tissue remodeling. Our network results generate actionable hypotheses. If this hub network is central to disease pathology, we expect (i) elevated protein levels of these key targets in eutopic/ectopic endometrium and patient plasma (potentially within exosomes) relative to controls; (ii) perturbing their specific signaling axes such as blocking VCAM1 mediated adhesion or CCL2 -driven chemotaxis in endometrial stromal/epithelial models will attenuate adhesion/migration readouts; and (iii) the aggregate expression profile of this multigene signature may correlate with pain burden, rASRM stage, or implantation outcomes. While these mechanistic predictions can be evaluated using multiplex IHC/IF, ELISA, and in vitro functional assays, we further prioritize the translational validation of our drug repurposing findings. Specifically, we plan to evaluate the in vivo therapeutic efficacy of the six clinically established compounds listed in Table 3 , including HDAC inhibitors (e.g., Vorinostat, Panobinostat) and immunomodulators, using murine models of endometriosis. By assessing their ability to suppress lesion growth and alleviate pain behaviors in a physiological context, we aim to confirm whether these repurposable agents can offer a rapid and effective alternative to current hormonal therapies. While previous transcriptomic analyses have explored the pathogenesis of endometriosis, the present study introduces several innovative aspects that distinguish it from standard descriptive bioinformatics. First, we employed a strict, directionally consistent intersection strategy across three independent cohorts. Rather than merely pooling datasets, this stringent filtering isolates a highly conserved molecular signature, minimizing cohort-specific or platform-specific artifacts. Second, we uniquely highlighted the translational potential of the exosomal compartment. By revealing that nearly a quarter (24%) of the consensus genes are enriched in extracellular exosomes, our study actively bridges basic transcriptomic discovery with the urgent clinical need for non-invasive liquid biopsies. Finally, we advanced the analysis beyond biomarker identification by integrating the LINCS L1000 connectivity mapping framework. This signature-reversion approach pivots the research toward actionable precision medicine, successfully nominating both FDA-approved agents and novel small molecules as rational candidates for non-hormonal drug repurposing. Together, these integrated multi-omics strategies provide a comprehensive and actionable roadmap from pathogenic network discovery to clinical therapeutic application. EM has been clinically linked to several conditions, including infertility [ 63 ], endometrioid endometrial carcinoma [ 64 ], and ovarian clear cell carcinoma [ 65 , 66 ]. With respect to infertility, the association between EM and impaired fecundity is well recognized, yet the underlying mechanisms remain incompletely defined. EM can influence reproduction through multiple, often intersecting pathways: chronic pelvic pain, persistent inflammation, pelvic adhesions, diminished ovarian reserve and function, and reduced endometrial receptivity collectively shape the phenotype of EM-related infertility [ 63 ]. Recent reports also highlight the role of uterine natural killer (uNK) cells in implantation [ 67 ], and several studies have documented a significant reduction in uNK cells in samples from patients with EM [ 68 ]. Together, these observations suggest that EM-associated decreases in uNK abundance may contribute to implantation failure and thus represent one plausible, indirect mechanism linking EM to infertility. Current management of EM centers on surgical and medical approaches. Surgically, focal to moderate lesions are typically treated by excision or carbon dioxide laser vaporization [ 69 ], followed by strategies to prevent postoperative adhesions and their sequelae [ 70 , 71 , 72 ]. Medically, because EM growth is estrogen-dependent, therapies that counteract endometrial proliferation are prioritized: progestins and oestro-progestins are widely used to alleviate EM-associated pain by suppressing endometrial growth and inducing secretory transformation [ 69 ], although a subset of patients exhibits progesterone resistance [ 4 ]. Among the predicted compounds, we also identified ER inhibitors. As endometrial proliferation is driven by estrogen stimulation, as discussed above, combining ER inhibitors with current treatment strategies may further enhance therapeutic efficacy. In addition, we plan to experimentally validate the effects of Torin-1, Amuvatinib, LRRK2-IN-1, and other predicted candidates in cellular models of endometriosis to determine whether these agents exert anti-endometriotic activity and could be repurposed as potential adjuvant therapies for EM. Notably, LRRK2-IN-1 has garnered attention beyond its original scope; studies have demonstrated its potent anti-cancer activity against colorectal and pancreatic cancer via inhibition of doublecortin-like kinase 1 [ 73 ], as well as its involvement in modulating PINK1/Parkin-dependent mitophagy in the context of Parkinson’s disease [ 74 ]. The demonstrated bioactivity of LRRK2-IN-1 across these diverse pathologies suggests it may represent another promising candidate for drug repurposing in endometriosis, warranting further investigation into its specific mechanisms within the endometrial microenvironment. Over the past decade, an expanding literature has also explored adjunctive or alternative modalities, including acupuncture [ 75 ], herbal medicine [ 76 ], dietary modification [ 77 ], physiotherapy, and anti-inflammatory/antioxidant interventions [ 78 ] reflecting interest in multimodal, individualized care alongside standard treatments. Looking forward, we envision the findings of this study translating into clinical practice through two primary translational avenues: non-invasive diagnostics and precision therapeutics. Diagnostically, a major clinical hurdle in endometriosis is the substantial diagnostic delay, which often necessitates invasive laparoscopy. Given that 24% of our consensus signature is significantly enriched in the extracellular exosome compartment, these targets hold substantial promise for the development of blood-based liquid biopsies. In future clinical practice, integrating these exosomal biomarker panels with machine-learning algorithms could allow clinicians to objectively risk-stratify symptomatic patients via a simple blood draw, prioritizing high-risk individuals for early imaging or surgical intervention. Therapeutically, current medical management heavily relies on hormonal suppression, which is unsuitable for patients desiring immediate pregnancy and is often limited by progesterone resistance or adverse effects. Our connectivity mapping rationally nominated several clinically approved compounds, such as HDAC inhibitors and immunomodulators, predicted to reverse the core pathogenic signature. Clinically, we envision these candidates serving as a data-driven foundation for phase II drug-repurposing trials. Ultimately, repurposing these approved agents could provide rapidly deployable, non-hormonal adjuvant therapies for patients who fail standard first-line treatments. Beyond utilizing signature-reversing compounds identified via LINCS L1000, we also foresee the direct pharmacological targeting of our identified hub genes as a highly viable therapeutic strategy, both currently and in the future. Because our 11-gene hub network constitutes the core of the immune-adhesion-angiogenesis axis, specifically neutralizing these nodes could disrupt lesion survival. For instance, CCL2 is a potent chemokine driving macrophage recruitment. Consistent with similar transcriptomic studies that highlighted the macrophage–chemokine axis in endometriosis, experimental murine models have demonstrated that pharmacological blockade of the CCL2/CCR2 signaling pathway significantly reduces macrophage infiltration and limits ectopic lesion growth [ 79 ]. Similarly, targeting angiogenesis via FGFR1 offers another direct approach; several FDA-approved FGFR inhibitors (e.g., Erdafitinib) currently used in oncology could be repurposed and evaluated for endometriosis to starve lesions of their blood supply [ 80 ]. Furthermore, future immunomodulatory strategies could specifically target CSF1R or S100A9 to deplete disease-associated macrophages or neutralize pro-inflammatory DAMPs within the pelvic microenvironment [ 81 ]. Collectively, these hub-targeted approaches align with emerging literature advocating for precision, non-hormonal immunotherapies [ 82 , 83 ], providing a rational roadmap for transitioning these in silico markers into clinical therapeutic targets.

Conclusions

The limited performance of existing non-invasive tests (including CA-125) highlights the need for sensitive and specific biomarkers for endometriosis. By uniformly reanalyzing three independent cohorts, we defined a conserved transcriptional signature and prioritized an 11-gene hub network ( VCAM1 , CCL2 , MCAM , CD14 , CD24 , FGFR1 , SIRPA , CSF1R , S100A9 , S100A8 , and LY96 ) that collectively sustains the inflammatory and angiogenic microenvironment of endometriosis. The enrichment of extracellular vesicle/exosome-associated genes supports the feasibility of translating these targets into liquid-biopsy panels. Connectivity mapping further nominated repurposable compounds predicted to reverse the EM signature, providing a rational, testable starting point for experimental validation in cellular and in vivo models. Moving forward, we plan to integrate laboratory-developed AI algorithms to score and weight these markers, validating their expression in endometrial tissues and circulating exosomes. This multi-modal strategy aims to construct a precise, non-invasive risk-prediction model and accelerate the development of targeted therapies for endometriosis. Limitations: This study is based on retrospective in silico re-analysis of public microarray datasets; findings require orthogonal validation at the protein level and in independent prospective cohorts. In addition, clinical heterogeneity (lesion type, stage, cycle phase) and platform-specific effects may influence gene expression estimates despite uniform preprocessing.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: pmc

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Outcome instruments

rASRM

Condition tags

endometriosis

MeSH descriptors

Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers Biomarkers

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-06-25T06:14:32.897245+00:00
pmc
last seen: 2026-05-13T20:22:03.195721+00:00
pubmed
last seen: 2026-06-25T06:09:59.281045+00:00
unpaywall
last seen: 2026-05-11T08:34:28.763810+00:00
License: CC-BY-4.0 · commercial use OK · attribution required
Courtesy of the U.S. National Library of Medicine