Identification and Subtype Analysis of Lipid Metabolism-Related Diagnostic Biomarkers for Endometriosis Based on WGCNA and Machine Learning

other OA: closed public-domain-us
Full text JSON View on PubMed View at publisher

Abstract

BACKGROUND: Endometriosis (EM), a disorder driven by persistent systemic inflammation, impacts around 10% of women in their reproductive period, often diagnosed only via surgery. Metabolic alterations, particularly in lipid metabolism, may uncover novel biomarkers. We aimed to identify diagnostic markers and molecular subtypes by integrating lipid metabolism gene expression and machine learning. METHODS: We downloaded gene expression datasets (GSE51981 and GSE7305) from the Gene Expression Omnibus (GEO) database. Differential expression was analyzed using limma (|log2FC| > 1, p.adj < 0.05); intersected with lipid genes to yield candidate genes. Weighted gene co-expression network analysis (WGCNA) demonstrated endometriosis-connected gene modules. Integrating lipid metabolism-related differentially expressed genes with WGCNA hub genes, followed by least absolute shrinkage and selection operator (LASSO) and XGBoost machine learning, identified diagnostic biomarkers. Their performance was validated using receiver operating characteristic (ROC) curves in an independent dataset. Immune infiltration, including CIBERSORT and single-sample GSEA (ssGSEA), gene set enrichment analysis (GSEA), and non-negative matrix factorization (NMF)-based subtype analyses were performed. MicroRNA (miRNA) and transcription factor (TF) regulatory networks were constructed using online databases. RESULTS: We identified 106 lipid metabolism-related differential genes. WGCNA revealed the turquoise module strongly correlated with endometriosis. ELOVL6 and MED20 were identified as key genes through machine learning algorithms. The two key genes emerged as robust diagnostic biomarkers, showing high area under the ROC curves (AUCs) across both training and validation sets. Immune infiltration analysis revealed distinct immune cell patterns in endometriosis, with ELOVL6 and MED20 correlating with specific immune cells. Subtype analysis, based on lipid metabolism scores, stratified patients into high and low score groups with differential gene expression and immune cell infiltration. Regulatory networks identified miRNAs and TFs targeting ELOVL6 and MED20. CONCLUSION: Our study identified ELOVL6 and MED20 as promising lipid metabolism-related diagnostic biomarkers for endometriosis. We also uncovered distinct molecular subtypes linked to lipid metabolism, providing novel insights into endometriosis heterogeneity and potential therapeutic targets.
Full text 2,807 characters · extracted from oa-doi-fallback · 4 sections · click to expand

Abstract

Background Endometriosis (EM), a disorder driven by persistent systemic inflammation, impacts around 10% of women in their reproductive period, often diagnosed only via surgery. Metabolic alterations, particularly in lipid metabolism, may uncover novel biomarkers. We aimed to identify diagnostic markers and molecular subtypes by integrating lipid metabolism gene expression and machine learning.

Methods

We downloaded gene expression datasets (GSE51981 and GSE7305) from the Gene Expression Omnibus (GEO) database. Differential expression was analyzed using limma (|log2FC| > 1, p.adj < 0.05); intersected with lipid genes to yield candidate genes. Weighted gene co-expression network analysis (WGCNA) demonstrated endometriosis-connected gene modules. Integrating lipid metabolism-related differentially expressed genes with WGCNA hub genes, followed by least absolute shrinkage and selection operator (LASSO) and XGBoost machine learning, identified diagnostic biomarkers. Their performance was validated using receiver operating characteristic (ROC) curves in an independent dataset. Immune infiltration, including CIBERSORT and single-sample GSEA (ssGSEA), gene set enrichment analysis (GSEA), and non-negative matrix factorization (NMF)-based subtype analyses were performed. MicroRNA (miRNA) and transcription factor (TF) regulatory networks were constructed using online databases.

Results

We identified 106 lipid metabolism-related differential genes. WGCNA revealed the turquoise module strongly correlated with endometriosis. ELOVL6 and MED20 were identified as key genes through machine learning algorithms. The two key genes emerged as robust diagnostic biomarkers, showing high area under the ROC curves (AUCs) across both training and validation sets. Immune infiltration analysis revealed distinct immune cell patterns in endometriosis, with ELOVL6 and MED20 correlating with specific immune cells. Subtype analysis, based on lipid metabolism scores, stratified patients into high and low score groups with differential gene expression and immune cell infiltration. Regulatory networks identified miRNAs and TFs targeting ELOVL6 and MED20.

Conclusion

Our study identified ELOVL6 and MED20 as promising lipid metabolism-related diagnostic biomarkers for endometriosis. We also uncovered distinct molecular subtypes linked to lipid metabolism, providing novel insights into endometriosis heterogeneity and potential therapeutic targets. Conflicts of Interest The authors declare no conflicts of interest. Data Availability Statement The data and materials in the current study are available from the corresponding author on reasonable request.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Condition tags

endometriosis

MeSH descriptors

Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis Endometriosis

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-06-11T06:19:48.454388+00:00
pubmed
last seen: 2026-06-11T06:15:35.408417+00:00
unpaywall
last seen: 2026-05-11T08:34:28.763810+00:00
License: public-domain-us · commercial use OK · attribution required
Courtesy of the U.S. National Library of Medicine