Multi-omics analysis identifies CLDN1 as a key regulator of gallbladder cancer progression and prognosis | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Multi-omics analysis identifies CLDN1 as a key regulator of gallbladder cancer progression and prognosis Zhicheng Pan, Guodong Liu, Fei Tong, Zhenggen Hu, Cheng Qian, and 5 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-9088571/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted 12 You are reading this latest preprint version Abstract Gallbladder cancer (GBC) is a highly aggressive biliary tract malignancy characterized by poor prognosis and a lack of reliable biomarkers for early diagnosis. Despite its clinical aggressiveness, the molecular drivers and cellular regulatory networks underlying GBC progression remain incompletely characterized. In this study, we identified CLDN1 as a key epithelial-associated gene in GBC and revealed an epithelial-centered immune regulatory network potentially involved in tumor progression. Integrated bulk transcriptome and single-cell RNA sequencing (scRNA-seq) analyses revealed that CLDN1 is markedly upregulated in GBC, with expression localized predominantly in epithelial cells. At the single-cell level, epithelial cells emerged as a central hub of cell–cell communication, exhibiting extensive interactions with neutrophils, natural killer (NK) cells, and monocytes, thereby associating with immunosuppressive or inflammatory landscape of the tumor microenvironment (TME). Functional enrichment and intercellular interaction analyses suggested that CLDN1 -high epithelial cells promote immune modulation via cytokine secretion and facilitate TME remodeling and metastasis through crosstalk with mesenchymal and endothelial cells. Notably, GBC epithelial cells exhibited significantly higher CLDN1 expression compared to those in chronic cholecystitis, reinforcing its disease-specific relevance. By integrating differential expression analysis, weighted gene co-expression network analysis (WGCNA), and machine learning approaches, CLDN1 was consistently identified as a key molecular feature of GBC. Collectively, our findings suggest that CLDN1 is closely associated with GBC initiation and progression through modulation of epithelial stability and epithelial–immune crosstalk, highlighting its potential as a biomarker for tumor classification, prognosis, and therapeutic targeting. gallbladder cancer transcriptome scRNA sequencing CLDN1 Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 Introduction Gallbladder cancer (GBC) is the most common malignancy of the biliary system and ranks among the top six gastrointestinal neoplasms worldwide [ 1 ]. Globally, GBC accounts for more than 150,000 cancer-related deaths annually, representing approximately 1.5% of all cancer fatalities [ 2 ]. Its incidence exhibits marked geographic and demographic disparities, with high prevalence reported in Chile, India, parts of Asia, Eastern Europe, and South America [ 3 – 6 ]. GBC is more frequently diagnosed in elderly individuals, with a mean age of approximately 65 years, and occurs 2–6 times more often in women than in men [ 7 ]. These variations are thought to result from complex interactions between genetic susceptibility and environmental risk factors, including gallstones, liver fluke infection, chronic inflammation, abnormal pancreaticobiliary duct junctions, microbial infections, and exposure to toxic substances or carcinogens [ 8 – 11 ]. Clinically, GBC remains one of the most lethal biliary tract malignancies due to its insidious onset and lack of specific early symptoms. As a result, most patients are diagnosed at advanced stages, and the rate of potentially curative radical resection is limited to approximately 20% [ 12 ]. Although current diagnostic strategies rely on blood-based biomarkers combined with imaging modalities, their sensitivity and specificity are insufficient for detecting early-stage or small lesions [ 13 ]. Even with aggressive treatments such as radical cholecystectomy and adjuvant chemotherapy, postoperative recurrence and distant metastasis are common. The absence of reliable biomarkers for early diagnosis and disease progression prediction underscores an urgent need to identify novel molecular markers and regulatory mechanisms underlying GBC. Accumulating evidence suggests that genetic, epigenetic, and transcriptomic alterations contribute to GBC initiation and progression. Variations in genes involved in immune and inflammatory signaling, such as TLR4, have been associated with increased GBC risk, particularly in female patients [ 14 ]. Epigenetic dysregulation also plays an important role; for example, elevated expression of the histone acetyltransferase KAT5 promotes GBC cell proliferation, while its knockdown induces caspase-9–mediated apoptosis and suppresses tumor growth [ 15 ]. Transcriptome-wide analyses have revealed widespread dysregulation of mRNAs, lncRNAs, and microRNAs in GBC, implicating key oncogenic pathways including PI3K-Akt, Hedgehog, Wnt, KRAS, androgen receptor, and interferon-γ signaling [ 16 , 17 ]. However, most existing studies focus on individual molecules or pathways, and a systematic understanding of how these alterations integrate at the cellular and microenvironmental levels remains lacking. To address these limitations, we performed an integrative analysis combining bulk transcriptome and single-cell RNA sequencing (scRNA-seq) data from public databases[ 18 – 20 ]. Through transcriptome-based screening, we identified CLDN1 as a core characteristic gene significantly upregulated in GBC tissues. Subsequent single-cell analysis revealed that epithelial cells act as central regulators within the GBC tumor microenvironment, orchestrating immune cell interactions and signaling networks. Together, our study highlights a potential CLDN1 -driven epithelial–immune regulatory axis in GBC and provides new insights into disease mechanisms and candidate targets for diagnosis and therapy. Materials and methodsData acquisition The bulk RNA-seq dataset GSE276931 and GSE139682 used in this study were retrieved from Gene Expression Omnibus (GEO) database. Tissue specimens of gallbladder cancer of GSE276931 dataset including 7 tumors and 5 adjacent non-tumors. Tissue specimens of gallbladder cancer of GSE139682 dataset including 10 tumors and 10 adjacent non-tumors. The single-cell RNA dataset used in this study was obtained from a previously published multi-omic analysis of gallbladder cancer [46], which contains 4 gallbladder cancer tissue (GBC) and 4 cholecystitis tissue (CC) single-cell RNA sequencing samples. Chronic cholecystitis tissue has a high degree of similarity in cellular composition with normal gallbladder tissue[21]. Differential gene expression analysis in bulk-RNAseq Differentially expressed genes (DEGs) between the tumors and adjacent non-tumors in the GSE276931 dataset and GSE139682 dataset were visualized using the limma package and met the criteria of P values < 0.05 and Fold change log2FC ≥ 1 or log2FC ≤ − 1.The results were visualized through heatmaps and volcano plots to highlight the significantly upregulated and downregulated genes. Weighted gene co-expression network analysis (WGCNA) in bulk-RNAseq To further screen the key genes closely related to gallbladder atrophy, the weighted gene coexpression network analysis (WGCNA)[22] was used to construct the gene coexpression module for the dataset GSE276931. Quality control was conducted by removing genes and samples with excessive missing values or low expression levels. The soft-thresholding power was chosen the scale free topology fit index reached 0.8 for the first time, ensuring that the constructed network approximates a scale-free topology. This thresholding was used to calculate the adjacency matrix representing gene co-expression similarity. Genes were clustered into modules using average linkage hierarchical clustering based on topological overlap, and each module was assigned a unique color. Finally, cluster dendrogram and module-trait relationships were visualized to interpret the results. GO and KEGG pathway enrichment analysis in bulk-RNAseq After identifying the overlaping genes by intersecting WGCNA results and DEGs from GSE276931 dataset and GSE139682 dataset, respectively, we conducted KEGG and GO enrichment analyses. Functional enrichment analyses, including Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG), were conducted using the R package clusterProfiler (version 4.14.6). Gene annotations and mappings were performed based on the org.Hs.eg.db (version 3.20.0). Specifically, the enrichGO and enrichKEGG functions were employed to identify significantly enriched pathways (P-value < 0.05). The KEGG pathway data were sourced from the KEGG API ( https://www.genome.jp/kegg/pathway.html ), and GO terms were retrieved via the Gene Ontology resource ( http://geneontology.org/).Gen e Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses[23] were employed to detect the molecular functions and potential pathways of differentially expressed genes (DEGs). GO analysis primarily identifies key features of genes concerning molecular functions(MF), cellular components(CC), and biological processes(BP). KEGG analysis mainly reveals gene enrichment in specific signaling pathways, elucidating the potential roles and mechanisms of genes in disease development. KEGG and GO enrichment analyses were presented using bubble Charts. Machine learning in bulk-RNAseq We used Least Absolute Shrinkage and Selection Operator(LASSO), Support Vector Machine - Recursive Feature Elimination (SVM-RFE), and Random Forest (RF) [24]to the overlaping genes to futher identify key regulatory genes for GBC. Three models were constructed using the R package. LASSO uses L1 regularized linear regression method to achieve feature sparsity through the absolute value of penalty coefficient and automatically screen important variables. The SVM-RFE method selects the genes with the strongest discrimination ability based on the principle of minimum classification error and highest accuracy. RF selected the top 20 key genes according to the importance score of genes. Single‑cell sequencing data analysis The scRNA-seq data was processed by the Seurat package. Canonical Correlation Analysis (CCA) was used to find Mutual Nearest Neighbours (MNNs)[25]. Cells with more than 2,500 or fewer than 300 gene counts was filtered out. Subsequently, cells exceeding a mitochondrial gene percentage of 5% was filtered out to eliminate low-quality cells. To identify cell clusters, PCA analysis was first performed on the list of highly variable genes. Clustering was then performed using the findclusters function. Finally, Umap was used for visualization. For normalized gene expression data, we listed markers for each cell cluster using the findallmarkers function. A total of 12 cell types were finally obtained. Cell–cell interaction analysis To visualize and analyze intercellular communications, we conducted CellChat analysis[ 26 ]. CellChat identified differentially overexpressed ligands and receptorsfor each cell group and associated each interaction with a probability value to quantify communications between the two cell groups mediated by these signaling genes. Significant interactions were identified on the basis of a statistical test that randomly permuted the group labels of cells and then recalculated the interaction probability. Cell interaction with a P value < 0.05 was defined as significance.The results were visualized using a Circle plot. Differential gene expression analysis in scRNA-seq To identify the differential expression genes (DEGs) of epithelial cells, we used model-based analysis of single-cell transcriptomics test with genes detected in a minimum of 10% of all cells, met the criteria of P values < 0.05 and |Fold change log2FC| ≥ 1 in the Findmarker function by Seurat. Enrichment analysis in scRNA-seq After identifying the DEGs from scRNA-seq, we conducted KEGG and GO enrichment analyses.Gene enrichment analysis was conducted using the R package clusterProfiler. KEGG and GO enrichment analyses were presented using bar Charts. Subsequently, we used R software GSVA package to conduct ssGSEA to calculate the inflammation-related pathways score[27]. The result was visualized into heatmaps. Results Differential expression analysis of genes and functional analysis of DEGs We identified a total of 1387 differential expression genes (DEGs) from GSE276931 between the GBC and CC using the limma package with multiple testing correction. Applying a P value of < 0.05 and |Fold change log2FC| ≥ 1, 776 genes were significantly upregulated and 611 genes were significantly downregulated. The expression patterns of these DEGs are shown in the volcano plot (Fig. 1 A). The Top50 gene of these DEGs are shown in the heatmap (Fig. 1 B).In GSE139682, a total of 791 DEGs were identified, among them, 195 genes were significantly upregulated and 596 genes were significantly downregulated. The expression patterns of these DEGs are shown in the volcano plot (Fig. 1 C). The Top50 gene of these DEGs are shown in the heatmap (Fig. 1 D). Identification of GBC-associated co-expression modules and functional signatures via WGCNA To identify critical genes associated with GBC, weighted gene co-expression network analysis (WGCNA) was performed on 1,387 DEGs from the GSE276931 dataset to construct co-expression modules. Using a soft-threshold power of 30 (scale-free R 2 = 0.77), a total of 16 co-expression modules were identified (Fig. 2 A). Among these, the black, yellow, and lavenderblush3 modules exhibited significant positive correlations with GBC (P < 0.05), with the black module showing the highest correlation coefficient and the most significant P value (Fig. 2 B–C). By intersecting genes within the black module with DEGs from both GSE276931 and GSE139682, we identified 32 overlapping genes (Fig. 2 D).Functional enrichment analyses indicated that these genes were primarily involved in GO terms related to cell junction organization and epithelial–mesenchymal transition (EMT). Specifically, "apical junction complex" and "bicellular tight junction" were significantly enriched in the cellular component category, while "substrate adhesion-dependent cell spreading," "mitotic metaphase plate congression," and "positive regulation of protein kinase activity" dominated the biological process category (Fig. 2 E). Furthermore, KEGG pathway analysis revealed significant enrichment in the IL-17 signaling pathway, tight junctions, and cell adhesion molecules, suggesting that GBC progression is driven by inflammatory-mediated immune responses and junctional remodeling (Fig. 2 F). Additionally, the enrichment of amino acid and drug metabolism pathways points toward systemic metabolic reprogramming and altered detoxification functions during GBC development. Machine learning and screening of key regulatory genes To further screen the key regulatory genes in the process of gallbladder cancer, applied three machine learning methods-LASSO, SVM-RFE, and RF, to the 32 overlapping genes. LASSO identified 3 characteristic genes- CLDN1 , SHANK2 and EFNA1 significantly associated with GBC from top 10 genes through lasso coefficient path analysis and cross validation error (Fig. 3 A). Random forest analysis selected the top 20 key genes according to the importance score of genes (Fig. 3 B-C). The SVM-RFE method selected the gene set with the strongest discrimination ability based on the principle of minimum classification error and highest accuracy (Fig. 3 D). we focused on the intersection of genes consistently identified across all 3 methods, resulting in one common gene: CLDN1 (Fig. 3 E). The expression level of the CLDN1 gene in the GBC group is significantly higher than that in the adjacent cancer group (Fig. 3 F). Integration and clustering of scRNA-seq data Single-cell analysis was performed on openly available datasets. Cells were classified into 12 different distinct clusters following umap visual analysis after quality control and preprocessing of the data (Fig. 4 A). Subsequently, we provided detailed annotations on the cell subpopulations of the GBC group and CC group. By analyzing the expression of established molecular markers, the specific types of cells we have identified were mainly immune cells and structural cells, such as: T cells, monocytes, NK cells, B cells, dendritic cells (DCS), Epithelial cells, Endothelial cells, Mesenchymal cells, Neutrophils, Mast cells and plasma cells, etc. in GBC group and CC group (Fig. 4 B-C). Although the cell types of GBC group and CC group were similar, there were some differences in the proportion and distribution pattern (Fig. 4 E). In the CC group, T cells dominate, followed by Mononuclear cells and mesenchymal cells, which exhibited stronger cellular immune response characteristics and retains more mesenchymal matrix components in the cellular microenvironment. In the GBC group, mononuclear cells were the predominant population, followed by mesenchymal cells, reflecting enhanced innate immune activity and matrix remodeling. The accuracy of cell type annotation was further confirmed by typical marker gene expression heatmaps (Fig. 4 D). Epithelial cells were located in the core regulatory position in GBC To quantify intercellular communications, we employed CellChat to construct signaling networks within the GBC microenvironment. Epithelial cells emerged as the central hub of the interaction network, exhibiting significant crosstalk with neutrophils, monocytes, NK cells, and stromal cells (Fig. 5 A). Compared to the balanced interaction pattern in chronic cholecystitis (CC) (Fig. 5 C), GBC tissues showed distinct remodeling of the tumor microenvironment (TME) (Fig. 5 B). Notably, interactions between epithelial cells and neutrophils or mesenchymal cells were markedly enhanced in GBC, with the latter showing the most significant intensity, potentially driving TME formation (Fig. 5 D).Further investigation using pseudo-temporal trajectory analysis revealed that epithelial cells in GBC exhibited a more extensive distribution compared to CC (Fig. 5 E). Specifically, a subset of cells in GBC shifted toward advanced evolutionary stages, suggesting they may have undergone epithelial–mesenchymal transition (EMT) to promote tumor progression. Collectively, these findings indicate that epithelial cells act as a primary orchestrator of immune regulation and structural remodeling in GBC. Differential expression analysis and functional enrichment of epithelial cells To elucidate the molecular characteristics of epithelial cells in GBC, we performed differential expression and functional enrichment analyses. We identified 3,302 significantly upregulated and 2,976 downregulated genes in GBC epithelial cells compared to the CC group (Fig. 6 A). GO enrichment analysis (Fig. 6 B) revealed that these genes were primarily associated with neutrophil-mediated immunity, antigen processing and presentation (MHC class I), and cell–substrate junctions. These findings suggest that GBC epithelial cells actively orchestrate immune responses and facilitate intercellular signaling through specialized membrane structures. KEGG pathway analysis further highlighted the enrichment of phagosome, cell cycle, and endocytosis pathways (Fig. 6 C), suggesting their contribution to the malignant phenotype of GBC. Notably, inflammatory pathways were consistently enriched across both bulk and single-cell datasets. Subsequent ssGSEA scoring confirmed that GBC epithelial cells possessed significantly higher inflammation scores than those in the CC group (Fig. 6 D), reflecting a pro-inflammatory state within the tumor microenvironment that may drive disease progression. Expression of CLDN1 in epithelial cells We expressed the CLDN1 gene overlapped by machine learning in the U-Map plot and found that it is mainly expressed in epithelial cells (Fig. 7 A-B).Comparing the expression levels of CLDN1 in the epithelial cells of CC group and GBC group, it was found that the expression level in GBC group was significantly higher than that of CC group with P < 0.05 (Fig. 7 C). Discussion Gallbladder cancer (GBC) is typically diagnosed at advanced stages, rendering early detection a formidable clinical challenge. While early-stage diagnosis significantly improves survival, the overall prognosis remains suboptimal, with only a fraction of patients achieving favorable long-term outcomes [ 29 , 30 ]. These therapeutic limitations underscore an urgent imperative to identify robust biomarkers and to elucidate the fundamental biological mechanisms driving GBC progression. In this study, we identified CLDN1 as a core molecular feature of GBC and, for the first time, systematically linked its expression to an epithelial-centered immune regulatory network at single-cell resolution. Moving beyond the traditional focus on isolated genes, our findings illuminate the pivotal role of epithelial cells as primary coordinators of tumor progression and immune landscape modulation in GBC. CLDN1 is a scaffold component of tight junction complexes, which, alongside occludin and junctional adhesion molecules, maintains epithelial barrier integrity and intercellular cohesion [ 31 ]. Dysregulated CLDN1 has been implicated in the initiation and invasion of various malignancies, driven by oncogenic signaling, genetic aberrations, and microenvironmental cues. For instance, BRAF V600E mutations upregulate CLDN1 via NF-κB activation in thyroid cancer [ 32 ], with parallel associations observed in colorectal polyps [ 33 ]. In head and neck squamous cell carcinoma and hepatocellular carcinoma (HCC), CLDN1 overexpression facilitates tumor advancement through Wnt/β-catenin signaling [ 34 ], contributing to epithelial–mesenchymal transition (EMT), metabolic reprogramming, and tumor immune microenvironment (TIME) remodeling [ 35 ]. While the specific role of CLDN1 in GBC has remained largely elusive, our functional enrichment analyses suggest it exerts analogous oncogenic effects. GO and KEGG pathways were significantly enriched in substrate adhesion, cadherin binding, and tight junction organization—processes integral to cell migration and metastatic potential [ 36 – 38 ]. The disruption of these adhesive architectures often triggers EMT, thereby enhancing tumor aggressiveness [ 37 , 38 ]. Crucially, these functional patterns were predominantly localized within epithelial cells, reinforcing their role as the primary engine of GBC progression. Beyond structural maintenance, we discovered that epithelial cells possess potent immunomodulatory properties. Our cell–cell communication analysis identified epithelial cells as a central hub within the interaction network, orchestrating crosstalk between neutrophils, natural killer (NK) cells, and monocytes. Epithelial cells are known to secrete a spectrum of cytokines, such as IL-1, IL-8, IL-18, TNF-α , and TGF-β , which act as critical mediators of inflammatory responses and immune polarization [ 38 – 40 ]. Furthermore, epithelial-derived metabolites like retinoic acid are essential for maintaining immune homeostasis [ 41 ]. Our findings support a paradigm shift: viewing GBC epithelial cells not merely as structural units but as active immunometabolic regulators of the TME. The interplay between epithelial-mesenchymal plasticity and the TME is a hallmark of metastasis. The intensified interactions between epithelial, mesenchymal, and endothelial cells observed in our study likely facilitate microenvironmental remodeling and systemic dissemination. This aligns with evidence that microRNA clusters, such as miR-214/miR-3120, can modulate EMT and autophagy to suppress GBC progression [ 42 ]. Consequently, this epithelial-centered interaction axis represents a critical vulnerability in GBC pathogenesis that could be exploited for therapeutic intervention. Finally, our single-cell validation confirmed that CLDN1 is both cell-type specific and significantly upregulated in GBC compared to chronic cholecystitis, emphasizing its diagnostic relevance. Given that CLDN1 correlates with prognosis in colorectal cancer [ 43 ], and considering the clinical success of CLDN1 8.2-targeted therapies in gastric cancers [ 44 , 45 ], CLDN1 emerges as a compelling target for GBC. Our data suggest that CLDN1 could serve as a versatile tool for early diagnosis, molecular classification, and the development of precision therapies. In conclusion, our study identifies a previously unrecognized CLDN1 –epithelial–immune regulatory axis in GBC. These insights not only deepen our understanding of GBC biology but also provide a conceptual framework for developing epithelial-targeted diagnostic and therapeutic strategies. Despite the insights gained from our multi-omics analysis, this study has several limitations. First, the sample sizes of the primary datasets (GSE276931 and GSE139682) are relatively small, which may affect the generalizability of the findings. Second, while CLDN1 was identified as a key regulator through bioinformatics, further in vitro and in vivo experimental validations are required to confirm its functional roles and underlying mechanisms in gallbladder cancer. Future studies with larger clinical cohorts and functional assays are warranted to strengthen these conclusions. Conclusion This study integrates bulk and single-cell transcriptomic analyses to elucidate key regulatory mechanisms in gallbladder cancer. We demonstrate that epithelial cells act as the central hub of the cell–cell interaction network, coordinating immune regulation through interactions with neutrophils, natural killer cells, and monocytes, while crosstalk with mesenchymal and endothelial cells may promote tumor microenvironment remodeling and metastasis. Notably, CLDN1 was identified as a core gene predominantly expressed in epithelial cells and may contribute to GBC progression by regulating epithelial integrity, inflammatory responses, and the tumor microenvironment. These findings suggest that CLDN1 and epithelial-centered regulatory networks represent promising candidates for tumor classification, targeted therapy, and prognosis assessment, warranting further functional and clinical validation. Declarations Authors contribution Jianbo Han, Yun Xiao, and Dayun Lu conceptualized the study, developed the methodology, and supervised the project and manuscript revision. Zhicheng Pan, Guodong Liu, and Fei Tong executed data collection, performed machine learning-based formal analysis, and drafted the original manuscript. Zhenggen Hu, Cheng Qian, Shijing Xing, and Xinyi Shen contributed to data curation, methodological implementation, and the visualization of research results. All authors read and approved the final manuscript. Conflict of interests The authors declare no competing interests. Funding The authors declare that no funds, grants, or other support were received during the preparation of this manuscript. Data Availability Statement The datasets analyzed in this study are publicly available in the Gene Expression Omnibus (GEO) repository. The bulk RNA-seq data can be accessed via the following links: GSE276931 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE276931) and GSE139682 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE139682). The single-cell RNA-seq data of gallbladder cancer (Reference [46]) is available at [https://zenodo.org/records/15400138]. Functional enrichment resources can be accessed via Gene Ontology (http://geneontology.org/) and KEGG (https://www.genome.jp/kegg/pathway.html). Ethics declaration Ethics declaration: Not applicable. Clinical trial number Clinical trial number: not applicable. Consent to Participate declaration Consent to Participate declaration: not applicable. Consent to Publish declaration Consent to Publish declaration: not applicable. References Song, X., et al ., Overview of current targeted therapy in gallbladder cancer. Signal Transduct Target Ther, 2020. 5 (1): p. 230. Rawla, P., et al ., Epidemiology of gallbladder cancer. Clin Exp Hepatol, 2019. 5 (2): p. 93-102. Krell, R.W. and A.C. Wei, Gallbladder cancer: surgical management. Chin Clin Oncol, 2019. 8 (4): p. 36. Thomas, T.S., et al ., Advancing age and the risk of bleomycin pulmonary toxicity in a largely older cohort of patients with newly diagnosed Hodgkin Lymphoma. J Geriatr Oncol, 2020. 11 (1): p. 69-74. Kumar, A., et al ., Gallbladder cancer: Progress in the Indian subcontinent. World J Clin Oncol, 2024. 15 (6): p. 695-716. Noel, R., et al ., Does the Frequency of Cholecystectomy Affect the Ensuing Incidence of Gallbladder Cancer in Sweden? A Population-Based Study with a 16-Year Coverage. World J Surg, 2016. 40 (1): p. 66-72. Van Dyke, A.L., et al ., Biliary tract cancer incidence and trends in the United States by demographic group, 1999-2013. Cancer, 2019. 125 (9): p. 1489-1498. Shin, H.R., et al ., Epidemiology of cholangiocarcinoma: an update focusing on risk factors. Cancer Sci, 2010. 101 (3): p. 579-85. Rawla, P., T. Sunkara, and A. Barsouk, Epidemiology of colorectal cancer: incidence, mortality, survival, and risk factors. Prz Gastroenterol, 2019. 14 (2): p. 89-103. Dixit, R., et al ., Comparative Analysis of Mutational Profile of Sonic hedgehog Gene in Gallbladder Cancer. Dig Dis Sci, 2017. 62 (3): p. 708-714. Lee, W. and V.S. Chandan, Gallbladder carcinomas: review and updates on morphology, immunohistochemistry, and staging. Hum Pathol, 2023. 132 : p. 149-157. Ren, T., et al ., [Analysis of treatment modalities and prognosis of patients with gallbladder cancer in China from 2010 to 2017]. Zhonghua Wai Ke Za Zhi, 2020. 58 (9): p. 697-706. Deo, K.B., et al ., Misdiagnosis of carcinoma gall bladder in endemic regions. BMC Surg, 2022. 22 (1): p. 343. Seretis, C., et al ., Metaplastic changes in chronic cholecystitis: implications for early diagnosis and surgical intervention to prevent the gallbladder metaplasia-dysplasia-carcinoma sequence. J Clin Med Res, 2014. 6 (1): p. 26-9. Feinberg, A.P. and B. Tycko, The history of cancer epigenetics. Nat Rev Cancer, 2004. 4 (2): p. 143-53. Dixit, R., et al ., Unravelling of the comparative Transcriptomic Profile of Gallbladder Cancer using mRNA sequencing. Mol Biol Rep, 2022. 49 (7): p. 6395-6403. Saxena, R., et al ., Next generation sequencing uncovers multiple miRNAs associated molecular targets in gallbladder cancer patients. Sci Rep, 2023. 13 (1): p. 19101. Ozsolak, F. and P.M. Milos, RNA sequencing: advances, challenges and opportunities. Nat Rev Genet, 2011. 12 (2): p. 87-98. Zhang, X., F. Zhang, and X. Xu, Single-cell RNA sequencing in exploring the pathogenesis of diabetic retinopathy. Clin Transl Med, 2024. 14 (7): p. e1751. Van Hove, I., et al ., Single-cell transcriptome analysis of the Akimba mouse retina reveals cell-type-specific insights into the pathobiology of diabetic retinopathy. Diabetologia, 2020. 63 (10): p. 2235-2248. Zhou, T., et al ., Multi-omic analysis of gallbladder cancer identifies distinct tumor microenvironments associated with disease progression. Nat Genet, 2025. 57 (8): p. 1935-1949. Langfelder, P. and S. Horvath, WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics, 2008. 9 : p. 559. Kanehisa, M., et al ., KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res, 2023. 51 (D1): p. D587-d592. Alazaidah, R., et al ., Potential of Machine Learning for Predicting Sleep Disorders: A Comprehensive Analysis of Regression and Classification Models. Diagnostics (Basel), 2023. 14 (1). Slovin, S., et al ., Single-Cell RNA Sequencing Analysis: A Step-by-Step Overview. Methods Mol Biol, 2021. 2284 : p. 343-365. Jin, S., et al ., Inference and analysis of cell-cell communication using CellChat. Nat Commun, 2021. 12 (1): p. 1088. Hänzelmann, S., R. Castelo, and J. Guinney, GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics, 2013. 14 : p. 7. Liu, L., et al ., SIRT3 inhibits gallbladder cancer by induction of AKT-dependent ferroptosis and blockade of epithelial-mesenchymal transition. Cancer Lett, 2021. 510 : p. 93-104. Feo, C.F., et al ., Current management of incidental gallbladder cancer: A review. Int J Surg, 2022. 98 : p. 106234. Roa, J.C., et al ., Gallbladder cancer. Nat Rev Dis Primers, 2022. 8 (1): p. 69. Ding, L., et al ., The claudin family of proteins in human malignancy: a clinical perspective. Cancer Manag Res, 2013. 5 : p. 367-75. Ross-Orozco, A., et al ., Association between BRAFV600E mutation and gene expression of CLDN1, TIMP1, and KRT19 in papillary thyroid cancer. Mol Cell Endocrinol, 2025. 607 : p. 112614. Caruso, M., et al ., Claudin-1 Expression Is Elevated in Colorectal Cancer Precursor Lesions Harboring the BRAF V600E Mutation. Transl Oncol, 2014. 7 (4): p. 456-63. Kayamori, K., et al ., Role of Stromal Fibroblast-Induced WNT7A Associated with Cancer Cell Migration Through the AKT/CLDN1 Signaling Axis in Oral Squamous Cell Carcinoma. Lab Invest, 2023. 103 (10): p. 100228. Wang, W., et al ., Claudins and hepatocellular carcinoma. Biomed Pharmacother, 2024. 171 : p. 116109. Al Azim, M. and J.S. Di Martino, ECM, integrins, and DDRs: A nexus of cancer progression, therapy, and future directions. Matrix Biol, 2025. 138 : p. 27-43. Danilova, N.V. and N.A. Oleynikova, [E-cadherin: structure and functions, role in gastric cancer carcinogenesis]. Arkh Patol, 2023. 85 (4): p. 70-77. Andrews, C., M.H. McLean, and S.K. Durum, Cytokine Tuning of Intestinal Epithelial Function. Front Immunol, 2018. 9 : p. 1270. Scarpa, M., et al ., The epithelial danger signal IL-1α is a potent activator of fibroblasts and reactivator of intestinal inflammation. Am J Pathol, 2015. 185 (6): p. 1624-37. Eckmann, L., M.F. Kagnoff, and J. Fierer, Epithelial cells secrete the chemokine interleukin-8 in response to bacterial entry. Infect Immun, 1993. 61 (11): p. 4569-74. Cao, Y.G., et al ., Faecalibaculum rodentium remodels retinoic acid signaling to govern eosinophil-dependent intestinal epithelial homeostasis. Cell Host Microbe, 2022. 30 (9): p. 1295-1310.e8. Li, W., et al ., The microRNA cluster miR-214/miR-3120 prevents tumor cell switching from an epithelial to a mesenchymal-like phenotype and inhibits autophagy in gallbladder cancer. Cell Signal, 2021. 80 : p. 109887. Ouban, A., Claudin-1 role in colon cancer: An update and a review. Histol Histopathol, 2018. 33 (10): p. 1013-1019. Nakayama, I., et al ., Claudin 18.2 as a novel therapeutic target. Nat Rev Clin Oncol, 2024. 21 (5): p. 354-369. Shah, M.A., et al ., Zolbetuximab plus CAPOX in CLDN18.2-positive gastric or gastroesophageal junction adenocarcinoma: the randomized, phase 3 GLOW trial. Nat Med, 2023. 29 (8): p. 2133-2141. Multi-omic analysis of gallbladder cancer identifies distinct tumor microenvironments associated with disease progression. https://zenodo.org/records/15400138. Accessed 03 Feb 2026. Additional Declarations No competing interests reported. Cite Share Download PDF Status: Under Review Version 1 posted Reviews received at journal 08 May, 2026 Reviewers agreed at journal 30 Apr, 2026 Reviews received at journal 29 Apr, 2026 Reviewers agreed at journal 29 Apr, 2026 Reviews received at journal 27 Apr, 2026 Reviewers agreed at journal 27 Apr, 2026 Reviewers agreed at journal 07 Apr, 2026 Reviewers invited by journal 03 Apr, 2026 Editor invited by journal 23 Mar, 2026 Editor assigned by journal 22 Mar, 2026 Submission checks completed at journal 19 Mar, 2026 First submitted to journal 18 Mar, 2026 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-9088571","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":619593866,"identity":"b9c133c2-38f9-470e-b4d0-4c4008c3b963","order_by":0,"name":"Zhicheng Pan","email":"","orcid":"","institution":"Department of general surgery,Nanjing Red Cross Hospital,Nanjing 210002, China","correspondingAuthor":false,"prefix":"","firstName":"Zhicheng","middleName":"","lastName":"Pan","suffix":""},{"id":619593868,"identity":"7c7e663e-392c-454b-877b-24b295aa1ac8","order_by":1,"name":"Guodong Liu","email":"","orcid":"","institution":"Department of general surgery,Nanjing Red Cross Hospital,Nanjing 210002, China","correspondingAuthor":false,"prefix":"","firstName":"Guodong","middleName":"","lastName":"Liu","suffix":""},{"id":619593870,"identity":"b8fa34ac-64a1-4265-85c3-b272b67e54c5","order_by":2,"name":"Fei Tong","email":"","orcid":"","institution":"Department of general surgery,Nanjing Red Cross Hospital,Nanjing 210002, China","correspondingAuthor":false,"prefix":"","firstName":"Fei","middleName":"","lastName":"Tong","suffix":""},{"id":619593871,"identity":"92a13abf-610b-4fb7-b523-e6e870148356","order_by":3,"name":"Zhenggen Hu","email":"","orcid":"","institution":"Department of general surgery,Nanjing Red Cross Hospital,Nanjing 210002, China","correspondingAuthor":false,"prefix":"","firstName":"Zhenggen","middleName":"","lastName":"Hu","suffix":""},{"id":619593872,"identity":"d627301c-9b7a-4907-9157-72da61cc208a","order_by":4,"name":"Cheng Qian","email":"","orcid":"","institution":"Department of general surgery,Nanjing Red Cross Hospital,Nanjing 210002, China","correspondingAuthor":false,"prefix":"","firstName":"Cheng","middleName":"","lastName":"Qian","suffix":""},{"id":619593873,"identity":"ffbb3a83-1de8-4a1b-be1d-a7f15a7a0a4c","order_by":5,"name":"Shijing Xing","email":"","orcid":"","institution":"Department of general surgery,Nanjing Red Cross Hospital,Nanjing 210002, China","correspondingAuthor":false,"prefix":"","firstName":"Shijing","middleName":"","lastName":"Xing","suffix":""},{"id":619593874,"identity":"a392a89c-72d0-4556-8ada-ae8b74f6e9b8","order_by":6,"name":"Xinyi Shen","email":"","orcid":"","institution":"Department of general surgery,Nanjing Red Cross Hospital,Nanjing 210002, China","correspondingAuthor":false,"prefix":"","firstName":"Xinyi","middleName":"","lastName":"Shen","suffix":""},{"id":619593875,"identity":"1ccf8164-f5ce-4dbc-ba5c-d6eb397948bb","order_by":7,"name":"Dayun Lu","email":"","orcid":"","institution":"School of Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China","correspondingAuthor":false,"prefix":"","firstName":"Dayun","middleName":"","lastName":"Lu","suffix":""},{"id":619593876,"identity":"4e88513e-c5a8-4fbd-a395-d20c8278d2be","order_by":8,"name":"Yun Xiao","email":"","orcid":"","institution":"Department of general surgery,Nanjing Red Cross Hospital,Nanjing 210002, China","correspondingAuthor":false,"prefix":"","firstName":"Yun","middleName":"","lastName":"Xiao","suffix":""},{"id":619593877,"identity":"ea732e50-1bb7-4f6b-8fa3-65ac1ceac984","order_by":9,"name":"Jianbo Han","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA8klEQVRIiWNgGAWjYBACAxBOYGBIYGBmPvjgQ4WEnDzxWtjZkg1nnLEwNmwgQgsIJDDw85hJ87ZVJDIcIKDFnP3sgYIHNXfy+Jl5jA1450kkMDYwP3x0A48Wy568BIOEY8+KJZvZCh9IbpPIY2dgMzbOweewAzkGBglshxM3HGbebGC4TaKYsYGHTRqvlvNvgFr+gbQwmEkkzpFIbDhASMsNoC2JbSAtLGYSBxuI0gK0JbHvMMgvyYYNxySMDZsJ+eV8jpnhj2+H8/j5Dx98/KemTk6evfnhY3xagIDNAJXPjF85WMkDwmpGwSgYBaNgRAMAVSpPhWPnuQwAAAAASUVORK5CYII=","orcid":"","institution":"Department of general surgery,Nanjing Red Cross Hospital,Nanjing 210002, China","correspondingAuthor":true,"prefix":"","firstName":"Jianbo","middleName":"","lastName":"Han","suffix":""}],"badges":[],"createdAt":"2026-03-11 01:23:58","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-9088571/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-9088571/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":106725611,"identity":"06692c6a-d574-4e87-b578-b28bfe396095","added_by":"auto","created_at":"2026-04-12 18:33:07","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":767201,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eDifferent expression genes identified from GSE276931 and GSE139682 between tumor group and adjacent tumor group. (A)\u003c/strong\u003e The red and blue dots refer to DEGs with upregulated and downregulated gene expression, respectively, from GSE276931.\u003cstrong\u003e (B) \u003c/strong\u003eHierarchical clustering analysis of the DEGs from GSE276931. The color bar was employed to represent the expression levels of the DEGs, where red color was indicative of high expression and blue color was indicative of low expression. \u003cstrong\u003e(C)\u003c/strong\u003e The red and blue dots refer to DEGs with upregulated and downregulated gene expression, respectively, from GSE139682.\u003cstrong\u003e (D) \u003c/strong\u003eHierarchical clustering analysis of the DEGs from GSE139682. The color bar was employed to represent the expression levels of the DEGs, where red color was indicative of high expression and blue color was indicative of low expression.\u003c/p\u003e","description":"","filename":"floatimage1.png","url":"https://assets-eu.researchsquare.com/files/rs-9088571/v1/d02a73c00a8da67ea3448af7.png"},{"id":106519101,"identity":"db66aa52-3ed0-4bd0-b8fb-82fc86657a17","added_by":"auto","created_at":"2026-04-09 12:27:39","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":607054,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eCo-expression network of DEGs from GSE276931 and Screening and functional enrichment analysis of key genes associated with GBC. (A)\u003c/strong\u003e Cluster tree dendrogram of the co-expression modules. Different colors represent distinct co-expression modules. \u003cstrong\u003e(B)\u003c/strong\u003e Module-Trait Relationships Heatmap, correlation analysis between module eigengenes and clinical status. Each row represents a module; each column represents a clinical status.\u003cstrong\u003e (C) \u003c/strong\u003eModule-Trait Relationships barplot, correlation coefficients between modules and traits are expressed as bar lengths, * p \u0026lt; 0.05, * * p \u0026lt; 0.01, Numbers in parentheses indicate the number of genes included in the module. \u003cstrong\u003e(D) \u003c/strong\u003eThe Venn plot shows that 32 overlapping genes were identified by intersecting black module genes from WGCNA and DEGs from GSE276931 and GSE139682, respectively. \u003cstrong\u003e(E)\u003c/strong\u003eGo functional enrichment of overlapping genes, including biological processes, cellular components, and molecular functions.\u003cstrong\u003e (F)\u003c/strong\u003e KEGG analysis of overlapping genes.\u003c/p\u003e","description":"","filename":"floatimage2.png","url":"https://assets-eu.researchsquare.com/files/rs-9088571/v1/748d1bf9e48cc43b878b6e48.png"},{"id":106519102,"identity":"f3cd377c-2396-4b4d-aeb3-d5d1a364713e","added_by":"auto","created_at":"2026-04-09 12:27:39","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":378576,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eThree machine learning methods for identifying the core characteristic genes. (A)\u003c/strong\u003eLasso regression analysis to screen GBC key genes, Left: the coefficient of top 10 gene changes with log lambda; Right: cross validation error curve. \u003cstrong\u003e(B) \u003c/strong\u003eTop 20 gallbladder cancer related genes screened by random forest model and their feature importance scores.\u003cstrong\u003e (C) \u003c/strong\u003eVariation curve of OOB error rate of random forest with the number of trees.\u003cstrong\u003e (D)\u003c/strong\u003e Variation curve of cross validation accuracy of SVM-RFE model with the number of features.\u003cstrong\u003e (E) \u003c/strong\u003eWayne diagram of core characteristic genes screened by SVM-RFE, Lasso, and random forest.\u003cstrong\u003e (F)\u003c/strong\u003e Expression of \u003cem\u003eCLDN1\u003c/em\u003e gene in tumor group and adjacent tumor group.\u003c/p\u003e","description":"","filename":"floatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-9088571/v1/89f96459a196a3c1e3715f6d.png"},{"id":106724869,"identity":"590b6a3c-6e64-4057-834e-a5635c51dc20","added_by":"auto","created_at":"2026-04-12 18:30:12","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":568263,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003escRNA-seq analysis of gallbladder cancer tissue (GBC) and cholecystitis tissue (CC).\u003c/strong\u003e \u003cstrong\u003e(A) \u003c/strong\u003e12 different distinct clusters analysis by PCA and UMAP. \u003cstrong\u003e(B)\u003c/strong\u003e U-MAP plot showing clusters of cells in GBC. \u003cstrong\u003e(C)\u003c/strong\u003e U-MAP plot showing clusters of cells in CC. \u003cstrong\u003e(D)\u003c/strong\u003e Heatmap showing the expression of marker genes in each cell type. \u003cstrong\u003e(E)\u003c/strong\u003eProportion of cell type clusters present in each GBC and CC groups.\u003c/p\u003e","description":"","filename":"floatimage4.png","url":"https://assets-eu.researchsquare.com/files/rs-9088571/v1/073056a184e4e0159496dcbc.png"},{"id":106519104,"identity":"cd74ea14-4696-468c-86ed-08aa17043f86","added_by":"auto","created_at":"2026-04-09 12:27:39","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":704345,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eCell–cell interaction analysis and pseudotime analysis.\u003c/strong\u003e \u003cstrong\u003e(A)\u003c/strong\u003eoverall cell–cell interaction. \u003cstrong\u003e(B) \u003c/strong\u003eGBC cell–cell interaction.\u003cstrong\u003e (C) \u003c/strong\u003eNon-GBC cell–cell interaction. \u003cstrong\u003e(D) \u003c/strong\u003eEpithelial cells-other interaction. \u003cstrong\u003e(E)\u003c/strong\u003e Pseudotime analysis.\u003c/p\u003e","description":"","filename":"floatimage5.png","url":"https://assets-eu.researchsquare.com/files/rs-9088571/v1/06733e6dec255af24c7f7426.png"},{"id":106724780,"identity":"850d1133-48e7-44a3-9853-9948d4934e55","added_by":"auto","created_at":"2026-04-12 18:29:43","extension":"png","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":548974,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eFunctional enrichment of epithelial cells.\u003c/strong\u003e \u003cstrong\u003e(A) \u003c/strong\u003eDifferential expression analysis on epithelial cells. \u003cstrong\u003e(B)\u003c/strong\u003e Top 5 GO terms in BP, CC and MF of epithelial cells. \u003cstrong\u003e(C) \u003c/strong\u003eTop 15 KEGG pathways of epithelial cells. \u003cstrong\u003e(D) \u003c/strong\u003eThe ssGSEA analysis of inflammation-related pathways.\u003c/p\u003e","description":"","filename":"floatimage6.png","url":"https://assets-eu.researchsquare.com/files/rs-9088571/v1/d15b2d616565e70a098ca35e.png"},{"id":106519106,"identity":"af58f031-bf82-4e08-99a6-701683364eda","added_by":"auto","created_at":"2026-04-09 12:27:39","extension":"png","order_by":7,"title":"Figure 7","display":"","copyAsset":false,"role":"figure","size":358297,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eExpression validation of \u003c/strong\u003e\u003cem\u003e\u003cstrong\u003eCLDN1\u003c/strong\u003e\u003c/em\u003e\u003cstrong\u003e.\u003c/strong\u003e \u003cstrong\u003e(A)\u003c/strong\u003eThe expression distribution of \u003cem\u003eCLDN1\u003c/em\u003e. \u003cstrong\u003e(B)\u003c/strong\u003e \u003cem\u003eCLDN1\u003c/em\u003e expression across cell types. \u003cstrong\u003e(C) \u003c/strong\u003e\u003cem\u003eCLDN1\u003c/em\u003e expression in CC group and GBC group.\u003c/p\u003e","description":"","filename":"floatimage7.png","url":"https://assets-eu.researchsquare.com/files/rs-9088571/v1/e2d3bc8370d365c9c8df1992.png"},{"id":106959648,"identity":"c0ed0494-8633-4991-96c4-1d178fe2c92e","added_by":"auto","created_at":"2026-04-15 09:12:54","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":5134495,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-9088571/v1/8f3f5008-241d-436d-afaf-9c8191b84f63.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"Multi-omics analysis identifies CLDN1 as a key regulator of gallbladder cancer progression and prognosis","fulltext":[{"header":"Introduction","content":"\u003cp\u003eGallbladder cancer (GBC) is the most common malignancy of the biliary system and ranks among the top six gastrointestinal neoplasms worldwide [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e]. Globally, GBC accounts for more than 150,000 cancer-related deaths annually, representing approximately 1.5% of all cancer fatalities [\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e]. Its incidence exhibits marked geographic and demographic disparities, with high prevalence reported in Chile, India, parts of Asia, Eastern Europe, and South America [\u003cspan additionalcitationids=\"CR4 CR5\" citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e]. GBC is more frequently diagnosed in elderly individuals, with a mean age of approximately 65 years, and occurs 2\u0026ndash;6 times more often in women than in men [\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. These variations are thought to result from complex interactions between genetic susceptibility and environmental risk factors, including gallstones, liver fluke infection, chronic inflammation, abnormal pancreaticobiliary duct junctions, microbial infections, and exposure to toxic substances or carcinogens [\u003cspan additionalcitationids=\"CR9 CR10\" citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eClinically, GBC remains one of the most lethal biliary tract malignancies due to its insidious onset and lack of specific early symptoms. As a result, most patients are diagnosed at advanced stages, and the rate of potentially curative radical resection is limited to approximately 20% [\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e]. Although current diagnostic strategies rely on blood-based biomarkers combined with imaging modalities, their sensitivity and specificity are insufficient for detecting early-stage or small lesions [\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e]. Even with aggressive treatments such as radical cholecystectomy and adjuvant chemotherapy, postoperative recurrence and distant metastasis are common. The absence of reliable biomarkers for early diagnosis and disease progression prediction underscores an urgent need to identify novel molecular markers and regulatory mechanisms underlying GBC.\u003c/p\u003e \u003cp\u003eAccumulating evidence suggests that genetic, epigenetic, and transcriptomic alterations contribute to GBC initiation and progression. Variations in genes involved in immune and inflammatory signaling, such as TLR4, have been associated with increased GBC risk, particularly in female patients [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e]. Epigenetic dysregulation also plays an important role; for example, elevated expression of the histone acetyltransferase KAT5 promotes GBC cell proliferation, while its knockdown induces caspase-9\u0026ndash;mediated apoptosis and suppresses tumor growth [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e]. Transcriptome-wide analyses have revealed widespread dysregulation of mRNAs, lncRNAs, and microRNAs in GBC, implicating key oncogenic pathways including PI3K-Akt, Hedgehog, Wnt, KRAS, androgen receptor, and interferon-γ signaling [\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e, \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e]. However, most existing studies focus on individual molecules or pathways, and a systematic understanding of how these alterations integrate at the cellular and microenvironmental levels remains lacking.\u003c/p\u003e \u003cp\u003eTo address these limitations, we performed an integrative analysis combining bulk transcriptome and single-cell RNA sequencing (scRNA-seq) data from public databases[\u003cspan additionalcitationids=\"CR19\" citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e20\u003c/span\u003e]. Through transcriptome-based screening, we identified \u003cem\u003eCLDN1\u003c/em\u003e as a core characteristic gene significantly upregulated in GBC tissues. Subsequent single-cell analysis revealed that epithelial cells act as central regulators within the GBC tumor microenvironment, orchestrating immune cell interactions and signaling networks. Together, our study highlights a potential \u003cem\u003eCLDN1\u003c/em\u003e-driven epithelial\u0026ndash;immune regulatory axis in GBC and provides new insights into disease mechanisms and candidate targets for diagnosis and therapy.\u003c/p\u003e \u003cp\u003e \u003cb\u003eMaterials and methodsData acquisition\u003c/b\u003e The bulk RNA-seq dataset GSE276931 and GSE139682 used in this study were retrieved from Gene Expression Omnibus (GEO) database. Tissue specimens of gallbladder cancer of GSE276931 dataset including 7 tumors and 5 adjacent non-tumors. Tissue specimens of gallbladder cancer of GSE139682 dataset including 10 tumors and 10 adjacent non-tumors. The single-cell RNA dataset used in this study was obtained from a previously published multi-omic analysis of gallbladder cancer [46], which contains 4 gallbladder cancer tissue (GBC) and 4 cholecystitis tissue (CC) single-cell RNA sequencing samples. Chronic cholecystitis tissue has a high degree of similarity in cellular composition with normal gallbladder tissue[21].\u003c/p\u003e \u003cp\u003e \u003cb\u003eDifferential gene expression analysis in bulk-RNAseq\u003c/b\u003eDifferentially expressed genes (DEGs) between the tumors and adjacent non-tumors in the GSE276931 dataset and GSE139682 dataset were visualized using the limma package and met the criteria of P values\u0026thinsp;\u0026lt;\u0026thinsp;0.05 and Fold change log2FC\u0026thinsp;\u0026ge;\u0026thinsp;1 or log2FC\u0026thinsp;\u0026le;\u0026thinsp;\u0026minus;\u0026thinsp;1.The results were visualized through heatmaps and volcano plots to highlight the significantly upregulated and downregulated genes.\u003c/p\u003e \u003cp\u003e \u003cb\u003eWeighted gene co-expression network analysis (WGCNA) in bulk-RNAseq\u003c/b\u003eTo further screen the key genes closely related to gallbladder atrophy, the weighted gene coexpression network analysis (WGCNA)[22] was used to construct the gene coexpression module for the dataset GSE276931. Quality control was conducted by removing genes and samples with excessive missing values or low expression levels. The soft-thresholding power was chosen the scale free topology fit index reached 0.8 for the first time, ensuring that the constructed network approximates a scale-free topology. This thresholding was used to calculate the adjacency matrix representing gene co-expression similarity. Genes were clustered into modules using average linkage hierarchical clustering based on topological overlap, and each module was assigned a unique color. Finally, cluster dendrogram and module-trait relationships were visualized to interpret the results.\u003c/p\u003e \u003cp\u003e \u003cb\u003eGO and KEGG pathway enrichment analysis in bulk-RNAseq\u003c/b\u003eAfter identifying the overlaping genes by intersecting WGCNA results and DEGs from GSE276931 dataset and GSE139682 dataset, respectively, we conducted KEGG and GO enrichment analyses. Functional enrichment analyses, including Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG), were conducted using the R package clusterProfiler (version 4.14.6). Gene annotations and mappings were performed based on the org.Hs.eg.db (version 3.20.0). Specifically, the enrichGO and enrichKEGG functions were employed to identify significantly enriched pathways (P-value\u0026thinsp;\u0026lt;\u0026thinsp;0.05). The KEGG pathway data were sourced from the KEGG API (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.genome.jp/kegg/pathway.html\u003c/span\u003e\u003cspan address=\"https://www.genome.jp/kegg/pathway.html\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e), and GO terms were retrieved via the Gene Ontology resource (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttp://geneontology.org/).Gen\u003c/span\u003e\u003cspan address=\"http://geneontology.org/).Gen\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003ee Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses[23] were employed to detect the molecular functions and potential pathways of differentially expressed genes (DEGs). GO analysis primarily identifies key features of genes concerning molecular functions(MF), cellular components(CC), and biological processes(BP). KEGG analysis mainly reveals gene enrichment in specific signaling pathways, elucidating the potential roles and mechanisms of genes in disease development. KEGG and GO enrichment analyses were presented using bubble Charts.\u003c/p\u003e \u003cp\u003e \u003cb\u003eMachine learning in bulk-RNAseq\u003c/b\u003eWe used Least Absolute Shrinkage and Selection Operator(LASSO), Support Vector Machine - Recursive Feature Elimination (SVM-RFE), and Random Forest (RF) [24]to the overlaping genes to futher identify key regulatory genes for GBC. Three models were constructed using the R package. LASSO uses L1 regularized linear regression method to achieve feature sparsity through the absolute value of penalty coefficient and automatically screen important variables. The SVM-RFE method selects the genes with the strongest discrimination ability based on the principle of minimum classification error and highest accuracy. RF selected the top 20 key genes according to the importance score of genes.\u003c/p\u003e \u003cp\u003e \u003cb\u003eSingle‑cell sequencing data analysis\u003c/b\u003eThe scRNA-seq data was processed by the Seurat package. Canonical Correlation Analysis (CCA) was used to find Mutual Nearest Neighbours (MNNs)[25]. Cells with more than 2,500 or fewer than 300 gene counts was filtered out. Subsequently, cells exceeding a mitochondrial gene percentage of 5% was filtered out to eliminate low-quality cells. To identify cell clusters, PCA analysis was first performed on the list of highly variable genes. Clustering was then performed using the findclusters function. Finally, Umap was used for visualization. For normalized gene expression data, we listed markers for each cell cluster using the findallmarkers function. A total of 12 cell types were finally obtained.\u003cb\u003eCell\u0026ndash;cell interaction analysis\u003c/b\u003e\u003c/p\u003e \u003cp\u003eTo visualize and analyze intercellular communications, we conducted CellChat analysis[\u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e26\u003c/span\u003e]. CellChat identified differentially overexpressed ligands and receptorsfor each cell group and associated each interaction with a probability value to quantify communications between the two cell groups mediated by these signaling genes. Significant interactions were identified on the basis of a statistical test that randomly permuted the group labels of cells and then recalculated the interaction probability. Cell interaction with a P value\u0026thinsp;\u0026lt;\u0026thinsp;0.05 was defined as significance.The results were visualized using a Circle plot. \u003cb\u003eDifferential gene expression analysis in scRNA-seq\u003c/b\u003e\u003c/p\u003e \u003cp\u003eTo identify the differential expression genes (DEGs) of epithelial cells, we used model-based analysis of single-cell transcriptomics test with genes detected in a minimum of 10% of all cells, met the criteria of P values\u0026thinsp;\u0026lt;\u0026thinsp;0.05 and |Fold change log2FC| \u0026ge; 1 in the Findmarker function by Seurat.\u003c/p\u003e \u003cp\u003e \u003cb\u003eEnrichment analysis in scRNA-seq\u003c/b\u003eAfter identifying the DEGs from scRNA-seq, we conducted KEGG and GO enrichment analyses.Gene enrichment analysis was conducted using the R package clusterProfiler. KEGG and GO enrichment analyses were presented using bar Charts. Subsequently, we used R software GSVA package to conduct ssGSEA to calculate the inflammation-related pathways score[27]. The result was visualized into heatmaps.\u003c/p\u003e"},{"header":"Results","content":"\u003cp\u003e \u003cb\u003eDifferential expression analysis of genes and functional analysis of DEGs\u003c/b\u003eWe identified a total of 1387 differential expression genes (DEGs) from GSE276931 between the GBC and CC using the limma package with multiple testing correction. Applying a P value of \u0026lt;\u0026thinsp;0.05 and |Fold change log2FC| \u0026ge; 1, 776 genes were significantly upregulated and 611 genes were significantly downregulated. The expression patterns of these DEGs are shown in the volcano plot (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eA). The Top50 gene of these DEGs are shown in the heatmap (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eB).In GSE139682, a total of 791 DEGs were identified, among them, 195 genes were significantly upregulated and 596 genes were significantly downregulated. The expression patterns of these DEGs are shown in the volcano plot (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eC). The Top50 gene of these DEGs are shown in the heatmap (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eD).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eIdentification of GBC-associated co-expression modules and functional signatures via WGCNA\u003c/h2\u003e \u003cp\u003eTo identify critical genes associated with GBC, weighted gene co-expression network analysis (WGCNA) was performed on 1,387 DEGs from the GSE276931 dataset to construct co-expression modules. Using a soft-threshold power of 30 (scale-free R\u003csup\u003e2\u003c/sup\u003e\u0026thinsp;=\u0026thinsp;0.77), a total of 16 co-expression modules were identified (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eA). Among these, the black, yellow, and lavenderblush3 modules exhibited significant positive correlations with GBC (P\u0026thinsp;\u0026lt;\u0026thinsp;0.05), with the black module showing the highest correlation coefficient and the most significant P value (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eB\u0026ndash;C). By intersecting genes within the black module with DEGs from both GSE276931 and GSE139682, we identified 32 overlapping genes (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eD).Functional enrichment analyses indicated that these genes were primarily involved in GO terms related to cell junction organization and epithelial\u0026ndash;mesenchymal transition (EMT). Specifically, \"apical junction complex\" and \"bicellular tight junction\" were significantly enriched in the cellular component category, while \"substrate adhesion-dependent cell spreading,\" \"mitotic metaphase plate congression,\" and \"positive regulation of protein kinase activity\" dominated the biological process category (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eE). Furthermore, KEGG pathway analysis revealed significant enrichment in the \u003cem\u003eIL-17\u003c/em\u003e signaling pathway, tight junctions, and cell adhesion molecules, suggesting that GBC progression is driven by inflammatory-mediated immune responses and junctional remodeling (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eF). Additionally, the enrichment of amino acid and drug metabolism pathways points toward systemic metabolic reprogramming and altered detoxification functions during GBC development.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003cb\u003eMachine learning and screening of key regulatory genes\u003c/b\u003eTo further screen the key regulatory genes in the process of gallbladder cancer, applied three machine learning methods-LASSO, SVM-RFE, and RF, to the 32 overlapping genes. LASSO identified 3 characteristic genes- \u003cem\u003eCLDN1\u003c/em\u003e, \u003cem\u003eSHANK2\u003c/em\u003e and \u003cem\u003eEFNA1\u003c/em\u003e significantly associated with GBC from top 10 genes through lasso coefficient path analysis and cross validation error (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eA). Random forest analysis selected the top 20 key genes according to the importance score of genes (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eB-C). The SVM-RFE method selected the gene set with the strongest discrimination ability based on the principle of minimum classification error and highest accuracy (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eD). we focused on the intersection of genes consistently identified across all 3 methods, resulting in one common gene: \u003cem\u003eCLDN1\u003c/em\u003e (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eE). The expression level of the \u003cem\u003eCLDN1\u003c/em\u003e gene in the GBC group is significantly higher than that in the adjacent cancer group (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eF).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eIntegration and clustering of scRNA-seq data\u003c/h3\u003e\n\u003cp\u003eSingle-cell analysis was performed on openly available datasets. Cells were classified into 12 different distinct clusters following umap visual analysis after quality control and preprocessing of the data (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eA). Subsequently, we provided detailed annotations on the cell subpopulations of the GBC group and CC group. By analyzing the expression of established molecular markers, the specific types of cells we have identified were mainly immune cells and structural cells, such as: T cells, monocytes, NK cells, B cells, dendritic cells (DCS), Epithelial cells, Endothelial cells, Mesenchymal cells, Neutrophils, Mast cells and plasma cells, etc. in GBC group and CC group (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eB-C). Although the cell types of GBC group and CC group were similar, there were some differences in the proportion and distribution pattern (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eE).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eIn the CC group, T cells dominate, followed by Mononuclear cells and mesenchymal cells, which exhibited stronger cellular immune response characteristics and retains more mesenchymal matrix components in the cellular microenvironment. In the GBC group, mononuclear cells were the predominant population, followed by mesenchymal cells, reflecting enhanced innate immune activity and matrix remodeling. The accuracy of cell type annotation was further confirmed by typical marker gene expression heatmaps (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003eD).\u003c/p\u003e\n\u003ch3\u003eEpithelial cells were located in the core regulatory position in GBC\u003c/h3\u003e\n\u003cp\u003eTo quantify intercellular communications, we employed CellChat to construct signaling networks within the GBC microenvironment. Epithelial cells emerged as the central hub of the interaction network, exhibiting significant crosstalk with neutrophils, monocytes, NK cells, and stromal cells (Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003eA). Compared to the balanced interaction pattern in chronic cholecystitis (CC) (Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003eC), GBC tissues showed distinct remodeling of the tumor microenvironment (TME) (Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003eB). Notably, interactions between epithelial cells and neutrophils or mesenchymal cells were markedly enhanced in GBC, with the latter showing the most significant intensity, potentially driving TME formation (Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003eD).Further investigation using pseudo-temporal trajectory analysis revealed that epithelial cells in GBC exhibited a more extensive distribution compared to CC (Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003eE). Specifically, a subset of cells in GBC shifted toward advanced evolutionary stages, suggesting they may have undergone epithelial\u0026ndash;mesenchymal transition (EMT) to promote tumor progression. Collectively, these findings indicate that epithelial cells act as a primary orchestrator of immune regulation and structural remodeling in GBC.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e\n\u003ch3\u003eDifferential expression analysis and functional enrichment of epithelial cells\u003c/h3\u003e\n\u003cp\u003eTo elucidate the molecular characteristics of epithelial cells in GBC, we performed differential expression and functional enrichment analyses. We identified 3,302 significantly upregulated and 2,976 downregulated genes in GBC epithelial cells compared to the CC group (Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eA). GO enrichment analysis (Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eB) revealed that these genes were primarily associated with neutrophil-mediated immunity, antigen processing and presentation (MHC class I), and cell\u0026ndash;substrate junctions. These findings suggest that GBC epithelial cells actively orchestrate immune responses and facilitate intercellular signaling through specialized membrane structures. KEGG pathway analysis further highlighted the enrichment of phagosome, cell cycle, and endocytosis pathways (Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eC), suggesting their contribution to the malignant phenotype of GBC. Notably, inflammatory pathways were consistently enriched across both bulk and single-cell datasets. Subsequent ssGSEA scoring confirmed that GBC epithelial cells possessed significantly higher inflammation scores than those in the CC group (Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003eD), reflecting a pro-inflammatory state within the tumor microenvironment that may drive disease progression.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003cb\u003eExpression of\u003c/b\u003e \u003cb\u003eCLDN1\u003c/b\u003e \u003cb\u003ein epithelial cells\u003c/b\u003e\u003c/p\u003e \u003cp\u003eWe expressed the \u003cem\u003eCLDN1\u003c/em\u003e gene overlapped by machine learning in the U-Map plot and found that it is mainly expressed in epithelial cells (Fig.\u0026nbsp;\u003cspan refid=\"Fig7\" class=\"InternalRef\"\u003e7\u003c/span\u003eA-B).Comparing the expression levels of \u003cem\u003eCLDN1\u003c/em\u003e in the epithelial cells of CC group and GBC group, it was found that the expression level in GBC group was significantly higher than that of CC group with P\u0026thinsp;\u0026lt;\u0026thinsp;0.05 (Fig.\u0026nbsp;\u003cspan refid=\"Fig7\" class=\"InternalRef\"\u003e7\u003c/span\u003eC).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e"},{"header":"Discussion","content":"\u003cp\u003eGallbladder cancer (GBC) is typically diagnosed at advanced stages, rendering early detection a formidable clinical challenge. While early-stage diagnosis significantly improves survival, the overall prognosis remains suboptimal, with only a fraction of patients achieving favorable long-term outcomes [\u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e29\u003c/span\u003e, \u003cspan citationid=\"CR30\" class=\"CitationRef\"\u003e30\u003c/span\u003e]. These therapeutic limitations underscore an urgent imperative to identify robust biomarkers and to elucidate the fundamental biological mechanisms driving GBC progression.\u003c/p\u003e \u003cp\u003eIn this study, we identified \u003cem\u003eCLDN1\u003c/em\u003e as a core molecular feature of GBC and, for the first time, systematically linked its expression to an epithelial-centered immune regulatory network at single-cell resolution. Moving beyond the traditional focus on isolated genes, our findings illuminate the pivotal role of epithelial cells as primary coordinators of tumor progression and immune landscape modulation in GBC.\u003c/p\u003e \u003cp\u003e \u003cem\u003eCLDN1\u003c/em\u003e is a scaffold component of tight junction complexes, which, alongside occludin and junctional adhesion molecules, maintains epithelial barrier integrity and intercellular cohesion [\u003cspan citationid=\"CR31\" class=\"CitationRef\"\u003e31\u003c/span\u003e]. Dysregulated \u003cem\u003eCLDN1\u003c/em\u003e has been implicated in the initiation and invasion of various malignancies, driven by oncogenic signaling, genetic aberrations, and microenvironmental cues. For instance, BRAF V600E mutations upregulate \u003cem\u003eCLDN1\u003c/em\u003e via NF-κB activation in thyroid cancer [\u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e32\u003c/span\u003e], with parallel associations observed in colorectal polyps [\u003cspan citationid=\"CR33\" class=\"CitationRef\"\u003e33\u003c/span\u003e]. In head and neck squamous cell carcinoma and hepatocellular carcinoma (HCC), \u003cem\u003eCLDN1\u003c/em\u003e overexpression facilitates tumor advancement through Wnt/β-catenin signaling [\u003cspan citationid=\"CR34\" class=\"CitationRef\"\u003e34\u003c/span\u003e], contributing to epithelial\u0026ndash;mesenchymal transition (EMT), metabolic reprogramming, and tumor immune microenvironment (TIME) remodeling [\u003cspan citationid=\"CR35\" class=\"CitationRef\"\u003e35\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eWhile the specific role of \u003cem\u003eCLDN1\u003c/em\u003e in GBC has remained largely elusive, our functional enrichment analyses suggest it exerts analogous oncogenic effects. GO and KEGG pathways were significantly enriched in substrate adhesion, cadherin binding, and tight junction organization\u0026mdash;processes integral to cell migration and metastatic potential [\u003cspan additionalcitationids=\"CR37\" citationid=\"CR36\" class=\"CitationRef\"\u003e36\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e38\u003c/span\u003e]. The disruption of these adhesive architectures often triggers EMT, thereby enhancing tumor aggressiveness [\u003cspan citationid=\"CR37\" class=\"CitationRef\"\u003e37\u003c/span\u003e, \u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e38\u003c/span\u003e]. Crucially, these functional patterns were predominantly localized within epithelial cells, reinforcing their role as the primary engine of GBC progression.\u003c/p\u003e \u003cp\u003eBeyond structural maintenance, we discovered that epithelial cells possess potent immunomodulatory properties. Our cell\u0026ndash;cell communication analysis identified epithelial cells as a central hub within the interaction network, orchestrating crosstalk between neutrophils, natural killer (NK) cells, and monocytes. Epithelial cells are known to secrete a spectrum of cytokines, such as \u003cem\u003eIL-1, IL-8, IL-18, TNF-α\u003c/em\u003e, and \u003cem\u003eTGF-β\u003c/em\u003e, which act as critical mediators of inflammatory responses and immune polarization [\u003cspan additionalcitationids=\"CR39\" citationid=\"CR38\" class=\"CitationRef\"\u003e38\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR40\" class=\"CitationRef\"\u003e40\u003c/span\u003e]. Furthermore, epithelial-derived metabolites like retinoic acid are essential for maintaining immune homeostasis [\u003cspan citationid=\"CR41\" class=\"CitationRef\"\u003e41\u003c/span\u003e]. Our findings support a paradigm shift: viewing GBC epithelial cells not merely as structural units but as active immunometabolic regulators of the TME.\u003c/p\u003e \u003cp\u003eThe interplay between epithelial-mesenchymal plasticity and the TME is a hallmark of metastasis. The intensified interactions between epithelial, mesenchymal, and endothelial cells observed in our study likely facilitate microenvironmental remodeling and systemic dissemination. This aligns with evidence that microRNA clusters, such as miR-214/miR-3120, can modulate EMT and autophagy to suppress GBC progression [\u003cspan citationid=\"CR42\" class=\"CitationRef\"\u003e42\u003c/span\u003e]. Consequently, this epithelial-centered interaction axis represents a critical vulnerability in GBC pathogenesis that could be exploited for therapeutic intervention.\u003c/p\u003e \u003cp\u003eFinally, our single-cell validation confirmed that \u003cem\u003eCLDN1\u003c/em\u003e is both cell-type specific and significantly upregulated in GBC compared to chronic cholecystitis, emphasizing its diagnostic relevance. Given that \u003cem\u003eCLDN1\u003c/em\u003e correlates with prognosis in colorectal cancer [\u003cspan citationid=\"CR43\" class=\"CitationRef\"\u003e43\u003c/span\u003e], and considering the clinical success of \u003cem\u003eCLDN1\u003c/em\u003e8.2-targeted therapies in gastric cancers [\u003cspan citationid=\"CR44\" class=\"CitationRef\"\u003e44\u003c/span\u003e, \u003cspan citationid=\"CR45\" class=\"CitationRef\"\u003e45\u003c/span\u003e], \u003cem\u003eCLDN1\u003c/em\u003e emerges as a compelling target for GBC. Our data suggest that \u003cem\u003eCLDN1\u003c/em\u003e could serve as a versatile tool for early diagnosis, molecular classification, and the development of precision therapies.\u003c/p\u003e \u003cp\u003eIn conclusion, our study identifies a previously unrecognized \u003cem\u003eCLDN1\u003c/em\u003e\u0026ndash;epithelial\u0026ndash;immune regulatory axis in GBC. These insights not only deepen our understanding of GBC biology but also provide a conceptual framework for developing epithelial-targeted diagnostic and therapeutic strategies.\u003c/p\u003e \u003cp\u003eDespite the insights gained from our multi-omics analysis, this study has several limitations. First, the sample sizes of the primary datasets (GSE276931 and GSE139682) are relatively small, which may affect the generalizability of the findings. Second, while CLDN1 was identified as a key regulator through bioinformatics, further \u003cem\u003ein vitro\u003c/em\u003e and \u003cem\u003ein vivo\u003c/em\u003e experimental validations are required to confirm its functional roles and underlying mechanisms in gallbladder cancer. Future studies with larger clinical cohorts and functional assays are warranted to strengthen these conclusions.\u003c/p\u003e"},{"header":"Conclusion","content":"\u003cp\u003eThis study integrates bulk and single-cell transcriptomic analyses to elucidate key regulatory mechanisms in gallbladder cancer. We demonstrate that epithelial cells act as the central hub of the cell\u0026ndash;cell interaction network, coordinating immune regulation through interactions with neutrophils, natural killer cells, and monocytes, while crosstalk with mesenchymal and endothelial cells may promote tumor microenvironment remodeling and metastasis. Notably, \u003cem\u003eCLDN1\u003c/em\u003e was identified as a core gene predominantly expressed in epithelial cells and may contribute to GBC progression by regulating epithelial integrity, inflammatory responses, and the tumor microenvironment. These findings suggest that \u003cem\u003eCLDN1\u003c/em\u003e and epithelial-centered regulatory networks represent promising candidates for tumor classification, targeted therapy, and prognosis assessment, warranting further functional and clinical validation.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eAuthors contribution\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eJianbo Han, Yun Xiao, and Dayun Lu conceptualized the study, developed the methodology, and supervised the project and manuscript revision. Zhicheng Pan, Guodong Liu, and Fei Tong executed data collection, performed machine learning-based formal analysis, and drafted the original manuscript. Zhenggen Hu, Cheng Qian, Shijing Xing, and Xinyi Shen contributed to data curation, methodological implementation, and the visualization of research results.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eAll authors read and approved the final manuscript.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConflict of interests\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe authors declare no competing interests.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFunding\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe authors declare that no funds, grants, or other support were received during the preparation of this manuscript.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eData Availability Statement\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe datasets analyzed in this study are publicly available in the Gene Expression Omnibus (GEO) repository. The bulk RNA-seq data can be accessed via the following links: \u003cstrong\u003eGSE276931\u003c/strong\u003e (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE276931) and \u003cstrong\u003eGSE139682\u003c/strong\u003e (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE139682). The single-cell RNA-seq data of gallbladder cancer (Reference [46]) is available at [https://zenodo.org/records/15400138]. Functional enrichment resources can be accessed via \u003cstrong\u003eGene Ontology\u003c/strong\u003e (http://geneontology.org/) and \u003cstrong\u003eKEGG\u003c/strong\u003e (https://www.genome.jp/kegg/pathway.html).\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eEthics declaration\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eEthics declaration: Not applicable.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eClinical trial number\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eClinical trial number: not applicable.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConsent to Participate declaration\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eConsent to Participate declaration: not applicable.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConsent to Publish declaration\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eConsent to Publish declaration: not applicable.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eSong, X., \u003cem\u003eet al\u003c/em\u003e., Overview of current targeted therapy in gallbladder cancer. Signal Transduct Target Ther, 2020. \u003cstrong\u003e5\u003c/strong\u003e(1): p. 230.\u003c/li\u003e\n\u003cli\u003eRawla, P., \u003cem\u003eet al\u003c/em\u003e., Epidemiology of gallbladder cancer. Clin Exp Hepatol, 2019. \u003cstrong\u003e5\u003c/strong\u003e(2): p. 93-102.\u003c/li\u003e\n\u003cli\u003eKrell, R.W. and A.C. Wei, Gallbladder cancer: surgical management. Chin Clin Oncol, 2019. \u003cstrong\u003e8\u003c/strong\u003e(4): p. 36.\u003c/li\u003e\n\u003cli\u003eThomas, T.S., \u003cem\u003eet al\u003c/em\u003e., Advancing age and the risk of bleomycin pulmonary toxicity in a largely older cohort of patients with newly diagnosed Hodgkin Lymphoma. J Geriatr Oncol, 2020. \u003cstrong\u003e11\u003c/strong\u003e(1): p. 69-74.\u003c/li\u003e\n\u003cli\u003eKumar, A., \u003cem\u003eet al\u003c/em\u003e., Gallbladder cancer: Progress in the Indian subcontinent. World J Clin Oncol, 2024. \u003cstrong\u003e15\u003c/strong\u003e(6): p. 695-716.\u003c/li\u003e\n\u003cli\u003eNoel, R., \u003cem\u003eet al\u003c/em\u003e., Does the Frequency of Cholecystectomy Affect the Ensuing Incidence of Gallbladder Cancer in Sweden? A Population-Based Study with a 16-Year Coverage. World J Surg, 2016. \u003cstrong\u003e40\u003c/strong\u003e(1): p. 66-72.\u003c/li\u003e\n\u003cli\u003eVan Dyke, A.L., \u003cem\u003eet al\u003c/em\u003e., Biliary tract cancer incidence and trends in the United States by demographic group, 1999-2013. Cancer, 2019. \u003cstrong\u003e125\u003c/strong\u003e(9): p. 1489-1498.\u003c/li\u003e\n\u003cli\u003eShin, H.R., \u003cem\u003eet al\u003c/em\u003e., Epidemiology of cholangiocarcinoma: an update focusing on risk factors. Cancer Sci, 2010. \u003cstrong\u003e101\u003c/strong\u003e(3): p. 579-85.\u003c/li\u003e\n\u003cli\u003eRawla, P., T. Sunkara, and A. Barsouk, Epidemiology of colorectal cancer: incidence, mortality, survival, and risk factors. Prz Gastroenterol, 2019. \u003cstrong\u003e14\u003c/strong\u003e(2): p. 89-103.\u003c/li\u003e\n\u003cli\u003eDixit, R., \u003cem\u003eet al\u003c/em\u003e., Comparative Analysis of Mutational Profile of Sonic hedgehog Gene in Gallbladder Cancer. Dig Dis Sci, 2017. \u003cstrong\u003e62\u003c/strong\u003e(3): p. 708-714.\u003c/li\u003e\n\u003cli\u003eLee, W. and V.S. Chandan, Gallbladder carcinomas: review and updates on morphology, immunohistochemistry, and staging. Hum Pathol, 2023. \u003cstrong\u003e132\u003c/strong\u003e: p. 149-157.\u003c/li\u003e\n\u003cli\u003eRen, T., \u003cem\u003eet al\u003c/em\u003e., [Analysis of treatment modalities and prognosis of patients with gallbladder cancer in China from 2010 to 2017]. Zhonghua Wai Ke Za Zhi, 2020. \u003cstrong\u003e58\u003c/strong\u003e(9): p. 697-706.\u003c/li\u003e\n\u003cli\u003eDeo, K.B., \u003cem\u003eet al\u003c/em\u003e., Misdiagnosis of carcinoma gall bladder in endemic regions. BMC Surg, 2022. \u003cstrong\u003e22\u003c/strong\u003e(1): p. 343.\u003c/li\u003e\n\u003cli\u003eSeretis, C., \u003cem\u003eet al\u003c/em\u003e., Metaplastic changes in chronic cholecystitis: implications for early diagnosis and surgical intervention to prevent the gallbladder metaplasia-dysplasia-carcinoma sequence. J Clin Med Res, 2014. \u003cstrong\u003e6\u003c/strong\u003e(1): p. 26-9.\u003c/li\u003e\n\u003cli\u003eFeinberg, A.P. and B. Tycko, The history of cancer epigenetics. Nat Rev Cancer, 2004. \u003cstrong\u003e4\u003c/strong\u003e(2): p. 143-53.\u003c/li\u003e\n\u003cli\u003eDixit, R., \u003cem\u003eet al\u003c/em\u003e., Unravelling of the comparative Transcriptomic Profile of Gallbladder Cancer using mRNA sequencing. Mol Biol Rep, 2022. \u003cstrong\u003e49\u003c/strong\u003e(7): p. 6395-6403.\u003c/li\u003e\n\u003cli\u003eSaxena, R., \u003cem\u003eet al\u003c/em\u003e., Next generation sequencing uncovers multiple miRNAs associated molecular targets in gallbladder cancer patients. Sci Rep, 2023. \u003cstrong\u003e13\u003c/strong\u003e(1): p. 19101.\u003c/li\u003e\n\u003cli\u003eOzsolak, F. and P.M. Milos, RNA sequencing: advances, challenges and opportunities. Nat Rev Genet, 2011. \u003cstrong\u003e12\u003c/strong\u003e(2): p. 87-98.\u003c/li\u003e\n\u003cli\u003eZhang, X., F. Zhang, and X. Xu, Single-cell RNA sequencing in exploring the pathogenesis of diabetic retinopathy. Clin Transl Med, 2024. \u003cstrong\u003e14\u003c/strong\u003e(7): p. e1751.\u003c/li\u003e\n\u003cli\u003eVan Hove, I., \u003cem\u003eet al\u003c/em\u003e., Single-cell transcriptome analysis of the Akimba mouse retina reveals cell-type-specific insights into the pathobiology of diabetic retinopathy. Diabetologia, 2020. \u003cstrong\u003e63\u003c/strong\u003e(10): p. 2235-2248.\u003c/li\u003e\n\u003cli\u003eZhou, T., \u003cem\u003eet al\u003c/em\u003e., Multi-omic analysis of gallbladder cancer identifies distinct tumor microenvironments associated with disease progression. Nat Genet, 2025. \u003cstrong\u003e57\u003c/strong\u003e(8): p. 1935-1949.\u003c/li\u003e\n\u003cli\u003eLangfelder, P. and S. Horvath, WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics, 2008. \u003cstrong\u003e9\u003c/strong\u003e: p. 559.\u003c/li\u003e\n\u003cli\u003eKanehisa, M., \u003cem\u003eet al\u003c/em\u003e., KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res, 2023. \u003cstrong\u003e51\u003c/strong\u003e(D1): p. D587-d592.\u003c/li\u003e\n\u003cli\u003eAlazaidah, R., \u003cem\u003eet al\u003c/em\u003e., Potential of Machine Learning for Predicting Sleep Disorders: A Comprehensive Analysis of Regression and Classification Models. Diagnostics (Basel), 2023. \u003cstrong\u003e14\u003c/strong\u003e(1).\u003c/li\u003e\n\u003cli\u003eSlovin, S., \u003cem\u003eet al\u003c/em\u003e., Single-Cell RNA Sequencing Analysis: A Step-by-Step Overview. Methods Mol Biol, 2021. \u003cstrong\u003e2284\u003c/strong\u003e: p. 343-365.\u003c/li\u003e\n\u003cli\u003eJin, S., \u003cem\u003eet al\u003c/em\u003e., Inference and analysis of cell-cell communication using CellChat. Nat Commun, 2021. \u003cstrong\u003e12\u003c/strong\u003e(1): p. 1088.\u003c/li\u003e\n\u003cli\u003eH\u0026auml;nzelmann, S., R. Castelo, and J. Guinney, GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics, 2013. \u003cstrong\u003e14\u003c/strong\u003e: p. 7.\u003c/li\u003e\n\u003cli\u003eLiu, L., \u003cem\u003eet al\u003c/em\u003e., SIRT3 inhibits gallbladder cancer by induction of AKT-dependent ferroptosis and blockade of epithelial-mesenchymal transition. Cancer Lett, 2021. \u003cstrong\u003e510\u003c/strong\u003e: p. 93-104.\u003c/li\u003e\n\u003cli\u003eFeo, C.F., \u003cem\u003eet al\u003c/em\u003e., Current management of incidental gallbladder cancer: A review. Int J Surg, 2022. \u003cstrong\u003e98\u003c/strong\u003e: p. 106234.\u003c/li\u003e\n\u003cli\u003eRoa, J.C., \u003cem\u003eet al\u003c/em\u003e., Gallbladder cancer. Nat Rev Dis Primers, 2022. \u003cstrong\u003e8\u003c/strong\u003e(1): p. 69.\u003c/li\u003e\n\u003cli\u003eDing, L., \u003cem\u003eet al\u003c/em\u003e., The claudin family of proteins in human malignancy: a clinical perspective. Cancer Manag Res, 2013. \u003cstrong\u003e5\u003c/strong\u003e: p. 367-75.\u003c/li\u003e\n\u003cli\u003eRoss-Orozco, A., \u003cem\u003eet al\u003c/em\u003e., Association between BRAFV600E mutation and gene expression of CLDN1, TIMP1, and KRT19 in papillary thyroid cancer. Mol Cell Endocrinol, 2025. \u003cstrong\u003e607\u003c/strong\u003e: p. 112614.\u003c/li\u003e\n\u003cli\u003eCaruso, M., \u003cem\u003eet al\u003c/em\u003e., Claudin-1 Expression Is Elevated in Colorectal Cancer Precursor Lesions Harboring the BRAF V600E Mutation. Transl Oncol, 2014. \u003cstrong\u003e7\u003c/strong\u003e(4): p. 456-63.\u003c/li\u003e\n\u003cli\u003eKayamori, K., \u003cem\u003eet al\u003c/em\u003e., Role of Stromal Fibroblast-Induced WNT7A Associated with Cancer Cell Migration Through the AKT/CLDN1 Signaling Axis in Oral Squamous Cell Carcinoma. Lab Invest, 2023. \u003cstrong\u003e103\u003c/strong\u003e(10): p. 100228.\u003c/li\u003e\n\u003cli\u003eWang, W., \u003cem\u003eet al\u003c/em\u003e., Claudins and hepatocellular carcinoma. Biomed Pharmacother, 2024. \u003cstrong\u003e171\u003c/strong\u003e: p. 116109.\u003c/li\u003e\n\u003cli\u003eAl Azim, M. and J.S. Di Martino, ECM, integrins, and DDRs: A nexus of cancer progression, therapy, and future directions. Matrix Biol, 2025. \u003cstrong\u003e138\u003c/strong\u003e: p. 27-43.\u003c/li\u003e\n\u003cli\u003eDanilova, N.V. and N.A. Oleynikova, [E-cadherin: structure and functions, role in gastric cancer carcinogenesis]. Arkh Patol, 2023. \u003cstrong\u003e85\u003c/strong\u003e(4): p. 70-77.\u003c/li\u003e\n\u003cli\u003eAndrews, C., M.H. McLean, and S.K. Durum, Cytokine Tuning of Intestinal Epithelial Function. Front Immunol, 2018. \u003cstrong\u003e9\u003c/strong\u003e: p. 1270.\u003c/li\u003e\n\u003cli\u003eScarpa, M., \u003cem\u003eet al\u003c/em\u003e., The epithelial danger signal IL-1\u0026alpha; is a potent activator of fibroblasts and reactivator of intestinal inflammation. Am J Pathol, 2015. \u003cstrong\u003e185\u003c/strong\u003e(6): p. 1624-37.\u003c/li\u003e\n\u003cli\u003eEckmann, L., M.F. Kagnoff, and J. Fierer, Epithelial cells secrete the chemokine interleukin-8 in response to bacterial entry. Infect Immun, 1993. \u003cstrong\u003e61\u003c/strong\u003e(11): p. 4569-74.\u003c/li\u003e\n\u003cli\u003eCao, Y.G., \u003cem\u003eet al\u003c/em\u003e., Faecalibaculum rodentium remodels retinoic acid signaling to govern eosinophil-dependent intestinal epithelial homeostasis. Cell Host Microbe, 2022. \u003cstrong\u003e30\u003c/strong\u003e(9): p. 1295-1310.e8.\u003c/li\u003e\n\u003cli\u003eLi, W., \u003cem\u003eet al\u003c/em\u003e., The microRNA cluster miR-214/miR-3120 prevents tumor cell switching from an epithelial to a mesenchymal-like phenotype and inhibits autophagy in gallbladder cancer. Cell Signal, 2021. \u003cstrong\u003e80\u003c/strong\u003e: p. 109887.\u003c/li\u003e\n\u003cli\u003eOuban, A., Claudin-1 role in colon cancer: An update and a review. Histol Histopathol, 2018. \u003cstrong\u003e33\u003c/strong\u003e(10): p. 1013-1019.\u003c/li\u003e\n\u003cli\u003eNakayama, I., \u003cem\u003eet al\u003c/em\u003e., Claudin 18.2 as a novel therapeutic target. Nat Rev Clin Oncol, 2024. \u003cstrong\u003e21\u003c/strong\u003e(5): p. 354-369.\u003c/li\u003e\n\u003cli\u003eShah, M.A., \u003cem\u003eet al\u003c/em\u003e., Zolbetuximab plus CAPOX in CLDN18.2-positive gastric or gastroesophageal junction adenocarcinoma: the randomized, phase 3 GLOW trial. Nat Med, 2023. \u003cstrong\u003e29\u003c/strong\u003e(8): p. 2133-2141.\u003c/li\u003e\n\u003cli\u003eMulti-omic analysis of gallbladder cancer identifies distinct tumor microenvironments associated with disease progression. https://zenodo.org/records/15400138. Accessed 03 Feb 2026.\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"discover-oncology","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"dion","sideBox":"Learn more about [Discover Oncology](https://www.springer.com/12672)","snPcode":"","submissionUrl":"","title":"Discover Oncology","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Discover Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"gallbladder cancer, transcriptome, scRNA sequencing, CLDN1","lastPublishedDoi":"10.21203/rs.3.rs-9088571/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-9088571/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eGallbladder cancer (GBC) is a highly aggressive biliary tract malignancy characterized by poor prognosis and a lack of reliable biomarkers for early diagnosis. Despite its clinical aggressiveness, the molecular drivers and cellular regulatory networks underlying GBC progression remain incompletely characterized. In this study, we identified \u003cem\u003eCLDN1\u003c/em\u003e as a key epithelial-associated gene in GBC and revealed an epithelial-centered immune regulatory network potentially involved in tumor progression. Integrated bulk transcriptome and single-cell RNA sequencing (scRNA-seq) analyses revealed that \u003cem\u003eCLDN1\u003c/em\u003e is markedly upregulated in GBC, with expression localized predominantly in epithelial cells. At the single-cell level, epithelial cells emerged as a central hub of cell\u0026ndash;cell communication, exhibiting extensive interactions with neutrophils, natural killer (NK) cells, and monocytes, thereby associating with immunosuppressive or inflammatory landscape of the tumor microenvironment (TME). Functional enrichment and intercellular interaction analyses suggested that \u003cem\u003eCLDN1\u003c/em\u003e-high epithelial cells promote immune modulation via cytokine secretion and facilitate TME remodeling and metastasis through crosstalk with mesenchymal and endothelial cells. Notably, GBC epithelial cells exhibited significantly higher \u003cem\u003eCLDN1\u003c/em\u003e expression compared to those in chronic cholecystitis, reinforcing its disease-specific relevance. By integrating differential expression analysis, weighted gene co-expression network analysis (WGCNA), and machine learning approaches, \u003cem\u003eCLDN1\u003c/em\u003e was consistently identified as a key molecular feature of GBC. Collectively, our findings suggest that \u003cem\u003eCLDN1\u003c/em\u003e is closely associated with GBC initiation and progression through modulation of epithelial stability and epithelial\u0026ndash;immune crosstalk, highlighting its potential as a biomarker for tumor classification, prognosis, and therapeutic targeting.\u003c/p\u003e","manuscriptTitle":"Multi-omics analysis identifies CLDN1 as a key regulator of gallbladder cancer progression and prognosis","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-04-09 12:27:35","doi":"10.21203/rs.3.rs-9088571/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"editorInvitedReview","content":"","date":"2026-05-08T05:27:21+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"42870025316635228591362138488056299578","date":"2026-05-01T02:49:32+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-04-29T14:09:06+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"254360859769294325298253756347437538296","date":"2026-04-29T13:43:31+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-04-27T18:34:31+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"52179102468158413283357104352885093215","date":"2026-04-27T11:21:19+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"78647163512555510462538328611772157359","date":"2026-04-07T10:40:02+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2026-04-03T10:20:27+00:00","index":"","fulltext":""},{"type":"editorInvited","content":"","date":"2026-03-23T05:02:27+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2026-03-23T03:47:48+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2026-03-19T07:33:42+00:00","index":"","fulltext":""},{"type":"submitted","content":"Discover Oncology","date":"2026-03-18T09:17:26+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"discover-oncology","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"dion","sideBox":"Learn more about [Discover Oncology](https://www.springer.com/12672)","snPcode":"","submissionUrl":"","title":"Discover Oncology","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Discover Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"08b1054b-b842-459a-8745-3b64e40cb318","owner":[],"postedDate":"April 9th, 2026","published":true,"recentEditorialEvents":[{"type":"editorInvitedReview","content":"","date":"2026-05-08T05:27:21+00:00","index":101,"fulltext":""},{"type":"reviewerAgreed","content":"42870025316635228591362138488056299578","date":"2026-05-01T02:49:32+00:00","index":100,"fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-04-29T14:09:06+00:00","index":99,"fulltext":""},{"type":"reviewerAgreed","content":"254360859769294325298253756347437538296","date":"2026-04-29T13:43:31+00:00","index":98,"fulltext":""}],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[],"tags":[],"updatedAt":"2026-04-09T12:27:35+00:00","versionOfRecord":[],"versionCreatedAt":"2026-04-09 12:27:35","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-9088571","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-9088571","identity":"rs-9088571","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.