Integrative Functional Genomics Analysis of Kaposi Sarcoma Cohorts | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Integrative Functional Genomics Analysis of Kaposi Sarcoma Cohorts Ezequiel Lacunza, Valeria Fink, Julian Naipauer, María E. Salas, and 10 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-6146471/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Kaposi sarcoma (KS) is an AIDS-defining cancer and a significant global health challenge caused by KS-associated herpesvirus (KSHV). NGS-based approaches have profiled KS lesions in a minimal number of studies compared with other neoplastic diseases. Here we present a compiled and harmonized dataset of 131 KS and non-tumor cutaneous samples in the context of their predicted pathway activities, immune infiltrate, KSHV and HIV gene expression profiles, and their associated clinical data representing patient populations from Argentina, United States (USA), and Sub-Saharan Africa cohorts. RNA-seq data from 9 Argentinian KS lesions were generated and integrated with previously published datasets derived from the USA and sub-Saharan African cohorts from Tanzania, Zambia, and Uganda. An unsupervised analysis of 131 KS-related samples allowed us to identify four KS clusters based on their host and KSHV gene expression profiles, immune infiltrate, and the activity of specific signaling pathways. The compiled RNA-seq profile is shared with the research community through the UCSC Xena browser for further visualization, download, and analysis (https://kaposi.xenahubs.net/ ). These resources will allow biologists without bioinformatics knowledge to explore and correlate the host and viral transcriptome in a curated dataset of different KS RNA-seq-based cohorts, which can lead to novel biological insights and biomarker discovery. Kaposi's Sarcoma RNAseq Transcriptome Xena browser Figures Figure 1 Figure 2 Figure 3 1 INTRODUCTION Kaposi sarcoma (KS) is a prominent AIDS-defining malignancy, and a critical global health challenge primarily linked to infection with KS-associated herpesvirus (KSHV) (Mesri et al., 2010 ; Dittmer and Damania 2016 ; Cesarman et al., 2019 ). KSHV contains a complex genome comprising over 80 genes, which are regulated to facilitate either latency with minimal viral expression or lytic replication, resulting in the production of new virions. KSHV can infect epithelial cells, endothelial cells, B cells, and, more recently, it has been found to infect neurons and mesenchymal stem cells (MSCs), further highlighting its diverse pathogenic potential (Mesri et al., 2010 ; Dittmer and Damania 2016 ; Cesarman et al., 2019 ; Naipauer and Mesri 2023 ; Lacunza et al., 2024 ). Among the various forms of KS, the most aggressive variant is observed in individuals with HIV, known as acquired immunodeficiency syndrome-associated KS (AIDS-KS). This form is characterized by widespread dissemination, affecting the skin and visceral organs, including the gastrointestinal tract and lungs (Haverkos et al 1985). Moreover, in men who have sex with men (MSM), the HIV infection rate is 4.9–10.5%, and in transgender women (TGW), HIV reported prevalence is up to 58%. These two populations have disproportionately higher risks of developing KS caused by KSHV (Hutchison et al., 2018 ; AIDS-defining Cancer Project Working Group 2017). NGS-based transcriptomic analyses of KS have revealed significant molecular insights into the pathogenesis of the disease. For example, Tso et al. ( 2018 ) reported alterations in glucose and lipid metabolism, with changes in genes associated with metabolic disorder pathways. A recent study further stratified HIV + KS lesions into two molecular subtypes—one characterized by endothelial and proliferative features and the other enriched in inflammatory transcripts (Moorad et al., 2023 ). Additionally, profiling KS lesions from different anatomical sites has revealed variations in immune composition, angiogenic factors, and KSHV gene expression, underscoring the role of the tumor microenvironment in disease progression (Ramaswami et al., 2023 ). Unsupervised clustering of KSHV gene expression has also identified distinct tumor groups with varying latent and lytic viral profiles (Tso et al., 2018 ; Rose et al., 2018 ; Lidenge et al., 2020 ; Lacunza et al., 2024 ). Furthermore, morphologically distinct KS tumors from the same individual exhibit similar KSHV gene expression patterns, suggesting that viral activation is influenced by the local microenvironment and host immune response (Rose et al., 2018 ). This molecular heterogeneity emphasizes the need for further studies to develop predictive biomarkers and optimize therapeutic strategies for KS. Most of these RNAseq studies come from subjects in sub-Sahara Africa where KSHV seroprevalence is notably high, with over 50% of the population exhibiting detectable anti-KSHV antibodies (Uldrik et al., 2011; Damania and Dittmer 2023 ). This raises the need for a broader integrative analysis including diverse cohorts to capture the global spectrum of KS pathology. This study aimed to integrate RNAseq-based gene expression profiles from four distinct cohorts of KS covering North America, South American, and sub-Saharan Africa to enhance our understanding of the disease. Additionally, we intend to share these valuable datasets with the research community through the UCSC Xena resource, promoting further exploration and discovery in the field of KS research. 2 MATERIALS AND METHODS 2.1 Sample Collection and RNA Sequencing Nine KS skin lesions were collected within an Argentinian study that included men at birth with or without HIV, recruited at Fundación Huésped and Hospitals network as part of our collaborative U54 consortium (Miami CFAR SCCC - Argentina Consortium for Research and Training in virally induced AIDS-Malignancies). Participants were > 18 years old (median: 42 years; range: 34–62 years). Approval of the protocol and study-related documents was obtained from the Ethics Committee. All participants included in this study signed informed consent before undergoing any study procedures. Total RNA was isolated from KS samples preserved in RNAlater using the miRNeasy Tissue/Cells Advanced Kits (Qiagen) following standard manufacturer's protocol. RNA concentration and integrity were measured on an Agilent 2100 Bioanalyzer (Agilent Technologies). RNA samples with RNA integrity number (RIN) over 5 were considered for RNA sequencing. The RNA samples were processed for directional RNA-seq library construction using the Illumina Total RNA Prep with Ribo-Zero Plus library preparation kit according to the manufacturer's protocol. We performed 101 nt paired-end sequencing using an Illumina Novaseq 6000 platform and obtained ~ 60 million clusters per sample with 92% >Q30. The RNAseq raw data has been submitted to NCBI GEO database with accession number GSE271303. 2.2 RNAseq data pre-processing and integration of Kaposi's Sarcoma cohorts RNAseq raw data were retrieved from GEO/SRA using the SRA Toolkit ( https://github.com/ncbi/sra-tools ) from GSE147704 (Tanzania & Zambia cohort: KS = 24, Control = 24, and Normal = 3), GSE241095 (USA cohort: KS = 10 and Control = 10) and SRP486827 (Uganda cohort: KS = 51). The raw short-read sequences generated from the Argentinian KS samples and retrieved from GEO/SRA were quality-checked and trimmed to remove adapters and low-quality bases using the Rfastp R/Bioconductor package. The preprocessed reads were then aligned and mapped to the human genome reference GRCh38 using the Subread aligner algorithm provided by the Rsubread R/Bioconductor package. The remaining non-human reads were aligned to the KSHV reference sequence NC_009333. The KSHV reference genome used in our study for read sequence alignment and mapping belongs to the Human herpesvirus 8 strain GK18. It provides a valuable complete genome for global transcriptome analysis, but it does not capture the full extent of KSHV's genetic diversity, including the highly diverse K1 and K15 subtypes present in sub-Saharan populations. The aligned reads (BAM files) from each sample were used to calculate gene expression abundance at the whole-genome level using the featureCounts function provided by the Rsubread package. The raw read counts for each cohort were integrated, and batch effects were diagnosed using the BatchQC R/Bioconductor package (Manimaran et al., 2016 ), followed by batch effects adjustment with the ComBat-seq algorithm from the "sva" R/Bioconductor package (Zhang et al., 2020 ). Briefly, ComBat-seq algorithm implements a negative binomial regression model to estimate batch effects based on the count matrix obtained for each RNA-seq study allowing us to include the condition variable (control or KS) as covariates in the regression model behind this method to account for the impact of batch and retaining biological signal. 2.3 KS and KSHV transcriptome analysis of the compiled dataset The transcriptome profile of the compiled batch-effect adjusted KS dataset consists of 131 KS and non-tumor cutaneous samples and their associated metadata. To identify differentially expressed genes between KS and controls or between KS groups (e.g.: from HIV + vs HIV- or with or without ART, etc.), we computed fold changes and adjusted p-values using the edgeR R/Bioconductor package based on the normalized log2-based count per million values. Genes showing a log-fold change greater than 1 and an adjusted p-value below 0.05 were considered significantly differentially expressed. Functional enrichment analysis and Gene Set Variation Analysis (GSVA) of differentially expressed genes were performed with the clusterProfiler and GSVA R packages. Tumor immune cell infiltration scores were estimated with the ABIS algorithm and the MCP-counter tool from the immunedeconv ( https://github.com/omnideconv/immunedeconv ) and MCPcounter ( https://github.com/ebecht/MCPcounter ) R packages, respectively, on normalized count matrices. The scores, based on transcriptomic markers that are strongly, specifically, and stably expressed in a unique cell population, are proportional to the abundance of each population in the tumor, enabling intersample comparison and large cohort analyses (Becht et al., 2016 ). Unsupervised hierarchical clustering analysis and heatmaps representations were performed with the MultiExperimentViewer (MeV 4.9.0) software. To determine the optimal number of clusters, we applied Principal Component Analysis (PCA) alongside the NbClust package, which evaluates the number of clusters using 30 different indices( https://github.com/cran/NbClust/blob/master/R/NbClust.R ). The relationship between a categorical and a quantitative variable was assessed using the Wilcoxon rank-sum test or the Kruskal–Wallis test. Pearson correlation was used for two quantitative variables. P values were corrected for multiple testing using the Bonferroni or Benjamini–Hochberg methods. 2.4 Data sharing and visualization through a UCSC Xena hub UCSC Xena is a high-performance visualization and analysis tool for both large public repositories and private datasets (Goldman et al., 2020 ). UCSC Xena has two components: the front-end Xena Browser and the back-end Xena Hubs. We have deployed a public Xena Hub (called KS-omics) to host and share the functional genomics data of the compiled KS dataset with the research community ( https://kaposi.xenahubs.net/ ). Briefly, Xena Browser allows biologists without bioinformatics knowledge to explore data with a variety of visualizations and analytic tools. 3 RESULTS AND DISCUSSION 3.1 Integrative Analysis of Host and Viral Gene Expression Profiles in KS Using Multi-Cohort RNA Sequencing To perform an integrative analysis of host and viral gene expression profiles in KS, we combined 94 KS with 37 non-tumor cutaneous cases obtained from four cohorts, one of them generated by us and three available in a public database (Figure 1A ). Clinical variables considered for the study included age, ART treatment, sex at birth, HIV status, origin (endemic/epidemic), and KS morphotype. To generate a homogeneous, uniformly curated, and preprocessed dataset, the RNAseq raw data was directly retrieved from GEO/SRA using the SRA Toolkit and equally preprocessed by our bioinformatics pipeline. Additionally, raw read count matrices from each cohort were integrated. Variation analysis (Figure S1 ) showed that batch effects explained a larger proportion of variation than the condition (control or KS), highlighting the need for batch correction. Batch effects were then adjusted using the ComBat-seq algorithm (Figure S1 ). Importantly, the transcriptomic profile of the four independent KS cohorts were generated using Illumina sequencing platforms (Table 1 ). Table 1. Characteristics of the study subjects included in each KS cohort. Characteristics Tanzania / Zambia GSE147704 Uganda SRP486827 USA GSE241095 Argentina GSE271303 # of Cases 24 KS 24 Controls 3 Normal skin 51 KS 10 KS 10 Controls 9 KS Age (median, IQR) 33 (29, 42) 35 (26, 46) 43 (34, 49) 42 (34, 50) Sex at birth Male Female n/a 16 8 - 44 5 2 10 0 - 9 0 - Race White Black Hispanic 0 24 (100%) 0 0 51 (100%) 0 2 (20%) 4 (40%) 4 (40%) 0 0 9 (100) HIV characteristics HIV co-infection CD4 T-cell count (cells/ul) HIV Viral load (cps/mL) On ART at biopsy 18 (75%) na 12000 18 (75%) 51 (100%) na na 0 (0%) 9 (90%) 39 74000 10 (100%) 9 (90%) 144 125000 3 (33%) Figure 1B shows the scatter plot of cases projected on the first two dimensions in unadjusted data, and in data adjusted by ComBat-seq. We observed a strong batch effect in the unadjusted data, which was well addressed by ComBat-seq. A favorable adjustment would pool control/non-tumor cutaneous samples from the cohorts (batches), while keeping all KS lesions separated from the controls and from each other. In the multidimensional plot of ComBat-seq adjusted data, we observed the expected pattern of data if there were no batch effects, in which the control/normal cases are clustered together, while the KS cases from four cohorts are scattered at different locations. These results suggest a successful adjustment of batch effect from ComBat-seq. UCSC Xena is a high-performance visualization and analysis tool for both large public repositories and private datasets (Goldman et al., 2020). In this sense, we have implemented a public cloud-based backend Xena Hubs hosting the human and KSHV batch effect adjusted gene expression, immune and pathway activity profiles as well as their associated phenotypic data obtained from the four KS cohorts (https://kaposi.xenahubs.net/ ) . The frontend Xena Browser provides a wide variety of visualizations and analyses including scatter plots, bar graphs, statistical tests, genomic signatures, as well as a unique Visual Spreadsheet view (Figure 1C ). The Xena Visual Spreadsheet was designed to enable and enhance integration across diverse data modalities, providing researchers with a more biologically complete understanding of genomic events and tumor biology. The UCSC Xena browser enables users to explore functional genomic datasets for correlations between genomic and phenotypic variables. It also allows for differential expression analysis between user-defined sample groups (Figure 1D ) and functional enrichment analysis of differentially expressed genes (Figure 1E ). 3.2 Unsupervised clustering reveals distinct transcriptomic and immune profiles in KS lesions. To characterize the transcriptomic, immune, and functional profiles of the samples, we employed an unsupervised clustering approach. We defined clusters based on the entire transcriptome (host and KSHV). This approach allows us to dissect the contributions of both host and viral factors to the overall expression landscape, providing a comprehensive understanding of the interplay between the host's immune response and viral activity in KS. The unsupervised analysis revealed a clear distinction between KS lesions and non-tumor controls (Figure 2A ), underscoring the impact of KSHV on the host transcriptome (Tso et al., 2018). To further validate the optimal number of clusters, we applied the NbClust package. The majority rule suggested either 2 or 4 clusters as the most likely options. Considering this, along with the structure observed in the dendrogram and PCA analysis, we determined that 4 clusters were the most appropriate choice for our data (Figure S2 ) We identified at least three distinct groups of lesions, designated as clusters 1 (C1), 3 (C3), and 4 (C4). Uninvolved skin from KS patients and skin from non-KS subjects were classified within the control cluster (C2), which comprised two subclusters and included two KS cases (Figure 2A ). Cluster C1 included most endemic tumors, along with some epidemic cases, while clusters C3 and C4 were primarily composed of epidemic lesions (p<0.001; Data S1 ). Importantly, no significant associations were observed between lesion clusters and HIV status or ART treatment (Data S1 ). However, a notable association with KS morphotype was found, with cluster C4 exhibiting a higher percentage of macular lesions (p<0.01; Data S1 ). We defined the immune fraction profile using the ABIS algorithm, which demonstrated a significant immune infiltrate enrichment in the lesions compared to the controls (Figure 2B ). Clusters C1 and C3 showed the highest percentage of immune fractions, indicating a greater infiltrate compared to clusters C2 and C4. Cluster C1 was characterized by a significant enrichment of memory CD4 T lymphocytes, whereas naive CD4 T cells were predominant in the control group (C2) (Figure S3 ). In contrast, cluster C3 exhibited a notable enrichment of plasmablasts and memory B cells (Figure S3 ). Additionally, all three lesion clusters showed enrichment in CD8 memory T cells compared to the control group (Figure S3 ). To strengthen the analysis, we examined the composition of the sample microenvironment using the microenvironment cell populations (MCP)-counter method (Becht et al., 2016). The cell composition varied significantly between clusters (Figure S3 ). Cluster C1 displayed elevated expression of genes specific to immune populations, including T cells, CD8+ T cells, natural killer (NK) cells, and cytotoxic lymphocytes. However, it was primarily characterized by a significant upregulation of endothelial cell-related genes (Figure S3 ). Cluster C3 showed elevated expression of most immune cell populations, with B lineage signatures and T cells as key determinants. Cluster C4 displayed a generally low immune profile, similar to the control cluster (C2), but differed by a higher expression of endothelial cell markers (Figure S3 ). These results complemented the findings by Tso et al., who reported a significant infiltrate of immune cells in the lesions, primarily of B cells, although T cells were not significantly present. However, they predicted the activation of numerous chemokines that play an important role in the recruitment of T cells (Tso et al., 2018). Additionally, Lidenge et al. have shown that, despite the upregulation of chemokines like CxCL-9, immune cells, including CD4+ T cells, CD8+ T cells, and NK cells, are not co-localized with KSHV-infected regions (Lidenge et al., 2020). This discrepancy highlights the differences between transcriptomic-based immune profiling and protein-level detection, suggesting that immune cells may be recruited to the tumor microenvironment but may not be localized to KSHV-infected areas. In addition, the marginal tissues captured in biopsies likely contain immune populations that are present but infiltrate the tumor at low frequencies. Given the increasing relevance of immune checkpoint inhibitors (ICIs) in the treatment of KS, we evaluated the transcriptomic expression levels of a set of ICIs. A distinct pattern of expression levels was observed, correlating with the infiltrate (Figure 2C ). In this context, clusters C1 and C3 were the most enriched in these genes. These results, while needing further validation, along with the immune profile of the clusters, are relevant in the context of the KS microenvironment and immune checkpoint ICIs immunotherapies. They are also consistent with previous studies. For instance, Petitprez et al. (2018) established an immune-based classification of sarcoma immune classes (SICs) and demonstrated that the immune-high group, characterized by the presence of B cell-rich lineage, exhibited a strong response to PD1 immunotherapy. The tumor microenvironment in KS is known to be composed of activated B cells and tumor-associated macrophages (Joest et al., 2020). More recently, it has been shown that KS, along with other sarcoma subtypes, displayed the highest response rates to ICIs and longest survival, with PD-L1 expression of ≥1% being associated with increased response to ICIs (Lee et al., 2024). 3.3 Pathways activity analysis. The pathway activity analysis based on Gene Ontology (GO) and Hallmarks revealed a reduction in epithelial differentiation, including keratinocyte differentiation and epithelial-mesenchymal transition (EMT), in lesions compared to controls (Figure 2D ). Additionally, decreases in metabolic processes such as lipid metabolism, cholesterol homeostasis, and response to estrogen were observed (Figure 2E ). This finding aligns with previous studies demonstrating that KSHV plays a significant role in cellular transformation and metabolic reprogramming, both of which are key factors in the pathogenesis of KS (Zhu et al., 2014; Tso et al., 2018). Moreover, the impaired EMT is particularly important in KS, as it promotes the formation of new blood vessels and supports tumor growth, emphasizing the complex relationship between these processes in the disease (Naipauer and Mesri 2023; Lacunza et al., 2024). Furthermore, the analysis revealed higher levels of immune pathways—such as adaptive immune responses, T cell differentiation, and B cell differentiation—in the lesions of clusters C1 and C3 compared to cluster C4. KS lesions in cluster C3 exhibited more pronounced humoral immune related pathways than those in cluster C1, which corresponds with its higher proportion of B cells (Figure 2D , Figure S3 ). Since KSHV has a known tropism for B cells and can infect them, leading to lymphoproliferative lesions, cluster C3, enriched in B cell lineages, may reflect this interaction (Rappocciolo et al., 2008; Ballon et al., 2011; Nicol et al., 2016). However, the transcriptomic data does not clarify whether these B cells are directly infected by KSHV or are merely part of the recruited immune response. Lesions in cluster C4 displayed a pattern more similar to non-tumor controls, showing lower enrichment levels across most pathways compared to clusters C1 and C3 (Figure 2D ). However, these lesions exhibited enrichment in pathways related to innate antiviral responses, whereas tumors in clusters C1 and C3 were associated with broader and more robust immune responses (Figure 2D ). Cluster C1 exhibited lesions with a molecular pattern of pathway activation characteristic of KS, marked by increased proliferative activity (e.g., cell cycle, mitotic spindle) and activation of the PI3K/Akt/mTOR and IL6/JAK/STAT3 pathways (Figure 2 D , E ). Both clusters C1 and C3 also displayed angiogenic activity (Figure 2 D, E ). These differences in immune and pathways profiling may have important implications for disease progression and patient response. 3.4 Gene expression profiling analysis. To identify the representative genes or signatures associated with the functional processes in each lesion cluster, we conducted differential expression analysis, comparing controls (C2) with lesions (C1, C3, and C4), as well as among the lesion groups themselves. Cluster C1 exhibited the highest number of dysregulated genes compared to non-tumor samples, with the majority of KSHV genes expressed (Figure S4 , Table S1 ). KSHV genes K15 , K13, ORF72, ORF75, vIRF-2 , and vIRF-4 were among the top upregulated genes. Among the numerous affected host genes, those linked to extracellular matrix remodeling stood out, including several metalloproteinases and collagens (e.g MMP9-17, ADAMTS54-S7, COL4A1 ), along with significant genes such as ITGB1, FLT1, FLT2, PDGFA, PDGFB, AKT3, ANGPT2, CXCL10, ROCK2, IFNG , and CXCR4 , all of which are associated with processes relevant to KS, such as endothelial differentiation, angiogenesis, the PI3K/Akt pathway, and cytokine/chemokine cascades (Cesarman et al., 2019). Additionally, several genes related to cell proliferation (e.g. PLK1 , BUB1B , TTK ) and T cell lymphocytes (e.g. CD86, CD274, CCR7) were also upregulated. There was also a marked decrease in the expression of several genes linked to protein biosynthesis (ribosomal proteins), cellular respiration (e.g. NDUFA1 , NDUFA3, CYC1 ), and lipid metabolism ( ADIPOQ , ACOX1 , ACOX2 , CYP4B1 ) This underscores the substantial impact of KSHV on host cell metabolism, particularly under hypoxic conditions (Delgado et al., 2010; Singh et al., 2018; Méndez-Solís et al., 2021). When comparing cluster C3 with controls (C2), a similar trend emerged, with a high expression of KSHV genes and a notable impact on host genes, though to a lesser extent than in cluster C1 (Figure S4 , Table S1 ). In cluster C3, there was a prominent enrichment of genes associated with lymphocyte activation and proliferation (e.g., CD80 , ITK , CD84 , CD7 , CD28 , CD247 , TLR9 , LYN , CD22 ). Cluster C4 also exhibited expression of most KSHV genes when compared to non-tumor controls; however, the host gene enrichment was primarily linked to the viral infection response (e.g., IFIT1 , IFIT2 , IFIT3 , IRF1 , IRF7 , IF1 6) (Table S1 ). The comparison between lesion clusters revealed distinct differences. In the cluster C1 vs. cluster C3 analysis, cluster C1 exhibits gene expression profiles more like KS tumors, specifically angiogenic genes (e.g. TX1, ANGPTL3, PROX1, SCUBE1, KDR, ROBO4, PDGFB, SERPINA5 ), mitotic genes (e.g. CENPE, CDK1, KIF3C, BRCA2, TOP2A , etc.), and remodeling matrix components including metalloproteinases (Lacunza et al., 2024). Conversely, cluster C3 was distinguished by greater differentiation (higher keratin expression) and increased metabolic activity, evidenced by a higher expression of lipid metabolism-related genes and xenobiotic metabolism genes with activation of numerous cytochrome P450 genes (Table S1 ). The comparison between clusters C1 and C4 reveals distinct metabolic and immune response profiles in the lesions. Cluster C4 shows elevated expression of oxidative and xenobiotic metabolism genes, along with ERBB signaling genes (e.g., ERBB2 , ERBB3 , ERBB4 , EGFR , EREG ), suggesting an adaptive response to environmental stressors and a focus on cellular survival and proliferation (Table S1 ). This points to potential metabolic reprogramming that could support tumorigenesis. The ERBB family of kinases has recently been implicated in regulating the KSHV latent-to-lytic replication switch, emphasizing their role in KSHV-related pathogenesis (Olson et al., 2023). In contrast, cluster C1 was marked by a strong activation of immune-related genes, particularly those associated with lymphocyte activation and inflammatory responses (e.g., CXCL8 , CXCL13 , CXCL2 , IL6 , IFNG , IL10 , CCR5 , CXCR4 ), indicating a robust immune environment potentially engaged in combating the tumor (Table S1 ). Additionally, the comparison of clusters C3 and C4 revealed a lower number of differentially expressed genes (Figure S2 ). C3 exhibited higher expression of genes linked to T and B cell activation (e.g., CD84 , CD86 , CD70 , CD79A , CD79B , IL10 ), while C4 showed increased expression of genes involved in antiviral responses (e.g., IFI44 , IFIH1 , IFI27 , RSAD2 , XAF1 , OAS1 , OAS2 , OAS3 ) (Table S1 ). In summary, the transcriptomic and functional analysis identifies three distinct groups of KS. Cluster C1, primarily composed of endemic cases with some epidemic ones, represents tumors characterized by poorly differentiated, endothelial, and proliferative phenotypes. This cluster also showed enrichment of the PI3K/mTOR pathway and a strong presence of memory CD4 T lymphocytes in the immune fraction. Cluster C3, composed of predominantly epidemic cases, showed greater metabolic activity and was enriched in B cells. In contrast, cluster C4 had the least immune infiltration and highest differentiation, with strong activation of antiviral defense genes and the macular morphotype, typical of early-stage KS (Cesarman et al., 2019). Overall, consistent with the findings of Lidenge et al. (2020), which showed that endemic and epidemic KS transcriptomes are highly similar, with enhanced dysregulation of tumorigenesis and immune functions in endemic KS, we observed a segregation between endemic (C1) and epidemic (C3 and C4) cases. This distinction was driven by the magnitude and the specific subsets of expression changes. C1 (primarily endemic) displayed stronger activation of angiogenesis, endothelial differentiation, immune-related genes, cell proliferation, and tumorigenic pathways such as PI3K/Akt/mTOR. In contrast, C3 (epidemic) exhibited higher metabolic activity, with increased expression of genes related to lipid and xenobiotic metabolism, while C4 showed features typical of an early-stage viral response, including activation of antiviral defense pathways. These differences highlight functional divergence between C1, C3, and C4, supporting the clustering of endemic and epidemic KS lesions based on distinct immune and metabolic signatures. 3.5 KSHV transcriptome analysis. To evaluate the relationship between the clusters identified from the whole transcriptome and KSHV expression levels while minimizing variability from host gene expression, we chose to analyze the KSHV transcriptome alone between clusters. This approach clarified the viral contributions to the disease and helped to determine if different viral expressions correlate with the observed clusters. First, it is important to note that there were no significant differences in overall KSHV expression levels (measured by total mapped KSHV reads, TMKR) across the different cohort populations (Argentina, Tanzania/Zambia, and Uganda; p>0.05; Figure S5 ). However, when comparing the clusters, cluster C1 contained tumors with the highest levels of KSHV expression, as indicated by the TMKR, compared to clusters C3 and C4 (p<0.01; Figure 3 A ). This difference was further supported by the elevated levels of LANA in cluster C1 (Figure 3 A ). The analysis of differentially expressed genes among the lesion clusters confirmed that cluster C1 exhibited the highest expression levels of genes such as ORF72 , ORF73 ( LANA ), ORF74 , ORF75 , as well as K12 , K13 , and K15 , which were among the most highly expressed transcripts across all clusters. These genes have been implicated in various functions, including immune evasion, angiogenesis, cell proliferation, and apoptosis regulation. LANA is known to stabilize the viral genome within host cells, facilitating persistence and contributing to tumorigenesis (Uppal et al., 2014; Nakajima et al; 2024). ORF74 is a viral GPCR that binds various human chemokines ( CXCL1 , CXCL8 ) contributing to immune evasion and consequently viral dissemination (de Munnik et al., 2015; Medina et al., 2020). Similarly, K12 and K13 play roles in modulating host immune responses (Liang et al., 2008), suggesting that the elevated expression of these genes in cluster C1 may correlate with more aggressive tumor behavior or enhanced viral persistence. Although no significant differences were observed in the total mapped reads between clusters C3 and C4 (Figure 3A ), these clusters were differentiated by the expression of specific genes, including LANA and RTA ( ORF50 ). Additionally, lytic genes such as the viral Bcl-2 ( ORF16 ), which inhibits apoptosis, the protease ( ORF17 ) associated with capsid maturation, and the gene expression modulator ( ORF18 ) were all more abundantly expressed in C3 compared to C4 (Figure 3B ; Table S2 ).This suggests that KSHV is contributing to the intrinsic transcriptomic differences among these clusters of epidemic tumors, potentially influencing their pathogenic behavior and responses to treatment. The distinct expression profiles of these genes highlight the complex interplay between viral replication and host cell dynamics in KS. We conducted further correlation analyses to examine the relationship between viral tissue load (quantified via TMKR) and KS morphotypes. The macular group exhibited a significantly lower viral load compared to the plaque and nodular groups (Figure S5 ). Although no significant differences were found between the plaque, nodular, and fungating groups, an increasing trend in mean TMKR values was observed from macular to fungating lesions (Figure S5 ). These findings correlate with the observation that cluster C4, primarily composed of macular lesions, exhibited lower viral load, consistent with the early stage of these lesions. This suggests an association between viral load and lesion stage in KSHV-driven tumorigenesis. We also explored the relationship between immune infiltration patterns and viral loads across clusters (Figure S5) . Linear regression analysis revealed that higher KSHV loads were associated with a decrease in naïve CD4+ T cells and an increase in memory CD4+ T cells, particularly in Cluster C1. A similar trend, though weaker, was observed for CD8+ T cells, with memory CD8+ T cells showing a positive association with KSHV load. No significant associations were found between KSHV load and B cell subsets, suggesting that B cell enrichment in cluster C3 may be an intrinsic feature of these lesions, independent of KSHV load. While our study provides valuable insights into the molecular diversity of KS, several limitations should be acknowledged. First, RNA-seq data, while powerful for transcriptomic profiling, may not fully capture the presence and spatial distribution of immune cells within the tumor microenvironment. Immune profile algorithms infer immune cell composition based on gene expression rather than direct cellular localization, which may not reflect the actual immune cell presence in KS lesions. Additionally, although our findings align with and expand upon previous transcriptomic studies that identified varying KS subtypes based on proliferative, angiogenic, or metabolic pathways, the classification of KS lesions remains complex. The number of subtypes identified across studies has ranged from two to five, highlighting the heterogeneity of KS tumors (Tso et al., 2018; Rose et al., 2018; Lidenge et al., 2020; Ramaswami et al., 2023; Moorad et al., 2023). Despite these limitations, our study’s strength lies in integrating diverse cohorts, providing a more comprehensive view of KS pathology across different populations. Further validation, including spatial transcriptomics, will be necessary to refine our understanding of KS molecular landscapes and their therapeutic implications. 4 CONCLUSIONS In summary, this is the first study to integrate transcriptomic profiles of various cohorts of KS. The RNA-seq profile generated from the Argentinian KS samples compiled with previously published datasets as well as their associated metadata and predicted pathway activities, immune infiltrate, host and viral gene expression profiles are shared with the research community through the development of a KS-omics resource leveraging UCSC Xena browser. This resource will allow biologists without bioinformatics knowledge to explore and correlate the host and viral transcriptomic profiles in a compiled and harmonized dataset to develop novel biological insights and discover biomarkers. Our comprehensive analysis enabled us to define and characterize three distinct lesion groups that reflect both the biology of the virus and the intrinsic molecular characteristics of the lesions. Notably, while endemic lesions were primarily concentrated in cluster C1, epidemic lesions were distributed across all three clusters. Epidemic lesions in clusters C3 and C4 did not show significant differences in overall KSHV expression but differed in their intrinsic functional molecular characteristics. Cluster C3 exhibited a high level of B cell infiltration and ICIs expression, while cluster C4 was characterized by an enrichment of genes involved in early response against viral infection. Whether these different clusters imply differences in treatment approaches or response, it still needs to be explored. Thus, we defined three groups of KS: endothelial, proliferative and well-defined KS tumors (C1); B-cell-enriched and metabolic tumors (C3); and less immunogenic, early-stage KS tumors (C4). Overall, our findings provide valuable insights into the molecular diversity of KS and underscore the potential for tailored therapeutic strategies based on lesion characteristics. Declarations Data availability The raw data have been submitted to NCBI GEO database with accession number GSE271303. The rest of the data are available from the corresponding author upon reasonable request. Acknowledgements The authors want to acknowledge the invaluable contributions from all study participants and from all the research team at Fundación Huésped, where participants were recruited. Authors contributions EL: Investigation, formal analysis, writing the article. VF, MIF: Resources and logistics for sample and clinical data collection. MES, AMG, SW: Methodology, research assistance, clinical data of participants. MJG and JZ: UCSC Xena. JN and EC: Advised on the study and revised the manuscript. PC, OC, JCR: Resources. MCA: Conceived the study, supervision, formal analysis, writing the article. All authors read and approved the final manuscript. Funding Declaration This work was supported by the National Cancer Institute of the National Institutes of Health under Award Number U54CA221208 (OC and JCR), P30CA240139 (JCR), 5U24CA258407 (UCSC Xena Browser), and the National University of La Plata M250 I+D grant (MCA). Ethics declarations MJG is a consultant for the Fred Hutchinson Cancer Research Center. The other authors declare no competing interests. References AIDS-defining Cancer Project Working Group for IeDEA and COHERE in EuroCoord. Comparison of Kaposi Sarcoma Risk in Human Immunodeficiency Virus-Positive Adults Across 5 Continents: A Multiregional Multicohort Study. Clin Infect Dis. 2017 Oct 15;65(8):1316-1326. doi: 10.1093/cid/cix480. PMID: 28531260; PMCID: PMC5850623 Ballon G, Chen K, Perez R, Tam W, Cesarman E. 2011. Kaposi sarcoma herpesvirus (KSHV) vFLIP oncoprotein induces B cell transdifferentiation and tumorigenesis in mice. J Clin Invest 121:1141–1153. doi: 10.1172/JCI44417. Becht, E. et al. Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression. Genome Biol. 17, 218 (2016). Cesarman E, Damania B, Krown SE, Martin J, Bower M, Whitby D. Kaposi sarcoma. Nature reviews Disease primers. 2019 Jan 31;5(1):9. Damania B, Dittmer DP. Today's Kaposi sarcoma is not the same as it was 40 years ago, or is it? J Med Virol. 2023 May;95(5):e28773. doi: 10.1002/jmv.28773. PMID: 37212317; PMCID: PMC10266714. de Munnik SM, Kooistra AJ, van Offenbeek J, Nijmeijer S, de Graaf C, Smit MJ, Leurs R, Vischer HF. The Viral G Protein-Coupled Receptor ORF74 Hijacks β-Arrestins for Endocytic Trafficking in Response to Human Chemokines. PLoS One. 2015 Apr 20;10(4):e0124486. doi: 10.1371/journal.pone.0124486. Delgado T, Carroll PA, Punjabi AS, Margineantu D, Hockenbery DM, et al. (2010) Induction of the Warburg effect by Kaposi’s sarcoma herpesvirus is required for the maintenance of latently infected endothelial cells. Proc Natl Acad Sci U S A 107: 10696–10701. pmid:20498071 Dittmer DP, Damania B. Kaposi sarcoma–associated herpesvirus: immunobiology, oncogenesis, and therapy. The Journal of clinical investigation. 2016 Sep 1;126(9):3165-75. Goldman, M.J., Craft, B., Hastie, M. et al. Visualizing and interpreting cancer genomics data via the Xena platform. Nat Biotechnol (2020). https://doi.org/10.1038/s41587-020-0546-8 Haverkos HW, Drotman DP. Prevalence of Kaposi's sarcoma among patients with AIDS. The New England Journal of Medicine. 1985;312(23):1518. Hutchison LM, Boscoe FP, Feingold BJ. Cancers Disproportionately Affecting the New York State Transgender Population, 1979-2016. Am J Public Health. 2018 Sep;108(9):1260-1262. Joest B, Kempf W, Berisha A, Peyk P, Tronnier M, Mitteldorf C. Stage-related PD-L1 expression in Kaposi sarcoma tumor microenvironment. J Cutan Pathol. 2020 Oct;47(10):888-895. doi: 10.1111/cup.13716. Lacunza E, Ahuja A, Coso OA, Abba M, Ramos JC, Cesarman E, Mesri EA, Naipauer J. Unveiling the role of KSHV-infected human mesenchymal stem cells in Kaposi's sarcoma initiation. J Med Virol. 2024 May;96(5):e29684. doi: 10.1002/jmv.29684. PMID: 38773828. Lee AQ, Hao C, Pan M, Ganjoo KN, Bui NQ. Histologic and immunologic factors associated with response to immune checkpoint inhibitors in advanced sarcoma. Clin Cancer Res. 2024 Dec 19. doi: 10.1158/1078-0432.CCR-24-3485. Liang C, Lee JS, Jung JU. Immune evasion in Kaposi's sarcoma-associated herpes virus associated oncogenesis. Semin Cancer Biol. 2008 Dec;18(6):423-36. doi: 10.1016/j.semcancer.2008.09.003. Epub 2008 Oct 2. Lidenge SJ, Tso FY, Ngalamika O, Kolape J, Ngowi JR, Mwaiselage J, Wood C, West JT. Lack of CD8+ T-cell co-localization with Kaposi's sarcoma-associated herpesvirus infected cells in Kaposi's sarcoma tumors. Oncotarget. 2020 Apr 28;11(17):1556-1572. Lidenge, S.J., Kossenkov, A.V., Tso, F.Y., Wickramasinghe, J., Privatt, S.R., Ngalamika, O., Ngowi, J.R., Mwaiselage, J., Lieberman, P.M., West, J.T. and Wood, C., 2020. Comparative transcriptome analysis of endemic and epidemic Kaposi’s sarcoma (KS) lesions and the secondary role of HIV-1 in KS pathogenesis. PLoS pathogens, 16(7), p.e1008681. Manimaran S, Selby HM, Okrah K, Ruberman C, Leek JT, Quackenbush J, Haibe-Kains B, Bravo HC, Johnson WE. BatchQC: interactive software for evaluating sample and batch effects in genomic data. Bioinformatics. 2016 Dec 15;32(24):3836-3838. doi: 10.1093/bioinformatics/btw538. Medina MV, D Agostino A, Ma Q, Eroles P, Cavallin L, Chiozzini C, Sapochnik D, Cymeryng C, Hyjek E, Cesarman E, Naipauer J, Mesri EA, Coso OA. KSHV G-protein coupled receptor vGPCR oncogenic signaling upregulation of Cyclooxygenase-2 expression mediates angiogenesis and tumorigenesis in Kaposi's sarcoma. PLoS Pathog. 2020 Oct 15;16(10):e1009006. doi: 10.1371/journal.ppat.1009006. Méndez-Solís O, Bendjennat M, Naipauer J, Theodoridis PR, Ho JJD, Verdun RE, Hare JM, Cesarman E, Lee S, Mesri EA. Kaposi's sarcoma herpesvirus activates the hypoxia response to usurp HIF2α-dependent translation initiation for replication and oncogenesis. Cell Rep. 2021 Dec 28;37(13):110144. doi: 10.1016/j.celrep.2021.110144. PMID: 34965440; PMCID: PMC9121799. Mesri EA, Cesarman E, Boshoff C. Kaposi's sarcoma and its associated herpesvirus. Nature Reviews Cancer. 2010 Oct;10(10):707-19. Moorad R, Kasonkanji E, Gumulira J, Gondwe Y, Dewey M, Pan Y, Peng A, Pluta LJ, Kudowa E, Nyasosela R, Tomoka T, Tweya H, Heller T, Gugsa S, Phiri S, Moore DT, Damania B, Painschab M, Hosseinipour MC, Dittmer DP. A prospective cohort study identifies two types of HIV+ Kaposi Sarcoma lesions: proliferative and inflammatory. Int J Cancer. 2023 Dec 15;153(12):2082-2092. doi: 10.1002/ijc.34689. Naipauer J, Mesri EA. The Kaposi's sarcoma progenitor enigma: KSHV-induced MEndT-EndMT axis. Trends Mol Med. 2023 Mar;29(3):188-200. doi: 10.1016/j.molmed.2022.12.003. Epub 2023 Jan 10. PMID: 36635149; PMCID: PMC9957928. Nakajima K-i, Inagaki T, Espera JM, Izumiya Y. Kaposi's sarcoma-associated herpesvirus (KSHV) LANA prevents KSHV episomes from degradation. J Virol. 2024 Feb 20;98(2):e0126823. doi: 10.1128/jvi.01268-23. Nicol SM, Sabbah S, Brulois KF, Jung JU, Bell AI, Hislop AD. Primary B Lymphocytes Infected with Kaposi's Sarcoma-Associated Herpesvirus Can Be Expanded In Vitro and Are Recognized by LANA-Specific CD4+ T Cells. J Virol. 2016 Mar 28;90(8):3849-3859. doi: 10.1128/JVI.02377-15. Olson AT, Kang Y, Ladha AM, Zhu S, Lim CB, Nabet B, Lagunoff M, Gujral TS, Geballe AP. Polypharmacology-based kinome screen identifies new regulators of KSHV reactivation. PLoS Pathog. 2023 Sep 5;19(9):e1011169. doi: 10.1371/journal.ppat.1011169. PMID: 37669313; PMCID: PMC10503724. Petitprez F, Vano YA, Becht E, Giraldo NA, de Reyniès A, Sautès-Fridman C, Fridman WH. Transcriptomic analysis of the tumor microenvironment to guide prognosis and immunotherapies. Cancer Immunol Immunother. 2018 Jun;67(6):981-988. doi: 10.1007/s00262-017-2058-z. Ramaswami R, Tagawa T, Mahesh G, Serquina A, Koparde V, Lurain K, Dremel S, Li X, Mungale A, Beran A, Ohler ZW. Transcriptional landscape of Kaposi sarcoma tumors identifies unique immunologic signatures and key determinants of angiogenesis. Journal of Translational Medicine. 2023 Sep 22;21(1):653. Rappocciolo G, Hensler HR, Jais M, Reinhart TA, Pegu A, Jenkins FJ, Rinaldo CR. 2008. Human herpesvirus 8 infects and replicates in primary cultures of activated B lymphocytes through DC-SIGN. J Virol 82:4793–4806. doi: 10.1128/JVI.01587-07. Rose TM, Bruce AG, Barcy S, Fitzgibbon M, Matsumoto LR, Ikoma M, Casper C, Orem J, Phipps W. Quantitative RNAseq analysis of Ugandan KS tumors reveals KSHV gene expression dominated by transcription from the LTd downstream latency promoter. PLoS Pathogens. 2018 Dec 17;14(12):e1007441. Singh RK, Lang F, Pei Y, Jha HC, Robertson ES. Metabolic reprogramming of Kaposi's sarcoma associated herpes virus infected B-cells in hypoxia. PLoS Pathog. 2018 May 10;14(5):e1007062. doi: 10.1371/journal.ppat.1007062. PMID: 29746587; PMCID: PMC5963815. Tso FY, Kossenkov AV, Lidenge SJ, Ngalamika O, Ngowi JR, Mwaiselage J, Wickramasinghe J, Kwon EH, West JT, Lieberman PM, Wood C. RNA-Seq of Kaposi’s sarcoma reveals alterations in glucose and lipid metabolism. PLoS pathogens. 2018 Jan 19;14(1):e1006844. Uldrick TS, Whitby D. Update on KSHV epidemiology, Kaposi Sarcoma pathogenesis, and treatment of Kaposi Sarcoma. Cancer letters. 2011 Jun 28;305(2):150-62. Uppal T, Banerjee S, Sun Z, Verma SC, Robertson ES. KSHV LANA--the master regulator of KSHV latency. Viruses. 2014 Dec 11;6(12):4961-98. doi: 10.3390/v6124961. PMID: 25514370; PMCID: PMC4276939. Zhang Y, Parmigiani G, Johnson WE. ComBat-seq: batch effect adjustment for RNA-seq count data. NAR genomics and bioinformatics. 2020 Sep 1;2(3):lqaa078. Zhu, Y, da Silva, S.R. & Gao, SJ. Metabolic reprogramming by KSHV in cellular transformation. Cancer Metab 2 (Suppl 1), P88 (2014). https://doi.org/10.1186/2049-3002-2-S1-P88 Additional Declarations No competing interests reported. Supplementary Files AdditionalFigures.pdf SData1.docx Supplementary Data 1: Cluster comparison across cohorts and clinical variables. STable1.xlsx Supplementary Table 1: Results of differential gene expression analysis. STable2.xlsx Supplementary Table 2: Results of differential KSHV gene expression analysis. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-6146471","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":425508354,"identity":"e99552ce-b396-4702-8db4-3ce66471e78d","order_by":0,"name":"Ezequiel Lacunza","email":"","orcid":"","institution":"CINIBA, Universidad Nacional de La Plata","correspondingAuthor":false,"prefix":"","firstName":"Ezequiel","middleName":"","lastName":"Lacunza","suffix":""},{"id":425508357,"identity":"25119557-3639-4993-9828-c571f24a5828","order_by":1,"name":"Valeria Fink","email":"","orcid":"","institution":"Dirección de Investigaciones, Fundación Huésped, Buenos Aires, Argentina.","correspondingAuthor":false,"prefix":"","firstName":"Valeria","middleName":"","lastName":"Fink","suffix":""},{"id":425508358,"identity":"d4934f2e-e65e-4212-87e8-1324a0144b6d","order_by":2,"name":"Julian Naipauer","email":"","orcid":"","institution":"Instituto de Fisiología (IFIBYNE), Biología Molecular y Neurociencias (IFIBYNE), Buenos Aires, Argentina","correspondingAuthor":false,"prefix":"","firstName":"Julian","middleName":"","lastName":"Naipauer","suffix":""},{"id":425508360,"identity":"32a6a5d6-ee8a-467f-a2fc-554289390714","order_by":3,"name":"María E. Salas","email":"","orcid":"","institution":"CINIBA, Universidad Nacional de La Plata","correspondingAuthor":false,"prefix":"","firstName":"María","middleName":"E.","lastName":"Salas","suffix":""},{"id":425508361,"identity":"50cda3d9-ddb8-4c43-9d9b-9a4dc0b0cdb9","order_by":4,"name":"Ana M. Gun","email":"","orcid":"","institution":"Dirección de Investigaciones, Fundación Huésped, Buenos Aires, Argentina.","correspondingAuthor":false,"prefix":"","firstName":"Ana","middleName":"M.","lastName":"Gun","suffix":""},{"id":425508362,"identity":"e8bab9a1-0f1c-469d-875d-ff7b72df8091","order_by":5,"name":"Mary J. Goldman","email":"","orcid":"","institution":"UC Santa Cruz Genomics Institute, University of California","correspondingAuthor":false,"prefix":"","firstName":"Mary","middleName":"J.","lastName":"Goldman","suffix":""},{"id":425508363,"identity":"5e9825fb-8f2d-42f9-a6a1-b8b114dc054e","order_by":6,"name":"Jingchun Zhu","email":"","orcid":"","institution":"UC Santa Cruz Genomics Institute, University of California","correspondingAuthor":false,"prefix":"","firstName":"Jingchun","middleName":"","lastName":"Zhu","suffix":""},{"id":425508364,"identity":"932907f2-9402-405d-82df-cc61bbcacac8","order_by":7,"name":"Sion Williams","email":"","orcid":"","institution":"University of Miami","correspondingAuthor":false,"prefix":"","firstName":"Sion","middleName":"","lastName":"Williams","suffix":""},{"id":425508365,"identity":"5e2531fd-9b52-48c9-b9aa-2ce561d87014","order_by":8,"name":"María I. Figueroa","email":"","orcid":"","institution":"Dirección de Investigaciones, Fundación Huésped, Buenos Aires, Argentina.","correspondingAuthor":false,"prefix":"","firstName":"María","middleName":"I.","lastName":"Figueroa","suffix":""},{"id":425508366,"identity":"ee8362c3-413d-4e29-94d1-372e67d7f746","order_by":9,"name":"Pedro Cahn","email":"","orcid":"","institution":"Dirección de Investigaciones, Fundación Huésped, Buenos Aires, Argentina.","correspondingAuthor":false,"prefix":"","firstName":"Pedro","middleName":"","lastName":"Cahn","suffix":""},{"id":425508367,"identity":"5ecadf61-a943-4183-929a-d02bfc7ee556","order_by":10,"name":"Omar Coso","email":"","orcid":"","institution":"Instituto de Fisiología (IFIBYNE), Biología Molecular y Neurociencias (IFIBYNE), Buenos Aires, Argentina","correspondingAuthor":false,"prefix":"","firstName":"Omar","middleName":"","lastName":"Coso","suffix":""},{"id":425508368,"identity":"454b0806-112f-4cf7-bea9-52bcb75dcdab","order_by":11,"name":"Ethel Cesarman","email":"","orcid":"","institution":"Weill Cornell Medicine","correspondingAuthor":false,"prefix":"","firstName":"Ethel","middleName":"","lastName":"Cesarman","suffix":""},{"id":425508369,"identity":"131e4ce5-0d39-4379-a012-2982d522db15","order_by":12,"name":"Juan C. Ramos","email":"","orcid":"","institution":"University of Miami","correspondingAuthor":false,"prefix":"","firstName":"Juan","middleName":"C.","lastName":"Ramos","suffix":""},{"id":425508370,"identity":"50225173-fe43-4a66-9c18-957dd725894b","order_by":13,"name":"Martín C. Abba","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAtElEQVRIiWNgGAWjYHACNmYGBhsgzdh4gBQtaSAtDSRpOQxmEadFt/3ws8cFNeft1rYfBtpSYxNNUIvZmTRz4xnHbidvO5MI1HIsLbeBoJYbDGbSPGy3k80OALUwNhwmRgv7N2mef+eSzc4/JFoLj5k0b9sBO7MbRNtyJqfceGZfcoLZDaAtCUT55fjxbY8LvtnZm51Pf/jgQ40NYS0wkAhWmUCschCwJ0XxKBgFo2AUjDAAAPF4Ryh5A4aLAAAAAElFTkSuQmCC","orcid":"","institution":"CINIBA, Universidad Nacional de La Plata","correspondingAuthor":true,"prefix":"","firstName":"Martín","middleName":"C.","lastName":"Abba","suffix":""}],"badges":[],"createdAt":"2025-03-03 13:08:05","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-6146471/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-6146471/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":78262093,"identity":"bb2f8369-e36d-4401-b39d-177a94a3593d","added_by":"auto","created_at":"2025-03-11 11:49:26","extension":"jpeg","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":1575008,"visible":true,"origin":"","legend":"\u003cp\u003eCompiled KS dataset of 131 KS and matched non-tumor tissues. \u003cstrong\u003eA. \u003c/strong\u003eDiagram of the Strategy followed to obtain the compiled KS dataset. \u003cstrong\u003eB.\u003c/strong\u003e Multidimensional scaling plot of the non-adjusted (left) and batch-effect adjusted (right) gene expression data of KS (circle) and matched non-tumor tissues (triangle) among cohorts. \u003cstrong\u003eC.\u003c/strong\u003e An example Xena Browser Visual Spreadsheet examining the gene expression profiles of selected humans (\u003cem\u003eITGB1\u003c/em\u003e, \u003cem\u003eFLT1\u003c/em\u003e, \u003cem\u003ePDGFA\u003c/em\u003e and \u003cem\u003ePDGFB\u003c/em\u003e) and KSHV genes (\u003cem\u003eK12\u003c/em\u003e, \u003cem\u003eLANA\u003c/em\u003e, \u003cem\u003eORF72\u003c/em\u003e) among the compiled KS dataset in the context of their phenotypic data (\u003ca href=\"https://kaposi.xenahubs.net/\"\u003ehttps://kaposi.xenahubs.net/\u003c/a\u003e). Gene expression data is colored red to blue for high to low expression respectively. UCSC Xena browser provides analytic tools to correlate the human and KSHV gene expression levels as well as to identify differentially expressed genes-based groups defined by the user (\u003cem\u003ee.g.\u003c/em\u003e KS lesions vs. control skin). \u003cstrong\u003eD. \u003c/strong\u003eRepresentative volcano plot of differentially expressed genes detected between KS and control samples using UCSC Xena browser. \u003cstrong\u003eE.\u003c/strong\u003e Functional enrichment analysis of differentially expressed genes with Xena.\u003c/p\u003e","description":"","filename":"floatimage1.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-6146471/v1/9a8a195f2f6d621636cc23b2.jpeg"},{"id":78261799,"identity":"50ab03f6-552e-4cb0-b3e5-8c1c4a46faa4","added_by":"auto","created_at":"2025-03-11 11:41:26","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":345880,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eTranscriptomic, Immune and Functional Profiling of Kaposi Sarcoma Lesions. A. \u003c/strong\u003eClustering results reveal distinct groups of KS lesions, with endemic lesions primarily in Cluster C1, while epidemic lesions are distributed across the three clusters. \u003cstrong\u003eB.\u003c/strong\u003eImmune profiling using the ABIS algorithm demonstrates a higher immune cell infiltrate in KS lesions compared to controls. \u003cstrong\u003eC.\u003c/strong\u003e Transcriptomic expression levels of immune checkpoint inhibitors (ICIs) indicate that Clusters C1 and C3 exhibit the highest expression levels\u003cstrong\u003e. D. \u003c/strong\u003ePathway activity analysis using Gene Ontology reveals reduced epithelial differentiation and increased immune activity in KS lesions, particularly in Clusters C1 and C3. Cluster C4 shows lower immune activity but a strong innate antiviral response. \u003cstrong\u003eE.\u003c/strong\u003e Pathway activity analysis using Hallmarks reveals reduced metabolic processes in KS lesions compared to controls, along with increased proliferative activity, angiogenesis, and PI3K/Akt/mTOR signaling, particularly in Clusters C1 and C3.\u003c/p\u003e","description":"","filename":"floatimage2.png","url":"https://assets-eu.researchsquare.com/files/rs-6146471/v1/22c58023d8ac61766d6e9315.png"},{"id":78263166,"identity":"31640de5-cc13-4c99-bf9a-8f0f8ddb8d63","added_by":"auto","created_at":"2025-03-11 11:57:26","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":257776,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eKSHV Transcriptomic Analysis Across Kaposi Sarcoma Lesion Clusters A.\u003c/strong\u003e Heatmap visualization of the expression levels of 85 KSHV genes across control and lesion clusters. Gene expression data is colored red to blue for high to low expression respectively. The bar plot at the bottom displays LANA levels based on mapped reads. The error bar plot on the right illustrates the total KSHV mapped reads in each cluster, highlighting significant differences in cluster C1 compared to clusters C3 and C4 (p\u0026lt;0.01). \u003cstrong\u003eB.\u003c/strong\u003e Bar plots displaying the expression of key KSHV genes that are differentially expressed between clusters. * p \u0026lt;0.05; ** p\u0026lt;0.01; *** p\u0026lt;0.001; **** p\u0026lt;0.0001.\u003c/p\u003e","description":"","filename":"floatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-6146471/v1/65adaa956e3e719e4a8d5b36.png"},{"id":78306957,"identity":"e69de073-ef9e-423e-924e-8d66c4b65b25","added_by":"auto","created_at":"2025-03-12 01:01:23","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":3045632,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-6146471/v1/02b5e60f-e1e1-4b01-8ad4-3435522eece2.pdf"},{"id":78261801,"identity":"0238b98a-87a1-4e4a-ae5b-9b3dcd62079f","added_by":"auto","created_at":"2025-03-11 11:41:26","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":1117195,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cbr\u003e\u003c/p\u003e","description":"","filename":"AdditionalFigures.pdf","url":"https://assets-eu.researchsquare.com/files/rs-6146471/v1/d7354654167af71ae2d9c6b7.pdf"},{"id":78261800,"identity":"a84dab0d-d94d-4ab3-81e4-926693c7a3b3","added_by":"auto","created_at":"2025-03-11 11:41:26","extension":"docx","order_by":2,"title":"","display":"","copyAsset":false,"role":"supplement","size":22035,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSupplementary Data 1: \u003c/strong\u003eCluster comparison across cohorts and clinical variables.\u003c/p\u003e","description":"","filename":"SData1.docx","url":"https://assets-eu.researchsquare.com/files/rs-6146471/v1/141eb9db4f14d7da9d85f60e.docx"},{"id":78261816,"identity":"3bb8627f-a67d-43c4-b337-6c7e54276167","added_by":"auto","created_at":"2025-03-11 11:41:27","extension":"xlsx","order_by":3,"title":"","display":"","copyAsset":false,"role":"supplement","size":3976772,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSupplementary Table 1: \u003c/strong\u003eResults of differential gene expression analysis.\u003c/p\u003e","description":"","filename":"STable1.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-6146471/v1/498444102faf0d2105f0bcfb.xlsx"},{"id":78261805,"identity":"44bf0948-3889-4f36-ad9e-e26b18cd951a","added_by":"auto","created_at":"2025-03-11 11:41:26","extension":"xlsx","order_by":4,"title":"","display":"","copyAsset":false,"role":"supplement","size":25175,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSupplementary Table 2:\u003c/strong\u003e Results of differential KSHV gene expression analysis.\u003c/p\u003e","description":"","filename":"STable2.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-6146471/v1/128cb3411e2054d9dab534b6.xlsx"}],"financialInterests":"No competing interests reported.","formattedTitle":"\u003cp\u003eIntegrative Functional Genomics Analysis of Kaposi Sarcoma Cohorts\u003c/p\u003e","fulltext":[{"header":"1 INTRODUCTION","content":"\u003cp\u003eKaposi sarcoma (KS) is a prominent AIDS-defining malignancy, and a critical global health challenge primarily linked to infection with KS-associated herpesvirus (KSHV) (Mesri et al., \u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e2010\u003c/span\u003e; Dittmer and Damania \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e2016\u003c/span\u003e; Cesarman et al., \u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e2019\u003c/span\u003e). KSHV contains a complex genome comprising over 80 genes, which are regulated to facilitate either latency with minimal viral expression or lytic replication, resulting in the production of new virions. KSHV can infect epithelial cells, endothelial cells, B cells, and, more recently, it has been found to infect neurons and mesenchymal stem cells (MSCs), further highlighting its diverse pathogenic potential (Mesri et al., \u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e2010\u003c/span\u003e; Dittmer and Damania \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e2016\u003c/span\u003e; Cesarman et al., \u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e2019\u003c/span\u003e; Naipauer and Mesri \u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e2023\u003c/span\u003e; Lacunza et al., \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2024\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eAmong the various forms of KS, the most aggressive variant is observed in individuals with HIV, known as acquired immunodeficiency syndrome-associated KS (AIDS-KS). This form is characterized by widespread dissemination, affecting the skin and visceral organs, including the gastrointestinal tract and lungs (Haverkos et al 1985). Moreover, in men who have sex with men (MSM), the HIV infection rate is 4.9\u0026ndash;10.5%, and in transgender women (TGW), HIV reported prevalence is up to 58%. These two populations have disproportionately higher risks of developing KS caused by KSHV (Hutchison et al., \u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e2018\u003c/span\u003e; AIDS-defining Cancer Project Working Group 2017).\u003c/p\u003e \u003cp\u003eNGS-based transcriptomic analyses of KS have revealed significant molecular insights into the pathogenesis of the disease. For example, Tso et al. (\u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e2018\u003c/span\u003e) reported alterations in glucose and lipid metabolism, with changes in genes associated with metabolic disorder pathways. A recent study further stratified HIV\u0026thinsp;+\u0026thinsp;KS lesions into two molecular subtypes\u0026mdash;one characterized by endothelial and proliferative features and the other enriched in inflammatory transcripts (Moorad et al., \u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). Additionally, profiling KS lesions from different anatomical sites has revealed variations in immune composition, angiogenic factors, and KSHV gene expression, underscoring the role of the tumor microenvironment in disease progression (Ramaswami et al., \u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). Unsupervised clustering of KSHV gene expression has also identified distinct tumor groups with varying latent and lytic viral profiles (Tso et al., \u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e2018\u003c/span\u003e; Rose et al., \u003cspan citationid=\"CR30\" class=\"CitationRef\"\u003e2018\u003c/span\u003e; Lidenge et al., \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e2020\u003c/span\u003e; Lacunza et al., \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2024\u003c/span\u003e). Furthermore, morphologically distinct KS tumors from the same individual exhibit similar KSHV gene expression patterns, suggesting that viral activation is influenced by the local microenvironment and host immune response (Rose et al., \u003cspan citationid=\"CR30\" class=\"CitationRef\"\u003e2018\u003c/span\u003e). This molecular heterogeneity emphasizes the need for further studies to develop predictive biomarkers and optimize therapeutic strategies for KS.\u003c/p\u003e \u003cp\u003eMost of these RNAseq studies come from subjects in sub-Sahara Africa where KSHV seroprevalence is notably high, with over 50% of the population exhibiting detectable anti-KSHV antibodies (Uldrik et al., 2011; Damania and Dittmer \u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). This raises the need for a broader integrative analysis including diverse cohorts to capture the global spectrum of KS pathology.\u003c/p\u003e \u003cp\u003eThis study aimed to integrate RNAseq-based gene expression profiles from four distinct cohorts of KS covering North America, South American, and sub-Saharan Africa to enhance our understanding of the disease. Additionally, we intend to share these valuable datasets with the research community through the UCSC Xena resource, promoting further exploration and discovery in the field of KS research.\u003c/p\u003e"},{"header":"2 MATERIALS AND METHODS","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003e2.1 Sample Collection and RNA Sequencing\u003c/h2\u003e \u003cp\u003eNine KS skin lesions were collected within an Argentinian study that included men at birth with or without HIV, recruited at Fundaci\u0026oacute;n Hu\u0026eacute;sped and Hospitals network as part of our collaborative U54 consortium (Miami CFAR SCCC - Argentina Consortium for Research and Training in virally induced AIDS-Malignancies). Participants were \u0026gt;\u0026thinsp;18 years old (median: 42 years; range: 34\u0026ndash;62 years). Approval of the protocol and study-related documents was obtained from the Ethics Committee. All participants included in this study signed informed consent before undergoing any study procedures.\u003c/p\u003e \u003cp\u003eTotal RNA was isolated from KS samples preserved in RNAlater using the miRNeasy Tissue/Cells Advanced Kits (Qiagen) following standard manufacturer's protocol. RNA concentration and integrity were measured on an Agilent 2100 Bioanalyzer (Agilent Technologies). RNA samples with RNA integrity number (RIN) over 5 were considered for RNA sequencing. The RNA samples were processed for directional RNA-seq library construction using the Illumina Total RNA Prep with Ribo-Zero Plus library preparation kit according to the manufacturer's protocol. We performed 101 nt paired-end sequencing using an Illumina Novaseq 6000 platform and obtained\u0026thinsp;~\u0026thinsp;60\u0026nbsp;million clusters per sample with 92% \u0026gt;Q30. The RNAseq raw data has been submitted to NCBI GEO database with accession number GSE271303.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec4\" class=\"Section2\"\u003e \u003ch2\u003e2.2 RNAseq data pre-processing and integration of Kaposi's Sarcoma cohorts\u003c/h2\u003e \u003cp\u003eRNAseq raw data were retrieved from GEO/SRA using the SRA Toolkit (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://github.com/ncbi/sra-tools\u003c/span\u003e\u003cspan address=\"https://github.com/ncbi/sra-tools\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e) from GSE147704 (Tanzania \u0026amp; Zambia cohort: KS\u0026thinsp;=\u0026thinsp;24, Control\u0026thinsp;=\u0026thinsp;24, and Normal\u0026thinsp;=\u0026thinsp;3), GSE241095 (USA cohort: KS\u0026thinsp;=\u0026thinsp;10 and Control\u0026thinsp;=\u0026thinsp;10) and SRP486827 (Uganda cohort: KS\u0026thinsp;=\u0026thinsp;51). The raw short-read sequences generated from the Argentinian KS samples and retrieved from GEO/SRA were quality-checked and trimmed to remove adapters and low-quality bases using the Rfastp R/Bioconductor package. The preprocessed reads were then aligned and mapped to the human genome reference GRCh38 using the Subread aligner algorithm provided by the Rsubread R/Bioconductor package. The remaining non-human reads were aligned to the KSHV reference sequence NC_009333. The KSHV reference genome used in our study for read sequence alignment and mapping belongs to the Human herpesvirus 8 strain GK18. It provides a valuable complete genome for global transcriptome analysis, but it does not capture the full extent of KSHV's genetic diversity, including the highly diverse K1 and K15 subtypes present in sub-Saharan populations. The aligned reads (BAM files) from each sample were used to calculate gene expression abundance at the whole-genome level using the featureCounts function provided by the Rsubread package.\u003c/p\u003e \u003cp\u003eThe raw read counts for each cohort were integrated, and batch effects were diagnosed using the BatchQC R/Bioconductor package (Manimaran et al., \u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e2016\u003c/span\u003e), followed by batch effects adjustment with the ComBat-seq algorithm from the \"sva\" R/Bioconductor package (Zhang et al., \u003cspan citationid=\"CR35\" class=\"CitationRef\"\u003e2020\u003c/span\u003e). Briefly, ComBat-seq algorithm implements a negative binomial regression model to estimate batch effects based on the count matrix obtained for each RNA-seq study allowing us to include the condition variable (control or KS) as covariates in the regression model behind this method to account for the impact of batch and retaining biological signal.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec5\" class=\"Section2\"\u003e \u003ch2\u003e2.3 KS and KSHV transcriptome analysis of the compiled dataset\u003c/h2\u003e \u003cp\u003eThe transcriptome profile of the compiled batch-effect adjusted KS dataset consists of 131 KS and non-tumor cutaneous samples and their associated metadata. To identify differentially expressed genes between KS and controls or between KS groups (e.g.: from HIV\u0026thinsp;+\u0026thinsp;vs HIV- or with or without ART, etc.), we computed fold changes and adjusted p-values using the edgeR R/Bioconductor package based on the normalized log2-based count per million values. Genes showing a log-fold change greater than 1 and an adjusted p-value below 0.05 were considered significantly differentially expressed. Functional enrichment analysis and Gene Set Variation Analysis (GSVA) of differentially expressed genes were performed with the clusterProfiler and GSVA R packages. Tumor immune cell infiltration scores were estimated with the ABIS algorithm and the MCP-counter tool from the immunedeconv (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://github.com/omnideconv/immunedeconv\u003c/span\u003e\u003cspan address=\"https://github.com/omnideconv/immunedeconv\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e)\u003c/span\u003e and MCPcounter \u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e(\u003c/span\u003e\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://github.com/ebecht/MCPcounter\u003c/span\u003e\u003cspan address=\"https://github.com/ebecht/MCPcounter\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e)\u003c/span\u003e R packages, respectively, on normalized count matrices. The scores, based on transcriptomic markers that are strongly, specifically, and stably expressed in a unique cell population, are proportional to the abundance of each population in the tumor, enabling intersample comparison and large cohort analyses (Becht et al., \u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e2016\u003c/span\u003e). Unsupervised hierarchical clustering analysis and heatmaps representations were performed with the MultiExperimentViewer (MeV 4.9.0) software. To determine the optimal number of clusters, we applied Principal Component Analysis (PCA) alongside the NbClust package, which evaluates the number of clusters using 30 different indices(\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://github.com/cran/NbClust/blob/master/R/NbClust.R\u003c/span\u003e\u003cspan address=\"https://github.com/cran/NbClust/blob/master/R/NbClust.R\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eThe relationship between a categorical and a quantitative variable was assessed using the Wilcoxon rank-sum test or the Kruskal\u0026ndash;Wallis test. Pearson correlation was used for two quantitative variables. P values were corrected for multiple testing using the Bonferroni or Benjamini\u0026ndash;Hochberg methods.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec6\" class=\"Section2\"\u003e \u003ch2\u003e2.4 Data sharing and visualization through a UCSC Xena hub\u003c/h2\u003e \u003cp\u003eUCSC Xena is a high-performance visualization and analysis tool for both large public repositories and private datasets (Goldman et al., \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e2020\u003c/span\u003e). UCSC Xena has two components: the front-end Xena Browser and the back-end Xena Hubs. We have deployed a public Xena Hub (called KS-omics) to host and share the functional genomics data of the compiled KS dataset with the research community (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://kaposi.xenahubs.net/\u003c/span\u003e\u003cspan address=\"https://kaposi.xenahubs.net/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003cspan type=\"Underline\" class=\"Underline\" name=\"Emphasis\"\u003e).\u003c/span\u003e Briefly, Xena Browser allows biologists without bioinformatics knowledge to explore data with a variety of visualizations and analytic tools.\u003c/p\u003e \u003c/div\u003e"},{"header":"3 RESULTS AND DISCUSSION","content":"\u003cp\u003e\u003cstrong\u003e3.1 Integrative Analysis of Host and Viral Gene Expression Profiles in KS Using Multi-Cohort RNA Sequencing\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eTo perform an integrative analysis of host and viral gene expression profiles in KS, we combined 94 KS with 37 non-tumor cutaneous cases obtained from four cohorts, one of them generated by us and three available in a public database (Figure \u003cstrong\u003e1A\u003c/strong\u003e). Clinical variables considered for the study included age, ART treatment, sex at birth, HIV status, origin (endemic/epidemic), and KS morphotype. To generate a homogeneous, uniformly curated, and preprocessed dataset, the RNAseq raw data was directly retrieved from GEO/SRA using the SRA Toolkit and equally preprocessed by our bioinformatics pipeline. Additionally, raw read count matrices from each cohort were integrated. Variation analysis (Figure \u003cstrong\u003eS1\u003c/strong\u003e) showed that batch effects explained a larger proportion of variation than the condition (control or KS), highlighting the need for batch correction. Batch effects were then adjusted using the ComBat-seq algorithm (Figure \u003cstrong\u003eS1\u003c/strong\u003e). Importantly, the transcriptomic profile of the four independent KS cohorts were generated using Illumina sequencing platforms (Table \u003cstrong\u003e1\u003c/strong\u003e).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eTable 1.\u0026nbsp;\u003c/strong\u003eCharacteristics of the study subjects included in each KS cohort.\u0026nbsp;\u003c/p\u003e\n\u003ctable border=\"1\" cellspacing=\"0\" cellpadding=\"0\" width=\"695\"\u003e\n \u003ctbody\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 202px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eCharacteristics\u003c/strong\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 142px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eTanzania / Zambia\u003c/strong\u003e\u003c/p\u003e\n \u003cp\u003eGSE147704\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 104px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eUganda\u003c/strong\u003e\u003c/p\u003e\n \u003cp\u003eSRP486827\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 108px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eUSA\u003c/strong\u003e\u003c/p\u003e\n \u003cp\u003eGSE241095\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 139px;\"\u003e\n \u003cp\u003e\u003cstrong\u003eArgentina\u003c/strong\u003e\u003c/p\u003e\n \u003cp\u003eGSE271303\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 202px;\"\u003e\n \u003cp\u003e# of Cases\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 142px;\"\u003e\n \u003cp\u003e24 KS\u003c/p\u003e\n \u003cp\u003e24 Controls\u003c/p\u003e\n \u003cp\u003e\u0026nbsp; 3 Normal skin\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 104px;\"\u003e\n \u003cp\u003e51 KS\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 108px;\"\u003e\n \u003cp\u003e10 KS\u003c/p\u003e\n \u003cp\u003e10 Controls\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 139px;\"\u003e\n \u003cp\u003e9 KS\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 202px;\"\u003e\n \u003cp\u003eAge (median, IQR)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 142px;\"\u003e\n \u003cp\u003e33 (29, 42)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 104px;\"\u003e\n \u003cp\u003e35 (26, 46)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 108px;\"\u003e\n \u003cp\u003e43 (34, 49)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 139px;\"\u003e\n \u003cp\u003e42 (34, 50)\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 202px;\"\u003e\n \u003cp\u003eSex at birth\u003c/p\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp; Male\u003c/p\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp; Female\u003c/p\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp; n/a\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 142px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e16\u003c/p\u003e\n \u003cp\u003e8\u003c/p\u003e\n \u003cp\u003e-\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 104px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e44\u003c/p\u003e\n \u003cp\u003e5\u003c/p\u003e\n \u003cp\u003e2\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 108px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e10\u003c/p\u003e\n \u003cp\u003e0\u003c/p\u003e\n \u003cp\u003e-\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 139px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e9\u003c/p\u003e\n \u003cp\u003e0\u003c/p\u003e\n \u003cp\u003e-\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 202px;\"\u003e\n \u003cp\u003eRace\u003c/p\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp;White\u003c/p\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp;Black\u003c/p\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp;Hispanic\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 142px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e0\u003c/p\u003e\n \u003cp\u003e24 (100%)\u003c/p\u003e\n \u003cp\u003e0\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 104px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e0\u003c/p\u003e\n \u003cp\u003e51 (100%)\u003c/p\u003e\n \u003cp\u003e0\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 108px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e2 (20%)\u003c/p\u003e\n \u003cp\u003e4 (40%)\u003c/p\u003e\n \u003cp\u003e4 (40%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 139px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e0\u003c/p\u003e\n \u003cp\u003e0\u003c/p\u003e\n \u003cp\u003e9 (100)\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd valign=\"top\" style=\"width: 202px;\"\u003e\n \u003cp\u003eHIV characteristics\u003c/p\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp;HIV co-infection\u003c/p\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp;CD4 T-cell count (cells/ul)\u003c/p\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp;HIV Viral load (cps/mL)\u003c/p\u003e\n \u003cp\u003e\u0026nbsp; \u0026nbsp;On ART at biopsy\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 142px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e18 (75%)\u003c/p\u003e\n \u003cp\u003ena\u003c/p\u003e\n \u003cp\u003e12000\u003c/p\u003e\n \u003cp\u003e18 (75%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 104px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e51 (100%)\u003c/p\u003e\n \u003cp\u003ena\u003c/p\u003e\n \u003cp\u003ena\u003c/p\u003e\n \u003cp\u003e0 (0%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 108px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e9 (90%)\u003c/p\u003e\n \u003cp\u003e39\u003c/p\u003e\n \u003cp\u003e74000\u003c/p\u003e\n \u003cp\u003e10 (100%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd valign=\"top\" style=\"width: 139px;\"\u003e\n \u003cp\u003e\u0026nbsp;\u003c/p\u003e\n \u003cp\u003e9 (90%)\u003c/p\u003e\n \u003cp\u003e144\u003c/p\u003e\n \u003cp\u003e125000\u003c/p\u003e\n \u003cp\u003e3 (33%)\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003c/tbody\u003e\n\u003c/table\u003e\n\u003cp\u003e\u003cstrong\u003e\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eFigure \u003cstrong\u003e1B\u003c/strong\u003e shows the scatter plot of cases projected on the first two dimensions in unadjusted data, and in data adjusted by ComBat-seq. We observed a strong batch effect in the unadjusted data, which was well addressed by ComBat-seq. A favorable adjustment would pool control/non-tumor cutaneous samples from the cohorts (batches), while keeping all KS lesions separated from the controls and from each other. In the multidimensional plot of ComBat-seq adjusted data, we observed the expected pattern of data if there were no batch effects, in which the control/normal cases are clustered together, while the KS cases from four cohorts are scattered at different locations. These results suggest a successful adjustment of batch effect from ComBat-seq.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eUCSC Xena is a high-performance visualization and analysis tool for both large public repositories and private datasets (Goldman et al., 2020). In this sense, we have implemented a public cloud-based backend Xena Hubs hosting the human and KSHV batch effect adjusted gene expression, immune and pathway activity profiles as well as their associated phenotypic data obtained from the four KS cohorts (https://kaposi.xenahubs.net/\u003cu\u003e)\u003c/u\u003e. The frontend Xena Browser provides a wide variety of visualizations and analyses including scatter plots, bar graphs, statistical tests, genomic signatures, as well as a unique Visual Spreadsheet view (Figure \u003cstrong\u003e1C\u003c/strong\u003e). The Xena Visual Spreadsheet was designed to enable and enhance integration across diverse data modalities, providing researchers with a more biologically complete understanding of genomic events and tumor biology. The UCSC Xena browser enables users to explore functional genomic datasets for correlations between genomic and phenotypic variables. It also allows for differential expression analysis between user-defined sample groups (Figure \u003cstrong\u003e1D\u003c/strong\u003e) and functional enrichment analysis of differentially expressed genes (Figure \u003cstrong\u003e1E\u003c/strong\u003e). \u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003e3.2 Unsupervised clustering reveals distinct transcriptomic and immune profiles in KS lesions.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eTo characterize the transcriptomic, immune, and functional profiles of the samples, we employed an unsupervised clustering approach. We defined clusters based on the entire transcriptome (host and KSHV). This approach allows us to dissect the contributions of both host and viral factors to the overall expression landscape, providing a comprehensive understanding of the interplay between the host\u0026apos;s immune response and viral activity in KS.\u003c/p\u003e\n\u003cp\u003eThe unsupervised analysis revealed a clear distinction between KS lesions and non-tumor controls (Figure \u003cstrong\u003e2A\u003c/strong\u003e), underscoring the impact of KSHV on the host transcriptome (Tso et al., 2018). To further validate the optimal number of clusters, we applied the NbClust package. The majority rule suggested either 2 or 4 clusters as the most likely options. Considering this, along with the structure observed in the dendrogram and PCA analysis, we determined that 4 clusters were the most appropriate choice for our data (Figure \u003cstrong\u003eS2\u003c/strong\u003e) We identified at least three distinct groups of lesions, designated as clusters 1 (C1), 3 (C3), and 4 (C4). Uninvolved skin from KS patients and skin from non-KS subjects were classified within the control cluster (C2), which comprised two subclusters and included two KS cases (Figure \u003cstrong\u003e2A\u003c/strong\u003e). Cluster C1 included most endemic tumors, along with some epidemic cases, while clusters C3 and C4 were primarily composed of epidemic lesions (p\u0026lt;0.001; Data \u003cstrong\u003eS1\u003c/strong\u003e). Importantly, no significant associations were observed between lesion clusters and HIV status or ART treatment (Data \u003cstrong\u003eS1\u003c/strong\u003e). However, a notable association with KS morphotype was found, with cluster C4 exhibiting a higher percentage of macular lesions (p\u0026lt;0.01; Data \u003cstrong\u003eS1\u003c/strong\u003e).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eWe defined the immune fraction profile using the ABIS algorithm, which demonstrated a significant immune infiltrate enrichment in the lesions compared to the controls (Figure \u003cstrong\u003e2B\u003c/strong\u003e). Clusters C1 and C3 showed the highest percentage of immune fractions, indicating a greater infiltrate compared to clusters C2 and C4. Cluster C1 was characterized by a significant enrichment of memory CD4 T lymphocytes, whereas naive CD4 T cells were predominant in the control group (C2) (Figure \u003cstrong\u003eS3\u003c/strong\u003e). In contrast, cluster C3 exhibited a notable enrichment of plasmablasts and memory B cells (Figure \u003cstrong\u003eS3\u003c/strong\u003e). Additionally, all three lesion clusters showed enrichment in CD8 memory T cells compared to the control group (Figure \u003cstrong\u003eS3\u003c/strong\u003e).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eTo strengthen the analysis, we examined the composition of the sample microenvironment using the microenvironment cell populations (MCP)-counter method (Becht et al., 2016). The cell composition varied significantly between clusters (Figure \u003cstrong\u003eS3\u003c/strong\u003e). Cluster C1 displayed elevated expression of genes specific to immune populations, including T cells, CD8+ T cells, natural killer (NK) cells, and cytotoxic lymphocytes. However, it was primarily characterized by a significant upregulation of endothelial cell-related genes (Figure \u003cstrong\u003eS3\u003c/strong\u003e). Cluster C3 showed elevated expression of most immune cell populations, with B lineage signatures and T cells as key determinants. Cluster C4 displayed a generally low immune profile, similar to the control cluster (C2), but differed by a higher expression of endothelial cell markers (Figure \u003cstrong\u003eS3\u003c/strong\u003e).\u003c/p\u003e\n\u003cp\u003eThese results complemented the findings by Tso et al., who reported a significant infiltrate of immune cells in the lesions, primarily of B cells, although T cells were not significantly present. However, they predicted the activation of numerous chemokines that play an important role in the recruitment of T cells (Tso et al., 2018). Additionally, Lidenge et al. have shown that, despite the upregulation of chemokines like CxCL-9, immune cells, including CD4+ T cells, CD8+ T cells, and NK cells, are not co-localized with KSHV-infected regions (Lidenge et al., 2020). This discrepancy highlights the differences between transcriptomic-based immune profiling and protein-level detection, suggesting that immune cells may be recruited to the tumor microenvironment but may not be localized to KSHV-infected areas.\u0026nbsp;In addition, the marginal tissues captured in biopsies likely contain immune populations that are present but infiltrate the tumor at low frequencies.\u003c/p\u003e\n\u003cp\u003eGiven the increasing relevance of immune checkpoint inhibitors (ICIs) in the treatment of KS, we evaluated the transcriptomic expression levels of a set of ICIs. A distinct pattern of expression levels was observed, correlating with the infiltrate (Figure \u003cstrong\u003e2C\u003c/strong\u003e). In this context, clusters C1 and C3 were the most enriched in these genes. \u0026nbsp;These results, while needing further validation, along with the immune profile of the clusters, are relevant in the context of the KS microenvironment and immune checkpoint ICIs immunotherapies. They are also consistent with previous studies. For instance, Petitprez et al. (2018) established an immune-based classification of sarcoma immune classes (SICs) and demonstrated that the immune-high group, characterized by the presence of B cell-rich lineage, exhibited a strong response to PD1 immunotherapy. The tumor microenvironment in KS is known to be composed of activated B cells and tumor-associated macrophages (Joest et al., 2020). More recently, it has been shown that KS, along with other sarcoma subtypes, displayed the highest response rates to ICIs and longest survival, with PD-L1 expression of \u0026ge;1% being associated with increased response to ICIs (Lee et al., 2024).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003e3.3 Pathways activity analysis.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe pathway activity analysis based on Gene Ontology (GO) and Hallmarks revealed a reduction in epithelial differentiation, including keratinocyte differentiation and epithelial-mesenchymal transition (EMT), in lesions compared to controls (Figure \u003cstrong\u003e2D\u003c/strong\u003e). Additionally, decreases in metabolic processes such as lipid metabolism, cholesterol homeostasis, and response to estrogen were observed (Figure \u003cstrong\u003e2E\u003c/strong\u003e). This finding aligns with previous studies demonstrating that KSHV plays a significant role in cellular transformation and metabolic reprogramming, both of which are key factors in the pathogenesis of KS (Zhu et al., 2014; Tso et al., 2018). Moreover, the impaired EMT is particularly important in KS, as it promotes the formation of new blood vessels and supports tumor growth, emphasizing the complex relationship between these processes in the disease (Naipauer and Mesri 2023; Lacunza et al., 2024).\u003c/p\u003e\n\u003cp\u003eFurthermore, the analysis revealed higher levels of immune pathways\u0026mdash;such as adaptive immune responses, T cell differentiation, and B cell differentiation\u0026mdash;in the lesions of clusters C1 and C3 compared to cluster C4. KS lesions in cluster C3 exhibited more pronounced humoral immune related pathways than those in cluster C1, which corresponds with its higher proportion of B cells (Figure \u003cstrong\u003e2D\u003c/strong\u003e, Figure \u003cstrong\u003eS3\u003c/strong\u003e). Since KSHV has a known tropism for B cells and can infect them, leading to lymphoproliferative lesions, cluster C3, enriched in B cell lineages, may reflect this interaction (Rappocciolo et al., 2008; Ballon et al., 2011; Nicol et al., 2016). However, the transcriptomic data does not clarify whether these B cells are directly infected by KSHV or are merely part of the recruited immune response.\u003c/p\u003e\n\u003cp\u003eLesions in cluster C4 displayed a pattern more similar to non-tumor controls, showing lower enrichment levels across most pathways compared to clusters C1 and C3 (Figure \u003cstrong\u003e2D\u003c/strong\u003e). However, these lesions exhibited enrichment in pathways related to innate antiviral responses, whereas tumors in clusters C1 and C3 were associated with broader and more robust immune responses (Figure \u003cstrong\u003e2D\u003c/strong\u003e).\u003c/p\u003e\n\u003cp\u003eCluster C1 exhibited lesions with a molecular pattern of pathway activation characteristic of KS, marked by increased proliferative activity (e.g., cell cycle, mitotic spindle) and activation of the PI3K/Akt/mTOR and IL6/JAK/STAT3 pathways (Figure 2\u003cstrong\u003eD\u003c/strong\u003e, \u003cstrong\u003eE\u003c/strong\u003e). Both clusters C1 and C3 also displayed angiogenic activity (Figure 2\u003cstrong\u003eD, E\u003c/strong\u003e). These differences in immune and pathways profiling may have important implications for disease progression and patient response.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003e3.4 Gene expression profiling analysis.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eTo identify the representative genes or signatures associated with the functional processes in each lesion cluster, we conducted differential expression analysis, comparing controls (C2) with lesions (C1, C3, and C4), as well as among the lesion groups themselves. Cluster C1 exhibited the highest number of dysregulated genes compared to non-tumor samples, with the majority of KSHV genes expressed (Figure \u003cstrong\u003eS4\u003c/strong\u003e, Table \u003cstrong\u003eS1\u003c/strong\u003e). KSHV genes \u003cem\u003eK15\u003c/em\u003e, \u003cem\u003eK13, ORF72, ORF75, vIRF-2\u003c/em\u003e, and \u003cem\u003evIRF-4\u003c/em\u003e were among the top upregulated genes. Among the numerous affected host genes, those linked to extracellular matrix remodeling stood out, including several metalloproteinases and collagens (e.g \u003cem\u003eMMP9-17, ADAMTS54-S7, COL4A1\u003c/em\u003e), along with significant genes such as \u003cem\u003eITGB1, FLT1, FLT2, PDGFA, PDGFB, AKT3, ANGPT2, CXCL10, ROCK2, IFNG\u003c/em\u003e, and \u003cem\u003eCXCR4\u003c/em\u003e, all of which are associated with processes relevant to KS, such as endothelial differentiation, angiogenesis, the PI3K/Akt pathway, and cytokine/chemokine cascades (Cesarman et al., 2019). Additionally, several genes related to cell proliferation (e.g. \u003cem\u003ePLK1\u003c/em\u003e, \u003cem\u003eBUB1B\u003c/em\u003e, \u003cem\u003eTTK\u003c/em\u003e) and T cell lymphocytes (e.g. CD86, CD274, CCR7) were also upregulated. There was also a marked decrease in the expression of several genes linked to protein biosynthesis (ribosomal proteins), cellular respiration (e.g. \u003cem\u003eNDUFA1\u003c/em\u003e, \u003cem\u003eNDUFA3,\u003c/em\u003e \u003cem\u003eCYC1\u003c/em\u003e), and lipid metabolism (\u003cem\u003eADIPOQ\u003c/em\u003e, \u003cem\u003eACOX1\u003c/em\u003e, \u003cem\u003eACOX2\u003c/em\u003e, \u003cem\u003eCYP4B1\u003c/em\u003e) This underscores the substantial impact of KSHV on host cell metabolism, particularly under hypoxic conditions (Delgado et al., 2010; Singh et al., 2018; M\u0026eacute;ndez-Sol\u0026iacute;s et al., 2021).\u003c/p\u003e\n\u003cp\u003eWhen comparing cluster C3 with controls (C2), a similar trend emerged, with a high expression of KSHV genes and a notable impact on host genes, though to a lesser extent than in cluster C1 (Figure \u003cstrong\u003eS4\u003c/strong\u003e, Table \u003cstrong\u003eS1\u003c/strong\u003e). In cluster C3, there was a prominent enrichment of genes associated with lymphocyte activation and proliferation (e.g., \u003cem\u003eCD80\u003c/em\u003e, \u003cem\u003eITK\u003c/em\u003e, \u003cem\u003eCD84\u003c/em\u003e, \u003cem\u003eCD7\u003c/em\u003e, \u003cem\u003eCD28\u003c/em\u003e, \u003cem\u003eCD247\u003c/em\u003e, \u003cem\u003eTLR9\u003c/em\u003e, \u003cem\u003eLYN\u003c/em\u003e, \u003cem\u003eCD22\u003c/em\u003e). Cluster C4 also exhibited expression of most KSHV genes when compared to non-tumor controls; however, the host gene enrichment was primarily linked to the viral infection response (e.g., \u003cem\u003eIFIT1\u003c/em\u003e, \u003cem\u003eIFIT2\u003c/em\u003e, \u003cem\u003eIFIT3\u003c/em\u003e, \u003cem\u003eIRF1\u003c/em\u003e, \u003cem\u003eIRF7\u003c/em\u003e, \u003cem\u003eIF1\u003c/em\u003e6) (Table \u003cstrong\u003eS1\u003c/strong\u003e).\u003c/p\u003e\n\u003cp\u003eThe comparison between lesion clusters revealed distinct differences. In the cluster C1 vs. cluster C3 analysis, cluster C1 exhibits gene expression profiles more like KS tumors, specifically angiogenic genes (e.g. \u003cem\u003eTX1, ANGPTL3, PROX1, SCUBE1, KDR, ROBO4, PDGFB, SERPINA5\u003c/em\u003e), mitotic genes (e.g. \u003cem\u003eCENPE, CDK1, KIF3C, BRCA2, TOP2A\u003c/em\u003e, etc.), and remodeling matrix components including metalloproteinases\u003csup\u003e\u0026nbsp;\u003c/sup\u003e(Lacunza et al., 2024). Conversely, cluster C3 was distinguished by greater differentiation (higher keratin expression) and increased metabolic activity, evidenced by a higher expression of lipid metabolism-related genes and xenobiotic metabolism genes with activation of numerous cytochrome P450 genes (Table \u003cstrong\u003eS1\u003c/strong\u003e).\u003c/p\u003e\n\u003cp\u003eThe comparison between clusters C1 and C4 reveals distinct metabolic and immune response profiles in the lesions. Cluster C4 shows elevated expression of oxidative and xenobiotic metabolism genes, along with ERBB signaling genes (e.g., \u003cem\u003eERBB2\u003c/em\u003e, \u003cem\u003eERBB3\u003c/em\u003e, \u003cem\u003eERBB4\u003c/em\u003e, \u003cem\u003eEGFR\u003c/em\u003e, \u003cem\u003eEREG\u003c/em\u003e), suggesting an adaptive response to environmental stressors and a focus on cellular survival and proliferation (Table \u003cstrong\u003eS1\u003c/strong\u003e). This points to potential metabolic reprogramming that could support tumorigenesis. The ERBB family of kinases has recently been implicated in regulating the KSHV latent-to-lytic replication switch, emphasizing their role in KSHV-related pathogenesis (Olson et al., 2023). In contrast, cluster C1 was marked by a strong activation of immune-related genes, particularly those associated with lymphocyte activation and inflammatory responses (e.g., \u003cem\u003eCXCL8\u003c/em\u003e, \u003cem\u003eCXCL13\u003c/em\u003e, \u003cem\u003eCXCL2\u003c/em\u003e, \u003cem\u003eIL6\u003c/em\u003e, \u003cem\u003eIFNG\u003c/em\u003e, \u003cem\u003eIL10\u003c/em\u003e, \u003cem\u003eCCR5\u003c/em\u003e, \u003cem\u003eCXCR4\u003c/em\u003e), indicating a robust immune environment potentially engaged in combating the tumor (Table \u003cstrong\u003eS1\u003c/strong\u003e).\u003c/p\u003e\n\u003cp\u003eAdditionally, the comparison of clusters C3 and C4 revealed a lower number of differentially expressed genes (Figure \u003cstrong\u003eS2\u003c/strong\u003e). C3 exhibited higher expression of genes linked to T and B cell activation (e.g., \u003cem\u003eCD84\u003c/em\u003e, \u003cem\u003eCD86\u003c/em\u003e, \u003cem\u003eCD70\u003c/em\u003e, \u003cem\u003eCD79A\u003c/em\u003e, \u003cem\u003eCD79B\u003c/em\u003e,\u003cem\u003e\u0026nbsp;IL10\u003c/em\u003e), while C4 showed increased expression of genes involved in antiviral responses (e.g., \u003cem\u003eIFI44\u003c/em\u003e, \u003cem\u003eIFIH1\u003c/em\u003e, \u003cem\u003eIFI27\u003c/em\u003e, \u003cem\u003eRSAD2\u003c/em\u003e, \u003cem\u003eXAF1\u003c/em\u003e, \u003cem\u003eOAS1\u003c/em\u003e, \u003cem\u003eOAS2\u003c/em\u003e, \u003cem\u003eOAS3\u003c/em\u003e) (Table \u003cstrong\u003eS1\u003c/strong\u003e).\u003c/p\u003e\n\u003cp\u003eIn summary, the transcriptomic and functional analysis identifies three distinct groups of KS. Cluster C1, primarily composed of endemic cases with some epidemic ones, represents tumors characterized by poorly differentiated, endothelial, and proliferative phenotypes. This cluster also showed enrichment of the PI3K/mTOR pathway and a strong presence of memory CD4 T lymphocytes in the immune fraction. Cluster C3, composed of predominantly epidemic cases, showed greater metabolic activity and was enriched in B cells. In contrast, cluster C4 had the least immune infiltration and highest differentiation, with strong activation of antiviral defense genes and the macular morphotype, typical of early-stage KS (Cesarman et al., 2019).\u003c/p\u003e\n\u003cp\u003eOverall, consistent with the findings of Lidenge et al. (2020), which showed that endemic and epidemic KS transcriptomes are highly similar, with enhanced dysregulation of tumorigenesis and immune functions in endemic KS, we observed a segregation between endemic (C1) and epidemic (C3 and C4) cases. This distinction was driven by the magnitude and the specific subsets of expression changes. C1 (primarily endemic) displayed stronger activation of angiogenesis, endothelial differentiation, immune-related genes, cell proliferation, and tumorigenic pathways such as PI3K/Akt/mTOR. In contrast, C3 (epidemic) exhibited higher metabolic activity, with increased expression of genes related to lipid and xenobiotic metabolism, while C4 showed features typical of an early-stage viral response, including activation of antiviral defense pathways. These differences highlight functional divergence between C1, C3, and C4, supporting the clustering of endemic and epidemic KS lesions based on distinct immune and metabolic signatures.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003e3.5 KSHV transcriptome analysis.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eTo evaluate the relationship between the clusters identified from the whole transcriptome and KSHV expression levels while minimizing variability from host gene expression, we chose to analyze the KSHV transcriptome alone between clusters. This approach clarified the viral contributions to the disease and helped to determine if different viral expressions correlate with the observed clusters. First, it is important to note that there were no significant differences in overall KSHV expression levels (measured by total mapped KSHV reads, TMKR) across the different cohort populations (Argentina, Tanzania/Zambia, and Uganda; p\u0026gt;0.05; Figure \u003cstrong\u003eS5\u003c/strong\u003e). However, when comparing the clusters, cluster C1 contained tumors with the highest levels of KSHV expression, as indicated by the TMKR, compared to clusters C3 and C4 (p\u0026lt;0.01; Figure 3\u003cstrong\u003eA\u003c/strong\u003e). This difference was further supported by the elevated levels of LANA in cluster C1 (Figure 3\u003cstrong\u003eA\u003c/strong\u003e). The analysis of differentially expressed genes among the lesion clusters confirmed that cluster C1 exhibited the highest expression levels of genes such as \u003cem\u003eORF72\u003c/em\u003e, \u003cem\u003eORF73\u003c/em\u003e (\u003cem\u003eLANA\u003c/em\u003e), \u003cem\u003eORF74\u003c/em\u003e, \u003cem\u003eORF75\u003c/em\u003e, as well as \u003cem\u003eK12\u003c/em\u003e, \u003cem\u003eK13\u003c/em\u003e, and \u003cem\u003eK15\u003c/em\u003e, which were among the most highly expressed transcripts across all clusters. These genes have been implicated in various functions, including immune evasion, angiogenesis, cell proliferation, and apoptosis regulation. LANA is known to stabilize the viral genome within host cells, facilitating persistence and contributing to tumorigenesis (Uppal et al., 2014; Nakajima et al; 2024). \u003cem\u003eORF74\u003c/em\u003e is a viral GPCR that binds various human chemokines (\u003cem\u003eCXCL1\u003c/em\u003e, \u003cem\u003eCXCL8\u003c/em\u003e) contributing to immune evasion and consequently viral dissemination (de Munnik et al., 2015; Medina et al., 2020). Similarly, \u003cem\u003eK12\u003c/em\u003e and \u003cem\u003eK13\u003c/em\u003e play roles in modulating host immune responses (Liang et al., 2008), suggesting that the elevated expression of these genes in cluster C1 may correlate with more aggressive tumor behavior or enhanced viral persistence. Although no significant differences were observed in the total mapped reads between clusters C3 and C4 (Figure \u003cstrong\u003e3A\u003c/strong\u003e), these clusters were differentiated by the expression of specific genes, including LANA and RTA (\u003cem\u003eORF50\u003c/em\u003e). Additionally, lytic genes such as the viral Bcl-2 (\u003cem\u003eORF16\u003c/em\u003e), which inhibits apoptosis, the protease (\u003cem\u003eORF17\u003c/em\u003e) associated with capsid maturation, and the gene expression modulator (\u003cem\u003eORF18\u003c/em\u003e) were all more abundantly expressed in C3 compared to C4 (Figure \u003cstrong\u003e3B\u003c/strong\u003e; Table \u003cstrong\u003eS2\u003c/strong\u003e).This suggests that KSHV is contributing to the intrinsic transcriptomic differences among these clusters of epidemic tumors, potentially influencing their pathogenic behavior and responses to treatment. The distinct expression profiles of these genes highlight the complex interplay between viral replication and host cell dynamics in KS.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eWe conducted further correlation analyses to examine the relationship between viral tissue load (quantified via TMKR) and KS morphotypes. The macular group exhibited a significantly lower viral load compared to the plaque and nodular groups (Figure \u003cstrong\u003eS5\u003c/strong\u003e). Although no significant differences were found between the plaque, nodular, and fungating groups, an increasing trend in mean TMKR values was observed from macular to fungating lesions (Figure \u003cstrong\u003eS5\u003c/strong\u003e). These findings correlate with the observation that cluster C4, primarily composed of macular lesions, exhibited lower viral load, consistent with the early stage of these lesions. This suggests an association between viral load and lesion stage in KSHV-driven tumorigenesis. We also explored the relationship between immune infiltration patterns and viral loads across clusters (Figure \u003cstrong\u003eS5)\u003c/strong\u003e. Linear regression analysis revealed that higher KSHV loads were associated with a decrease in na\u0026iuml;ve CD4+ T cells and an increase in memory CD4+ T cells, particularly in Cluster C1. A similar trend, though weaker, was observed for CD8+ T cells, with memory CD8+ T cells showing a positive association with KSHV load. No significant associations were found between KSHV load and B cell subsets, suggesting that B cell enrichment in cluster C3 may be an intrinsic feature of these lesions, independent of KSHV load.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eWhile our study provides valuable insights into the molecular diversity of KS, several limitations should be acknowledged. First, RNA-seq data, while powerful for transcriptomic profiling, may not fully capture the presence and spatial distribution of immune cells within the tumor microenvironment. Immune profile algorithms infer immune cell composition based on gene expression rather than direct cellular localization, which may not reflect the actual immune cell presence in KS lesions. Additionally, although our findings align with and expand upon previous transcriptomic studies that identified varying KS subtypes based on proliferative, angiogenic, or metabolic pathways, the classification of KS lesions remains complex. The number of subtypes identified across studies has ranged from two to five, highlighting the heterogeneity of KS tumors (Tso et al., 2018; Rose et al., 2018; Lidenge et al., 2020; Ramaswami et al., 2023; Moorad et al., 2023). Despite these limitations, our study\u0026rsquo;s strength lies in integrating diverse cohorts, providing a more comprehensive view of KS pathology across different populations. Further validation, including spatial transcriptomics, will be necessary to refine our understanding of KS molecular landscapes and their therapeutic implications.\u003c/p\u003e"},{"header":"4 CONCLUSIONS","content":"\u003cp\u003eIn summary, this is the first study to integrate transcriptomic profiles of various cohorts of KS. The RNA-seq profile generated from the Argentinian KS samples compiled with previously published datasets as well as their associated metadata and predicted pathway activities, immune infiltrate, host and viral gene expression profiles are shared with the research community through the development of a KS-omics resource leveraging UCSC Xena browser. This resource will allow biologists without bioinformatics knowledge to explore and correlate the host and viral transcriptomic profiles in a compiled and harmonized dataset to develop novel biological insights and discover biomarkers.\u003c/p\u003e \u003cp\u003eOur comprehensive analysis enabled us to define and characterize three distinct lesion groups that reflect both the biology of the virus and the intrinsic molecular characteristics of the lesions. Notably, while endemic lesions were primarily concentrated in cluster C1, epidemic lesions were distributed across all three clusters.\u003c/p\u003e \u003cp\u003eEpidemic lesions in clusters C3 and C4 did not show significant differences in overall KSHV expression but differed in their intrinsic functional molecular characteristics. Cluster C3 exhibited a high level of B cell infiltration and ICIs expression, while cluster C4 was characterized by an enrichment of genes involved in early response against viral infection. Whether these different clusters imply differences in treatment approaches or response, it still needs to be explored.\u003c/p\u003e \u003cp\u003eThus, we defined three groups of KS: endothelial, proliferative and well-defined KS tumors (C1); B-cell-enriched and metabolic tumors (C3); and less immunogenic, early-stage KS tumors (C4). Overall, our findings provide valuable insights into the molecular diversity of KS and underscore the potential for tailored therapeutic strategies based on lesion characteristics.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eData availability\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe raw data have been submitted to NCBI GEO database with accession number GSE271303. The rest of the data are available from the corresponding author upon reasonable request.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAcknowledgements\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe authors want to acknowledge the invaluable contributions from all study participants and from all the research team at Fundaci\u0026oacute;n Hu\u0026eacute;sped, where participants were recruited.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthors contributions\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eEL: Investigation, formal analysis, writing the article. VF, MIF: Resources and logistics for sample and clinical data collection. MES, AMG, SW: Methodology, research assistance, clinical data of participants. MJG and JZ: UCSC Xena. JN and EC: Advised on the study and revised the manuscript. PC, OC, JCR: Resources. MCA: Conceived the study, supervision, formal analysis, writing the article. All authors read and approved the final manuscript.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFunding Declaration\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThis work was supported by the National Cancer Institute of the National Institutes of Health under Award Number U54CA221208 (OC and JCR), P30CA240139 (JCR), 5U24CA258407 (UCSC Xena Browser), and the National University of La Plata M250 I+D grant (MCA).\u003cstrong\u003e\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eEthics declarations\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eMJG is a consultant for the Fred Hutchinson Cancer Research Center. The other authors declare no competing interests.\u003c/p\u003e"},{"header":"References","content":"\u003col start=\"1\" type=\"1\"\u003e\n \u003cli\u003eAIDS-defining Cancer Project Working Group for IeDEA and COHERE in EuroCoord. Comparison of Kaposi Sarcoma Risk in Human Immunodeficiency Virus-Positive Adults Across 5 Continents: A Multiregional Multicohort Study. Clin Infect Dis. 2017 Oct 15;65(8):1316-1326. doi: 10.1093/cid/cix480. PMID: 28531260; PMCID: PMC5850623\u003c/li\u003e\n \u003cli\u003eBallon G, Chen K, Perez R, Tam W, Cesarman E. 2011. Kaposi sarcoma herpesvirus (KSHV) vFLIP oncoprotein induces B cell transdifferentiation and tumorigenesis in mice. J Clin Invest 121:1141\u0026ndash;1153. doi: 10.1172/JCI44417.\u003c/li\u003e\n \u003cli\u003eBecht, E. et al. Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression. Genome Biol. 17, 218 (2016).\u003c/li\u003e\n \u003cli\u003eCesarman E, Damania B, Krown SE, Martin J, Bower M, Whitby D. Kaposi sarcoma. Nature reviews Disease primers. 2019 Jan 31;5(1):9.\u003c/li\u003e\n \u003cli\u003eDamania B, Dittmer DP. Today\u0026apos;s Kaposi sarcoma is not the same as it was 40 years ago, or is it? J Med Virol. 2023 May;95(5):e28773. doi: 10.1002/jmv.28773. PMID: 37212317; PMCID: PMC10266714.\u003c/li\u003e\n \u003cli\u003ede Munnik SM, Kooistra AJ, van Offenbeek J, Nijmeijer S, de Graaf C, Smit MJ, Leurs R, Vischer HF. The Viral G Protein-Coupled Receptor ORF74 Hijacks \u0026beta;-Arrestins for Endocytic Trafficking in Response to Human Chemokines. PLoS One. 2015 Apr 20;10(4):e0124486. doi: 10.1371/journal.pone.0124486.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eDelgado T, Carroll PA, Punjabi AS, Margineantu D, Hockenbery DM, et al. (2010) Induction of the Warburg effect by Kaposi\u0026rsquo;s sarcoma herpesvirus is required for the maintenance of latently infected endothelial cells. Proc Natl Acad Sci U S A 107: 10696\u0026ndash;10701. pmid:20498071\u003c/li\u003e\n \u003cli\u003eDittmer DP, Damania B. Kaposi sarcoma\u0026ndash;associated herpesvirus: immunobiology, oncogenesis, and therapy. The Journal of clinical investigation. 2016 Sep 1;126(9):3165-75.\u003c/li\u003e\n \u003cli\u003eGoldman, M.J., Craft, B., Hastie, M. et al. Visualizing and interpreting cancer genomics data via the Xena platform. Nat Biotechnol (2020). https://doi.org/10.1038/s41587-020-0546-8\u003c/li\u003e\n \u003cli\u003eHaverkos HW, Drotman DP. Prevalence of Kaposi\u0026apos;s sarcoma among patients with AIDS. The New England Journal of Medicine. 1985;312(23):1518.\u003c/li\u003e\n \u003cli\u003eHutchison LM, Boscoe FP, Feingold BJ. Cancers Disproportionately Affecting the New York State Transgender Population, 1979-2016. Am J Public Health. 2018 Sep;108(9):1260-1262.\u003c/li\u003e\n \u003cli\u003eJoest B, Kempf W, Berisha A, Peyk P, Tronnier M, Mitteldorf C. Stage-related PD-L1 expression in Kaposi sarcoma tumor microenvironment. J Cutan Pathol. 2020 Oct;47(10):888-895. doi: 10.1111/cup.13716.\u003c/li\u003e\n \u003cli\u003eLacunza E, Ahuja A, Coso OA, Abba M, Ramos JC, Cesarman E, Mesri EA, Naipauer J. Unveiling the role of KSHV-infected human mesenchymal stem cells in Kaposi\u0026apos;s sarcoma initiation. J Med Virol. 2024 May;96(5):e29684. doi: 10.1002/jmv.29684. PMID: 38773828.\u003c/li\u003e\n \u003cli\u003eLee AQ, Hao C, Pan M, Ganjoo KN, Bui NQ. Histologic and immunologic factors associated with response to immune checkpoint inhibitors in advanced sarcoma. Clin Cancer Res. 2024 Dec 19. doi: 10.1158/1078-0432.CCR-24-3485.\u003c/li\u003e\n \u003cli\u003eLiang C, Lee JS, Jung JU. Immune evasion in Kaposi\u0026apos;s sarcoma-associated herpes virus associated oncogenesis. Semin Cancer Biol. 2008 Dec;18(6):423-36. doi: 10.1016/j.semcancer.2008.09.003. Epub 2008 Oct 2.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eLidenge SJ, Tso FY, Ngalamika O, Kolape J, Ngowi JR, Mwaiselage J, Wood C, West JT. Lack of CD8+ T-cell co-localization with Kaposi\u0026apos;s sarcoma-associated herpesvirus infected cells in Kaposi\u0026apos;s sarcoma tumors. Oncotarget. 2020 Apr 28;11(17):1556-1572.\u003c/li\u003e\n \u003cli\u003eLidenge, S.J., Kossenkov, A.V., Tso, F.Y., Wickramasinghe, J., Privatt, S.R., Ngalamika, O., Ngowi, J.R., Mwaiselage, J., Lieberman, P.M., West, J.T. and Wood, C., 2020. Comparative transcriptome analysis of endemic and epidemic Kaposi\u0026rsquo;s sarcoma (KS) lesions and the secondary role of HIV-1 in KS pathogenesis. PLoS pathogens, 16(7), p.e1008681.\u003c/li\u003e\n \u003cli\u003eManimaran S, Selby HM, Okrah K, Ruberman C, Leek JT, Quackenbush J, Haibe-Kains B, Bravo HC, Johnson WE. BatchQC: interactive software for evaluating sample and batch effects in genomic data. Bioinformatics. 2016 Dec 15;32(24):3836-3838. doi: 10.1093/bioinformatics/btw538.\u003c/li\u003e\n \u003cli\u003eMedina MV, D Agostino A, Ma Q, Eroles P, Cavallin L, Chiozzini C, Sapochnik D, Cymeryng C, Hyjek E, Cesarman E, Naipauer J, Mesri EA, Coso OA. KSHV G-protein coupled receptor vGPCR oncogenic signaling upregulation of Cyclooxygenase-2 expression mediates angiogenesis and tumorigenesis in Kaposi\u0026apos;s sarcoma. PLoS Pathog. 2020 Oct 15;16(10):e1009006. doi: 10.1371/journal.ppat.1009006.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eM\u0026eacute;ndez-Sol\u0026iacute;s O, Bendjennat M, Naipauer J, Theodoridis PR, Ho JJD, Verdun RE, Hare JM, Cesarman E, Lee S, Mesri EA. Kaposi\u0026apos;s sarcoma herpesvirus activates the hypoxia response to usurp HIF2\u0026alpha;-dependent translation initiation for replication and oncogenesis. Cell Rep. 2021 Dec 28;37(13):110144. doi: 10.1016/j.celrep.2021.110144. PMID: 34965440; PMCID: PMC9121799.\u003c/li\u003e\n \u003cli\u003eMesri EA, Cesarman E, Boshoff C. Kaposi\u0026apos;s sarcoma and its associated herpesvirus. Nature Reviews Cancer. 2010 Oct;10(10):707-19.\u003c/li\u003e\n \u003cli\u003eMoorad R, Kasonkanji E, Gumulira J, Gondwe Y, Dewey M, Pan Y, Peng A, Pluta LJ, Kudowa E, Nyasosela R, Tomoka T, Tweya H, Heller T, Gugsa S, Phiri S, Moore DT, Damania B, Painschab M, Hosseinipour MC, Dittmer DP. A prospective cohort study identifies two types of HIV+ Kaposi Sarcoma lesions: proliferative and inflammatory. Int J Cancer. 2023 Dec 15;153(12):2082-2092. doi: 10.1002/ijc.34689.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eNaipauer J, Mesri EA. The Kaposi\u0026apos;s sarcoma progenitor enigma: KSHV-induced MEndT-EndMT axis. Trends Mol Med. 2023 Mar;29(3):188-200. doi: 10.1016/j.molmed.2022.12.003. Epub 2023 Jan 10. PMID: 36635149; PMCID: PMC9957928.\u003c/li\u003e\n \u003cli\u003eNakajima K-i, Inagaki T, Espera JM, Izumiya Y. Kaposi\u0026apos;s sarcoma-associated herpesvirus (KSHV) LANA prevents KSHV episomes from degradation. J Virol. 2024 Feb 20;98(2):e0126823. doi: 10.1128/jvi.01268-23.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eNicol SM, Sabbah S, Brulois KF, Jung JU, Bell AI, Hislop AD. Primary B Lymphocytes Infected with Kaposi\u0026apos;s Sarcoma-Associated Herpesvirus Can Be Expanded In Vitro and Are Recognized by LANA-Specific CD4+ T Cells. J Virol. 2016 Mar 28;90(8):3849-3859. doi: 10.1128/JVI.02377-15.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eOlson AT, Kang Y, Ladha AM, Zhu S, Lim CB, Nabet B, Lagunoff M, Gujral TS, Geballe AP. Polypharmacology-based kinome screen identifies new regulators of KSHV reactivation. PLoS Pathog. 2023 Sep 5;19(9):e1011169. doi: 10.1371/journal.ppat.1011169. PMID: 37669313; PMCID: PMC10503724.\u003c/li\u003e\n \u003cli\u003ePetitprez F, Vano YA, Becht E, Giraldo NA, de Reyni\u0026egrave;s A, Saut\u0026egrave;s-Fridman C, Fridman WH. Transcriptomic analysis of the tumor microenvironment to guide prognosis and immunotherapies. Cancer Immunol Immunother. 2018 Jun;67(6):981-988. doi: 10.1007/s00262-017-2058-z.\u003c/li\u003e\n \u003cli\u003eRamaswami R, Tagawa T, Mahesh G, Serquina A, Koparde V, Lurain K, Dremel S, Li X, Mungale A, Beran A, Ohler ZW. Transcriptional landscape of Kaposi sarcoma tumors identifies unique immunologic signatures and key determinants of angiogenesis. Journal of Translational Medicine. 2023 Sep 22;21(1):653.\u003c/li\u003e\n \u003cli\u003eRappocciolo G, Hensler HR, Jais M, Reinhart TA, Pegu A, Jenkins FJ, Rinaldo CR. 2008. Human herpesvirus 8 infects and replicates in primary cultures of activated B lymphocytes through DC-SIGN. J Virol 82:4793\u0026ndash;4806. doi: 10.1128/JVI.01587-07.\u003c/li\u003e\n \u003cli\u003eRose TM, Bruce AG, Barcy S, Fitzgibbon M, Matsumoto LR, Ikoma M, Casper C, Orem J, Phipps W. Quantitative RNAseq analysis of Ugandan KS tumors reveals KSHV gene expression dominated by transcription from the LTd downstream latency promoter. PLoS Pathogens. 2018 Dec 17;14(12):e1007441.\u003c/li\u003e\n \u003cli\u003eSingh RK, Lang F, Pei Y, Jha HC, Robertson ES. Metabolic reprogramming of Kaposi\u0026apos;s sarcoma associated herpes virus infected B-cells in hypoxia. PLoS Pathog. 2018 May 10;14(5):e1007062. doi: 10.1371/journal.ppat.1007062. PMID: 29746587; PMCID: PMC5963815.\u003c/li\u003e\n \u003cli\u003eTso FY, Kossenkov AV, Lidenge SJ, Ngalamika O, Ngowi JR, Mwaiselage J, Wickramasinghe J, Kwon EH, West JT, Lieberman PM, Wood C. RNA-Seq of Kaposi\u0026rsquo;s sarcoma reveals alterations in glucose and lipid metabolism. PLoS pathogens. 2018 Jan 19;14(1):e1006844.\u003c/li\u003e\n \u003cli\u003eUldrick TS, Whitby D. Update on KSHV epidemiology, Kaposi Sarcoma pathogenesis, and treatment of Kaposi Sarcoma. Cancer letters. 2011 Jun 28;305(2):150-62.\u003c/li\u003e\n \u003cli\u003eUppal T, Banerjee S, Sun Z, Verma SC, Robertson ES. KSHV LANA--the master regulator of KSHV latency. Viruses. 2014 Dec 11;6(12):4961-98. doi: 10.3390/v6124961. PMID: 25514370; PMCID: PMC4276939.\u003c/li\u003e\n \u003cli\u003eZhang Y, Parmigiani G, Johnson WE. ComBat-seq: batch effect adjustment for RNA-seq count data. NAR genomics and bioinformatics. 2020 Sep 1;2(3):lqaa078.\u003c/li\u003e\n \u003cli\u003eZhu, Y, da Silva, S.R. \u0026amp; Gao, SJ. Metabolic reprogramming by KSHV in cellular transformation. Cancer Metab 2 (Suppl 1), P88 (2014). https://doi.org/10.1186/2049-3002-2-S1-P88\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Kaposi's Sarcoma, RNAseq, Transcriptome, Xena browser","lastPublishedDoi":"10.21203/rs.3.rs-6146471/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-6146471/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eKaposi sarcoma (KS) is an AIDS-defining cancer and a significant global health challenge caused by KS-associated herpesvirus (KSHV). NGS-based approaches have profiled KS lesions in a minimal number of studies compared with other neoplastic diseases. Here we present a compiled and harmonized dataset of 131 KS and non-tumor cutaneous samples in the context of their predicted pathway activities, immune infiltrate, KSHV and HIV gene expression profiles, and their associated clinical data representing patient populations from Argentina, United States (USA), and Sub-Saharan Africa cohorts. RNA-seq data from 9 Argentinian KS lesions were generated and integrated with previously published datasets derived from the USA and sub-Saharan African cohorts from Tanzania, Zambia, and Uganda. An unsupervised analysis of 131 KS-related samples allowed us to identify four KS clusters based on their host and KSHV gene expression profiles, immune infiltrate, and the activity of specific signaling pathways. The compiled RNA-seq profile is shared with the research community through the UCSC Xena browser for further visualization, download, and analysis \u003cu\u003e(https://kaposi.xenahubs.net/\u003c/u\u003e). These resources will allow biologists without bioinformatics knowledge to explore and correlate the host and viral transcriptome in a curated dataset of different KS RNA-seq-based cohorts, which can lead to novel biological insights and biomarker discovery.\u003c/p\u003e","manuscriptTitle":"Integrative Functional Genomics Analysis of Kaposi Sarcoma Cohorts","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-03-11 11:41:22","doi":"10.21203/rs.3.rs-6146471/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"d8186b60-0320-43c9-8b16-f2e03e43f767","owner":[],"postedDate":"March 11th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[],"tags":[],"updatedAt":"2025-03-27T23:53:08+00:00","versionOfRecord":[],"versionCreatedAt":"2025-03-11 11:41:22","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-6146471","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-6146471","identity":"rs-6146471","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.