A physics informed neural network approach to quantify antigen presentation activities at single cell level using omics data

doi:10.21203/rs.3.rs-5629379/v1

A physics informed neural network approach to quantify antigen presentation activities at single cell level using omics data

2025 · doi:10.21203/rs.3.rs-5629379/v1

preprint OA: closed CC-BY-4.0

📄 Open PDF Full text JSON View at publisher

Full text 214,826 characters · extracted from preprint-html · click to expand

A physics informed neural network approach to quantify antigen presentation activities at single cell level using omics data | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article A physics informed neural network approach to quantify antigen presentation activities at single cell level using omics data Chi Zhang, Jia Wang, Pengtao Dang, Yuhui Wei, Xiao Wang, Julie Brothwell, and 9 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-5629379/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted You are reading this latest preprint version Abstract Antigen processing and presentation via major histocompatibility complex (MHC) molecules are central to immune surveillance. Yet, quantifying the dynamic activity of MHC class I and II antigen presentation remains a critical challenge, particularly in diseases like cancer, infection and autoimmunity where these pathways are often disrupted. Current methods fall short in providing precise, sample-specific insights into antigen presentation, limiting our understanding of immune evasion and therapeutic responses. Here, we present PSAA (PINN-empowered Systems Biology Analysis of Antigen Presentation Activity), which is designed to estimate sample-wise MHC class I and class II antigen presentation activity using bulk, single-cell, and spatially resolved transcriptomics or proteomics data. By reconstructing MHC pathways and employing pathway flux estimation, PSAA offers a detailed, stepwise quantification of MHC pathway activity, enabling predictions of gene-specific impacts and their downstream effects on immune interactions. Benchmarked across diverse omics datasets and experimental validations, PSAA demonstrates a robust prediction accuracy and utility across various disease contexts. In conclusion, PSAA and its downstream functions provide a comprehensive framework for analyzing the dynamics of MHC antigen presentation using omics data. By linking antigen presentation to immune cell activity and clinical outcomes, PSAA both elucidates key mechanisms driving disease progression and uncovers potential therapeutic targets. Biological sciences/Computational biology and bioinformatics/Machine learning Biological sciences/Computational biology and bioinformatics/Computational models Biological sciences/Immunology/Antigen processing and presentation Biological sciences/Cancer/Cancer microenvironment Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Introduction Antigen processing and presentation are fundamental mechanisms within the immune system, wherein intact protein antigens undergo degradation to produce peptide fragments that are loaded onto major histocompatibility complex (MHC) molecules and presented on the cell surface of antigen-presenting cells (APCs), further facilitating recognition by T cells. Humans have two different classes of antigen presentation molecules: MHC class I (MHC-I) molecules mainly bind to endogenous antigens. MHC-I is expressed by all nucleated cells throughout the body and recognized by CD8 + cytotoxic T lymphocytes (CTLs). MHC class II (MHC-II) is primarily expressed by professional APCs, mainly bind to exogenous antigens, and are recognized by CD4 + cells. Functional variations in MHC-I and MHC-II have been reported in different disease conditions and cell types. Immune evasion is a hallmark feature of cancer that enables the cancer cells to escape immune recognition and destruction and is often achieved through the reduction or loss of antigen presentation 1 – 3 . Immunotherapy, such as checkpoint blockade treatments, boosts T cell-mediated immune responses 4 – 6 . Immunotherapy relies on the recognition of tumor cells by CTLs, which recognize tumor cells based on tumor associated antigens (TAA) presented on surface of cancer cells via MHC-I 7 – 9 . Impairing this process will ultimately diminish or eliminate CD8 + T cell-mediated tumor cytotoxicity and cause treatment resistance. The MHC complex also plays a significant role in various other diseases, primarily those related to immune system dysfunction 10 , 11 . In autoimmune diseases such as rheumatoid arthritis, multiple sclerosis, and lupus, MHC-II antigen presentation dysregulation leads to the presentation of self-antigens to T cells, triggering an inappropriate immune response against the body's own tissues 12 . Some intracellular pathogens can evade immune responses by interfering with pathways involved in MHC-II antigen presentation 13 . Diseases characterized by chronic inflammation, such as inflammatory bowel disease (IBD), psoriasis, or even cancer, also involve dysregulated MHC-II antigen presentation, contributing to sustained immune activation and tissue damage 10 , 14 , 15 . While substantial efforts have been focused on studying the source and quality of antigens, the activity level and impairment of the MHC-I and MHC-II pathways have been understudied in disease-specific contexts 16 . To the best of our knowledge, there is a lack of a method that can accurately characterize the dyanamics of the MHC-I and MHC-II antigen presentations and their regulators using omics data. To fill this gap, we developed a computational method named P hysics Informed Neural Network (PINN)-empowered S ystems Biology A nalysis of A ntigen Presentation Activity ( PSAA ) to estimate the flux of MHC class I and II antigen presentation pathways at the individual sample or cell resolution using transcriptomics or proteomics data. PINN is a novel learning framework that combines data-driven machine learning models with governing physical laws, encoded as differential equations, to enhance predictions in complex biological systems 17 , 18 . By incorporating known biological constraints, PINNs allow for more accurate and interpretable modeling of biological processes than conventional data driven approaches using omics data. However, directly applying PINNs to estimate the dynamics of antigen presentation presents challenges due to the static nature of transcriptomics and proteomics data, which lack temporal dynamics and may not capture real-time regulatory fluctuations within MHC pathways. This limitation necessitates careful modeling assumptions to infer pathway activity from such static snapshots. PSAA was empowered by a new PINN framework. One neural network was trained for each reaction step in the MHC-I or -II to approximate its sample-wise flux rate by leveraging the goodness of fit of their underlying systems biology model with observed omics data. Specifically, the MHC-I and MHC-II pathways consist of large molecular processing reactions that can be treated as enzyme-catalyzed reactions, in which proteins, peptides, antigens, and MHC complexes across different steps and subcellular localizations act as intermediate substrates. In this context, the antigen source serves as the input, while the MHC complexes presented on the cell surface, or further T cell recognition levels, can be considered as the outcome of the pathway. Under this framework, the system operates under steady-state equilibrium, which PSAA leverages to estimate sample-wise MHC-I and MHC-II antigen presentation levels, as well as the impact of each gene involved in the antigen presentation approach for each individual sample. To develop PSAA, we first curated and reconstructed the MHC-I and MHC-II pathways by collecting genes involved in each reaction step from multiple pathway databases and literature. Based on the steady-state equilibrium hypothesis, we developed a new flux estimation and optimization method to assess the activity level of each step of the MHC-I and MHC-II pathways in each sample using omics data. PSAA, along with its downstream functions, offers several capabilities: (1) predicting the activity level of the entire process and individual biological steps of the MHC-I and MHC-II pathways in each sample using diverse types of transcriptomics or proteomics data, (2) assessing the impact of each individual gene or step on MHC-I and MHC-II antigen presentation, (3) evaluating the association of MHC-I and MHC-II antigen presentation with T cell activity and other biological or clinical features, and (4) inferring potential regulators or varied network structures of the MHC-I and MHC-II pathways in specific disease or tissue contexts. We rigorously benchmarked the prediction accuracy, robustness, and utility of PSAA using a broad spectrum of public omics data including bulk, single-cell, and spatially resolved transcriptomics or proteomics data of cancer, inflammatory, and neurodegenerative conditions. We also generated matched scRNA-seq and SRT data of bacterial infected human skins vs wound to further demonstrate the application of PSAA in analyzing varied antigen presentation mechanisms under different inflammatory conditions. To validate the prediction of PSAA, we conducted CRISPR knock-down experiments of key genes predicted by PSAA to amplify MHC-I antigen presentations in cancer. Results The overall framework of PSAA. PSAA is a new PINN-based model to quantify sample-wise MHC class I and II antigen presentation activity using transcriptomics or proteomics data. As illustrated in Fig. 1 a, PSAA takes bulk, single-cell, or spatially resolved data as the input to approximate the reaction rate of each step in the MHC-I or MHC-II antigen presentation pathways in each sample. PSAA utilizes a new optimization approach, namely message passing and supervised learning (MPSL), to compute the non-linear dependency between gene expression and reaction rate by leveraging (i) the flux balance of the reaction flows and (ii) the goodness of fit to the data. The output of PSAA provides the sample-wise reaction rates for each step in the selected MHC-I or MHC-II pathway. These rates could be directly used to explore potential associations with other molecular or clinical features, to study the functional role or clinical implications of antigen presentation in the disease context. PSAA also enables downstream analysis, including: (1) evaluating the differences in MHC-I or MHC-II antigen presentation across different disease contexts and their correlations with other features, (2) conducting in-silico perturbation analysis to identify genes with the greatest context-specific impact on the antigen presentation pathway to highlight potential drug targets, (3) performing antigen presentation and recognition-informed spatial segmentation for spatially resolved transcriptomics (SRT) data to analyze interactions between APCs and T cells, and (4) inferring with network structure and potential regulators of antigen presentation and T cell recognition. Reconstruction of MHC-I and MHC-II antigen presentation pathways. To accurately estimate the activity level of antigen processing and presentation, we first reconstructed the MHC-I and MHC-II pathways and related branches, ensuring they recapitulate all molecules or their complexes serving as sources, products, and facilitators of all the involved reactions (Fig. 1 b, c). Although databases such as the Kyoto Encyclopedia of Genes and Genomes 19 (KEGG), REACTOME 20 , and GO 21 provide sets of genes related to antigen presentation, to the best of our knowledge, there is a lack of fully annotated MHC-I and MHC-II pathways that comprehensively cover the individual antigen processing, presentation, and salvage steps for systems biology analysis. Thus, we first collected genes that are involved in the MHC-I and MHC-II pathways and related biological processes via integrating the pathways annotated in these databases and further curated the integrated pathway by an extensive literature search 7 , 22 – 24 . Our reconstructed MHC-I pathway consists of eight reaction steps, namely ubiquitination, proteasomal degradation, the action of the transporter associated with antigen processing (TAP), chaperone-assisted assembly of the MHC class I complex, transit from the endoplasmic reticulum (ER) to the Golgi apparatus, and subsequent exocytosis from the Golgi to the cell membrane. Additionally, auxiliary pathways such as de-ubiquitination, verification of MHC class I complex integrity, and recycling (salvage) of cell surface MHC-I complex through endocytosis were also added as branches in this pathway. The final MHC-I antigen presentation pathway includes 284 genes in eight reaction modules. The final MHC-II antigen presentation pathway consists of seven reaction modules, including uptake of extracellular proteins into cells, processing of internalized proteins in endosomal/lysosomal compartments, biosynthesis of MHC-II complex in the ER, transport of MHC-II from the ER to the Golgi, transport of MHC-II from the Golgi to Endosomes, association of MHC-II with antigen, and expression of peptide-MHC-II complexes on the cell surface. The reconstructed MHC-II pathway includes 124 genes in seven reaction modules. Notably, the reconstructed MHC-I and MHC-II pathways were optimized for a systems biology-model based estimation of pathway activity level, which have the following features: (1) the pathways cover the main antigen processing and presentation steps as well as side branches such as de-ubiquitination and recycling of cell surface MHC complexes; (2) the reaction steps within the reconstructed networks were optimized to reduce duplicated genes in adjacent steps; and (3) the reconstructed pathways were represented as directed factor graphs where each intermediate state of antigen or MHC complexes is a factor and each reaction step is a variable. The PINN model was further developed over the directed factor graphs to estimate the activity level of each reaction step using expression changes of the genes or proteins involved in the steps. Detailed pathway reconstructions are provided in Supplementary Methods , and the reconstructed MHC-I and MHC-II pathways are given in Supplemental Tables S1 and S2 . Systems biology considerations, mathematical model, and solution of PSAA . PSAA is built upon the PINN framework that approximates the sample-wise activity level of MHC-I and MHC-II pathways using transcriptomics or proteomics data. PSAA first hypothesizes that the rate of each reaction of the MHC-I and MHC-II pathway could be inferred from the transcriptomics or proteomics changes of the involved genes via unknown non-linear functions. To train these functions, we first analyzed the systems properties of the MHC pathways. All reactions in the MHC-I and MHC-II pathways are large molecule processing reactions, in which proteins, peptides, MHC class I or II complexes, and their binding forms are substrates or products of each reaction step. The whole pathway satisfies the law of conservation of mass under steady or quasi-steady state, known as the flux balance condition, which has been widely utilized in metabolic flux analysis 25 – 27 . Thus, PSAA further assumes that the predicted activities of each reaction in the MHC-I and MHC-II pathway should satisfy a quasi-flux balance condition. PSAA approximates reaction rates using non-time course omics data, which differs from traditional PINN models. To enable a robust and explainable prediction, we developed a new learning paradigm —constrained learning —and a new optimization algorithm, as detailed below and in Methods . Given an omics data set $\:D$ and a pathway of $\:m$ reactions, denote the flux rate of the $\:ith$ reaction ( $\:i=1,\dots\:,m$ ) in the MHC-I or -II pathways in the $\:jth$ sample in $\:D$ as $\:{F}_{i,j}$ . PSAA identifies functions $\:{\mathcal{F}}_{i}$ to estimate $\:{F}_{i,j}$ by $\:{F}_{i,j}={\mathcal{F}}_{i}({D}_{,j},{{\Theta\:}}^{i})$ and minimizes a loss function $\:L={L}_{FB}\left({\widehat{F}}_{,j}^{}\right)+{L}_{p}({\mathcal{F}}_{i},{{\Theta\:}}_{i}|i=1,\dots\:,m)$ , where $\:{\widehat{F}}_{,j}^{}\triangleq\:{\{\widehat{F}}_{i,j}^{}|i=1,\dots\:,m\}$ denotes the predicted reaction rates in sample $\:j$ , $\:{{\Theta\:}}_{i}$ denotes parameters of $\:{\mathcal{F}}_{i}$ , $\:{L}_{FB}$ is a quadratic loss term that regularizes the imbalance of predicted reaction rate, and $\:{L}_{p}$ denotes an aggregated computational loss term to avoid trivial solutions and to conduct variable selection. Recognizing that the PSAA framework is neither supervised nor unsupervised learning, we developed a new optimization approach called MPSL (Message Passing enhanced Supervised Learning) to enable a robust and efficient solution of $\:{\mathcal{F}}_{i}$ and $\:{{\Theta\:}}^{i}$ . As illustrated in Fig. 1 a, a two-step optimization approach is introduced to iteratively (1) Message Passing Optimization (MPO) step: Computing $\:{\widehat{F}}_{,j}^{t+1}$ that minimizes $\:{L}_{FB}\left({\widehat{F}}_{,j}^{t+1}\right)$ by searching through $\:{\widehat{F}}_{i,j}^{t+1}$ within a certain distance to $\:{\mathcal{F}}_{i}\left({D}_{,j},{{\Theta\:}}_{i}^{t}\right)$ and (2) Supervised Learning (SL) step: Updating $\:{{\Theta\:}}_{i}^{t+1}$ by conducting a supervised training of $\:{\mathcal{F}}_{i}\left({D}_{,j},{{\Theta\:}}_{i}^{t+1}\right)$ to estimate $\:{\widehat{F}}_{i,j}^{t+1}$ , where $\:t$ and $\:t+1$ denote successive rounds of iterations. A key challenge of PINN is to balance fit to both physical models and data when using neural networks to approximate the non-linear dependency. The MPSL algorithm addresses this challenge by splitting the fitting process into two iterative steps: fitting the physical model and fitting the data. This makes the data-fitting step a classic supervised learning problem that can be implemented with additional regularization terms for variable selection or better fitting robustness. The MPSL algorithm was validated on an extensive set of simulated data (see details in Supplementary Methods ). The detailed mathematical formulation of the MPSL optimization algorithm is given in Methods . Downstream functionalities of PSAA We also developed a series of downstream functionalities of PSAA. The outputs of PSAA, i.e., sample-wise activity levels for each step in the MHC-I and MHC-II pathways can be directly compared across different contexts. Because $\:{\mathcal{F}}_{i}\left({D}_{,j},{{\Theta\:}}^{i}\right)$ directly utilizes neural networks to approximate sample-wise reaction rate, its partial derivative $\:\frac{\partial\:{\mathcal{F}}_{i}}{\partial\:D}$ could be computed to evaluate the impact of each gene in each sample. In-silico perturbation analysis can be conducted to identify genes that, when perturbed, may increase or decrease antigen presentation levels. In spatial transcriptomics data, the predicted MHC-I and MHC-II activity level can be directly utilized to characterize the interaction between APCs and T cells, enabling an APC-T cell interaction-based spatial dissection. Additionally, the coherence between the MPO and SL steps can be used to evaluate the goodness of fitting of the data to the systems biology assumption and pathway structure. This coherence can be further used to infer the pathway structure variation. We further validated PSAA and its downstream functions using public data and experimental approaches. The MPSL optimization was validated using a set of synthetic data-based experiments. We also demonstrated the application of PSAA and its functions for different antigen-presentation disease scenarios including cancer, bacterial infection, and neurodegeneration. PSAA accurately quantifies antigen presentation activity using transcriptomics or proteomics data. To benchmark PSAA, we first tested the Pearson correlation coefficients (PCC) between the predicted antigen presentation level and the cell surface protein of MHC-I molecules in three independent CITE-seq and TEA-seq data sets (CITE-seq of B-cell malignancies: GSE249542, CITE-seq of melanoma: SCP1064 28 , and TEA-seq of T cells: GSE200417 29 ). Both CITE-seq and TEA-seq offer simultaneous measurement of gene expression and protein levels. We found that PSAA-predicted antigen presentation levels were significantly associated with measured cell surface MHC I molecules in all three datasets: GSE200417, PCC = 0.363 (p-value < 2.2e-16); GSE249542, PCC = 0.527 (p-value < 2.2e-16), and SCP0164, PCC = 0.356 (p-value < 2.2e-16), as shown in Fig. 2 a (first row). Noted, these correlations were much higher than those that use averaged HLA class I or II gene expressions to predict antigen presentation activity Fig. 2 a (second row). Our results demonstrate that PSAA could accurately predict cell surface MHC complex presentation levels. To evaluate the robustness of PSAA, we applied PSAA to the CCLE and TCGA matched bulk RNA-seq and proteomics data sets, respectively, to test the consistency of the predictions made from transcriptomics and proteomics data. We identified a significant consistency between the predictions made from the two data sources for both MHC-I (PCC = 0.43, p-value = 4.1e-06) and MHC-II (PCC = 0.23, p-value = 0.018) antigen presentation pathways in the TCGA data (Fig. 2 b). In CCLE data, we identified a significant consistency of the MHC-I (PCC = 0.37, p-value = 1.3e-13) pathway (Fig. 2 b) and MHC-II pathway (PCC = 0.12, p-value = 0.02) ( Supplementary Fig. 1a ). Noted, a weaker correlation of the predicted MHC-II activity in the CCLE data is expected, since the CCLE data set is derived from pure cancer cells that normally do not present MHC-II molecules. Overall, our observation demonstrated that PSAA could robustly predict MHC-I and MHC-II antigen presentation levels using either transcriptomics or proteomics data. To assess PSAA’s robustness, we conducted an ablation study by removing the “T cell recognition” module at the end of the antigen presentation pathway and compared with results obtained from the original pathway (Fig. 2 c and Supplementary Fig. 1b). While including the “T cell recognition” module intuitively enhances the completeness of the pathway, and could yield more accurate predictions, we observed significant consistency between the predicted flux of each step when using a network with and without the T cell module (PCC = 0.635 in PLC, PCC = 0.728 in proteasome, and PCC = 0.741 in MHC-I). This indicates that PSAA can accurately quantify antigen presentation levels even without considering the “T cell recognition” module. Notably, this analysis demonstrated that PSAA could robustly estimate antigen presentation without considering neighboring T cell infiltration levels, making it suitable for applications in single-cell or spatial transcriptomics data, where individual samples (single cell or spatial spot) may not contain information about neighboring T cells. To further analyze PSAA’s robustness to data sparsity, missing data, and overfitting, we conducted a systematic evaluation on CITE-seq data of peripheral blood mononuclear cells (PBMCs) (GSE249542). We first analyzed the impact of potential dropout events in scRNA-seq or SRT data by simulating different levels of dropout in the input scRNA-seq data. Our results suggested that PCC was highly robust to dropouts ( Supplementary Fig. 1c ). Similarly, we evaluated the robustness of PCC to missing genes ( Supplementary Fig. 1c ). We also conducted statistical analysis of the necessary input sample size for PSAA, evaluated overfitting by permutation test, and conducted robustness tests of PSAA (see details in Supplementary Methods ). Using simulated data, MPSL always projected a high-dimensional flux vector to the closest point in the solution space of flux balance (see Supplementary Methods ). Figure 3. Application of PSAA and downstream functions on matched single cell and spatial transcriptomics data obtained from Haemophilus ducreyi infected skin and uninfected (wounded) skin. (a) tSNE of the distribution of scRNA-seq data and cell types. (b) Distribution of PSAA predicted MHC-II antigen presentation level in each single cell over the tSNE of the scRNA-seq data. (c) Distribution of PSAA predicted cell surface MHC-II antigen presentation level (y-axis) in each cell type (x-axis) on scRNA-seq data. Cell types are colored as (a) and (d). (d) Proportion of each cell type in the pustules and wounds. (e) tSNE plot of APCs and the three APC cell clusters derived using the first-order partial derivative of each gene in the MHC-II pathway. (f) Proportion of APC cell types in the three clusters (left) and PSAA predicted MHC-II whole pathway activity level of each cell type in the three clusters (right). (g) T cell level (left) and PSAA predicted MHC-II whole pathway activity level (right) in the spatial spots of the SRT data. (h) Varied dependency between T cell level and MHC-II antigen presentation level in the SRT data of pustules (blue) vs wounded skin (orange) in patient sample 1 and 2. (i) Distribution of T cells and predicted MHC-II antigen presentation level on the spatial slides. PSAA accurately estimates MHC II antigen presentation activity and APC-T cell interactions in a bacterial infection model of human volunteers. To further validate PSAA and demonstrate its application in studying the variation of MHC-II pathway in antigen-presenting cells (APCs), we generated a matched scRNA-seq and SRT data sets derived from skin biopsies of four human volunteers. The volunteers provided paired biopsies of skin sites that were wounded (uninfected controls) or inoculated via puncture wounds with Haemophilus ducreyi until a pustule developed at that site 6–8 days later (see details in Supplementary Methods ). In the scRNA-seq data, there were four types of APCs (macrophages, pDCs, mPCs, and B cells), two types of skin cells (keratinocytes and melanocytes), three types of stromal cells (endothelial cells, fibroblast cells, and smooth muscle cells), three other immune cell types (ISG-expressing cells, T cells and mast cells), and one unknown cell group (Fig. 3a). PSAA was applied to approximate cell surface MHC-II antigen presentation activity in each cell except for those in the unknown cluster (Fig. 3b). PSAA identified that the APCs have the highest level of MHC-II antigen presentation activity, followed by endothelial cells and melanocytes, which have an intermediate level of MHC-II antigen presentation activity, and the other immune cell types that have relatively low antigen presentation activity (Fig. 3c). We also found that the pDCs in pustules have a higher level of MHC-II presentation compared to wounds (p = 4e-12) while the other cell types do not have a significant difference (Fig. 3c). In addition, we see a consistent increase of cell proportions of APCs in pustules vs wounds (Fig. 3d). The predicted lack of antigen presentation by pDCs and decreased population of macrophages in pustules may reflect the failure of the immune response to clear infection and inability to detect H. ducreyi -specific antibodies after infection 30 . The increased MHC-II antigen presentation in pustules compared to wounds likely reflects the presence of exogenous antigen and consequent influx of immune cells to the site of infection in pustules; the immune influx is largely absent in wounds. We also applied the in-silico perturbation analysis to study which genes contribute most to MHC-II antigen presentation pathways in APCs. The first-order partial derivative of each gene in the PSAA model was computed for each sample, which evaluates the impact of the gene to the pathway activity level if its expression level is disrupted. The partial derivatives of each gene in each reaction step were analyzed to identify if the genes contribute to MHC-II antigen presentation differently (Fig. 3e, see details in Methods ). Cell clustering analysis using the partial derivatives identified three APC subgroups that potentially have different MHC-II antigen presentation mechanisms: namely, APC-G1 , represented by mDCs in both infected and wounded sites; APC-G2 , consisting of APC cells specifically in infected sites; and APC-G3 , represented by macrophages in both infected and wounded sites (Fig. 3f). Perturbation analysis suggested that the whole activity level of the MHC-II pathway in APC-G1 mostly determined by HLA-DR , HLA-DP , and HLA-DQ . MHC II in APC-G2 is mostly determined by CD74 , HLA-DM , and HLA-DR , and the C1 peptidases CTSZ and CTSC , and RAB14 , and MHC-II in APC-G3 specifically depends on the C1 peptidases CTSZ , CTSB , CTSS , CTSD , CTSH , and CTSL ( Supplementary Table S3 ). Only the cells in the APC-G1 group had a significantly increased antigen presentation level in pustules versus wounds (Fig. 3f). Based on differential gene expression and perturbation analysis, the varied expression in the MHC II genes HLA-DR , HLA-DP , and HLA-DQ drove the differences between cells in the infected and wounded sites. By PSAA to the SRT data, we observed again that pustules have significantly increased MHC-II antigen presentation activity and CD4 + helper T cells than wounded tissues (Fig. 3g). Spatial dependent regression analysis revealed that the CD4 + helper T cells level significantly depends on MHC-II activity in both conditions (Fig. 3h and Supplementary Fig. 2 ), and donors showed a higher level of activation dependency (larger slope) between MHC-II antigen presentation and CD4 + T cells in pustules than in wounded skin 31 . The strong linear dependency suggests a strong colocalization of APCs with CD4 + T cells in both conditions while the varied dependencies suggest a stronger CD4 + T cell activation and increased adaptive immune response in pustules than in wounded skin (Fig. 3h). In pustules, MHC-II antigen presentation was more diffused throughout the skin; in wounds, high MHC-II activity spots were enriched in the surface regions (epidermis) (Fig. 3i). A similar pattern was observed for CD4 + helper T cells (Fig. 3i). Our analysis demonstrates the potential utility of PSAA in the integrative analysis of scRNA-seq and SRT data. This approach facilitates the assessment of MHC-II activity and further subtyping of APCs, as well as the interpretation of biological functional variations such as antigen presentation mechanisms, spatial distribution of MHC-II antigen presentation, and spatial dependent MHC-II – T cell interactions. PSAA captured MHC-I antigen presentation level in cancer tumor microenvironment and identified potential targets to improve CD8 + T cell’s recognition and cytotoxicity. Immunotherapy has shown remarkable efficacy in treating multiple cancer types 4 – 6 . However, mechanisms underlying non-responsiveness, particularly in solid tumors, remain poorly understood 32 – 36 . In the context of CD8 + cytotoxic T lymphocytes (CTL) – mediated immune responses, recognition of tumor associated antigen occurs through its presentation via MHC-I molecule on tumor cells and their interaction with T cell receptor (TCR) on the CD8 + T cells 7 – 9 . Impairing this event will ultimately reduce or prevent CD8 + T cell mediated tumor cytotoxicity. However, reduction or loss of antigen presentation is a frequent mechanism used by tumor cells to escape immune recognition and destruction 1 – 3 . Little is known regarding how and what variations of gene expression or key biological steps in the MHC-I pathway affect the level of antigen presentation on the surface of cancer cells and their downstream recognition by T cells. Therefore, we applied PSAA to predict how MHC-I antigen presentation affects the immune responses in cancer. We first validated the application of PSAA on TCGA RNA-seq data from nine cancer types. The T cell level could be well explained by the activity level of MHC-I antigen presentation and its recycling (Fig. 4 a). However, the T cell level alone poorly recapitulates MHC-I activity (Fig. 4 b). Further analysis suggested that T cell activation via MHC-I is increased in the cancer TME compared to the adjacent normal tissues although both cancer and normal tissues have a similar amount of the MHC-I presentation ( Supplementary Fig. 3a ). Our analysis revealed that the majority of the presented MHC-I tend to be recycled in the TME. This observation is consistent with the presentation and salvage of antigen presentation on cancer cells being a key factor for T cell recognition 37 . We further applied PSAA to five SRT datasets of cancer TME. Using the PSAA predicted MHC-I level and expression level of CD8 + cytotoxic T cell markers, we identified spatial regions of high-/low-MHC-I level and high-/low- CTL cell level (See details in Methods ). Strong consistencies between MHC-I presentation and CTL infiltration, evaluated by Moran’s I correlation, was identified in the TME of different cancer types ( Supplementary Table S4 ). In addition, we identified a substantial number of distinct spatial regions of high MHC-I and low CTL infiltration (Fig. 4 c and Supplementary Fig. 3b ). However, we rarely observed spatial regions of low MHC-I and high CTL infiltration ( Supplementary Fig. 3b ). All the identified spatial spots of low MHC-I and high CTL infiltration are on the boundary of the high MHC-I and high CTL regions. Moreover, the numbers of spatial spots of high MHC-I and low CTL infiltration are consistently higher than the spots of low MHC-I and high CTL infiltration in all analyzed data ( Supplementary Table S4 ). Our observations suggested that there were additional factors in these regions, such as stromal variations or metabolic shifts, inhibited the infiltration of CTLs. To identify the biological functions that are associated with low T cell infiltration, we utilized a generalized linear model to identify the genes that are consistently and specifically expressed in the regions of high MHC-I and low CTL versus the other regions in the analyzed data and downstream pathway enrichment analysis (see Methods ). We observed upregulated CTL-related pathways in high MHC-I and high CTL vs high MHC-I and low CTL regions, upregulated MHC-I genes in high MHC-I and low CTL vs low MHC-I and high CTL regions, and upregulated general adaptive immune responses in high MHC-I and high CTL vs low MHC-I and low CTL regions (Fig. 4 d). We also identified new stromal, TME, and metabolic changes that may be related to low CTL-infiltration in high MHC-I regions. Upregulated ECM formation, coagulation cascades, myeloid cells including monocytes and granulocytes, and chemokine receptors were seen in high MHC-I and high CTL vs high MHC-I and low CTL regions. In addition, increased metabolic activities including glycolysis, TCA cycle, oxidative phosphorylation, electron transport chain (ETC), glycan and steroid synthesis were seen in (1) high MHC-I and low CTL vs low MHC-I and high CTL regions and (2) high MHC-I and high CTL vs low MHC-I and low CTL regions, suggesting possible roles of metabolic activity related to MHC-I presentation (Fig. 4 d). We also observed down regulation of the TGBF-beta, JAK-Stat, glucose transport, glutathione metabolism, and histone deacetylase III pathways in high MHC-I and high CTL vs high MHC-I and low CTL regions. To demonstrate the clinical implication of PSAA, we applied the method to the bulk RNA-seq data of a melanoma data set (GSE91061) collected from patients under anti-PD1 therapy using Nivolumab 38 . In total, we obtained 105 samples from this data set, including 48 partial response (PR), 34 stable disease (SD), and 23 progressive disease (PD) patients. We applied PSAA to compute sample-wise activity level of the eight steps in the MHC-I pathway. Biologically, we expect the higher level of antigen presentation activity is associated with better response. We adopted multi-variate logistic regression with a L1-penalty to identify the top variables and best model in predicting patients’ response to the ant-PD1 therapy. Considering that the quality of cancer associated antigen also determines CTL recognition, we included predicted microsatellite stability (MSS) or microsatellite instability (MSI) status as a confounding factor (see details in Supplementary Methods ). In addition, we also introduced total T cell level and cytotoxic CD8 + T cell level predicted by deconvolution analysis and the on-/off-treatment status provided in the data as additional factors. The final selected model is: $$\:Responsiveness\:\sim\:logistic\:(3.02\bullet\:Peptide\:loading+\:1.12\bullet\:MSI\:status\:-1.55)$$ , where Responsiveness is a binary variable that takes values in “responder” and “non-responder”, and the p -value of the peptide loading and MSI status are 6.4e-4 and 0.02, respectively, suggesting the activity of the peptide loading step has a higher power predicting the outcome of immuno-therapy than MSI status and T cell abundance. We further checked how the predicted rate of peptide loading varies with respect to responsiveness, MSI status, and treatment status. We observed that the MHC-I antigen presentation level in the PR group is consistently higher than the SD and PD group, and the SD group is also higher than the PD group, in both MSS and MSI, and on-treatment/pre-treatment patient groups. Limited by sample size, a significant difference of the MHC-I antigen presentation level is only observed between the PR and SD groups vs the PD group in the on-treatment patient of MSS (Fig. 4 e). Specifically, the MSS of PR and SD on treatment patients have a significant increase of predicted rate of peptide loading compared to (1) the MSS of PR patients on-treatment patients (p = 0.0026) and (2) the MSI of all PD and SD on-treatment patients (p = 0.022). Our observation suggests that PD-1 inhibitor has a better efficacy in the melanoma patients of low mutation load or low cancer-associated antigen quality if the cancer cells have a higher level of MHC-I presentation. Complete predicted activity level of MHC-I reactions and clinical information of each sample are given in Supplementary Table S5 . To validate our observations, we also analyzed three additional datasets collected from melanoma (GSE115821 39 ), lung cancer (GSE126043 40 ), and cutaneous T cell lymphoma (GSE162137 41 ) patients treated by PD-1 inhibitor. Increased MHC-I antigen presentation levels were detected in the responders compared to non-responding patients in all the datasets (Fig. 4 f). We further utilized the in-silico perturbation function of PSAA to identify the genes that could be targeted to improve the presentation level of MHC-I complex on the surface of a cell. Previous studies reported that MAL2 in the salvage pathway of MHC-I is a critical negative regulator of MHC-I presentation 37 . A leave-one-out statistical test evaluates the significance of change of the predicted flux balance when including or excluding MAL2 in the recycling step of the MHC-I pathway (see details in Methods and Supplementary Table S6 ). Application of the test on TCGA data revealed that MAL2 is significantly involved in the recycling step of the MHC-I pathway in seven cancer types. To experimentally validate the PSAA predictions, we used in-silico perturbation analysis to predict genes that consistently negatively impact MHC-I antigen presentation using pan-cancer data from TCGA. Specifically, we added each gene into the recycling module to train the PSAA model and rank the genes according to their PSAA-predicted cell surface MHC-I level when perturbing their gene expression (Fig. 4 g). To further validate the PSAA analysis, we knocked down the top predicted genes in mouse breast cancer cells and evaluated the antigen presentation level and T cell killing between knockdown vs control for the top 11 predicted genes. We observed that knocking down each of the predicted genes significantly increased MHC-I antigen presentation and T cell killing effect (range: 22–148%) (Fig. 4 h). See experimental details in Methods . PSAA identified unaligned MHC-II antigen presentation steps in microglia and other cell types in Alzheimer’s disease brain. Although the brain was initially considered as an immune-privileged site where antigen presentation would not occur, both microglia and astrocytes present antigens via MHC class II molecules to activate CD4 + T cells and stimulate immune responses in the central nervous system (CNS) 42 – 44 . However, the role of the immune system and the antigen presentation process is not well understood in neurodegenerative diseases, such as Alzheimer’s disease and Parkinson’s disease 45 , 46 . To understand the cell type-specific MHC-II antigen presentation status in the AD brain, we applied PSAA on the ROSMAP AD scRNA-seq data to evaluate the MHC class II antigen presentation levels in different cell types. Cell type annotation was provided for the 172,659 cells, including seven cell types: microglia (Mic), astrocytes (Ast), endothelial cells (End), excitatory neurons (Exc), inhibitory neurons (Inh), oligodendrocytes (Oli), and oligodendrocyte precursor cells (OPC). For a better visualization, we randomly sample 500 cells of high total UMI from each cell type in the AD samples (Fig. 5 a) and applied PSAA to estimate the MHC class II antigen presentation level in each cell (Fig. 5 b, 5 c). PSAA identified that microglia have the highest activities of the reactions involved in the 'Processing of internalized proteins in endosomal/lysosomal' and 'Biosynthesis of MHC-II complex' (Fig. 5 d). However, microglia and astrocytes exhibit lower levels of further processing modules of MHC class II molecules compared to Exc and Inh (Fig. 5 d). Compared to the antigen presentation cells in infectious disease and the TME of cancer, microglia in the AD brain have a much higher inconsistency between the MHC-II complex biosynthesis step and its processing and presentation onto cell surface. Thus, we hypothesize that even though microglia can express and produce MHC class II molecules, they still have a low rate of MHC-II complex presentation on the cell surface. To further test this hypothesis, we adjusted the weight for the imbalance loss of each reaction step when applying the optimization algorithm, MPSL. Modulating these weights facilitates an extensive search for the potential solution space of the functions that could enhance the flux balance across the entire pathway. We observed that the iterations between the message passing optimization and supervised learning steps failed to yield a consistent distribution of the flux when the algorithm reached convergence (burns-in) across all tested hyperparameters (Fig. 5 e, 5 f). We also examined how the flux distribution responded to the changes of the hyperparameters. A higher level of MHC-II activity in microglia was predicted by the message passing step when using a higher weight of the flux balance loss for the "Biosynthesis of MHC-II complex" reaction (Fig. 5 f). However, the flux predicted by the supervised learning step did not align with the message-passing-derived flux. This inconsistency suggests that there is no function that could be identified by PSAA to map the ROSMAP AD brain scRNA-seq data onto the flux balance solution space of the MHC-II pathway, implying that brain cells do not adhere to the classic MHC-II pathway characteristic of immune cells. Our analysis indicates the potential existence of an alternative mechanism for peptide processing and loading in the MHC-II pathways, or a mismatch between peptide processing and MHC-II complex expression in the microglia in AD brains. Discussion Systems biology characterizes the motion of molecules within a complex biological process as differential equation-based dynamic systems 47 , 48 . Recently, physics-informed neural networks (PINNs) have emerged as a powerful approach in this field, integrating neural network architectures with physical laws to model complex biological systems. A reliable systems biology model provides explicit quantification and interpretations of the system 48 , 49 , and enables simulation and perturbation analysis to study the impact of each biological feature and their interactive effects in the system 48 , 50 , 51 . Despite a plethora of knowledge on the differential equation-based systems biology model, dynamic models are difficult to establish within specific biological contexts, especially when only static data is available. Although a challenge, the large amount of biological omics data has the potential to characterize complex biological systems. Here we presented the PSAA framework that approximates the activity level and dynamics of MHC-I and MHC-II pathways by using multi-omics data. Compared to conventional systems biology and data-driven computational biology approaches, PSAA bridges omics data with explicit systems biology models by integrating neural network frameworks. Crucially, we demonstrate that for systems with steady-state equilibrium—such as metabolic pathways and macromolecule processing—dynamic behaviors can be inferred from non-time course omics data, expanding the applicability of PSAA to a broader range of biological data contexts. PSAA demonstrated that reaction rates within the system having equilibrium steady states can be approximated by properly designed neural networks, which forms a new type of physics informed neural network. We call this type of analysis PINN-empowered and data-driven systems biology and the underlying learning paradigm as constrained learning. Constrained learning is defined by (1) approximating the non-linear dependency between the dynamics of biological reactions and observed omics data by AI-based non-linear solvers such neural networks and (2) constraining the solution space (functional space) of the non-linear solver by the dynamic properties of the systems, i.e, a type of PINN. We provided a mathematical formulation of the general constrained learning problem (see details in Supplementary Methods ). PSAA illustrates a newly defined machine learning paradigm and PINN architecture, which falls outside of traditional supervised and unsupervised learning. We refer to this learning paradigm as constrained learning for data-driven systems biology. This approach is characterized by identifying the mathematical model to quantify the reaction rates within a given omics data set, while enforcing coherence between the mathematical property of the model and the systems biological property of the reactions being studied. Empowered by this idea, the PSAA framework provides the following unmet capabilities: (1) quantify sample-wise activity level of the whole process and individual biological steps of MHC-I and MHC-II pathways using bulk, single-cell, or spatially resolved transcriptomics or proteomics data; (2) compute the dependency of MHC-I antigen presentation with T cell infiltration and activity level, and other biological processes or clinical features; (3) dissection of spatial regions of varied T cell infiltration and antigen presentation levels in SRT data; (4) assess the impact of the expression change of each gene on the MHC-I and MHC-II pathways in each sample; (5) prediction of possible drugs to perturb the level of MHC-I and MHC-II antigen presentation; and (6) inference of the goodness of fit of the data to prior assumed biological pathway structure. Selected analyses including prediction accuracy and targets predicted by in-silico perturbations have been validated on our in-house generated and public domain data sets. Figure 3h, 4 a and Supplementary Fig. 3a suggested that different samples may have varied levels of T cell activation. It is noteworthy that PSAA only models the flux rate of MHC-I/II pathway in a cell or tissue sample. T cell activation depends on the interaction of a specific TCR with the peptide-MHC complex and engagement with other co-stimulatory molecules. However, the TCR activation reaction, which is a signaling procedure, does not satisfy the condition of steady-state equilibrium. Thus, PSAA cannot effectively handle variations in TCR differences. In each training of the PSAA model, it is assumed that all samples share the same rate of T cell recognition and activation. One future direction is to extend the current PSAA by including parameters of TCR recognition and T cell activation to better characterize the interactions between APCs and T cells. Methods Public Data used in the analysis. Reaction information of the genes involved in each step of MHC-I and II pathways were collected from KEGG, Reactome, GO databases and literature data. Detailed information of the pathway reconstruction was given in Supplementary Methods . We collected three CITE-seq data sets, GSE249542, GSE200417 29 , and SCP1064 28 , from the GEO database to validate the predication accuracy of PSAA. Bulk RNA-seq and proteomics data of cancer cell lines from CCLE and cancer tissue data from TCGA were utilized to validate the PSAA predictions. RPKM normalized RNA-seq data and normalized proteomics data were downloaded from cBioPortal. Cancer types were selected based on the availability of normal samples in TCGA. We retrieved five spatial transcriptomics slides of cancer tissue from the 10x Genomics website and GSE206522 to validate the application of PSAA to cancer data. Four transcriptomics data of patient samples collected from clinical trials of anti-PD1/CTLA-4 therapies, GSE91061 38 , GSE115821 39 , GSE126043 40 , and GSE162137 41 , were collected from the GEO database to demonstrate the clinical implications of PSAA predicted antigen presentation levels. ROSMAP scRNA-seq data of AD brain was retrieved from ROSMAP data portal. Detailed information of data retrieval, processing, and normalization were given in Supplementary Methods . Detailed experimental procedures of the scRNA-seq data of H. ducreyi infection from four human volunteers and matched spatial transcriptomics data, including sample collection, processing, library construction, and sequencing, were given in Supplementary Methods . The data was deposited to dbGaP (phs003754). Mathematical formulation of PSAA PSAA utilizes a directed factor graph base representation of the MHC-I or II pathway and a PINN model for reaction rate estimation. We first reconstruct the MHC-I or II pathway into a directed factor graph, in which each reaction $\:\left(R\right)$ is a variable, and each intermediate molecule such as peptides and MHC complex $\:\left(C\right)$ are factors. Denote $\:FG\left(C,R,\:{E}_{C\to\:R},{E}_{R\to\:C}\right)$ as the factor graph, where $\:C$ is the set of intermediate molecules, $\:R$ is the set of reactions, $\:{E}_{C\to\:R}$ and $\:{E}_{R\to\:C}$ are direct edges represent $\:R$ consumes or produces $\:C$ . Denote $\:{\mathcal{R}}_{in}^{{C}_{k}}=\left\{{R}_{m}\right|{(R}_{m}\to\:{C}_{k})\in\:{E}_{C\to\:R}\}$ and $\:{\mathcal{R}}_{out}^{{C}_{k}}=\left\{{R}_{m}\right|\left({C}_{k}\to\:{R}_{m}\right)\in\:{E}_{R\to\:C}\:\}$ as the sets of the reactions that produce $\:{C}_{k}$ or consume $\:{C}_{k}$ . For an omics data of $\:N$ samples, denote $\:{X}_{j}^{m}=\left\{{x}_{1,j}^{m},\dots\:,{x}_{{i}_{m},j}^{m}\right\}$ as the expression of the genes or proteins involved in the reaction $\:{R}_{m}$ and $\:{F}_{m,j}$ as the rate of $\:{R}_{m}$ in sample $\:j$ . We model $\:{F}_{m,j}=\mathcal{F}\left({X}_{j}^{m},\:{{\Theta\:}}_{m}\right)$ as a multi-layer neural network with the input $\:{X}_{j}^{m}$ , here $\:{{\Theta\:}}_{m}$ denotes the parameter of the neural network. $\:{{\Theta\:}}_{m}$ and cell-wise flux $\:{F}_{m,j}$ are solved by minimizing the loss: $$\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\:\sum\:_{j=1}^{N}\sum\:_{k=1}^{K}{{\gamma\:}_{k}\left({\sum\:}_{{m\in\:\mathcal{R}}_{in}^{{C}_{k}}}{F}_{m,j}-{\sum\:}_{{{m}^{{\prime\:}}\in\:\mathcal{R}}_{out}^{{C}_{k}}}{F}_{{m}^{{\prime\:}},j}\right)}^{2}+{L}_{p}\left(\mathcal{F},{\Theta\:}{\prime\:}\right)\:\:\:\:①\:\:\:\:\:\:\:\:\:\:\:\:\:\:$$ Here the first loss term regularizes the coherency to the flux balance condition of the biological system on the observed data. A quadratic loss was used to enable certain imbalances for the samples under quasi-steady state or non-steady state. Parameter $\:{\gamma\:}_{k}$ is introduced in PSAA to enable different weights for the flux balance of different steps. Noted, the whole network of antigen processing, presentation, and recognition is composed by three main branches, namely (1) peptide generation (ubiquitination and proteosome), (2) peptide loading, and (3) processing and exocytosis of MHC class I or II complexes, where only the step (2) is antigen presentation specific while the activity level of (1) and (3) could be regulated for other biological activities that the presentation of MHC complexes. Thus, $\:{\gamma\:}_{k}$ is introduced to leverage the influence of varied expression levels of different steps in antigen presentation. We demonstrated the proper value range of $\:{\gamma\:}_{k}$ can be identified by maximizing the best fitting of the data to the underlying systems biology model of the antigen presentation pathways. Detailed systems biology consideration is provided in Supplementary Methods . MPSL optimization algorithm The solution of $\:①$ is non-trivial because it needs to search over the functional space of $\:{\mathcal{F}}_{m}({X}_{j}^{m},{{\Theta\:}}^{m})$ and parameter space of $\:{{\Theta\:}}^{m}$ for each reaction $\:m$ . In PSAA, we utilize neural networks to approximate the non-linear dependency between omics data and reaction rate 27 . Noted, minimization of the loss term $\:①$ neither falls into the paradigm of supervised learning nor unsupervised learning. Instead, the loss term characterizes the biochemical dependencies among $\:\mathcal{F}$ . Furthermore, the sparse learning penalty term $\:{L}_{p}\left(\mathcal{F},{\Theta\:}{\prime\:}\right)\:$ cannot be directly co-optimized with the flux balance term by using existing methods. Thus, we developed a new optimization algorithm for $\:①$ , namely Message Passing and Supervised Learning (MPSL). Specifically, considering the flux balance term as additional constraints of the solution space of $\:{\mathcal{F}}_{m}$ , we proposed an iterative approach to effectively search for the solution space. Because the flux balance constraint could be represented as factors over an directed factor graph, in which each function $\:\mathcal{F}$ is a variable and Information balance over a factor can be achieved by belief propagation 52 . Based on this idea, we develop a three-step optimizer to minimize $\:①$ : Step 1 (Initialization) : Classic gradient decent is first utilized to generate an initial solution $\:\mathcal{F}({D}_{0},{{\Theta\:}}_{0}^{{\prime\:}})$ . Step 2 (Message Passing Optimization, MPO) : for any given $\:\mathcal{F}({D}_{0},{{\Theta\:}}_{0}^{{\prime\:}})$ with fixed $\:{D}_{0}$ and $\:{{\Theta\:}}_{0}^{{\prime\:}}$ , the optimization strategy adopts the idea of belief propagation (BP) to identify a solution $\:\mathcal{F}\mathcal{{\prime\:}}$ , which is on the solution space of $\:{\Phi\:}\left(\mathcal{R}\left({\Theta\:}\right)\right)$ and close to $\:\mathcal{F}({D}_{0},{{\Theta\:}}_{0}^{{\prime\:}})$ . Noted, here BP only searches $\:\mathcal{F}\mathcal{{\prime\:}}$ only based on the flux rates $\:\mathcal{F}({D}_{0},{{\Theta\:}}_{0})$ and does not rely on $\:{D}_{0}$ . $\:\mathcal{F}\mathcal{{\prime\:}}$ can be considered as projecting the values of $\:\mathcal{F}({D}_{0},{{\Theta\:}}_{0})$ to the solution space that minimizes the flux balance term. Step 3 (Supervised Learning) : Then $\:\mathcal{F}\mathcal{{\prime\:}}$ could serve as known labels of flux rate of each reaction. $\:\mathcal{F}$ and $\:{{\Theta\:}}^{{\prime\:}}$ could be further updated by a supervised fitting of $\:\mathcal{F}({D}_{0},{{\Theta\:}}^{{\prime\:}})$ to $\:\mathcal{F}\mathcal{{\prime\:}}$ for all data points $\:{D}_{0}$ . Under default setting, $\:\mathcal{F}\left({D}_{0},{{\Theta\:}}^{{\prime\:}}\right)$ will be trained as a fully connected deep neural network to predict $\:\mathcal{F}\mathcal{{\prime\:}}$ and $\:{L}_{1}$ penalty will be used for $\:L(\mathcal{F},{\Theta\:}{\prime\:})$ . Step 2 and 3 could be iteratively conducted to iteratively optimize $\:\mathcal{F}\mathcal{{\prime\:}}$ and $\:\{\mathcal{F},{{\Theta\:}}^{{\prime\:}}\}$ . It is noteworthy that BP can effectively handle linear constraints such as flux balance under quasi-steady state. By using this approach, the two terms could be iteratively and effectively handled, and the $\:{L}_{p}\left(\mathcal{F},{\Theta\:}{\prime\:}\right)\:$ can be directly minimized in the supervised learning form in Step 3. In-silico perturbation analysis to evaluate the impact of each gene To study which gene contributes most to MHC class II antigen presentation pathways in APCs, we conduct an in-silico perturbation analysis. The first order partial derivative of each gene in the PSAA model was computed for each sample. We first got the absolute value of the first order partial derivative and then used log transformation to make sure that all the gradients are positive values. After that, we performed principal component analysis (PCA) in a gradient matrix and selected the top 6 PCs to represent the importance of each gene. Then we used k-means clustering to cluster the importance matrix into three clusters as elaborated in the main text. Finally, we ranked genes based on their importance within each cluster and identified the top 10 genes in each cluster for further analysis. Identification of gene targets to improve the antigen presentation level To identify the genes that could be targeted to improve the presentation level of MHC-I complex on the surface of a cell, we evaluate if adding a certain gene into the “MHC-I complex salvage” module could significantly increase the balance of the fitting. Specifically, we performed a paired Wald test using TCGA data of TNBC and other cancer types. We define the flux imbalance loss with target gene as $\:{L}_{X}=({l}_{1}^{X},{l}_{2}^{X}\dots\:{l}_{n}^{X})$ and the flux imbalance loss without target gene as $\:{L}_{Y}=({l}_{1}^{Y},{l}_{2}^{Y}\dots\:{l}_{n}^{Y})$ . The null hypothesis test is $\:{H}_{0}$ : $\:{\mu\:}_{X}-{\mu\:}_{Y}=0$ . The alternative hypothesis is $\:{H}_{a}$ : $\:{\mu\:}_{X}-{\mu\:}_{Y}<0$ . We tested a list of target genes and reported their p-values in Supplementary Table S6 . The CRISPR knock experiment of the genes with most significant p-values were conducted on a TNBC cell line system as detailed below. Cell culture and Generation of stable knockdown cell lines of the CRISPR knockdown experiment The mouse breast cancer cell line with endogenous expression of ovalbumin, EO771-OVA, was maintained in DMEM medium supplemented with 10% fetal bovine serum and 1% penicillin/streptomycin at 37°C with 5% CO2. For the generation of stable shRNA knockdown cell lines, shRNA clone sets targeting different mouse genes were purchased from Sigma and transduced into EO771-OVA cells via lentivirus, followed by 2 µg/ml puromycin selection for 5 days. qPCR was used to examine the knockdown efficacy. Antigen presentation and cytotoxicity assay to validate the predicted genes Antigen presentation levels on EO771-OVA and knockdown cell lines were examined using APC-conjugated anti-mouse H-2Kb bound to SIINFEKL antibody (BioLegend, Dilution 1:50) and evaluated by flow cytometry. EO771-WT cells were used as isotype control for gating strategy. To assess the CD8 + T cell cytotoxicity against EO771-OVA cell lines with gene knockdown, CD8 + T cells were isolated from the splenocytes of OT-I mice and stimulated with CD3/CD28 dynabeads (Gibco™ #11131D) in the presence of 5 ng/mL IL-2 for 2 days. EO771-OVA cells were stained with IncuCyte Cytolight Rapid Red (Sartorius, #4705) and seeded in 96-well plate (5 x 10 3 cell/well). OT-1 T cells were co-cultured with EO771-OVA cells at effector/tumor-cell ratio of 3:1. The 16h time point was used for final readout. Signal amplification of spatial transcriptomics data. To reduce the discontinuities inherent caused by the low signal level, we employed a Gaussian smoothing filter to amplify the spatial dependent signals in SRT data, such as the predicted activity level of antigen presentations or T cell abundance. Specifically, the following smoothing model was used to amplify spatial dependent signals: $$\:{f}_{i}^{amp}=\sum\:_{j\in\:N\left(i\right)}g\left(d\left(i,j\right)\right)*{f}_{i}$$ , where $\:g\left(x\right)=\frac{1}{\sigma\:\surd\:2\pi\:}{e}^{-\frac{{\left(x-\mu\:\right)}^{2}}{2{\sigma\:}^{2}}}$ is a gaussian kernel and $\:{f}_{i}$ represents the PSAA predicted antigen presentation level or original T cell expression level of spot $\:i$ . $\:N\left(i\right)$ is the neighborhood of $\:i$ defined as the spatial spots whose distance to $\:i$ are smaller than a certain threshold, and $\:d(i,j)$ is the Euclidean distance between two spatial spots $\:i$ and $\:j$ , and $\:{f}_{i}^{amp}$ is the amplified signal of spot $\:i$ . Identify spatial regions of varied dependency between antigen presentation and T cell infiltration. To evaluate the spatial dependency between antigen presentation and T cell infiltrations and identify the spatial regions show varied dependencies, we computed local bivariate Moran’s I correlation 53 for each spatial spot $\:i$ : $$\:{I}_{i}=\frac{{\sum\:}_{j}{w}_{ij}{y}_{j}\times\:{x}_{i}}{{\sum\:}_{i}{x}_{i}^{2}}$$ , where $\:{x}_{i}$ represents the normalized antigen presentation level after amplification at spot $\:i$ , $\:{y}_{j}$ represents the normalized T cell level after amplification at spot $\:j$ , and $\:{w}_{ij}$ is a weight indexing location of spot $\:i$ relative to $\:j$ . In this study, we used the gaussian kernel distance to compute $\:{w}_{ij}$ . For each spot $\:i$ , $\:{I}_{i}$ measures the correlation between its antigen presentation level ( $\:{x}_{i}$ ) with the level of T cell abundance ( $\:{y}_{i}$ ) in its neighborhood ( $\:{\sum\:}_{j\in\:N\left(i\right)}{w}_{ij}{y}_{j}$ ). The significance of $\:{I}_{i}$ was evaluated using a permutation test, where the $\:{y}_{}$ values were randomly permuted across all spots. A pseudo p-value was then calculated by determining the proportion of Local Moran's I values generated from permutations that are greater than or equal to the observed Local Moran's I values from original data. We set significant level $\:\alpha\:=0.05$ and the spots with pseudo p-value $\:p<0.05$ were considered statistically significant, indicating significant a spatial dependence between antigen presentation level and T cell infiltration level. Then we segment the regions of significant dependencies into four regions based on the level of antigen presentation and T cell signals – “High antigen, High T cell”, “High antigen, Low T cell”, “Low antigen, High T cell” and “Low antigen, Low T cell”. To conduct this segmentation, we first normalize the antigen presentation level of each spot by computing the z-score of $\:{x}_{i}$ , denoted as $\:{z(x}_{i})$ , and the T cell infiltration level of its neighbors by the weighted average z-score of $\:{y}_{j}$ , denoted as $\:{{w}_{ij}z(y}_{j})$ . The “High” or “Low” antigen and “High” or “Low” T cell regions were identified by positive or negative $\:{z(x}_{i})$ and positive or negative $\:{{w}_{ij}z(y}_{j})$ ), respectively. Declarations ACKNOWLEDGEMENTS AND FUNDING SUPPORTS The project was supported by NIGMS 1R35GM150971 (CZ), R35GM155028 (CS), NLM 1R01LM014720 (CZ), NSF DBI-2047631 (CZ), NSF-IIS- 2145314 (CS), American Cancer Society RSG-22-062-01-MM (CZ), RSG-24-1321371 (CS). The generation of the H. ducreyi data sets were supported by R01AI137116 from the National Institute of Allergy and Infectious Diseases to S.M.S. References Jhunjhunwala S, Hammer C, Delamarre L. Antigen presentation in cancer: insights into tumour immunogenicity and immune evasion. Nat Rev Cancer. 2021;21(5):298–312. Epub 20210309. doi: 10.1038/s41568-021-00339-z . PubMed PMID: 33750922. Dhatchinamoorthy K, Colbert JD, Rock KL. Cancer Immune Evasion Through Loss of MHC Class I Antigen Presentation. Front Immunol. 2021;12:636568. Epub 20210309. doi: 10.3389/fimmu.2021.636568. PubMed PMID: 33767702; PMCID: PMC7986854. Vinay DS, Ryan EP, Pawelec G, Talib WH, Stagg J, Elkord E, Lichtor T, Decker WK, Whelan RL, Kumara H, Signori E, Honoki K, Georgakilas AG, Amin A, Helferich WG, Boosani CS, Guha G, Ciriolo MR, Chen S, Mohammed SI, Azmi AS, Keith WN, Bilsland A, Bhakta D, Halicka D, Fujii H, Aquilano K, Ashraf SS, Nowsheen S, Yang X, Choi BK, Kwon BS. Immune evasion in cancer: Mechanistic basis and therapeutic strategies. Semin Cancer Biol. 2015;35 Suppl:S185-S98. Epub 20150325. doi: 10.1016/j.semcancer.2015.03.004 . PubMed PMID: 25818339. Mellman I, Coukos G, Dranoff G. Cancer immunotherapy comes of age. Nature. 2011;480(7378):480–9. Epub 20111221. doi: 10.1038/nature10673. PubMed PMID: 22193102; PMCID: PMC3967235. Schuster M, Nechansky A, Kircheis R. Cancer immunotherapy. Biotechnol J. 2006;1(2):138–47. doi: 10.1002/biot.200500044 . PubMed PMID: 16892244. Esfahani K, Roudaia L, Buhlaiga N, Del Rincon SV, Papneja N, Miller WH, Jr. A review of cancer immunotherapy: from the past, to the present, to the future. Curr Oncol. 2020;27(Suppl 2):S87-S97. Epub 20200401. doi: 10.3747/co.27.5223 . PubMed PMID: 32368178; PMCID: PMC7194005. Blum JS, Wearsch PA, Cresswell P. Pathways of antigen processing. Annu Rev Immunol. 2013;31:443–73. Epub 20130103. doi: 10.1146/annurev-immunol-032712-095910 . PubMed PMID: 23298205; PMCID: PMC4026165. Leone P, Shin EC, Perosa F, Vacca A, Dammacco F, Racanelli V. MHC class I antigen processing and presenting machinery: organization, function, and defects in tumor cells. J Natl Cancer Inst. 2013;105(16):1172–87. Epub 20130712. doi: 10.1093/jnci/djt184 . PubMed PMID: 23852952. Lee MY, Jeon JW, Sievers C, Allen CT. Antigen processing and presentation in cancer immunotherapy. J Immunother Cancer. 2020;8(2). doi: 10.1136/jitc-2020-001111 . PubMed PMID: 32859742; PMCID: PMC7454179. Neefjes J, Jongsma ML, Paul P, Bakke O. Towards a systems understanding of MHC class I and MHC class II antigen presentation. Nature reviews immunology. 2011;11(12):823–36. Holling TM, Schooten E, van Den Elsen PJ. Function and regulation of MHC class II molecules in T-lymphocytes: of mice and men. Human immunology. 2004;65(4):282–90. Bolon B. Cellular and molecular mechanisms of autoimmune disease. Toxicologic pathology. 2012;40(2):216–29. Brodsky FM, Lem L, Solache A, Bennett EM. Human pathogen subversion of antigen presentation. Immunological reviews. 1999;168(1):199–215. Thibodeau J, Bourgeois-Daigneault M-C, Lapointe R. Targeting the MHC Class II antigen presentation pathway in cancer immunotherapy. Oncoimmunology. 2012;1(6):908–16. Loschko J, Schreiber HA, Rieke GJ, Esterházy D, Meredith MM, Pedicord VA, Yao K-H, Caballero S, Pamer EG, Mucida D. Absence of MHC class II on cDCs results in microbial-dependent intestinal inflammation. Journal of experimental medicine. 2016;213(4):517–34. Pishesha N, Harmand TJ, Ploegh HL. A guide to antigen processing and presentation. Nature Reviews Immunology. 2022;22(12):751–64. Lagergren JH, Nardini JT, Baker RE, Simpson MJ, Flores KB. Biologically-informed neural networks guide mechanistic modeling from sparse experimental data. PLoS computational biology. 2020;16(12):e1008462. Raissi M, Perdikaris P, Karniadakis GE. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational physics. 2019;378:686–707. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30. doi: 10.1093/nar/28.1.27 . PubMed PMID: 10592173; PMCID: PMC102409. Fabregat A, Jupe S, Matthews L, Sidiropoulos K, Gillespie M, Garapati P, Haw R, Jassal B, Korninger F, May B. The reactome pathway knowledgebase. Nucleic acids research. 2018;46(D1):D649-D55. Consortium GO. The Gene Ontology (GO) database and informatics resource. Nucleic acids research. 2004;32(suppl_1):D258-D61. Montealegre S, Van Endert PM. Endocytic recycling of MHC class I molecules in non-professional antigen presenting and dendritic cells. Frontiers in immunology. 2019;9:3098. Lindenbergh MF, Stoorvogel W. Antigen presentation by extracellular vesicles from professional antigen-presenting cells. Annual review of immunology. 2018;36(1):435–59. Roche PA, Furuta K. The ins and outs of MHC class II-mediated antigen processing and presentation. Nature Reviews Immunology. 2015;15(4):203–16. Damiani C, Maspero D, Di Filippo M, Colombo R, Pescini D, Graudenzi A, Westerhoff HV, Alberghina L, Vanoni M, Mauri G. Integration of single-cell RNA-seq data into population models to characterize cancer metabolism. PLoS computational biology. 2019;15(2):e1006733. Zhang Y, Kim MS, Nguyen E, Taylor DM. Modeling metabolic variation with single-cell expression data. bioRxiv. 2020:2020.01. 28.923680. Alghamdi N, Chang W, Dang P, Lu X, Wan C, Gampala S, Huang Z, Wang J, Ma Q, Zang Y, Fishel M, Cao S, Zhang C. A graph neural network model to estimate cell-wise metabolic flux using single-cell RNA-seq data. Genome Res. 2021;31(10):1867–84. Epub 20210722. doi: 10.1101/gr.271205.120 . PubMed PMID: 34301623; PMCID: PMC8494226. Frangieh CJ, Melms JC, Thakore PI, Geiger-Schuller KR, Ho P, Luoma AM, Cleary B, Jerby-Arnon L, Malu S, Cuoco MS. Multimodal pooled Perturb-CITE-seq screens in patient models define mechanisms of cancer immune evasion. Nature genetics. 2021;53(3):332–41. Xu Z, Heidrich-O’Hare E, Chen W, Duerr RH. Comprehensive benchmarking of CITE-seq versus DOGMA-seq single cell multimodal omics. Genome Biology. 2022;23(1):135. Chen CY, Mertz KJ, Spinola SM, Morse SA. Comparison of enzyme immunoassays for antibodies to Haemophilus ducreyi in a community outbreak of chancroid in the United States. J Infect Dis. 1997;175(6):1390–5. doi: 10.1086/516471. PubMed PMID: 9180178. Chang W, Dang P, Wan C, Lu X, Fang Y, Zhao T, Zang Y, Li B, Zhang C, Cao S, editors. Spatially and Robustly Hybrid Mixture Regression Model for Inference of Spatial Dependence. 2021 IEEE International Conference on Data Mining (ICDM); 2021: IEEE. Martinez-Bosch N, Vinaixa J, Navarro P. Immune Evasion in Pancreatic Cancer: From Mechanisms to Therapy. Cancers (Basel). 2018;10(1). Epub 20180103. doi: 10.3390/cancers10010006 . PubMed PMID: 29301364; PMCID: PMC5789356. Kabacaoglu D, Ciecielski KJ, Ruess DA, Algul H. Immune Checkpoint Inhibition for Pancreatic Ductal Adenocarcinoma: Current Limitations and Future Options. Front Immunol. 2018;9:1878. Epub 20180815. doi: 10.3389/fimmu.2018.01878 . PubMed PMID: 30158932; PMCID: PMC6104627. Sun Q, Hong Z, Zhang C, Wang L, Han Z, Ma D. Immune checkpoint therapy for solid tumours: clinical dilemmas and future trends. Signal Transduct Target Ther. 2023;8(1):320. Epub 20230828. doi: 10.1038/s41392-023-01522-4 . PubMed PMID: 37635168; PMCID: PMC10460796. Pei L, Liu Y, Liu L, Gao S, Gao X, Feng Y, Sun Z, Zhang Y, Wang C. Roles of cancer-associated fibroblasts (CAFs) in anti- PD-1/PD-L1 immunotherapy for solid cancers. Mol Cancer. 2023;22(1):29. Epub 20230210. doi: 10.1186/s12943-023-01731-z . PubMed PMID: 36759842; PMCID: PMC9912573. Lu C, Liu Z, Klement JD, Yang D, Merting AD, Poschel D, Albers T, Waller JL, Shi H, Liu K. WDR5-H3K4me3 epigenetic axis regulates OPN expression to compensate PD-L1 function to promote pancreatic cancer immune escape. J Immunother Cancer. 2021;9(7). doi: 10.1136/jitc-2021-002624 . PubMed PMID: 34326167; PMCID: PMC8323468. Fang Y, Wang L, Wan C, Sun Y, Van der Jeught K, Zhou Z, Dong T, So KM, Yu T, Li Y, Eyvani H, Colter AB, Dong E, Cao S, Wang J, Schneider BP, Sandusky GE, Liu Y, Zhang C, Lu X, Zhang X. MAL2 drives immune evasion in breast cancer by suppressing tumor antigen presentation. J Clin Invest. 2021;131(1). doi: 10.1172/JCI140837 . PubMed PMID: 32990678; PMCID: PMC7773365. Riaz N, Havel JJ, Makarov V, Desrichard A, Urba WJ, Sims JS, Hodi FS, Martín-Algarra S, Mandal R, Sharfman WH. Tumor and microenvironment evolution during immunotherapy with nivolumab. Cell. 2017;171(4):934–49. e16. Auslander N, Zhang G, Lee JS, Frederick DT, Miao B, Moll T, Tian T, Wei Z, Madan S, Sullivan RJ. Robust prediction of response to immune checkpoint blockade therapy in metastatic melanoma. Nature medicine. 2018;24(10):1545–9. Cho J-W, Hong MH, Ha S-J, Kim Y-J, Cho BC, Lee I, Kim HR. Genome-wide identification of differentially methylated promoters and enhancers associated with response to anti-PD-1 therapy in non-small cell lung cancer. Experimental & molecular medicine. 2020;52(9):1550–63. Phillips D, Matusiak M, Gutierrez BR, Bhate SS, Barlow GL, Jiang S, Demeter J, Smythe KS, Pierce RH, Fling SP. Immune cell topography predicts response to PD-1 blockade in cutaneous T cell lymphoma. Nature Communications. 2021;12(1):6726. Collawn JF, Benveniste EN. Regulation of MHC class II expression in the central nervous system. Microbes and infection. 1999;1(11):893–902. Korn T, Kallies A. T cell responses in the central nervous system. Nature Reviews Immunology. 2017;17(3):179–94. Aloisi F. The role of microglia and astrocytes in CNS immune surveillance and immunopathology. The Functional Roles of Glial Cells in Health and Disease: Dialogue between Glia and Neurons. 1999:123 – 33. Nott A, Holtman IR. Genetic insights into immune mechanisms of Alzheimer’s and Parkinson’s disease. Frontiers in immunology. 2023;14:1168539. Bachiller S, Jiménez-Ferrer I, Paulus A, Yang Y, Swanberg M, Deierborg T, Boza-Serrano A. Microglia in neurological diseases: a road map to brain-disease dependent-inflammatory response. Frontiers in cellular neuroscience. 2018;12:488. Kitano H. Systems biology: a brief overview. Science. 2002;295(5560):1662-4. doi: 10.1126/science.1069492 . PubMed PMID: 11872829. Kamimoto K, Stringa B, Hoffmann CM, Jindal K, Solnica-Krezel L, Morris SA. Dissecting cell identity via network inference and in silico gene perturbation. Nature. 2023;614(7949):742–51. Epub 20230208. doi: 10.1038/s41586-022-05688-9 . PubMed PMID: 36755098; PMCID: PMC9946838. Eissing T, Kuepfer L, Becker C, Block M, Coboeken K, Gaub T, Goerlitz L, Jaeger J, Loosen R, Ludewig B, Meyer M, Niederalt C, Sevestre M, Siegmund HU, Solodenko J, Thelen K, Telle U, Weiss W, Wendl T, Willmann S, Lippert J. A computational systems biology software platform for multiscale modeling and simulation: integrating whole-body physiology, disease biology, and molecular reaction networks. Front Physiol. 2011;2:4. Epub 20110224. doi: 10.3389/fphys.2011.00004 . PubMed PMID: 21483730; PMCID: PMC3070480. Konur S, Mierla L, Fellermann H, Ladroue C, Brown B, Wipat A, Twycross J, Dun BP, Kalvala S, Gheorghe M, Krasnogor N. Toward Full-Stack In Silico Synthetic Biology: Integrating Model Specification, Simulation, Verification, and Biological Compilation. ACS Synth Biol. 2021;10(8):1931–45. Epub 20210802. doi: 10.1021/acssynbio.1c00143 . PubMed PMID: 34339602. Keller R, Dorr A, Tabira A, Funahashi A, Ziller MJ, Adams R, Rodriguez N, Le Novere N, Hiroi N, Planatscher H, Zell A, Drager A. The systems biology simulation core algorithm. BMC Syst Biol. 2013;7:55. Epub 20130705. doi: 10.1186/1752-0509-7-55 . PubMed PMID: 23826941; PMCID: PMC3707837. Satorras VG, Welling M, editors. Neural enhanced belief propagation on factor graphs. International Conference on Artificial Intelligence and Statistics; 2021: PMLR. Lee S-I. Developing a bivariate spatial association measure: an integration of Pearson's r and Moran's I. Journal of geographical systems. 2001;3:369–85. Additional Declarations There is NO Competing Interest. Supplementary Files Supplementarymethodssubmission.docx Supplementary Methods SupplementaryTableS1.xlsx Supplementary Table S1 SupplementaryTableS2.xlsx Supplementary Table S2 SupplementaryTableS3.xlsx Supplementary Table S3 SupplementaryTableS4.xlsx Supplementary Table S4 SupplementaryTableS5.csv Supplementary Table S5 SupplementaryTableS6.xlsx Supplementary Table S6 SupplementaryFigure1.pdf Supplementary Figure S1 SupplementaryFigure2.pdf Supplementary Figure S2 SupplementaryFigure3.pdf Supplementary Figure S3 Cite Share Download PDF Status: Under Review Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-5629379","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":392054909,"identity":"e0f9464d-feb2-428b-a1ea-4017a2727ad5","order_by":0,"name":"Chi Zhang","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAzUlEQVRIiWNgGAWjYDACCTB5gIefgQfMYmwgWotkA6laGAwOEKtFfnbzs4dfau7IGB/vPfyZh8FGdsMBAloY5xwzN5Y59ozH7My5NGkehjRjglqYJRLMpCUbDvOY3cgxY+ZhOJxIUAubRPo3sBbjGTnGQIf9J6yFRyLHTPIjUIuBRI4B0GEHCGuRkMgpk2Y4dphH4swZM8k5BsnGMwlpkZ+Rvk3yR81he/72HuMPbyrsZPsIaQEBZh4404AI5SDA+INIhaNgFIyCUTBCAQAvRUBH4lCZOAAAAABJRU5ErkJggg==","orcid":"","institution":"Indiana University School of Medicine","correspondingAuthor":true,"prefix":"","firstName":"Chi","middleName":"","lastName":"Zhang","suffix":""},{"id":392054910,"identity":"0cb0d9e1-baff-4cfe-9d14-6477fb5829b7","order_by":1,"name":"Jia Wang","email":"","orcid":"","institution":"Indiana University","correspondingAuthor":false,"prefix":"","firstName":"Jia","middleName":"","lastName":"Wang","suffix":""},{"id":392054911,"identity":"16e9c9a6-23c4-4bf2-a62e-ef9cbf55c320","order_by":2,"name":"Pengtao Dang","email":"","orcid":"","institution":"Oregon Health \u0026 Science University","correspondingAuthor":false,"prefix":"","firstName":"Pengtao","middleName":"","lastName":"Dang","suffix":""},{"id":392054912,"identity":"3350302b-1c0e-4370-8d18-c44dfa41fe87","order_by":3,"name":"Yuhui Wei","email":"","orcid":"","institution":"Oregon Health \u0026 Science University","correspondingAuthor":false,"prefix":"","firstName":"Yuhui","middleName":"","lastName":"Wei","suffix":""},{"id":392054913,"identity":"94b4ded0-54b6-45dd-87fa-47d3f59e3e25","order_by":4,"name":"Xiao Wang","email":"","orcid":"","institution":"Indiana University","correspondingAuthor":false,"prefix":"","firstName":"Xiao","middleName":"","lastName":"Wang","suffix":""},{"id":392054914,"identity":"5d8f8f8b-236a-4b08-b3fe-866bc33a3839","order_by":5,"name":"Julie Brothwell","email":"","orcid":"","institution":"Indiana University School of Medicine","correspondingAuthor":false,"prefix":"","firstName":"Julie","middleName":"","lastName":"Brothwell","suffix":""},{"id":392054915,"identity":"5fdc1136-e780-4ba1-b536-0894c0298330","order_by":6,"name":"Yifan Sun","email":"","orcid":"","institution":"Indiana University School of Medicine","correspondingAuthor":false,"prefix":"","firstName":"Yifan","middleName":"","lastName":"Sun","suffix":""},{"id":392054916,"identity":"45c43864-6b4f-4d4f-9cb9-8a28a19a78a9","order_by":7,"name":"Haiqi Zhu","email":"","orcid":"","institution":"Indiana University","correspondingAuthor":false,"prefix":"","firstName":"Haiqi","middleName":"","lastName":"Zhu","suffix":""},{"id":392054917,"identity":"2bd3650c-0991-471a-b618-515faac58465","order_by":8,"name":"Kaman So","email":"","orcid":"","institution":"Indiana University School of Medicine","correspondingAuthor":false,"prefix":"","firstName":"Kaman","middleName":"","lastName":"So","suffix":""},{"id":392054918,"identity":"27c53376-4e20-4c58-af6c-26d39f564ce4","order_by":9,"name":"Jing Liu","email":"","orcid":"https://orcid.org/0000-0002-4912-4560","institution":"Purdue University","correspondingAuthor":false,"prefix":"","firstName":"Jing","middleName":"","lastName":"Liu","suffix":""},{"id":392054919,"identity":"0c7075ef-2519-4dcf-8d41-62307c69ce86","order_by":10,"name":"Yijie Wang","email":"","orcid":"","institution":"Computer Science Department, Indiana University","correspondingAuthor":false,"prefix":"","firstName":"Yijie","middleName":"","lastName":"Wang","suffix":""},{"id":392054920,"identity":"d29d3d38-0a1b-45b8-8f9f-c9c0c99454f8","order_by":11,"name":"Xiongbin Lu","email":"","orcid":"","institution":"Zhejiang University","correspondingAuthor":false,"prefix":"","firstName":"Xiongbin","middleName":"","lastName":"Lu","suffix":""},{"id":392054921,"identity":"e1da16ad-301d-4da8-99d2-848682618598","order_by":12,"name":"Stanley Spinola","email":"","orcid":"","institution":"Indiana University School of Medicine","correspondingAuthor":false,"prefix":"","firstName":"Stanley","middleName":"","lastName":"Spinola","suffix":""},{"id":392054922,"identity":"c67e6a3d-47ed-429b-89c3-a611769830c2","order_by":13,"name":"Xinna Zhang","email":"","orcid":"https://orcid.org/0000-0003-4118-2245","institution":"Indiana University School of Medicine","correspondingAuthor":false,"prefix":"","firstName":"Xinna","middleName":"","lastName":"Zhang","suffix":""},{"id":392054923,"identity":"13238b52-2406-4e29-ae0f-aa5807c1ba75","order_by":14,"name":"Sha Cao","email":"","orcid":"","institution":"Oregon Health \u0026 Science University","correspondingAuthor":false,"prefix":"","firstName":"Sha","middleName":"","lastName":"Cao","suffix":""}],"badges":[],"createdAt":"2024-12-12 07:45:16","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-5629379/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-5629379/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":74019538,"identity":"07d8ead9-078c-48fd-9b06-528e6f4ad548","added_by":"auto","created_at":"2025-01-17 04:52:15","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":3019610,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eOverview of the PSAA framework\u003c/strong\u003e. (a) The analysis pipeline of PSAA. The initial setup of PSAA includes input data and selected antigen presentation pathway. As validated in this study, the input data can be bulk transcriptomics data, scRNA-seq data, spatial transcriptomics data, or proteomics data and the selected pathway can be MHC-I or MHC-II pathways. The core algorithm of PSAA consists of four steps: (1) computing initial flux; (2) projecting the current flux into the flux balance solution space (MPO step); (3) training a neural network to predict the MPO-balanced flux using the input data (SL step); and (4) iteratively conducting steps (2) and (3) until convergence. The output of PSAA is sample-wise activity level of each step in the MHC-I and MHC-II pathways. Downstream applications of PSAA include analyses of differential flux, in-silico perturbation, spatial segmentation, cell-cell interactions, and pathway structure. (b) Reconstructed MHC-I antigen presentation pathway. Rounded boxes represent reactions, and blue dots represent intermediate products. (c) Reconstructed MHC-II antigen presentation pathway.\u003c/p\u003e","description":"","filename":"floatimage1.png","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/f0043f9da8f5a27bec56297c.png"},{"id":74019415,"identity":"f382f8e3-10c4-4e1a-8bd8-5e372e8696ed","added_by":"auto","created_at":"2025-01-17 04:44:15","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":864193,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eValidation of the prediction accuracy of PSAA\u003c/strong\u003e. (a) 1\u003csup\u003est\u003c/sup\u003e Row: Consistency of the PSAA predicted MHC-I antigen presentation level (x-axis) vs experimentally measured cell surface MHC-I complex level (y-axis) in three CITE-seq data and TEA-seq data. 2\u003csup\u003end\u003c/sup\u003e Row: Consistency of the expression level of MHC-I genes (x-axis) vs experimentally measured cell surface MHC-I complex level (y-axis). (b) Consistency of the PSAA predicted MHC-I and II level using RNA-seq data (x-axis) and proteomics data (y-axis) in TCGA and CCLE. (c) Consistency of the predicted activity level of different steps in MHC-I pathway predicted using TCGA data by including (x-axis) and excluding (y-axis) the T cell recognition step in the input pathway.\u003c/p\u003e","description":"","filename":"floatimage2.png","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/24eb62268f94b6e7c749a959.png"},{"id":74021163,"identity":"33548093-a6fb-49b5-98a6-058c719e1363","added_by":"auto","created_at":"2025-01-17 05:08:20","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":2435727,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eApplication of PSAA and downstream functions on matched single cell and spatial transcriptomics data obtained from \u003c/strong\u003e\u003cem\u003e\u003cstrong\u003eHaemophilus ducreyi\u003c/strong\u003e\u003c/em\u003e\u003cstrong\u003e infected skin and uninfected (wounded) skin. \u003c/strong\u003e(a) tSNE of the distribution of scRNA-seq data and cell types. (b) Distribution of PSAA predicted MHC-II antigen presentation level in each single cell over the tSNE of the scRNA-seq data. (c) Distribution of PSAA predicted cell surface MHC-II antigen presentation level (y-axis) in each cell type (x-axis) on scRNA-seq data. Cell types are colored as (a) and (d). (d) Proportion of each cell type in the pustules and wounds. (e) tSNE plot of APCs and the three APC cell clusters derived using the first-order partial derivative of each gene in the MHC-II pathway. (f) Proportion of APC cell types in the three clusters (left) and PSAA predicted MHC-II whole pathway activity level of each cell type in the three clusters (right). (g) T cell level (left) and PSAA predicted MHC-II whole pathway activity level (right) in the spatial spots of the SRT data. (h) Varied dependency between T cell level and MHC-II antigen presentation level in the SRT data of pustules (blue) vs wounded skin (orange) in patient sample 1 and 2. (i) Distribution of T cells and predicted MHC-II antigen presentation level on the spatial slides.\u003c/p\u003e","description":"","filename":"floatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/177a3ee2b5aa43ddb7078acd.png"},{"id":74019418,"identity":"c4571099-d596-4996-85e9-409414d036b0","added_by":"auto","created_at":"2025-01-17 04:44:15","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":1123059,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eApplication of PSAA and downstream functions on TME data. \u003c/strong\u003e(a) PSAA predicted MHC-I antigen presentation on the cell surface vs CTL level + recycling of MHC-I across nine cancer types. (b) Mean CTL cell gene expression level vs MHC-I level - recycling of MHC-I (left) and CTL cell flux prediction level vs MHC-I level - recycling of MHC-I (right). (c) Spatial dissection conducted by using PSAA predicted MHC-I level and T cell level. (d) Pathway enrichment of the differentially expressed genes between different spatial regions. Pathways that directly relate to our collected MHC-I antigen presentation pathway are marked as dark blue. (e) MHC-I antigen presentation level in patients of different responses to PD-1 inhibitors. Progressive Disease (PD), Stable Disease (SD) and Partial Response (PR) groups are colored green, orange, and blue, respectively. Group 1: MSS and pre-treatment patients; Group 2: MSI and pre-treatment patients; Group 3: MSS and on-treatment patients; and Group 4: MSI and on-treatment patients. (f) PSA predicted antigen presentation levels in response vs non-response patients in four independent data sets. (g) p-values of \u003cem\u003eMAL2 \u003c/em\u003eand the genes that are predicted to have the most negative impacts on the MHC-I antigen presentation in 8 TCGA cancer types. (h) Increase of antigen presentation level (y-axis, left) and T cell killing effect (y-axis, right) by CRISPR knocking-down the genes predicted to have the most negative impact on MHC-I antigen presentation. Here x-axis is the -log(p-value) of the impact predicted by using TCGA breast cancer type.\u003c/p\u003e","description":"","filename":"floatimage4.png","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/85ae394151c40b58eb316c1c.png"},{"id":74019428,"identity":"75166291-692c-4c8a-8e4d-e5d2439077b3","added_by":"auto","created_at":"2025-01-17 04:44:16","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":2182193,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eApplication of PSAA on ROSMAP AD data. \u003c/strong\u003e(a) tSNE plot of selected cells in ROSMAP scRNA-seq data. (b-c) Distribution of predicted activity level of (b) biosynthesis of MHC-II in ER and (c) expression of MHC-II complex on the surface of each cell. (d) Predicted activity level of each reaction step in each cell type using the default optimizer. (e-f) Predicted activity level of biosynthesis of MHC-II in ER and expression of MHC-II complex on the surface by the message passing step after and supervised learning step after burning-in using hyperparameters \u003cem\u003eγ\u003c/em\u003e\u003csub\u003e\u003cem\u003eM1\u003c/em\u003e\u003c/sub\u003e= …=\u003cem\u003eγ\u003c/em\u003e\u003csub\u003e\u003cem\u003eM7\u003c/em\u003e\u003c/sub\u003e=0.5 (e) and \u003cem\u003eγ\u003c/em\u003e\u003csub\u003e\u003cem\u003eM3\u003c/em\u003e\u003c/sub\u003e=0.8, \u003cem\u003eγ\u003c/em\u003e\u003csub\u003e\u003cem\u003eM1\u003c/em\u003e\u003c/sub\u003e\u003cem\u003e \u003c/em\u003e=\u003cem\u003eγ\u003c/em\u003e\u003csub\u003e\u003cem\u003eM2\u003c/em\u003e\u003c/sub\u003e=\u003cem\u003eγ\u003c/em\u003e\u003csub\u003e\u003cem\u003eM4\u003c/em\u003e\u003c/sub\u003e=⋯=\u003cem\u003eγ\u003c/em\u003e\u003csub\u003e\u003cem\u003eM7\u003c/em\u003e\u003c/sub\u003e=0.2 (f).\u003c/p\u003e","description":"","filename":"floatimage5.png","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/d17a7d8708abded51dd25033.png"},{"id":74021615,"identity":"b9af8948-9920-44ee-89b2-46a910278543","added_by":"auto","created_at":"2025-01-17 05:08:42","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":8268890,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/b68d5db9-ab98-4c56-9f13-c7b8e5d93cbc.pdf"},{"id":74020702,"identity":"34bbccbc-bf3c-4537-8b87-435fe3018f04","added_by":"auto","created_at":"2025-01-17 05:00:15","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":6567245,"visible":true,"origin":"","legend":"Supplementary Methods","description":"","filename":"Supplementarymethodssubmission.docx","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/0b59971aa9185bb96bfcea4e.docx"},{"id":74019412,"identity":"68c50ac7-36dc-4920-95c6-823c3d497a63","added_by":"auto","created_at":"2025-01-17 04:44:15","extension":"xlsx","order_by":2,"title":"","display":"","copyAsset":false,"role":"supplement","size":15591,"visible":true,"origin":"","legend":"Supplementary Table S1","description":"","filename":"SupplementaryTableS1.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/cebe4d90ed3ce49dc7cb26c9.xlsx"},{"id":74020701,"identity":"ac87f27d-0d85-4e5a-a729-aa195eddc4af","added_by":"auto","created_at":"2025-01-17 05:00:15","extension":"xlsx","order_by":3,"title":"","display":"","copyAsset":false,"role":"supplement","size":12582,"visible":true,"origin":"","legend":"\u003cp\u003eSupplementary Table S2\u003c/p\u003e","description":"","filename":"SupplementaryTableS2.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/a5bb2ebb4bca46e1312c862d.xlsx"},{"id":74019423,"identity":"2c483520-e9be-4075-9907-74929737f2de","added_by":"auto","created_at":"2025-01-17 04:44:16","extension":"xlsx","order_by":4,"title":"","display":"","copyAsset":false,"role":"supplement","size":12498,"visible":true,"origin":"","legend":"Supplementary Table S3","description":"","filename":"SupplementaryTableS3.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/93770ea25c51f051a044d3c1.xlsx"},{"id":74021154,"identity":"aa88032f-50c0-460d-bc5f-aa5336797999","added_by":"auto","created_at":"2025-01-17 05:08:19","extension":"xlsx","order_by":5,"title":"","display":"","copyAsset":false,"role":"supplement","size":9213,"visible":true,"origin":"","legend":"Supplementary Table S4","description":"","filename":"SupplementaryTableS4.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/f061a5b197b5215dd5169c12.xlsx"},{"id":74019544,"identity":"261a8991-7d95-4cbb-8fec-59b8a1348cce","added_by":"auto","created_at":"2025-01-17 04:52:16","extension":"csv","order_by":6,"title":"","display":"","copyAsset":false,"role":"supplement","size":20165,"visible":true,"origin":"","legend":"Supplementary Table S5","description":"","filename":"SupplementaryTableS5.csv","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/d553627debca4bbf939bccf3.csv"},{"id":74019426,"identity":"e46dea09-398a-47e8-ad61-2ba6427a423a","added_by":"auto","created_at":"2025-01-17 04:44:16","extension":"xlsx","order_by":7,"title":"","display":"","copyAsset":false,"role":"supplement","size":10861,"visible":true,"origin":"","legend":"Supplementary Table S6","description":"","filename":"SupplementaryTableS6.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/7f37adedf066a0c0b496587e.xlsx"},{"id":74019427,"identity":"446371aa-a1c9-4bad-8ac6-3ab46e930499","added_by":"auto","created_at":"2025-01-17 04:44:16","extension":"pdf","order_by":8,"title":"","display":"","copyAsset":false,"role":"supplement","size":3235499,"visible":true,"origin":"","legend":"Supplementary Figure S1","description":"","filename":"SupplementaryFigure1.pdf","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/40f4085596ea7d615310982d.pdf"},{"id":74019432,"identity":"5f88b54b-01a2-41f5-84fa-cb644e48a7ac","added_by":"auto","created_at":"2025-01-17 04:44:16","extension":"pdf","order_by":9,"title":"","display":"","copyAsset":false,"role":"supplement","size":174345,"visible":true,"origin":"","legend":"\u003cp\u003eSupplementary Figure S2\u003c/p\u003e","description":"","filename":"SupplementaryFigure2.pdf","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/6db834c5e1b3ee509883b303.pdf"},{"id":74019429,"identity":"bea885a1-1681-41ad-8139-a08ce6f19120","added_by":"auto","created_at":"2025-01-17 04:44:16","extension":"pdf","order_by":10,"title":"","display":"","copyAsset":false,"role":"supplement","size":2021973,"visible":true,"origin":"","legend":"Supplementary Figure S3","description":"","filename":"SupplementaryFigure3.pdf","url":"https://assets-eu.researchsquare.com/files/rs-5629379/v1/daeb78c52d455e8d2806c49e.pdf"}],"financialInterests":"There is \u003cb\u003eNO\u003c/b\u003e Competing Interest.","formattedTitle":"A physics informed neural network approach to quantify antigen presentation activities at single cell level using omics data","fulltext":[{"header":"Introduction","content":"\u003cp\u003eAntigen processing and presentation are fundamental mechanisms within the immune system, wherein intact protein antigens undergo degradation to produce peptide fragments that are loaded onto major histocompatibility complex (MHC) molecules and presented on the cell surface of antigen-presenting cells (APCs), further facilitating recognition by T cells. Humans have two different classes of antigen presentation molecules: MHC class I (MHC-I) molecules mainly bind to endogenous antigens. MHC-I is expressed by all nucleated cells throughout the body and recognized by CD8\u0026thinsp;+\u0026thinsp;cytotoxic T lymphocytes (CTLs). MHC class II (MHC-II) is primarily expressed by professional APCs, mainly bind to exogenous antigens, and are recognized by CD4\u0026thinsp;+\u0026thinsp;cells.\u003c/p\u003e \u003cp\u003eFunctional variations in MHC-I and MHC-II have been reported in different disease conditions and cell types. Immune evasion is a hallmark feature of cancer that enables the cancer cells to escape immune recognition and destruction and is often achieved through the reduction or loss of antigen presentation \u003csup\u003e\u003cspan additionalcitationids=\"CR2\" citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e\u003c/sup\u003e. Immunotherapy, such as checkpoint blockade treatments, boosts T cell-mediated immune responses \u003csup\u003e\u003cspan additionalcitationids=\"CR5\" citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e\u003c/sup\u003e. Immunotherapy relies on the recognition of tumor cells by CTLs, which recognize tumor cells based on tumor associated antigens (TAA) presented on surface of cancer cells via MHC-I \u003csup\u003e\u003cspan additionalcitationids=\"CR8\" citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e\u003c/sup\u003e. Impairing this process will ultimately diminish or eliminate CD8\u0026thinsp;+\u0026thinsp;T cell-mediated tumor cytotoxicity and cause treatment resistance. The MHC complex also plays a significant role in various other diseases, primarily those related to immune system dysfunction\u003csup\u003e\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e, \u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e\u003c/sup\u003e. In autoimmune diseases such as rheumatoid arthritis, multiple sclerosis, and lupus, MHC-II antigen presentation dysregulation leads to the presentation of self-antigens to T cells, triggering an inappropriate immune response against the body's own tissues\u003csup\u003e\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e\u003c/sup\u003e. Some intracellular pathogens can evade immune responses by interfering with pathways involved in MHC-II antigen presentation\u003csup\u003e\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e\u003c/sup\u003e. Diseases characterized by chronic inflammation, such as inflammatory bowel disease (IBD), psoriasis, or even cancer, also involve dysregulated MHC-II antigen presentation, contributing to sustained immune activation and tissue damage\u003csup\u003e\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e, \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e \u003cp\u003eWhile substantial efforts have been focused on studying the source and quality of antigens, the activity level and impairment of the MHC-I and MHC-II pathways have been understudied in disease-specific contexts\u003csup\u003e\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e\u003c/sup\u003e. To the best of our knowledge, there is a lack of a method that can accurately characterize the dyanamics of the MHC-I and MHC-II antigen presentations and their regulators using omics data. To fill this gap, we developed a computational method named \u003cspan type=\"BoldUnderline\" class=\"BoldUnderline\" name=\"Emphasis\"\u003eP\u003c/span\u003e\u003cb\u003ehysics Informed Neural Network (PINN)-empowered\u003c/b\u003e \u003cspan type=\"BoldUnderline\" class=\"BoldUnderline\" name=\"Emphasis\"\u003eS\u003c/span\u003e\u003cb\u003eystems Biology\u003c/b\u003e \u003cspan type=\"BoldUnderline\" class=\"BoldUnderline\" name=\"Emphasis\"\u003eA\u003c/span\u003e\u003cb\u003enalysis of\u003c/b\u003e \u003cspan type=\"BoldUnderline\" class=\"BoldUnderline\" name=\"Emphasis\"\u003eA\u003c/span\u003e\u003cb\u003entigen Presentation Activity\u003c/b\u003e (\u003cb\u003ePSAA\u003c/b\u003e) to estimate the flux of MHC class I and II antigen presentation pathways at the individual sample or cell resolution using transcriptomics or proteomics data.\u003c/p\u003e \u003cp\u003ePINN is a novel learning framework that combines data-driven machine learning models with governing physical laws, encoded as differential equations, to enhance predictions in complex biological systems \u003csup\u003e\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e, \u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e\u003c/sup\u003e. By incorporating known biological constraints, PINNs allow for more accurate and interpretable modeling of biological processes than conventional data driven approaches using omics data. However, directly applying PINNs to estimate the dynamics of antigen presentation presents challenges due to the static nature of transcriptomics and proteomics data, which lack temporal dynamics and may not capture real-time regulatory fluctuations within MHC pathways. This limitation necessitates careful modeling assumptions to infer pathway activity from such static snapshots.\u003c/p\u003e \u003cp\u003ePSAA was empowered by a new PINN framework. One neural network was trained for each reaction step in the MHC-I or -II to approximate its sample-wise flux rate by leveraging the goodness of fit of their underlying systems biology model with observed omics data. Specifically, the MHC-I and MHC-II pathways consist of large molecular processing reactions that can be treated as enzyme-catalyzed reactions, in which proteins, peptides, antigens, and MHC complexes across different steps and subcellular localizations act as intermediate substrates. In this context, the antigen source serves as the input, while the MHC complexes presented on the cell surface, or further T cell recognition levels, can be considered as the outcome of the pathway. Under this framework, the system operates under steady-state equilibrium, which PSAA leverages to estimate sample-wise MHC-I and MHC-II antigen presentation levels, as well as the impact of each gene involved in the antigen presentation approach for each individual sample.\u003c/p\u003e \u003cp\u003eTo develop PSAA, we first curated and reconstructed the MHC-I and MHC-II pathways by collecting genes involved in each reaction step from multiple pathway databases and literature. Based on the steady-state equilibrium hypothesis, we developed a new flux estimation and optimization method to assess the activity level of each step of the MHC-I and MHC-II pathways in each sample using omics data. PSAA, along with its downstream functions, offers several capabilities: (1) predicting the activity level of the entire process and individual biological steps of the MHC-I and MHC-II pathways in each sample using diverse types of transcriptomics or proteomics data, (2) assessing the impact of each individual gene or step on MHC-I and MHC-II antigen presentation, (3) evaluating the association of MHC-I and MHC-II antigen presentation with T cell activity and other biological or clinical features, and (4) inferring potential regulators or varied network structures of the MHC-I and MHC-II pathways in specific disease or tissue contexts. We rigorously benchmarked the prediction accuracy, robustness, and utility of PSAA using a broad spectrum of public omics data including bulk, single-cell, and spatially resolved transcriptomics or proteomics data of cancer, inflammatory, and neurodegenerative conditions. We also generated matched scRNA-seq and SRT data of bacterial infected human skins vs wound to further demonstrate the application of PSAA in analyzing varied antigen presentation mechanisms under different inflammatory conditions. To validate the prediction of PSAA, we conducted CRISPR knock-down experiments of key genes predicted by PSAA to amplify MHC-I antigen presentations in cancer.\u003c/p\u003e"},{"header":"Results","content":"\u003cp\u003e \u003cem\u003eThe overall framework of PSAA.\u003c/em\u003e \u003c/p\u003e \u003cp\u003ePSAA is a new PINN-based model to quantify sample-wise MHC class I and II antigen presentation activity using transcriptomics or proteomics data. As illustrated in Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003ea, PSAA takes bulk, single-cell, or spatially resolved data as the input to approximate the reaction rate of each step in the MHC-I or MHC-II antigen presentation pathways in each sample. PSAA utilizes a new optimization approach, namely message passing and supervised learning (MPSL), to compute the non-linear dependency between gene expression and reaction rate by leveraging (i) the flux balance of the reaction flows and (ii) the goodness of fit to the data. The output of PSAA provides the sample-wise reaction rates for each step in the selected MHC-I or MHC-II pathway. These rates could be directly used to explore potential associations with other molecular or clinical features, to study the functional role or clinical implications of antigen presentation in the disease context. PSAA also enables downstream analysis, including: (1) evaluating the differences in MHC-I or MHC-II antigen presentation across different disease contexts and their correlations with other features, (2) conducting in-silico perturbation analysis to identify genes with the greatest context-specific impact on the antigen presentation pathway to highlight potential drug targets, (3) performing antigen presentation and recognition-informed spatial segmentation for spatially resolved transcriptomics (SRT) data to analyze interactions between APCs and T cells, and (4) inferring with network structure and potential regulators of antigen presentation and T cell recognition.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003cem\u003eReconstruction of MHC-I and MHC-II antigen presentation pathways.\u003c/em\u003e \u003c/p\u003e \u003cp\u003eTo accurately estimate the activity level of antigen processing and presentation, we first reconstructed the MHC-I and MHC-II pathways and related branches, ensuring they recapitulate all molecules or their complexes serving as sources, products, and facilitators of all the involved reactions (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eb, c). Although databases such as the Kyoto Encyclopedia of Genes and Genomes\u003csup\u003e\u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e\u003c/sup\u003e (KEGG), REACTOME\u003csup\u003e\u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e20\u003c/span\u003e\u003c/sup\u003e, and GO\u003csup\u003e\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e\u003c/sup\u003e provide sets of genes related to antigen presentation, to the best of our knowledge, there is a lack of fully annotated MHC-I and MHC-II pathways that comprehensively cover the individual antigen processing, presentation, and salvage steps for systems biology analysis. Thus, we first collected genes that are involved in the MHC-I and MHC-II pathways and related biological processes via integrating the pathways annotated in these databases and further curated the integrated pathway by an extensive literature search \u003csup\u003e\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e, \u003cspan additionalcitationids=\"CR23\" citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e \u003cp\u003eOur reconstructed MHC-I pathway consists of eight reaction steps, namely ubiquitination, proteasomal degradation, the action of the transporter associated with antigen processing (TAP), chaperone-assisted assembly of the MHC class I complex, transit from the endoplasmic reticulum (ER) to the Golgi apparatus, and subsequent exocytosis from the Golgi to the cell membrane. Additionally, auxiliary pathways such as de-ubiquitination, verification of MHC class I complex integrity, and recycling (salvage) of cell surface MHC-I complex through endocytosis were also added as branches in this pathway. The final MHC-I antigen presentation pathway includes 284 genes in eight reaction modules.\u003c/p\u003e \u003cp\u003eThe final MHC-II antigen presentation pathway consists of seven reaction modules, including uptake of extracellular proteins into cells, processing of internalized proteins in endosomal/lysosomal compartments, biosynthesis of MHC-II complex in the ER, transport of MHC-II from the ER to the Golgi, transport of MHC-II from the Golgi to Endosomes, association of MHC-II with antigen, and expression of peptide-MHC-II complexes on the cell surface. The reconstructed MHC-II pathway includes 124 genes in seven reaction modules.\u003c/p\u003e \u003cp\u003eNotably, the reconstructed MHC-I and MHC-II pathways were optimized for a systems biology-model based estimation of pathway activity level, which have the following features: (1) the pathways cover the main antigen processing and presentation steps as well as side branches such as de-ubiquitination and recycling of cell surface MHC complexes; (2) the reaction steps within the reconstructed networks were optimized to reduce duplicated genes in adjacent steps; and (3) the reconstructed pathways were represented as directed factor graphs where each intermediate state of antigen or MHC complexes is a factor and each reaction step is a variable. The PINN model was further developed over the directed factor graphs to estimate the activity level of each reaction step using expression changes of the genes or proteins involved in the steps. Detailed pathway reconstructions are provided in \u003cb\u003eSupplementary Methods\u003c/b\u003e, and the reconstructed MHC-I and MHC-II pathways are given in \u003cb\u003eSupplemental Tables S1\u003c/b\u003e and \u003cb\u003eS2\u003c/b\u003e.\u003c/p\u003e \u003cp\u003e \u003cem\u003eSystems biology considerations, mathematical model, and solution of PSAA\u003c/em\u003e.\u003c/p\u003e \u003cp\u003ePSAA is built upon the PINN framework that approximates the sample-wise activity level of MHC-I and MHC-II pathways using transcriptomics or proteomics data. PSAA first hypothesizes that the rate of each reaction of the MHC-I and MHC-II pathway could be inferred from the transcriptomics or proteomics changes of the involved genes via unknown non-linear functions. To train these functions, we first analyzed the systems properties of the MHC pathways. All reactions in the MHC-I and MHC-II pathways are large molecule processing reactions, in which proteins, peptides, MHC class I or II complexes, and their binding forms are substrates or products of each reaction step. The whole pathway satisfies the law of conservation of mass under steady or quasi-steady state, known as the flux balance condition, which has been widely utilized in metabolic flux analysis\u003csup\u003e\u003cspan additionalcitationids=\"CR26\" citationid=\"CR25\" class=\"CitationRef\"\u003e25\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e\u003c/sup\u003e. Thus, PSAA further assumes that the predicted activities of each reaction in the MHC-I and MHC-II pathway should satisfy a quasi-flux balance condition.\u003c/p\u003e \u003cp\u003ePSAA approximates reaction rates using non-time course omics data, which differs from traditional PINN models. To enable a robust and explainable prediction, we developed a new learning paradigm\u003cb\u003e\u0026mdash;constrained learning\u003c/b\u003e\u0026mdash;and a new optimization algorithm, as detailed below and in \u003cb\u003eMethods\u003c/b\u003e. Given an omics data set \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:D\$\u003c/span\u003e\u003c/span\u003e and a pathway of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:m\$\u003c/span\u003e\u003c/span\u003e reactions, denote the flux rate of the \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:ith\$\u003c/span\u003e\u003c/span\u003e reaction (\u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:i=1,\\dots\\:,m\$\u003c/span\u003e\u003c/span\u003e) in the MHC-I or -II pathways in the \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:jth\$\u003c/span\u003e\u003c/span\u003e sample in \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:D\$\u003c/span\u003e\u003c/span\u003e as \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{F}_{i,j}\$\u003c/span\u003e\u003c/span\u003e. PSAA identifies functions \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mathcal{F}}_{i}\$\u003c/span\u003e\u003c/span\u003e to estimate \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{F}_{i,j}\$\u003c/span\u003e\u003c/span\u003e by \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{F}_{i,j}={\\mathcal{F}}_{i}({D}_{,j},{{\\Theta\\:}}^{i})\$\u003c/span\u003e\u003c/span\u003e and minimizes a loss function \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:L={L}_{FB}\\left({\\widehat{F}}_{,j}^{}\\right)+{L}_{p}({\\mathcal{F}}_{i},{{\\Theta\\:}}_{i}|i=1,\\dots\\:,m)\$\u003c/span\u003e\u003c/span\u003e, where \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\widehat{F}}_{,j}^{}\\triangleq\\:{\\{\\widehat{F}}_{i,j}^{}|i=1,\\dots\\:,m\\}\$\u003c/span\u003e\u003c/span\u003e denotes the predicted reaction rates in sample \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:j\$\u003c/span\u003e\u003c/span\u003e, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{{\\Theta\\:}}_{i}\$\u003c/span\u003e\u003c/span\u003e denotes parameters of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mathcal{F}}_{i}\$\u003c/span\u003e\u003c/span\u003e, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{L}_{FB}\$\u003c/span\u003e\u003c/span\u003e is a quadratic loss term that regularizes the imbalance of predicted reaction rate, and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{L}_{p}\$\u003c/span\u003e\u003c/span\u003e denotes an aggregated computational loss term to avoid trivial solutions and to conduct variable selection.\u003c/p\u003e \u003cp\u003eRecognizing that the PSAA framework is neither supervised nor unsupervised learning, we developed a new optimization approach called MPSL (Message Passing enhanced Supervised Learning) to enable a robust and efficient solution of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mathcal{F}}_{i}\$\u003c/span\u003e\u003c/span\u003e and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{{\\Theta\\:}}^{i}\$\u003c/span\u003e\u003c/span\u003e. As illustrated in Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003ea, a two-step optimization approach is introduced to iteratively (1) Message Passing Optimization (MPO) step: Computing \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\widehat{F}}_{,j}^{t+1}\$\u003c/span\u003e\u003c/span\u003e that minimizes \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{L}_{FB}\\left({\\widehat{F}}_{,j}^{t+1}\\right)\$\u003c/span\u003e\u003c/span\u003e by searching through \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\widehat{F}}_{i,j}^{t+1}\$\u003c/span\u003e\u003c/span\u003e within a certain distance to \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mathcal{F}}_{i}\\left({D}_{,j},{{\\Theta\\:}}_{i}^{t}\\right)\$\u003c/span\u003e\u003c/span\u003e and (2) Supervised Learning (SL) step: Updating \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{{\\Theta\\:}}_{i}^{t+1}\$\u003c/span\u003e\u003c/span\u003e by conducting a supervised training of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mathcal{F}}_{i}\\left({D}_{,j},{{\\Theta\\:}}_{i}^{t+1}\\right)\$\u003c/span\u003e\u003c/span\u003e to estimate \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\widehat{F}}_{i,j}^{t+1}\$\u003c/span\u003e\u003c/span\u003e, where \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:t\$\u003c/span\u003e\u003c/span\u003e and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:t+1\$\u003c/span\u003e\u003c/span\u003e denote successive rounds of iterations. A key challenge of PINN is to balance fit to both physical models and data when using neural networks to approximate the non-linear dependency. The MPSL algorithm addresses this challenge by splitting the fitting process into two iterative steps: fitting the physical model and fitting the data. This makes the data-fitting step a classic supervised learning problem that can be implemented with additional regularization terms for variable selection or better fitting robustness. The MPSL algorithm was validated on an extensive set of simulated data (see details in \u003cb\u003eSupplementary Methods\u003c/b\u003e). The detailed mathematical formulation of the MPSL optimization algorithm is given in \u003cb\u003eMethods\u003c/b\u003e.\u003c/p\u003e \u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eDownstream functionalities of PSAA\u003c/h2\u003e \u003cp\u003eWe also developed a series of downstream functionalities of PSAA. The outputs of PSAA, i.e., sample-wise activity levels for each step in the MHC-I and MHC-II pathways can be directly compared across different contexts. Because \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mathcal{F}}_{i}\\left({D}_{,j},{{\\Theta\\:}}^{i}\\right)\$\u003c/span\u003e\u003c/span\u003e directly utilizes neural networks to approximate sample-wise reaction rate, its partial derivative \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\frac{\\partial\\:{\\mathcal{F}}_{i}}{\\partial\\:D}\$\u003c/span\u003e\u003c/span\u003e could be computed to evaluate the impact of each gene in each sample. In-silico perturbation analysis can be conducted to identify genes that, when perturbed, may increase or decrease antigen presentation levels. In spatial transcriptomics data, the predicted MHC-I and MHC-II activity level can be directly utilized to characterize the interaction between APCs and T cells, enabling an APC-T cell interaction-based spatial dissection. Additionally, the coherence between the MPO and SL steps can be used to evaluate the goodness of fitting of the data to the systems biology assumption and pathway structure. This coherence can be further used to infer the pathway structure variation.\u003c/p\u003e \u003cp\u003eWe further validated PSAA and its downstream functions using public data and experimental approaches. The MPSL optimization was validated using a set of synthetic data-based experiments. We also demonstrated the application of PSAA and its functions for different antigen-presentation disease scenarios including cancer, bacterial infection, and neurodegeneration.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003cem\u003ePSAA accurately quantifies antigen presentation activity using transcriptomics or proteomics data.\u003c/em\u003e \u003c/p\u003e \u003cp\u003eTo benchmark PSAA, we first tested the Pearson correlation coefficients (PCC) between the predicted antigen presentation level and the cell surface protein of MHC-I molecules in three independent CITE-seq and TEA-seq data sets (CITE-seq of B-cell malignancies: GSE249542, CITE-seq of melanoma: SCP1064\u003csup\u003e28\u003c/sup\u003e, and TEA-seq of T cells: GSE200417\u003csup\u003e29\u003c/sup\u003e). Both CITE-seq and TEA-seq offer simultaneous measurement of gene expression and protein levels. We found that PSAA-predicted antigen presentation levels were significantly associated with measured cell surface MHC I molecules in all three datasets: GSE200417, PCC\u0026thinsp;=\u0026thinsp;0.363 (p-value\u0026thinsp;\u0026lt;\u0026thinsp;2.2e-16); GSE249542, PCC\u0026thinsp;=\u0026thinsp;0.527 (p-value\u0026thinsp;\u0026lt;\u0026thinsp;2.2e-16), and SCP0164, PCC\u0026thinsp;=\u0026thinsp;0.356 (p-value\u0026thinsp;\u0026lt;\u0026thinsp;2.2e-16), as shown in Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003ea (first row). Noted, these correlations were much higher than those that use averaged HLA class I or II gene expressions to predict antigen presentation activity Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003ea (second row). Our results demonstrate that PSAA could accurately predict cell surface MHC complex presentation levels.\u003c/p\u003e \u003cp\u003eTo evaluate the robustness of PSAA, we applied PSAA to the CCLE and TCGA matched bulk RNA-seq and proteomics data sets, respectively, to test the consistency of the predictions made from transcriptomics and proteomics data. We identified a significant consistency between the predictions made from the two data sources for both MHC-I (PCC\u0026thinsp;=\u0026thinsp;0.43, p-value\u0026thinsp;=\u0026thinsp;4.1e-06) and MHC-II (PCC\u0026thinsp;=\u0026thinsp;0.23, p-value\u0026thinsp;=\u0026thinsp;0.018) antigen presentation pathways in the TCGA data (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eb). In CCLE data, we identified a significant consistency of the MHC-I (PCC\u0026thinsp;=\u0026thinsp;0.37, p-value\u0026thinsp;=\u0026thinsp;1.3e-13) pathway (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eb) and MHC-II pathway (PCC\u0026thinsp;=\u0026thinsp;0.12, p-value\u0026thinsp;=\u0026thinsp;0.02) (\u003cb\u003eSupplementary Fig.\u0026nbsp;1a\u003c/b\u003e). Noted, a weaker correlation of the predicted MHC-II activity in the CCLE data is expected, since the CCLE data set is derived from pure cancer cells that normally do not present MHC-II molecules. Overall, our observation demonstrated that PSAA could robustly predict MHC-I and MHC-II antigen presentation levels using either transcriptomics or proteomics data.\u003c/p\u003e \u003cp\u003eTo assess PSAA\u0026rsquo;s robustness, we conducted an ablation study by removing the \u0026ldquo;T cell recognition\u0026rdquo; module at the end of the antigen presentation pathway and compared with results obtained from the original pathway (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003ec and \u003cb\u003eSupplementary Fig.\u0026nbsp;1b).\u003c/b\u003e While including the \u0026ldquo;T cell recognition\u0026rdquo; module intuitively enhances the completeness of the pathway, and could yield more accurate predictions, we observed significant consistency between the predicted flux of each step when using a network with and without the T cell module (PCC\u0026thinsp;=\u0026thinsp;0.635 in PLC, PCC\u0026thinsp;=\u0026thinsp;0.728 in proteasome, and PCC\u0026thinsp;=\u0026thinsp;0.741 in MHC-I). This indicates that PSAA can accurately quantify antigen presentation levels even without considering the \u0026ldquo;T cell recognition\u0026rdquo; module. Notably, this analysis demonstrated that PSAA could robustly estimate antigen presentation without considering neighboring T cell infiltration levels, making it suitable for applications in single-cell or spatial transcriptomics data, where individual samples (single cell or spatial spot) may not contain information about neighboring T cells.\u003c/p\u003e \u003cp\u003eTo further analyze PSAA\u0026rsquo;s robustness to data sparsity, missing data, and overfitting, we conducted a systematic evaluation on CITE-seq data of peripheral blood mononuclear cells (PBMCs) (GSE249542). We first analyzed the impact of potential dropout events in scRNA-seq or SRT data by simulating different levels of dropout in the input scRNA-seq data. Our results suggested that PCC was highly robust to dropouts (\u003cb\u003eSupplementary Fig.\u0026nbsp;1c\u003c/b\u003e). Similarly, we evaluated the robustness of PCC to missing genes (\u003cb\u003eSupplementary Fig.\u0026nbsp;1c\u003c/b\u003e). We also conducted statistical analysis of the necessary input sample size for PSAA, evaluated overfitting by permutation test, and conducted robustness tests of PSAA (see details in \u003cb\u003eSupplementary Methods\u003c/b\u003e). Using simulated data, MPSL always projected a high-dimensional flux vector to the closest point in the solution space of flux balance (see \u003cb\u003eSupplementary Methods\u003c/b\u003e).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003cb\u003eFigure 3. Application of PSAA and downstream functions on matched single cell and spatial transcriptomics data obtained from\u003c/b\u003e \u003cb\u003eHaemophilus ducreyi\u003c/b\u003e \u003cb\u003einfected skin and uninfected (wounded) skin.\u003c/b\u003e (a) tSNE of the distribution of scRNA-seq data and cell types. (b) Distribution of PSAA predicted MHC-II antigen presentation level in each single cell over the tSNE of the scRNA-seq data. (c) Distribution of PSAA predicted cell surface MHC-II antigen presentation level (y-axis) in each cell type (x-axis) on scRNA-seq data. Cell types are colored as (a) and (d). (d) Proportion of each cell type in the pustules and wounds. (e) tSNE plot of APCs and the three APC cell clusters derived using the first-order partial derivative of each gene in the MHC-II pathway. (f) Proportion of APC cell types in the three clusters (left) and PSAA predicted MHC-II whole pathway activity level of each cell type in the three clusters (right). (g) T cell level (left) and PSAA predicted MHC-II whole pathway activity level (right) in the spatial spots of the SRT data. (h) Varied dependency between T cell level and MHC-II antigen presentation level in the SRT data of pustules (blue) vs wounded skin (orange) in patient sample 1 and 2. (i) Distribution of T cells and predicted MHC-II antigen presentation level on the spatial slides.\u003c/p\u003e \u003cp\u003e \u003cem\u003ePSAA accurately estimates MHC II antigen presentation activity and APC-T cell interactions in a bacterial infection model of human volunteers.\u003c/em\u003e \u003c/p\u003e \u003cp\u003eTo further validate PSAA and demonstrate its application in studying the variation of MHC-II pathway in antigen-presenting cells (APCs), we generated a matched scRNA-seq and SRT data sets derived from skin biopsies of four human volunteers. The volunteers provided paired biopsies of skin sites that were wounded (uninfected controls) or inoculated via puncture wounds with \u003cem\u003eHaemophilus ducreyi\u003c/em\u003e until a pustule developed at that site 6\u0026ndash;8 days later (see details in \u003cb\u003eSupplementary Methods\u003c/b\u003e). In the scRNA-seq data, there were four types of APCs (macrophages, pDCs, mPCs, and B cells), two types of skin cells (keratinocytes and melanocytes), three types of stromal cells (endothelial cells, fibroblast cells, and smooth muscle cells), three other immune cell types (ISG-expressing cells, T cells and mast cells), and one unknown cell group (Fig.\u0026nbsp;3a). PSAA was applied to approximate cell surface MHC-II antigen presentation activity in each cell except for those in the unknown cluster (Fig.\u0026nbsp;3b). PSAA identified that the APCs have the highest level of MHC-II antigen presentation activity, followed by endothelial cells and melanocytes, which have an intermediate level of MHC-II antigen presentation activity, and the other immune cell types that have relatively low antigen presentation activity (Fig.\u0026nbsp;3c). We also found that the pDCs in pustules have a higher level of MHC-II presentation compared to wounds (p\u0026thinsp;=\u0026thinsp;4e-12) while the other cell types do not have a significant difference (Fig.\u0026nbsp;3c). In addition, we see a consistent increase of cell proportions of APCs in pustules vs wounds (Fig.\u0026nbsp;3d). The predicted lack of antigen presentation by pDCs and decreased population of macrophages in pustules may reflect the failure of the immune response to clear infection and inability to detect \u003cem\u003eH. ducreyi\u003c/em\u003e-specific antibodies after infection \u003csup\u003e\u003cspan citationid=\"CR30\" class=\"CitationRef\"\u003e30\u003c/span\u003e\u003c/sup\u003e. The increased MHC-II antigen presentation in pustules compared to wounds likely reflects the presence of exogenous antigen and consequent influx of immune cells to the site of infection in pustules; the immune influx is largely absent in wounds.\u003c/p\u003e \u003cp\u003eWe also applied the in-silico perturbation analysis to study which genes contribute most to MHC-II antigen presentation pathways in APCs. The first-order partial derivative of each gene in the PSAA model was computed for each sample, which evaluates the impact of the gene to the pathway activity level if its expression level is disrupted. The partial derivatives of each gene in each reaction step were analyzed to identify if the genes contribute to MHC-II antigen presentation differently (Fig.\u0026nbsp;3e, see details in \u003cb\u003eMethods\u003c/b\u003e). Cell clustering analysis using the partial derivatives identified three APC subgroups that potentially have different MHC-II antigen presentation mechanisms: namely, \u003cem\u003eAPC-G1\u003c/em\u003e, represented by mDCs in both infected and wounded sites; \u003cem\u003eAPC-G2\u003c/em\u003e, consisting of APC cells specifically in infected sites; and \u003cem\u003eAPC-G3\u003c/em\u003e, represented by macrophages in both infected and wounded sites (Fig.\u0026nbsp;3f). Perturbation analysis suggested that the whole activity level of the MHC-II pathway in \u003cem\u003eAPC-G1\u003c/em\u003e mostly determined by \u003cem\u003eHLA-DR\u003c/em\u003e, \u003cem\u003eHLA-DP\u003c/em\u003e, and \u003cem\u003eHLA-DQ\u003c/em\u003e. MHC II in APC-G2 is mostly determined by \u003cem\u003eCD74\u003c/em\u003e, \u003cem\u003eHLA-DM\u003c/em\u003e, and \u003cem\u003eHLA-DR\u003c/em\u003e, and the C1 peptidases \u003cem\u003eCTSZ\u003c/em\u003e and \u003cem\u003eCTSC\u003c/em\u003e, and \u003cem\u003eRAB14\u003c/em\u003e, and MHC-II in APC-G3 specifically depends on the C1 peptidases \u003cem\u003eCTSZ\u003c/em\u003e, \u003cem\u003eCTSB\u003c/em\u003e, \u003cem\u003eCTSS\u003c/em\u003e, \u003cem\u003eCTSD\u003c/em\u003e, \u003cem\u003eCTSH\u003c/em\u003e, and \u003cem\u003eCTSL\u003c/em\u003e (\u003cb\u003eSupplementary Table \u003cspan refid=\"MOESM3\" class=\"InternalRef\"\u003eS3\u003c/span\u003e\u003c/b\u003e). Only the cells in the APC-G1 group had a significantly increased antigen presentation level in pustules versus wounds (Fig.\u0026nbsp;3f). Based on differential gene expression and perturbation analysis, the varied expression in the MHC II genes \u003cem\u003eHLA-DR\u003c/em\u003e, \u003cem\u003eHLA-DP\u003c/em\u003e, and \u003cem\u003eHLA-DQ\u003c/em\u003e drove the differences between cells in the infected and wounded sites.\u003c/p\u003e \u003cp\u003eBy PSAA to the SRT data, we observed again that pustules have significantly increased MHC-II antigen presentation activity and CD4\u0026thinsp;+\u0026thinsp;helper T cells than wounded tissues (Fig.\u0026nbsp;3g). Spatial dependent regression analysis revealed that the CD4\u0026thinsp;+\u0026thinsp;helper T cells level significantly depends on MHC-II activity in both conditions (Fig.\u0026nbsp;3h and \u003cb\u003eSupplementary Fig.\u0026nbsp;2\u003c/b\u003e), and donors showed a higher level of activation dependency (larger slope) between MHC-II antigen presentation and CD4\u0026thinsp;+\u0026thinsp;T cells in pustules than in wounded skin \u003csup\u003e\u003cspan citationid=\"CR31\" class=\"CitationRef\"\u003e31\u003c/span\u003e\u003c/sup\u003e. The strong linear dependency suggests a strong colocalization of APCs with CD4\u0026thinsp;+\u0026thinsp;T cells in both conditions while the varied dependencies suggest a stronger CD4\u0026thinsp;+\u0026thinsp;T cell activation and increased adaptive immune response in pustules than in wounded skin (Fig.\u0026nbsp;3h). In pustules, MHC-II antigen presentation was more diffused throughout the skin; in wounds, high MHC-II activity spots were enriched in the surface regions (epidermis) (Fig.\u0026nbsp;3i). A similar pattern was observed for CD4\u0026thinsp;+\u0026thinsp;helper T cells (Fig.\u0026nbsp;3i).\u003c/p\u003e \u003cp\u003eOur analysis demonstrates the potential utility of PSAA in the integrative analysis of scRNA-seq and SRT data. This approach facilitates the assessment of MHC-II activity and further subtyping of APCs, as well as the interpretation of biological functional variations such as antigen presentation mechanisms, spatial distribution of MHC-II antigen presentation, and spatial dependent MHC-II \u0026ndash; T cell interactions.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003cem\u003ePSAA captured MHC-I antigen presentation level in cancer tumor microenvironment and identified potential targets to improve CD8\u0026thinsp;+\u0026thinsp;T cell\u0026rsquo;s recognition and cytotoxicity.\u003c/em\u003e \u003cdiv class=\"BlockQuote\"\u003e \u003cp\u003eImmunotherapy has shown remarkable efficacy in treating multiple cancer types\u003csup\u003e\u003cspan additionalcitationids=\"CR5\" citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e\u003c/sup\u003e. However, mechanisms underlying non-responsiveness, particularly in solid tumors, remain poorly understood\u003csup\u003e\u003cspan additionalcitationids=\"CR33 CR34 CR35\" citationid=\"CR32\" class=\"CitationRef\"\u003e32\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e36\u003c/span\u003e\u003c/sup\u003e. In the context of CD8\u0026thinsp;+\u0026thinsp;cytotoxic T lymphocytes (CTL) \u0026ndash; mediated immune responses, recognition of tumor associated antigen occurs through its presentation via MHC-I molecule on tumor cells and their interaction with T cell receptor (TCR) on the CD8\u0026thinsp;+\u0026thinsp;T cells \u003csup\u003e\u003cspan additionalcitationids=\"CR8\" citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e\u003c/sup\u003e. Impairing this event will ultimately reduce or prevent CD8\u0026thinsp;+\u0026thinsp;T cell mediated tumor cytotoxicity. However, reduction or loss of antigen presentation is a frequent mechanism used by tumor cells to escape immune recognition and destruction \u003csup\u003e\u003cspan additionalcitationids=\"CR2\" citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e\u003c/sup\u003e. Little is known regarding how and what variations of gene expression or key biological steps in the MHC-I pathway affect the level of antigen presentation on the surface of cancer cells and their downstream recognition by T cells. Therefore, we applied PSAA to predict how MHC-I antigen presentation affects the immune responses in cancer. We first validated the application of PSAA on TCGA RNA-seq data from nine cancer types. The T cell level could be well explained by the activity level of MHC-I antigen presentation and its recycling (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e4\u003c/span\u003ea). However, the T cell level alone poorly recapitulates MHC-I activity (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e4\u003c/span\u003eb). Further analysis suggested that T cell activation via MHC-I is increased in the cancer TME compared to the adjacent normal tissues although both cancer and normal tissues have a similar amount of the MHC-I presentation (\u003cb\u003eSupplementary Fig.\u0026nbsp;3a\u003c/b\u003e). Our analysis revealed that the majority of the presented MHC-I tend to be recycled in the TME. This observation is consistent with the presentation and salvage of antigen presentation on cancer cells being a key factor for T cell recognition\u003csup\u003e\u003cspan citationid=\"CR37\" class=\"CitationRef\"\u003e37\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e \u003cp\u003eWe further applied PSAA to five SRT datasets of cancer TME. Using the PSAA predicted MHC-I level and expression level of CD8\u0026thinsp;+\u0026thinsp;cytotoxic T cell markers, we identified spatial regions of high-/low-MHC-I level and high-/low- CTL cell level (See details in \u003cb\u003eMethods\u003c/b\u003e). Strong consistencies between MHC-I presentation and CTL infiltration, evaluated by Moran\u0026rsquo;s I correlation, was identified in the TME of different cancer types (\u003cb\u003eSupplementary Table \u003cspan refid=\"MOESM4\" class=\"InternalRef\"\u003eS4\u003c/span\u003e\u003c/b\u003e). In addition, we identified a substantial number of distinct spatial regions of high MHC-I and low CTL infiltration (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e4\u003c/span\u003ec and \u003cb\u003eSupplementary Fig.\u0026nbsp;3b\u003c/b\u003e). However, we rarely observed spatial regions of low MHC-I and high CTL infiltration (\u003cb\u003eSupplementary Fig.\u0026nbsp;3b\u003c/b\u003e). All the identified spatial spots of low MHC-I and high CTL infiltration are on the boundary of the high MHC-I and high CTL regions. Moreover, the numbers of spatial spots of high MHC-I and low CTL infiltration are consistently higher than the spots of low MHC-I and high CTL infiltration in all analyzed data (\u003cb\u003eSupplementary Table \u003cspan refid=\"MOESM4\" class=\"InternalRef\"\u003eS4\u003c/span\u003e\u003c/b\u003e). Our observations suggested that there were additional factors in these regions, such as stromal variations or metabolic shifts, inhibited the infiltration of CTLs. To identify the biological functions that are associated with low T cell infiltration, we utilized a generalized linear model to identify the genes that are consistently and specifically expressed in the regions of high MHC-I and low CTL versus the other regions in the analyzed data and downstream pathway enrichment analysis (see \u003cb\u003eMethods\u003c/b\u003e).\u003c/p\u003e \u003cp\u003eWe observed upregulated CTL-related pathways in high MHC-I and high CTL vs high MHC-I and low CTL regions, upregulated MHC-I genes in high MHC-I and low CTL vs low MHC-I and high CTL regions, and upregulated general adaptive immune responses in high MHC-I and high CTL vs low MHC-I and low CTL regions (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e4\u003c/span\u003ed). We also identified new stromal, TME, and metabolic changes that may be related to low CTL-infiltration in high MHC-I regions. Upregulated ECM formation, coagulation cascades, myeloid cells including monocytes and granulocytes, and chemokine receptors were seen in high MHC-I and high CTL vs high MHC-I and low CTL regions. In addition, increased metabolic activities including glycolysis, TCA cycle, oxidative phosphorylation, electron transport chain (ETC), glycan and steroid synthesis were seen in (1) high MHC-I and low CTL vs low MHC-I and high CTL regions and (2) high MHC-I and high CTL vs low MHC-I and low CTL regions, suggesting possible roles of metabolic activity related to MHC-I presentation (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e4\u003c/span\u003ed). We also observed down regulation of the TGBF-beta, JAK-Stat, glucose transport, glutathione metabolism, and histone deacetylase III pathways in high MHC-I and high CTL vs high MHC-I and low CTL regions.\u003c/p\u003e \u003cp\u003eTo demonstrate the clinical implication of PSAA, we applied the method to the bulk RNA-seq data of a melanoma data set (GSE91061) collected from patients under anti-PD1 therapy using Nivolumab\u003csup\u003e\u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e38\u003c/span\u003e\u003c/sup\u003e. In total, we obtained 105 samples from this data set, including 48 partial response (PR), 34 stable disease (SD), and 23 progressive disease (PD) patients. We applied PSAA to compute sample-wise activity level of the eight steps in the MHC-I pathway. Biologically, we expect the higher level of antigen presentation activity is associated with better response. We adopted multi-variate logistic regression with a L1-penalty to identify the top variables and best model in predicting patients\u0026rsquo; response to the ant-PD1 therapy. Considering that the quality of cancer associated antigen also determines CTL recognition, we included predicted microsatellite stability (MSS) or microsatellite instability (MSI) status as a confounding factor (see details in \u003cb\u003eSupplementary Methods\u003c/b\u003e). In addition, we also introduced total T cell level and cytotoxic CD8\u0026thinsp;+\u0026thinsp;T cell level predicted by deconvolution analysis and the on-/off-treatment status provided in the data as additional factors. The final selected model is:\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Equa\" class=\"Equation\"\u003e \u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equa\" name=\"EquationSource\"\u003e\n$$\\:Responsiveness\\:\\sim\\:logistic\\:(3.02\\bullet\\:Peptide\\:loading+\\:1.12\\bullet\\:MSI\\:status\\:-1.55)$$\u003c/div\u003e \u003c/div\u003e \u003c/p\u003e \u003cp\u003e, where \u003cem\u003eResponsiveness\u003c/em\u003e is a binary variable that takes values in \u0026ldquo;responder\u0026rdquo; and \u0026ldquo;non-responder\u0026rdquo;, and the \u003cem\u003ep\u003c/em\u003e-value of the peptide loading and MSI status are 6.4e-4 and 0.02, respectively, suggesting the activity of the peptide loading step has a higher power predicting the outcome of immuno-therapy than MSI status and T cell abundance.\u003c/p\u003e \u003cp\u003eWe further checked how the predicted rate of peptide loading varies with respect to responsiveness, MSI status, and treatment status. We observed that the MHC-I antigen presentation level in the PR group is consistently higher than the SD and PD group, and the SD group is also higher than the PD group, in both MSS and MSI, and on-treatment/pre-treatment patient groups. Limited by sample size, a significant difference of the MHC-I antigen presentation level is only observed between the PR and SD groups vs the PD group in the on-treatment patient of MSS (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e4\u003c/span\u003ee). Specifically, the MSS of PR and SD on treatment patients have a significant increase of predicted rate of peptide loading compared to (1) the MSS of PR patients on-treatment patients (p\u0026thinsp;=\u0026thinsp;0.0026) and (2) the MSI of all PD and SD on-treatment patients (p\u0026thinsp;=\u0026thinsp;0.022). Our observation suggests that PD-1 inhibitor has a better efficacy in the melanoma patients of low mutation load or low cancer-associated antigen quality if the cancer cells have a higher level of MHC-I presentation. Complete predicted activity level of MHC-I reactions and clinical information of each sample are given in \u003cb\u003eSupplementary Table S5\u003c/b\u003e. To validate our observations, we also analyzed three additional datasets collected from melanoma (GSE115821\u003csup\u003e39\u003c/sup\u003e), lung cancer (GSE126043\u003csup\u003e40\u003c/sup\u003e), and cutaneous T cell lymphoma (GSE162137\u003csup\u003e41\u003c/sup\u003e) patients treated by PD-1 inhibitor. Increased MHC-I antigen presentation levels were detected in the responders compared to non-responding patients in all the datasets (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e4\u003c/span\u003ef).\u003c/p\u003e \u003cp\u003eWe further utilized the in-silico perturbation function of PSAA to identify the genes that could be targeted to improve the presentation level of MHC-I complex on the surface of a cell. Previous studies reported that \u003cem\u003eMAL2\u003c/em\u003e in the salvage pathway of MHC-I is a critical negative regulator of MHC-I presentation \u003csup\u003e\u003cspan citationid=\"CR37\" class=\"CitationRef\"\u003e37\u003c/span\u003e\u003c/sup\u003e. A leave-one-out statistical test evaluates the significance of change of the predicted flux balance when including or excluding \u003cem\u003eMAL2\u003c/em\u003e in the recycling step of the MHC-I pathway (see details in \u003cb\u003eMethods and Supplementary Table S6\u003c/b\u003e). Application of the test on TCGA data revealed that \u003cem\u003eMAL2\u003c/em\u003e is significantly involved in the recycling step of the MHC-I pathway in seven cancer types.\u003c/p\u003e \u003cp\u003eTo experimentally validate the PSAA predictions, we used in-silico perturbation analysis to predict genes that consistently negatively impact MHC-I antigen presentation using pan-cancer data from TCGA. Specifically, we added each gene into the recycling module to train the PSAA model and rank the genes according to their PSAA-predicted cell surface MHC-I level when perturbing their gene expression (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e4\u003c/span\u003eg). To further validate the PSAA analysis, we knocked down the top predicted genes in mouse breast cancer cells and evaluated the antigen presentation level and T cell killing between knockdown vs control for the top 11 predicted genes. We observed that knocking down each of the predicted genes significantly increased MHC-I antigen presentation and T cell killing effect (range: 22\u0026ndash;148%) (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e4\u003c/span\u003eh). See experimental details in \u003cb\u003eMethods\u003c/b\u003e.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003cem\u003ePSAA identified unaligned MHC-II antigen presentation steps in microglia and other cell types in Alzheimer\u0026rsquo;s disease brain.\u003c/em\u003e \u003c/p\u003e \u003cp\u003eAlthough the brain was initially considered as an immune-privileged site where antigen presentation would not occur, both microglia and astrocytes present antigens via MHC class II molecules to activate CD4\u0026thinsp;+\u0026thinsp;T cells and stimulate immune responses in the central nervous system (CNS)\u003csup\u003e\u003cspan additionalcitationids=\"CR43\" citationid=\"CR42\" class=\"CitationRef\"\u003e42\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR44\" class=\"CitationRef\"\u003e44\u003c/span\u003e\u003c/sup\u003e. However, the role of the immune system and the antigen presentation process is not well understood in neurodegenerative diseases, such as Alzheimer\u0026rsquo;s disease and Parkinson\u0026rsquo;s disease\u003csup\u003e\u003cspan citationid=\"CR45\" class=\"CitationRef\"\u003e45\u003c/span\u003e, \u003cspan citationid=\"CR46\" class=\"CitationRef\"\u003e46\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e \u003cp\u003eTo understand the cell type-specific MHC-II antigen presentation status in the AD brain, we applied PSAA on the ROSMAP AD scRNA-seq data to evaluate the MHC class II antigen presentation levels in different cell types. Cell type annotation was provided for the 172,659 cells, including seven cell types: microglia (Mic), astrocytes (Ast), endothelial cells (End), excitatory neurons (Exc), inhibitory neurons (Inh), oligodendrocytes (Oli), and oligodendrocyte precursor cells (OPC). For a better visualization, we randomly sample 500 cells of high total UMI from each cell type in the AD samples (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e5\u003c/span\u003ea) and applied PSAA to estimate the MHC class II antigen presentation level in each cell (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e5\u003c/span\u003eb, \u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e5\u003c/span\u003ec). PSAA identified that microglia have the highest activities of the reactions involved in the 'Processing of internalized proteins in endosomal/lysosomal' and 'Biosynthesis of MHC-II complex' (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e5\u003c/span\u003ed). However, microglia and astrocytes exhibit lower levels of further processing modules of MHC class II molecules compared to Exc and Inh (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e5\u003c/span\u003ed). Compared to the antigen presentation cells in infectious disease and the TME of cancer, microglia in the AD brain have a much higher inconsistency between the MHC-II complex biosynthesis step and its processing and presentation onto cell surface. Thus, we hypothesize that even though microglia can express and produce MHC class II molecules, they still have a low rate of MHC-II complex presentation on the cell surface.\u003c/p\u003e \u003cp\u003eTo further test this hypothesis, we adjusted the weight for the imbalance loss of each reaction step when applying the optimization algorithm, MPSL. Modulating these weights facilitates an extensive search for the potential solution space of the functions that could enhance the flux balance across the entire pathway. We observed that the iterations between the message passing optimization and supervised learning steps failed to yield a consistent distribution of the flux when the algorithm reached convergence (burns-in) across all tested hyperparameters (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e5\u003c/span\u003ee, \u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e5\u003c/span\u003ef). We also examined how the flux distribution responded to the changes of the hyperparameters. A higher level of MHC-II activity in microglia was predicted by the message passing step when using a higher weight of the flux balance loss for the \"Biosynthesis of MHC-II complex\" reaction (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e5\u003c/span\u003ef). However, the flux predicted by the supervised learning step did not align with the message-passing-derived flux.\u003c/p\u003e \u003cp\u003eThis inconsistency suggests that there is no function that could be identified by PSAA to map the ROSMAP AD brain scRNA-seq data onto the flux balance solution space of the MHC-II pathway, implying that brain cells do not adhere to the classic MHC-II pathway characteristic of immune cells. Our analysis indicates the potential existence of an alternative mechanism for peptide processing and loading in the MHC-II pathways, or a mismatch between peptide processing and MHC-II complex expression in the microglia in AD brains.\u003c/p\u003e \u003c/div\u003e"},{"header":"Discussion","content":"\u003cp\u003eSystems biology characterizes the motion of molecules within a complex biological process as differential equation-based dynamic systems \u003csup\u003e\u003cspan citationid=\"CR47\" class=\"CitationRef\"\u003e47\u003c/span\u003e, \u003cspan citationid=\"CR48\" class=\"CitationRef\"\u003e48\u003c/span\u003e\u003c/sup\u003e. Recently, physics-informed neural networks (PINNs) have emerged as a powerful approach in this field, integrating neural network architectures with physical laws to model complex biological systems. A reliable systems biology model provides explicit quantification and interpretations of the system \u003csup\u003e\u003cspan citationid=\"CR48\" class=\"CitationRef\"\u003e48\u003c/span\u003e, \u003cspan citationid=\"CR49\" class=\"CitationRef\"\u003e49\u003c/span\u003e\u003c/sup\u003e, and enables simulation and perturbation analysis to study the impact of each biological feature and their interactive effects in the system \u003csup\u003e\u003cspan citationid=\"CR48\" class=\"CitationRef\"\u003e48\u003c/span\u003e, \u003cspan citationid=\"CR50\" class=\"CitationRef\"\u003e50\u003c/span\u003e, \u003cspan citationid=\"CR51\" class=\"CitationRef\"\u003e51\u003c/span\u003e\u003c/sup\u003e. Despite a plethora of knowledge on the differential equation-based systems biology model, dynamic models are difficult to establish within specific biological contexts, especially when only static data is available. Although a challenge, the large amount of biological omics data has the potential to characterize complex biological systems.\u003c/p\u003e \u003cp\u003eHere we presented the PSAA framework that approximates the activity level and dynamics of MHC-I and MHC-II pathways by using multi-omics data. Compared to conventional systems biology and data-driven computational biology approaches, PSAA bridges omics data with explicit systems biology models by integrating neural network frameworks. Crucially, we demonstrate that for systems with steady-state equilibrium\u0026mdash;such as metabolic pathways and macromolecule processing\u0026mdash;dynamic behaviors can be inferred from non-time course omics data, expanding the applicability of PSAA to a broader range of biological data contexts. PSAA demonstrated that reaction rates within the system having equilibrium steady states can be approximated by properly designed neural networks, which forms a new type of physics informed neural network. We call this type of analysis PINN-empowered and data-driven systems biology and the underlying learning paradigm as constrained learning. Constrained learning is defined by (1) approximating the non-linear dependency between the dynamics of biological reactions and observed omics data by AI-based non-linear solvers such neural networks and (2) constraining the solution space (functional space) of the non-linear solver by the dynamic properties of the systems, i.e, a type of PINN. We provided a mathematical formulation of the general constrained learning problem (see details in \u003cb\u003eSupplementary Methods\u003c/b\u003e).\u003cdiv class=\"BlockQuote\"\u003e\u003cp\u003ePSAA illustrates a newly defined machine learning paradigm and PINN architecture, which falls outside of traditional supervised and unsupervised learning. We refer to this learning paradigm as \u003cb\u003econstrained learning\u003c/b\u003e for data-driven systems biology. This approach is characterized by identifying the mathematical model to quantify the reaction rates within a given omics data set, while enforcing coherence between the mathematical property of the model and the systems biological property of the reactions being studied. Empowered by this idea, the PSAA framework provides the following unmet capabilities: (1) quantify sample-wise activity level of the whole process and individual biological steps of MHC-I and MHC-II pathways using bulk, single-cell, or spatially resolved transcriptomics or proteomics data; (2) compute the dependency of MHC-I antigen presentation with T cell infiltration and activity level, and other biological processes or clinical features; (3) dissection of spatial regions of varied T cell infiltration and antigen presentation levels in SRT data; (4) assess the impact of the expression change of each gene on the MHC-I and MHC-II pathways in each sample; (5) prediction of possible drugs to perturb the level of MHC-I and MHC-II antigen presentation; and (6) inference of the goodness of fit of the data to prior assumed biological pathway structure. Selected analyses including prediction accuracy and targets predicted by in-silico perturbations have been validated on our in-house generated and public domain data sets.\u003c/p\u003e\u003cp\u003eFigure 3h, \u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e4\u003c/span\u003ea and \u003cb\u003eSupplementary Fig.\u0026nbsp;3a\u003c/b\u003e suggested that different samples may have varied levels of T cell activation. It is noteworthy that PSAA only models the flux rate of MHC-I/II pathway in a cell or tissue sample. T cell activation depends on the interaction of a specific TCR with the peptide-MHC complex and engagement with other co-stimulatory molecules. However, the TCR activation reaction, which is a signaling procedure, does not satisfy the condition of steady-state equilibrium. Thus, PSAA cannot effectively handle variations in TCR differences. In each training of the PSAA model, it is assumed that all samples share the same rate of T cell recognition and activation. One future direction is to extend the current PSAA by including parameters of TCR recognition and T cell activation to better characterize the interactions between APCs and T cells.\u003c/p\u003e\u003c/div\u003e\u003c/p\u003e"},{"header":"Methods","content":"\u003cp\u003e \u003cb\u003ePublic Data used in the analysis.\u003c/b\u003e \u003c/p\u003e \u003cp\u003eReaction information of the genes involved in each step of MHC-I and II pathways were collected from KEGG, Reactome, GO databases and literature data. Detailed information of the pathway reconstruction was given in \u003cb\u003eSupplementary Methods\u003c/b\u003e.\u003c/p\u003e \u003cp\u003eWe collected three CITE-seq data sets, GSE249542, GSE200417\u003csup\u003e29\u003c/sup\u003e, and SCP1064\u003csup\u003e28\u003c/sup\u003e, from the GEO database to validate the predication accuracy of PSAA. Bulk RNA-seq and proteomics data of cancer cell lines from CCLE and cancer tissue data from TCGA were utilized to validate the PSAA predictions. RPKM normalized RNA-seq data and normalized proteomics data were downloaded from cBioPortal. Cancer types were selected based on the availability of normal samples in TCGA. We retrieved five spatial transcriptomics slides of cancer tissue from the 10x Genomics website and GSE206522 to validate the application of PSAA to cancer data. Four transcriptomics data of patient samples collected from clinical trials of anti-PD1/CTLA-4 therapies, GSE91061\u003csup\u003e38\u003c/sup\u003e, GSE115821\u003csup\u003e39\u003c/sup\u003e, GSE126043\u003csup\u003e40\u003c/sup\u003e, and GSE162137\u003csup\u003e41\u003c/sup\u003e, were collected from the GEO database to demonstrate the clinical implications of PSAA predicted antigen presentation levels. ROSMAP scRNA-seq data of AD brain was retrieved from ROSMAP data portal. Detailed information of data retrieval, processing, and normalization were given in \u003cb\u003eSupplementary Methods\u003c/b\u003e.\u003c/p\u003e \u003cp\u003eDetailed experimental procedures of the scRNA-seq data of \u003cem\u003eH. ducreyi\u003c/em\u003e infection from four human volunteers and matched spatial transcriptomics data, including sample collection, processing, library construction, and sequencing, were given in \u003cb\u003eSupplementary Methods\u003c/b\u003e. The data was deposited to dbGaP (phs003754).\u003c/p\u003e\n\u003ch3\u003eMathematical formulation of PSAA\u003c/h3\u003e\n\u003cp\u003e \u003c/p\u003e\u003cdiv class=\"BlockQuote\"\u003e \u003cp\u003ePSAA utilizes a directed factor graph base representation of the MHC-I or II pathway and a PINN model for reaction rate estimation. We first reconstruct the MHC-I or II pathway into a directed factor graph, in which each reaction \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\left(R\\right)\$\u003c/span\u003e\u003c/span\u003e is a variable, and each intermediate molecule such as peptides and MHC complex \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\left(C\\right)\$\u003c/span\u003e\u003c/span\u003e are factors.\u003c/p\u003e \u003cp\u003eDenote \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:FG\\left(C,R,\\:{E}_{C\\to\\:R},{E}_{R\\to\\:C}\\right)\$\u003c/span\u003e\u003c/span\u003e as the factor graph, where \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:C\$\u003c/span\u003e\u003c/span\u003e is the set of intermediate molecules, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:R\$\u003c/span\u003e\u003c/span\u003e is the set of reactions, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{E}_{C\\to\\:R}\$\u003c/span\u003e\u003c/span\u003e and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{E}_{R\\to\\:C}\$\u003c/span\u003e\u003c/span\u003e are direct edges represent \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:R\$\u003c/span\u003e\u003c/span\u003e consumes or produces \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:C\$\u003c/span\u003e\u003c/span\u003e. Denote \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mathcal{R}}_{in}^{{C}_{k}}=\\left\\{{R}_{m}\\right|{(R}_{m}\\to\\:{C}_{k})\\in\\:{E}_{C\\to\\:R}\\}\$\u003c/span\u003e\u003c/span\u003e and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mathcal{R}}_{out}^{{C}_{k}}=\\left\\{{R}_{m}\\right|\\left({C}_{k}\\to\\:{R}_{m}\\right)\\in\\:{E}_{R\\to\\:C}\\:\\}\$\u003c/span\u003e\u003c/span\u003e as the sets of the reactions that produce \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{C}_{k}\$\u003c/span\u003e\u003c/span\u003e or consume \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{C}_{k}\$\u003c/span\u003e\u003c/span\u003e. For an omics data of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:N\$\u003c/span\u003e\u003c/span\u003e samples, denote \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{X}_{j}^{m}=\\left\\{{x}_{1,j}^{m},\\dots\\:,{x}_{{i}_{m},j}^{m}\\right\\}\$\u003c/span\u003e\u003c/span\u003e as the expression of the genes or proteins involved in the reaction \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{R}_{m}\$\u003c/span\u003e\u003c/span\u003e and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{F}_{m,j}\$\u003c/span\u003e\u003c/span\u003e as the rate of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{R}_{m}\$\u003c/span\u003e\u003c/span\u003e in sample \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:j\$\u003c/span\u003e\u003c/span\u003e. We model \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{F}_{m,j}=\\mathcal{F}\\left({X}_{j}^{m},\\:{{\\Theta\\:}}_{m}\\right)\$\u003c/span\u003e\u003c/span\u003e as a multi-layer neural network with the input \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{X}_{j}^{m}\$\u003c/span\u003e\u003c/span\u003e, here \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{{\\Theta\\:}}_{m}\$\u003c/span\u003e\u003c/span\u003e denotes the parameter of the neural network. \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{{\\Theta\\:}}_{m}\$\u003c/span\u003e\u003c/span\u003e and cell-wise flux \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{F}_{m,j}\$\u003c/span\u003e\u003c/span\u003e are solved by minimizing the loss:\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Equb\" class=\"Equation\"\u003e \u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equb\" name=\"EquationSource\"\u003e\n$$\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\sum\\:_{j=1}^{N}\\sum\\:_{k=1}^{K}{{\\gamma\\:}_{k}\\left({\\sum\\:}_{{m\\in\\:\\mathcal{R}}_{in}^{{C}_{k}}}{F}_{m,j}-{\\sum\\:}_{{{m}^{{\\prime\\:}}\\in\\:\\mathcal{R}}_{out}^{{C}_{k}}}{F}_{{m}^{{\\prime\\:}},j}\\right)}^{2}+{L}_{p}\\left(\\mathcal{F},{\\Theta\\:}{\\prime\\:}\\right)\\:\\:\\:\\:①\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:\\:$$\u003c/div\u003e \u003c/div\u003e \u003cp\u003e\u003c/p\u003e \u003cp\u003e \u003c/p\u003e\u003cdiv class=\"BlockQuote\"\u003e \u003cp\u003eHere the first loss term regularizes the coherency to the flux balance condition of the biological system on the observed data. A quadratic loss was used to enable certain imbalances for the samples under quasi-steady state or non-steady state. Parameter \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\gamma\\:}_{k}\$\u003c/span\u003e\u003c/span\u003e is introduced in PSAA to enable different weights for the flux balance of different steps. Noted, the whole network of antigen processing, presentation, and recognition is composed by three main branches, namely (1) peptide generation (ubiquitination and proteosome), (2) peptide loading, and (3) processing and exocytosis of MHC class I or II complexes, where only the step (2) is antigen presentation specific while the activity level of (1) and (3) could be regulated for other biological activities that the presentation of MHC complexes. Thus, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\gamma\\:}_{k}\$\u003c/span\u003e\u003c/span\u003e is introduced to leverage the influence of varied expression levels of different steps in antigen presentation. We demonstrated the proper value range of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\gamma\\:}_{k}\$\u003c/span\u003e\u003c/span\u003e can be identified by maximizing the best fitting of the data to the underlying systems biology model of the antigen presentation pathways. Detailed systems biology consideration is provided in \u003cb\u003eSupplementary Methods\u003c/b\u003e.\u003c/p\u003e \u003c/div\u003e \u003cp\u003e\u003c/p\u003e\n\u003ch3\u003eMPSL optimization algorithm\u003c/h3\u003e\n\u003cp\u003e \u003c/p\u003e\u003cdiv class=\"BlockQuote\"\u003e \u003cp\u003eThe solution of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:①\$\u003c/span\u003e\u003c/span\u003e is non-trivial because it needs to search over the functional space of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mathcal{F}}_{m}({X}_{j}^{m},{{\\Theta\\:}}^{m})\$\u003c/span\u003e\u003c/span\u003e and parameter space of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{{\\Theta\\:}}^{m}\$\u003c/span\u003e\u003c/span\u003e for each reaction \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:m\$\u003c/span\u003e\u003c/span\u003e. In PSAA, we utilize neural networks to approximate the non-linear dependency between omics data and reaction rate \u003csup\u003e\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e\u003c/sup\u003e. Noted, minimization of the loss term \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:①\$\u003c/span\u003e\u003c/span\u003e neither falls into the paradigm of supervised learning nor unsupervised learning. Instead, the loss term characterizes the biochemical dependencies among \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}\$\u003c/span\u003e\u003c/span\u003e. Furthermore, the sparse learning penalty term \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{L}_{p}\\left(\\mathcal{F},{\\Theta\\:}{\\prime\\:}\\right)\\:\$\u003c/span\u003e\u003c/span\u003ecannot be directly co-optimized with the flux balance term by using existing methods. Thus, we developed a new optimization algorithm for \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:①\$\u003c/span\u003e\u003c/span\u003e, namely Message Passing and Supervised Learning (MPSL). Specifically, considering the flux balance term as additional constraints of the solution space of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mathcal{F}}_{m}\$\u003c/span\u003e\u003c/span\u003e, we proposed an iterative approach to effectively search for the solution space. Because the flux balance constraint could be represented as factors over an directed factor graph, in which each function \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}\$\u003c/span\u003e\u003c/span\u003e is a variable and Information balance over a factor can be achieved by belief propagation \u003csup\u003e\u003cspan citationid=\"CR52\" class=\"CitationRef\"\u003e52\u003c/span\u003e\u003c/sup\u003e. Based on this idea, we develop a three-step optimizer to minimize \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:①\$\u003c/span\u003e\u003c/span\u003e:\u003c/p\u003e \u003c/div\u003e \u003cp\u003e\u003c/p\u003e \u003cp\u003e \u003c/p\u003e\u003col\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003e \u003cb\u003eStep 1 (Initialization)\u003c/b\u003e: Classic gradient decent is first utilized to generate an initial solution \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}({D}_{0},{{\\Theta\\:}}_{0}^{{\\prime\\:}})\$\u003c/span\u003e\u003c/span\u003e.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003e \u003cb\u003eStep 2 (Message Passing Optimization, MPO)\u003c/b\u003e: for any given \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}({D}_{0},{{\\Theta\\:}}_{0}^{{\\prime\\:}})\$\u003c/span\u003e\u003c/span\u003e with fixed \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{D}_{0}\$\u003c/span\u003e\u003c/span\u003e and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{{\\Theta\\:}}_{0}^{{\\prime\\:}}\$\u003c/span\u003e\u003c/span\u003e, the optimization strategy adopts the idea of belief propagation (BP) to identify a solution \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}\\mathcal{{\\prime\\:}}\$\u003c/span\u003e\u003c/span\u003e, which is on the solution space of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\Phi\\:}\\left(\\mathcal{R}\\left({\\Theta\\:}\\right)\\right)\$\u003c/span\u003e\u003c/span\u003e and close to \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}({D}_{0},{{\\Theta\\:}}_{0}^{{\\prime\\:}})\$\u003c/span\u003e\u003c/span\u003e. Noted, here BP only searches \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}\\mathcal{{\\prime\\:}}\$\u003c/span\u003e\u003c/span\u003e only based on the flux rates \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}({D}_{0},{{\\Theta\\:}}_{0})\$\u003c/span\u003e\u003c/span\u003e and does not rely on \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{D}_{0}\$\u003c/span\u003e\u003c/span\u003e. \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}\\mathcal{{\\prime\\:}}\$\u003c/span\u003e\u003c/span\u003e can be considered as projecting the values of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}({D}_{0},{{\\Theta\\:}}_{0})\$\u003c/span\u003e\u003c/span\u003e to the solution space that minimizes the flux balance term.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003e \u003cb\u003eStep 3 (Supervised Learning)\u003c/b\u003e: Then \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}\\mathcal{{\\prime\\:}}\$\u003c/span\u003e\u003c/span\u003e could serve as known labels of flux rate of each reaction. \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}\$\u003c/span\u003e\u003c/span\u003e and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{{\\Theta\\:}}^{{\\prime\\:}}\$\u003c/span\u003e\u003c/span\u003e could be further updated by a supervised fitting of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}({D}_{0},{{\\Theta\\:}}^{{\\prime\\:}})\$\u003c/span\u003e\u003c/span\u003e to \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}\\mathcal{{\\prime\\:}}\$\u003c/span\u003e\u003c/span\u003e for all data points \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{D}_{0}\$\u003c/span\u003e\u003c/span\u003e. Under default setting, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}\\left({D}_{0},{{\\Theta\\:}}^{{\\prime\\:}}\\right)\$\u003c/span\u003e\u003c/span\u003e will be trained as a fully connected deep neural network to predict \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}\\mathcal{{\\prime\\:}}\$\u003c/span\u003e\u003c/span\u003e and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{L}_{1}\$\u003c/span\u003e\u003c/span\u003e penalty will be used for \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:L(\\mathcal{F},{\\Theta\\:}{\\prime\\:})\$\u003c/span\u003e\u003c/span\u003e.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003c/ol\u003e \u003cdiv class=\"BlockQuote\"\u003e \u003cp\u003eStep 2 and 3 could be iteratively conducted to iteratively optimize \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\mathcal{F}\\mathcal{{\\prime\\:}}\$\u003c/span\u003e\u003c/span\u003e and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\{\\mathcal{F},{{\\Theta\\:}}^{{\\prime\\:}}\\}\$\u003c/span\u003e\u003c/span\u003e. It is noteworthy that BP can effectively handle linear constraints such as flux balance under quasi-steady state. By using this approach, the two terms could be iteratively and effectively handled, and the \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{L}_{p}\\left(\\mathcal{F},{\\Theta\\:}{\\prime\\:}\\right)\\:\$\u003c/span\u003e\u003c/span\u003e can be directly minimized in the supervised learning form in Step 3.\u003c/p\u003e \u003c/div\u003e \u003cp\u003e\u003c/p\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003eIn-silico perturbation analysis to evaluate the impact of each gene\u003c/h2\u003e \u003cp\u003eTo study which gene contributes most to MHC class II antigen presentation pathways in APCs, we conduct an in-silico perturbation analysis. The first order partial derivative of each gene in the PSAA model was computed for each sample. We first got the absolute value of the first order partial derivative and then used log transformation to make sure that all the gradients are positive values. After that, we performed principal component analysis (PCA) in a gradient matrix and selected the top 6 PCs to represent the importance of each gene. Then we used k-means clustering to cluster the importance matrix into three clusters as elaborated in the main text. Finally, we ranked genes based on their importance within each cluster and identified the top 10 genes in each cluster for further analysis.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eIdentification of gene targets to improve the antigen presentation level\u003c/h3\u003e\n\u003cp\u003eTo identify the genes that could be targeted to improve the presentation level of MHC-I complex on the surface of a cell, we evaluate if adding a certain gene into the “MHC-I complex salvage” module could significantly increase the balance of the fitting. Specifically, we performed a paired Wald test using TCGA data of TNBC and other cancer types. We define the flux imbalance loss with target gene as \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{L}_{X}=({l}_{1}^{X},{l}_{2}^{X}\\dots\\:{l}_{n}^{X})\$\u003c/span\u003e\u003c/span\u003e and the flux imbalance loss without target gene as \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{L}_{Y}=({l}_{1}^{Y},{l}_{2}^{Y}\\dots\\:{l}_{n}^{Y})\$\u003c/span\u003e\u003c/span\u003e. The null hypothesis test is \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{H}_{0}\$\u003c/span\u003e\u003c/span\u003e: \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mu\\:}_{X}-{\\mu\\:}_{Y}=0\$\u003c/span\u003e\u003c/span\u003e. The alternative hypothesis is \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{H}_{a}\$\u003c/span\u003e\u003c/span\u003e: \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\mu\\:}_{X}-{\\mu\\:}_{Y}\u0026lt;0\$\u003c/span\u003e\u003c/span\u003e. We tested a list of target genes and reported their p-values in \u003cb\u003eSupplementary Table S6\u003c/b\u003e. The CRISPR knock experiment of the genes with most significant p-values were conducted on a TNBC cell line system as detailed below.\u003c/p\u003e\n\u003ch3\u003eCell culture and Generation of stable knockdown cell lines of the CRISPR knockdown experiment\u003c/h3\u003e\n\u003cp\u003eThe mouse breast cancer cell line with endogenous expression of ovalbumin, EO771-OVA, was maintained in DMEM medium supplemented with 10% fetal bovine serum and 1% penicillin/streptomycin at 37°C with 5% CO2. For the generation of stable shRNA knockdown cell lines, shRNA clone sets targeting different mouse genes were purchased from Sigma and transduced into EO771-OVA cells via lentivirus, followed by 2 µg/ml puromycin selection for 5 days. qPCR was used to examine the knockdown efficacy.\u003c/p\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003eAntigen presentation and cytotoxicity assay to validate the predicted genes\u003c/h2\u003e \u003cp\u003eAntigen presentation levels on EO771-OVA and knockdown cell lines were examined using APC-conjugated anti-mouse H-2Kb bound to SIINFEKL antibody (BioLegend, Dilution 1:50) and evaluated by flow cytometry. EO771-WT cells were used as isotype control for gating strategy.\u003c/p\u003e \u003cp\u003eTo assess the CD8 + T cell cytotoxicity against EO771-OVA cell lines with gene knockdown, CD8 + T cells were isolated from the splenocytes of OT-I mice and stimulated with CD3/CD28 dynabeads (Gibco™ #11131D) in the presence of 5 ng/mL IL-2 for 2 days. EO771-OVA cells were stained with IncuCyte Cytolight Rapid Red (Sartorius, #4705) and seeded in 96-well plate (5 x 10\u003csup\u003e3\u003c/sup\u003e cell/well). OT-1 T cells were co-cultured with EO771-OVA cells at effector/tumor-cell ratio of 3:1. The 16h time point was used for final readout.\u003c/p\u003e \u003cp\u003e \u003cb\u003eSignal amplification of spatial transcriptomics data.\u003c/b\u003e \u003c/p\u003e \u003cp\u003eTo reduce the discontinuities inherent caused by the low signal level, we employed a Gaussian smoothing filter to amplify the spatial dependent signals in SRT data, such as the predicted activity level of antigen presentations or T cell abundance. Specifically, the following smoothing model was used to amplify spatial dependent signals:\u003c/p\u003e\u003cdiv id=\"Equc\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equc\" name=\"EquationSource\"\u003e\n$$\\:{f}_{i}^{amp}=\\sum\\:_{j\\in\\:N\\left(i\\right)}g\\left(d\\left(i,j\\right)\\right)*{f}_{i}$$\u003c/div\u003e\u003c/div\u003e\u003cp\u003e\u003c/p\u003e \u003cp\u003e, where \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:g\\left(x\\right)=\\frac{1}{\\sigma\\:\\surd\\:2\\pi\\:}{e}^{-\\frac{{\\left(x-\\mu\\:\\right)}^{2}}{2{\\sigma\\:}^{2}}}\$\u003c/span\u003e\u003c/span\u003e is a gaussian kernel and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{f}_{i}\$\u003c/span\u003e\u003c/span\u003e represents the PSAA predicted antigen presentation level or original T cell expression level of spot \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:i\$\u003c/span\u003e\u003c/span\u003e. \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:N\\left(i\\right)\$\u003c/span\u003e\u003c/span\u003e is the neighborhood of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:i\$\u003c/span\u003e\u003c/span\u003e defined as the spatial spots whose distance to \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:i\$\u003c/span\u003e\u003c/span\u003e are smaller than a certain threshold, and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:d(i,j)\$\u003c/span\u003e\u003c/span\u003e is the Euclidean distance between two spatial spots \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:i\$\u003c/span\u003e\u003c/span\u003e and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:j\$\u003c/span\u003e\u003c/span\u003e, and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{f}_{i}^{amp}\$\u003c/span\u003e\u003c/span\u003e is the amplified signal of spot \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:i\$\u003c/span\u003e\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cb\u003eIdentify spatial regions of varied dependency between antigen presentation and T cell infiltration.\u003c/b\u003e \u003c/p\u003e \u003cp\u003eTo evaluate the spatial dependency between antigen presentation and T cell infiltrations and identify the spatial regions show varied dependencies, we computed local bivariate Moran’s I correlation\u003csup\u003e\u003cspan citationid=\"CR53\" class=\"CitationRef\"\u003e53\u003c/span\u003e\u003c/sup\u003e for each spatial spot \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:i\$\u003c/span\u003e\u003c/span\u003e:\u003c/p\u003e\u003cdiv id=\"Equd\" class=\"Equation\"\u003e\u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equd\" name=\"EquationSource\"\u003e\n$$\\:{I}_{i}=\\frac{{\\sum\\:}_{j}{w}_{ij}{y}_{j}\\times\\:{x}_{i}}{{\\sum\\:}_{i}{x}_{i}^{2}}$$\u003c/div\u003e\u003c/div\u003e\u003cp\u003e\u003c/p\u003e \u003cp\u003e, where \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{x}_{i}\$\u003c/span\u003e\u003c/span\u003e represents the normalized antigen presentation level after amplification at spot \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:i\$\u003c/span\u003e\u003c/span\u003e, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{y}_{j}\$\u003c/span\u003e\u003c/span\u003e represents the normalized T cell level after amplification at spot \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:j\$\u003c/span\u003e\u003c/span\u003e, and \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{w}_{ij}\$\u003c/span\u003e\u003c/span\u003e is a weight indexing location of spot \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:i\$\u003c/span\u003e\u003c/span\u003e relative to \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:j\$\u003c/span\u003e\u003c/span\u003e. In this study, we used the gaussian kernel distance to compute \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{w}_{ij}\$\u003c/span\u003e\u003c/span\u003e.\u003c/p\u003e \u003cp\u003eFor each spot \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:i\$\u003c/span\u003e\u003c/span\u003e, \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{I}_{i}\$\u003c/span\u003e\u003c/span\u003e measures the correlation between its antigen presentation level (\u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{x}_{i}\$\u003c/span\u003e\u003c/span\u003e) with the level of T cell abundance (\u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{y}_{i}\$\u003c/span\u003e\u003c/span\u003e) in its neighborhood (\u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{\\sum\\:}_{j\\in\\:N\\left(i\\right)}{w}_{ij}{y}_{j}\$\u003c/span\u003e\u003c/span\u003e). The significance of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{I}_{i}\$\u003c/span\u003e\u003c/span\u003e was evaluated using a permutation test, where the \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{y}_{}\$\u003c/span\u003e\u003c/span\u003e values were randomly permuted across all spots. A pseudo p-value was then calculated by determining the proportion of Local Moran's I values generated from permutations that are greater than or equal to the observed Local Moran's I values from original data. We set significant level \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:\\alpha\\:=0.05\$\u003c/span\u003e\u003c/span\u003e and the spots with pseudo p-value \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:p\u0026lt;0.05\$\u003c/span\u003e\u003c/span\u003e were considered statistically significant, indicating significant a spatial dependence between antigen presentation level and T cell infiltration level.\u003c/p\u003e \u003cp\u003eThen we segment the regions of significant dependencies into four regions based on the level of antigen presentation and T cell signals – “High antigen, High T cell”, “High antigen, Low T cell”, “Low antigen, High T cell” and “Low antigen, Low T cell”. To conduct this segmentation, we first normalize the antigen presentation level of each spot by computing the z-score of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{x}_{i}\$\u003c/span\u003e\u003c/span\u003e, denoted as \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{z(x}_{i})\$\u003c/span\u003e\u003c/span\u003e, and the T cell infiltration level of its neighbors by the weighted average z-score of \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{y}_{j}\$\u003c/span\u003e\u003c/span\u003e, denoted as \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{{w}_{ij}z(y}_{j})\$\u003c/span\u003e\u003c/span\u003e. The “High” or “Low” antigen and “High” or “Low” T cell regions were identified by positive or negative \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{z(x}_{i})\$\u003c/span\u003e\u003c/span\u003e and positive or negative \u003cspan class=\"InlineEquation\"\u003e\u003cspan class=\"mathinline\"\u003e\$\\:{{w}_{ij}z(y}_{j})\$\u003c/span\u003e\u003c/span\u003e), respectively.\u003c/p\u003e \u003c/div\u003e"},{"header":"Declarations","content":"\n\u003ch3\u003eACKNOWLEDGEMENTS AND FUNDING SUPPORTS\u003c/h3\u003e\n\u003cp\u003eThe project was supported by NIGMS 1R35GM150971 (CZ), R35GM155028 (CS), NLM 1R01LM014720 (CZ), NSF DBI-2047631 (CZ), NSF-IIS- 2145314 (CS), American Cancer Society RSG-22-062-01-MM (CZ), RSG-24-1321371 (CS). The generation of the \u003cem\u003eH. ducreyi\u003c/em\u003e data sets were supported by R01AI137116 from the National Institute of Allergy and Infectious Diseases to S.M.S.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eJhunjhunwala S, Hammer C, Delamarre L. Antigen presentation in cancer: insights into tumour immunogenicity and immune evasion. Nat Rev Cancer. 2021;21(5):298\u0026ndash;312. Epub 20210309. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1038/s41568-021-00339-z\u003c/span\u003e\u003cspan address=\"10.1038/s41568-021-00339-z\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 33750922.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDhatchinamoorthy K, Colbert JD, Rock KL. Cancer Immune Evasion Through Loss of MHC Class I Antigen Presentation. Front Immunol. 2021;12:636568. Epub 20210309. doi: 10.3389/fimmu.2021.636568. PubMed PMID: 33767702; PMCID: PMC7986854.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eVinay DS, Ryan EP, Pawelec G, Talib WH, Stagg J, Elkord E, Lichtor T, Decker WK, Whelan RL, Kumara H, Signori E, Honoki K, Georgakilas AG, Amin A, Helferich WG, Boosani CS, Guha G, Ciriolo MR, Chen S, Mohammed SI, Azmi AS, Keith WN, Bilsland A, Bhakta D, Halicka D, Fujii H, Aquilano K, Ashraf SS, Nowsheen S, Yang X, Choi BK, Kwon BS. Immune evasion in cancer: Mechanistic basis and therapeutic strategies. Semin Cancer Biol. 2015;35 Suppl:S185-S98. Epub 20150325. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1016/j.semcancer.2015.03.004\u003c/span\u003e\u003cspan address=\"10.1016/j.semcancer.2015.03.004\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 25818339.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMellman I, Coukos G, Dranoff G. Cancer immunotherapy comes of age. Nature. 2011;480(7378):480\u0026ndash;9. Epub 20111221. doi: 10.1038/nature10673. PubMed PMID: 22193102; PMCID: PMC3967235.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSchuster M, Nechansky A, Kircheis R. Cancer immunotherapy. Biotechnol J. 2006;1(2):138\u0026ndash;47. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1002/biot.200500044\u003c/span\u003e\u003cspan address=\"10.1002/biot.200500044\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 16892244.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eEsfahani K, Roudaia L, Buhlaiga N, Del Rincon SV, Papneja N, Miller WH, Jr. A review of cancer immunotherapy: from the past, to the present, to the future. Curr Oncol. 2020;27(Suppl 2):S87-S97. Epub 20200401. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.3747/co.27.5223\u003c/span\u003e\u003cspan address=\"10.3747/co.27.5223\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 32368178; PMCID: PMC7194005.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBlum JS, Wearsch PA, Cresswell P. Pathways of antigen processing. Annu Rev Immunol. 2013;31:443\u0026ndash;73. Epub 20130103. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1146/annurev-immunol-032712-095910\u003c/span\u003e\u003cspan address=\"10.1146/annurev-immunol-032712-095910\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 23298205; PMCID: PMC4026165.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLeone P, Shin EC, Perosa F, Vacca A, Dammacco F, Racanelli V. MHC class I antigen processing and presenting machinery: organization, function, and defects in tumor cells. J Natl Cancer Inst. 2013;105(16):1172\u0026ndash;87. Epub 20130712. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1093/jnci/djt184\u003c/span\u003e\u003cspan address=\"10.1093/jnci/djt184\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 23852952.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLee MY, Jeon JW, Sievers C, Allen CT. Antigen processing and presentation in cancer immunotherapy. J Immunother Cancer. 2020;8(2). doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1136/jitc-2020-001111\u003c/span\u003e\u003cspan address=\"10.1136/jitc-2020-001111\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 32859742; PMCID: PMC7454179.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eNeefjes J, Jongsma ML, Paul P, Bakke O. Towards a systems understanding of MHC class I and MHC class II antigen presentation. Nature reviews immunology. 2011;11(12):823\u0026ndash;36.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHolling TM, Schooten E, van Den Elsen PJ. Function and regulation of MHC class II molecules in T-lymphocytes: of mice and men. Human immunology. 2004;65(4):282\u0026ndash;90.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBolon B. Cellular and molecular mechanisms of autoimmune disease. Toxicologic pathology. 2012;40(2):216\u0026ndash;29.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBrodsky FM, Lem L, Solache A, Bennett EM. Human pathogen subversion of antigen presentation. Immunological reviews. 1999;168(1):199\u0026ndash;215.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eThibodeau J, Bourgeois-Daigneault M-C, Lapointe R. Targeting the MHC Class II antigen presentation pathway in cancer immunotherapy. Oncoimmunology. 2012;1(6):908\u0026ndash;16.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLoschko J, Schreiber HA, Rieke GJ, Esterh\u0026aacute;zy D, Meredith MM, Pedicord VA, Yao K-H, Caballero S, Pamer EG, Mucida D. Absence of MHC class II on cDCs results in microbial-dependent intestinal inflammation. Journal of experimental medicine. 2016;213(4):517\u0026ndash;34.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePishesha N, Harmand TJ, Ploegh HL. A guide to antigen processing and presentation. Nature Reviews Immunology. 2022;22(12):751\u0026ndash;64.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLagergren JH, Nardini JT, Baker RE, Simpson MJ, Flores KB. Biologically-informed neural networks guide mechanistic modeling from sparse experimental data. PLoS computational biology. 2020;16(12):e1008462.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRaissi M, Perdikaris P, Karniadakis GE. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational physics. 2019;378:686\u0026ndash;707.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27\u0026ndash;30. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1093/nar/28.1.27\u003c/span\u003e\u003cspan address=\"10.1093/nar/28.1.27\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 10592173; PMCID: PMC102409.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFabregat A, Jupe S, Matthews L, Sidiropoulos K, Gillespie M, Garapati P, Haw R, Jassal B, Korninger F, May B. The reactome pathway knowledgebase. Nucleic acids research. 2018;46(D1):D649-D55.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eConsortium GO. The Gene Ontology (GO) database and informatics resource. Nucleic acids research. 2004;32(suppl_1):D258-D61.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMontealegre S, Van Endert PM. Endocytic recycling of MHC class I molecules in non-professional antigen presenting and dendritic cells. Frontiers in immunology. 2019;9:3098.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLindenbergh MF, Stoorvogel W. Antigen presentation by extracellular vesicles from professional antigen-presenting cells. Annual review of immunology. 2018;36(1):435\u0026ndash;59.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRoche PA, Furuta K. The ins and outs of MHC class II-mediated antigen processing and presentation. Nature Reviews Immunology. 2015;15(4):203\u0026ndash;16.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDamiani C, Maspero D, Di Filippo M, Colombo R, Pescini D, Graudenzi A, Westerhoff HV, Alberghina L, Vanoni M, Mauri G. Integration of single-cell RNA-seq data into population models to characterize cancer metabolism. PLoS computational biology. 2019;15(2):e1006733.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZhang Y, Kim MS, Nguyen E, Taylor DM. Modeling metabolic variation with single-cell expression data. bioRxiv. 2020:2020.01. 28.923680.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAlghamdi N, Chang W, Dang P, Lu X, Wan C, Gampala S, Huang Z, Wang J, Ma Q, Zang Y, Fishel M, Cao S, Zhang C. A graph neural network model to estimate cell-wise metabolic flux using single-cell RNA-seq data. Genome Res. 2021;31(10):1867\u0026ndash;84. Epub 20210722. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1101/gr.271205.120\u003c/span\u003e\u003cspan address=\"10.1101/gr.271205.120\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 34301623; PMCID: PMC8494226.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFrangieh CJ, Melms JC, Thakore PI, Geiger-Schuller KR, Ho P, Luoma AM, Cleary B, Jerby-Arnon L, Malu S, Cuoco MS. Multimodal pooled Perturb-CITE-seq screens in patient models define mechanisms of cancer immune evasion. Nature genetics. 2021;53(3):332\u0026ndash;41.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eXu Z, Heidrich-O\u0026rsquo;Hare E, Chen W, Duerr RH. Comprehensive benchmarking of CITE-seq versus DOGMA-seq single cell multimodal omics. Genome Biology. 2022;23(1):135.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChen CY, Mertz KJ, Spinola SM, Morse SA. Comparison of enzyme immunoassays for antibodies to Haemophilus ducreyi in a community outbreak of chancroid in the United States. J Infect Dis. 1997;175(6):1390\u0026ndash;5. doi: 10.1086/516471. PubMed PMID: 9180178.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChang W, Dang P, Wan C, Lu X, Fang Y, Zhao T, Zang Y, Li B, Zhang C, Cao S, editors. Spatially and Robustly Hybrid Mixture Regression Model for Inference of Spatial Dependence. 2021 IEEE International Conference on Data Mining (ICDM); 2021: IEEE.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMartinez-Bosch N, Vinaixa J, Navarro P. Immune Evasion in Pancreatic Cancer: From Mechanisms to Therapy. Cancers (Basel). 2018;10(1). Epub 20180103. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.3390/cancers10010006\u003c/span\u003e\u003cspan address=\"10.3390/cancers10010006\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 29301364; PMCID: PMC5789356.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKabacaoglu D, Ciecielski KJ, Ruess DA, Algul H. Immune Checkpoint Inhibition for Pancreatic Ductal Adenocarcinoma: Current Limitations and Future Options. Front Immunol. 2018;9:1878. Epub 20180815. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.3389/fimmu.2018.01878\u003c/span\u003e\u003cspan address=\"10.3389/fimmu.2018.01878\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 30158932; PMCID: PMC6104627.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSun Q, Hong Z, Zhang C, Wang L, Han Z, Ma D. Immune checkpoint therapy for solid tumours: clinical dilemmas and future trends. Signal Transduct Target Ther. 2023;8(1):320. Epub 20230828. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1038/s41392-023-01522-4\u003c/span\u003e\u003cspan address=\"10.1038/s41392-023-01522-4\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 37635168; PMCID: PMC10460796.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePei L, Liu Y, Liu L, Gao S, Gao X, Feng Y, Sun Z, Zhang Y, Wang C. Roles of cancer-associated fibroblasts (CAFs) in anti- PD-1/PD-L1 immunotherapy for solid cancers. Mol Cancer. 2023;22(1):29. Epub 20230210. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1186/s12943-023-01731-z\u003c/span\u003e\u003cspan address=\"10.1186/s12943-023-01731-z\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 36759842; PMCID: PMC9912573.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLu C, Liu Z, Klement JD, Yang D, Merting AD, Poschel D, Albers T, Waller JL, Shi H, Liu K. WDR5-H3K4me3 epigenetic axis regulates OPN expression to compensate PD-L1 function to promote pancreatic cancer immune escape. J Immunother Cancer. 2021;9(7). doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1136/jitc-2021-002624\u003c/span\u003e\u003cspan address=\"10.1136/jitc-2021-002624\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 34326167; PMCID: PMC8323468.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFang Y, Wang L, Wan C, Sun Y, Van der Jeught K, Zhou Z, Dong T, So KM, Yu T, Li Y, Eyvani H, Colter AB, Dong E, Cao S, Wang J, Schneider BP, Sandusky GE, Liu Y, Zhang C, Lu X, Zhang X. MAL2 drives immune evasion in breast cancer by suppressing tumor antigen presentation. J Clin Invest. 2021;131(1). doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1172/JCI140837\u003c/span\u003e\u003cspan address=\"10.1172/JCI140837\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 32990678; PMCID: PMC7773365.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRiaz N, Havel JJ, Makarov V, Desrichard A, Urba WJ, Sims JS, Hodi FS, Mart\u0026iacute;n-Algarra S, Mandal R, Sharfman WH. Tumor and microenvironment evolution during immunotherapy with nivolumab. Cell. 2017;171(4):934\u0026ndash;49. e16.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAuslander N, Zhang G, Lee JS, Frederick DT, Miao B, Moll T, Tian T, Wei Z, Madan S, Sullivan RJ. Robust prediction of response to immune checkpoint blockade therapy in metastatic melanoma. Nature medicine. 2018;24(10):1545\u0026ndash;9.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCho J-W, Hong MH, Ha S-J, Kim Y-J, Cho BC, Lee I, Kim HR. Genome-wide identification of differentially methylated promoters and enhancers associated with response to anti-PD-1 therapy in non-small cell lung cancer. Experimental \u0026amp; molecular medicine. 2020;52(9):1550\u0026ndash;63.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePhillips D, Matusiak M, Gutierrez BR, Bhate SS, Barlow GL, Jiang S, Demeter J, Smythe KS, Pierce RH, Fling SP. Immune cell topography predicts response to PD-1 blockade in cutaneous T cell lymphoma. Nature Communications. 2021;12(1):6726.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCollawn JF, Benveniste EN. Regulation of MHC class II expression in the central nervous system. Microbes and infection. 1999;1(11):893\u0026ndash;902.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKorn T, Kallies A. T cell responses in the central nervous system. Nature Reviews Immunology. 2017;17(3):179\u0026ndash;94.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAloisi F. The role of microglia and astrocytes in CNS immune surveillance and immunopathology. The Functional Roles of Glial Cells in Health and Disease: Dialogue between Glia and Neurons. 1999:123\u0026thinsp;\u0026ndash;\u0026thinsp;33.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eNott A, Holtman IR. Genetic insights into immune mechanisms of Alzheimer\u0026rsquo;s and Parkinson\u0026rsquo;s disease. Frontiers in immunology. 2023;14:1168539.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBachiller S, Jim\u0026eacute;nez-Ferrer I, Paulus A, Yang Y, Swanberg M, Deierborg T, Boza-Serrano A. Microglia in neurological diseases: a road map to brain-disease dependent-inflammatory response. Frontiers in cellular neuroscience. 2018;12:488.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKitano H. Systems biology: a brief overview. Science. 2002;295(5560):1662-4. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1126/science.1069492\u003c/span\u003e\u003cspan address=\"10.1126/science.1069492\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 11872829.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKamimoto K, Stringa B, Hoffmann CM, Jindal K, Solnica-Krezel L, Morris SA. Dissecting cell identity via network inference and in silico gene perturbation. Nature. 2023;614(7949):742\u0026ndash;51. Epub 20230208. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1038/s41586-022-05688-9\u003c/span\u003e\u003cspan address=\"10.1038/s41586-022-05688-9\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 36755098; PMCID: PMC9946838.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eEissing T, Kuepfer L, Becker C, Block M, Coboeken K, Gaub T, Goerlitz L, Jaeger J, Loosen R, Ludewig B, Meyer M, Niederalt C, Sevestre M, Siegmund HU, Solodenko J, Thelen K, Telle U, Weiss W, Wendl T, Willmann S, Lippert J. A computational systems biology software platform for multiscale modeling and simulation: integrating whole-body physiology, disease biology, and molecular reaction networks. Front Physiol. 2011;2:4. Epub 20110224. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.3389/fphys.2011.00004\u003c/span\u003e\u003cspan address=\"10.3389/fphys.2011.00004\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 21483730; PMCID: PMC3070480.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKonur S, Mierla L, Fellermann H, Ladroue C, Brown B, Wipat A, Twycross J, Dun BP, Kalvala S, Gheorghe M, Krasnogor N. Toward Full-Stack In Silico Synthetic Biology: Integrating Model Specification, Simulation, Verification, and Biological Compilation. ACS Synth Biol. 2021;10(8):1931\u0026ndash;45. Epub 20210802. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1021/acssynbio.1c00143\u003c/span\u003e\u003cspan address=\"10.1021/acssynbio.1c00143\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 34339602.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKeller R, Dorr A, Tabira A, Funahashi A, Ziller MJ, Adams R, Rodriguez N, Le Novere N, Hiroi N, Planatscher H, Zell A, Drager A. The systems biology simulation core algorithm. BMC Syst Biol. 2013;7:55. Epub 20130705. doi: \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003e10.1186/1752-0509-7-55\u003c/span\u003e\u003cspan address=\"10.1186/1752-0509-7-55\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. PubMed PMID: 23826941; PMCID: PMC3707837.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSatorras VG, Welling M, editors. Neural enhanced belief propagation on factor graphs. International Conference on Artificial Intelligence and Statistics; 2021: PMLR.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLee S-I. Developing a bivariate spatial association measure: an integration of Pearson's r and Moran's I. Journal of geographical systems. 2001;3:369\u0026ndash;85.\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"nature-portfolio","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"","title":"Nature Portfolio","twitterHandle":"","acdcEnabled":false,"dfaEnabled":false,"editorialSystem":"ejp","reportingPortfolio":"","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"","lastPublishedDoi":"10.21203/rs.3.rs-5629379/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-5629379/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eAntigen processing and presentation via major histocompatibility complex (MHC) molecules are central to immune surveillance. Yet, quantifying the dynamic activity of MHC class I and II antigen presentation remains a critical challenge, particularly in diseases like cancer, infection and autoimmunity where these pathways are often disrupted. Current methods fall short in providing precise, sample-specific insights into antigen presentation, limiting our understanding of immune evasion and therapeutic responses. Here, we present PSAA (PINN-empowered Systems Biology Analysis of Antigen Presentation Activity), which is designed to estimate sample-wise MHC class I and class II antigen presentation activity using bulk, single-cell, and spatially resolved transcriptomics or proteomics data. By reconstructing MHC pathways and employing pathway flux estimation, PSAA offers a detailed, stepwise quantification of MHC pathway activity, enabling predictions of gene-specific impacts and their downstream effects on immune interactions. Benchmarked across diverse omics datasets and experimental validations, PSAA demonstrates a robust prediction accuracy and utility across various disease contexts. In conclusion, PSAA and its downstream functions provide a comprehensive framework for analyzing the dynamics of MHC antigen presentation using omics data. By linking antigen presentation to immune cell activity and clinical outcomes, PSAA both elucidates key mechanisms driving disease progression and uncovers potential therapeutic targets.\u003c/p\u003e","manuscriptTitle":"A physics informed neural network approach to quantify antigen presentation activities at single cell level using omics data","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-01-17 04:44:11","doi":"10.21203/rs.3.rs-5629379/v1","editorialEvents":[],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"nature-communications","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"NCOMMS","sideBox":"Learn more about [Nature Communications](http://www.nature.com/ncomms/)","snPcode":"","submissionUrl":"https://mts-ncomms.nature.com/","title":"Nature Communications","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"ejp","reportingPortfolio":"Nature Communications","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"a9b105bd-ceab-450e-bc89-1c8e62536996","owner":[],"postedDate":"January 17th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[{"id":41792098,"name":"Biological sciences/Computational biology and bioinformatics/Machine learning"},{"id":41792099,"name":"Biological sciences/Computational biology and bioinformatics/Computational models"},{"id":41792100,"name":"Biological sciences/Immunology/Antigen processing and presentation"},{"id":41792101,"name":"Biological sciences/Cancer/Cancer microenvironment"}],"tags":[],"updatedAt":"2025-01-17T04:44:11+00:00","versionOfRecord":[],"versionCreatedAt":"2025-01-17 04:44:11","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-5629379","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-5629379","identity":"rs-5629379","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: preprint-html ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall: last seen: 2026-05-23T02:00:01.238055+00:00

License: CC-BY-4.0