Towards Cross-Sample Alignment for Multi-Modal Representation Learning in Spatial Transcriptomics
preprint
OA: closed
CC-BY-4.0
Full text
2,241 characters
· extracted from
oa-html
· click to expand
Abstract
The growing number of spatial transcriptomics (ST) datasets enables comprehensive multi-modal characterization of cell types across diverse biological and clinical contexts. However, integration across patient cohorts remains challenging, as local microenvironment, patient-specific variability, and technical batch effects can dominate signals. Here, we hypothesize that combining specialized transcriptomics correction methods with deep representation learning can jointly align morphology, transcriptomics, and spatial information across multiple tissue samples. This approach benefits from recent transcriptomics and pathology foundation models, projecting cells into a shared embedding space where they cluster by cell type rather than dataset-specific conditions. Applying this framework to 18 skin melanoma, 12 human brain, and 4 lung cancer datasets, we demonstrate that it outperforms conventional batch-correction approaches by 58%, 38%, and 2-fold, respectively. Together, this framework enables efficient integration of multi-modal ST data across modalities and samples, facilitating the systematic discovery of conserved cellular programs and spatial niches while remaining robust to cohort-specific batch effects.
Code availability https://github.com/ratschlab/aestetik
Competing Interest Statement
V.H.K reports being an invited speaker for Sharing Progress in Cancer Care (SPCC) and Indica Labs; advisory board of Takeda; and sponsored research agreements with Roche and IAG, all unrelated to the current study. V.H.K. is a participant in a patent application on the assessment of cancer immunotherapy biomarkers by digital pathology; a patent application on multimodal deep learning for the prediction of recurrence risk in cancer patients, and a patent application on predicting the efficacy of cancer treatment using deep learning all unrelated to the current work. GR is a participant in a patent application on matching cells from different measurement modalities which is not directly related to the current work. Moreover, G.R. is a cofounder of Computomics GmbH, Germany, and one of its shareholders.
Footnotes
jusdai{at}ethz.ch, kalin.nonchev{at}inf.ethz.ch, viktor.koelzer{at}usb.ch, raetsch{at}inf.ethz.ch
Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.
My notes (saved in your browser only)
Ask this paper
Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works
Funding
- funders
- [{'doi': None, 'name': None, 'awards': ['220127']}, {'doi': None, 'name': None, 'awards': ['201656']}]
Citation neighborhood (sparse)
Too few in-corpus citations on either side for a chart; here are the lists.
Cites (3)
References (28)
- CancerFoundation: A single-cell RNA sequencing foundation model to decipher drug resistance in cancer via crossref
- DeepSpot: Leveraging Spatial Context for Enhanced Spatial Transcriptomics Prediction from H&E Images via crossref
- Representation learning for multi-modal spatially resolved transcriptomics data via crossref
- doi:10.1038/s41467-025-58989-8 via crossref
- doi:10.1038/s41591-024-02857-3 via crossref
- doi:10.1038/s41592-024-02201-0 via crossref
- doi:10.1109/icassp48485.2024.10448360 via crossref
- doi:10.1016/j.ydbio.2018.02.003 via crossref
- doi:10.1038/s41592-024-02305-7 via crossref
- doi:10.1038/s41576-023-00586-w via crossref
- doi:10.1038/s41587-019-0113-3 via crossref
- doi:10.1186/s13059-024-03361-0 via crossref
- doi:10.1038/s41592-019-0619-0 via crossref
- doi:10.1109/bibm62325.2024.10822280 via crossref
- doi:10.1038/s41592-018-0229-2 via crossref
- doi:10.1038/s41592-021-01336-8 via crossref
- doi:10.1109/isbi.2009.5193250 via crossref
- doi:10.1038/s41593-020-00787-0 via crossref
- doi:10.1038/s41586-021-03634-9 via crossref
- doi:10.1126/science.aaf2403 via crossref
- doi:10.1186/s13059-017-1382-0 via crossref
- doi:10.1093/nar/gkac901 via crossref
- doi:10.1093/nar/gkad933 via crossref
- doi:10.1186/s13058-019-1242-9 via crossref
- doi:10.1038/s42003-021-02118-w via crossref
- doi:10.1038/s41587-021-00935-2 via crossref
- doi:10.1093/bioadv/vbac016 via crossref
- doi:10.1038/s41587-022-01251-z via crossref
Source provenance
- crossref
- last seen: 2026-06-24T06:27:42.533072+00:00
- europepmc
- last seen: 2026-05-20T01:45:00.602351+00:00
- unpaywall
- last seen: 2026-05-22T02:00:06.705733+00:00
License: CC-BY-4.0