Multi-sample, multi-platform isoform quantification using empirical Bayes

doi:10.1101/2025.02.08.637184

Multi-sample, multi-platform isoform quantification using empirical Bayes

2025 · doi:10.1101/2025.02.08.637184

preprint OA: closed

📄 Open PDF Full text JSON View at publisher

⚙ AI-generated deep summary by claude@2026-06, 2026-06-24 · read from full text ⓘ

This paper introduces JOLI, a hierarchical empirical Bayes model for quantifying RNA isoform abundances by jointly integrating short-read (SR) and long-read (LR) sequencing data across multiple samples. The authors evaluate JOLI on simulated and real RNA-seq datasets, finding that multi-sample learning improves accuracy and reproducibility, particularly for low- to moderate-abundance isoforms, by capturing shared transcript structure and correcting systematic biases. A stated limitation is that read-to-transcript ambiguity and platform-specific challenges motivate the method’s dependence on having both SR and LR data and multiple samples to realize its benefits. This paper does not explicitly discuss endometriosis or adenomyosis; it was included in the corpus via a keyword match in the upstream search index.

Read from the paper's body, not the abstract. Not a substitute for reading the paper. No clinical advice. How this works

Full text 2,409 characters · extracted from oa-doi-fallback · click to expand

Abstract Accurate quantification of RNA isoform abundance is crucial for understanding gene regulation, cellular behavior, and disease mechanisms. While short-read (SR) sequencing provides high-throughput and cost-effective transcript quantification, it suffers from read-to-transcript ambiguity. Long-read (LR) sequencing reduces this ambiguity but faces challenges such as high error rates, biases, and lower throughput. Existing methods rely on either SR or LR data and operate on single or merged samples, failing to leverage the variability across multiple samples and the complementary strengths of both technologies. As a result, they struggle to accurately quantify low-abundance and moderate-expressed isoforms and often require complex models for sample-specific bias correction. To address these limitations, we introduce JOLI, a hierarchical model that leverages multi-sample learning to enhance transcript quantification by jointly integrating SR and LR sequencing data. By incorporating multi-sample learning, JOLI captures shared transcript structures, corrects for systematic biases, and enhances statistical power, particularly for low- and moderate-abundance isoforms. Our model applies an empirical Bayes framework, learning a shared prior across samples to improve inference consistency. By jointly modeling SR and LR data, it integrates the strengths of both technologies, achieving higher accuracy and reproducibility in transcript quantification. Through benchmarking on simulated and real RNA-seq datasets, we show that JOLI consistently outperforms single-sample EM method by improving ranking consistency, proportional agreement, and estimation accuracy while enhancing reproducibility. Specifically, in simulations, JOLI multi-sample improves Spearman correlation by 9.8% for LR and 7.7% for SR data compared to single-sample method, while for real data, the improvements are 2.56% (LR) and 1.28% (SR), respectively. Multi-sample learning further improves the quantification of isoforms with low to moderate expression levels. Furthermore, JOLI performs competitively with state-of-the-art methods, highlighting its robustness in transcript quantification. Competing Interest Statement The authors have declared no competing interest. Footnotes {at3836{at}columbia.edu,dak2173{at}columbia.edu} Funding information has been updated. No other changes have been made to the manuscript.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00