A hierarchical Bayesian framework for inferring mitochondrial clonal selection from single-cell data | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article A hierarchical Bayesian framework for inferring mitochondrial clonal selection from single-cell data Qianqian Song, Xiaobo Zhou, Aoqi Wang, Yanfei Wang, Xiaona Liu, and 3 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-8490828/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted You are reading this latest preprint version Abstract Mitochondrial genetic heterogeneity arises from the accumulation of somatic mitochondrial DNA (mtDNA) mutations within individual cells, generating intracellular clonal populations whose selective dynamics in disease remain poorly characterized. Here, we present MitoBayes, a hierarchical Bayesian framework that jointly models mitochondrial clonal lineage structure, allele frequency variation, and single-cell disease-relevant phenotypic burdens to infer clone-specific selection pressures. Extensive benchmarking demonstrates that MitoBayes accurately recovers ground-truth selection coefficients across a wide range of genetic heterogeneity, data sparsity, and lineage complexity scenarios. Application of MitoBayes to single-cell atlases of Alzheimer’s disease (AD) cortex, treatment-naïve non–small-cell lung cancer (NSCLC), and chemotherapy-resistant small-cell lung cancer (SCLC) revealed distinct, disease-specific patterns of mitochondrial clonal selection. These include selective expansion of high-risk mitochondrial clones associated with disruption of PVALB interneuron homeostasis in AD; disease-driven clonal remodeling in cycling T/NK cells from NSCLC tumors characterized by increased mitochondrial biogenesis and impaired immune regulatory programs; and preferential enrichment of a tumor-associated MT-ATP6 (m.8859A > G) clone linked to metabolic adaptation and platinum resistance in SCLC. Pan-cancer survival analyses further confirmed the clinical relevance of elevated MT-ATP6 activity, which was associated with inferior chemotherapy outcomes. Additionally, in hepatocellular carcinoma (HCC), a dominant m.2356C > G clone correlated with POLR2A activation and widespread transcriptional amplification, consistent with a mitochondria–nucleus signaling axis contributing to adverse prognosis in this cancer type. Collectively, these findings establish MitoBayes as a robust statistical framework linking mitochondrial genetic diversity to disease phenotypes and highlight mitochondrial clonal selection as a mechanistically and clinically actionable target for therapeutic and diagnostic development. Biological sciences/Genetics/Mutation Biological sciences/Computational biology and bioinformatics/Computational models mitochondrial genetic heterogeneity mitochondrial clonal selection hierarchical bayesian modeling selection pressure estimation genotype–phenotype relationships mitochondrial-driven pathogenesis Full Text Additional Declarations There is NO Competing Interest. Supplementary Files SupplementaryMaterial.docx The complete derivation process of the MitoBayes Cite Share Download PDF Status: Under Review Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-8490828","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":586442013,"identity":"223eeb02-08fb-49b0-80ae-ba4870149e18","order_by":0,"name":"Qianqian Song","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA7klEQVRIiWNgGAWjYHACNoYEGPMDiMtOihbGGSAuMzFaYICZB0wSUC/f3nvswYMKa9n57b2HX9v82ibPx8zA+OFjDm4tBmfOpRsknEk33nDmXJp1bt9twzZmBmbJmdvwaJHIMZNIbDucuAHIMM7tuc0I1MLGzItHi/wMkJZ/hxPnAxnGlj237QlqYbgB0tJwOLHhRo7xY4YftxMJagH6JU0i4RjIL2fMGHsbbie3MTM24/ULKMQkf9SAQqzH+MOPP7dt57c3H/zwEZ/DGCBxwdgAjCAJxjYQB8TGCxBamD8w/CGgeBSMglEwCkYkAAARoFMjkierIQAAAABJRU5ErkJggg==","orcid":"https://orcid.org/0000-0002-4455-5302","institution":"University of Florida","correspondingAuthor":true,"prefix":"","firstName":"Qianqian","middleName":"","lastName":"Song","suffix":""},{"id":586442014,"identity":"a445dae0-84ee-41a6-aeaa-d8065aafaf55","order_by":1,"name":"Xiaobo Zhou","email":"","orcid":"https://orcid.org/0000-0001-7191-6495","institution":"Center for Computational Systems Medicine, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston","correspondingAuthor":false,"prefix":"","firstName":"Xiaobo","middleName":"","lastName":"Zhou","suffix":""},{"id":586442015,"identity":"d2e01f66-8264-4fa5-a4a0-709be93338cc","order_by":2,"name":"Aoqi Wang","email":"","orcid":"","institution":"West China Biomedical Big Data Centre, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, PR China","correspondingAuthor":false,"prefix":"","firstName":"Aoqi","middleName":"","lastName":"Wang","suffix":""},{"id":586442016,"identity":"54264123-efd0-4e89-a361-259c7c1f2613","order_by":3,"name":"Yanfei Wang","email":"","orcid":"","institution":"Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, FL, 32611, USA","correspondingAuthor":false,"prefix":"","firstName":"Yanfei","middleName":"","lastName":"Wang","suffix":""},{"id":586442017,"identity":"b326eba1-10c9-478b-8d6c-9335e1ffb62d","order_by":4,"name":"Xiaona Liu","email":"","orcid":"","institution":"Center for Computational Systems Medicine, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA","correspondingAuthor":false,"prefix":"","firstName":"Xiaona","middleName":"","lastName":"Liu","suffix":""},{"id":586442018,"identity":"90981b5e-91fc-4467-8226-2d7554c3eda4","order_by":5,"name":"Qing Wang","email":"","orcid":"https://orcid.org/0000-0003-3511-4107","institution":"Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, FL, 32611, USA","correspondingAuthor":false,"prefix":"","firstName":"Qing","middleName":"","lastName":"Wang","suffix":""},{"id":586442019,"identity":"e3444724-bf73-406d-8bd7-95dedbd09849","order_by":6,"name":"Sen Guo","email":"","orcid":"","institution":"Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, FL, 32611, USA","correspondingAuthor":false,"prefix":"","firstName":"Sen","middleName":"","lastName":"Guo","suffix":""},{"id":586442020,"identity":"b15aff95-4e67-4d5d-8ae8-a97c088201a6","order_by":7,"name":"Jianguo Wen","email":"","orcid":"","institution":"The University of Texas Health Science Center at Houston","correspondingAuthor":false,"prefix":"","firstName":"Jianguo","middleName":"","lastName":"Wen","suffix":""}],"badges":[],"createdAt":"2025-12-31 15:25:11","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-8490828/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-8490828/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":102746591,"identity":"97a17fca-50f8-4c97-8029-9529d2ed8d2d","added_by":"auto","created_at":"2026-02-16 08:58:27","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":15899067,"visible":true,"origin":"","legend":"","description":"","filename":"Manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-8490828/v1_covered_5d3b6df3-041c-4708-a5fd-94a6470b60f1.pdf"},{"id":102534207,"identity":"46bfaa6f-2bef-454a-b9ab-bbd3e5e53e19","added_by":"auto","created_at":"2026-02-12 17:03:24","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":54010,"visible":true,"origin":"","legend":"The complete derivation process of the MitoBayes","description":"","filename":"SupplementaryMaterial.docx","url":"https://assets-eu.researchsquare.com/files/rs-8490828/v1/e67853ce4cf4bbe2f4046812.docx"}],"financialInterests":"There is \u003cb\u003eNO\u003c/b\u003e Competing Interest.","formattedTitle":"A hierarchical Bayesian framework for inferring mitochondrial clonal selection from single-cell data","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"nature-portfolio","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"","title":"Nature Portfolio","twitterHandle":"","acdcEnabled":false,"dfaEnabled":false,"editorialSystem":"ejp","reportingPortfolio":"","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"mitochondrial genetic heterogeneity, mitochondrial clonal selection, hierarchical bayesian modeling, selection pressure estimation, genotype–phenotype relationships, mitochondrial-driven pathogenesis","lastPublishedDoi":"10.21203/rs.3.rs-8490828/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-8490828/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eMitochondrial genetic heterogeneity arises from the accumulation of somatic mitochondrial DNA (mtDNA) mutations within individual cells, generating intracellular clonal populations whose selective dynamics in disease remain poorly characterized. Here, we present MitoBayes, a hierarchical Bayesian framework that jointly models mitochondrial clonal lineage structure, allele frequency variation, and single-cell disease-relevant phenotypic burdens to infer clone-specific selection pressures. Extensive benchmarking demonstrates that MitoBayes accurately recovers ground-truth selection coefficients across a wide range of genetic heterogeneity, data sparsity, and lineage complexity scenarios. Application of MitoBayes to single-cell atlases of Alzheimer\u0026rsquo;s disease (AD) cortex, treatment-na\u0026iuml;ve non\u0026ndash;small-cell lung cancer (NSCLC), and chemotherapy-resistant small-cell lung cancer (SCLC) revealed distinct, disease-specific patterns of mitochondrial clonal selection. These include selective expansion of high-risk mitochondrial clones associated with disruption of PVALB interneuron homeostasis in AD; disease-driven clonal remodeling in cycling T/NK cells from NSCLC tumors characterized by increased mitochondrial biogenesis and impaired immune regulatory programs; and preferential enrichment of a tumor-associated MT-ATP6 (m.8859A\u0026thinsp;\u0026gt;\u0026thinsp;G) clone linked to metabolic adaptation and platinum resistance in SCLC. Pan-cancer survival analyses further confirmed the clinical relevance of elevated MT-ATP6 activity, which was associated with inferior chemotherapy outcomes. Additionally, in hepatocellular carcinoma (HCC), a dominant m.2356C\u0026thinsp;\u0026gt;\u0026thinsp;G clone correlated with POLR2A activation and widespread transcriptional amplification, consistent with a mitochondria\u0026ndash;nucleus signaling axis contributing to adverse prognosis in this cancer type. Collectively, these findings establish MitoBayes as a robust statistical framework linking mitochondrial genetic diversity to disease phenotypes and highlight mitochondrial clonal selection as a mechanistically and clinically actionable target for therapeutic and diagnostic development.\u003c/p\u003e","manuscriptTitle":"A hierarchical Bayesian framework for inferring mitochondrial clonal selection from single-cell data","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-02-12 17:03:19","doi":"10.21203/rs.3.rs-8490828/v1","editorialEvents":[],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"nature-communications","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"NCOMMS","sideBox":"Learn more about [Nature Communications](http://www.nature.com/ncomms/)","snPcode":"","submissionUrl":"https://mts-ncomms.nature.com/","title":"Nature Communications","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"ejp","reportingPortfolio":"Nature Communications","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"114741ac-57ad-4baf-b0df-100bf02f82ab","owner":[],"postedDate":"February 12th, 2026","published":true,"recentEditorialEvents":[{"type":"editorInvitedReview","content":"This content is not available.","date":"2026-05-12T18:38:52+00:00","index":4,"fulltext":"This content is not available."},{"type":"reviewerAgreed","content":"This content is not available.","date":"2026-05-12T18:35:12+00:00","index":4,"fulltext":"This content is not available."},{"type":"editorInvitedReview","content":"This content is not available.","date":"2026-05-11T18:53:56+00:00","index":3,"fulltext":"This content is not available."},{"type":"editorInvitedReview","content":"This content is not available.","date":"2026-05-07T08:39:28+00:00","index":2,"fulltext":"This content is not available."},{"type":"reviewerAgreed","content":"This content is not available.","date":"2026-05-04T09:18:39+00:00","index":3,"fulltext":"This content is not available."},{"type":"reviewerAgreed","content":"This content is not available.","date":"2026-05-01T02:40:31+00:00","index":2,"fulltext":"This content is not available."}],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[{"id":62395154,"name":"Biological sciences/Genetics/Mutation"},{"id":62395155,"name":"Biological sciences/Computational biology and bioinformatics/Computational models"}],"tags":[],"updatedAt":"2026-02-12T17:03:20+00:00","versionOfRecord":[],"versionCreatedAt":"2026-02-12 17:03:19","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-8490828","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-8490828","identity":"rs-8490828","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.