A statistical model for tuberculosis diagnosis to guide multi-test strategies and clinical decision-making | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article A statistical model for tuberculosis diagnosis to guide multi-test strategies and clinical decision-making Marc-Olivier Boldi, Nada Abdulghafor, René Brouillet, Ludovico Cobuccio, and 9 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-7067339/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Objectives The diagnosis of tuberculosis (TB) remains complex, requiring multiple microbiological tests - microscopy, PCR, and culture - across various clinical specimen types. A central challenge is determining the optimal combination of test, specimen, and strategy for a given patient. We developed a predictive scoring system that integrates patient characteristics (e.g., age and sex) with test performance data to guide personalized, data-driven diagnostic decisions in multi-test settings. Methods We retrospectively analyzed data from 4,179 patients evaluated for respiratory TB between 2008 and 2018. Variables included age, sex, specimen type and diagnostic test results: smear microscopy, Xpert MTB/RIF, in-house real-time PCR for Mycobacterium tuberculosis complex (MTB), and mycobacterial culture. A multivariate logistic regression model was used to estimate test-specific performance between subgroups. These outputs informed a Hidden Markov Model (HMM) to simulate sequential testing strategies and calculate diagnostic probabilities. The entropy analysis quantified the information gained with each successive test. Results Multivariate logistic regression identified that age significantly influences TB prevalence, peaking in the 21–40 age group. Culture showed the highest sensitivity, followed by PCR, and microscopy performed the poorest. HMM-based simulations revealed that sequential testing improves diagnostic yield, but with diminishing returns. Entropy analysis confirmed that most diagnostic uncertainty is resolved by the first few tests, while subsequent tests contribute marginal additional value. Conclusions Mathematical modeling, combining HMM and entropy analysis, offers a framework for optimizing the diagnostic pathways for TB. Our approach enables test selection to be tailored to patient characteristics, potentially improving diagnostic efficiency and reducing unnecessary tests. Future work should integrate broader clinical and epidemiological variables as well as cost-effectiveness analysis to be able to inform diagnostic stewardship strategies for TB and other complex syndromes. Biological sciences/Computational biology and bioinformatics Health sciences/Diseases Health sciences/Health care Health sciences/Medical research Biological sciences/Microbiology Full Text Additional Declarations No competing interests reported. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-7067339","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":482917520,"identity":"0bd53a34-9aee-48e6-8805-32bc17af69e8","order_by":0,"name":"Marc-Olivier Boldi","email":"","orcid":"","institution":"Department of Operations, Faculty of Business and Economics, University of Lausanne","correspondingAuthor":false,"prefix":"","firstName":"Marc-Olivier","middleName":"","lastName":"Boldi","suffix":""},{"id":482917521,"identity":"3cc59a0c-1db9-4b38-b473-a77d22609ba0","order_by":1,"name":"Nada Abdulghafor","email":"","orcid":"","institution":"Department of Operations, Faculty of Business and Economics, University of Lausanne","correspondingAuthor":false,"prefix":"","firstName":"Nada","middleName":"","lastName":"Abdulghafor","suffix":""},{"id":482917522,"identity":"93e82030-17e4-4b4d-a7a3-33b216701249","order_by":2,"name":"René Brouillet","email":"","orcid":"","institution":"Institute of Microbiology, Lausanne University and University Hospital of Lausanne","correspondingAuthor":false,"prefix":"","firstName":"René","middleName":"","lastName":"Brouillet","suffix":""},{"id":482917523,"identity":"83376202-c252-48da-8533-a229914260d8","order_by":3,"name":"Ludovico Cobuccio","email":"","orcid":"","institution":"Infectious Diseases Service, Lausanne University and University Hospital of Lausanne","correspondingAuthor":false,"prefix":"","firstName":"Ludovico","middleName":"","lastName":"Cobuccio","suffix":""},{"id":482917524,"identity":"04114c58-6ae0-4e97-9910-14b42c7f5d6d","order_by":4,"name":"Véronique Suttels","email":"","orcid":"","institution":"Infectious Diseases Service, Lausanne University and University Hospital of Lausanne","correspondingAuthor":false,"prefix":"","firstName":"Véronique","middleName":"","lastName":"Suttels","suffix":""},{"id":482917525,"identity":"48c36927-c10a-4ced-b469-ba290bf743f3","order_by":5,"name":"Frédéric Lamoth","email":"","orcid":"","institution":"Institute of Microbiology, Lausanne University and University Hospital of Lausanne","correspondingAuthor":false,"prefix":"","firstName":"Frédéric","middleName":"","lastName":"Lamoth","suffix":""},{"id":482917526,"identity":"7818336a-ba2e-49ba-b522-f7403b4728cf","order_by":6,"name":"Laurence Senn","email":"","orcid":"","institution":"Infection Prevention and Control Unit, Service of Infectious Diseases, Lausanne University and University Hospital of Lausanne","correspondingAuthor":false,"prefix":"","firstName":"Laurence","middleName":"","lastName":"Senn","suffix":""},{"id":482917527,"identity":"a11f3c4c-a728-4550-a863-5cce09e42747","order_by":7,"name":"Valérie Chavez-Demoulin","email":"","orcid":"","institution":"Department of Operations, Faculty of Business and Economics, University of Lausanne","correspondingAuthor":false,"prefix":"","firstName":"Valérie","middleName":"","lastName":"Chavez-Demoulin","suffix":""},{"id":482917528,"identity":"41ccb020-5a12-494c-baf2-7195dc8c9eaf","order_by":8,"name":"Christophe Von-Garnier","email":"","orcid":"","institution":"Division of Pulmonology, Department of Medicine, Lausanne University Hospital, University of Lausanne, Lausanne, Switzerland","correspondingAuthor":false,"prefix":"","firstName":"Christophe","middleName":"","lastName":"Von-Garnier","suffix":""},{"id":482917529,"identity":"8c58c279-ed8e-46a6-a701-c28b90c9f2b6","order_by":9,"name":"Noémie Boillat","email":"","orcid":"","institution":"Infectious Diseases Service, Lausanne University and University Hospital of Lausanne","correspondingAuthor":false,"prefix":"","firstName":"Noémie","middleName":"","lastName":"Boillat","suffix":""},{"id":482917531,"identity":"11d2b134-46e0-42bc-bb29-3227b7856772","order_by":10,"name":"Jesica Mazza-Stalder","email":"","orcid":"","institution":"Division of Pulmonology, Department of Medicine, Lausanne University Hospital, University of Lausanne, Lausanne, Switzerland","correspondingAuthor":false,"prefix":"","firstName":"Jesica","middleName":"","lastName":"Mazza-Stalder","suffix":""},{"id":482917532,"identity":"c541b7d4-ff66-4567-8d5f-5f61b594349b","order_by":11,"name":"Gilbert Greub","email":"","orcid":"","institution":"Institute of Microbiology, Lausanne University and University Hospital of Lausanne","correspondingAuthor":false,"prefix":"","firstName":"Gilbert","middleName":"","lastName":"Greub","suffix":""},{"id":482917533,"identity":"a23e23c0-7a22-497a-9fc9-bc6ba0f85244","order_by":12,"name":"Onya Opota","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAyElEQVRIiWNgGAWjYHAD5gPMYFqCWA08DGwJJGvhMSBOC790j+EHhoo7cvbsZz5+Lmw7zMA/uwG/Fsk5Z4wlGM48M+bhyd0sPROoReLOAfxaDG7kGEgwth1O7GHI3cbM25bGYCCRQFCL8Q+wFv43z4jWYgaxRSKHDajFhrAWyRlpZRYJIL/ceGYszXPOhkfiBgEt/BLJm298AIYYe3/yw888ZRJy/DMIaGFg4DBgSGA4AOfyEFIPBOwPgMQBvEpGwSgYBaNghAMA/Ro9QPwbxR8AAAAASUVORK5CYII=","orcid":"","institution":"Institute of Microbiology, Lausanne University and University Hospital of Lausanne","correspondingAuthor":true,"prefix":"","firstName":"Onya","middleName":"","lastName":"Opota","suffix":""}],"badges":[],"createdAt":"2025-07-07 16:13:22","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-7067339/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-7067339/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":87172112,"identity":"2881c0e8-0470-4c93-a141-223b830ffeca","added_by":"auto","created_at":"2025-07-21 07:39:23","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":954120,"visible":true,"origin":"","legend":"","description":"","filename":"TuberculosisDiagnosisnpjDigitalMedicine.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7067339/v1_covered_1f546f2f-5e5c-4d13-990a-28d1455c860d.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"A statistical model for tuberculosis diagnosis to guide multi-test strategies and clinical decision-making","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"","lastPublishedDoi":"10.21203/rs.3.rs-7067339/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-7067339/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003e\u003cstrong\u003eObjectives\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe diagnosis of tuberculosis (TB) remains complex, requiring multiple microbiological tests - microscopy, PCR, and culture - across various clinical specimen types. A central challenge is determining the optimal combination of test, specimen, and strategy for a given patient. We developed a predictive scoring system that integrates patient characteristics (e.g., age and sex) with test performance data to guide personalized, data-driven diagnostic decisions in multi-test settings.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eMethods\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eWe retrospectively analyzed data from 4,179 patients evaluated for respiratory TB between 2008 and 2018. Variables included age, sex, specimen type and diagnostic test results: smear microscopy, Xpert MTB/RIF, in-house real-time PCR for Mycobacterium tuberculosis complex (MTB), and mycobacterial culture. A multivariate logistic regression model was used to estimate test-specific performance between subgroups. These outputs informed a Hidden Markov Model (HMM) to simulate sequential testing strategies and calculate diagnostic probabilities. The entropy analysis quantified the information gained with each successive test.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eResults\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eMultivariate logistic regression identified that age significantly influences TB prevalence, peaking in the 21–40 age group. Culture showed the highest sensitivity, followed by PCR, and microscopy performed the poorest. HMM-based simulations revealed that sequential testing improves diagnostic yield, but with diminishing returns. Entropy analysis confirmed that most diagnostic uncertainty is resolved by the first few tests, while subsequent tests contribute marginal additional value.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConclusions\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eMathematical modeling, combining HMM and entropy analysis, offers a framework for optimizing the diagnostic pathways for TB. Our approach enables test selection to be tailored to patient characteristics, potentially improving diagnostic efficiency and reducing unnecessary tests. Future work should integrate broader clinical and epidemiological variables as well as cost-effectiveness analysis to be able to inform diagnostic stewardship strategies for TB and other complex syndromes.\u003c/p\u003e","manuscriptTitle":"A statistical model for tuberculosis diagnosis to guide multi-test strategies and clinical decision-making","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-07-15 17:04:52","doi":"10.21203/rs.3.rs-7067339/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"bf6a382a-f9c0-4902-b9ff-a03804e1e2ea","owner":[],"postedDate":"July 15th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[{"id":51276462,"name":"Biological sciences/Computational biology and bioinformatics"},{"id":51276463,"name":"Health sciences/Diseases"},{"id":51276464,"name":"Health sciences/Health care"},{"id":51276465,"name":"Health sciences/Medical research"},{"id":51276466,"name":"Biological sciences/Microbiology"}],"tags":[],"updatedAt":"2025-07-21T07:39:07+00:00","versionOfRecord":[],"versionCreatedAt":"2025-07-15 17:04:52","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-7067339","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-7067339","identity":"rs-7067339","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.