Long COVID Diagnostic with Differentiation from Chronic Lyme Disease using Machine Learning and Cytokine Hubs | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Long COVID Diagnostic with Differentiation from Chronic Lyme Disease using Machine Learning and Cytokine Hubs Bruce Patterson, Jose Guevara-Coto, Javier Mora, Edgar B. Francisco, and 7 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-3873244/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract The absence of a diagnostic for long COVID (LC) or post-acute sequelae of COVID-19 (PASC) has profound implications for research and potential therapeutics. Further, symptom-based identification of patients with long-term COVID-19 lacks the specificity to serve as a diagnostic because of the overlap of symptoms with other chronic inflammatory conditions like chronic Lymedisease (CLD), myalgic encephalomyelitis-chronic fatigue syndrome (ME-CFS), and others. Here, we report a machine-learning approach to long COVID diagnosis using cytokine hubs that are also capable of differentiating long COVID from chronic Lyme. We constructed three tree-based classifiers: decision tree, random forest, and gradient-boosting machine (GBM) and compared their diagnostic capabilities. A 223 patient dataset was partitioned into training (178 patients) and evaluation (45 patients) sets. The GBM model was selected based on performance (89% Sensitivity and 96% Specificity for LC) with no evidence of overfitting. We tested the GBM on a random dataset of 124 individuals (106 PASC and 18 Lyme), resulting in high sensitivity (97%) and specificity 90% for LC). A Lyme Index composed of two features ((TNF-alpha +IL-4)/(IFN-gamma + IL-2) and (TNF-alpha *IL-4)/(IFN-gamma + IL-2 + CCL3) was constructed as a confirmatory algorithm to discriminate between LC and CLD. Immunology COVID-19 PASC long COVID cytokines chronic lyme disease (CLD) myalgic encephalomyelitis-chronic fatigue syndrome (ME-CFS) Machine Learning/AI Full Text Additional Declarations The authors declare potential competing interests as follows: Competing interests: B.K.P, C.B., J.G-M, G.L. and E.B.F. are employees of IncellDx, Inc Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-3873244","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":267652249,"identity":"08c16341-c970-44f6-8b14-c891ca8501e4","order_by":0,"name":"Bruce Patterson","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAyElEQVRIiWNgGAWjYBACAwhlwwMkGA88YCNeSxpIC8OBBBK0HGYgXou59BnDDx93nJcxZz984EBC2WEG/vbuBLxaLPtyjCVnnrnNY9mTlnAg4Vwag8SZsxvwO+wMjxkzb9ttHoMbPAYHEttsGAwkconQ8rftHFAL/wegFgkitTC2HQDZwkCcLZY9bMWSvWeSeQzOpBmA/MJD0C/mPMwbP/zcYWdvcPzwwwcfyg7L8bf34tcCBowNCDYPYeXoWkbBKBgFo2AUYAAAlABHpEkPAOUAAAAASUVORK5CYII=","orcid":"","institution":"IncellDx","correspondingAuthor":true,"prefix":"","firstName":"Bruce","middleName":"","lastName":"Patterson","suffix":""},{"id":267655181,"identity":"be4bd0eb-32e9-43dc-abef-24cb459d3fe4","order_by":1,"name":"Jose Guevara-Coto","email":"","orcid":"","institution":"IncellDx Inc","correspondingAuthor":false,"prefix":"","firstName":"Jose","middleName":"","lastName":"Guevara-Coto","suffix":""},{"id":267655182,"identity":"9766e730-4419-46ed-a35f-f237482d12af","order_by":2,"name":"Javier Mora","email":"","orcid":"","institution":"Lab of Tumor Chemosensitivity, CIET / CICICA, Faculty of Microbiology, Universidad de Costa Rica","correspondingAuthor":false,"prefix":"","firstName":"Javier","middleName":"","lastName":"Mora","suffix":""},{"id":267655183,"identity":"d9665b00-bb66-4d2e-a64c-c88fd2f02124","order_by":3,"name":"Edgar B. Francisco","email":"","orcid":"","institution":"Incelldx Inc","correspondingAuthor":false,"prefix":"","firstName":"Edgar","middleName":"B.","lastName":"Francisco","suffix":""},{"id":267655184,"identity":"f0d132ab-7490-48e8-aaec-9f55ff9d6e3a","order_by":4,"name":"Ram Yogendra","email":"","orcid":"","institution":"Lawrence General Hospital, Lawrence, MA","correspondingAuthor":false,"prefix":"","firstName":"Ram","middleName":"","lastName":"Yogendra","suffix":""},{"id":267655185,"identity":"251f79a5-fdf3-4a17-8cea-592894c28ea1","order_by":5,"name":"Rodrigo A Mora-Rodríguez","email":"","orcid":"","institution":"Lab of Tumor Chemosensitivity, CIET / CICICA, Faculty of Microbiology, Universidad de Costa Rica.","correspondingAuthor":false,"prefix":"","firstName":"Rodrigo","middleName":"A","lastName":"Mora-Rodríguez","suffix":""},{"id":267655186,"identity":"17cd848a-6169-481c-a150-ebac06cb1bed","order_by":6,"name":"Christopher Beaty","email":"","orcid":"","institution":"incelldx Inc","correspondingAuthor":false,"prefix":"","firstName":"Christopher","middleName":"","lastName":"Beaty","suffix":""},{"id":267655187,"identity":"30883d7d-81ff-4996-a3b8-1d40dd6f91d5","order_by":7,"name":"Gwyneth Lemaster","email":"","orcid":"","institution":"IncellDx Inc","correspondingAuthor":false,"prefix":"","firstName":"Gwyneth","middleName":"","lastName":"Lemaster","suffix":""},{"id":267655188,"identity":"a9827c3b-affa-431a-b2b2-76bad7a16bcb","order_by":8,"name":"Gary Kaplan","email":"","orcid":"","institution":"Clinical Associate Professor (Adjunct), Department of Community and Family Medicine, Georgetown University School of Medicine, Washington, DC","correspondingAuthor":false,"prefix":"","firstName":"Gary","middleName":"","lastName":"Kaplan","suffix":""},{"id":267655189,"identity":"cd409730-069c-42e5-b719-18c03adf0003","order_by":9,"name":"Amiram Katz","email":"","orcid":"","institution":"Neurology Specialist Affiliated with Norwalk Hospital. Orange, CT","correspondingAuthor":false,"prefix":"","firstName":"Amiram","middleName":"","lastName":"Katz","suffix":""},{"id":267655190,"identity":"b8d2cbca-00bc-4c2d-9493-0408ff82fd77","order_by":10,"name":"Joseph A. Bellanti","email":"","orcid":"","institution":"Departments of Pediatrics and Microbiology-Immunology, and the International Center for Interdisciplinary Studies of Immunology, Georgetown University Medical Center, Washington DC.","correspondingAuthor":false,"prefix":"","firstName":"Joseph","middleName":"A.","lastName":"Bellanti","suffix":""}],"badges":[],"createdAt":"2024-01-17 15:19:56","currentVersionCode":1,"declarations":{"humanSubjects":true,"vertebrateSubjects":false,"conflictsOfInterestStatement":true,"humanSubjectEthicalGuidelines":true,"humanSubjectConsent":true,"humanSubjectClinicalTrial":false,"humanSubjectCaseReport":false,"vertebrateSubjectEthicalGuidelines":false},"doi":"10.21203/rs.3.rs-3873244/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-3873244/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":49783878,"identity":"85b1178f-33bc-4b53-8c50-7ec21d82dc59","added_by":"auto","created_at":"2024-01-18 02:31:47","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":431087,"visible":true,"origin":"","legend":"","description":"","filename":"CytokinmehubspaperupdatedmethodsresultsJGCBP01162024.pdf","url":"https://assets-eu.researchsquare.com/files/rs-3873244/v1_covered_d684b138-308c-47ec-abfa-c5194ee2133a.pdf"}],"financialInterests":"The authors declare potential competing interests as follows: Competing interests: \nB.K.P, C.B., J.G-M, G.L. and E.B.F. are employees of IncellDx, Inc \n\n","formattedTitle":"\u003cp\u003e\u003cem\u003e\u003cstrong\u003eLong COVID Diagnostic with Differentiation from Chronic Lyme Disease using Machine Learning and Cytokine Hubs\u003c/strong\u003e\u003c/em\u003e\u003c/p\u003e","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"COVID-19, PASC, long COVID, cytokines, chronic lyme disease (CLD), myalgic encephalomyelitis-chronic fatigue syndrome (ME-CFS), Machine Learning/AI","lastPublishedDoi":"10.21203/rs.3.rs-3873244/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-3873244/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eThe absence of a diagnostic for long COVID (LC) or post-acute sequelae of COVID-19 (PASC) has profound implications for research and potential therapeutics. Further, symptom-based identification of patients with long-term COVID-19 lacks the specificity to serve as a diagnostic because of the overlap of symptoms with other chronic inflammatory conditions like chronic Lymedisease (CLD), myalgic encephalomyelitis-chronic fatigue syndrome (ME-CFS), and others. Here, we report a machine-learning approach to long COVID diagnosis using cytokine hubs that are also capable of differentiating long COVID from chronic Lyme. We constructed three tree-based classifiers: decision tree, random forest, and gradient-boosting machine (GBM) and compared their diagnostic capabilities. A 223 patient dataset was partitioned into training (178 patients) and evaluation (45 patients) sets. The GBM model was selected based on performance (89% Sensitivity and 96% Specificity for LC) with no evidence of overfitting. We tested the GBM on a random dataset of 124 individuals (106 PASC and 18 Lyme), resulting in high sensitivity (97%) and specificity 90% for LC). A Lyme Index composed of two features ((TNF-alpha +IL-4)/(IFN-gamma + IL-2) and (TNF-alpha *IL-4)/(IFN-gamma + IL-2 + CCL3) was constructed as a confirmatory algorithm to discriminate between LC and CLD.\u0026nbsp;\u003c/p\u003e","manuscriptTitle":"Long COVID Diagnostic with Differentiation from Chronic Lyme Disease using Machine Learning and Cytokine Hubs","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-01-18 02:23:36","doi":"10.21203/rs.3.rs-3873244/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"ccf3cc0a-ab54-433e-a677-fc6ae0dc79e5","owner":[],"postedDate":"January 18th, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[{"id":28201534,"name":"Immunology"}],"tags":[],"updatedAt":"2024-03-30T04:59:29+00:00","versionOfRecord":[],"versionCreatedAt":"2024-01-18 02:23:36","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-3873244","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-3873244","identity":"rs-3873244","version":["v1"]},"buildId":"qtupq5eGEP_6zYnWcrvyt","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.