ViraHInter: a dual-modal artificial intelligence framework for predicting virus-host interactions | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article ViraHInter: a dual-modal artificial intelligence framework for predicting virus-host interactions Siqi Sun, Weiqiang Bai, Fei wang, Sheng Xu, Jialin Wang, Lifeng Qiao, and 7 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-9329453/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted You are reading this latest preprint version Abstract Protein-protein interactions (PPIs) between a virus and its host govern infection, replication, and pathogenesis. While high-throughput mapping has identified thousands of virus-host associations, much of the virus-host interactome remains uncharacterized due to the labor-intensive nature of experimental screens, the inherent difficulty in capturing transient interactions, and the limited sequence homology across divergent viral families. Here, we introduce ViraHInter, a dual-modal deep learning framework for the precise prediction of virus-host interactions and large-scale inference of interaction landscapes. ViraHInter couples a structure-generation branch with a sequence-representation branch, integrating structure-informed pair representations with ESM-derived embeddings to learn generalizable interaction rules across unseen viruses. We benchmark ViraHInter on pathogenic coronaviruses and influenza A viruses and show that it consistently outperforms RoseTTAFold2-PPI, AlphaFold 3 and RoseTTAFold2-Lite in prioritizing high-confidence candidates even under severe class imbalance and across diverse interface regimes. Notably, ViraHInter successfully identifies novel functionally relevant host factors and recapitulates the structural plasticity of the complex interfaces. By intersecting predictions across multiple influenza subtypes, ViraHInter reveals 33 shared host factors, offering a roadmap for broad-spectrum antiviral discovery. ViraHInter therefore serves as a robust computational approach for studying virus-host interactions, enabling systematic screening of host factors for all known human-infecting viruses, providing new insights into the shared mechanisms of viral pathogenesis, and accelerating the discovery of novel therapeutic targets and the development of broad-spectrum antivirals. Biological sciences/Computational biology and bioinformatics/Protein structure predictions Biological sciences/Computational biology and bioinformatics/Computational models Biological sciences/Microbiology/Virology Virus-host interactions ViraHInter Protein-protein interaction prediction Coronaviruses Influenza A viruses Full Text Additional Declarations There is NO Competing Interest. Supplementary Files SupplementaryTable.xlsx Supplementary information of this manuscript Cite Share Download PDF Status: Under Review Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-9329453","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":620225172,"identity":"16b3947f-da25-4bc8-ab18-fd8c9d46fa1f","order_by":0,"name":"Siqi Sun","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA30lEQVRIie2PMQrCUAyGIw/qEvcHUr1CpKCD4lkshbooCII4CkJdiq56i4oH8EkGNy+gg1N3cVEctIpTkdeODu9bkkA+8gfAYPhHBIBKSqU4AXrPhUlexUGVW/nihrmV6kysd/fg1A2lt74OoGVHSsRnnUJsDbgUxP1Q+sPyEnwnUlaDtIpA4kLA/a3sURmB3UihJbXBpkhJMO6i7DkPhGe2AoykSgF3EqWeXFHZyucXPHAtxHjURPKcFVt1fbAFby63EVex6G2OOG7b8/001gcDoFQvMvZTisFgMBh+8AJyKEbz18BEhAAAAABJRU5ErkJggg==","orcid":"https://orcid.org/0000-0001-7240-8724","institution":"Fudan university","correspondingAuthor":true,"prefix":"","firstName":"Siqi","middleName":"","lastName":"Sun","suffix":""},{"id":620225173,"identity":"9113e9c7-5bdf-4255-839f-3d20f13381c4","order_by":1,"name":"Weiqiang Bai","email":"","orcid":"","institution":"Fudan university;Shanghai Artificial Intelligence Laboratory","correspondingAuthor":false,"prefix":"","firstName":"Weiqiang","middleName":"","lastName":"Bai","suffix":""},{"id":620225174,"identity":"b169f173-fa70-43de-8ca0-fc19ff0165dc","order_by":2,"name":"Fei wang","email":"","orcid":"","institution":"Ruijin Hospital, Shanghai Jiao Tong University School of Medicine","correspondingAuthor":false,"prefix":"","firstName":"Fei","middleName":"","lastName":"wang","suffix":""},{"id":620225175,"identity":"09ad2122-96f0-4172-89e0-a9029839d7b4","order_by":3,"name":"Sheng Xu","email":"","orcid":"","institution":"Fudan university","correspondingAuthor":false,"prefix":"","firstName":"Sheng","middleName":"","lastName":"Xu","suffix":""},{"id":620225176,"identity":"05428550-4bf6-4e97-b01f-ffd6448fd34a","order_by":4,"name":"Jialin Wang","email":"","orcid":"","institution":"Fudan university;Shanghai Artificial Intelligence Laboratory","correspondingAuthor":false,"prefix":"","firstName":"Jialin","middleName":"","lastName":"Wang","suffix":""},{"id":620225177,"identity":"af25a119-2d8e-40d7-89f1-2d6806409326","order_by":5,"name":"Lifeng Qiao","email":"","orcid":"","institution":"Shanghai Artificial Intelligence Laboratory;School of Artificial Intelligence, Shanghai Jiao Tong University","correspondingAuthor":false,"prefix":"","firstName":"Lifeng","middleName":"","lastName":"Qiao","suffix":""},{"id":620225178,"identity":"3e06cb1c-75b4-441e-8cea-66d514f8923f","order_by":6,"name":"Juan Li","email":"","orcid":"","institution":"Shandong First Medical University \u0026 Shandong Academy of Medical Sciences;The Second Affiliated Hospital of Shandong First Medical University","correspondingAuthor":false,"prefix":"","firstName":"Juan","middleName":"","lastName":"Li","suffix":""},{"id":620225179,"identity":"d7222887-a4cb-4b98-905e-6a9e0831ce80","order_by":7,"name":"Zhuyi Guo","email":"","orcid":"","institution":"Shanghai Institute of Virology, Shanghai Jiao Tong University School of Medicine","correspondingAuthor":false,"prefix":"","firstName":"Zhuyi","middleName":"","lastName":"Guo","suffix":""},{"id":620225180,"identity":"1a52cf59-6f29-48e1-ac8f-8d2c1afaebde","order_by":8,"name":"Xiangyun Hou","email":"","orcid":"","institution":"Ruijin Hospital, Shanghai Jiao Tong University School of Medicine","correspondingAuthor":false,"prefix":"","firstName":"Xiangyun","middleName":"","lastName":"Hou","suffix":""},{"id":620225181,"identity":"d2c5b8a3-96a8-4cf8-9f69-ee716674afaf","order_by":9,"name":"Lei Bai","email":"","orcid":"https://orcid.org/0000-0003-3378-7201","institution":"Shanghai AI Laboratory","correspondingAuthor":false,"prefix":"","firstName":"Lei","middleName":"","lastName":"Bai","suffix":""},{"id":620225182,"identity":"bbae57ae-b3f3-41b1-8820-09fa01086339","order_by":10,"name":"Bowen Zhou","email":"","orcid":"","institution":"Shanghai Artificial Intelligence Laboratory","correspondingAuthor":false,"prefix":"","firstName":"Bowen","middleName":"","lastName":"Zhou","suffix":""},{"id":620225183,"identity":"915e0eea-0201-4f7b-be72-cd8e6b157ee8","order_by":11,"name":"Edward Holmes","email":"","orcid":"https://orcid.org/0000-0001-9596-3552","institution":"University of Sydney","correspondingAuthor":false,"prefix":"","firstName":"Edward","middleName":"","lastName":"Holmes","suffix":""},{"id":620225184,"identity":"37bafe36-d583-474d-b6c8-9437a43aab5b","order_by":12,"name":"Weifeng Shi","email":"","orcid":"","institution":"Ruijin Hospital, Shanghai Jiao Tong University School of Medicine;Shanghai Institute of Virology, Shanghai Jiao Tong University School of Medicine","correspondingAuthor":false,"prefix":"","firstName":"Weifeng","middleName":"","lastName":"Shi","suffix":""}],"badges":[],"createdAt":"2026-04-06 03:40:06","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-9329453/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-9329453/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":106960484,"identity":"a9ae487a-8196-49f4-8146-0328f0ace4f0","added_by":"auto","created_at":"2026-04-15 09:21:23","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1348612,"visible":true,"origin":"","legend":"","description":"","filename":"ViraHinter4.5.pdf","url":"https://assets-eu.researchsquare.com/files/rs-9329453/v1_covered_f2143926-d073-45b1-b266-44e519795681.pdf"},{"id":106765800,"identity":"cc78e209-a791-4f0a-b30e-ce447e2653ae","added_by":"auto","created_at":"2026-04-13 09:31:54","extension":"xlsx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":187110,"visible":true,"origin":"","legend":"Supplementary information of this manuscript","description":"","filename":"SupplementaryTable.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-9329453/v1/94d7a2b172377eaee71f9929.xlsx"}],"financialInterests":"There is \u003cb\u003eNO\u003c/b\u003e Competing Interest.","formattedTitle":"ViraHInter: a dual-modal artificial intelligence framework for predicting virus-host interactions","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"nature-portfolio","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"","title":"Nature Portfolio","twitterHandle":"","acdcEnabled":false,"dfaEnabled":false,"editorialSystem":"ejp","reportingPortfolio":"","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"Virus-host interactions, ViraHInter, Protein-protein interaction prediction, Coronaviruses, Influenza A viruses","lastPublishedDoi":"10.21203/rs.3.rs-9329453/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-9329453/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"Protein-protein interactions (PPIs) between a virus and its host govern infection, replication, and pathogenesis. While high-throughput mapping has identified thousands of virus-host associations, much of the virus-host interactome remains uncharacterized due to the labor-intensive nature of experimental screens, the inherent difficulty in capturing transient interactions, and the limited sequence homology across divergent viral families. Here, we introduce ViraHInter, a dual-modal deep learning framework for the precise prediction of virus-host interactions and large-scale inference of interaction landscapes. ViraHInter couples a structure-generation branch with a sequence-representation branch, integrating structure-informed pair representations with ESM-derived embeddings to learn generalizable interaction rules across unseen viruses. We benchmark ViraHInter on pathogenic coronaviruses and influenza A viruses and show that it consistently outperforms RoseTTAFold2-PPI, AlphaFold 3 and RoseTTAFold2-Lite in prioritizing high-confidence candidates even under severe class imbalance and across diverse interface regimes. Notably, ViraHInter successfully identifies novel functionally relevant host factors and recapitulates the structural plasticity of the complex interfaces. By intersecting predictions across multiple influenza subtypes, ViraHInter reveals 33 shared host factors, offering a roadmap for broad-spectrum antiviral discovery. ViraHInter therefore serves as a robust computational approach for studying virus-host interactions, enabling systematic screening of host factors for all known human-infecting viruses, providing new insights into the shared mechanisms of viral pathogenesis, and accelerating the discovery of novel therapeutic targets and the development of broad-spectrum antivirals. ","manuscriptTitle":"ViraHInter: a dual-modal artificial intelligence framework for predicting virus-host interactions","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-04-13 09:31:50","doi":"10.21203/rs.3.rs-9329453/v1","editorialEvents":[],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"nature-biotechnology","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"nbt","sideBox":"Learn more about [Nature Biotechnology](http://www.nature.com/nbt/)","snPcode":"","submissionUrl":"","title":"Nature Biotechnology","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"ejp","reportingPortfolio":"Nature Research","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"90f1110c-9d29-4267-bcee-9d33d2dd9f4f","owner":[],"postedDate":"April 13th, 2026","published":true,"recentEditorialEvents":[{"type":"editorInvitedReview","content":"This content is not available.","date":"2026-05-13T10:03:01+00:00","index":4,"fulltext":"This content is not available."},{"type":"editorInvitedReview","content":"This content is not available.","date":"2026-05-13T09:57:47+00:00","index":3,"fulltext":"This content is not available."},{"type":"editorInvitedReview","content":"This content is not available.","date":"2026-04-29T22:25:59+00:00","index":2,"fulltext":"This content is not available."}],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[{"id":66002301,"name":"Biological sciences/Computational biology and bioinformatics/Protein structure predictions"},{"id":66002302,"name":"Biological sciences/Computational biology and bioinformatics/Computational models"},{"id":66002303,"name":"Biological sciences/Microbiology/Virology"}],"tags":[],"updatedAt":"2026-04-13T09:31:50+00:00","versionOfRecord":[],"versionCreatedAt":"2026-04-13 09:31:50","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-9329453","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-9329453","identity":"rs-9329453","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.