Structure Alignment-driven Cross-Graph Modeling for Functional RNA Design | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article Structure Alignment-driven Cross-Graph Modeling for Functional RNA Design Xiaoyong Pan, Shengfan Wang, Jun Wang, Xiaojian Liu, Weimin Zhu, and 5 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-7169944/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted You are reading this latest preprint version Abstract RNAs are critical for biological processes, with their biological functions closely tied to their three-dimensional structures. RNA inverse folding, the design of RNA sequences that fold into target 3D structures, is a complex challenge due to the dynamic and unstable nature of RNA structures. Motivated by evolutionary conservation concepts from structure prediction, we present a novel RNA design approach AlignIF, which leverages multiple structure alignment (MStA) with cross-graph modeling to capture evolutionarily conserved structural patterns at the structural level, facilitating RNA sequence design. AlignIF outperforms existing state-of-the-art methods by a large margin in key metrics, including sequence recovery, perplexity, and foldability. Notably, it enables the design of entire RNA families rather than being restricted to recapitulating native sequences. Furthermore, AlignIF effectively generates functional RNA fluorescent aptamers and self-cleaving ribozymes, as experimentally validated by their respective fluorescent signals or cleavage activity. Importantly, two of the ten designed aptamers show enhanced fluorescence compared to the wild-type aptamer due to the increased fluorophore binding capacity, and other two exhibit improved binding affinity toward the target molecule, highlighting the potential of AlignIF for engineering novel functional RNAs. Biological sciences/Computational biology and bioinformatics/Machine learning Biological sciences/Systems biology/Synthetic biology Full Text Additional Declarations There is NO Competing Interest. Supplementary Files SupportInformation.docx Support Information Cite Share Download PDF Status: Under Review Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-7169944","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":489660081,"identity":"3b991f63-0d74-4ad4-bfb2-cb4cd7bae758","order_by":0,"name":"Xiaoyong Pan","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABAklEQVRIie2RsWrDMBRFnxA4i5quMoHkF2QMydKSX1EIZNIQKIRuTTCov+DQn9DazUFQLyazSzM4BNw5ZO1QWe0sZSxUZ7rDO0+6PIBA4A9yi9Gmsam3BmZD4VHi50z/TJICWHGNwqq3BbWJ8t9pnwK1SNMbeRhN4s/m4Sxh2K85uiwdBspFctzKNnl9EYztJKRxzfEgdyiYipSdpUbqQ0CnzFTNI0wcSkTFmM6knqr3sumUJ69CiKlvXjHLwX6MM59Ce5lO8n07V5XpUu1psq2O2cClTDXanMjqcK/KsmWPq7tRv5zvLi7FgqLC9jI37W6E1j7B8GUV3FwxGggEAv+Qb/f9Vql2R0lnAAAAAElFTkSuQmCC","orcid":"https://orcid.org/0000-0001-5010-464X","institution":"Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University","correspondingAuthor":true,"prefix":"","firstName":"Xiaoyong","middleName":"","lastName":"Pan","suffix":""},{"id":489660082,"identity":"f58f36a6-8978-4b89-82a2-8b57e445ac6f","order_by":1,"name":"Shengfan Wang","email":"","orcid":"","institution":"Shanghai Jiao Tong University","correspondingAuthor":false,"prefix":"","firstName":"Shengfan","middleName":"","lastName":"Wang","suffix":""},{"id":489660083,"identity":"ae8e7e5f-4547-4951-962a-272cd73b4b93","order_by":2,"name":"Jun Wang","email":"","orcid":"https://orcid.org/0009-0006-9213-8331","institution":"Shanghai Jiao Tong University","correspondingAuthor":false,"prefix":"","firstName":"Jun","middleName":"","lastName":"Wang","suffix":""},{"id":489660084,"identity":"84c337fb-f507-41db-8a11-8a6fda54e15f","order_by":3,"name":"Xiaojian Liu","email":"","orcid":"","institution":"Shanghai Jiao Tong University","correspondingAuthor":false,"prefix":"","firstName":"Xiaojian","middleName":"","lastName":"Liu","suffix":""},{"id":489660085,"identity":"c48f43f0-ce25-4e21-b07c-c0d22d94584e","order_by":4,"name":"Weimin Zhu","email":"","orcid":"","institution":"Shanghai Jiao Tong University","correspondingAuthor":false,"prefix":"","firstName":"Weimin","middleName":"","lastName":"Zhu","suffix":""},{"id":489660086,"identity":"174561e3-c51e-47d6-a9cb-5a09e59f5df1","order_by":5,"name":"Jiaqi Huang","email":"","orcid":"","institution":"Shanghai Jiao Tong University","correspondingAuthor":false,"prefix":"","firstName":"Jiaqi","middleName":"","lastName":"Huang","suffix":""},{"id":489660087,"identity":"8fa3c922-1aa9-4365-9e45-79ea1c866d24","order_by":6,"name":"Yiming Xue","email":"","orcid":"","institution":"Shanghai Jiao Tong University","correspondingAuthor":false,"prefix":"","firstName":"Yiming","middleName":"","lastName":"Xue","suffix":""},{"id":489660088,"identity":"57d7e5e7-752c-4a37-82a6-0cfba6b3ef3b","order_by":7,"name":"Junchi Yan","email":"","orcid":"","institution":"Shanghai Jiao Tong University","correspondingAuthor":false,"prefix":"","firstName":"Junchi","middleName":"","lastName":"Yan","suffix":""},{"id":489660089,"identity":"ffa939e2-d7c0-473a-ad22-ecc65aa2a9fa","order_by":8,"name":"Hong-Bin Shen","email":"","orcid":"https://orcid.org/0000-0002-4029-3325","institution":"Shanghai Jiao Tong University","correspondingAuthor":false,"prefix":"","firstName":"Hong-Bin","middleName":"","lastName":"Shen","suffix":""},{"id":489660090,"identity":"07018865-8055-4e64-a4f5-ed20a1ff504e","order_by":9,"name":"Jie Song","email":"","orcid":"","institution":"Hangzhou Institute of Medicine Chinese Academy of Sciences","correspondingAuthor":false,"prefix":"","firstName":"Jie","middleName":"","lastName":"Song","suffix":""}],"badges":[],"createdAt":"2025-07-20 13:35:12","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-7169944/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-7169944/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":91146003,"identity":"0928e3c0-5147-494e-aff6-2f638f1fd4b9","added_by":"auto","created_at":"2025-09-12 06:13:56","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":2463640,"visible":true,"origin":"","legend":"","description":"","filename":"AlignIFMS.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7169944/v1_covered_0b12d848-f70b-4365-90fc-ff95f78643be.pdf"},{"id":91145161,"identity":"bf26a4f1-000d-4785-9884-131c80aa0e4b","added_by":"auto","created_at":"2025-09-12 06:05:47","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":2957240,"visible":true,"origin":"","legend":"Support Information","description":"","filename":"SupportInformation.docx","url":"https://assets-eu.researchsquare.com/files/rs-7169944/v1/7cba0286ff5060c7af3be0da.docx"}],"financialInterests":"There is \u003cb\u003eNO\u003c/b\u003e Competing Interest.","formattedTitle":"Structure Alignment-driven Cross-Graph Modeling for Functional RNA Design","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"nature-portfolio","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"","title":"Nature Portfolio","twitterHandle":"","acdcEnabled":false,"dfaEnabled":false,"editorialSystem":"ejp","reportingPortfolio":"","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"","lastPublishedDoi":"10.21203/rs.3.rs-7169944/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-7169944/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"RNAs are critical for biological processes, with their biological functions closely tied to their three-dimensional structures. RNA inverse folding, the design of RNA sequences that fold into target 3D structures, is a complex challenge due to the dynamic and unstable nature of RNA structures. Motivated by evolutionary conservation concepts from structure prediction, we present a novel RNA design approach AlignIF, which leverages multiple structure alignment (MStA) with cross-graph modeling to capture evolutionarily conserved structural patterns at the structural level, facilitating RNA sequence design. AlignIF outperforms existing state-of-the-art methods by a large margin in key metrics, including sequence recovery, perplexity, and foldability. Notably, it enables the design of entire RNA families rather than being restricted to recapitulating native sequences. Furthermore, AlignIF effectively generates functional RNA fluorescent aptamers and self-cleaving ribozymes, as experimentally validated by their respective fluorescent signals or cleavage activity. Importantly, two of the ten designed aptamers show enhanced fluorescence compared to the wild-type aptamer due to the increased fluorophore binding capacity, and other two exhibit improved binding affinity toward the target molecule, highlighting the potential of AlignIF for engineering novel functional RNAs.","manuscriptTitle":"Structure Alignment-driven Cross-Graph Modeling for Functional RNA Design","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-09-12 06:05:43","doi":"10.21203/rs.3.rs-7169944/v1","editorialEvents":[],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"nature-computational-science","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"natcomputsci","sideBox":"Learn more about [Nature Computational Science](http://www.nature.com/natcomputsci/)","snPcode":"","submissionUrl":"","title":"Nature Computational Science","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"ejp","reportingPortfolio":"Nature Research","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"c413b436-bf70-4757-837a-e50af6ee5540","owner":[],"postedDate":"September 12th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[{"id":51985649,"name":"Biological sciences/Computational biology and bioinformatics/Machine learning"},{"id":51985650,"name":"Biological sciences/Systems biology/Synthetic biology"}],"tags":[],"updatedAt":"2025-09-12T06:05:43+00:00","versionOfRecord":[],"versionCreatedAt":"2025-09-12 06:05:43","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-7169944","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-7169944","identity":"rs-7169944","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.