Deep Learning Deciphers the Related Role of Master Regulators and G-Quadruplexes in Tissue Specification

preprint OA: closed
Full text JSON View at publisher
Full text 18,334 characters · extracted from preprint-html · click to expand
Deep Learning Deciphers the Related Role of Master Regulators and G-Quadruplexes in Tissue Specification | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article Deep Learning Deciphers the Related Role of Master Regulators and G-Quadruplexes in Tissue Specification Artem Bashkatov, Andrey Andreasyan, Dmitry Konovalov, Alan Herbert, and 1 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-6165621/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 02 Jul, 2025 Read the published version in Scientific Reports → Version 1 posted 10 You are reading this latest preprint version Abstract G-quadruplexes (GQs) are non-canonical DNA structures encoded by G-flipons that may play critical roles in gene regulation and chromatin structure. Here, we explore the role of G-flipons in tissue specification. We present a deep learning-based framework for the genome-wide G-flipon predictions across 14 human tissue types. The model was trained with high confidence experimental maps of GQ forming sequences and ATAC-seq peaks, conjoined with the location of RNA polymerase, histones, and transcription factor binding sites. The training set was based on level 4-6 Endoquad GQ annotated sequences with predictions validated using the complete level 1-6 dataset. To identify tissue-specific regulatory patterns, we classified GQ promoter predictions as either 'core' or ‘tissue-specific’. The predicted GQ-dependent, tissue-specific expression of genes was confirmed using DAVID gene ontology tools and validated using the TissueEnrich and GTEx datasets. We further explored interactions between GQ structures and master regulator genes (MRGs) in promoter regions, revealing a notable overlap of MRGs and predicted sites of GQ formation, with colocation of both features within the same tissue-specific DNase hypersensitivity site and with proteins that modulate R-loop formation. Collectively, the findings highlight the transactions between G-flipons with MRG during development that underlie tissue specification. Biological sciences/Biological techniques/Bioinformatics Biological sciences/Biological techniques/Biological models/Genetic models Biological sciences/Molecular biology/Transcription/Transcriptional regulatory elements Biological sciences/Biological techniques/Software G-quadruplex Flipons R-loops Tissue Differentiation Chromatin Deep Learning Full Text Additional Declarations No competing interests reported. Supplementary Files SupplementaryTable1.xlsx SupplementaryTable2.xlsx SupplementaryTable3.xlsx SupplementaryTable4.xlsx SupplementaryTable5.xlsx SupplementaryTable6.pdf SupplementaryTable7.xlsx SupplementaryFigure1.pdf SupplementaryMethods.pdf SupplementaryMaterial1.zip Cite Share Download PDF Status: Published Journal Publication published 02 Jul, 2025 Read the published version in Scientific Reports → Version 1 posted Editorial decision: Revision requested 07 May, 2025 Reviews received at journal 02 May, 2025 Reviewers agreed at journal 23 Apr, 2025 Reviews received at journal 19 Apr, 2025 Reviewers agreed at journal 02 Apr, 2025 Reviewers invited by journal 27 Mar, 2025 Editor assigned by journal 27 Mar, 2025 Editor invited by journal 11 Mar, 2025 Submission checks completed at journal 07 Mar, 2025 First submitted to journal 05 Mar, 2025 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-6165621","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":425559925,"identity":"0601b5ee-e118-49d5-a40e-3ee1ad7b47e7","order_by":0,"name":"Artem Bashkatov","email":"","orcid":"","institution":"HSE university","correspondingAuthor":false,"prefix":"","firstName":"Artem","middleName":"","lastName":"Bashkatov","suffix":""},{"id":425559926,"identity":"b075d313-a44f-4ddb-9e71-e7ce890e5607","order_by":1,"name":"Andrey Andreasyan","email":"","orcid":"","institution":"HSE university","correspondingAuthor":false,"prefix":"","firstName":"Andrey","middleName":"","lastName":"Andreasyan","suffix":""},{"id":425559927,"identity":"565176b8-1958-48f8-8e9f-018c683823e1","order_by":2,"name":"Dmitry Konovalov","email":"","orcid":"","institution":"HSE university","correspondingAuthor":false,"prefix":"","firstName":"Dmitry","middleName":"","lastName":"Konovalov","suffix":""},{"id":425559928,"identity":"fe5cfb9b-1135-4d23-bc1e-2288bc977e4f","order_by":3,"name":"Alan Herbert","email":"","orcid":"","institution":"InsideOutBio","correspondingAuthor":false,"prefix":"","firstName":"Alan","middleName":"","lastName":"Herbert","suffix":""},{"id":425559929,"identity":"e302ff66-6903-4df5-bc0d-03eb2b3619b7","order_by":4,"name":"Maria Popstova","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAsUlEQVRIiWNgGAWjYFAD9h5SVB8AETxnSNYikUOkat3Zzccef/hjk9g/8+3hDx8Y7iU2ENJidudYusHBtrTEGbfz0iRnMBQToeVGjpnEwYbDxgy3c8yYeRgSiNRy4M9hY/mbZ4w//yFeC9thOYMbPAbSDERpAfnlbFuanOGZHDPJHoMEY8Jabjcfe1Dxx4ZH7vgZ4w8/KhJkCWphkGBgQ+IZEFSPoWUUjIJRMApGARYAAPhZQRnSYojKAAAAAElFTkSuQmCC","orcid":"","institution":"HSE university","correspondingAuthor":true,"prefix":"","firstName":"Maria","middleName":"","lastName":"Popstova","suffix":""}],"badges":[],"createdAt":"2025-03-05 21:53:13","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-6165621/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-6165621/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1038/s41598-025-07579-1","type":"published","date":"2025-07-02T15:57:50+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":86180153,"identity":"3c31887b-4932-4c3a-9c11-dbdcb9696d9c","added_by":"auto","created_at":"2025-07-07 16:21:26","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1128086,"visible":true,"origin":"","legend":"","description":"","filename":"DeepGQ070325.pdf","url":"https://assets-eu.researchsquare.com/files/rs-6165621/v1_covered_bb0efc32-c2bf-40cd-b1c2-8bf0c423e45f.pdf"},{"id":78272088,"identity":"0f3ecd37-85b4-4faa-88af-cabd2ece0961","added_by":"auto","created_at":"2025-03-11 13:25:36","extension":"xlsx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":97470,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryTable1.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-6165621/v1/2d8bb3b1cdb59de4c6d94cea.xlsx"},{"id":78271070,"identity":"1b3849c0-1ce5-4fb9-9cc7-5d152ec1225e","added_by":"auto","created_at":"2025-03-11 13:17:36","extension":"xlsx","order_by":2,"title":"","display":"","copyAsset":false,"role":"supplement","size":9015,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryTable2.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-6165621/v1/463da00a12342293e2d77abb.xlsx"},{"id":78272491,"identity":"35ddd862-e3de-487d-9fc2-9dc57af3b5f1","added_by":"auto","created_at":"2025-03-11 13:33:36","extension":"xlsx","order_by":3,"title":"","display":"","copyAsset":false,"role":"supplement","size":1219091,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryTable3.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-6165621/v1/a6b9fd05f939fb5f29f04c5e.xlsx"},{"id":78272087,"identity":"151df02f-0805-43cc-b60f-42435d2baffd","added_by":"auto","created_at":"2025-03-11 13:25:36","extension":"xlsx","order_by":4,"title":"","display":"","copyAsset":false,"role":"supplement","size":30403,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryTable4.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-6165621/v1/84bd2c38a7b9a7370f0b3633.xlsx"},{"id":78271073,"identity":"ddfc623d-19c7-4bdc-9132-9e39bf51794c","added_by":"auto","created_at":"2025-03-11 13:17:36","extension":"xlsx","order_by":5,"title":"","display":"","copyAsset":false,"role":"supplement","size":15996,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryTable5.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-6165621/v1/901e11145dc4c4450674c1ce.xlsx"},{"id":78272492,"identity":"1d0df3e1-554e-481f-b2a5-9ba24709055a","added_by":"auto","created_at":"2025-03-11 13:33:36","extension":"pdf","order_by":7,"title":"","display":"","copyAsset":false,"role":"supplement","size":140852,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryTable6.pdf","url":"https://assets-eu.researchsquare.com/files/rs-6165621/v1/a0ccd565971d8f5ddde0c68a.pdf"},{"id":78271077,"identity":"70a8413f-ad34-4c0d-a857-0fd1dd96ee92","added_by":"auto","created_at":"2025-03-11 13:17:36","extension":"xlsx","order_by":8,"title":"","display":"","copyAsset":false,"role":"supplement","size":255976,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryTable7.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-6165621/v1/7896a1ce1d8ce6afca643d9c.xlsx"},{"id":78271082,"identity":"6d5bfd9c-3c4b-459f-bad5-89e2b16da4aa","added_by":"auto","created_at":"2025-03-11 13:17:37","extension":"pdf","order_by":9,"title":"","display":"","copyAsset":false,"role":"supplement","size":7961014,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryFigure1.pdf","url":"https://assets-eu.researchsquare.com/files/rs-6165621/v1/2a2583e08b15023d28cfb602.pdf"},{"id":78271068,"identity":"09414694-482b-42a5-9453-24b8b48b5987","added_by":"auto","created_at":"2025-03-11 13:17:36","extension":"pdf","order_by":10,"title":"","display":"","copyAsset":false,"role":"supplement","size":48164,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryMethods.pdf","url":"https://assets-eu.researchsquare.com/files/rs-6165621/v1/09fd87f3c0cedf0b41fc5c7e.pdf"},{"id":78271085,"identity":"2ca6478e-efda-4afb-8430-46e9b1df40ea","added_by":"auto","created_at":"2025-03-11 13:17:39","extension":"zip","order_by":11,"title":"","display":"","copyAsset":false,"role":"supplement","size":38221245,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryMaterial1.zip","url":"https://assets-eu.researchsquare.com/files/rs-6165621/v1/4d408c5c8b2cdf05fb9b69a7.zip"}],"financialInterests":"No competing interests reported.","formattedTitle":"Deep Learning Deciphers the Related Role of Master Regulators and G-Quadruplexes in Tissue Specification","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"scientific-reports","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"scirep","sideBox":"Learn more about [Scientific Reports](http://www.nature.com/srep/)","snPcode":"","submissionUrl":"","title":"Scientific Reports","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Scientific Reports","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"G-quadruplex, Flipons, R-loops, Tissue Differentiation, Chromatin, Deep Learning","lastPublishedDoi":"10.21203/rs.3.rs-6165621/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-6165621/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"G-quadruplexes (GQs) are non-canonical DNA structures encoded by G-flipons that may play critical roles in gene regulation and chromatin structure. Here, we explore the role of G-flipons in tissue specification. We present a deep learning-based framework for the genome-wide G-flipon predictions across 14 human tissue types. The model was trained with high confidence experimental maps of GQ forming sequences and ATAC-seq peaks, conjoined with the location of RNA polymerase, histones, and transcription factor binding sites. The training set was based on level 4-6 Endoquad GQ annotated sequences with predictions validated using the complete level 1-6 dataset. To identify tissue-specific regulatory patterns, we classified GQ promoter predictions as either 'core' or ‘tissue-specific’. The predicted GQ-dependent, tissue-specific expression of genes was confirmed using DAVID gene ontology tools and validated using the TissueEnrich and GTEx datasets. We further explored interactions between GQ structures and master regulator genes (MRGs) in promoter regions, revealing a notable overlap of MRGs and predicted sites of GQ formation, with colocation of both features within the same tissue-specific DNase hypersensitivity site and with proteins that modulate R-loop formation. Collectively, the findings highlight the transactions between G-flipons with MRG during development that underlie tissue specification.","manuscriptTitle":"Deep Learning Deciphers the Related Role of Master Regulators and G-Quadruplexes in Tissue Specification","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-03-11 13:17:31","doi":"10.21203/rs.3.rs-6165621/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2025-05-07T09:09:53+00:00","index":"","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-05-02T18:23:45+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"6212471552525896364615085582842567525","date":"2025-04-23T06:36:47+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-04-19T05:48:50+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"146201014647017300649116715062423824520","date":"2025-04-02T22:10:06+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2025-03-27T11:57:05+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2025-03-27T11:42:43+00:00","index":"","fulltext":""},{"type":"editorInvited","content":"","date":"2025-03-11T06:02:18+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2025-03-07T11:07:16+00:00","index":"","fulltext":""},{"type":"submitted","content":"Scientific Reports","date":"2025-03-05T21:45:39+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"scientific-reports","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"scirep","sideBox":"Learn more about [Scientific Reports](http://www.nature.com/srep/)","snPcode":"","submissionUrl":"","title":"Scientific Reports","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Scientific Reports","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"8bf69395-5156-43fc-ba7d-8331052bda99","owner":[],"postedDate":"March 11th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[{"id":45356684,"name":"Biological sciences/Biological techniques/Bioinformatics"},{"id":45356685,"name":"Biological sciences/Biological techniques/Biological models/Genetic models"},{"id":45356686,"name":"Biological sciences/Molecular biology/Transcription/Transcriptional regulatory elements"},{"id":45356687,"name":"Biological sciences/Biological techniques/Software"}],"tags":[],"updatedAt":"2025-07-07T16:15:06+00:00","versionOfRecord":{"articleIdentity":"rs-6165621","link":"https://doi.org/10.1038/s41598-025-07579-1","journal":{"identity":"scientific-reports","isVorOnly":false,"title":"Scientific Reports"},"publishedOn":"2025-07-02 15:57:50","publishedOnDateReadable":"July 2nd, 2025"},"versionCreatedAt":"2025-03-11 13:17:31","video":"","vorDoi":"10.1038/s41598-025-07579-1","vorDoiUrl":"https://doi.org/10.1038/s41598-025-07579-1","workflowStages":[]},"version":"v1","identity":"rs-6165621","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-6165621","identity":"rs-6165621","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00