An assessment of the genomic structural variation landscape in Sub-Saharan African populations

preprint OA: gold CC-BY-4.0
📄 Open PDF Full text JSON View at publisher
Full text 15,829 characters · extracted from preprint-html · click to expand
An assessment of the genomic structural variation landscape in Sub-Saharan African populations | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article An assessment of the genomic structural variation landscape in Sub-Saharan African populations Zané Lombard, Emma Wiener, Laura Cottino, Gerrit Botha, Oscar Nyangiri, and 16 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-4485126/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted You are reading this latest preprint version Abstract Structural variants are responsible for a large part of genomic variation between individuals and play a role in both common and rare diseases. Databases cataloguing structural variants notably do not represent the full spectrum of global diversity, particularly missing information from most African populations. To address this representation gap, we analysed 1,091 high-coverage African genomes, 545 of which are public data sets, and 546 which have been analysed for structural variants for the first time. Variants were called using five different tools and datasets merged and jointly called using SURVIVOR. We identified 67,795 structural variants throughout the genome, with 10,421 genes having at least one variant. Using a conservative overlap in merged data, 6,414 of the structural variants (9.5%) are novel compared to the Database of Genomic Variants. This study contributes to knowledge of the landscape of structural variant diversity in Africa and presents a reliable dataset for potential applications in population genetics and health-related research. Biological sciences/Genetics/Population genetics/Genetic variation/Structural variation Biological sciences/Genetics/Genome/Genetic variation/Structural variation Structural variants African diversity copy number variants genomic variation Full Text Additional Declarations There is NO Competing Interest. Cite Share Download PDF Status: Under Review Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-4485126","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":313168848,"identity":"eb1c19c4-6f2d-404f-b057-6eb9cd86b8ac","order_by":0,"name":"Zané Lombard","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA+klEQVRIiWNgGAWjYNCCAgYGAyAl8YFBglgtBhAtkjMYJIjVA9UizcNAhDXy7WcPPmAwsIk2Zz/78LZtm0WdfAPzww8Mf+xwamHsyUs2YDBIy93Zk25sndsmIWFwgM1YgoEnGacWZoYcMwkGg8O5Gw6ksUnnbgNqYWAwA7qPGacWNv435j/AWs4/Y5O2BGqRb2D/BvRaPU4tPBI5QDNBWm4AbWEEamE4wAMUSTiMU4uExBtjiQSQX2Y8Y7bs/SchueEwT7FEwoHjOLXI9+cYfvhQYZO7nT+N8caPM3X88u3tGz98+FONUwsYJKDwmDFERsEoGAWjYBSQCgBqOUcUyZRPMQAAAABJRU5ErkJggg==","orcid":"https://orcid.org/0000-0002-7997-2616","institution":"University of the Witwatersrand","correspondingAuthor":true,"prefix":"","firstName":"Zané","middleName":"","lastName":"Lombard","suffix":""},{"id":313168849,"identity":"25c01a2e-c877-4eda-8639-b0dc4f92d06e","order_by":1,"name":"Emma Wiener","email":"","orcid":"https://orcid.org/0000-0001-9184-0264","institution":"University of the Witwatersrand","correspondingAuthor":false,"prefix":"","firstName":"Emma","middleName":"","lastName":"Wiener","suffix":""},{"id":313168850,"identity":"f28eb5a3-7612-4a8b-9a01-4452a2c2803a","order_by":2,"name":"Laura Cottino","email":"","orcid":"","institution":"University of the Witwatersrand","correspondingAuthor":false,"prefix":"","firstName":"Laura","middleName":"","lastName":"Cottino","suffix":""},{"id":313168851,"identity":"972a0146-9880-405d-bfb9-17292de9ad00","order_by":3,"name":"Gerrit Botha","email":"","orcid":"","institution":"University of Cape Town","correspondingAuthor":false,"prefix":"","firstName":"Gerrit","middleName":"","lastName":"Botha","suffix":""},{"id":313168852,"identity":"d6197859-ed6b-4330-9118-1ecb98f547d7","order_by":4,"name":"Oscar Nyangiri","email":"","orcid":"https://orcid.org/0000-0003-2741-2921","institution":"Makerere University","correspondingAuthor":false,"prefix":"","firstName":"Oscar","middleName":"","lastName":"Nyangiri","suffix":""},{"id":313168853,"identity":"7894975b-5c03-42ac-9590-98bdf5f54691","order_by":5,"name":"Harry Noyes","email":"","orcid":"","institution":"University of Liverpool","correspondingAuthor":false,"prefix":"","firstName":"Harry","middleName":"","lastName":"Noyes","suffix":""},{"id":313168854,"identity":"ca785fa8-97c3-443c-817d-d00ff2ea6591","order_by":6,"name":"Annette MacLeod","email":"","orcid":"https://orcid.org/0000-0002-0150-5049","institution":"University of Glasgow","correspondingAuthor":false,"prefix":"","firstName":"Annette","middleName":"","lastName":"MacLeod","suffix":""},{"id":313168855,"identity":"b29d05d2-e245-492c-ab4c-712d45a6c770","order_by":7,"name":"David Jakubosky","email":"","orcid":"","institution":"University of California, San Diego","correspondingAuthor":false,"prefix":"","firstName":"David","middleName":"","lastName":"Jakubosky","suffix":""},{"id":313168856,"identity":"dd0d0241-ec1f-455b-a021-047929ca67ef","order_by":8,"name":"Clement Adebamowo","email":"","orcid":"","institution":"University of Maryland, Baltimore","correspondingAuthor":false,"prefix":"","firstName":"Clement","middleName":"","lastName":"Adebamowo","suffix":""},{"id":313168857,"identity":"2932afa1-e9ce-4c8e-8993-1a58ce7215b2","order_by":9,"name":"Philip Awadalla","email":"","orcid":"https://orcid.org/0000-0001-9946-6393","institution":"Ontario Institute for Cancer Research","correspondingAuthor":false,"prefix":"","firstName":"Philip","middleName":"","lastName":"Awadalla","suffix":""},{"id":313168858,"identity":"aa6d6147-583e-4965-8b30-0dba31a0daa8","order_by":10,"name":"Guida Landouré","email":"","orcid":"","institution":"USTTB","correspondingAuthor":false,"prefix":"","firstName":"Guida","middleName":"","lastName":"Landouré","suffix":""},{"id":313168859,"identity":"b33f006a-5547-4d90-92f2-4ef0c7e2fa84","order_by":11,"name":"Mogomotsi Matshaba","email":"","orcid":"","institution":"Baylor College of Medicine","correspondingAuthor":false,"prefix":"","firstName":"Mogomotsi","middleName":"","lastName":"Matshaba","suffix":""},{"id":313168860,"identity":"df0075c2-b90d-4c9f-98f9-76625dfb4872","order_by":12,"name":"Enock Matovu","email":"","orcid":"","institution":"Makerere University","correspondingAuthor":false,"prefix":"","firstName":"Enock","middleName":"","lastName":"Matovu","suffix":""},{"id":313168861,"identity":"46a01647-9f54-45ca-8a7d-f72210a003b1","order_by":13,"name":"Michele Ramsay","email":"","orcid":"https://orcid.org/0000-0002-4156-4801","institution":"University of the Witwatersrand","correspondingAuthor":false,"prefix":"","firstName":"Michele","middleName":"","lastName":"Ramsay","suffix":""},{"id":313168862,"identity":"06637cf1-5ca0-4db6-877c-b28e210aa8ef","order_by":14,"name":"Gustave Simo","email":"","orcid":"","institution":"University of Dschang","correspondingAuthor":false,"prefix":"","firstName":"Gustave","middleName":"","lastName":"Simo","suffix":""},{"id":313168863,"identity":"bbe6db2d-1e02-4eeb-a70b-53a3545faa9b","order_by":15,"name":"Martin Simuunza","email":"","orcid":"https://orcid.org/0000-0001-6621-7470","institution":"University of Zambia","correspondingAuthor":false,"prefix":"","firstName":"Martin","middleName":"","lastName":"Simuunza","suffix":""},{"id":313168864,"identity":"1d7da297-c824-4edb-b336-8cfa3b2dd407","order_by":16,"name":"Caroline Tiemessen","email":"","orcid":"https://orcid.org/0000-0002-0991-1690","institution":"National Institute for Communicable Diseases","correspondingAuthor":false,"prefix":"","firstName":"Caroline","middleName":"","lastName":"Tiemessen","suffix":""},{"id":313168865,"identity":"d79ae2f4-a168-4721-bc43-7fd25c7a168f","order_by":17,"name":"Ambroise Wonkam","email":"","orcid":"https://orcid.org/0000-0003-1420-9051","institution":"Johns Hopkins University","correspondingAuthor":false,"prefix":"","firstName":"Ambroise","middleName":"","lastName":"Wonkam","suffix":""},{"id":313168866,"identity":"33365ee8-a2cc-4d84-878d-514a3d2e19bb","order_by":18,"name":"Venesa Sahibdeen","email":"","orcid":"","institution":"University of the Witwatersrand","correspondingAuthor":false,"prefix":"","firstName":"Venesa","middleName":"","lastName":"Sahibdeen","suffix":""},{"id":313168867,"identity":"3b44bf37-84b6-4517-a0e3-7c6dd9504eb4","order_by":19,"name":"Amanda Krause","email":"","orcid":"","institution":"University of the Witwatersrand","correspondingAuthor":false,"prefix":"","firstName":"Amanda","middleName":"","lastName":"Krause","suffix":""},{"id":313168868,"identity":"0e70f635-b365-4add-86e8-2d09264822c7","order_by":20,"name":"Scott Hazelhurst","email":"","orcid":"https://orcid.org/0000-0002-0581-149X","institution":"Sydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences \u0026 School of Electrical \u0026 Information Engineering, University of the Witwatersrand","correspondingAuthor":false,"prefix":"","firstName":"Scott","middleName":"","lastName":"Hazelhurst","suffix":""}],"badges":[],"createdAt":"2024-05-27 12:50:39","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-4485126/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-4485126/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":59825752,"identity":"bb2955a0-03b3-4a85-a9af-b06e4bbf0d3a","added_by":"auto","created_at":"2024-07-08 05:48:01","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":434881,"visible":true,"origin":"","legend":"","description":"","filename":"h3asvFINAL20240524.pdf","url":"https://assets-eu.researchsquare.com/files/rs-4485126/v1_covered_394ea8c6-af68-4bdf-b8fb-873038fa9fa3.pdf"}],"financialInterests":"There is \u003cb\u003eNO\u003c/b\u003e Competing Interest.","formattedTitle":"An assessment of the genomic structural variation landscape in Sub-Saharan African populations","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"nature-portfolio","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"","title":"Nature Portfolio","twitterHandle":"","acdcEnabled":false,"dfaEnabled":false,"editorialSystem":"ejp","reportingPortfolio":"","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"Structural variants, African diversity, copy number variants, genomic variation","lastPublishedDoi":"10.21203/rs.3.rs-4485126/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-4485126/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"Structural variants are responsible for a large part of genomic variation between individuals and play a role in both common and rare diseases. Databases cataloguing structural variants notably do not represent the full spectrum of global diversity, particularly missing information from most African populations. To address this representation gap, we analysed 1,091 high-coverage African genomes, 545 of which are public data sets, and 546 which have been analysed for structural variants for the first time. Variants were called using five different tools and datasets merged and jointly called using SURVIVOR. We identified 67,795 structural variants throughout the genome, with 10,421 genes having at least one variant. Using a conservative overlap in merged data, 6,414 of the structural variants (9.5%) are novel compared to the Database of Genomic Variants. This study contributes to knowledge of the landscape of structural variant diversity in Africa and presents a reliable dataset for potential applications in population genetics and health-related research.","manuscriptTitle":"An assessment of the genomic structural variation landscape in Sub-Saharan African populations","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-07-08 05:39:54","doi":"10.21203/rs.3.rs-4485126/v1","editorialEvents":[],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"communications-biology","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"commsbio","sideBox":"Learn more about [Communications Biology](http://www.nature.com/commsbio/)","snPcode":"","submissionUrl":"","title":"Communications Biology","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"ejp","reportingPortfolio":"Communications Series","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"2b6933a8-5e45-437e-81aa-4bc5f5d3fdd6","owner":[],"postedDate":"July 8th, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[{"id":33109032,"name":"Biological sciences/Genetics/Population genetics/Genetic variation/Structural variation"},{"id":33109033,"name":"Biological sciences/Genetics/Genome/Genetic variation/Structural variation"}],"tags":[],"updatedAt":"2024-07-08T05:39:54+00:00","versionOfRecord":[],"versionCreatedAt":"2024-07-08 05:39:54","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-4485126","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-4485126","identity":"rs-4485126","version":["v1"]},"buildId":"qtupq5eGEP_6zYnWcrvyt","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-05-21T05:10:58.409756+00:00
License: CC-BY-4.0