Barcoding Gaps and Sequencing Prioritisation in a Global Biodiversity Stronghold | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article Barcoding Gaps and Sequencing Prioritisation in a Global Biodiversity Stronghold Thomas Luypaert, Izeni Farias, Tomas Hrbek, Carlos Peres, Torbjørn Haugaasen This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-9096952/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted You are reading this latest preprint version Abstract Effective ecological monitoring increasingly depends on eDNA metabarcoding, yet the accuracy of these approaches hinges on the completeness and quality of reference databases. In tropical biodiversity strongholds, where conservation urgency is greatest, systematic gaps and biases in these databases remain poorly characterised. Here, we develop an integrated framework to quantify, explain, and prioritise barcode reference gaps for mammals from Brazil – the world’s largest tropical forest country. Evaluating barcode presence for 731 species across ten primer combinations spanning four mitochondrial loci, we find that fewer than half of species are represented in any single primer database, with interspecific barcode sharing reducing effective taxonomic resolution by up to 50% in recently radiated clades. Database gaps are strongly phylogenetically structured: coverage is highest in charismatic, species-poor lineages and lowest in species-rich clades dominated by small-bodied taxa. Critically, conservation status does not predict barcode inclusion, revealing a fundamental misalignment between molecular infrastructure and conservation need. Using hierarchical Bayesian models, we show that range size and time since scientific description are the strongest predictors of inclusion, and that gaps are driven primarily by clade-level research traditions rather than individual species traits - implying that closing them requires targeted investment across entire neglected lineages. We integrate these drivers with evolutionary distinctiveness and extinction risk into a Barcoding Priority Score that identifies sequencing priorities at global and national scales, and map these priorities across Brazilian biomes. Our transferable framework provides a scalable roadmap for guiding sequencing investments in hyperdiverse regions where biodiversity loss is outpacing molecular documentation. Earth and environmental sciences/Ecology/Ecological genetics Earth and environmental sciences/Ecology/Biodiversity Earth and environmental sciences/Ecology/Tropical ecology eDNA metabarcoding DNA barcoding gaps reference database completeness sequencing prioritisation evolutionary distinctiveness Barcoding Priority Score tropical biodiversity monitoring conservation genomics Full Text Additional Declarations There is NO Competing Interest. Supplementary Files Luypaertetal2026SupplementaryMaterials.pdf Luypaert_et_al_2026_Supplementary_Materials Cite Share Download PDF Status: Under Review Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-9096952","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":609595931,"identity":"475cd36c-2b02-4b9c-b465-0b8caa51cb99","order_by":0,"name":"Thomas Luypaert","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA6UlEQVRIiWNgGAWjYDACCQglA+PLMTDw4NfBA9UCU2ZgTLqWxAZCWuylm49u+PGHgcfgRu7BDx/3/EnfLnb2mATjDhvctsgcS7vZ2wbSkpcsOeOZQe7O2XlpEoxn0vA4LMfsBm8DSEuOgTTPAYPcDbdzzCQY2w7j1XLzD9hhOca//xwwSDcgRsttHjawFjNphgMGCYS13EhLuy3bJsEjeeZdmmXPAWPDDbfzki0S23D7hX1G8rGbb/7YyPEdzz1848cBOXmD27kHb3xswx1iUACMHYEcJH4CIQ1gwH+GKGWjYBSMglEwAgEA085TIdexP/0AAAAASUVORK5CYII=","orcid":"https://orcid.org/0000-0001-7491-7418","institution":"Norwegian University of Life Sciences (NMBU)","correspondingAuthor":true,"prefix":"","firstName":"Thomas","middleName":"","lastName":"Luypaert","suffix":""},{"id":609595932,"identity":"bbcccf6e-18d5-4364-bf04-9d7e5011b459","order_by":1,"name":"Izeni Farias","email":"","orcid":"https://orcid.org/0000-0002-1416-4351","institution":"","correspondingAuthor":false,"prefix":"","firstName":"Izeni","middleName":"","lastName":"Farias","suffix":""},{"id":609595933,"identity":"cd0f71aa-6a26-439d-9bc7-1e961d4e9c4c","order_by":2,"name":"Tomas Hrbek","email":"","orcid":"","institution":"Universidade Federal do Amazonas (UFAM)","correspondingAuthor":false,"prefix":"","firstName":"Tomas","middleName":"","lastName":"Hrbek","suffix":""},{"id":609595934,"identity":"830843e4-f882-45b6-9f85-748e2f3d73c5","order_by":3,"name":"Carlos Peres","email":"","orcid":"","institution":"University of East Anglia (UEA)","correspondingAuthor":false,"prefix":"","firstName":"Carlos","middleName":"","lastName":"Peres","suffix":""},{"id":609595935,"identity":"feb03734-4768-43e7-bd57-708d1ba19121","order_by":4,"name":"Torbjørn Haugaasen","email":"","orcid":"https://orcid.org/0000-0003-0901-5324","institution":"Norwegian University of Life Sciences","correspondingAuthor":false,"prefix":"","firstName":"Torbjørn","middleName":"","lastName":"Haugaasen","suffix":""}],"badges":[],"createdAt":"2026-03-11 17:20:14","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-9096952/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-9096952/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":105751806,"identity":"2998d7df-ef3e-4c69-bbbf-6e354fc9e5dc","added_by":"auto","created_at":"2026-03-30 15:45:06","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":5906155,"visible":true,"origin":"","legend":"","description":"","filename":"Luypaertetal2026.pdf","url":"https://assets-eu.researchsquare.com/files/rs-9096952/v1_covered_7ec718be-eb9e-4b68-b2f2-8a31263f2cc2.pdf"},{"id":105182054,"identity":"dd575099-1d69-4fcc-bf5f-0c4e423560a5","added_by":"auto","created_at":"2026-03-23 07:43:27","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":38672399,"visible":true,"origin":"","legend":"Luypaert_et_al_2026_Supplementary_Materials","description":"","filename":"Luypaertetal2026SupplementaryMaterials.pdf","url":"https://assets-eu.researchsquare.com/files/rs-9096952/v1/497768a1ade7355c148bdb09.pdf"}],"financialInterests":"There is \u003cb\u003eNO\u003c/b\u003e Competing Interest.","formattedTitle":"Barcoding Gaps and Sequencing Prioritisation in a Global Biodiversity Stronghold","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"nature-portfolio","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"","title":"Nature Portfolio","twitterHandle":"","acdcEnabled":false,"dfaEnabled":false,"editorialSystem":"ejp","reportingPortfolio":"","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"eDNA metabarcoding, DNA barcoding gaps, reference database completeness, sequencing prioritisation, evolutionary distinctiveness, Barcoding Priority Score, tropical biodiversity monitoring, conservation genomics","lastPublishedDoi":"10.21203/rs.3.rs-9096952/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-9096952/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"Effective ecological monitoring increasingly depends on eDNA metabarcoding, yet the accuracy of these approaches hinges on the completeness and quality of reference databases. In tropical biodiversity strongholds, where conservation urgency is greatest, systematic gaps and biases in these databases remain poorly characterised. Here, we develop an integrated framework to quantify, explain, and prioritise barcode reference gaps for mammals from Brazil – the world’s largest tropical forest country. Evaluating barcode presence for 731 species across ten primer combinations spanning four mitochondrial loci, we find that fewer than half of species are represented in any single primer database, with interspecific barcode sharing reducing effective taxonomic resolution by up to 50% in recently radiated clades. Database gaps are strongly phylogenetically structured: coverage is highest in charismatic, species-poor lineages and lowest in species-rich clades dominated by small-bodied taxa. Critically, conservation status does not predict barcode inclusion, revealing a fundamental misalignment between molecular infrastructure and conservation need. Using hierarchical Bayesian models, we show that range size and time since scientific description are the strongest predictors of inclusion, and that gaps are driven primarily by clade-level research traditions rather than individual species traits - implying that closing them requires targeted investment across entire neglected lineages. We integrate these drivers with evolutionary distinctiveness and extinction risk into a Barcoding Priority Score that identifies sequencing priorities at global and national scales, and map these priorities across Brazilian biomes. Our transferable framework provides a scalable roadmap for guiding sequencing investments in hyperdiverse regions where biodiversity loss is outpacing molecular documentation.","manuscriptTitle":"Barcoding Gaps and Sequencing Prioritisation in a Global Biodiversity Stronghold","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-03-23 07:43:02","doi":"10.21203/rs.3.rs-9096952/v1","editorialEvents":[],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"nature-communications","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"NCOMMS","sideBox":"Learn more about [Nature Communications](http://www.nature.com/ncomms/)","snPcode":"","submissionUrl":"https://mts-ncomms.nature.com/","title":"Nature Communications","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"ejp","reportingPortfolio":"Nature Communications","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"4c19e41b-188e-4c55-a3f2-25a7121e8cd3","owner":[],"postedDate":"March 23rd, 2026","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[{"id":64868523,"name":"Earth and environmental sciences/Ecology/Ecological genetics"},{"id":64868524,"name":"Earth and environmental sciences/Ecology/Biodiversity"},{"id":64868525,"name":"Earth and environmental sciences/Ecology/Tropical ecology"}],"tags":[],"updatedAt":"2026-03-23T07:43:02+00:00","versionOfRecord":[],"versionCreatedAt":"2026-03-23 07:43:02","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-9096952","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-9096952","identity":"rs-9096952","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.