Draft genome of the Cuban painted land snail Polymita picta, International Mollusc of the Year 2022

preprint OA: closed
Full text JSON View at publisher
Full text 49,622 characters · extracted from preprint-html · click to expand
Draft genome of the Cuban painted land snail Polymita picta, International Mollusc of the Year 2022 | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Data Note Draft genome of the Cuban painted land snail Polymita picta, International Mollusc of the Year 2022 Bernardo Reyes-Tur, Zeyuan Chen, Mario Juan Gordillo-Pérez, Alexander Ben Hamadou, and 3 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-6464041/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 03 Sep, 2025 Read the published version in BMC Genomic Data → Version 1 posted 11 You are reading this latest preprint version Abstract Objective: The Cuban painted land snail is an iconic endemic forest snail species with distinctive colourful shells used in traditional handicrafts. This species won the International Mollusc of the Year 2022 competition in an open public vote. As the competition prize, we have assembled the draft genome of this species. Data description: Genomic DNA from Polymita picta (Born, 1778) was sequenced using PacBio HiFi sequencing with a yield of 5.3 million reads and an N50 of 8.1 Kbp. The genome size of P. picta was estimated to be 2.9 Gbp, and the final assembly was 1.85 Gbp, with a total of 22,628 contigs and a contig N50 of 124.2 Kbp. BUSCO analysis of the genome assembly indicated a genome completeness of 88.4%, with 7% complete duplicated BUSCOs. The draft genome will be a valuable resource for work on the endangered Cuban painted land snail including monitoring genetic diversity and establishing captive breeding for conservation. Mollusca adaptive radiation Gastropoda Stylommatophora Cuba conservation genomics Objective The Cuban painted land snail, Polymita picta (Born, 1778), is a visually iconic species recognised for its dramatic shell colour polymorphism [ 1 , 2 ]. Polymita picta is one member of a range-restricted genus, it is endemic to eastern Cuba, and threatened by habitat loss and illegal collection [ 3 , 4 ]. This species is a focal subject for evolutionary biology, ecology, and conservation [ 4 – 7 ]. Despite its charisma and growing interest from both scientists and collectors [ 4 , 5 ], P. picta remains genetically understudied with only one short DNA fragment previously published [ 8 ], and one recently published mitochondrial genome [ 9 ]. Genomic resources for land snails (Stylommatophora) remain underrepresented in molluscan genomics compared to marine bivalves and cephalopods, despite the high species richness and economic importance of land snails. Many available genomes are highly fragmented drafts, limited by the large genome sizes and high repeat content characteristic of gastropods [ 10 ], which complicate assembly. A high-quality P. picta reference genome will provide a valuable genetic resource to support genetic diversity assessment and conservation efforts, as well as to advance comparative genomic studies of land snails, particularly in the context of speciation, adaptation, and divergence. The International Mollusc of the Year competition has been run annually through the Senckenberg Research Institute and Museum, Frankfurt, Germany, in order to bring more attention to the need for increasing genomic resources for molluscs. The winning species is selected by popular vote from five finalists, and as a prize the genome of the winning species is sequenced and assembled, such as this draft for P. picta . Data description The sample used for genome sequencing was a brown shelled individual with supernumerary spiral bands (juvenile), collected by hand from El Diamante, Maisí municipality, Guantánamo province, Cuba. Genome size was estimated following a propidium iodide-stained nuclei flow cytometry (FCM) protocol [ 11 ], using the same P. picta individual used for PacBio sequencing. Neural tissue from Acheta domesticus (female, 1C = 2 Gb) and chicken erythrocyte nuclei (1C = 1.2 Gb) were used as internal reference standards. The genome size (1C) of P. picta was estimated by averaging measurements from three replicates taken on three different days to reduce random instrumental error. Genomic DNA was extracted using a CTAB-based method [ 12 ]. We prepared one PacBio ultra-low input library including a long-range PCR amplification step using the SMRTbell® gDNA Sample Amplification Kit and the SMRTbell® Express Template Preparation Kit 2.0. In addition, to reduce potential PCR biases of the amplification polymerases, we prepared one further library using the KOD Xtreme™ Hot Start DNA Polymerase (Merck), optimized for amplification of long and GC-rich DNA templates. For this, we combined the buffer, dNTPs and KOD polymerase from the KOD Xtreme Hot Start DNA Polymerase Kit with the ultra-low input primers from the PacBio SMRTbell gDNA Sample Amplification Kit. Otherwise, we followed the PacBio ultra-low input protocol [ 13 , 14 ]. These two ultra-low input libraries were then pooled in equal mass quantities and sequenced on a single Revio SMRTcell. HiFi reads were called using a pipeline, which is running PacBio’s tools ccs 6.4.0 [ 15 ], actc 0.3.1[ 16 ], samtools 1.15 [ 17 ] and DeepConsensus 1.2.0 [ 18 ]. PCR adapter sequences and duplicates were removed using lima v2.9.0 [ 19 ] and pbmarkdup v1.0.3 [ 20 ], resulting in approximately 5.3 million HiFi reads with an N50 of 8.1kb (Table 1 , Data file 1, Data set 1, [ 21 , 22 ]), providing approximately 14-fold coverage of the genome of P. picta that was estimated at 2.9 Gb using flow cytometry. Genome size and heterozygosity were estimated from a k-mer profile of the HiFi reads using Jellyfish 2.3.0 [ 23 ] and GenomeScope v1.0 [ 24 ]. The kmer-based genome size was estimated to be 3.06 Gb, with a heterozygosity about 2.41% (Table 1 , Data file 2, [ 21 ]). The cleaned HiFi reads were assembled with hifiasm v0.16.1 [ 25 ], and Purge_haplotigs was used to remove duplicated contigs from the genome [ 26 ]. The final P. picta genome assembly consists of 22,628 contigs with a total length of 1.85 Gb, a contig N50 of 124.16 Kb (Table 1 , Data file 3, Data set 2, [ 21 ]). Genome completeness was assessed by BUSCO (Benchmarkding Universal Single-Copy Orthologs) v5.4.3 [ 27 ], in euk_genome_met mode using the lineage dataset metazoa_odb10. The results showed 844 of the 954 BUSCO genes (88.4%) were complete, with 777 complete single copy (81.4%), 67 complete duplicates (7.0%), 70 genes were fragmented (7.3%), and 40 BUSCO genes were missing (4.3%) (Table 1 , Data file 3, [ 21 ]). The draft genome presents a valuable resource for monitoring genetic diversity, breeding, resilience to climate change and conservation of the endangered Cuban painted snail. Table 1 Overview of data files/data sets. Label Name of data file/data set File types (file extension) Data repository and identifier (DOI or accession number) Data file 1 HiFi sequencing of P. picta Excel (.xlsx) https://doi.org/10.6084/m9.figshare.28790750.v3[21] Data file 2 Genome survey of P. picta Portable Document Format (.png) https://doi.org/10.6084/m9.figshare.28790750.v3[21] Data file 3 Genome assembly of P. picta Excel (.xlsx) https://doi.org/10.6084/m9.figshare.28790750.v3[21] Data set 1 Pacbio sequence reads of P. picta genomic DNA Fastq files (.fastq) NCBI Sequence Read Archive ( https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1250545 )[ 22 ] Data set 2 Genome assembly of P.picta Fasta file (.fna) https://doi.org/10.6084/m9.figshare.28790750.v3[21] Limitations The polymerase amplification process has yielded a promising amount of HiFi reads for P. picta ; however, it also shortens the average HiFi read length, resulting in a more fragmented assembly. In addition, the relatively low sequencing coverage (14× relative to the genome size) leaves some genomic regions unassembled. The high heterozygosity of P. picta further complicates the assembly process, contributing to a significant proportion of the genome representing alternative haplotypes. In addition, the discrepancy between the assembly and the estimated genome size may be due to highly repetitive sequences. Nonetheless, the assembly quality is comparable to that of other published genomes of land snails and slugs, and it now provides a valuable genetic resource for comparative genomics and the discovery of lineage-specific and adaptive genes. Abbreviations PacBio: Pacific Biosciences Sequel II SMRT: Single Molecule Real Time BUSCO: Benchmarkding Universal Single-Copy Orthologs Declarations Ethics approval and consent to participate Specimens were collected under permits to BRT and exported with authorisation from the relevant authorities in Cuba including prior informed consent and material transfer agreements as addenda to an established collaboration contract between Senckenberg Society and University of Oriente, Santiago de Cuba. Consent for publication Not applicable. Availability of data and materials The data described in this Data note can be freely and openly accessed on NCBI under BioProject ID PRJNA1250545 [22], and figshare (https://doi.org/10.6084/m9.figshare.28790750.v3)[21]. Please see Table 1 for details and links to the data. Competing interests The authors declare they have no competing interests. Funding Mollusc of the Year was a project funded under the LOEWE Translational Biodiversity Genomics initiative. Authors’ contributions BRT, JDS, CGr conceived the project. BRT collected the samples. ABH and CGe performed DNA extraction and library preparation, ZC performed bioinformatic analysis, BRT, MJG performed investigation. ZC, JDS prepared the original draft, all authors reviewed, edited and approved the final manuscript. Acknowledgements We thank the Bioscientia Institut für Medizinische Diagnostik GmbH for providing the PacBio SMRT sequencing service on the PacBio Revio platform. BRT is grateful to J.M. Koene and A. Davison for the first advise about The International Mollusc of the Year competition and the 10092 participants that voted for Polymita picta . MJGP and BRT were partially financed by VLIR-UOS and Global Minds funds from Hasselt University, Belgium. References Torre de la C. El género Polymita. Memorias de la Sociedad Cubana de Historia Natural “Felipe Poey”. 1950; 20:1–20. Gordillo-Pérez MJ, Beenaerts N, Sánchez DL, Smeets K, Arias-Sosa YC, Reyes-Tur B. Shell colour luminance of Cuban painted snails, Polymita picta and Polymita muscarum (Gastropoda: Cepolidae). PLoS ONE. 2025;20:e0314008. Milera J, Martínez JR. Polymita . 1st ed. La Habana: Editorial Científico Técnica; 1987. González-Guillén A. Polymita : the most beautiful land snail of the world. Memphis: Fundcraft Publishing, Collierville; 2021. Berovides V, Valdés G, Milera J. Polimorphismo genetico de Polymita picta roseolimbata Torre, 1950 en la region de Maisi, Cuba. Caribbaean Journal of Science. 1986;22:179–89. Reyes-Tur B, Allen JA, Cuellar-Araujo N, Hernández N, Lodi M, Méndez-Hernández AA, et al. Mating behaviour, dart shape and spermatophore morphology of the Cuban tree snail Polymita picta (Born, 1780). Journal of Molluscan Studies. 2015;81:187–95. Cancio-Martínez E, Alfonso-Sánchez M, Espinosa-López G. Polimorfismo de las esterasas de músculo en las especies de Polymita . Ciencias Biológicas. 1987;18:26–32. Koene JM, Schulenburg H. Shooting darts: co-evolution and counter-adaptation in hermaphroditic snails. BMC Evol Biol. 2005;5:25. Lewis A, Reyes-Tur B, Davison A. The mitochondrial genome and phylogenetic placement of the world’s most beautiful snail, Polymita picta. Journal of Molluscan Studies. 2025; (in press). https://doi.org/10.1093/mollus/eyaf006. Chen Z, Baeza JA, Chen C, Gonzalez MT, González VL, Greve C, et al. A genome-based phylogeny for Mollusca is concordant with fossils and morphology. Science. 2025;387:1001–7. Hare EE, Johnston JS. Genome Size Determination Using Flow Cytometry of Propidium Iodide-Stained Nuclei. In: Orgogozo V, Rockman MV, editors. Molecular Methods for Evolutionary Genetics. Totowa, NJ: Humana Press; 2012. p. 3–12. Murray MG, Thompson WF. Rapid isolation of high molecular weight plant DNA. Nucl Acids Res. 1980;8:4321–6. Männer L, Schell T, Spies J, Galià-Camps C, Baranski D, Ben Hamadou A, et al. Chromosome-level genome assembly of the sacoglossan sea slug Elysia timida (Risso, 1818). BMC Genomics. 2024;25:941. Bein B, Chrysostomakis I, Arantes LS, Brown T, Gerheim C, Schell T, et al. Long-read sequencing and genome assembly of natural history collection samples and challenging specimens. Genome Biol. 2025;26:25. CCS - Generate PacBio HiFi data. https://github.com/PacificBiosciences/ccs. Accessed 14 April 2025. Align clr to ccs reads. https://github.com/PacificBiosciences/actc. Accessed 14 April 2025. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–9. Baid G, Cook DE, Shafin K, Yun T, Llinares-López F, Berthet Q, et al. DeepConsensus improves the accuracy of sequences with a gap-aware sequence transformer. Nat Biotechnol. 2022. https://doi.org/10.1038/s41587-022-01435-7. Lima - Demultiplex PacBio data. https://github.com/PacificBiosciences/barcoding. Accessed 14 April 2025. Mark duplicate reads from PacBio sequencing of an amplified library. https://github.com/PacificBiosciences/pbmarkdup. Accessed 14 April 2025. Reyes-Tur B, Chen Z, Gordillo-Pérez MJ, Hamadou AB, Gerheim C, Greve C, et al. Draft genome of the Cuban painted land snail Polymita picta, International Mollusc of the Year 2022. figshare. Dataset. https://doi.org/10.6084/m9.figshare.28790750. Accessed 14 April 2025. Reyes-Tur B, Chen Z, Gordillo-Pérez MJ, Hamadou AB, Gerheim C, Greve C, et al. NCBI GeneBank. https://identifiers.org/ncbi/bioproject:PRJNA1250545. Accessed 15 April 2025. Marçais G, Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of k -mers. Bioinformatics. 2011;27:764–70. Ranallo-Benavidez TR, Jaron KS, Schatz MC. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat Commun. 2020;11:1432. Cheng H, Concepcion GT, Feng X, Zhang H, Li H. Haplotype-resolved de novo assembly using phased assembly graphs with Hifiasm. Nat Methods. 2021;18:170–5. Roach MJ, Schmidt SA, Borneman AR. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC Bioinformatics. 2018;19:460. Seppey M, Manni M, Zdobnov EM. BUSCO: Assessing Genome Assembly and Annotation Completeness. In: Kollmar M, editor. Gene Prediction. New York, NY: Springer New York; 2019. p. 227–45. Additional Declarations No competing interests reported. Cite Share Download PDF Status: Published Journal Publication published 03 Sep, 2025 Read the published version in BMC Genomic Data → Version 1 posted Editorial decision: Revision requested 23 May, 2025 Reviews received at journal 23 May, 2025 Reviews received at journal 20 May, 2025 Reviewers agreed at journal 19 May, 2025 Reviewers agreed at journal 19 May, 2025 Reviews received at journal 16 May, 2025 Reviewers agreed at journal 06 May, 2025 Reviewers invited by journal 30 Apr, 2025 Editor assigned by journal 18 Apr, 2025 Submission checks completed at journal 18 Apr, 2025 First submitted to journal 16 Apr, 2025 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-6464041","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Data Note","associatedPublications":[],"authors":[{"id":445836917,"identity":"8957d761-fe52-435b-abd0-515aff882d3b","order_by":0,"name":"Bernardo Reyes-Tur","email":"","orcid":"","institution":"Universidad de Oriente","correspondingAuthor":false,"prefix":"","firstName":"Bernardo","middleName":"","lastName":"Reyes-Tur","suffix":""},{"id":445836918,"identity":"47b7204e-8cfb-4c04-bbbc-abf0e381a943","order_by":1,"name":"Zeyuan Chen","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABGElEQVRIie2RMUsDMRiGv6NwdfjaOcdh8xcShHTQH3NTXW4RF4VaOt0tuV3xT+im4BAI3C2hc6VCb+pUh25XUPFCR1PUTTDPFF7y8OYlAB7PX4YBBLU9EIBwF6nvlQ77tRKSHyk0l4JsH4AOu0V52Vy80CgvynrzdAX9aupUAmlEVBjgj3I2WqBZ8VucnfLrVQWRcdd0SCpIL4OEzVOxCDKdDNokRlUCmydOJaRrEb1bZbkWZ9uPVmmT+M0qy9qpIEER71pQQG+qk9gmoMZti3s+wdH58WFG+J1Jj2IsNb+R6ZBLpTAy7ofRXN8/v2YnlFWGb5qxpqQyom7UZNCv3PMBDuwHki+xxj33W7rukZP9hsfj8fw3PgETel+QzKYyywAAAABJRU5ErkJggg==","orcid":"","institution":"Senckenberg Research Institute and Museum","correspondingAuthor":true,"prefix":"","firstName":"Zeyuan","middleName":"","lastName":"Chen","suffix":""},{"id":445836919,"identity":"9487e7bf-8cc0-463d-a1b5-c5698ff756ba","order_by":2,"name":"Mario Juan Gordillo-Pérez","email":"","orcid":"","institution":"Hasselt University","correspondingAuthor":false,"prefix":"","firstName":"Mario","middleName":"Juan","lastName":"Gordillo-Pérez","suffix":""},{"id":445836920,"identity":"ab29aeff-7570-48a1-a430-a9ca5ed66c6b","order_by":3,"name":"Alexander Ben Hamadou","email":"","orcid":"","institution":"Senckenberg Research Institute and Museum","correspondingAuthor":false,"prefix":"","firstName":"Alexander","middleName":"Ben","lastName":"Hamadou","suffix":""},{"id":445836921,"identity":"15693a59-83b8-417f-8944-ae21ea706d93","order_by":4,"name":"Charlotte Gerheim","email":"","orcid":"","institution":"Senckenberg Research Institute and Museum","correspondingAuthor":false,"prefix":"","firstName":"Charlotte","middleName":"","lastName":"Gerheim","suffix":""},{"id":445836922,"identity":"173cc5f1-3a4a-47e8-94bd-3d7138fe0cdd","order_by":5,"name":"Carola Greve","email":"","orcid":"","institution":"Senckenberg Research Institute and Museum","correspondingAuthor":false,"prefix":"","firstName":"Carola","middleName":"","lastName":"Greve","suffix":""},{"id":445836923,"identity":"7c8557d7-760a-4e0c-8bcb-8a5b0f5f6745","order_by":6,"name":"Julia D. Sigwart","email":"","orcid":"","institution":"Senckenberg Research Institute and Museum","correspondingAuthor":false,"prefix":"","firstName":"Julia","middleName":"D.","lastName":"Sigwart","suffix":""}],"badges":[],"createdAt":"2025-04-16 13:38:21","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-6464041/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-6464041/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1186/s12863-025-01356-9","type":"published","date":"2025-09-03T15:57:00+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":90828067,"identity":"0c2f938b-5c33-4f41-b552-5d97cda05f76","added_by":"auto","created_at":"2025-09-08 16:05:28","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":458656,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-6464041/v1/3d0af8ba-4af6-42ff-b874-c49af0a126f5.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"Draft genome of the Cuban painted land snail Polymita picta, International Mollusc of the Year 2022","fulltext":[{"header":"Objective","content":"\u003cp\u003eThe Cuban painted land snail, \u003cem\u003ePolymita picta\u003c/em\u003e (Born, 1778), is a visually iconic species recognised for its dramatic shell colour polymorphism [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e, \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e]. \u003cem\u003ePolymita picta\u003c/em\u003e is one member of a range-restricted genus, it is endemic to eastern Cuba, and threatened by habitat loss and illegal collection [\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e, \u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e]. This species is a focal subject for evolutionary biology, ecology, and conservation [\u003cspan additionalcitationids=\"CR5 CR6\" citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. Despite its charisma and growing interest from both scientists and collectors [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e, \u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e], \u003cem\u003eP. picta\u003c/em\u003e remains genetically understudied with only one short DNA fragment previously published [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e], and one recently published mitochondrial genome [\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e]. Genomic resources for land snails (Stylommatophora) remain underrepresented in molluscan genomics compared to marine bivalves and cephalopods, despite the high species richness and economic importance of land snails. Many available genomes are highly fragmented drafts, limited by the large genome sizes and high repeat content characteristic of gastropods [\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e], which complicate assembly. A high-quality \u003cem\u003eP. picta\u003c/em\u003e reference genome will provide a valuable genetic resource to support genetic diversity assessment and conservation efforts, as well as to advance comparative genomic studies of land snails, particularly in the context of speciation, adaptation, and divergence.\u003c/p\u003e \u003cp\u003eThe International Mollusc of the Year competition has been run annually through the Senckenberg Research Institute and Museum, Frankfurt, Germany, in order to bring more attention to the need for increasing genomic resources for molluscs. The winning species is selected by popular vote from five finalists, and as a prize the genome of the winning species is sequenced and assembled, such as this draft for \u003cem\u003eP. picta\u003c/em\u003e.\u003c/p\u003e"},{"header":"Data description","content":"\u003cp\u003eThe sample used for genome sequencing was a brown shelled individual with supernumerary spiral bands (juvenile), collected by hand from El Diamante, Mais\u0026iacute; municipality, Guant\u0026aacute;namo province, Cuba. Genome size was estimated following a propidium iodide-stained nuclei flow cytometry (FCM) protocol [\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e], using the same \u003cem\u003eP. picta\u003c/em\u003e individual used for PacBio sequencing. Neural tissue from \u003cem\u003eAcheta domesticus\u003c/em\u003e (female, 1C\u0026thinsp;=\u0026thinsp;2 Gb) and chicken erythrocyte nuclei (1C\u0026thinsp;=\u0026thinsp;1.2 Gb) were used as internal reference standards. The genome size (1C) of \u003cem\u003eP. picta\u003c/em\u003e was estimated by averaging measurements from three replicates taken on three different days to reduce random instrumental error. Genomic DNA was extracted using a CTAB-based method [\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e]. We prepared one PacBio ultra-low input library including a long-range PCR amplification step using the SMRTbell\u0026reg; gDNA Sample Amplification Kit and the SMRTbell\u0026reg; Express Template Preparation Kit 2.0. In addition, to reduce potential PCR biases of the amplification polymerases, we prepared one further library using the KOD Xtreme\u0026trade; Hot Start DNA Polymerase (Merck), optimized for amplification of long and GC-rich DNA templates. For this, we combined the buffer, dNTPs and KOD polymerase from the KOD Xtreme Hot Start DNA Polymerase Kit with the ultra-low input primers from the PacBio SMRTbell gDNA Sample Amplification Kit. Otherwise, we followed the PacBio ultra-low input protocol [\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e, \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e]. These two ultra-low input libraries were then pooled in equal mass quantities and sequenced on a single Revio SMRTcell.\u003c/p\u003e \u003cp\u003eHiFi reads were called using a pipeline, which is running PacBio\u0026rsquo;s tools ccs 6.4.0 [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e], actc 0.3.1[\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e], samtools 1.15 [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e] and DeepConsensus 1.2.0 [\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e]. PCR adapter sequences and duplicates were removed using lima v2.9.0 [\u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e] and pbmarkdup v1.0.3 [\u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e20\u003c/span\u003e], resulting in approximately 5.3\u0026nbsp;million HiFi reads with an N50 of 8.1kb (Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e, Data file 1, Data set 1, [\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e, \u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e]), providing approximately 14-fold coverage of the genome of \u003cem\u003eP. picta\u003c/em\u003e that was estimated at 2.9 Gb using flow cytometry. Genome size and heterozygosity were estimated from a k-mer profile of the HiFi reads using Jellyfish 2.3.0 [\u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e23\u003c/span\u003e] and GenomeScope v1.0 [\u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e]. The kmer-based genome size was estimated to be 3.06 Gb, with a heterozygosity about 2.41% (Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e, Data file 2, [\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e]). The cleaned HiFi reads were assembled with hifiasm v0.16.1 [\u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e25\u003c/span\u003e], and Purge_haplotigs was used to remove duplicated contigs from the genome [\u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e26\u003c/span\u003e]. The final \u003cem\u003eP. picta\u003c/em\u003e genome assembly consists of 22,628 contigs with a total length of 1.85 Gb, a contig N50 of 124.16 Kb (Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e, Data file 3, Data set 2, [\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e]). Genome completeness was assessed by BUSCO (Benchmarkding Universal Single-Copy Orthologs) v5.4.3 [\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e], in euk_genome_met mode using the lineage dataset metazoa_odb10. The results showed 844 of the 954 BUSCO genes (88.4%) were complete, with 777 complete single copy (81.4%), 67 complete duplicates (7.0%), 70 genes were fragmented (7.3%), and 40 BUSCO genes were missing (4.3%) (Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e, Data file 3, [\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e]).\u003c/p\u003e \u003cp\u003eThe draft genome presents a valuable resource for monitoring genetic diversity, breeding, resilience to climate change and conservation of the endangered Cuban painted snail.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eOverview of data files/data sets.\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"4\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eLabel\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eName of data file/data set\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eFile types\u003c/p\u003e \u003cp\u003e(file extension)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eData repository and identifier (DOI or accession number)\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003e\u003cem\u003eData file 1\u003c/em\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eHiFi sequencing of \u003cem\u003eP. picta\u003c/em\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eExcel (.xlsx)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.6084/m9.figshare.28790750.v3[21]\u003c/span\u003e\u003cspan address=\"10.6084/m9.figshare.28790750.v3[21]\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eData file 2\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eGenome survey of \u003cem\u003eP. picta\u003c/em\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003ePortable Document Format (.png)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.6084/m9.figshare.28790750.v3[21]\u003c/span\u003e\u003cspan address=\"10.6084/m9.figshare.28790750.v3[21]\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eData file 3\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eGenome assembly of \u003cem\u003eP. picta\u003c/em\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eExcel (.xlsx)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.6084/m9.figshare.28790750.v3[21]\u003c/span\u003e\u003cspan address=\"10.6084/m9.figshare.28790750.v3[21]\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eData set 1\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003ePacbio sequence reads of \u003cem\u003eP. picta\u003c/em\u003e genomic DNA\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eFastq files (.fastq)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003eNCBI Sequence Read Archive (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.ncbi.nlm.nih.gov/bioproject/PRJNA1250545\u003c/span\u003e\u003cspan address=\"https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1250545\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e)[\u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e]\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eData set 2\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eGenome assembly of \u003cem\u003eP.picta\u003c/em\u003e\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eFasta file (.fna)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003e\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.6084/m9.figshare.28790750.v3[21]\u003c/span\u003e\u003cspan address=\"10.6084/m9.figshare.28790750.v3[21]\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eLimitations\u003c/h2\u003e \u003cp\u003eThe polymerase amplification process has yielded a promising amount of HiFi reads for \u003cem\u003eP. picta\u003c/em\u003e; however, it also shortens the average HiFi read length, resulting in a more fragmented assembly. In addition, the relatively low sequencing coverage (14\u0026times; relative to the genome size) leaves some genomic regions unassembled. The high heterozygosity of \u003cem\u003eP. picta\u003c/em\u003e further complicates the assembly process, contributing to a significant proportion of the genome representing alternative haplotypes. In addition, the discrepancy between the assembly and the estimated genome size may be due to highly repetitive sequences. Nonetheless, the assembly quality is comparable to that of other published genomes of land snails and slugs, and it now provides a valuable genetic resource for comparative genomics and the discovery of lineage-specific and adaptive genes.\u003c/p\u003e \u003c/div\u003e"},{"header":"Abbreviations","content":"\u003cp\u003ePacBio: Pacific Biosciences Sequel II\u003c/p\u003e\n\u003cp\u003eSMRT: Single Molecule Real Time\u003c/p\u003e\n\u003cp\u003eBUSCO: Benchmarkding Universal Single-Copy Orthologs\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eEthics approval and consent to participate\u003cbr\u003e\u003c/strong\u003eSpecimens were collected under permits to BRT and exported with authorisation from the relevant authorities in Cuba including prior informed consent and material transfer agreements as addenda to an established collaboration contract between Senckenberg Society and University of Oriente, Santiago de Cuba.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConsent for publication\u003cbr\u003e\u003c/strong\u003eNot applicable.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAvailability of data and materials\u003c/strong\u003e\u0026nbsp;\u003cbr\u003e\u0026nbsp;The data described in this Data note can be freely and openly accessed on NCBI under BioProject ID PRJNA1250545 [22], and figshare (https://doi.org/10.6084/m9.figshare.28790750.v3)[21]. Please see Table 1 for details and links to the data.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eCompeting interests\u003cbr\u003e\u003c/strong\u003eThe authors declare they have no competing interests.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFunding\u003cbr\u003e\u003c/strong\u003eMollusc of the Year was a project funded under the LOEWE Translational Biodiversity Genomics initiative.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthors\u0026rsquo; contributions\u003cbr\u003e\u003c/strong\u003eBRT, JDS, CGr conceived the project. BRT collected the samples. ABH and CGe performed DNA extraction and library preparation, ZC performed bioinformatic analysis, BRT, MJG performed investigation. ZC, JDS prepared the original draft, all authors reviewed, edited and approved the final manuscript.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAcknowledgements\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eWe thank the Bioscientia Institut f\u0026uuml;r Medizinische Diagnostik GmbH for providing the PacBio SMRT sequencing service on the PacBio Revio platform. BRT is grateful to J.M. Koene and A. Davison for the first advise about The International Mollusc of the Year competition and the 10092 participants that voted for \u003cem\u003ePolymita picta\u003c/em\u003e. MJGP and BRT were partially financed by VLIR-UOS and Global Minds funds from Hasselt University, Belgium.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eTorre de la C. El g\u0026eacute;nero Polymita. Memorias de la Sociedad Cubana de Historia Natural \u0026ldquo;Felipe Poey\u0026rdquo;. 1950; 20:1\u0026ndash;20.\u003c/li\u003e\n\u003cli\u003eGordillo-P\u0026eacute;rez MJ, Beenaerts N, S\u0026aacute;nchez DL, Smeets K, Arias-Sosa YC, Reyes-Tur B. Shell colour luminance of Cuban painted snails, \u003cem\u003ePolymita picta\u003c/em\u003e and \u003cem\u003ePolymita muscarum\u003c/em\u003e (Gastropoda: Cepolidae). PLoS ONE. 2025;20:e0314008.\u003c/li\u003e\n\u003cli\u003eMilera J, Mart\u0026iacute;nez JR. \u003cem\u003ePolymita\u003c/em\u003e. 1st ed. La Habana: Editorial Cient\u0026iacute;fico T\u0026eacute;cnica; 1987.\u003c/li\u003e\n\u003cli\u003eGonz\u0026aacute;lez-Guill\u0026eacute;n A. \u003cem\u003ePolymita\u003c/em\u003e: the most beautiful land snail of the world. Memphis: Fundcraft Publishing, Collierville; 2021.\u003c/li\u003e\n\u003cli\u003eBerovides V, Vald\u0026eacute;s G, Milera J. Polimorphismo genetico de \u003cem\u003ePolymita picta\u003c/em\u003e roseolimbata Torre, 1950 en la region de Maisi, Cuba. Caribbaean Journal of Science. 1986;22:179\u0026ndash;89.\u003c/li\u003e\n\u003cli\u003eReyes-Tur B, Allen JA, Cuellar-Araujo N, Hern\u0026aacute;ndez N, Lodi M, M\u0026eacute;ndez-Hern\u0026aacute;ndez AA, et al. Mating behaviour, dart shape and spermatophore morphology of the Cuban tree snail \u003cem\u003ePolymita picta\u003c/em\u003e (Born, 1780). Journal of Molluscan Studies. 2015;81:187\u0026ndash;95.\u003c/li\u003e\n\u003cli\u003eCancio-Mart\u0026iacute;nez E, Alfonso-S\u0026aacute;nchez M, Espinosa-L\u0026oacute;pez G. Polimorfismo de las esterasas de m\u0026uacute;sculo en las especies de \u003cem\u003ePolymita\u003c/em\u003e. Ciencias Biol\u0026oacute;gicas. 1987;18:26\u0026ndash;32.\u003c/li\u003e\n\u003cli\u003eKoene JM, Schulenburg H. Shooting darts: co-evolution and counter-adaptation in hermaphroditic snails. BMC Evol Biol. 2005;5:25.\u003c/li\u003e\n\u003cli\u003eLewis A, Reyes-Tur B, Davison A. The mitochondrial genome and phylogenetic placement of the world\u0026rsquo;s most beautiful snail, Polymita picta. Journal of Molluscan Studies. 2025; (in press). https://doi.org/10.1093/mollus/eyaf006.\u003c/li\u003e\n\u003cli\u003eChen Z, Baeza JA, Chen C, Gonzalez MT, Gonz\u0026aacute;lez VL, Greve C, et al. A genome-based phylogeny for Mollusca is concordant with fossils and morphology. Science. 2025;387:1001\u0026ndash;7.\u003c/li\u003e\n\u003cli\u003eHare EE, Johnston JS. Genome Size Determination Using Flow Cytometry of Propidium Iodide-Stained Nuclei. In: Orgogozo V, Rockman MV, editors. Molecular Methods for Evolutionary Genetics. Totowa, NJ: Humana Press; 2012. p. 3\u0026ndash;12.\u003c/li\u003e\n\u003cli\u003eMurray MG, Thompson WF. Rapid isolation of high molecular weight plant DNA. Nucl Acids Res. 1980;8:4321\u0026ndash;6.\u003c/li\u003e\n\u003cli\u003eM\u0026auml;nner L, Schell T, Spies J, Gali\u0026agrave;-Camps C, Baranski D, Ben Hamadou A, et al. Chromosome-level genome assembly of the sacoglossan sea slug Elysia timida (Risso, 1818). BMC Genomics. 2024;25:941.\u003c/li\u003e\n\u003cli\u003eBein B, Chrysostomakis I, Arantes LS, Brown T, Gerheim C, Schell T, et al. Long-read sequencing and genome assembly of natural history collection samples and challenging specimens. Genome Biol. 2025;26:25.\u003c/li\u003e\n\u003cli\u003eCCS - Generate PacBio HiFi data. https://github.com/PacificBiosciences/ccs. Accessed 14 April 2025.\u003c/li\u003e\n\u003cli\u003eAlign clr to ccs reads. https://github.com/PacificBiosciences/actc. Accessed 14 April 2025.\u003c/li\u003e\n\u003cli\u003eLi H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078\u0026ndash;9.\u003c/li\u003e\n\u003cli\u003eBaid G, Cook DE, Shafin K, Yun T, Llinares-L\u0026oacute;pez F, Berthet Q, et al. DeepConsensus improves the accuracy of sequences with a gap-aware sequence transformer. Nat Biotechnol. 2022. https://doi.org/10.1038/s41587-022-01435-7.\u003c/li\u003e\n\u003cli\u003eLima - Demultiplex PacBio data. https://github.com/PacificBiosciences/barcoding. Accessed 14 April 2025.\u003c/li\u003e\n\u003cli\u003eMark duplicate reads from PacBio sequencing of an amplified library. https://github.com/PacificBiosciences/pbmarkdup. Accessed 14 April 2025.\u003c/li\u003e\n\u003cli\u003eReyes-Tur B, Chen Z, Gordillo-P\u0026eacute;rez MJ, Hamadou AB, Gerheim C, Greve C, et al. Draft genome of the Cuban painted land snail Polymita picta, International Mollusc of the Year 2022. figshare. Dataset. https://doi.org/10.6084/m9.figshare.28790750. Accessed 14 April 2025.\u003c/li\u003e\n\u003cli\u003eReyes-Tur B, Chen Z, Gordillo-P\u0026eacute;rez MJ, Hamadou AB, Gerheim C, Greve C, et al. NCBI GeneBank. https://identifiers.org/ncbi/bioproject:PRJNA1250545. Accessed 15 April 2025.\u003c/li\u003e\n\u003cli\u003eMar\u0026ccedil;ais G, Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of \u003cem\u003ek\u003c/em\u003e -mers. Bioinformatics. 2011;27:764\u0026ndash;70.\u003c/li\u003e\n\u003cli\u003eRanallo-Benavidez TR, Jaron KS, Schatz MC. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat Commun. 2020;11:1432.\u003c/li\u003e\n\u003cli\u003eCheng H, Concepcion GT, Feng X, Zhang H, Li H. Haplotype-resolved de novo assembly using phased assembly graphs with Hifiasm. Nat Methods. 2021;18:170\u0026ndash;5.\u003c/li\u003e\n\u003cli\u003eRoach MJ, Schmidt SA, Borneman AR. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC Bioinformatics. 2018;19:460.\u003c/li\u003e\n\u003cli\u003eSeppey M, Manni M, Zdobnov EM. BUSCO: Assessing Genome Assembly and Annotation Completeness. In: Kollmar M, editor. Gene Prediction. New York, NY: Springer New York; 2019. p. 227\u0026ndash;45.\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"bmc-genomic-data","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"gtic","sideBox":"Learn more about [BMC Genomic Data](http://bmcgenet.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/gtic/default.aspx","title":"BMC Genomic Data","twitterHandle":"BMC_series","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"Mollusca, adaptive radiation, Gastropoda, Stylommatophora, Cuba, conservation genomics","lastPublishedDoi":"10.21203/rs.3.rs-6464041/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-6464041/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003ch2\u003eObjective:\u003c/h2\u003e \u003cp\u003eThe Cuban painted land snail is an iconic endemic forest snail species with distinctive colourful shells used in traditional handicrafts. This species won the International Mollusc of the Year 2022 competition in an open public vote. As the competition prize, we have assembled the draft genome of this species.\u003c/p\u003e\u003ch2\u003eData description:\u003c/h2\u003e \u003cp\u003eGenomic DNA from \u003cem\u003ePolymita picta\u003c/em\u003e (Born, 1778) was sequenced using PacBio HiFi sequencing with a yield of 5.3\u0026nbsp;million reads and an N50 of 8.1 Kbp. The genome size of \u003cem\u003eP. picta\u003c/em\u003e was estimated to be 2.9 Gbp, and the final assembly was 1.85 Gbp, with a total of 22,628 contigs and a contig N50 of 124.2 Kbp. BUSCO analysis of the genome assembly indicated a genome completeness of 88.4%, with 7% complete duplicated BUSCOs. The draft genome will be a valuable resource for work on the endangered Cuban painted land snail including monitoring genetic diversity and establishing captive breeding for conservation.\u003c/p\u003e","manuscriptTitle":"Draft genome of the Cuban painted land snail Polymita picta, International Mollusc of the Year 2022","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-04-29 10:26:22","doi":"10.21203/rs.3.rs-6464041/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2025-05-23T09:34:36+00:00","index":"","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-05-23T06:26:26+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-05-21T00:26:33+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"17203634154669776730080548396208799068","date":"2025-05-19T15:13:52+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"199178505187179921869824892593221494126","date":"2025-05-19T10:47:06+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-05-16T09:58:40+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"36398391589869338990497233400431490784","date":"2025-05-06T13:32:45+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2025-04-30T15:08:01+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2025-04-18T10:21:02+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2025-04-18T10:19:25+00:00","index":"","fulltext":""},{"type":"submitted","content":"BMC Genomic Data","date":"2025-04-16T13:31:39+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"bmc-genomic-data","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"gtic","sideBox":"Learn more about [BMC Genomic Data](http://bmcgenet.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/gtic/default.aspx","title":"BMC Genomic Data","twitterHandle":"BMC_series","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"bdd6f8ab-cdb2-4676-96bb-bd2383d7d63f","owner":[],"postedDate":"April 29th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[],"tags":[],"updatedAt":"2025-09-08T16:03:38+00:00","versionOfRecord":{"articleIdentity":"rs-6464041","link":"https://doi.org/10.1186/s12863-025-01356-9","journal":{"identity":"bmc-genomic-data","isVorOnly":false,"title":"BMC Genomic Data"},"publishedOn":"2025-09-03 15:57:00","publishedOnDateReadable":"September 3rd, 2025"},"versionCreatedAt":"2025-04-29 10:26:22","video":"","vorDoi":"10.1186/s12863-025-01356-9","vorDoiUrl":"https://doi.org/10.1186/s12863-025-01356-9","workflowStages":[]},"version":"v1","identity":"rs-6464041","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-6464041","identity":"rs-6464041","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00