Analysis of genetic diversity and population Structure of Gleditsia sinensis based on SLAF-Seq

preprint OA: closed
Full text JSON View at publisher
Full text 59,994 characters · extracted from preprint-html · click to expand
Analysis of genetic diversity and population Structure of Gleditsia sinensis based on SLAF-Seq | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Analysis of genetic diversity and population Structure of Gleditsia sinensis based on SLAF-Seq Bing Yang, Jun Luo, Xiaoyong Dai This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-6259139/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 09 Dec, 2025 Read the published version in Genetic Resources and Crop Evolution → Version 1 posted 11 You are reading this latest preprint version Abstract Gleditsia sinensis Lam. has significant medicinal and economic value. By utilizing SLAF-Seq technology, we developed single nucleotide polymorphism (SNP) markers to analyze the genetic diversity of this species, providing a basis for the conservation and utilization of germplasm resources. In this study, we conducted SNP marker development on 159 G. sinensis samples from Guizhou Province, identifying a total of 132,709 population SNPs. The number of SNPs per sample ranged from 27,292 to 76,870, with heterozygosity rates ranging from 4.17–15.92%. The analysis of population genetic structure indicated that these 159 G. sinensis samples could be divided into five groups. The genetic distances among samples from the same provenance were close, while gene flow was observed among germplasm resources from different geographical populations. The nucleotide diversity level of population Q5 was relatively high, and the Tajima’s D test suggested that rare alleles were scarce within the populations, with the majority of genes being neutral, indicating a state of equilibrium. This study provides fundamental data for the collection of G. sinensis germplasm resources and the breeding of superior varieties. Gleditsia sinensis SLAF-seq SNP genetic diversity Figures Figure 1 Figure 2 Figure 3 Introduction Gleditsia sinensis Lam. is a species that belongs to the genus Gleditsia within the subfamily Caesalpinioideae of the family Leguminosae. This species is valued for its medicinal and culinary applications.The primary utilizable components of the plant include thorns, pods, and seeds (commonly referred to as "soap bean kernels") Each of these components possesses considerable medicinal and economic value. (Xiao et al., 2023). The thorns are used in traditional medicine (Shu et al., 2023), the pods are rich in saponins and serve as natural detergents, while the seeds are edible. Additionally, the durable wood of G. sinensis is iparticularly well-suited for both domestic and artisanal applications.This species is widely distributed across 19 provinces in China. MaxEnt and ArcGIS analyses suggest that G. sinensis possesses a wide range of potentially suitable habitats across China, with medium- and high-suitability zones covering approximately 13.76% of the country's total area (Yang et al., 2022). Investigating and collecting elite germplasm resources is essential for promoting the development of the G. sinensis industry and enhancing genetic improvement. The genetic diversity of G. sinensis populations is intricately linked to their geographic distribution as well as anthropogenic activities (Li et al., 2017).Research on the germplasm of Guizhou has demonstrated significant phenotypic variation both among and within populations (Tian et al., 2022).Pod morphological indices exhibit an increasing trend with elevation, whereas seed size shows a decreasing pattern. Additionally, seeds demonstrate a gradual enlargement trend from the south to the north and from the east to the west across Guizhou (Tian, 2022 ). Seedling height growth across ten populations from six southern provinces of China exhibits a significant negative correlation with both latitude and annual average temperature, indicating the presence of a latitudinal cline.(Li, 2013).SRAP (sequence-related amplified polymorphism) marker analysis of 203 samples from 14 natural populations revealed genetic distances ranging from 0.0504 to 0.3596, with significant positive correlations between geographic and genetic distances (Tian et al., 2023).Amplification techniques such as RASP (Fan et al., 2017), SRAP (Zhang et al., 2017), and SCoT (Zhang et al., 2017) have demonstrated a high level of polymorphism across 18 germplasm resources of G. sinensis . Simplified genome sequencing provides a technical basis for elucidating patterns of species population distribution and facilitating genetic enhancement.SLAF-Seq (Specific-Locus Amplified Fragment Sequencing) identifies approximately one tag per 10 kb with a uniform distribution while avoiding repetitive sequences. This method is more cost-effective compared to Genotyping-by-Sequencing (GBS) or Restriction-site Associated DNA Sequencing (RAD-seq) (Chen et al., 2022 ; Sun et al., 2013 ). Due to its high efficiency in the development of SNP markers, SLAF-Seq has been extensively utilized across a variety of plant species.(Shen et al., 2024; Li et al., 2024). In this study, we utilized SLAF-Seq to generate SNP markers for 159 Gleditsia sinensis germplasms collected from Guizhou Province. This analysis aimed to assess their genetic diversity and provides a foundational basis for the selection of germplasm resources and the identification of cultivars. Materials and Methods Plant Materials A total of 159 leaf samples of G. sinensis were collected from eight distinct regions across Guizhou Province (Table 1 ). Healthy leaves were immediately flash-frozen in liquid nitrogen and subsequently stored at -80°C. DNA extraction was performed using the CTAB method, followed by quantification via spectrophotometry and verification through 1.5% agarose gel electrophoresis. Qualified DNA samples were preserved at -20°C for subsequent experiments. Table 1 Leaf collection information of G. sinensis population in Guizhou Province Group Region Collection Sites Longitude (°E) Latitude (°N) Altitude (m) No. of Individuals QN Qiannan Guiding,Huishui 106.18 ~ 106.93 25.89 ~ 26.73 928.83 ~ 1390.4 13 QDN Qiandongnan Leishan,Majiang,Taijiang 106.88 ~ 108.07 26.40 ~ 28.73 613.45 ~ 1330.2 28 AS Anshun Pingba 106.22 ~ 106.89 26.35 ~ 26.47 1295.3 ~ 1398.0 12 QXN Qianxinan Puan,Qinglong,Anlong,Ceheng,Xingren 105.00 ~ 105.98 24.91 ~ 25.96 751.28 ~ 1582.2 20 BJ Bijie Zhijin,Dafang 105.38 ~ 106.03 26.49 ~ 27.31 1199.0 ~ 1703.0 22 LPS Liupanshui Liupanshui 104.62 ~ 105.63 26.17 ~ 26.81 1215.0 ~ 1935.7 19 TR Tongren Sinan,Dejiang,Yinjiang,Shiqian 107.97 ~ 108.61 27.48 ~ 28.30 414.7 ~ 1012.0 21 ZY Zunyi Huichuan,Honghuagang,Meitan,Wuchuan 106.67 ~ 107.90 27.56 ~ 28.80 712.0 ~ 1330.2 24 Restriction Digestion and High-Throughput Sequencing Genomic DNA was digested using RsaI and HaeIII, resulting in the selection of fragments ranging from 364 to 414 bp as SLAF tags. Libraries were constructed through A-tailing, adapter ligation, PCR amplification, and size selection (Kozich et al., 2013 ), followed by sequencing on an Illumina platform. The data were processed with FASTQ and BMK_Control for quality assessment. SNP Marker Statistics and Genetic Diversity Analysis SNP markers were developed using the highest-depth sequence from each SLAF tag as a reference. Reads were aligned with BWA, and SNPs were identified using GATK (v3.8) and SAMtools (v1.9). Genome-wide SNPs were filtered based on minor allele frequency (MAF: 0.05) and site integrity (INT: 0.5) to retain high-confidence SNPs for subsequent analyses. For phylogenetic analysis, a neighbor-joining tree was constructed using MEGA X (Tamura et al., 2011 ), employing the Kimura two-parameter model with 1,000 bootstrap replicates. Population structure was inferred through ADMIXTURE, testing subgroup numbers (K values) ranging from 1 to 10, followed by clustering and cross-validation. Principal component analysis (PCA) was performed on the SNP data using EIGENSOFT software (version 6.0; Price et al., 2006 ). Genetic diversity parameters were assessed utilizing VCFtools. Results and Analysis SLAF Marker Development The SLAF-Seq analysis of 159 G. sinensis samples generated a total of 805 Mb of sequencing data, achieving an average Q30 score of 93.50% and a GC content of 38.42%. A comprehensive set of 844,847 SLAF tags was obtained, with an average sequencing depth of 12.22x. Utilizing the Genome Analysis Toolkit (GATK), we identified 132,709 high-quality SNPs across the samples, with individual sample counts ranging from 27,292 to 76,870 SNPs and heterozygosity rates varying between 4.17% and 15.92%. Population Genetic Structure Analysis Population structure analysis of 159 individuals, with K values preset from 1 to 10, revealed the lowest cross-validation error at K = 5 (Fig. 4a), indicating five distinct genetic groups. Group Q1 included CH-1, CH-2, CH-7, HC-34, HC-39, MT-8, MT-8-1, PA-6, TZ-1, and TZ-6; Q2 comprised DF-5, HC-5, HS-1, HS-15, HS-17, LPS-1, LPS-20, LPS-23, and LPS-69; Q3 encompassed DF-7, DJ-11, DJ-22, DJ-24, DJ-5, DJ-6, and HHG-15; Q4 included AL-2, DF-2, GD-12, GD-17, GD-3, HC-36, and HHG-13; Q5 consisted of DF-12, DF-9, DQ, GD-11, HC-9, HHG-03, HS-12, HS-4, HS-6, HS-9, LPS-12, LPS-21, LPS-35, LPS-4, LPS-56, LPS-61, LPS-62, LS-11, LS-14, MJ-1, MJ-21, MJ-27, MJ-32, MJ-56, MT-9, PA-3, PB-12, PB-3, PB-5, PB-9, QL-6, QL-9, SN-16, ZJ-14, ZJ-17, ZJ-2, ZJ-28, ZJ-38, and ZJ-6. Phylogenetic Tree Analysis A neighbor-joining phylogenetic tree constructed using MEGA X (Fig. 2 a) revealed close genetic distances among samples originating from the same geographic locations. Conversely, principal component analysis (PCA) of population structure (Fig. 2 b) indicated potential gene flow between populations. Genetic Diversity Analysis Individual heterozygosity ranged from − 0.41758 to 0.86718, with mean values for Q1–Q5 being 0.2047, 0.3696, 0.1933, 0.1600, and 0.1168, respectively. Nucleotide diversity (Pi) analysis (Fig. 3 a) indicated a slightly higher diversity in Q5 (mean Pi = 0.0285), which included individuals from Zhijin (ZJ-14, ZJ-17, ZJ-2, ZJ-28, ZJ-38, ZJ-6), Liupanshui (LPS-12, LPS-21, LPS-35, LPS-4, LPS-56, LPS-61, LPS-62), and Majiang (MJ-1, MJ-21, MJ-27, MJ-32, MJ-56). Tajima’s D values (Fig. 3 b) averaged at 0.359 for Q5, surpassing those of the other groups examined. Pairwise Fst values Q1, Q2, Q3, Q4, and Q5 were 0.07, 0.026, 0.032, and 0.032, respectively, suggesting moderate genetic differentiation among these groups. Discussion With the advancement of high-throughput sequencing technologies, SLAF-Seq, which utilizes second-generation sequencing methods, has demonstrated its efficacy in developing reliable SNP and InDel markers. This approach offers significant advantages for germplasm classification, studies on population evolution, and gene association analyses. For example, simplified genome sequencing has been employed to differentiate 12 clones of Camellia oleifera and to construct SNP fingerprint profiles (Liao et al., 2024). Similarly, SLAF-Seq facilitated the classification of 195 pepper germplasms into three distinct subgroups (Chen et al., 2023). In this study, SLAF-Seq was utilized for the first time to evaluate the genetic diversity of 159 Gleditsia sinensis samples collected from Guizhou Province. Using GATK software, a total of 132,709 high-quality SNPs were identified; within individual samples, SNP counts ranged from 27,292 to 76,870 with heterozygosity rates varying between 4.17% and 15.92%. Population genetic structure analysis indicated that the 159 G. sinensis individuals could be categorized into five distinct groups. Genetic distances were found to be closer among individuals sharing the same geographic origin, suggesting potential gene flow between populations. The mean heterozygosity values for groups Q1 through Q5 were 0.2047, 0.3696, 0.1933, 0.1600, and 0.1168, respectively.Population Q5 exhibited a marginally elevated level of nucleotide diversity (Pi = 0.0285) in comparison to the other groups. Furthermore, Tajima’s D, which quantifies intraspecific polymorphism and serves as an indicator for neutral evolution, was significantly higher in Q5 than in the other populations.All groups exhibited positive Tajima’s D values, indicating an excess of intermediate-frequency alleles. This phenomenon may be attributed to bottleneck effects, population structure dynamics, or balancing selection (Li et al., 2020 ). Simplified genome sequencing of the 159 G. sinensis samples revealed moderate to strong genetic differentiation among them, with pairwise Fst values ranging from 0.07 to 0.80 (Yin et al., 2023). Notably, the average Fst values recorded at 0.07, 0.026, 0.032, and 0.032 between Q5 and the other groups (Q1–Q4), respectively. this indicates relatively low levels of genetic differentiation overall. Declarations Acknowledgements We extend our gratitude to the researchers involved in the Project for their dedication and hard work. Additionally,We thank Biomarker Technologies Co.,Ltd for assisting in sequencing and/or bioinformatics analysis. Funding This project was supported by The Guizhou Provincial Basic Research Program (No.ZK[2022]-242) and The Guizhou Forestry Seedling Cultivation Project (No. 2024-ZM-19). Competing Interests The authors have no relevant financial or non-financial interests to disclose. Author Contributions Bing Yang and Jun Luo designed and executed the experiments, analyzed the data, and drafted the manuscript. Yayan Zhu contributed to both the experimental design and data analysis. Bing Yang and Xiaoyong Dai supervised the project, provided guidance on analyses, and revised the manuscript. All authors reviewed and approved the final version. Data Availability The raw sequence data reported in this paper have been deposited in the Genome Sequence Archive in National Genomics Data Center, China National Center for Bioinformation/Beijing Institute of Genomics, Chinese Academy of Sciences (GSA:CRA023840) that are publicly accessible at https://ngdc.cncb.ac.cn/gsa, (accessed on 18 March 2026). References Chen Xiaocui,Zhang Xiaowei,Li Hui,Qiu Huarong,Tang Xiangqun,Luo Xirong,Yang Hong,Qin Cheng.SNP Sites Development by Specific Length Amplification Fragment Sequencing (SLAF-seq) and Genetic Analysis in Pepper.Molecular Plant Breeding.1-212 Chen Z, He Y, Iqbal Y, Shi Y, Huang H, Yi Z: Investigation of genetic relationships within three Miscanthus species using SNP markers identified with SLAF-seq. BMC genomics 2022, 23:1–11. Fan Dingchen,Zhang Anshi,Liu Ying,Luo YangGenetic Diversity and Fingerprints of Glediysia sinensis Germplasms Based on RSAP.Journal of Henan Agricultural Sciences 2017, 46(11):103–107. Kozich JJ, Westcott SL, Baxter NT, Highlander SK, Schloss PD: Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Applied and environmental microbiology 2013, 79(17):5112–5120. Li W, Liu L, Wang Y, Zhang Q, Fan G, Zhang S, Wang Y, Liao K: Genetic diversity, population structure, and relationships of apricot (Prunus) based on restriction site-associated DNA sequencing. Horticulture Research 2020, 7. Li Wei,2013,Research on Genetic Resources Evaluation of Gleditsia sinensis in Sourth China,Doctoral dissertation,Chinese Academy Forestry,Mentor:Zheng Yongqi LI Wei,LIN Fu-rong,ZHENG Yong-qi,SUN Sheng.Genetic Diversity of Ten Gleditsia sinensis Populations from Southern China.Forest Research 2017,30(01):46–52. Li Xue-jun,Wang Bu-tian,Zhao Meng,Shi Xue-dong,Chen Zhi-hua,Xie Chun,Lu Yun-feng,Dong Xiang-shu,Ma Yin-bo,Ge Yu.Genetic diversity of Coffea arabica based on SLAF-Seq technology.Journal of Southern Agriculture 2024, 55(06):1583–1593. Liao Hongze,Sun Manman,Huang Xiaojuan,Hao Bingqing,Sun Jiaxing,Jiang Zepeng,Wang Dongxue,Liu Kai.Development of SNP markers and construction of DNA fingerprint in teaoil Camellia germplasms based on simplified genome sequencing.Journal of Central South University of Forestry & Technology 2024, 44(04):128–137. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D: Principal components analysis corrects for stratification in genome-wide association studies. Nature genetics 2006, 38(8):904–909. Shen Chaogui,Lai Ruilian,Chen Jin,Feng Xin,Chen Yiting,Wei Xiaoxia,Wu Rujian.SNP Markers Development and Genetic Relationship Identification of Germplasm Resources of Canarium album Based on SLAF-Seq Technology.Fujian Journal of Agricultural Sciences2024, 39(05):557–562. Shu Chengjie,Li Zhuohang,Chen Diansong,Ma Ling,Zhou Xinyu,Zhao Jing,Wang Miao,Ma Shihong.Total Flavonoids, Anti-mite Activity and Antioxidant Activity ofthe Extract of Gleditsia Sinensis Thron.Chinese Wild Plant Resources 2022, 33(05):1201–1204. Sun X, Liu D, Zhang X, Li W, Liu H, Hong W, Jiang C, Guan N, Ma C, Zeng H: SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing. PloS one 2013, 8(3):e58700. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Molecular biology and evolution 2011, 28(10):2731–2739. Tian Honghong,2022,Study on the genetic diversity of wild soap pod resources in Guizhou Province,Master's Thesis,Mentor:Zhao Yang Tian Honghong,Yang Ju,Lu Chunyun,XiaoO Feng,Zhao Yang.Phenotypic Diversity Analysis of Provenances of Natural Gleditsia sinensis in Guizhou.Acta Botanica Boreali-occidentalis 2022, 42(11):1927–1935 Yang Ye,Liu Xiao-long,Lu Xiang,Sun Qing-wen.Ecological suitability regionalization of Gleditsia sinensis Lam.in China based on MaxEnt and ArcGIS.Lishizhen Medicine and Materia Medica Research 2022, 33(05):1201–1204. Yin Qingfeng,Wang Yi,Huyan Li,Hao Jiabo,Meng Jinfang,Lu Bin.Genetic Diversity of Wild Zanthoxylum Armatum by ddRAD-seq.Molecular Plant Breeding.1–22· Zhang AnShi,Luo Yang,Fan Ding-Chen,Zhang Zhong-Hai.Genetic diversity and fingerprints of Gleditsia sinensis germplasm resource based on SCoT.Guihaia 2017, 37(11):1378–1385. Zhang Anshi,Zhang Sumin,Fan Dingchen,Liu Ying.Genetic diversity and fingerprints of Gleditsia sinensis germplasm based on SRAP.Acta Agriculturae Zhejiangensis2017, 29(09):1524–153 Additional Declarations No competing interests reported. Cite Share Download PDF Status: Published Journal Publication published 09 Dec, 2025 Read the published version in Genetic Resources and Crop Evolution → Version 1 posted Editorial decision: Revision requested 14 Apr, 2025 Reviews received at journal 14 Apr, 2025 Reviews received at journal 04 Apr, 2025 Reviews received at journal 01 Apr, 2025 Reviewers agreed at journal 26 Mar, 2025 Reviewers agreed at journal 24 Mar, 2025 Reviewers agreed at journal 24 Mar, 2025 Reviewers invited by journal 24 Mar, 2025 Editor assigned by journal 20 Mar, 2025 Submission checks completed at journal 20 Mar, 2025 First submitted to journal 19 Mar, 2025 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-6259139","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":434518196,"identity":"bb741814-75b6-42cd-a864-d24b1f92bc12","order_by":0,"name":"Bing Yang","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA3UlEQVRIiWNgGAWjYBACNv7mww8+GNjwMDYcPvggoaKGsBY+iWNphjMq0uSYG48lGzw4c4ywFjmGHAVpnjOHjNmbz5hJPmxhJsJhDGcYjHnbDiT2th1Lq0hsYGPgb+9OwK+FuffAw7ltdxJn9hw+diNxhwyDxJmzGwjYci7B4G3bs8SNM46l3Ug8w8ZgIJFLSEuOgQRv2+HE/fffmBUktjETp0WS58xhY8aGM2YMxGmBBTJjw7FkiYQzx3gI+kW+HykqP/6oqJHjb+/FrwUD8JCmfBSMglEwCkYBVgAAFEJVBrMrcWgAAAAASUVORK5CYII=","orcid":"","institution":"","correspondingAuthor":true,"prefix":"","firstName":"Bing","middleName":"","lastName":"Yang","suffix":""},{"id":434518198,"identity":"8c9c562c-d127-46bf-b6b6-207d027eeae3","order_by":1,"name":"Jun Luo","email":"","orcid":"","institution":"","correspondingAuthor":false,"prefix":"","firstName":"Jun","middleName":"","lastName":"Luo","suffix":""},{"id":434518200,"identity":"75726908-de57-4507-90f9-9c02e9ca1143","order_by":2,"name":"Xiaoyong Dai","email":"","orcid":"","institution":"","correspondingAuthor":false,"prefix":"","firstName":"Xiaoyong","middleName":"","lastName":"Dai","suffix":""}],"badges":[],"createdAt":"2025-03-19 07:53:13","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-6259139/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-6259139/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1007/s10722-025-02690-8","type":"published","date":"2025-12-09T15:59:35+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":79698222,"identity":"ec381ddd-fc05-4fc3-a9a8-0f7c32ef2bd2","added_by":"auto","created_at":"2025-04-01 16:01:17","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":10504993,"visible":true,"origin":"","legend":"\u003cp\u003eadmixture validation error values (a) and Gentic structure cluster (b)\u003c/p\u003e","description":"","filename":"Fig.1.png","url":"https://assets-eu.researchsquare.com/files/rs-6259139/v1/43241aa0f7728d375ee9e9e6.png"},{"id":79698220,"identity":"dea5987d-9aa9-4e19-b313-c4a7e8f44b6c","added_by":"auto","created_at":"2025-04-01 16:01:17","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":794922,"visible":true,"origin":"","legend":"\u003cp\u003ePhylogenetic tree analysis and PCA analysis\u003c/p\u003e\n\u003cp\u003e(a) Phylogenetic tree analysis;(b) PCA analysis\u003c/p\u003e\n\u003cp\u003eNote: The names of the samples in the inner circle of Figure a were mapped according to administrative divisions, while the outer circle was mapped according to the population structure of admixtures; In Figure b, different populations were divided according to their population structure, and the periphery was drawn using geomer_mark'hull.\u003c/p\u003e","description":"","filename":"Fig.2treeandPCA.png","url":"https://assets-eu.researchsquare.com/files/rs-6259139/v1/1b9e9f87b7b8f2db266f6e0f.png"},{"id":79698844,"identity":"4c1db78c-598a-4d28-aa05-09e1bb89b477","added_by":"auto","created_at":"2025-04-01 16:09:17","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":1246399,"visible":true,"origin":"","legend":"\u003cp\u003eComparison of population diversity parameters\u003c/p\u003e\n\u003cp\u003e(a) The nucleotide diversity levels of different populations; (b) Comparison of Tajima’s D values for different populations.\u003c/p\u003e\n\u003cp\u003eNote: The sliding window size was set to 10, and the curve fitting model is based on loess by the geom_smooth.\u003c/p\u003e","description":"","filename":"Fig.3.png","url":"https://assets-eu.researchsquare.com/files/rs-6259139/v1/168a62ecbdb922d8da9e091a.png"},{"id":98245210,"identity":"cb1412c3-b232-4427-b9c7-b1d5f15442ca","added_by":"auto","created_at":"2025-12-15 16:17:10","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":11658455,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-6259139/v1/46d95d21-62b7-4524-8c07-092f175810d3.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"Analysis of genetic diversity and population Structure of Gleditsia sinensis based on SLAF-Seq","fulltext":[{"header":"Introduction","content":"\u003cp\u003e \u003cem\u003eGleditsia sinensis\u003c/em\u003e Lam. is a species that belongs to the genus \u003cem\u003eGleditsia\u003c/em\u003e within the subfamily Caesalpinioideae of the family Leguminosae. This species is valued for its medicinal and culinary applications.The primary utilizable components of the plant include thorns, pods, and seeds (commonly referred to as \"soap bean kernels\") Each of these components possesses considerable medicinal and economic value. (Xiao et al., 2023). The thorns are used in traditional medicine (Shu et al., 2023), the pods are rich in saponins and serve as natural detergents, while the seeds are edible. Additionally, the durable wood of \u003cem\u003eG. sinensis\u003c/em\u003e is iparticularly well-suited for both domestic and artisanal applications.This species is widely distributed across 19 provinces in China. MaxEnt and ArcGIS analyses suggest that \u003cem\u003eG. sinensis\u003c/em\u003e possesses a wide range of potentially suitable habitats across China, with medium- and high-suitability zones covering approximately 13.76% of the country's total area (Yang et al., 2022).\u003c/p\u003e \u003cp\u003eInvestigating and collecting elite germplasm resources is essential for promoting the development of the \u003cem\u003eG. sinensis\u003c/em\u003e industry and enhancing genetic improvement. The genetic diversity of \u003cem\u003eG. sinensis\u003c/em\u003e populations is intricately linked to their geographic distribution as well as anthropogenic activities (Li et al., 2017).Research on the germplasm of Guizhou has demonstrated significant phenotypic variation both among and within populations (Tian et al., 2022).Pod morphological indices exhibit an increasing trend with elevation, whereas seed size shows a decreasing pattern. Additionally, seeds demonstrate a gradual enlargement trend from the south to the north and from the east to the west across Guizhou (Tian, \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). Seedling height growth across ten populations from six southern provinces of China exhibits a significant negative correlation with both latitude and annual average temperature, indicating the presence of a latitudinal cline.(Li, 2013).SRAP (sequence-related amplified polymorphism) marker analysis of 203 samples from 14 natural populations revealed genetic distances ranging from 0.0504 to 0.3596, with significant positive correlations between geographic and genetic distances (Tian et al., 2023).Amplification techniques such as RASP (Fan et al., 2017), SRAP (Zhang et al., 2017), and SCoT (Zhang et al., 2017) have demonstrated a high level of polymorphism across 18 germplasm resources of \u003cem\u003eG. sinensis\u003c/em\u003e.\u003c/p\u003e \u003cp\u003eSimplified genome sequencing provides a technical basis for elucidating patterns of species population distribution and facilitating genetic enhancement.SLAF-Seq (Specific-Locus Amplified Fragment Sequencing) identifies approximately one tag per 10 kb with a uniform distribution while avoiding repetitive sequences. This method is more cost-effective compared to Genotyping-by-Sequencing (GBS) or Restriction-site Associated DNA Sequencing (RAD-seq) (Chen et al., \u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2022\u003c/span\u003e; Sun et al., \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e2013\u003c/span\u003e). Due to its high efficiency in the development of SNP markers, SLAF-Seq has been extensively utilized across a variety of plant species.(Shen et al., 2024; Li et al., 2024). In this study, we utilized SLAF-Seq to generate SNP markers for 159 Gleditsia sinensis germplasms collected from Guizhou Province. This analysis aimed to assess their genetic diversity and provides a foundational basis for the selection of germplasm resources and the identification of cultivars.\u003c/p\u003e"},{"header":"Materials and Methods","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003ePlant Materials\u003c/h2\u003e \u003cp\u003eA total of 159 leaf samples of \u003cem\u003eG. sinensis\u003c/em\u003e were collected from eight distinct regions across Guizhou Province (Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e). Healthy leaves were immediately flash-frozen in liquid nitrogen and subsequently stored at -80\u0026deg;C. DNA extraction was performed using the CTAB method, followed by quantification via spectrophotometry and verification through 1.5% agarose gel electrophoresis. Qualified DNA samples were preserved at -20\u0026deg;C for subsequent experiments.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eLeaf collection information of \u003cem\u003eG. sinensis\u003c/em\u003e population in Guizhou Province\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"7\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c5\" colnum=\"5\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c6\" colnum=\"6\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c7\" colnum=\"7\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eGroup\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eRegion\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eCollection Sites\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eLongitude (\u0026deg;E)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c5\"\u003e \u003cp\u003eLatitude (\u0026deg;N)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c6\"\u003e \u003cp\u003eAltitude (m)\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c7\"\u003e \u003cp\u003eNo. of Individuals\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eQN\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eQiannan\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eGuiding,Huishui\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e106.18\u0026thinsp;~\u0026thinsp;106.93\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e25.89\u0026thinsp;~\u0026thinsp;26.73\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e928.83\u0026thinsp;~\u0026thinsp;1390.4\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e13\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eQDN\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eQiandongnan\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eLeishan,Majiang,Taijiang\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e106.88\u0026thinsp;~\u0026thinsp;108.07\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e26.40\u0026thinsp;~\u0026thinsp;28.73\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e613.45\u0026thinsp;~\u0026thinsp;1330.2\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e28\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eAS\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eAnshun\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003ePingba\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e106.22\u0026thinsp;~\u0026thinsp;106.89\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e26.35\u0026thinsp;~\u0026thinsp;26.47\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e1295.3\u0026thinsp;~\u0026thinsp;1398.0\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e12\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eQXN\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eQianxinan\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003ePuan,Qinglong,Anlong,Ceheng,Xingren\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e105.00\u0026thinsp;~\u0026thinsp;105.98\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e24.91\u0026thinsp;~\u0026thinsp;25.96\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e751.28\u0026thinsp;~\u0026thinsp;1582.2\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e20\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eBJ\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eBijie\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eZhijin,Dafang\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e105.38\u0026thinsp;~\u0026thinsp;106.03\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e26.49\u0026thinsp;~\u0026thinsp;27.31\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e1199.0\u0026thinsp;~\u0026thinsp;1703.0\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e22\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eLPS\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eLiupanshui\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eLiupanshui\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e104.62\u0026thinsp;~\u0026thinsp;105.63\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e26.17\u0026thinsp;~\u0026thinsp;26.81\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e1215.0\u0026thinsp;~\u0026thinsp;1935.7\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e19\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTR\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eTongren\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eSinan,Dejiang,Yinjiang,Shiqian\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e107.97\u0026thinsp;~\u0026thinsp;108.61\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e27.48\u0026thinsp;~\u0026thinsp;28.30\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e414.7\u0026thinsp;~\u0026thinsp;1012.0\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e21\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eZY\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eZunyi\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eHuichuan,Honghuagang,Meitan,Wuchuan\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e106.67\u0026thinsp;~\u0026thinsp;107.90\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e27.56\u0026thinsp;~\u0026thinsp;28.80\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e712.0\u0026thinsp;~\u0026thinsp;1330.2\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c7\"\u003e \u003cp\u003e24\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eRestriction Digestion and High-Throughput Sequencing\u003c/h3\u003e\n\u003cp\u003eGenomic DNA was digested using RsaI and HaeIII, resulting in the selection of fragments ranging from 364 to 414 bp as SLAF tags. Libraries were constructed through A-tailing, adapter ligation, PCR amplification, and size selection (Kozich et al., \u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e2013\u003c/span\u003e), followed by sequencing on an Illumina platform. The data were processed with FASTQ and BMK_Control for quality assessment.\u003c/p\u003e\n\u003ch3\u003eSNP Marker Statistics and Genetic Diversity Analysis\u003c/h3\u003e\n\u003cp\u003eSNP markers were developed using the highest-depth sequence from each SLAF tag as a reference. Reads were aligned with BWA, and SNPs were identified using GATK (v3.8) and SAMtools (v1.9). Genome-wide SNPs were filtered based on minor allele frequency (MAF: 0.05) and site integrity (INT: 0.5) to retain high-confidence SNPs for subsequent analyses.\u003c/p\u003e \u003cp\u003eFor phylogenetic analysis, a neighbor-joining tree was constructed using MEGA X (Tamura et al., \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e2011\u003c/span\u003e), employing the Kimura two-parameter model with 1,000 bootstrap replicates. Population structure was inferred through ADMIXTURE, testing subgroup numbers (K values) ranging from 1 to 10, followed by clustering and cross-validation.\u003c/p\u003e \u003cp\u003ePrincipal component analysis (PCA) was performed on the SNP data using EIGENSOFT software (version 6.0; Price et al., \u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e2006\u003c/span\u003e). Genetic diversity parameters were assessed utilizing VCFtools.\u003c/p\u003e"},{"header":"Results and Analysis","content":"\u003cdiv id=\"Sec7\" class=\"Section2\"\u003e\n \u003ch2\u003eSLAF Marker Development\u003c/h2\u003e\n \u003cp\u003eThe SLAF-Seq analysis of 159 \u003cem\u003eG. sinensis\u003c/em\u003e samples generated a total of 805 Mb of sequencing data, achieving an average Q30 score of 93.50% and a GC content of 38.42%. A comprehensive set of 844,847 SLAF tags was obtained, with an average sequencing depth of 12.22x. Utilizing the Genome Analysis Toolkit (GATK), we identified 132,709 high-quality SNPs across the samples, with individual sample counts ranging from 27,292 to 76,870 SNPs and heterozygosity rates varying between 4.17% and 15.92%.\u003c/p\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec8\" class=\"Section2\"\u003e\n \u003ch2\u003ePopulation Genetic Structure Analysis\u003c/h2\u003e\n \u003cp\u003ePopulation structure analysis of 159 individuals, with K values preset from 1 to 10, revealed the lowest cross-validation error at K\u0026thinsp;=\u0026thinsp;5 (Fig. 4a), indicating five distinct genetic groups. Group Q1 included CH-1, CH-2, CH-7, HC-34, HC-39, MT-8, MT-8-1, PA-6, TZ-1, and TZ-6; Q2 comprised DF-5, HC-5, HS-1, HS-15, HS-17, LPS-1, LPS-20, LPS-23, and LPS-69; Q3 encompassed DF-7, DJ-11, DJ-22, DJ-24, DJ-5, DJ-6, and HHG-15; Q4 included AL-2, DF-2, GD-12, GD-17, GD-3, HC-36, and HHG-13; Q5 consisted of DF-12, DF-9, DQ, GD-11, HC-9, HHG-03, HS-12, HS-4, HS-6, HS-9, LPS-12, LPS-21, LPS-35, LPS-4, LPS-56, LPS-61, LPS-62, LS-11, LS-14, MJ-1, MJ-21, MJ-27, MJ-32, MJ-56, MT-9, PA-3, PB-12, PB-3, PB-5, PB-9, QL-6, QL-9, SN-16, ZJ-14, ZJ-17, ZJ-2, ZJ-28, ZJ-38, and ZJ-6.\u003c/p\u003e\n\u003c/div\u003e\n\u003ch3\u003ePhylogenetic Tree Analysis\u003c/h3\u003e\n\u003cp\u003eA neighbor-joining phylogenetic tree constructed using MEGA X (Fig. \u003cspan class=\"InternalRef\"\u003e2\u003c/span\u003ea) revealed close genetic distances among samples originating from the same geographic locations. Conversely, principal component analysis (PCA) of population structure (Fig. \u003cspan class=\"InternalRef\"\u003e2\u003c/span\u003eb) indicated potential gene flow between populations.\u003c/p\u003e\n\u003cdiv id=\"Sec11\" class=\"Section2\"\u003e\n \u003ch2\u003eGenetic Diversity Analysis\u003c/h2\u003e\n \u003cp\u003eIndividual heterozygosity ranged from \u0026minus;\u0026thinsp;0.41758 to 0.86718, with mean values for Q1\u0026ndash;Q5 being 0.2047, 0.3696, 0.1933, 0.1600, and 0.1168, respectively. Nucleotide diversity (Pi) analysis (Fig. \u003cspan class=\"InternalRef\"\u003e3\u003c/span\u003ea) indicated a slightly higher diversity in Q5 (mean Pi\u0026thinsp;=\u0026thinsp;0.0285), which included individuals from Zhijin (ZJ-14, ZJ-17, ZJ-2, ZJ-28, ZJ-38, ZJ-6), Liupanshui (LPS-12, LPS-21, LPS-35, LPS-4, LPS-56, LPS-61, LPS-62), and Majiang (MJ-1, MJ-21, MJ-27, MJ-32, MJ-56). Tajima\u0026rsquo;s D values (Fig. \u003cspan class=\"InternalRef\"\u003e3\u003c/span\u003eb) averaged at 0.359 for Q5, surpassing those of the other groups examined. Pairwise Fst values Q1, Q2, Q3, Q4, and Q5 were 0.07, 0.026, 0.032, and 0.032, respectively, suggesting moderate genetic differentiation among these groups.\u003c/p\u003e\n\u003c/div\u003e"},{"header":"Discussion","content":"\u003cp\u003eWith the advancement of high-throughput sequencing technologies, SLAF-Seq, which utilizes second-generation sequencing methods, has demonstrated its efficacy in developing reliable SNP and InDel markers. This approach offers significant advantages for germplasm classification, studies on population evolution, and gene association analyses. For example, simplified genome sequencing has been employed to differentiate 12 clones of Camellia oleifera and to construct SNP fingerprint profiles (Liao et al., 2024). Similarly, SLAF-Seq facilitated the classification of 195 pepper germplasms into three distinct subgroups (Chen et al., 2023). In this study, SLAF-Seq was utilized for the first time to evaluate the genetic diversity of 159 Gleditsia sinensis samples collected from Guizhou Province. Using GATK software, a total of 132,709 high-quality SNPs were identified; within individual samples, SNP counts ranged from 27,292 to 76,870 with heterozygosity rates varying between 4.17% and 15.92%.\u003c/p\u003e \u003cp\u003ePopulation genetic structure analysis indicated that the 159 \u003cem\u003eG. sinensis\u003c/em\u003e individuals could be categorized into five distinct groups. Genetic distances were found to be closer among individuals sharing the same geographic origin, suggesting potential gene flow between populations. The mean heterozygosity values for groups Q1 through Q5 were 0.2047, 0.3696, 0.1933, 0.1600, and 0.1168, respectively.Population Q5 exhibited a marginally elevated level of nucleotide diversity (Pi\u0026thinsp;=\u0026thinsp;0.0285) in comparison to the other groups. Furthermore, Tajima\u0026rsquo;s D, which quantifies intraspecific polymorphism and serves as an indicator for neutral evolution, was significantly higher in Q5 than in the other populations.All groups exhibited positive Tajima\u0026rsquo;s D values, indicating an excess of intermediate-frequency alleles. This phenomenon may be attributed to bottleneck effects, population structure dynamics, or balancing selection (Li et al., \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e2020\u003c/span\u003e). Simplified genome sequencing of the 159 \u003cem\u003eG. sinensis\u003c/em\u003e samples revealed moderate to strong genetic differentiation among them, with pairwise Fst values ranging from 0.07 to 0.80 (Yin et al., 2023). Notably, the average Fst values recorded at 0.07, 0.026, 0.032, and 0.032 between Q5 and the other groups (Q1\u0026ndash;Q4), respectively. this indicates relatively low levels of genetic differentiation overall.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eAcknowledgements\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eWe extend our gratitude to the researchers involved in the Project for their dedication and hard work. Additionally,We thank Biomarker Technologies Co.,Ltd for assisting in sequencing and/or bioinformatics analysis.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFunding\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThis project was supported by The Guizhou Provincial Basic Research Program (No.ZK[2022]-242) and The Guizhou Forestry Seedling Cultivation Project (No. 2024-ZM-19).\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eCompeting Interests\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe authors have no relevant financial or non-financial interests to disclose.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthor Contributions\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eBing Yang and Jun Luo designed and executed the experiments, analyzed the data, and drafted the manuscript. Yayan Zhu contributed to both the experimental design and data analysis. Bing Yang and Xiaoyong Dai supervised the project, provided guidance on analyses, and revised the manuscript. All authors reviewed and approved the final version.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eData Availability\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe raw sequence data reported in this paper have been deposited in the Genome Sequence Archive in National Genomics Data Center, China National Center for Bioinformation/Beijing Institute of Genomics, Chinese Academy of Sciences (GSA:CRA023840) that are publicly accessible at https://ngdc.cncb.ac.cn/gsa, (accessed on 18 March 2026).\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eChen Xiaocui,Zhang Xiaowei,Li Hui,Qiu Huarong,Tang Xiangqun,Luo Xirong,Yang Hong,Qin Cheng.SNP Sites Development by Specific Length Amplification Fragment Sequencing (SLAF-seq) and Genetic Analysis in Pepper.Molecular Plant Breeding.1-212\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChen Z, He Y, Iqbal Y, Shi Y, Huang H, Yi Z: Investigation of genetic relationships within three Miscanthus species using SNP markers identified with SLAF-seq. BMC genomics 2022, 23:1\u0026ndash;11.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFan Dingchen,Zhang Anshi,Liu Ying,Luo YangGenetic Diversity and Fingerprints of Glediysia sinensis\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGermplasms Based on RSAP.Journal of Henan Agricultural Sciences 2017, 46(11):103\u0026ndash;107.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKozich JJ, Westcott SL, Baxter NT, Highlander SK, Schloss PD: Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Applied and environmental microbiology 2013, 79(17):5112\u0026ndash;5120.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi W, Liu L, Wang Y, Zhang Q, Fan G, Zhang S, Wang Y, Liao K: Genetic diversity, population structure, and relationships of apricot (Prunus) based on restriction site-associated DNA sequencing. Horticulture Research 2020, 7.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi Wei,2013,Research on Genetic Resources Evaluation of Gleditsia sinensis in Sourth China,Doctoral dissertation,Chinese Academy Forestry,Mentor:Zheng Yongqi\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLI Wei,LIN Fu-rong,ZHENG Yong-qi,SUN Sheng.Genetic Diversity of Ten Gleditsia sinensis Populations from Southern China.Forest Research 2017,30(01):46\u0026ndash;52.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi Xue-jun,Wang Bu-tian,Zhao Meng,Shi Xue-dong,Chen Zhi-hua,Xie Chun,Lu Yun-feng,Dong Xiang-shu,Ma Yin-bo,Ge Yu.Genetic diversity of Coffea arabica based on SLAF-Seq technology.Journal of Southern Agriculture 2024, 55(06):1583\u0026ndash;1593.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLiao Hongze,Sun Manman,Huang Xiaojuan,Hao Bingqing,Sun Jiaxing,Jiang Zepeng,Wang Dongxue,Liu Kai.Development of SNP markers and construction of DNA fingerprint in teaoil Camellia germplasms based on simplified genome sequencing.Journal of Central South University of Forestry \u0026amp; Technology 2024, 44(04):128\u0026ndash;137.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePrice AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D: Principal components analysis corrects for stratification in genome-wide association studies. Nature genetics 2006, 38(8):904\u0026ndash;909.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eShen Chaogui,Lai Ruilian,Chen Jin,Feng Xin,Chen Yiting,Wei Xiaoxia,Wu Rujian.SNP Markers Development and Genetic Relationship Identification of Germplasm Resources of Canarium album Based on SLAF-Seq Technology.Fujian Journal of Agricultural Sciences2024, 39(05):557\u0026ndash;562.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eShu Chengjie,Li Zhuohang,Chen Diansong,Ma Ling,Zhou Xinyu,Zhao Jing,Wang Miao,Ma Shihong.Total Flavonoids, Anti-mite Activity and Antioxidant Activity ofthe Extract of Gleditsia Sinensis Thron.Chinese Wild Plant Resources 2022, 33(05):1201\u0026ndash;1204.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSun X, Liu D, Zhang X, Li W, Liu H, Hong W, Jiang C, Guan N, Ma C, Zeng H: SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing. PloS one 2013, 8(3):e58700.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Molecular biology and evolution 2011, 28(10):2731\u0026ndash;2739.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTian Honghong,2022,Study on the genetic diversity of wild soap pod resources in Guizhou Province,Master's Thesis,Mentor:Zhao Yang\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTian Honghong,Yang Ju,Lu Chunyun,XiaoO Feng,Zhao Yang.Phenotypic Diversity Analysis of Provenances of Natural Gleditsia sinensis in Guizhou.Acta Botanica Boreali-occidentalis 2022, 42(11):1927\u0026ndash;1935\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYang Ye,Liu Xiao-long,Lu Xiang,Sun Qing-wen.Ecological suitability regionalization of Gleditsia sinensis Lam.in China based on MaxEnt and ArcGIS.Lishizhen Medicine and Materia Medica Research 2022, 33(05):1201\u0026ndash;1204.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYin Qingfeng,Wang Yi,Huyan Li,Hao Jiabo,Meng Jinfang,Lu Bin.Genetic Diversity of Wild Zanthoxylum Armatum by ddRAD-seq.Molecular Plant Breeding.1\u0026ndash;22\u0026middot;\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZhang AnShi,Luo Yang,Fan Ding-Chen,Zhang Zhong-Hai.Genetic diversity and fingerprints of Gleditsia sinensis germplasm resource based on SCoT.Guihaia 2017, 37(11):1378\u0026ndash;1385.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eZhang Anshi,Zhang Sumin,Fan Dingchen,Liu Ying.Genetic diversity and fingerprints of Gleditsia sinensis germplasm based on SRAP.Acta Agriculturae Zhejiangensis2017, 29(09):1524\u0026ndash;153\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"genetic-resources-and-crop-evolution","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"gres","sideBox":"Learn more about [Genetic Resources and Crop Evolution](https://www.springer.com/journal/10722)","snPcode":"10722","submissionUrl":"https://submission.nature.com/new-submission/10722/3","title":"Genetic Resources and Crop Evolution","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"Gleditsia sinensis, SLAF-seq, SNP, genetic diversity","lastPublishedDoi":"10.21203/rs.3.rs-6259139/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-6259139/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003e \u003cem\u003eGleditsia sinensis\u003c/em\u003e Lam. has significant medicinal and economic value. By utilizing SLAF-Seq technology, we developed single nucleotide polymorphism (SNP) markers to analyze the genetic diversity of this species, providing a basis for the conservation and utilization of germplasm resources. In this study, we conducted SNP marker development on 159 \u003cem\u003eG. sinensis\u003c/em\u003e samples from Guizhou Province, identifying a total of 132,709 population SNPs. The number of SNPs per sample ranged from 27,292 to 76,870, with heterozygosity rates ranging from 4.17\u0026ndash;15.92%. The analysis of population genetic structure indicated that these 159 \u003cem\u003eG. sinensis\u003c/em\u003e samples could be divided into five groups. The genetic distances among samples from the same provenance were close, while gene flow was observed among germplasm resources from different geographical populations. The nucleotide diversity level of population Q5 was relatively high, and the Tajima\u0026rsquo;s D test suggested that rare alleles were scarce within the populations, with the majority of genes being neutral, indicating a state of equilibrium. This study provides fundamental data for the collection of \u003cem\u003eG. sinensis\u003c/em\u003e germplasm resources and the breeding of superior varieties.\u003c/p\u003e","manuscriptTitle":"Analysis of genetic diversity and population Structure of Gleditsia sinensis based on SLAF-Seq","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-04-01 16:01:12","doi":"10.21203/rs.3.rs-6259139/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2025-04-14T17:00:40+00:00","index":"","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-04-14T15:02:39+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-04-04T07:20:42+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-04-01T07:35:50+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"244803093498441617162963571049844066232","date":"2025-03-26T22:47:07+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"125362733724695895967908218581225035563","date":"2025-03-24T21:06:28+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"12173757725200834396050400771871107903","date":"2025-03-24T16:23:22+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2025-03-24T12:29:46+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2025-03-20T07:04:31+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2025-03-20T07:04:16+00:00","index":"","fulltext":""},{"type":"submitted","content":"Genetic Resources and Crop Evolution","date":"2025-03-19T07:42:26+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"genetic-resources-and-crop-evolution","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"gres","sideBox":"Learn more about [Genetic Resources and Crop Evolution](https://www.springer.com/journal/10722)","snPcode":"10722","submissionUrl":"https://submission.nature.com/new-submission/10722/3","title":"Genetic Resources and Crop Evolution","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"6ea5109a-3e19-455e-88a4-957468aae369","owner":[],"postedDate":"April 1st, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[],"tags":[],"updatedAt":"2025-12-15T16:12:44+00:00","versionOfRecord":{"articleIdentity":"rs-6259139","link":"https://doi.org/10.1007/s10722-025-02690-8","journal":{"identity":"genetic-resources-and-crop-evolution","isVorOnly":false,"title":"Genetic Resources and Crop Evolution"},"publishedOn":"2025-12-09 15:59:35","publishedOnDateReadable":"December 9th, 2025"},"versionCreatedAt":"2025-04-01 16:01:12","video":"","vorDoi":"10.1007/s10722-025-02690-8","vorDoiUrl":"https://doi.org/10.1007/s10722-025-02690-8","workflowStages":[]},"version":"v1","identity":"rs-6259139","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-6259139","identity":"rs-6259139","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00