An enhanced genome assembly and functional, high-throughput molecular markers enable genomics-assisted breeding of waxy sorghum [Sorghum bicolor (L.) Moench] | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article An enhanced genome assembly and functional, high-throughput molecular markers enable genomics-assisted breeding of waxy sorghum [Sorghum bicolor (L.) Moench] Melinda Yerka, Zhiyuan Liu, Scott Bean, Deepti Nigam, Chad Hayes, and 6 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-4883126/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 11 Aug, 2025 Read the published version in Journal of Applied Genetics → Version 1 posted 5 You are reading this latest preprint version Abstract Several mutations of the sorghum [ Sorghum bicolor (L.) Moench] GRANULE-BOUND STARCH SYNTHASE ( GBSS ) gene [ Sobic.010G022600 ] result in a low amylose:amylopectin starch ratio in the endosperm and confer a glutinous, “waxy” texture; hence, the wild-type gene is commonly referred to as Waxy ( Wx ). Recessive waxy ( wx ) alleles improve starch digestibility in ethanol production, human foods and beverages, and animal feed. However, breeding waxy sorghum can be time-consuming due to the need for grain to reach physiological maturity before the trait can be phenotyped and ongoing reliance on PCR markers for genotyping, which are not amenable to next-generation sequencing (NGS). Modern genomics-assisted breeding requires conducing high-throughput genotyping and selection in large, segregating populations prior to flowering. This study provides the first published NGS markers for the two mostly commonly used waxy ( wx ) alleles of sorghum and is the first to fully characterize the large insertion that is causal of the wx a allele. An enhanced genome assembly was constructed from the B.Tx623 reference genome (v3.1.1) to include the 5.6 kb la rge r etrotransposon d erivative (LARD) in the wx a allele. This improved read mapping at Sobic.010G022600 in wx a individuals, identified 78 new uniquely mapped reads, and made it possible to distinguish different Waxy genotypes using short-read sequencing data. Functional PACE-PCR markers, suitable for genomic selection, were developed for Wx , wx a , and wx b alleles and validated in three public and private breeding programs. These new molecular breeding resources will improve the efficiency of developing commercial waxy sorghum hybrids. sorghum waxy grain quality molecular markers breeding Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Introduction Sorghum [ Sorghum bicolor (L.) Moench] is the fifth-most produced grain crop in the world. It is undergoing major development as a climate-smart species due to its wide genetic diversity and its ability to tolerate hot, dry, low-input environments. Sorghum can be marketed with gluten-free, non-GMO, and ancient grain labels in specialty foods, food ingredients, and alcoholic beverages. As a diploid (2 n = 2 x = 20) with a genome size of ~ 700 Mbp, several published reference genomes (Deschamps et al. 2018 , McCormick et al. 2018 , Cooper et al. 2019 , Tao et al. 2021 ) and pangenomes (Deschamps et al. 2018 , Ruperao et al. 2021 ), publicly available mutant and mapping populations (Kumar et al., Xin et al. 2009 , Yerka et al. 2015 , Brenton et al. 2016 , Boyles et al. 2019 , Mace et al. 2021 , Xin et al. 2023 ), and large whole-genome sequencing data of diversity populations (Mace et al. 2013 , Boatwright et al. 2022 ), it is also an emerging model grass species, particularly for sugarcane [ Saccharum oficinarum L.]. Certain mutations of the sorghum GRANULE-BOUND STARCH SYNTHASE ( GBSS ) gene [ Sobic.010G022600 ] result in a low amylose:amylopectin starch ratio. In seeds, low-amylose endosperms have a sticky, “waxy” texture; hence, the wild-type gene is commonly referred to as Waxy . GBSS is highly conserved in the Poaceae and has been extensively reviewed for its role in cereal starch synthesis (Tetlow and Emes 2017 , Tetlow 2018 , Nakamura 2023 ), including in waxy maize [ Zea mays L.] (Talukder et al. 2023 , Wu et al. 2023 ), glutinous rice [ Oryza sativa L.] (Okpala et al. 2022 , Jin et al. 2023 , Nakamura 2023 ), and sorghum (Kang et al. 2023 ). Among sorghum breeders, the waxy grain phenotype is generally considered to be ~ 95–100% amylopectin. A dosage effect on amylopectin content depends on how many recessive waxy alleles are inherited by the triploid endosperm (Karper 1933 , Yerka et al. 2016 ). In 1933, R.E. Karper first documented the Mendelian inheritance of individual alleles of wild-type ( Wx ) and waxy ( wx ) grain phenotypes in sorghum (Karper 1933 ). Pedersen et al. (Pedersen et al. 2005 ) were first to characterize the waxy a ( wx a ) and waxy b ( wx b ) alleles with respect to the production of GBSS: wx a is a knock-out allele producing no GBSS, whereas wx b is “leaky” and produces low levels of non-functional GBSS. Sattler et al. (Sattler et al. 2009 ) performed the first molecular characterization of wx a and wx b alleles and identified the causal mutations of each one: the wx a allele contains a large insertion in the third exon, whereas the wx b allele contains a C to T transition converting amino acid 268 from glutamine to histidine. Since then, two additional alleles have been reported in East Asia, although their nomenclature is inconsistent across publications (Kawahigashi et al. 2013 , Lu et al. 2013 ). The wx c allele described by Lu et al. ( 2013 ) has a G deletion at the 5' splicing site of the ninth intron, causing a frameshift mutation and early translation termination. The wx d allele described by the same authors contains a mutation at the 3' splicing site of the tenth intron, causing a splicing site shift and deletion of five amino acids (GTGKK) in the predicted translated protein. While sorghum is a key subsistence crop for human food and animal feed in Sub-Saharan African and Southeast Asia, it has historically been bred for grain yield and animal feed in North and South America, Europe, and Australia. Waxy sorghum grain has long been associated with improved whole-grain digestibility by animals (Sherrod et al. 1969 , Walker and Lichtenw.Re 1974, Myer and Gorbet 1985 , Schroeder et al. 1998 ), rapid fermentation into ethanol for biofuel production (Yan et al. 2011 , Wu et al. 2013 ), the production of alcoholic beverages (Hsieh and Pi 1979 , Mugode et al. 2011 , Mezgebe et al. 2018 ), and improved baking properties (Sang et al. 2008 , Elhassan et al. 2015 ). Despite its potential uses, sorghum breeders have historically reported conflicting results with waxy sorghum grain yield and germination (Jampala et al. 2012 ), causing a delay in the adoption of this trait by seed companies. In 2015 the first near-isogenic waxy/wild-type lines and hybrids that overcame these challenges were publicly released, leading to current commercial interest (Yerka et al. 2015 , Yerka et al. 2015 ). Consequently, waxy grain is the first value-added food and beverage quality trait in sorghum to be commercially developed in the U.S. The first generation of commercial waxy hybrids are now in the final stages of development by several seed companies and should be released within the next few years. Nevertheless, most of these efforts are limited to Mendelian approaches that rely on PCR-based molecular markers specific to alleles of the Waxy locus. The genomic relationship between yield, germination, and the waxy trait has not been resolved, and continues to hinder the efficiency of developing high-yielding hybrids. Improved genomics-assisted breeding tools would greatly enhance these efforts. The present study was conducted to develop an updated molecular breeding toolkit for waxy sorghum that can be combined with genomic selection for rapid crop improvement. The full-length wx a allele was integrated into chromosome 10 of the B.Tx623 reference genome (v3.1.1) to improve read mapping in diverse waxy materials at Sobic.010G022600 . Functional competitive allele-specific PCR (PACE-PCR) markers were developed for Wx , wx a , and wx b alleles (the most frequently used alleles in U.S. breeding programs) and validated in numerous pedigrees in public and private breeding programs. This new molecular toolkit should enhance the efficiency of waxy sorghum breeding for the production of climate-smart foods, animal feeds, and biofuels. Materials and Methods Plant Materials . Three sets of sorghum genotypes contributed to this analysis: publicly available materials in the University of Nevada, Reno (UNR; Reno, NV) sorghum breeding program, publicly available materials at the USDA-ARS Plant Stress and Germplasm Development Research Unit (Lubbock, TX), and pre-commercial materials (experimental hybrid parents) at Richardson Seeds (Vega, TX), a subsidiary of Nuseed (Victoria, Australia) (Supplementary Table 1). The UNR materials included B.Tx623 and a selection of previously reported waxy and wild-type near-isogenic lines (referred to hereafter as waxy NILs) (Yerka et al. 2015 a, Yerka et al. 2015 b, Yerka et al. 2016 ) upon which additional phenotypic selection has been imposed for uniformity. The USDA materials included 98 experimental breeding lines in the F 4 generation from various population structures (biparental, multiparental, backcross). The pre-commercial Richardson Seeds materials were coded for privacy and include 19 inbred lines in the F 7 generation or later from biparental populations. All plant materials were grown in Lubbock, TX in 2022. Panicles were mechanically threshed to obtain seeds for analysis. Aliquots (20 g) of bulk seed from 2–3 panicles per entry were evaluated for % amylose content on a whole-grain, dry weight basis (dwb) using published near-infrared spectroscopy (NIRS) protocols at the USDA-ARS Grain Quality and Structure Research Unit in Manhattan, KS (Peiris et al. 2021 ). NIR data quality was assured by only reporting those values with Mahalanobis distances < 4.0 (Whitfield et al. 1987 ). The use of both public and private materials from three independent sorghum breeding programs was a strategy to ensure that new molecular markers are robust and will work across diverse populations and molecular breeding programs. Sequencing the full-length wx a insertion . Full-length sequence of the wx a allele was needed to develop an enhanced assembly inclusive of both sequence and structural variations present at the Waxy locus of chromosome 10. Due to the repetitive nature of the transposon insertion in wx a genotypes, a contiguous assembly of the wx a allele was not possible using PE150 “short” reads from our WGS data, nor in published WGS data from the Sorghum Association Panel (SAP). Instead, a de novo full-length sequence of the wx a allele of ‘Tx2907’ (a common trait donor) was generated using long-range PCR, as follows. Genomic DNA was extracted from immature sorghum leaves using the CTAB method (Porebski et al. 1997 ). A Multiskan SkyHigh Microplate Spectrophotometer (Thermo Fisher Scientific, USA) was used to measure the quality and concentration of DNA. Forward (AF: 5′-GGCCTGGATTCAATGTTCTT-3′) and referees (AR: 5′-GCAGCTGGTTGTCCTTGTAG-3′) primers were used with Q5 High-Fidelity 2X Master Mix (NEB, UK) under the following conditions: DNA was denatured at 98°C for 30 sec followed by 33 cycles of 98°C for 10 sec, annealing at 64°C for 30 sec, extension at 72°C for 5 min, and a final extension for 2 min at 72°C. Samples were held at 4°C and stored at -20°C for future use. The amplicon was purified with Monarch PCR & DNA Cleanup Kit (NEB, UK). Cloning was conducted using the Zero Blunt TOPO PCR Cloning Kit. Briefly, the purified PCR product (60–80 ng) was cloned into the TOPO vector and transformed into competent cells of E. coli . After 1-h propagating, the E. coli was streaked onto a 50 µg mL − 1 kanamycin plate and incubated overnight before colony selection and plasmid extraction. Sanger sequencing of the plasmid DNA was conducted by a commercial service provider. The conserved domain of the sequence was annotated at NCBI ( https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi ). These efforts were used to enhance the sorghum reference genome (v3.1.1) to contain the full-length wx a allele on chromosome 10 (referred to hereafter as the “enhanced genome assembly”). Specifically, an in-house Biopython program was developed to add the wx a insertion to the reference genome (v3.1.1) by substituting it and its flanking regions for chr10 positions 1862384 to 1862906. Allele-specific PACE marker development . The principle of PACE-PCR is that primers having unique tail sequences target known SNPs and bind to specific alleles (von Maydell 2023 ). Subsequently, during amplification, allele-specific fluorescent tags are added and detected spectrophotometrically so that different alleles can be identified (von Maydell 2023 ). In this study, primers were designed with the tails FAM (5′-GAAGGTGACCAAGTTCATGCT-3′) and HEX (5′-GAAGGTCGGAGTCAACGGATT-3′). The detailed sequences of each marker are shown in Table S1 . Validation of the PACE-PCR markers. The wx a allele was validated using fragment length polymorphisms amplified in PCR and visualized in gel electrophoresis. The primers and PCR conditions were the same as those above for amplifying the full-length wx a insertion, despite differences in fragment lengths. The wx b allele was validated with Sanger sequencing using the forward primer BF (5′-CCTCTGTCATGCTACCTCAAG-3′) and the reverse primer BR (5′-CATTCATTGCATACCCGTCG-3′) in the same PCR conditions as above. Fragments were separated in a 1% agarose gel. Purified PCR products were used for Sanger sequencing validation. PACE-PCR genotyping was conducted on all plant materials following manufacturer instructions (3CR Bioscience, Harlow, United Kingdom) of a CFX96 Touch Deep Well Real-Time PCR Detection System (BIO-RAD, USA). Specifically, 0.15 µL of the assay mix (12 µM for each allele-specific forward primer and 30 µM for a single common reverse primer in the final PCR reaction), 1 µL gDNA (100–200 ng), 3.85 µL ultra-pure distilled water, and 5 µL PACE™ 2.0 Genotyping Master Mix (3CR, Bioscience) were added to a total reaction volume of 10 µL. DNA was denatured at 94°C, then a touch down (65 − 57°C) step was performed for 10 cycles for primer annealing and initial extension. An additional 30 cycles were used for further extension to generate more tail-specific sequences. Fluorescence was visualized and digitalized for automated analysis with Bio-Rad CFX Maestro software. Whole-genome sequencing (WGS) . WGS data was obtained to generate whole-genome coverage at greater depth and coverage than GBS in selected plant materials to enhance our ability to assess read mapping in Wx versus wx a materials using the enhanced genome assembly. These materials included the grain sorghum parent lines ‘KS115’ ( Wx ) and ‘R.Tx2907’ ( wx a ). PE150 libraries were prepared for Illumina sequencing of gDNA (24 Gb/sample, ~ 30x coverage). To evaluate the quality of the raw reads, FastQC ( https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ ) (Andrews 2010 ) was utilized. Trimming was conducted by removing nucleotide calls with Phred scores below 20 (Q20) from the 3' ends of the reads. Subsequently, high-quality reads were selected for alignments using Bowtie2 (Langmead and Salzberg 2012 ) with default parameters. Whole-genome sequence of R.Tx2907 was retrieved from publicly available source (Boatwright et al. 2022 ). SAM/BAM alignments obtained were analyzed using the SAMtools (Li et al. 2009 ) and BAMtools (Barnett et al. 2011 ) software suites. The aligned reads mapped to the Waxy locus were visualized and compared using the IGV tool (Thorvaldsdóttir et al. 2013 ). Results The full-length wx a insertion . The causal mutation of the wx a allele is a 5,174 bp insertion containing RNase H (RNH), Reverse Transcriptase (RT), gap-pre-integrase, zf (Zinc knuckle) and Transposase domains; a long terminal repeat (LTR) region, and a GAG (gag-polypeptide) domain (Fig. 1 ). It is a large retrotransposon derivative (LARD) element that belongs to LTR retrotransposons (Kalendar et al. 2004 ). LARD elements are nonautonomous retrotransposons, relying on other autonomous TEs to transpose. LARDs consist of non-coding internal domains and LTR (Havecker et al. 2004 ). Within the LARD, the LTR region is a pair of identical 422 bp sequences at each end of the insertion, which serve as regulatory elements for GAG and POL domains needed for gene expression (Finnegan et al., 2012). RNH protein degrades the 5’ LTR to transcribe RNA and promotes priming of the 3’LTR for RT to synthesize a new cDNA molecule. A target region in the genome is cut by integrase and the newly synthesized cDNA is inserted. The GAG domain contains coding sequences for two structural proteins which are believed to play an important, but as-yet undetermined role (Chaparro et al. 2015 ). PACE-PCR markers . The initial PACE-PCR markers provided excellent discrimination of Wx , wx a , and wx b alleles in homozygous genotypes (Figs. 2 A and 2 C). Primer pairs AF/AR, used to discriminate between B.Tx623 ( Wx ) and R.Tx2907 ( wx a ), amplified bands of 523 bp and 5,702 bp, respectively (Fig. 2 B). Sanger sequencing confirmed the presence of the G-T SNP in wx b genotypes (Fig. 2 D). To validate the performance of the markers in breeding, genotyping was conducted blind (not in the breeders’ labs), without a priori knowledge of pedigrees or Waxy allele(s) in each line. The markers were used to screen a total of 258 genotypes in the UNR, USDA, and Richardson Seeds breeding programs, including individuals that were homozygous and heterozygous for various combinations of Waxy alleles (Fig. 3 ). To test whether the genotyping results matched the amylose contents expected by the breeders, we compared the Waxy alleles of plants (determined from young leaf tissue) to the amylose content of mature grain (Table S2 ). The amylose contents of the UNR and Richardson Seeds programs largely agreed with the genotyping results. Out of 98 USDA-ARS lines genotyped, ten were still segregating for different Waxy alleles (heterozygous) and eliminated from the analysis. Twenty-four out of 26 expected homozygous lines were determined to be homozygous for the correct allele (either wx a or wx b ) and had grain amylose contents below 5%, which is the threshold typically considered to define the waxy phenotype. However, two of those 26 lines were not in fact homozygous and were still segregating for wild-type or heterowaxy ( Wx / wx a or Wx / wx b ) plants that were not genotyped but were harvested and included in the grain quality analysis. Alternatively, some homozygous waxy plants could have been adventitiously cross-pollinated by wild-type pollen despite every attempt to cover them prior to flowering. Nevertheless, the average amylose content of the USDA-ARS waxy lines was 2.12%. Four heterowaxy lines had amylose contents between 10.6% and 17.3%, and 58 of 60 wild-type lines had amylose contents over 15%, consistent with the recessive nature of wx alleles and their dose response in the triploid endosperm (Fig. 4 ). Our results demonstrate the importance of high-throughput molecular markers when breeding with waxy materials, as the phenotype is not visible in the grain and adventitious outcrossing with wild-type pollen carrying dominant alleles can easily contaminate breeding stocks after genotyping is conducted early in the season to raise amylose content through xenia (Karper 1933 , Gorbet and Weibel 1972 ). Overall, the PACE genotypes were consistent with expected grain amylose contents and were able to rapidly identify off-types. They are therefore suitable for detecting adventitious outcrossing (through analysis of harvested grain or leaf tissues in the following generation) as a routine quality control measure leading up to commercialization. As the parental lines are from different breeding programs using different genetic backgrounds, the results indicate that these markers can be broadly used in waxy sorghum breeding. Read-mapping at the Waxy locus . The presence of the large insertion in wx a genotypes results in broken reads when flanking sequences that partially cover both wild-type and insertion sequences fail to align to the wild-type reference genome. This limits the efficiency of local short read mapping during molecular breeding and makes linkage disequilibrium analyses more difficult. The enhanced genome assembly was able to overcome this challenge by providing reference sequences for read mapping across the full-length insertion. When WGS data for the inbred lines KS115 ( Wx ) and R.Tx2907 ( wx a ) were mapped to the wild-type (B.Tx623, v3.1.1) and enhanced reference genome (containing the wx a insertion), the same number of reads were detected between Chr. 10: 1,860,965-1,865,278 for KS115, but an additional 78 uniquely mapped reads covering the insertion were mapped for R.Tx2907 (Fig. 5 ). Discussion Breeding for grain quality traits of sorghum . Historic breeding efforts in U.S. sorghum have focused on grain yield more than quality because phenotyping for quality is typically more time-consuming and expensive; and because producers were paid primarily for yield. However, advances in high-throughput phenotyping and genotyping methods have made it easier than ever to phenotype for quality traits and their interactions with whole-plant (grain + biomass) productivity and per-area yield. There is also a small but rapidly growing world-wide market for grain sorghum with grain quality traits, like low amylose content, so breeding programs are beginning to address them. The global value of the gluten-free products market was estimated at USD 6.3 billion in 2022 and projected to reach USD 11.8 billion by 2030 at a compound annual growth rate of 9.5%, based on a Vantage Market Research report. Highlights of the same report noted that a considerable amount of the increase in value could be attributed to more health-conscious consumers who were interested in methods to extend the shelf life of baked goods, especially breads, without the addition of preservatives – a known property of waxy sorghum flour. Waxy sorghum grain has been studied for many years in the food sciences is a high-value endosperm trait associated with high digestibility by animals (Sherrod et al. 1969 , Walker and Lichtenw.Re 1974, Myer and Gorbet 1985 , Schroeder et al. 1998 ), rapid fermentation into ethanol (Yan et al. 2011 , Wu et al. 2013 ), the production of alcoholic beverages (Hsieh and Pi 1979 , Mugode et al. 2011 , Mezgebe et al. 2018 ), and improved baking properties (Sang et al. 2008 , Elhassan et al. 2015 ) due to high-amylopectin starch content. While the waxy trait has been identified since the 1930s (Karper 1933 ), recent hybrids are available with no yield drag or poor agronomic performance compared to near-isogenic or closely-related wild-type hybrids (Jampala et al. 2012 , Yerka et al. 2016 ). It has also been linked to lower resistant starch content, an important factor in food functionality that must be balanced with diet because, when consumed in excess, it could result in unhealthy gut microbiomes, based on human stool microbiomes and mouse feeding studies (Yang et al. 2023 ). This is because the increased digestibility of waxy starches can raise the glycemic index (Frei et al. 2003 , Fitzgerald et al. 2011 ). Sorghum breeders are interested in combining waxy grain with additional target traits such as larger seed size, improved protein content and digestibility, improved germination, sugarcane aphid resistance, earlier maturity, etc. to improve their commercial value, expand production into additional regions, and help open underexplored markets where value-added, climate-smart crops are urgently needed. In particular, the western U.S. may be ideal for the development of new grain quality traits due to minimal precipitation throughout the growing season and harvest period, which can reduce grain quality in more central and eastern regions of the country. Developing waxy sorghum hybrids with improved trait combinations for deployment in new areas will be most efficiently achieved using advanced molecular breeding techniques. The PACE-PCR markers and updated chromosome 10 assembly presented herein provide a solid foundation for those efforts. Molecular breeding resources for adapting sorghum yield and quality to local environments . Genome-to-phenome (G2P) modeling combines physiological information about plant developmental processes with genomic, agronomic, and environomic data. These methods, which often use artificial intelligence (AI) to optimize the predictive capacity of genetic models for improved gene discovery, have emerged as the most effective method for clarifying relationships between inputs and outputs in cropping systems (Harfouche et al. 2019 , Xu et al. 2022 ). Hence, to secure both the abundance and quality of food systems, machine learning will become increasingly important to bring phenotyping for quality traits up to the speed and efficiency of phenotyping for yield for rapid improvements to the sustainable production of microscale, biochemical traits (Yan and Wang 2023 ), like amylose or amylopectin content in climate-smart species like sorghum. The current study reports functional markers for waxy grain, which is not amenable to high-throughput phenotyping in field environments prior to the time of flowering. However, genotyping young plants with NGS markers in genomic selection is becoming standard in many companies, so the reported markers will provide immediate improvements to breeding efficiency by shortening the commercial breeding cycle. They will also assist with ongoing quality control by identifying progeny that are outcrosses derived from adventitious cross-pollination of maternal plants in the previous generation. The improved read mapping achieved in wx a genotypes using the enhanced genome assembly is consistent with what other researchers have demonstrated when incorporating presence-absence and copy number variation into the reference genomes used for making marker-trait associations (Bayer et al. 2020 , Jayakodi et al. 2020 , Tao et al. 2021 , Li et al. 2022 , Sun et al. 2022 ). Future work in the sorghum grain quality space should focus on creating pangenomes that account for even more quality traits not expressed in existing reference genomes. Breeding for human food and animal feed end uses will depend upon identifying functional markers for micro- and macronutrient contents (e.g. starch, protein, oil, amylose, lysine, beta carotene, zinc, iron, etc.), percent vitreosity, kernel hardness, resistant starch content, fiber, seed size, and the interactions of diverse grain physiochemistry profiles with grain weathering and molding, as no in-field, high-throughput phenotyping platform currently exists for them. The ongoing reliance of most breeding programs on laboratory-based quality testing lengthens the breeding cycle when grain must first be harvested and analyzed before superior genotypes can be identified and advanced to the next generation. In addition, the interaction between functional grain qualities and production environments can be impacted by presence-absence variation that plays a role in other aspects of plant physiology and adaptation, such as loci controlling plant height, maturity, and thermal or photoperiod sensitivity. As reference genomes improve, plant breeders will be increasingly able to develop plant populations with improved combinations of yield and quality traits that express well in target environments. Conclusions The waxy trait is one of the first specialty grain quality traits of U.S. sorghum to be commercially produced. The validated markers in our study will be beneficial for various practical applications, such as germplasm characterization, allele mining, and marker assisted selection during backcrossing and forward breeding programs to improve sorghum food, feed, and fuel applications. We have built upon previous efforts to genetically characterize the Waxy locus by developing an enhanced chromosome 10 assembly for read mapping and functional NGS markers. These new resources provide an updated molecular toolkit for sorghum breeding that was validated in three public and private programs. This toolkit will improve the efficiency of breeding waxy sorghum using genomic selection and genome-to-phenome methodologies for rapid functional trait adaptation to target production environments. Declarations Data Availability All datasets generated during and/or analyzed during the current study are available in the supplementary files associated with this manuscript. STATEMENTS AND DECLARATIONS Funding. Funding for the development and analysis of the UNR breeding lines was provided by a United States Department of Agriculture Multi-State Hatch Award (NEV00763) and a United States Department of Agriculture Award (2019-67014-29174) to MKY. Funding for PACE-PCR assays, genomic analyses, and field studies at Texas Tech University (TTU) was provided by new faculty start-up funding to YJ. ZL was supported by the Graduate Support Stipend fellowship from the TTU Davis college of Agricultural Science and Natural Resources obtained by YJ and GP. Funding for the development of the USDA experimental breeding lines was provided by United States Department of Agriculture, Agricultural Research Service Project Plan 3096-21000-024-000-D. Richardson Seeds funded the development of their own experimental breeding lines and provided in-kind funding to MKY to support field studies conducted in Vega, TX during the development of the UNR breeding lines. Competing Interests. The authors have no relevant financial or non-financial interests to disclose. Author Contributions MY and ZL are co-first authors. MY and YJ are co-corresponding authors who designed the studies and analyzed data. MY, YJ, and GP obtained funding to conduct the work. ZL and DN developed the PACE-PCR markers and analyzed data. MY developed the UNR plant materials and led field studies using the UNR and Richardson Seeds plant materials with the assistance of YJ and ZL in Lubbock, TX. CH developed and conducted field studies with the USDA sorghum materials. DD, GK, SM, and GC developed the pre-commercial materials and analyzed data. SB conducted grain quality analyses. All authors read and approved the final manuscript. References Sorghum bicolor BTx642 v1.1, DOE-JGI, http://phytozome.jgi.doe.gov/ Sorghum bicolor RTx430 v2.1, DOE-JGI, http://phytozome.jgi.doe.gov/ Sorghum bicolor SC187 v1.1, DOE-JGI, http://phytozome-next.jgi.doe.gov/info/SbicolorSC187_v1_1 Sorghum bicolor v5.1 DOE-JGI, https://phytozome.jgi.doe.gov/info/Sbicolor_v5_1 Sorghum bicolor Wray v1.1 DOE-JGI, https://phytozome.jgi.doe.gov/info/SbicolorWray_v1_1 Andrews S (2010) FastQC: a quality control tool for high throughput sequence data. Babraham Bioinformatics, Babraham Institute, Cambridge, United Kingdom Barnett DW, Garrison EK, Quinlan AR, Strömberg MP, Marth GT (2011) BamTools: a C + + API and toolkit for analyzing and managing BAM files. Bioinformatics 27(12):1691–1692. https://doi.org/10.1093/bioinformatics/btr174 Bayer PE, Golicz AA, Scheben A, Batley J, Edwards D (2020) Plant pan-genomes are the new reference. Nat plants 6(8):914–920. https://doi.org/10.1038/s41477-020-0733-0 Boatwright JL, Sapkota S, Jin HY, Schnable JC, Brenton Z, Boyles R, Kresovich S (2022) Sorghum Association Panel whole-genome sequencing establishes cornerstone resource for dissecting genomic diversity. Plant J 111(3):888–904. https://doi.org/10.1111/tpj.15853 Boyles RE, Brenton ZW, Kresovich S (2019) Genetic and genomic resources of sorghum to connect genotype with phenotype in contrasting environments. Plant J 97(1):19–39. https://doi.org/10.1111/tpj.14113 Brenton ZW, Cooper EA, Myers MT, Boyles RE, Shakoor N, Zielinski KJ, Rauh BL, Bridges WC, Morris GP, Kresovich S (2016) A genomic resource for the development, improvement, and exploitation of sorghum for bioenergy. Genetics 204:21–33. https://doi.org/10.1534/genetics.115.183947 Chaparro C, Gayraud T, de Souza RF, Domingues DS, Akaffou S, Vanzela ALL, de Kochko A, Rigoreau M, Crouzillat D, Hamon S, Hamon P, Guyot R (2015) Terminal-Repeat Retrotransposons with GAG Domain in Plant Genomes: A New Testimony on the Complex World of Transposable Elements. Genome Biol Evol 7(2):493–504. https://doi.org/10.1093/gbe/evv001 Cooper EA, Brenton ZW, Flinn BS, Jenkins J, Shu SQ, Flowers D, Luo F, Wang YS, Xia P, Barry K, Daum C, Lipzen A, Yoshinaga Y, Schmutz J, Saski C, Vermerris W, Kresovich S (2019) A new reference genome for Sorghum bicolor reveals high levels of sequence similarity between sweet and grain genotypes: implications for the genetics of sugar metabolism. BMC Genomics 20:420. https://doi.org/10.1186/s12864-019-5734-x Deschamps S, Zhang Y, Llaca V, Ye L, Sanyal A, King M, May G, Lin HN (2018) A chromosome-scale assembly of the sorghum genome using nanopore sequencing and optical mapping. Nat Commun 9:4844. https://doi.org/10.1038/s41467-018-07271-1 Elhassan MSM, Emmambux MN, Hays DB, Peterson GC, Taylor JRN (2015) Novel biofortified sorghum lines with combined waxy (high amylopectin) starch and high protein digestibility traits: Effects on endosperm and flour properties. J Cereal Sci 65:132–139. https://doi.org/10.1016/j.jcs.2015.06.017 Fitzgerald MA, Rahman S, Resurreccion AP, Concepcion J, Daygon VD, Dipti SS, Kabir KA, Klingner B, Morell MK, Bird AR (2011) Identification of a Major Genetic Determinant of Glycaemic Index in Rice. Rice 4(2):66–74. https://doi.org/10.1007/s12284-011-9073-z Frei M, Siddhuraju P, Becker K (2003) Studies on the in vitro starch digestibility and the glycemic index of six different indigenous rice cultivars from the Philippines. Food Chem 83(3):395–402. https://doi.org/10.1016/S0308-8146(03)00101-8 Gorbet D, Weibel D (1972) Inheritance and genetic relationships of six endosperm types in sorghum. Crop Sci 12(3):378–382. https://doi.org/10.2135/cropsci1972.0011183X001200030037x Harfouche AL, Jacobson DA, Kainer D, Romero JC, Harfouche AH, Mugnozza GS, Moshelion M, Tuskan GA, Keurentjes JJ, Altman A (2019) Accelerating climate resilient plant breeding by applying next-generation artificial intelligence. Trends Biotechnol 37(11):1217–1235. https://doi.org/10.1016/j.tibtech.2019.05.007 Havecker ER, Gao X, Voytas DF (2004) The diversity of LTR retrotransposons. Genome Biol 5(6):1–6. https://doi.org/10.1186/gb-2004-5-6-225 Hsieh JS, Pi CP (1979) Effect of the waxy gene on grain quality and wine production in sorghum. J Agricultural Association China (106): 30–51 Jampala B, Rooney WL, Peterson GC, Bean S, Hays DB (2012) Estimating the relative effects of the endosperm traits of waxy and high protein digestibility on yield in grain sorghum. Field Crops Res 139:57–62. https://doi.org/10.1016/j.fcr.2012.09.021 Jayakodi M, Padmarasu S, Haberer G, Bonthala VS, Gundlach H, Monat C, Lux T, Kamal N, Lang D, Himmelbach A, Ens J, Zhang X-Q, Angessa TT, Zhou G, Tan C, Hill C, Wang P, Schreiber M, Boston LB, Plott C, Jenkins J, Guo Y, Fiebig A, Budak H, Xu D, Zhang J, Wang C, Grimwood J, Schmutz J, Guo G, Zhang G, Mochida K, Hirayama T, Sato K, Chalmers KJ, Langridge P, Waugh R, Pozniak CJ, Scholz U, Mayer KFX, Spannagl M, Li C, Mascher M, Stein N (2020) The barley pan-genome reveals the hidden legacy of mutation breeding. Nature 588(7837):284–289. https://doi.org/10.1038/s41586-020-2947-8 Jin SK, Xu LN, Leng YJ, Zhang MQ, Yang QQ, Wang SL, Jia SW, Song T, Wang RA, Tao T, Liu QQ, Cai XL, Gao JP (2023) The OsNAC24-OsNAP protein complex activates OsGBSSI and OsSBEI expression to fine-tune starch biosynthesis in rice endosperm. Plant Biotechnol J 21(11):2224–2240. https://doi.org/10.1111/pbi.14124 Kalendar R, Vicient CM, Peleg O, Anamthawat-Jonsson K, Bolshoy A, Schulman AH (2004) Large retrotransposon derivatives: abundant, conserved but nonautonomous retroelements of barley and related genomes. Genetics 166(3):1437–1450. https://doi.org/10.1534/genetics.166.3.1437 Kang XM, Gao W, Cui B, Abd El-Aty AM (2023) Structure and genetic regulation of starch formation in sorghum ( Sorghum bicolor (L.) Moench) endosperm: A review. Int J Biol Macromol 239:124315. https://doi.org/10.1016/j.ijbiomac.2023.124315 Karper R (1933) Inheritance of waxy endosperm in sorghum. J Hered 24(6):257–262. https://doi.org/10.1093/oxfordjournals.jhered.a103794 Kawahigashi H, Oshima M, Nishikawa T, Okuizumi H, Kasuga S, Yonemaru J (2013) A novel waxy allele in sorghum landraces in East Asia. Plant Breeding 132(3):305–310. https://doi.org/10.1111/pbr.12054 Kumar N, Brenton Z, Myers MT, Boyles RE, Sapkota S, Boatwright JL, Cox WA K. E. Jordan and S. Kresovich Registration of the sorghum carbon-partitioning nested association mapping (CP-NAM) population. J Plant Registrations. 15(3):656–663. https://doi.org/10.1002/plr2.20229 Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9(4):357–359. https://doi.org/10.1038/nmeth.1923 Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, R. Durbin and G. P. D. P., Subgroup (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25(16): 2078–2079. https://doi.org/10.1093/bioinformatics/btp352 Li H, Wang S, Chai S, Yang Z, Zhang Q, Xin H, Xu Y, Lin S, Chen X, Yao Z (2022) Graph-based pan-genome reveals structural and sequence variations related to agronomic traits and domestication in cucumber. Nature Communications 13(1): 682. 682 (2022). https://doi.org/10.1038/s41467-022-28362-0 Lu YG, Zhao GL, Li Y, Fan J, Ding GX, Zhao JQ, Ni XL, Xu YJ, Wang WM (2013) Identification of two novel waxy alleles and development of their molecular markers in sorghum. Genome 56(5):283–288 https://doi.org/10.1139/gen-2013-0047 Mace ES, Cruickshank AW, Tao YF, Hunt CH, Jordan DR (2021) A global resource for exploring and exploiting genetic variation in sorghum crop wild relatives. Crop Sci 61(1):150–162. https://doi.org/10.1002/csc2.20332 Mace ES, Tai S, Gilding EK, Li Y, Prentis PJ, Bian L, Campbell BC, Hu W, Innes DJ, Han X, Cruickshank A, Dai C, Frere C, Zhang H, Hunt CH, Wang X, Shatte T, Wang M, Su Z, Li J, Lin X, Godwin ID, Jordan DR, Wang J (2013) Whole-genome sequencing reveals untapped genetic potential in Africa's indigenous cereal crop sorghum. Nat Commun 4:2320–2329. https://doi.org/10.1038/ncomms3320 McCormick RF, Truong SK, Sreedasyam A, Jenkins J, Shu SQ, Sims D, Kennedy M, Amirebrahimi M, Weers BD, McKinley B, Mattison A, Morishige DT, Grimwood J, Schmutz J, Mullet JE (2018) The Sorghum bicolor reference genome: improved assembly, gene annotations, a transcriptome atlas, and signatures of genome organization. Plant J 93(2):338–354. https://doi.org/10.1111/tpj.13781 Mezgebe AG, Abegaz K, Taylor JRN (2018) Relationship between waxy (high amylopectin) and high protein digestibility traits in sorghum and malting quality. J Cereal Sci 79:319–327. https://doi.org/10.1016/j.jcs.2017.11.015 Mugode L, Portillo OR, Hays DB, Rooney LW, Taylor JRN (2011) Influence of high protein digestibility sorghums on free amino nitrogen (FAN) production during malting and mashing. J Inst Brew 117:422–426. https://doi.org/10.1002/j.2050-0416.2011.tb00488.x Myer RO, Gorbet DW (1985) Waxy and normal grain sorghums with varying tannin contents in diets for young pigs. Anim Feed Sci Technol 12(3):179–186. https://doi.org/10.1016/0377-8401(85)90011-2 Nakamura Y (2023) A model for the reproduction of amylopectin cluster by coordinated actions of starch branching enzyme isoforms. Plant Mol Biol 112:199–212. https://doi.org/10.1007/s11103-023-01352-6 Okpala NE, Aloryi KD, An TY, He LX, Tang XR (2022) The roles of starch branching enzymes and starch synthase in the biosynthesis of amylose in rice. J Cereal Sci 104:103393. https://doi.org/10.1016/j.jcs.2021.103393 Pedersen JF, Bean SR, Graybosch RA, Park SH, Tilley M (2005) Characterization of waxy grain sorghum lines in relation to granule-bound starch synthase. Euphytica 144(1–2):151–156. https://doi.org/10.1007/s10681-005-5298-5 Peiris KHS, Wu XR, Bean SR, Perez-Fajardo M, Hayes C, Yerka MK, Jagadish SVK, Ostmeyer T, Aramouni FM, Tesso T, Perumal R, Rooney WL, Kent MA, Bean B (2021) Near infrared spectroscopic evaluation of starch properties of diverse sorghum populations. Processes 9(11). https://doi.org/10.3390/pr9111942 Porebski S, Bailey LG, Baum BR (1997) Modification of a CTAB DNA extraction protocol for plants containing high polysaccharide and polyphenol components. Plant Mol Biology Report 15(1):8–15. https://doi.org/10.1007/BF02772108 Ruperao P, Thirunavukkarasu N, Gandham P, Selvanayagam S, Govindaraj M, Nebie B, Manyasa E, Gupta R, Das RR, Odeny DA, Gandhi H, Edwards D, Deshpande SP, Rathore A (2021) Sorghum pan-genome explores the functional utility for genomic-assisted breeding to accelerate the genetic gain. Front Plant Sci 12:666342. https://doi.org/10.3389/fpls.2021.666342 Sang YJ, Bean S, Seib PA, Pedersen J, Shi YC (2008) Structure and functional properties of sorghum starches differing in amylose content. J Agric Food Chem 56(15):6680–6685. https://doi.org/10.1021/jf800577x Sattler SE, Singh J, Haas EJ, Guo LN, Sarath G, Pedersen JF (2009) Two distinct waxy alleles impact the granule-bound starch synthase in sorghum. Mol Breeding 24(4):349–359. https://doi.org/10.1007/s11032-009-9296-5 Schroeder JW, Marx GD, Park CS (1998) Waxy corn as a replacement for dent corn for lactating dairy cows. Anim Feed Sci Technol 72(1–2):111–120. https://doi.org/10.1016/S0377-8401(97)00171-5 Sherrod LB, Albin RC, Furr RD (1969) Net energy of regular and waxy sorghum grains for finishing steers. J Anim Sci 29(6):997. https://doi.org/10.2527/jas1969.296997x Sun Y, Wang J, Li Y, Jiang B, Wang X, Xu W-H, Wang Y-Q, Zhang P-T, Zhang Y-J, Kong X-D (2022) Pan-genome analysis reveals the abundant gene presence/absence variations among different varieties of melon and their influence on traits. Front Plant Sci 13:835496. https://doi.org/10.3389/fpls.2022.835496 Talukder ZA, Chhabra R, Muthusamy V, Gain N, Katral A, Mishra SJ, Zunjare RU, Hossain F (2023) Recessive waxy1 and opaque2 genes synergistically regulate accumulation of amylopectin, lysine and tryptophan in maize. J Food Compos Anal 121:105392. https://doi.org/10.1016/j.jfca.2023.105392 Tao Y, Luo H, Xu J, Cruickshank A, Zhao X, Teng F, Hathorn A, Wu X, Liu Y, Shatte T (2021) Extensive variation within the pan-genome of cultivated and wild sorghum. Nat Plants 7(6):766–773. https://doi.org/10.1038/s41477-021-00925-x Tetlow IJ (2018) Starch Biosynthesis in Crop Plants. Agronomy-Basel 8(6):81. https://doi.org/10.3390/agronomy8060081 Tetlow IJ, Emes MJ (2017) Starch biosynthesis in the developing endosperms of grasses and cereals. Agronomy-Basel 7(4):81. https://doi.org/10.3390/agronomy7040081 Thorvaldsdóttir H, Robinson JT, Mesirov JP (2013) Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform 14(2):178–192. https://doi.org/10.1093/bib/bbs017 von Maydell D (2023) PCR Allele Competitive Extension (PACE). Plant Genotyping: Methods and Protocols. Y. Shavrukov. Springer US, New York, NY, pp 263–271 Walker RD and Lichtenw.Re (1974) Ruminal digestibility of waxy grain sorghum. J Anim Sci 39(1):256–256 Whitfield RG, Gerger ME, Sharp RL (1987) Near-infrared spectrum Qualification via Mahalanobis Distance determination. Appl Spectrosc 41(7):1204–1213 Wu WH, Zhong YY, Liu YL, Xu RY, Zhang XD, Liu N, Guo DW (2023) A new insight into the biosynthesis, structure, and functionality of waxy maize starch under drought stress. J Sci Food Agric 103(11):5270–5276. https://doi.org/10.1002/jsfa.12599 Wu XR, Maier A, Wu TT, Li NB, Pedersen JR, Wang DH (2013) Evaluation of Nebraska waxy sorghum hybrids for ethanol production. Cereal Chem 90(3):198–203. https://doi.org/10.1094/CCHEM-06-12-0067-R Xin ZG, Jiao YP, Burow G, Hayes C, Chen JP, Burke J, Pugh NA, Ware D (2023) Registration of 252 sequenced sorghum mutants as a community reverse genetic resource. J Plant Registrations. https://doi.org/10.1002/plr2.20296 Xin ZG, Wang ML, Burow G, Burke J (2009) An induced sorghum mutant population suitable for bioenergy research. Bioenergy Res 2(1–2):10–16. https://doi.org/10.1007/s12155-008-9029-3 Xu Y, Zhang X, Li H, Zheng H, Zhang J, Olsen MS, Varshney RK, Prasanna BM, Qian Q (2022) Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction. Mol Plant 15(11):1664–1695. https://doi.org/10.1016/j.molp.2022.09.001 Yan J, Wang X (2023) Machine learning bridges omics sciences and plant breeding. Trends Plant Sci 28(2):199–210. https://doi.org/10.1016/j.tplants.2022.08.018 Yan SP, Wu XR, Bean SR, Pedersen JF, Tesso T, Chen YHR, Wang DH (2011) Evaluation of waxy grain sorghum for ethanol production. Cereal Chem 88(6):589–595. https://doi.org/10.1094/CCHEM-04-11-0056 Yang QN, Van Haute M, Korth N, Sattler S, Rose D, Juritsch A, Shao J, Beede K, Schmaltz R, Price J, Toy J, Ramer-Tait AE, Benson AK (2023) The waxy mutation in sorghum and other cereal grains reshapes the gut microbiome by reducing levels of multiple beneficial species. Gut Microbes 15(1):2178799. https://doi.org/10.1080/19490976.2023.2178799 Yerka MK, Toy JJ, Funnell-Harris DL, Sattler SE, Pedersen JF (2015) Registration of A/BN641 and RN642 waxy grain sorghum genetic stocks. J Plant Registrations 9(2):258–261. https://doi.org/10.3198/jpr2014.10.0075crgs Yerka MK, Toy JJ, Funnell-Harris DL, Sattler SE, Pedersen JF (2015) Registration of N619 to N640 grain sorghum lines with waxy or wild-type endosperm. J Plant Registrations 9(2):249–253. https://doi.org/10.3198/jpr2014.06.0043crgs Yerka MK, Toy JJ, Funnell-Harris DL, Sattler SE, Pedersen JF (2016) Evaluation of interallelic waxy, heterowaxy, and wild-type grain sorghum hybrids. Crop Sci 56(1):113–121. https://doi.org/10.2135/cropsci2015.03.0151 Supplementary Files SupplementaryFile1WaxyLocusAssemblywithInsertion.docx SupplementaryTables1and2.xlsx Cite Share Download PDF Status: Published Journal Publication published 11 Aug, 2025 Read the published version in Journal of Applied Genetics → Version 1 posted Editorial decision: Major Revisions Needed 04 Feb, 2025 Reviewers agreed at journal 12 Sep, 2024 Reviewers invited by journal 05 Sep, 2024 Editor assigned by journal 12 Aug, 2024 First submitted to journal 08 Aug, 2024 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-4883126","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":349861822,"identity":"f915d61f-9697-4aea-a80c-6744411f5925","order_by":0,"name":"Melinda Yerka","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA9UlEQVRIiWNgGAWjYFAC5oYDDGwgBpD4AKLYCWphRGhhnAGimInQwgDTwswDtpaABt32xsbDBWV2if3SxxI/2/zaJs/HzMD44WMObi1mZw42HJ5xLjlxZl/aYencvtuGbcwMzJIzt+HRciOx4TBvG3PihjPsDdK5PbcZgVrYmHnxabn/EKSlPnH/Gfbm35Y9t+0Ja7nBCNJyOHEDD9sxaYYftxMJazkDdBjPuePGM86wpVn2NtxObmNmbMbvl+OHD3/mKauW7e9hM77x489t2/ntzQc/fMSjBQYcG0AkYxuYbCCsHgjsIdQfohSPglEwCkbBCAMA1uxVwy7WfpwAAAAASUVORK5CYII=","orcid":"https://orcid.org/0000-0001-7210-8684","institution":"University of Nevada Reno","correspondingAuthor":true,"prefix":"","firstName":"Melinda","middleName":"","lastName":"Yerka","suffix":""},{"id":349861823,"identity":"1f3f7f05-64f7-49c5-9f29-1b85ab02bcc1","order_by":1,"name":"Zhiyuan Liu","email":"","orcid":"","institution":"Texas Tech University","correspondingAuthor":false,"prefix":"","firstName":"Zhiyuan","middleName":"","lastName":"Liu","suffix":""},{"id":349861824,"identity":"cfaee194-c358-4d4f-8e7c-7b3965d929c8","order_by":2,"name":"Scott Bean","email":"","orcid":"","institution":"USDA-ARS: USDA Agricultural Research Service","correspondingAuthor":false,"prefix":"","firstName":"Scott","middleName":"","lastName":"Bean","suffix":""},{"id":349861825,"identity":"ee899147-a045-4041-b975-715ea10df060","order_by":3,"name":"Deepti Nigam","email":"","orcid":"","institution":"Texas Tech University","correspondingAuthor":false,"prefix":"","firstName":"Deepti","middleName":"","lastName":"Nigam","suffix":""},{"id":349861826,"identity":"3d01a004-20b1-4877-b07d-f60cced88a75","order_by":4,"name":"Chad Hayes","email":"","orcid":"","institution":"USDA ARS: USDA Agricultural Research Service","correspondingAuthor":false,"prefix":"","firstName":"Chad","middleName":"","lastName":"Hayes","suffix":""},{"id":349861827,"identity":"dfd8abdb-40dc-4f35-8457-cbfb34e9280a","order_by":5,"name":"Diego Druetto","email":"","orcid":"","institution":"Richardson Seeds","correspondingAuthor":false,"prefix":"","firstName":"Diego","middleName":"","lastName":"Druetto","suffix":""},{"id":349861828,"identity":"2c042a5b-753b-4f09-9d2d-e5f601597423","order_by":6,"name":"Gabriel Krishnamoorthy","email":"","orcid":"","institution":"Richardson Seeds","correspondingAuthor":false,"prefix":"","firstName":"Gabriel","middleName":"","lastName":"Krishnamoorthy","suffix":""},{"id":349861829,"identity":"75526067-065e-470e-842f-16bc9d300080","order_by":7,"name":"Shelley Meiwes","email":"","orcid":"","institution":"Richardson Seeds","correspondingAuthor":false,"prefix":"","firstName":"Shelley","middleName":"","lastName":"Meiwes","suffix":""},{"id":349861830,"identity":"05964860-523b-4284-98c1-358d6b6c3b28","order_by":8,"name":"Gonzalo Cucit","email":"","orcid":"","institution":"NuSeed","correspondingAuthor":false,"prefix":"","firstName":"Gonzalo","middleName":"","lastName":"Cucit","suffix":""},{"id":349861831,"identity":"b63c2da6-5612-40e3-94a2-8d12a37ab35b","order_by":9,"name":"Gunvant B. Patil","email":"","orcid":"","institution":"Texas Tech University","correspondingAuthor":false,"prefix":"","firstName":"Gunvant","middleName":"B.","lastName":"Patil","suffix":""},{"id":349861832,"identity":"a5ed40a2-19ae-4347-a3db-83c48ec149f0","order_by":10,"name":"Yinping Jiao","email":"","orcid":"","institution":"Texas Tech University","correspondingAuthor":false,"prefix":"","firstName":"Yinping","middleName":"","lastName":"Jiao","suffix":""}],"badges":[],"createdAt":"2024-08-08 20:30:59","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-4883126/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-4883126/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1007/s13353-025-00993-1","type":"published","date":"2025-08-11T15:57:48+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":66101688,"identity":"fd42b2e1-24eb-4d11-ae63-67042a0d3b25","added_by":"auto","created_at":"2024-10-07 17:17:29","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":106265,"visible":true,"origin":"","legend":"\u003cp\u003eThe genetic architecture of three alleles of the sorghum \u003cem\u003eWaxy\u003c/em\u003e locus, \u003cem\u003eSobic.010G022600\u003c/em\u003e, including the site of the full-length insertion that is causal of the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e allele and the SNP (G-T) that is causal of the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e allele. Exons are black. The primers AF/AR and BF/BR are forward/reverse primers that were used to amplify the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e insertion and the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e SNP, respectively. LTR, Long Terminal Repeats; RT, Reverse Transcriptase; gag_p, GAG-pre-integrase domain; zf, Zinc knuckle; GAG, gag-polypeptide\u003c/p\u003e","description":"","filename":"floatimage1.png","url":"https://assets-eu.researchsquare.com/files/rs-4883126/v1/a675b119fad5f3a52ac44fba.png"},{"id":66101460,"identity":"cbac85dc-d3ef-46b6-8a9d-03f1f8e34026","added_by":"auto","created_at":"2024-10-07 17:09:29","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":314497,"visible":true,"origin":"","legend":"\u003cp\u003eOriginal PACE-PCR marker design for \u003cem\u003eWx\u003c/em\u003e, \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e and \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e alleles was based on the inbred lines R.Tx2907 (\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e), B.TxARG1 (\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e), and B.Tx623 (\u003cem\u003eWx\u003c/em\u003e). (A) Genotyping results for the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e marker. Three individuals were tested for each genotype. One no-template control (NTC) and one positive control (\u003cem\u003eWx\u003c/em\u003e) were included. Clustered blue and orange dots represent \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e and \u003cem\u003eWx\u003c/em\u003e genotypes, respectively, and the black dot represents NTC. (B) PCR validation of the full-length 5,174 bp \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e insertion. Lanes 1-4 are individual plants of R.Tx2907 (\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e) and lanes 5-6 are individual plants of B.Tx623 (\u003cem\u003eWx\u003c/em\u003e). All fragments were amplified with the primer pair AF1/AR1. The band sizes were 5702 bp and 523 bp for the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e and \u003cem\u003eWx\u003c/em\u003e alleles, respectively. (C) Original PACE-PCR marker design for \u003cem\u003eWx\u003c/em\u003e and \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e alleles. Three individuals were tested for each genotype. One NTC and one positive control (\u003cem\u003eWx\u003c/em\u003e) were included. Clustered blue and orange dots represent \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e and \u003cem\u003eWx\u003c/em\u003e genotypes, respectively, and the black dot is the NTC. (D) The primer pair BF1/BR1 was used to amplify the SNP that is causal of the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e allele using two individuals each from B.TxARG1, B.N631, and B.N633. A G-T SNP is causal of the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e allele\u003c/p\u003e","description":"","filename":"floatimage2.png","url":"https://assets-eu.researchsquare.com/files/rs-4883126/v1/c507845b17204a81297dfcfc.png"},{"id":66100959,"identity":"b838604b-1dc7-4942-89cd-b7cf70fc34df","added_by":"auto","created_at":"2024-10-07 17:01:29","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":120835,"visible":true,"origin":"","legend":"\u003cp\u003ePACE-PCR genotyping results for the three breeding programs, including both homozygotes and heterozygotes. (A) Genotyping results using the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e marker. Clustered blue dots represent \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e genotypes, green dots represent \u003cem\u003eWxwx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e genotypes, and orange dots represent \u003cem\u003eWxWx\u003c/em\u003e genotypes. (B) Genotyping results using the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e marker. Clustered blue dots represent \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e genotypes, green dots represent \u003cem\u003eWxwx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e genotypes, and orange dots represent \u003cem\u003eWxWx\u003c/em\u003e genotypes\u003c/p\u003e","description":"","filename":"floatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-4883126/v1/6ea272c536ea6c8297c7e254.png"},{"id":66100962,"identity":"2ba6cace-048c-4980-9fe9-f1ce5d85c278","added_by":"auto","created_at":"2024-10-07 17:01:29","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":96981,"visible":true,"origin":"","legend":"\u003cp\u003eAmylose contents (% whole grain, dwb) of individuals from waxy, heterowaxy, and wild-type breeding lines, as reported by the USDA-ARS breeding program. A subsample of these individuals was genotyped with PACE-PCR markers using leaf tissue. The red dotted line represents the phenotypic threshold that is conventionally used to define a line as being waxy (5% amylose). Black dots represent the amylose contents of single plants; more individuals per line were evaluated for amylose content than were genotyped. Each column is the expected \u003cem\u003eWaxy \u003c/em\u003ehaplotype reported by the breeder who contributed seeds, with the diploid \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e genotype first and the diploid \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e genotype second: \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e\u003cem\u003e \u003c/em\u003e(homozygous \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e), \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e\u003cem\u003e \u003c/em\u003e(homozygous \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e), \u003cem\u003eWxwx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e\u003cem\u003e \u003c/em\u003e(heterozygous for \u003cem\u003eWx\u003c/em\u003e and \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e), \u003cem\u003eWxwx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e\u003cem\u003e \u003c/em\u003e(heterozygous for \u003cem\u003eWx\u003c/em\u003e and \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e), \u003cem\u003eWxWx \u003c/em\u003e(wild-type). The low (\u0026lt; 5%) and intermediate (10-15%) amylose contents in individuals in the wild-type (\u003cem\u003eWxWxWxWx\u003c/em\u003e) column indicate that the breeding line was still segregating for recessive \u003cem\u003ewaxy\u003c/em\u003e alleles in ungenotyped plants that contributed to grain quality analysis, or xenia from adventitious outcrossing with wild-type materials in the previous generation.\u0026nbsp; The intermediate amylose contents (10-11%) in the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e\u003cem\u003eWxWx\u003c/em\u003e and \u003cem\u003eWxWxWxWx\u003c/em\u003e columns indicate that some plants were heterowaxy (\u003cem\u003eWxwx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e\u003cem\u003eWxWx\u003c/em\u003e, \u003cem\u003eWxwx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e\u003cem\u003eWx\u003c/em\u003ewxb, or \u003cem\u003eWxWxWx wx\u003c/em\u003e), also due to segregation or xenia in ungenotyped individuals. These results demonstrate the importance of a rapid and high-throughput genotyping assay while working with the waxy trait to prevent undesirable \u003cem\u003ewaxy\u003c/em\u003e or wild-type alleles from contributing to consecutive generations\u003c/p\u003e","description":"","filename":"floatimage4.png","url":"https://assets-eu.researchsquare.com/files/rs-4883126/v1/71483bf1448e6d1d93ec5e70.png"},{"id":66100964,"identity":"a0aa9194-1bed-417a-bc51-18847c9df032","added_by":"auto","created_at":"2024-10-07 17:01:29","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":463480,"visible":true,"origin":"","legend":"\u003cp\u003eWGS reads from \u003cem\u003eWx\u003c/em\u003e (KS115) and \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e (R.Tx2907) inbred lines mapped to the wild-type (\u003cem\u003eWx\u003c/em\u003e) B.Tx623 reference genome (v3.1.1) and the enhanced genome assembly including the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e insertion. The \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e primer pair AF/AR was used to sequence the full-length insertion containing a LARD retrotransposon. An additional 78 unique reads of R.Tx2907 mapped to the enhanced genome assembly compared to the wild-type genome\u003c/p\u003e","description":"","filename":"floatimage5.png","url":"https://assets-eu.researchsquare.com/files/rs-4883126/v1/f666c904ed10641bb1e9d306.png"},{"id":89310650,"identity":"1f05bcc4-6541-4632-b6fc-b25dfae7fc6e","added_by":"auto","created_at":"2025-08-18 16:09:10","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1763557,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-4883126/v1/72f822e4-f1cc-4948-8c70-5cb07281e4f1.pdf"},{"id":66100963,"identity":"aa7e0fd3-348f-4371-b647-c8625d429253","added_by":"auto","created_at":"2024-10-07 17:01:29","extension":"docx","order_by":5,"title":"","display":"","copyAsset":false,"role":"supplement","size":16493,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryFile1WaxyLocusAssemblywithInsertion.docx","url":"https://assets-eu.researchsquare.com/files/rs-4883126/v1/c8371a8369870db1a5d49910.docx"},{"id":66101689,"identity":"7824d688-b47e-4a5f-a6a2-cd3c0b1e4327","added_by":"auto","created_at":"2024-10-07 17:17:29","extension":"xlsx","order_by":6,"title":"","display":"","copyAsset":false,"role":"supplement","size":24040,"visible":true,"origin":"","legend":"","description":"","filename":"SupplementaryTables1and2.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-4883126/v1/6756dd46e6306c6d9e943e52.xlsx"}],"financialInterests":"","formattedTitle":"An enhanced genome assembly and functional, high-throughput molecular markers enable genomics-assisted breeding of waxy sorghum [Sorghum bicolor (L.) Moench]","fulltext":[{"header":"Introduction","content":"\u003cp\u003eSorghum [\u003cem\u003eSorghum bicolor\u003c/em\u003e (L.) Moench] is the fifth-most produced grain crop in the world. It is undergoing major development as a climate-smart species due to its wide genetic diversity and its ability to tolerate hot, dry, low-input environments. Sorghum can be marketed with gluten-free, non-GMO, and ancient grain labels in specialty foods, food ingredients, and alcoholic beverages. As a diploid (2\u003cem\u003en\u003c/em\u003e\u0026thinsp;=\u0026thinsp;2\u003cem\u003ex\u003c/em\u003e\u0026thinsp;=\u0026thinsp;20) with a genome size of ~\u0026thinsp;700 Mbp, several published reference genomes (Deschamps et al. \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e2018\u003c/span\u003e, McCormick et al. \u003cspan citationid=\"CR37\" class=\"CitationRef\"\u003e2018\u003c/span\u003e, Cooper et al. \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2019\u003c/span\u003e, Tao et al. \u003cspan citationid=\"CR53\" class=\"CitationRef\"\u003e2021\u003c/span\u003e) and pangenomes (Deschamps et al. \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e2018\u003c/span\u003e, Ruperao et al. \u003cspan citationid=\"CR46\" class=\"CitationRef\"\u003e2021\u003c/span\u003e), publicly available mutant and mapping populations (Kumar et al., Xin et al. \u003cspan citationid=\"CR63\" class=\"CitationRef\"\u003e2009\u003c/span\u003e, Yerka et al. \u003cspan citationid=\"CR68\" class=\"CitationRef\"\u003e2015\u003c/span\u003e, Brenton et al. \u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e2016\u003c/span\u003e, Boyles et al. \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e2019\u003c/span\u003e, Mace et al. \u003cspan citationid=\"CR35\" class=\"CitationRef\"\u003e2021\u003c/span\u003e, Xin et al. \u003cspan citationid=\"CR62\" class=\"CitationRef\"\u003e2023\u003c/span\u003e), and large whole-genome sequencing data of diversity populations (Mace et al. \u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e2013\u003c/span\u003e, Boatwright et al. \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e2022\u003c/span\u003e), it is also an emerging model grass species, particularly for sugarcane [\u003cem\u003eSaccharum oficinarum\u003c/em\u003e L.].\u003c/p\u003e \u003cp\u003eCertain mutations of the sorghum \u003cem\u003eGRANULE-BOUND STARCH SYNTHASE\u003c/em\u003e (\u003cem\u003eGBSS\u003c/em\u003e) gene [\u003cem\u003eSobic.010G022600\u003c/em\u003e] result in a low amylose:amylopectin starch ratio. In seeds, low-amylose endosperms have a sticky, \u0026ldquo;waxy\u0026rdquo; texture; hence, the wild-type gene is commonly referred to as \u003cem\u003eWaxy\u003c/em\u003e. GBSS is highly conserved in the Poaceae and has been extensively reviewed for its role in cereal starch synthesis (Tetlow and Emes \u003cspan citationid=\"CR55\" class=\"CitationRef\"\u003e2017\u003c/span\u003e, Tetlow \u003cspan citationid=\"CR54\" class=\"CitationRef\"\u003e2018\u003c/span\u003e, Nakamura \u003cspan citationid=\"CR41\" class=\"CitationRef\"\u003e2023\u003c/span\u003e), including in waxy maize [\u003cem\u003eZea mays\u003c/em\u003e L.] (Talukder et al. \u003cspan citationid=\"CR52\" class=\"CitationRef\"\u003e2023\u003c/span\u003e, Wu et al. \u003cspan citationid=\"CR60\" class=\"CitationRef\"\u003e2023\u003c/span\u003e), glutinous rice [\u003cem\u003eOryza sativa\u003c/em\u003e L.] (Okpala et al. \u003cspan citationid=\"CR42\" class=\"CitationRef\"\u003e2022\u003c/span\u003e, Jin et al. \u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e2023\u003c/span\u003e, Nakamura \u003cspan citationid=\"CR41\" class=\"CitationRef\"\u003e2023\u003c/span\u003e), and sorghum (Kang et al. \u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). Among sorghum breeders, the waxy grain phenotype is generally considered to be ~\u0026thinsp;95\u0026ndash;100% amylopectin. A dosage effect on amylopectin content depends on how many recessive \u003cem\u003ewaxy\u003c/em\u003e alleles are inherited by the triploid endosperm (Karper \u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e1933\u003c/span\u003e, Yerka et al. \u003cspan citationid=\"CR70\" class=\"CitationRef\"\u003e2016\u003c/span\u003e). In 1933, R.E. Karper first documented the Mendelian inheritance of individual alleles of wild-type (\u003cem\u003eWx\u003c/em\u003e) and waxy (\u003cem\u003ewx\u003c/em\u003e) grain phenotypes in sorghum (Karper \u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e1933\u003c/span\u003e). Pedersen et al. (Pedersen et al. \u003cspan citationid=\"CR43\" class=\"CitationRef\"\u003e2005\u003c/span\u003e) were first to characterize the \u003cem\u003ewaxy a\u003c/em\u003e (\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e) and \u003cem\u003ewaxy b\u003c/em\u003e (\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e) alleles with respect to the production of GBSS: \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e is a knock-out allele producing no GBSS, whereas \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e is \u0026ldquo;leaky\u0026rdquo; and produces low levels of non-functional GBSS. Sattler et al. (Sattler et al. \u003cspan citationid=\"CR48\" class=\"CitationRef\"\u003e2009\u003c/span\u003e) performed the first molecular characterization of \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e and \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e alleles and identified the causal mutations of each one: the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e allele contains a large insertion in the third exon, whereas the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e allele contains a C to T transition converting amino acid 268 from glutamine to histidine. Since then, two additional alleles have been reported in East Asia, although their nomenclature is inconsistent across publications (Kawahigashi et al. \u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e2013\u003c/span\u003e, Lu et al. \u003cspan citationid=\"CR33\" class=\"CitationRef\"\u003e2013\u003c/span\u003e). The \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ec\u003c/em\u003e\u003c/sup\u003e allele described by Lu et al. (\u003cspan citationid=\"CR33\" class=\"CitationRef\"\u003e2013\u003c/span\u003e) has a G deletion at the 5' splicing site of the ninth intron, causing a frameshift mutation and early translation termination. The \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ed\u003c/em\u003e\u003c/sup\u003e allele described by the same authors contains a mutation at the 3' splicing site of the tenth intron, causing a splicing site shift and deletion of five amino acids (GTGKK) in the predicted translated protein.\u003c/p\u003e \u003cp\u003eWhile sorghum is a key subsistence crop for human food and animal feed in Sub-Saharan African and Southeast Asia, it has historically been bred for grain yield and animal feed in North and South America, Europe, and Australia. Waxy sorghum grain has long been associated with improved whole-grain digestibility by animals (Sherrod et al. \u003cspan citationid=\"CR50\" class=\"CitationRef\"\u003e1969\u003c/span\u003e, Walker and Lichtenw.Re 1974, Myer and Gorbet \u003cspan citationid=\"CR40\" class=\"CitationRef\"\u003e1985\u003c/span\u003e, Schroeder et al. \u003cspan citationid=\"CR49\" class=\"CitationRef\"\u003e1998\u003c/span\u003e), rapid fermentation into ethanol for biofuel production (Yan et al. \u003cspan citationid=\"CR66\" class=\"CitationRef\"\u003e2011\u003c/span\u003e, Wu et al. \u003cspan citationid=\"CR61\" class=\"CitationRef\"\u003e2013\u003c/span\u003e), the production of alcoholic beverages (Hsieh and Pi \u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e1979\u003c/span\u003e, Mugode et al. \u003cspan citationid=\"CR39\" class=\"CitationRef\"\u003e2011\u003c/span\u003e, Mezgebe et al. \u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e2018\u003c/span\u003e), and improved baking properties (Sang et al. \u003cspan citationid=\"CR47\" class=\"CitationRef\"\u003e2008\u003c/span\u003e, Elhassan et al. \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e2015\u003c/span\u003e). Despite its potential uses, sorghum breeders have historically reported conflicting results with waxy sorghum grain yield and germination (Jampala et al. \u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e2012\u003c/span\u003e), causing a delay in the adoption of this trait by seed companies. In 2015 the first near-isogenic waxy/wild-type lines and hybrids that overcame these challenges were publicly released, leading to current commercial interest (Yerka et al. \u003cspan citationid=\"CR68\" class=\"CitationRef\"\u003e2015\u003c/span\u003e, Yerka et al. \u003cspan citationid=\"CR68\" class=\"CitationRef\"\u003e2015\u003c/span\u003e). Consequently, waxy grain is the first value-added food and beverage quality trait in sorghum to be commercially developed in the U.S. The first generation of commercial waxy hybrids are now in the final stages of development by several seed companies and should be released within the next few years. Nevertheless, most of these efforts are limited to Mendelian approaches that rely on PCR-based molecular markers specific to alleles of the \u003cem\u003eWaxy\u003c/em\u003e locus. The genomic relationship between yield, germination, and the waxy trait has not been resolved, and continues to hinder the efficiency of developing high-yielding hybrids. Improved genomics-assisted breeding tools would greatly enhance these efforts.\u003c/p\u003e \u003cp\u003eThe present study was conducted to develop an updated molecular breeding toolkit for waxy sorghum that can be combined with genomic selection for rapid crop improvement. The full-length \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e allele was integrated into chromosome 10 of the B.Tx623 reference genome (v3.1.1) to improve read mapping in diverse waxy materials at \u003cem\u003eSobic.010G022600\u003c/em\u003e. Functional competitive allele-specific PCR (PACE-PCR) markers were developed for \u003cem\u003eWx\u003c/em\u003e, \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e, and \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e alleles (the most frequently used alleles in U.S. breeding programs) and validated in numerous pedigrees in public and private breeding programs. This new molecular toolkit should enhance the efficiency of waxy sorghum breeding for the production of climate-smart foods, animal feeds, and biofuels.\u003c/p\u003e"},{"header":"Materials and Methods","content":"\u003cp\u003e \u003cem\u003ePlant Materials\u003c/em\u003e. Three sets of sorghum genotypes contributed to this analysis: publicly available materials in the University of Nevada, Reno (UNR; Reno, NV) sorghum breeding program, publicly available materials at the USDA-ARS Plant Stress and Germplasm Development Research Unit (Lubbock, TX), and pre-commercial materials (experimental hybrid parents) at Richardson Seeds (Vega, TX), a subsidiary of Nuseed (Victoria, Australia) (Supplementary Table\u0026nbsp;1). The UNR materials included B.Tx623 and a selection of previously reported waxy and wild-type near-isogenic lines (referred to hereafter as waxy NILs) (Yerka et al. \u003cspan citationid=\"CR68\" class=\"CitationRef\"\u003e2015\u003c/span\u003ea, Yerka et al. \u003cspan citationid=\"CR68\" class=\"CitationRef\"\u003e2015\u003c/span\u003eb, Yerka et al. \u003cspan citationid=\"CR70\" class=\"CitationRef\"\u003e2016\u003c/span\u003e) upon which additional phenotypic selection has been imposed for uniformity. The USDA materials included 98 experimental breeding lines in the F\u003csub\u003e4\u003c/sub\u003e generation from various population structures (biparental, multiparental, backcross). The pre-commercial Richardson Seeds materials were coded for privacy and include 19 inbred lines in the F\u003csub\u003e7\u003c/sub\u003e generation or later from biparental populations. All plant materials were grown in Lubbock, TX in 2022. Panicles were mechanically threshed to obtain seeds for analysis. Aliquots (20 g) of bulk seed from 2\u0026ndash;3 panicles per entry were evaluated for % amylose content on a whole-grain, dry weight basis (dwb) using published near-infrared spectroscopy (NIRS) protocols at the USDA-ARS Grain Quality and Structure Research Unit in Manhattan, KS (Peiris et al. \u003cspan citationid=\"CR44\" class=\"CitationRef\"\u003e2021\u003c/span\u003e). NIR data quality was assured by only reporting those values with Mahalanobis distances\u0026thinsp;\u0026lt;\u0026thinsp;4.0 (Whitfield et al. \u003cspan citationid=\"CR59\" class=\"CitationRef\"\u003e1987\u003c/span\u003e). The use of both public and private materials from three independent sorghum breeding programs was a strategy to ensure that new molecular markers are robust and will work across diverse populations and molecular breeding programs.\u003c/p\u003e \u003cp\u003e \u003cem\u003eSequencing the full-length\u003c/em\u003e wx\u003csup\u003ea\u003c/sup\u003e \u003cem\u003einsertion\u003c/em\u003e. Full-length sequence of the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e allele was needed to develop an enhanced assembly inclusive of both sequence and structural variations present at the \u003cem\u003eWaxy\u003c/em\u003e locus of chromosome 10. Due to the repetitive nature of the transposon insertion in \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e genotypes, a contiguous assembly of the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e allele was not possible using PE150 \u0026ldquo;short\u0026rdquo; reads from our WGS data, nor in published WGS data from the Sorghum Association Panel (SAP). Instead, a \u003cem\u003ede novo\u003c/em\u003e full-length sequence of the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e allele of \u0026lsquo;Tx2907\u0026rsquo; (a common trait donor) was generated using long-range PCR, as follows. Genomic DNA was extracted from immature sorghum leaves using the CTAB method (Porebski et al. \u003cspan citationid=\"CR45\" class=\"CitationRef\"\u003e1997\u003c/span\u003e). A Multiskan SkyHigh Microplate Spectrophotometer (Thermo Fisher Scientific, USA) was used to measure the quality and concentration of DNA. Forward (AF: 5\u0026prime;-GGCCTGGATTCAATGTTCTT-3\u0026prime;) and referees (AR: 5\u0026prime;-GCAGCTGGTTGTCCTTGTAG-3\u0026prime;) primers were used with Q5 High-Fidelity 2X Master Mix (NEB, UK) under the following conditions: DNA was denatured at 98\u0026deg;C for 30 sec followed by 33 cycles of 98\u0026deg;C for 10 sec, annealing at 64\u0026deg;C for 30 sec, extension at 72\u0026deg;C for 5 min, and a final extension for 2 min at 72\u0026deg;C. Samples were held at 4\u0026deg;C and stored at -20\u0026deg;C for future use. The amplicon was purified with Monarch PCR \u0026amp; DNA Cleanup Kit (NEB, UK). Cloning was conducted using the Zero Blunt TOPO PCR Cloning Kit. Briefly, the purified PCR product (60\u0026ndash;80 ng) was cloned into the TOPO vector and transformed into competent cells of \u003cem\u003eE. coli\u003c/em\u003e. After 1-h propagating, the \u003cem\u003eE. coli\u003c/em\u003e was streaked onto a 50 \u0026micro;g mL\u003csup\u003e\u0026minus;\u0026thinsp;1\u003c/sup\u003e kanamycin plate and incubated overnight before colony selection and plasmid extraction. Sanger sequencing of the plasmid DNA was conducted by a commercial service provider. The conserved domain of the sequence was annotated at NCBI (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi\u003c/span\u003e\u003cspan address=\"https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e). These efforts were used to enhance the sorghum reference genome (v3.1.1) to contain the full-length \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e allele on chromosome 10 (referred to hereafter as the \u0026ldquo;enhanced genome assembly\u0026rdquo;). Specifically, an in-house Biopython program was developed to add the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e insertion to the reference genome (v3.1.1) by substituting it and its flanking regions for chr10 positions 1862384 to 1862906.\u003c/p\u003e \u003cp\u003e \u003cem\u003eAllele-specific PACE marker development\u003c/em\u003e. The principle of PACE-PCR is that primers having unique tail sequences target known SNPs and bind to specific alleles (von Maydell \u003cspan citationid=\"CR57\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). Subsequently, during amplification, allele-specific fluorescent tags are added and detected spectrophotometrically so that different alleles can be identified (von Maydell \u003cspan citationid=\"CR57\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). In this study, primers were designed with the tails FAM (5\u0026prime;-GAAGGTGACCAAGTTCATGCT-3\u0026prime;) and HEX (5\u0026prime;-GAAGGTCGGAGTCAACGGATT-3\u0026prime;). The detailed sequences of each marker are shown in Table \u003cspan refid=\"MOESM1\" class=\"InternalRef\"\u003eS1\u003c/span\u003e.\u003c/p\u003e \u003cp\u003e \u003cem\u003eValidation of the PACE-PCR markers.\u003c/em\u003e The \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e allele was validated using fragment length polymorphisms amplified in PCR and visualized in gel electrophoresis. The primers and PCR conditions were the same as those above for amplifying the full-length \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e insertion, despite differences in fragment lengths. The \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e allele was validated with Sanger sequencing using the forward primer BF (5\u0026prime;-CCTCTGTCATGCTACCTCAAG-3\u0026prime;) and the reverse primer BR (5\u0026prime;-CATTCATTGCATACCCGTCG-3\u0026prime;) in the same PCR conditions as above. Fragments were separated in a 1% agarose gel. Purified PCR products were used for Sanger sequencing validation.\u003c/p\u003e \u003cp\u003ePACE-PCR genotyping was conducted on all plant materials following manufacturer instructions (3CR Bioscience, Harlow, United Kingdom) of a CFX96 Touch Deep Well Real-Time PCR Detection System (BIO-RAD, USA). Specifically, 0.15 \u0026micro;L of the assay mix (12 \u0026micro;M for each allele-specific forward primer and 30 \u0026micro;M for a single common reverse primer in the final PCR reaction), 1 \u0026micro;L gDNA (100\u0026ndash;200 ng), 3.85 \u0026micro;L ultra-pure distilled water, and 5 \u0026micro;L PACE\u0026trade; 2.0 Genotyping Master Mix (3CR, Bioscience) were added to a total reaction volume of 10 \u0026micro;L. DNA was denatured at 94\u0026deg;C, then a touch down (65\u0026thinsp;\u0026minus;\u0026thinsp;57\u0026deg;C) step was performed for 10 cycles for primer annealing and initial extension. An additional 30 cycles were used for further extension to generate more tail-specific sequences. Fluorescence was visualized and digitalized for automated analysis with Bio-Rad CFX Maestro software.\u003c/p\u003e \u003cp\u003e \u003cem\u003eWhole-genome sequencing (WGS)\u003c/em\u003e. WGS data was obtained to generate whole-genome coverage at greater depth and coverage than GBS in selected plant materials to enhance our ability to assess read mapping in \u003cem\u003eWx\u003c/em\u003e versus \u003cem\u003ewx\u003c/em\u003e\u003csub\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sub\u003e materials using the enhanced genome assembly. These materials included the grain sorghum parent lines \u0026lsquo;KS115\u0026rsquo; (\u003cem\u003eWx\u003c/em\u003e) and \u0026lsquo;R.Tx2907\u0026rsquo; (\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e). PE150 libraries were prepared for Illumina sequencing of gDNA (24 Gb/sample, ~\u0026thinsp;30x coverage). To evaluate the quality of the raw reads, FastQC (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.bioinformatics.babraham.ac.uk/projects/fastqc/\u003c/span\u003e\u003cspan address=\"https://www.bioinformatics.babraham.ac.uk/projects/fastqc/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e) (Andrews \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e2010\u003c/span\u003e) was utilized. Trimming was conducted by removing nucleotide calls with Phred scores below 20 (Q20) from the 3' ends of the reads. Subsequently, high-quality reads were selected for alignments using Bowtie2 (Langmead and Salzberg \u003cspan citationid=\"CR30\" class=\"CitationRef\"\u003e2012\u003c/span\u003e) with default parameters. Whole-genome sequence of R.Tx2907 was retrieved from publicly available source (Boatwright et al. \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). SAM/BAM alignments obtained were analyzed using the SAMtools (Li et al. \u003cspan citationid=\"CR31\" class=\"CitationRef\"\u003e2009\u003c/span\u003e) and BAMtools (Barnett et al. \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e2011\u003c/span\u003e) software suites. The aligned reads mapped to the \u003cem\u003eWaxy\u003c/em\u003e locus were visualized and compared using the IGV tool (Thorvaldsd\u0026oacute;ttir et al. \u003cspan citationid=\"CR56\" class=\"CitationRef\"\u003e2013\u003c/span\u003e).\u003c/p\u003e"},{"header":"Results","content":"\u003cp\u003e \u003cem\u003eThe full-length\u003c/em\u003e wx\u003csup\u003ea\u003c/sup\u003e \u003cem\u003einsertion\u003c/em\u003e. The causal mutation of the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e allele is a 5,174 bp insertion containing RNase H (RNH), Reverse Transcriptase (RT), gap-pre-integrase, zf (Zinc knuckle) and Transposase domains; a long terminal repeat (LTR) region, and a GAG (gag-polypeptide) domain (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003e). It is a large retrotransposon derivative (LARD) element that belongs to LTR retrotransposons (Kalendar et al. \u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e2004\u003c/span\u003e). LARD elements are nonautonomous retrotransposons, relying on other autonomous TEs to transpose. LARDs consist of non-coding internal domains and LTR (Havecker et al. \u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e2004\u003c/span\u003e). Within the LARD, the LTR region is a pair of identical 422 bp sequences at each end of the insertion, which serve as regulatory elements for GAG and POL domains needed for gene expression (Finnegan et al., 2012). RNH protein degrades the 5\u0026rsquo; LTR to transcribe RNA and promotes priming of the 3\u0026rsquo;LTR for RT to synthesize a new cDNA molecule. A target region in the genome is cut by integrase and the newly synthesized cDNA is inserted. The GAG domain contains coding sequences for two structural proteins which are believed to play an important, but as-yet undetermined role (Chaparro et al. \u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e2015\u003c/span\u003e).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003cem\u003ePACE-PCR markers\u003c/em\u003e.\u003c/p\u003e \u003cp\u003eThe initial PACE-PCR markers provided excellent discrimination of \u003cem\u003eWx\u003c/em\u003e, \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e, and \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e alleles in homozygous genotypes (Figs.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eA and \u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eC). Primer pairs AF/AR, used to discriminate between B.Tx623 (\u003cem\u003eWx\u003c/em\u003e) and R.Tx2907 (\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e), amplified bands of 523 bp and 5,702 bp, respectively (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eB). Sanger sequencing confirmed the presence of the G-T SNP in \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e genotypes (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eD).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eTo validate the performance of the markers in breeding, genotyping was conducted blind (not in the breeders\u0026rsquo; labs), without \u003cem\u003ea priori\u003c/em\u003e knowledge of pedigrees or \u003cem\u003eWaxy\u003c/em\u003e allele(s) in each line. The markers were used to screen a total of 258 genotypes in the UNR, USDA, and Richardson Seeds breeding programs, including individuals that were homozygous and heterozygous for various combinations of \u003cem\u003eWaxy\u003c/em\u003e alleles (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eTo test whether the genotyping results matched the amylose contents expected by the breeders, we compared the \u003cem\u003eWaxy\u003c/em\u003e alleles of plants (determined from young leaf tissue) to the amylose content of mature grain (Table \u003cspan refid=\"MOESM2\" class=\"InternalRef\"\u003eS2\u003c/span\u003e). The amylose contents of the UNR and Richardson Seeds programs largely agreed with the genotyping results. Out of 98 USDA-ARS lines genotyped, ten were still segregating for different \u003cem\u003eWaxy\u003c/em\u003e alleles (heterozygous) and eliminated from the analysis. Twenty-four out of 26 expected homozygous lines were determined to be homozygous for the correct allele (either \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e or \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e) and had grain amylose contents below 5%, which is the threshold typically considered to define the waxy phenotype. However, two of those 26 lines were not in fact homozygous and were still segregating for wild-type or heterowaxy (\u003cem\u003eWx\u003c/em\u003e/\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e or \u003cem\u003eWx\u003c/em\u003e/\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e) plants that were not genotyped but were harvested and included in the grain quality analysis. Alternatively, some homozygous waxy plants could have been adventitiously cross-pollinated by wild-type pollen despite every attempt to cover them prior to flowering. Nevertheless, the average amylose content of the USDA-ARS waxy lines was 2.12%. Four heterowaxy lines had amylose contents between 10.6% and 17.3%, and 58 of 60 wild-type lines had amylose contents over 15%, consistent with the recessive nature of \u003cem\u003ewx\u003c/em\u003e alleles and their dose response in the triploid endosperm (Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003e).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eOur results demonstrate the importance of high-throughput molecular markers when breeding with waxy materials, as the phenotype is not visible in the grain and adventitious outcrossing with wild-type pollen carrying dominant alleles can easily contaminate breeding stocks after genotyping is conducted early in the season to raise amylose content through xenia (Karper \u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e1933\u003c/span\u003e, Gorbet and Weibel \u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e1972\u003c/span\u003e). Overall, the PACE genotypes were consistent with expected grain amylose contents and were able to rapidly identify off-types. They are therefore suitable for detecting adventitious outcrossing (through analysis of harvested grain or leaf tissues in the following generation) as a routine quality control measure leading up to commercialization. As the parental lines are from different breeding programs using different genetic backgrounds, the results indicate that these markers can be broadly used in waxy sorghum breeding.\u003c/p\u003e \u003cp\u003e \u003cem\u003eRead-mapping at the\u003c/em\u003e Waxy \u003cem\u003elocus\u003c/em\u003e. The presence of the large insertion in \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e genotypes results in broken reads when flanking sequences that partially cover both wild-type and insertion sequences fail to align to the wild-type reference genome. This limits the efficiency of local short read mapping during molecular breeding and makes linkage disequilibrium analyses more difficult. The enhanced genome assembly was able to overcome this challenge by providing reference sequences for read mapping across the full-length insertion. When WGS data for the inbred lines KS115 (\u003cem\u003eWx\u003c/em\u003e) and R.Tx2907 (\u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e) were mapped to the wild-type (B.Tx623, v3.1.1) and enhanced reference genome (containing the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e insertion), the same number of reads were detected between Chr. 10: 1,860,965-1,865,278 for KS115, but an additional 78 uniquely mapped reads covering the insertion were mapped for R.Tx2907 (Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003e).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e"},{"header":"Discussion","content":"\u003cp\u003e \u003cem\u003eBreeding for grain quality traits of sorghum\u003c/em\u003e. Historic breeding efforts in U.S. sorghum have focused on grain yield more than quality because phenotyping for quality is typically more time-consuming and expensive; and because producers were paid primarily for yield. However, advances in high-throughput phenotyping and genotyping methods have made it easier than ever to phenotype for quality traits and their interactions with whole-plant (grain\u0026thinsp;+\u0026thinsp;biomass) productivity and per-area yield. There is also a small but rapidly growing world-wide market for grain sorghum with grain quality traits, like low amylose content, so breeding programs are beginning to address them.\u003c/p\u003e \u003cp\u003eThe global value of the gluten-free products market was estimated at USD 6.3\u0026nbsp;billion in 2022 and projected to reach USD 11.8\u0026nbsp;billion by 2030 at a compound annual growth rate of 9.5%, based on a Vantage Market Research report. Highlights of the same report noted that a considerable amount of the increase in value could be attributed to more health-conscious consumers who were interested in methods to extend the shelf life of baked goods, especially breads, without the addition of preservatives \u0026ndash; a known property of waxy sorghum flour. Waxy sorghum grain has been studied for many years in the food sciences is a high-value endosperm trait associated with high digestibility by animals (Sherrod et al. \u003cspan citationid=\"CR50\" class=\"CitationRef\"\u003e1969\u003c/span\u003e, Walker and Lichtenw.Re 1974, Myer and Gorbet \u003cspan citationid=\"CR40\" class=\"CitationRef\"\u003e1985\u003c/span\u003e, Schroeder et al. \u003cspan citationid=\"CR49\" class=\"CitationRef\"\u003e1998\u003c/span\u003e), rapid fermentation into ethanol (Yan et al. \u003cspan citationid=\"CR66\" class=\"CitationRef\"\u003e2011\u003c/span\u003e, Wu et al. \u003cspan citationid=\"CR61\" class=\"CitationRef\"\u003e2013\u003c/span\u003e), the production of alcoholic beverages (Hsieh and Pi \u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e1979\u003c/span\u003e, Mugode et al. \u003cspan citationid=\"CR39\" class=\"CitationRef\"\u003e2011\u003c/span\u003e, Mezgebe et al. \u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e2018\u003c/span\u003e), and improved baking properties (Sang et al. \u003cspan citationid=\"CR47\" class=\"CitationRef\"\u003e2008\u003c/span\u003e, Elhassan et al. \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e2015\u003c/span\u003e) due to high-amylopectin starch content. While the waxy trait has been identified since the 1930s (Karper \u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e1933\u003c/span\u003e), recent hybrids are available with no yield drag or poor agronomic performance compared to near-isogenic or closely-related wild-type hybrids (Jampala et al. \u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e2012\u003c/span\u003e, Yerka et al. \u003cspan citationid=\"CR70\" class=\"CitationRef\"\u003e2016\u003c/span\u003e). It has also been linked to lower resistant starch content, an important factor in food functionality that must be balanced with diet because, when consumed in excess, it could result in unhealthy gut microbiomes, based on human stool microbiomes and mouse feeding studies (Yang et al. \u003cspan citationid=\"CR67\" class=\"CitationRef\"\u003e2023\u003c/span\u003e). This is because the increased digestibility of waxy starches can raise the glycemic index (Frei et al. \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e2003\u003c/span\u003e, Fitzgerald et al. \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e2011\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eSorghum breeders are interested in combining waxy grain with additional target traits such as larger seed size, improved protein content and digestibility, improved germination, sugarcane aphid resistance, earlier maturity, etc. to improve their commercial value, expand production into additional regions, and help open underexplored markets where value-added, climate-smart crops are urgently needed. In particular, the western U.S. may be ideal for the development of new grain quality traits due to minimal precipitation throughout the growing season and harvest period, which can reduce grain quality in more central and eastern regions of the country. Developing waxy sorghum hybrids with improved trait combinations for deployment in new areas will be most efficiently achieved using advanced molecular breeding techniques. The PACE-PCR markers and updated chromosome 10 assembly presented herein provide a solid foundation for those efforts.\u003c/p\u003e \u003cp\u003e \u003cem\u003eMolecular breeding resources for adapting sorghum yield and quality to local environments\u003c/em\u003e. Genome-to-phenome (G2P) modeling combines physiological information about plant developmental processes with genomic, agronomic, and environomic data. These methods, which often use artificial intelligence (AI) to optimize the predictive capacity of genetic models for improved gene discovery, have emerged as the most effective method for clarifying relationships between inputs and outputs in cropping systems (Harfouche et al. \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e2019\u003c/span\u003e, Xu et al. \u003cspan citationid=\"CR64\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). Hence, to secure both the abundance and quality of food systems, machine learning will become increasingly important to bring phenotyping for quality traits up to the speed and efficiency of phenotyping for yield for rapid improvements to the sustainable production of microscale, biochemical traits (Yan and Wang \u003cspan citationid=\"CR65\" class=\"CitationRef\"\u003e2023\u003c/span\u003e), like amylose or amylopectin content in climate-smart species like sorghum. The current study reports functional markers for waxy grain, which is not amenable to high-throughput phenotyping in field environments prior to the time of flowering. However, genotyping young plants with NGS markers in genomic selection is becoming standard in many companies, so the reported markers will provide immediate improvements to breeding efficiency by shortening the commercial breeding cycle. They will also assist with ongoing quality control by identifying progeny that are outcrosses derived from adventitious cross-pollination of maternal plants in the previous generation.\u003c/p\u003e \u003cp\u003eThe improved read mapping achieved in \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e genotypes using the enhanced genome assembly is consistent with what other researchers have demonstrated when incorporating presence-absence and copy number variation into the reference genomes used for making marker-trait associations (Bayer et al. \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e2020\u003c/span\u003e, Jayakodi et al. \u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e2020\u003c/span\u003e, Tao et al. \u003cspan citationid=\"CR53\" class=\"CitationRef\"\u003e2021\u003c/span\u003e, Li et al. \u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e2022\u003c/span\u003e, Sun et al. \u003cspan citationid=\"CR51\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). Future work in the sorghum grain quality space should focus on creating pangenomes that account for even more quality traits not expressed in existing reference genomes. Breeding for human food and animal feed end uses will depend upon identifying functional markers for micro- and macronutrient contents (e.g. starch, protein, oil, amylose, lysine, beta carotene, zinc, iron, etc.), percent vitreosity, kernel hardness, resistant starch content, fiber, seed size, and the interactions of diverse grain physiochemistry profiles with grain weathering and molding, as no in-field, high-throughput phenotyping platform currently exists for them. The ongoing reliance of most breeding programs on laboratory-based quality testing lengthens the breeding cycle when grain must first be harvested and analyzed before superior genotypes can be identified and advanced to the next generation. In addition, the interaction between functional grain qualities and production environments can be impacted by presence-absence variation that plays a role in other aspects of plant physiology and adaptation, such as loci controlling plant height, maturity, and thermal or photoperiod sensitivity. As reference genomes improve, plant breeders will be increasingly able to develop plant populations with improved combinations of yield and quality traits that express well in target environments.\u003c/p\u003e"},{"header":"Conclusions","content":"\u003cp\u003eThe waxy trait is one of the first specialty grain quality traits of U.S. sorghum to be commercially produced. The validated markers in our study will be beneficial for various practical applications, such as germplasm characterization, allele mining, and marker assisted selection during backcrossing and forward breeding programs to improve sorghum food, feed, and fuel applications. We have built upon previous efforts to genetically characterize the \u003cem\u003eWaxy\u003c/em\u003e locus by developing an enhanced chromosome 10 assembly for read mapping and functional NGS markers. These new resources provide an updated molecular toolkit for sorghum breeding that was validated in three public and private programs. This toolkit will improve the efficiency of breeding waxy sorghum using genomic selection and genome-to-phenome methodologies for rapid functional trait adaptation to target production environments.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eData Availability\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eAll datasets generated during and/or analyzed during the current study are available in the supplementary files associated with this manuscript.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eSTATEMENTS AND DECLARATIONS\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFunding.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eFunding for the development and analysis of the UNR breeding lines was provided by a United States Department of Agriculture Multi-State Hatch Award (NEV00763) and a United States Department of Agriculture Award (2019-67014-29174) to MKY. Funding for PACE-PCR assays, genomic analyses, and field studies at Texas Tech University (TTU) was provided by new faculty start-up funding to YJ. ZL was supported by the Graduate Support Stipend fellowship from the TTU Davis college of Agricultural Science and Natural Resources obtained by YJ and GP. Funding for the development of the USDA experimental breeding lines was provided by United States Department of Agriculture, Agricultural Research Service Project Plan 3096-21000-024-000-D. Richardson Seeds funded the development of their own experimental breeding lines and provided in-kind funding to MKY to support field studies conducted in Vega, TX during the development of the UNR breeding lines.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eCompeting Interests.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe authors have no relevant financial or non-financial interests to disclose.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthor Contributions\u003c/strong\u003e\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eMY and ZL are co-first authors. MY and YJ are co-corresponding authors who designed the studies and analyzed data. MY, YJ, and GP obtained funding to conduct the work. ZL and DN developed the PACE-PCR markers and analyzed data. MY developed the UNR plant materials and led field studies using the UNR and Richardson Seeds plant materials with the assistance of YJ and ZL in Lubbock, TX. CH developed and conducted field studies with the USDA sorghum materials. DD, GK, SM, and GC developed the pre-commercial materials and analyzed data. SB conducted grain quality analyses. All authors read and approved the final manuscript.\u0026nbsp;\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eSorghum \u003cem\u003ebicolor\u003c/em\u003e BTx642 v1.1, DOE-JGI, \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttp://phytozome.jgi.doe.gov/\u003c/span\u003e\u003cspan address=\"http://phytozome.jgi.doe.gov/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSorghum \u003cem\u003ebicolor\u003c/em\u003e RTx430 v2.1, DOE-JGI, \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttp://phytozome.jgi.doe.gov/\u003c/span\u003e\u003cspan address=\"http://phytozome.jgi.doe.gov/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSorghum \u003cem\u003ebicolor\u003c/em\u003e SC187 v1.1, DOE-JGI, \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttp://phytozome-next.jgi.doe.gov/info/SbicolorSC187_v1_1\u003c/span\u003e\u003cspan address=\"http://phytozome-next.jgi.doe.gov/info/SbicolorSC187_v1_1\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSorghum \u003cem\u003ebicolor\u003c/em\u003e v5.1 DOE-JGI, \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://phytozome.jgi.doe.gov/info/Sbicolor_v5_1\u003c/span\u003e\u003cspan address=\"https://phytozome.jgi.doe.gov/info/Sbicolor_v5_1\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003e\u003cem\u003eSorghum bicolor\u003c/em\u003e Wray v1.1 DOE-JGI, \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://phytozome.jgi.doe.gov/info/SbicolorWray_v1_1\u003c/span\u003e\u003cspan address=\"https://phytozome.jgi.doe.gov/info/SbicolorWray_v1_1\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAndrews S (2010) FastQC: a quality control tool for high throughput sequence data. Babraham Bioinformatics, Babraham Institute, Cambridge, United Kingdom\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBarnett DW, Garrison EK, Quinlan AR, Str\u0026ouml;mberg MP, Marth GT (2011) BamTools: a C\u0026thinsp;+\u0026thinsp;+\u0026thinsp;API and toolkit for analyzing and managing BAM files. Bioinformatics 27(12):1691\u0026ndash;1692. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1093/bioinformatics/btr174\u003c/span\u003e\u003cspan address=\"10.1093/bioinformatics/btr174\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBayer PE, Golicz AA, Scheben A, Batley J, Edwards D (2020) Plant pan-genomes are the new reference. Nat plants 6(8):914\u0026ndash;920. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1038/s41477-020-0733-0\u003c/span\u003e\u003cspan address=\"10.1038/s41477-020-0733-0\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBoatwright JL, Sapkota S, Jin HY, Schnable JC, Brenton Z, Boyles R, Kresovich S (2022) Sorghum Association Panel whole-genome sequencing establishes cornerstone resource for dissecting genomic diversity. Plant J 111(3):888\u0026ndash;904. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1111/tpj.15853\u003c/span\u003e\u003cspan address=\"10.1111/tpj.15853\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBoyles RE, Brenton ZW, Kresovich S (2019) Genetic and genomic resources of sorghum to connect genotype with phenotype in contrasting environments. Plant J 97(1):19\u0026ndash;39. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1111/tpj.14113\u003c/span\u003e\u003cspan address=\"10.1111/tpj.14113\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBrenton ZW, Cooper EA, Myers MT, Boyles RE, Shakoor N, Zielinski KJ, Rauh BL, Bridges WC, Morris GP, Kresovich S (2016) A genomic resource for the development, improvement, and exploitation of sorghum for bioenergy. Genetics 204:21\u0026ndash;33. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1534/genetics.115.183947\u003c/span\u003e\u003cspan address=\"10.1534/genetics.115.183947\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChaparro C, Gayraud T, de Souza RF, Domingues DS, Akaffou S, Vanzela ALL, de Kochko A, Rigoreau M, Crouzillat D, Hamon S, Hamon P, Guyot R (2015) Terminal-Repeat Retrotransposons with GAG Domain in Plant Genomes: A New Testimony on the Complex World of Transposable Elements. Genome Biol Evol 7(2):493\u0026ndash;504. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1093/gbe/evv001\u003c/span\u003e\u003cspan address=\"10.1093/gbe/evv001\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCooper EA, Brenton ZW, Flinn BS, Jenkins J, Shu SQ, Flowers D, Luo F, Wang YS, Xia P, Barry K, Daum C, Lipzen A, Yoshinaga Y, Schmutz J, Saski C, Vermerris W, Kresovich S (2019) A new reference genome for \u003cem\u003eSorghum bicolor\u003c/em\u003e reveals high levels of sequence similarity between sweet and grain genotypes: implications for the genetics of sugar metabolism. BMC Genomics 20:420. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1186/s12864-019-5734-x\u003c/span\u003e\u003cspan address=\"10.1186/s12864-019-5734-x\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDeschamps S, Zhang Y, Llaca V, Ye L, Sanyal A, King M, May G, Lin HN (2018) A chromosome-scale assembly of the sorghum genome using nanopore sequencing and optical mapping. Nat Commun 9:4844. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1038/s41467-018-07271-1\u003c/span\u003e\u003cspan address=\"10.1038/s41467-018-07271-1\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eElhassan MSM, Emmambux MN, Hays DB, Peterson GC, Taylor JRN (2015) Novel biofortified sorghum lines with combined waxy (high amylopectin) starch and high protein digestibility traits: Effects on endosperm and flour properties. J Cereal Sci 65:132\u0026ndash;139. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jcs.2015.06.017\u003c/span\u003e\u003cspan address=\"10.1016/j.jcs.2015.06.017\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFitzgerald MA, Rahman S, Resurreccion AP, Concepcion J, Daygon VD, Dipti SS, Kabir KA, Klingner B, Morell MK, Bird AR (2011) Identification of a Major Genetic Determinant of Glycaemic Index in Rice. Rice 4(2):66\u0026ndash;74. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s12284-011-9073-z\u003c/span\u003e\u003cspan address=\"10.1007/s12284-011-9073-z\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFrei M, Siddhuraju P, Becker K (2003) Studies on the in vitro starch digestibility and the glycemic index of six different indigenous rice cultivars from the Philippines. Food Chem 83(3):395\u0026ndash;402. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/S0308-8146(03)00101-8\u003c/span\u003e\u003cspan address=\"10.1016/S0308-8146(03)00101-8\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGorbet D, Weibel D (1972) Inheritance and genetic relationships of six endosperm types in sorghum. Crop Sci 12(3):378\u0026ndash;382. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.2135/cropsci1972.0011183X001200030037x\u003c/span\u003e\u003cspan address=\"10.2135/cropsci1972.0011183X001200030037x\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHarfouche AL, Jacobson DA, Kainer D, Romero JC, Harfouche AH, Mugnozza GS, Moshelion M, Tuskan GA, Keurentjes JJ, Altman A (2019) Accelerating climate resilient plant breeding by applying next-generation artificial intelligence. Trends Biotechnol 37(11):1217\u0026ndash;1235. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.tibtech.2019.05.007\u003c/span\u003e\u003cspan address=\"10.1016/j.tibtech.2019.05.007\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHavecker ER, Gao X, Voytas DF (2004) The diversity of LTR retrotransposons. Genome Biol 5(6):1\u0026ndash;6. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1186/gb-2004-5-6-225\u003c/span\u003e\u003cspan address=\"10.1186/gb-2004-5-6-225\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHsieh JS, Pi CP (1979) Effect of the waxy gene on grain quality and wine production in sorghum. J Agricultural Association China (106): 30\u0026ndash;51\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eJampala B, Rooney WL, Peterson GC, Bean S, Hays DB (2012) Estimating the relative effects of the endosperm traits of waxy and high protein digestibility on yield in grain sorghum. Field Crops Res 139:57\u0026ndash;62. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.fcr.2012.09.021\u003c/span\u003e\u003cspan address=\"10.1016/j.fcr.2012.09.021\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eJayakodi M, Padmarasu S, Haberer G, Bonthala VS, Gundlach H, Monat C, Lux T, Kamal N, Lang D, Himmelbach A, Ens J, Zhang X-Q, Angessa TT, Zhou G, Tan C, Hill C, Wang P, Schreiber M, Boston LB, Plott C, Jenkins J, Guo Y, Fiebig A, Budak H, Xu D, Zhang J, Wang C, Grimwood J, Schmutz J, Guo G, Zhang G, Mochida K, Hirayama T, Sato K, Chalmers KJ, Langridge P, Waugh R, Pozniak CJ, Scholz U, Mayer KFX, Spannagl M, Li C, Mascher M, Stein N (2020) The barley pan-genome reveals the hidden legacy of mutation breeding. Nature 588(7837):284\u0026ndash;289. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1038/s41586-020-2947-8\u003c/span\u003e\u003cspan address=\"10.1038/s41586-020-2947-8\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eJin SK, Xu LN, Leng YJ, Zhang MQ, Yang QQ, Wang SL, Jia SW, Song T, Wang RA, Tao T, Liu QQ, Cai XL, Gao JP (2023) The OsNAC24-OsNAP protein complex activates \u003cem\u003eOsGBSSI\u003c/em\u003e and \u003cem\u003eOsSBEI\u003c/em\u003e expression to fine-tune starch biosynthesis in rice endosperm. Plant Biotechnol J 21(11):2224\u0026ndash;2240. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1111/pbi.14124\u003c/span\u003e\u003cspan address=\"10.1111/pbi.14124\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKalendar R, Vicient CM, Peleg O, Anamthawat-Jonsson K, Bolshoy A, Schulman AH (2004) Large retrotransposon derivatives: abundant, conserved but nonautonomous retroelements of barley and related genomes. Genetics 166(3):1437\u0026ndash;1450. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1534/genetics.166.3.1437\u003c/span\u003e\u003cspan address=\"10.1534/genetics.166.3.1437\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKang XM, Gao W, Cui B, Abd El-Aty AM (2023) Structure and genetic regulation of starch formation in sorghum (\u003cem\u003eSorghum bicolor\u003c/em\u003e (L.) Moench) endosperm: A review. Int J Biol Macromol 239:124315. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.ijbiomac.2023.124315\u003c/span\u003e\u003cspan address=\"10.1016/j.ijbiomac.2023.124315\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKarper R (1933) Inheritance of waxy endosperm in sorghum. J Hered 24(6):257\u0026ndash;262. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1093/oxfordjournals.jhered.a103794\u003c/span\u003e\u003cspan address=\"10.1093/oxfordjournals.jhered.a103794\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKawahigashi H, Oshima M, Nishikawa T, Okuizumi H, Kasuga S, Yonemaru J (2013) A novel \u003cem\u003ewaxy\u003c/em\u003e allele in sorghum landraces in East Asia. Plant Breeding 132(3):305\u0026ndash;310. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1111/pbr.12054\u003c/span\u003e\u003cspan address=\"10.1111/pbr.12054\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eKumar N, Brenton Z, Myers MT, Boyles RE, Sapkota S, Boatwright JL, Cox WA K. E. Jordan and S. Kresovich Registration of the sorghum carbon-partitioning nested association mapping (CP-NAM) population. J Plant Registrations. 15(3):656\u0026ndash;663. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/plr2.20229\u003c/span\u003e\u003cspan address=\"10.1002/plr2.20229\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLangmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9(4):357\u0026ndash;359. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1038/nmeth.1923\u003c/span\u003e\u003cspan address=\"10.1038/nmeth.1923\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, R. Durbin and G. P. D. P., Subgroup (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25(16): 2078\u0026ndash;2079. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1093/bioinformatics/btp352\u003c/span\u003e\u003cspan address=\"10.1093/bioinformatics/btp352\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLi H, Wang S, Chai S, Yang Z, Zhang Q, Xin H, Xu Y, Lin S, Chen X, Yao Z (2022) Graph-based pan-genome reveals structural and sequence variations related to agronomic traits and domestication in cucumber. Nature Communications 13(1): 682. 682 (2022). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1038/s41467-022-28362-0\u003c/span\u003e\u003cspan address=\"10.1038/s41467-022-28362-0\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLu YG, Zhao GL, Li Y, Fan J, Ding GX, Zhao JQ, Ni XL, Xu YJ, Wang WM (2013) Identification of two novel \u003cem\u003ewaxy\u003c/em\u003e alleles and development of their molecular markers in sorghum. Genome 56(5):283\u0026ndash;288\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003e\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1139/gen-2013-0047\u003c/span\u003e\u003cspan address=\"10.1139/gen-2013-0047\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMace ES, Cruickshank AW, Tao YF, Hunt CH, Jordan DR (2021) A global resource for exploring and exploiting genetic variation in sorghum crop wild relatives. Crop Sci 61(1):150\u0026ndash;162. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/csc2.20332\u003c/span\u003e\u003cspan address=\"10.1002/csc2.20332\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMace ES, Tai S, Gilding EK, Li Y, Prentis PJ, Bian L, Campbell BC, Hu W, Innes DJ, Han X, Cruickshank A, Dai C, Frere C, Zhang H, Hunt CH, Wang X, Shatte T, Wang M, Su Z, Li J, Lin X, Godwin ID, Jordan DR, Wang J (2013) Whole-genome sequencing reveals untapped genetic potential in Africa's indigenous cereal crop sorghum. Nat Commun 4:2320\u0026ndash;2329. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1038/ncomms3320\u003c/span\u003e\u003cspan address=\"10.1038/ncomms3320\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMcCormick RF, Truong SK, Sreedasyam A, Jenkins J, Shu SQ, Sims D, Kennedy M, Amirebrahimi M, Weers BD, McKinley B, Mattison A, Morishige DT, Grimwood J, Schmutz J, Mullet JE (2018) The \u003cem\u003eSorghum bicolor\u003c/em\u003e reference genome: improved assembly, gene annotations, a transcriptome atlas, and signatures of genome organization. Plant J 93(2):338\u0026ndash;354. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1111/tpj.13781\u003c/span\u003e\u003cspan address=\"10.1111/tpj.13781\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMezgebe AG, Abegaz K, Taylor JRN (2018) Relationship between waxy (high amylopectin) and high protein digestibility traits in sorghum and malting quality. J Cereal Sci 79:319\u0026ndash;327. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jcs.2017.11.015\u003c/span\u003e\u003cspan address=\"10.1016/j.jcs.2017.11.015\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMugode L, Portillo OR, Hays DB, Rooney LW, Taylor JRN (2011) Influence of high protein digestibility sorghums on free amino nitrogen (FAN) production during malting and mashing. J Inst Brew 117:422\u0026ndash;426. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/j.2050-0416.2011.tb00488.x\u003c/span\u003e\u003cspan address=\"10.1002/j.2050-0416.2011.tb00488.x\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMyer RO, Gorbet DW (1985) Waxy and normal grain sorghums with varying tannin contents in diets for young pigs. Anim Feed Sci Technol 12(3):179\u0026ndash;186. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/0377-8401(85)90011-2\u003c/span\u003e\u003cspan address=\"10.1016/0377-8401(85)90011-2\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eNakamura Y (2023) A model for the reproduction of amylopectin cluster by coordinated actions of starch branching enzyme isoforms. Plant Mol Biol 112:199\u0026ndash;212. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s11103-023-01352-6\u003c/span\u003e\u003cspan address=\"10.1007/s11103-023-01352-6\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eOkpala NE, Aloryi KD, An TY, He LX, Tang XR (2022) The roles of starch branching enzymes and starch synthase in the biosynthesis of amylose in rice. J Cereal Sci 104:103393. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jcs.2021.103393\u003c/span\u003e\u003cspan address=\"10.1016/j.jcs.2021.103393\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePedersen JF, Bean SR, Graybosch RA, Park SH, Tilley M (2005) Characterization of waxy grain sorghum lines in relation to granule-bound starch synthase. Euphytica 144(1\u0026ndash;2):151\u0026ndash;156. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s10681-005-5298-5\u003c/span\u003e\u003cspan address=\"10.1007/s10681-005-5298-5\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePeiris KHS, Wu XR, Bean SR, Perez-Fajardo M, Hayes C, Yerka MK, Jagadish SVK, Ostmeyer T, Aramouni FM, Tesso T, Perumal R, Rooney WL, Kent MA, Bean B (2021) Near infrared spectroscopic evaluation of starch properties of diverse sorghum populations. Processes 9(11). \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3390/pr9111942\u003c/span\u003e\u003cspan address=\"10.3390/pr9111942\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePorebski S, Bailey LG, Baum BR (1997) Modification of a CTAB DNA extraction protocol for plants containing high polysaccharide and polyphenol components. Plant Mol Biology Report 15(1):8\u0026ndash;15. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/BF02772108\u003c/span\u003e\u003cspan address=\"10.1007/BF02772108\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRuperao P, Thirunavukkarasu N, Gandham P, Selvanayagam S, Govindaraj M, Nebie B, Manyasa E, Gupta R, Das RR, Odeny DA, Gandhi H, Edwards D, Deshpande SP, Rathore A (2021) Sorghum pan-genome explores the functional utility for genomic-assisted breeding to accelerate the genetic gain. Front Plant Sci 12:666342. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3389/fpls.2021.666342\u003c/span\u003e\u003cspan address=\"10.3389/fpls.2021.666342\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSang YJ, Bean S, Seib PA, Pedersen J, Shi YC (2008) Structure and functional properties of sorghum starches differing in amylose content. J Agric Food Chem 56(15):6680\u0026ndash;6685. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1021/jf800577x\u003c/span\u003e\u003cspan address=\"10.1021/jf800577x\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSattler SE, Singh J, Haas EJ, Guo LN, Sarath G, Pedersen JF (2009) Two distinct waxy alleles impact the granule-bound starch synthase in sorghum. Mol Breeding 24(4):349\u0026ndash;359. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s11032-009-9296-5\u003c/span\u003e\u003cspan address=\"10.1007/s11032-009-9296-5\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSchroeder JW, Marx GD, Park CS (1998) Waxy corn as a replacement for dent corn for lactating dairy cows. Anim Feed Sci Technol 72(1\u0026ndash;2):111\u0026ndash;120. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/S0377-8401(97)00171-5\u003c/span\u003e\u003cspan address=\"10.1016/S0377-8401(97)00171-5\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSherrod LB, Albin RC, Furr RD (1969) Net energy of regular and waxy sorghum grains for finishing steers. J Anim Sci 29(6):997. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.2527/jas1969.296997x\u003c/span\u003e\u003cspan address=\"10.2527/jas1969.296997x\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSun Y, Wang J, Li Y, Jiang B, Wang X, Xu W-H, Wang Y-Q, Zhang P-T, Zhang Y-J, Kong X-D (2022) Pan-genome analysis reveals the abundant gene presence/absence variations among different varieties of melon and their influence on traits. Front Plant Sci 13:835496. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3389/fpls.2022.835496\u003c/span\u003e\u003cspan address=\"10.3389/fpls.2022.835496\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTalukder ZA, Chhabra R, Muthusamy V, Gain N, Katral A, Mishra SJ, Zunjare RU, Hossain F (2023) Recessive \u003cem\u003ewaxy1\u003c/em\u003e and \u003cem\u003eopaque2\u003c/em\u003e genes synergistically regulate accumulation of amylopectin, lysine and tryptophan in maize. J Food Compos Anal 121:105392. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.jfca.2023.105392\u003c/span\u003e\u003cspan address=\"10.1016/j.jfca.2023.105392\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTao Y, Luo H, Xu J, Cruickshank A, Zhao X, Teng F, Hathorn A, Wu X, Liu Y, Shatte T (2021) Extensive variation within the pan-genome of cultivated and wild sorghum. Nat Plants 7(6):766\u0026ndash;773. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1038/s41477-021-00925-x\u003c/span\u003e\u003cspan address=\"10.1038/s41477-021-00925-x\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTetlow IJ (2018) Starch Biosynthesis in Crop Plants. Agronomy-Basel 8(6):81. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3390/agronomy8060081\u003c/span\u003e\u003cspan address=\"10.3390/agronomy8060081\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eTetlow IJ, Emes MJ (2017) Starch biosynthesis in the developing endosperms of grasses and cereals. Agronomy-Basel 7(4):81. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3390/agronomy7040081\u003c/span\u003e\u003cspan address=\"10.3390/agronomy7040081\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eThorvaldsd\u0026oacute;ttir H, Robinson JT, Mesirov JP (2013) Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform 14(2):178\u0026ndash;192. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1093/bib/bbs017\u003c/span\u003e\u003cspan address=\"10.1093/bib/bbs017\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003evon Maydell D (2023) PCR Allele Competitive Extension (PACE). Plant Genotyping: Methods and Protocols. Y. Shavrukov. Springer US, New York, NY, pp 263\u0026ndash;271\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWalker RD and Lichtenw.Re (1974) Ruminal digestibility of waxy grain sorghum. J Anim Sci 39(1):256\u0026ndash;256\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWhitfield RG, Gerger ME, Sharp RL (1987) Near-infrared spectrum Qualification via Mahalanobis Distance determination. Appl Spectrosc 41(7):1204\u0026ndash;1213\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWu WH, Zhong YY, Liu YL, Xu RY, Zhang XD, Liu N, Guo DW (2023) A new insight into the biosynthesis, structure, and functionality of waxy maize starch under drought stress. J Sci Food Agric 103(11):5270\u0026ndash;5276. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/jsfa.12599\u003c/span\u003e\u003cspan address=\"10.1002/jsfa.12599\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWu XR, Maier A, Wu TT, Li NB, Pedersen JR, Wang DH (2013) Evaluation of Nebraska waxy sorghum hybrids for ethanol production. Cereal Chem 90(3):198\u0026ndash;203. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1094/CCHEM-06-12-0067-R\u003c/span\u003e\u003cspan address=\"10.1094/CCHEM-06-12-0067-R\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eXin ZG, Jiao YP, Burow G, Hayes C, Chen JP, Burke J, Pugh NA, Ware D (2023) Registration of 252 sequenced sorghum mutants as a community reverse genetic resource. J Plant Registrations. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/plr2.20296\u003c/span\u003e\u003cspan address=\"10.1002/plr2.20296\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eXin ZG, Wang ML, Burow G, Burke J (2009) An induced sorghum mutant population suitable for bioenergy research. Bioenergy Res 2(1\u0026ndash;2):10\u0026ndash;16. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s12155-008-9029-3\u003c/span\u003e\u003cspan address=\"10.1007/s12155-008-9029-3\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eXu Y, Zhang X, Li H, Zheng H, Zhang J, Olsen MS, Varshney RK, Prasanna BM, Qian Q (2022) Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction. Mol Plant 15(11):1664\u0026ndash;1695. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.molp.2022.09.001\u003c/span\u003e\u003cspan address=\"10.1016/j.molp.2022.09.001\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYan J, Wang X (2023) Machine learning bridges omics sciences and plant breeding. Trends Plant Sci 28(2):199\u0026ndash;210. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.tplants.2022.08.018\u003c/span\u003e\u003cspan address=\"10.1016/j.tplants.2022.08.018\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYan SP, Wu XR, Bean SR, Pedersen JF, Tesso T, Chen YHR, Wang DH (2011) Evaluation of waxy grain sorghum for ethanol production. Cereal Chem 88(6):589\u0026ndash;595. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1094/CCHEM-04-11-0056\u003c/span\u003e\u003cspan address=\"10.1094/CCHEM-04-11-0056\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYang QN, Van Haute M, Korth N, Sattler S, Rose D, Juritsch A, Shao J, Beede K, Schmaltz R, Price J, Toy J, Ramer-Tait AE, Benson AK (2023) The waxy mutation in sorghum and other cereal grains reshapes the gut microbiome by reducing levels of multiple beneficial species. Gut Microbes 15(1):2178799. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1080/19490976.2023.2178799\u003c/span\u003e\u003cspan address=\"10.1080/19490976.2023.2178799\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYerka MK, Toy JJ, Funnell-Harris DL, Sattler SE, Pedersen JF (2015) Registration of A/BN641 and RN642 waxy grain sorghum genetic stocks. J Plant Registrations 9(2):258\u0026ndash;261. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3198/jpr2014.10.0075crgs\u003c/span\u003e\u003cspan address=\"10.3198/jpr2014.10.0075crgs\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYerka MK, Toy JJ, Funnell-Harris DL, Sattler SE, Pedersen JF (2015) Registration of N619 to N640 grain sorghum lines with waxy or wild-type endosperm. J Plant Registrations 9(2):249\u0026ndash;253. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.3198/jpr2014.06.0043crgs\u003c/span\u003e\u003cspan address=\"10.3198/jpr2014.06.0043crgs\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eYerka MK, Toy JJ, Funnell-Harris DL, Sattler SE, Pedersen JF (2016) Evaluation of interallelic waxy, heterowaxy, and wild-type grain sorghum hybrids. Crop Sci 56(1):113\u0026ndash;121. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.2135/cropsci2015.03.0151\u003c/span\u003e\u003cspan address=\"10.2135/cropsci2015.03.0151\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":true,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"journal-of-applied-genetics","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"joag","sideBox":"Learn more about [Journal of Applied Genetics](https://www.springer.com/journal/13353)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/joag/default.aspx","title":"Journal of Applied Genetics","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"em","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"sorghum, waxy, grain quality, molecular markers, breeding","lastPublishedDoi":"10.21203/rs.3.rs-4883126/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-4883126/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eSeveral mutations of the sorghum [\u003cem\u003eSorghum bicolor\u003c/em\u003e (L.) Moench] \u003cem\u003eGRANULE-BOUND STARCH SYNTHASE\u003c/em\u003e (\u003cem\u003eGBSS\u003c/em\u003e) gene [\u003cem\u003eSobic.010G022600\u003c/em\u003e] result in a low amylose:amylopectin starch ratio in the endosperm and confer a glutinous, \u0026ldquo;waxy\u0026rdquo; texture; hence, the wild-type gene is commonly referred to as \u003cem\u003eWaxy\u003c/em\u003e (\u003cem\u003eWx\u003c/em\u003e). Recessive \u003cem\u003ewaxy\u003c/em\u003e (\u003cem\u003ewx\u003c/em\u003e) alleles improve starch digestibility in ethanol production, human foods and beverages, and animal feed. However, breeding waxy sorghum can be time-consuming due to the need for grain to reach physiological maturity before the trait can be phenotyped and ongoing reliance on PCR markers for genotyping, which are not amenable to next-generation sequencing (NGS). Modern genomics-assisted breeding requires conducing high-throughput genotyping and selection in large, segregating populations prior to flowering. This study provides the first published NGS markers for the two mostly commonly used \u003cem\u003ewaxy\u003c/em\u003e (\u003cem\u003ewx\u003c/em\u003e) alleles of sorghum and is the first to fully characterize the large insertion that is causal of the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e allele. An enhanced genome assembly was constructed from the B.Tx623 reference genome (v3.1.1) to include the 5.6 kb \u003cem\u003ela\u003c/em\u003erge \u003cem\u003er\u003c/em\u003eetrotransposon \u003cem\u003ed\u003c/em\u003eerivative (LARD) in the \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e allele. This improved read mapping at \u003cem\u003eSobic.010G022600\u003c/em\u003e in \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e individuals, identified 78 new uniquely mapped reads, and made it possible to distinguish different \u003cem\u003eWaxy\u003c/em\u003e genotypes using short-read sequencing data. Functional PACE-PCR markers, suitable for genomic selection, were developed for \u003cem\u003eWx\u003c/em\u003e, \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003ea\u003c/em\u003e\u003c/sup\u003e, and \u003cem\u003ewx\u003c/em\u003e\u003csup\u003e\u003cem\u003eb\u003c/em\u003e\u003c/sup\u003e alleles and validated in three public and private breeding programs. These new molecular breeding resources will improve the efficiency of developing commercial waxy sorghum hybrids.\u003c/p\u003e","manuscriptTitle":"An enhanced genome assembly and functional, high-throughput molecular markers enable genomics-assisted breeding of waxy sorghum [Sorghum bicolor (L.) Moench]","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-10-07 17:01:24","doi":"10.21203/rs.3.rs-4883126/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Major Revisions Needed","date":"2025-02-05T01:54:17+00:00","index":"","fulltext":""},{"type":"reviewerAgreed","content":"","date":"2024-09-12T09:56:19+00:00","index":0,"fulltext":""},{"type":"reviewersInvited","content":"","date":"2024-09-05T07:42:48+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2024-08-12T07:22:19+00:00","index":"","fulltext":""},{"type":"submitted","content":"Journal of Applied Genetics","date":"2024-08-08T16:30:26+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"journal-of-applied-genetics","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"joag","sideBox":"Learn more about [Journal of Applied Genetics](https://www.springer.com/journal/13353)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/joag/default.aspx","title":"Journal of Applied Genetics","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"em","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"897728dc-ff2c-4794-afb5-1bb1bfc03e85","owner":[],"postedDate":"October 7th, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[],"tags":[],"updatedAt":"2025-08-18T16:05:43+00:00","versionOfRecord":{"articleIdentity":"rs-4883126","link":"https://doi.org/10.1007/s13353-025-00993-1","journal":{"identity":"journal-of-applied-genetics","isVorOnly":false,"title":"Journal of Applied Genetics"},"publishedOn":"2025-08-11 15:57:48","publishedOnDateReadable":"August 11th, 2025"},"versionCreatedAt":"2024-10-07 17:01:24","video":"","vorDoi":"10.1007/s13353-025-00993-1","vorDoiUrl":"https://doi.org/10.1007/s13353-025-00993-1","workflowStages":[]},"version":"v1","identity":"rs-4883126","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-4883126","identity":"rs-4883126","version":["v1"]},"buildId":"qtupq5eGEP_6zYnWcrvyt","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.