Comparative Analysis of orf182 Widespread in Chinese Wild Rice Species for Hybrid Rice Breeding

preprint OA: closed
Full text JSON View at publisher
Full text 126,504 characters · extracted from preprint-html · click to expand
Comparative Analysis of orf182 Widespread in Chinese Wild Rice Species for Hybrid Rice Breeding | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Comparative Analysis of orf182 Widespread in Chinese Wild Rice Species for Hybrid Rice Breeding Weixiong Long, Jie Wang, Lihua Luo, Lujian Zhou, Wei Chen, Laiyang Luo, and 5 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-7649942/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract The utilization of hybrid rice has successfully addressed food security challenges in China. However, the predominant use of wild-abortive (WA) cytoplasmic male sterility (CMS) systems in hybrid rice cultivation raises significant safety concerns. In this study, we have elucidated the distribution patterns of orf182 , a novel CMS gene identified in our previous work, in Chinese common wild rice. Mitochondrial sequencing and assembly of 11 representative wild rice species harboring orf182 revealed 16 novel genes. A total of 469 mitochondrial genes were classified into 36 gene families, with nine families containing single-copy homologous genes, indicating significant gene duplication in mitochondria. We observed a strong positive correlation between mitogenome size and the quantity and size of repetitive sequences. Collinearity analysis revealed extensive mitochondrial variation and large-scale inversions in Guangdong wild rice. Comparative genome analysis uncovered inversions, translocations, and several variations surrounding orf182 , with a 71 bp repeat sequence mediating the formation of the orf182 - nad6 chimeric gene. Gene copy number analysis (GCNV) revealed variable orf182 gene copy counts (1, 2, and 3) in wild rice species. Additionally, successful transformation of orf182 from various sources into sterile lines was achieved. These findings provide valuable resources for advancing hybrid rice development in China, thus contributing to enhanced food security. Oryza rufipogon orf182 wild rice CMS structural variation Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Introduction Mitochondria, often termed the powerhouses of eukaryotic cells, are semi-autonomous organelles believed to have originated from prokaryotic ancestors via endosymbiosis. They harbor their own genetic material, which is indispensable for mitochondrial and cellular functions [1]. The mitochondrial genome (mitogenome or mtDNA) is typically maternally inherited and follows evolutionary trajectories distinct from those of nuclear genomes. Outbreeding-mediated reshuffling of nuclear and cytoplasmic genomes occurs widely and frequently, potentially reconfiguring the functional and structural interplay between these genetic systems. Rearrangements, mutations, and inheritance patterns in mitochondrial genomes can provoke evolutionary conflicts that significantly influence plant diversification, domestication, and hybridization [2, 3]. Nevertheless, the evolutionary mechanisms shaping cytoplasmic genomes—especially among wild relatives of crops—remain inadequately explored. Structural variation (SV) is considerably more prevalent in mitochondrial genomes than in chloroplast genomes, where SVs are rare and occur at low frequency [4], whereas mitochondrial genomes exhibit extensive SVs driven by frequent repeat-mediated recombination[5]. In contrast, plant mitochondrial genomes undergo extensive recombination mediated by repetitive sequences, leading to pronounced structural diversity [6] and can disrupt gene expression through the reshuffling of regulatory elements or the creation of chimeric open reading frames (ORFs). Many such chimeric ORFs have been associated with cytoplasmic male sterility (CMS), a phenotype characterized by aberrant floral development and abortion of male gametophytes [7, 8]. Recombination events are often facilitated by short repeats (SRs; 50–500 bp) and micro-repeats (MRs; <50 bp), which promote asymmetric and largely irreversible recombination, yielding recombinant molecules present at low (sub-stoichiometric) levels [9, 10]. These molecules, termed sublimons , can rapidly amplify under certain conditions through a mechanism known as sub-stoichiometric shifting (SSS), thereby altering the mitochondrial genotype (mitotype) and driving mtDNA evolution [11]. Collectively, recombination processes mediated by SRs, MRs, intracellular gene transfer (IGT), and horizontal gene transfer (HGT) generate diverse SVs—including rearrangements, duplications, insertions, and deletions [5, 10, 12]. Although SVs within gene regions are generally deleterious and selected against, those in intergenic regions may persist, contributing to the high sequence divergence observed in mitochondrial intergenic spacers [13-15]. Wild rice ( Oryza rufipogon ), the direct progenitor of cultivated rice ( Oryza sativa ), was domesticated approximately 8,000–10,000 years ago. It represents a critical genetic resource for improving stress tolerance, yield, and grain quality in modern rice breeding [16]. While the nuclear genome of O. rufipogon has been extensively studied, providing key insights into rice evolution and domestication [17, 18], research on its mitochondrial genome remains limited. Early studies using restriction fragment length polymorphism (RFLP) analysis revealed mtDNA variation and distinct mitotypes among wild and cultivated rice populations [19]. Furthermore, several CMS types—including the widely used WA-, BT-, HL-, and D1-CMS systems—have been identified in cultivated rice, many tracing back to O. rufipogon or closely related accessions [7, 20-22]. However, only a few complete mitogenomes of O. rufipogon have been reported to date, and a comprehensive comparative analysis is still lacking. As a result, the extent of mitochondrial genomic diversity and its evolutionary relationship with cultivated rice remain poorly understood. In this study, we employed D1-CMS-type wild rice as a model to investigate cytoplasmic genome evolution. We constructed a pan-mitogenome based on high-quality long-read assemblies from 11 representative accessions of common wild rice. Our comprehensive analyses revealed extensive mitochondrial rearrangements and the presence of chimeric ORFs. We generated genome-wide mitochondrial variation maps from diverse Oryza accessions for population-level genomic studies. The potential role of chimeric ORFs—classified into distinct phylogenetic clades based on mitotype-specific markers (MSS)—in CMS was evaluated using cytoplasmic substitution lines. Furthermore, we examined recombination dynamics around the mitochondrial gene orf182 to elucidate its structural complexity and explore the mechanisms underlying rice mtDNA evolution. Finally, we characterized structural variations across the mitochondrial genome to uncover general architectural features and evolutionary constraints. Results Phylogenetic tree constructed by Mitotype-specific sequence (MSS) Previous studies identified the orf182 gene exclusively in 18 Oryza rufipogon accessions from China, distributed among 17 species across nine genomic types, based on a screen of 446 wild rice samples [22]. To further elucidate the evolutionary origin and distribution of this gene, we collected an additional 320 common wild rice accessions from three provinces in China: Guangdong, Jiangxi, and Guangxi. Among these, 76 accessions (23.75%) exhibiting the D1-type cytoplasmic male sterility (CMS) phenotype were selected for subsequent genetic and genomic analyses (Table S1). From a total of 57 mitochondrial structural sequence (MSS) profiles obtained from these accessions [23], 18 representative samples showing pronounced polymorphisms were used to reconstruct a phylogenetic tree, aiming to clarify the evolutionary relationships within the D1-type wild rice group (Fig. 1, Tables S2, S3). Furthermore, 11 representative wild rice accessions spanning nine distinct D1-type phylogenetic clades were subjected to high-quality mitochondrial genome assembly using a hybrid sequencing strategy combining long-read and short-read technologies. These high-resolution assemblies facilitated detailed comparative and evolutionary analyses of the mitochondrial genome, with particular emphasis on structural variation, recombination events, and the potential functional implications of CMS-associated open reading frames such as orf182 . Based on three Oryza rufipogon accessions carrying orf182 fromdifferent geographical origins and phylogenetic tree branch, we successfully converted Mingxiangsi into male sterile lines after only five backcross generations(Fig. 1B). These results demonstrates that the D1-type CMS system can be readily introduced into diverse genetic backgrounds, highlighting its efficiency and broad applicability in hybrid breeding programs. General features of the Oryza rufipogon mitochondrial genome A total of 69 Gb of raw Illumina reads (150 bp in length) from 11 samples, along with 506 Mb of Nanopore PromethION data from 9 wild rice varieties and 572 Mb of PacBio long reads (average length ~10 kb), were generated for mitochondrial genome assembly (Table S4). The assembled mitogenomes of the 11 wild rice accessions ranged in size from 438,632 bp to 718,357 bp. Functional annotations and physical locations of genes across these mitochondrial genomes are summarized in Fig. 2 and Fig. S1. In the JX2 mitogenome, 77 genes were annotated, including 46 protein-coding genes, 28 tRNAs, and 3 rRNAs. These encode 37 distinct proteins, with atp4 , atp6 , atp8 , atp9 , cob-like , orf182 , and orfX present in two copies each, and nad6 in three copies. The protein-coding genes were classified into 11 functional categories: ATP synthases (5 genes), cytochrome C biogenesis (4 genes), ubiquinol cytochrome c reductase (1 gene), cytochrome c oxidase (3 genes), maturases (1 gene), NADH dehydrogenase (9 genes), cytoplasmic male sterility-related (1 gene), open reading frames (1 gene), large ribosomal subunit proteins (3 genes), and small ribosomal subunit proteins (8 genes). On average, seven chimeric ORFs derived from fragments of known and conserved protein-coding genes (excluding tRNAs and rRNAs) were identified in each mitogenome. Notably, nad6 (from the nad family) and rpl2 and rpl16 (from the rpl family) frequently participated in the formation of prominent chimeric ORFs across all wild rice accessions examined (Table S5). To assess structural variation (SV) and its functional implications, we performed gene copy number analysis. A core set of 37 protein-coding genes was conserved across all 11 mitochondrial genomes. Among these, nad6 exhibited the highest copy number. The accession GX34 showed the most extensive gene loss, affecting atp4 , atp8 , rps7 , cox3 , nad5 , nad7 , rps13 , ccmFc , rpl5 , and cox1 . Additionally, genes such as cob-like and orfX were lost in two other accessions. Gene duplication was widespread: cox1 was replicated in nearly all wild rice mitogenomes, and nad6 duplication occurred in all 11 accessions (Fig. S2). Repeat sequence variation and correlations between genome size and repeat sequences Mitogenome repeats were classified into class I (repeats ≤ 100 bp), class II (100–500 bp), class III (500–2000 bp), and IV (> 2000 bp) based on repeat length. The number of class I repeat sequences ranged from 239 (GX44 and GX51) to 724 (GX4), account for 1.53-1.68% of mitochondrial genome size (Fig. 3A; Table S6). Among the samples, the number of class I repeats in GX4 samples was significantly higher than samples from wild rice in same province and other wild rice, while the number of repeats longer than 100 bp was not significantly increased (t-test, P-value = 0.11) (Table S6). Meanwhile the total repeat number (864) of GX4 was higher than that of other sample (total repeat number: 316–490) (Fig. 3), and the total repeat length show the similar pattern, while the mitochondrial size of GX4 greater than other contributed by the class IV type repeat. These were followed by repeats in class II, which accounted for approximately 15% of the total repeats, with 56–90 in number. There were most least repeats length located in class III and only accounted for 0.92%. In contrast, a total of 12–38 repeats in class IV were identified, with lengths ranging from 114.38 to 574.38kb, contributing 26.04–79.96% of the mitogenome size. Despite the relatively low number of repeats in class IV, they were the major contributors to the total mitogenome size (Fig. 3B). For all of the 11 wild rice samples, mitogenomes size show significantly high correlations (P-value < 0.001) with total repeat number and length (Fig. S3, Fig. 3c, d). All the count of the four repeat categories showed significant correlation with variations in genome size (R2 adj ranged from 0.63 to 0.99, P-value < 0.003) (Fig. S3). While the Class II repeat size show low correlations with the mitochondrial genome size (Fig. 3d). In addition, several mitogenomes may have an overrepresentation of repeat content (Table S6). For example, GX46 (total repeat length: 466 kb) had a similar genome size to GX51 and GX44 (438 kb), but the total repeat length was about 1.46-fold of GX51 (131 kb). Identified the structural variation in D1-wild rice mitogenomes Previous studies have indicated that repeat sequences can mediate rearrangements of varying extent in the mitochondrial genome. By aligning the 11 assembled mitogenomes to the Nipponbare reference genome, we identified syntenic regions accounting for 12% to 30% of each genome (Table S7). Structural variations across assemblies included one to three inversions, two to six translocations, 27 to 68 insertions, and 32 to 61 deletions (Fig. S4). Wild rice accessions with D1-CMS exhibited the highest abundance of SVs (Fig. S5). Notably, although GX34 ranked fourth lowest in terms of SV quantity, it displayed the greatest total SV length. This apparent discrepancy is largely attributable to the absence of large-scale translocations in GX34, resulting in a notably low minimal SV length. Mitogenomes of wild rice from Guangdong province appeared relatively conserved, a pattern that may reflect either geographic sampling concentration or limited underlying genetic differentiation. In contrast, the two Jiangxi accessions exhibited markedly distinct profiles in both the number and size of SVs (Fig. S5). Whole-genome alignment using Mauve further characterized large-scale structural rearrangements, revealing 17 locally collinear blocks (LCBs) with variable ordering across accessions (Fig. S6). Three major rearrangement events were detected in Jiangxi Dongxiang wild rice, whereas only five LCBs were identified in GX4 and GX34. These results suggest that the mitochondrial genomes of Guangxi D1-CMS wild rice have lost numerous conserved genomic modules during evolution, consistent with previous reports of extensive structural variation in accessions from this region. Pan-mitochondrial genome of 11 selected D1 CMS wild rice and Nipponbare To further investigate mitochondrial genomic diversity across different origins, we selected 11 D1-CMS wild rice accessions representing distinct groups, along with the previously published Nipponbare mitochondrial genome, for pan-genome construction and comparative analysis. The gene accumulation curve (Fig. 5C) indicated an open pan-genome, suggesting that new genes are likely to be identified as more mitochondrial genomes from rice are sequenced. Comparative analysis of the eleven D1-CMS wild rice mitogenomes revealed no private genes among these accessions. Mitochondrial genomes from Guangdong accessions were generally the most conserved. However, one accession, GD4, exhibited a reduced number of core genes and a higher proportion of variable genes within the pan-genome. In contrast, accessions from Guangxi showed greater variability in pan-genome composition. Among these, GX34 contained fewer variable genes, implying a more stable evolutionary history and possible adaptation to a narrower ecological niche. Further comparison between Jiangxi accessions indicated that JX5 possessed more variable genes than JX2. Notably, private genes were exclusively identified in the Nipponbare reference genome, highlighting substantial divergence in mitochondrial gene content between the reference and the D1-CMS wild rice lineages examined in this study (Fig. 5A). We also quantified the proportional distribution of different genomic components. Core gene families accounted for the largest share (44.23%), followed by private (30.77%) and variable (25%) gene families. In terms of copy number, core genes constituted 66.55% of all genes, while private genes represented only 2.89%. Discussion In this study, we identified the chimeric mitochondrial gene orf182 as a unique genetic element present exclusively in a subset of Chinese wild rice accessions. This finding offers significant insights into the evolutionary dynamics of plant mitochondrial genomes, the origin of cytoplasmic male sterility (CMS)-associated genes, and the potential utility of orf182 in hybrid rice breeding. The restricted distribution of orf182 suggests that it likely originated from a localized mitochondrial recombination event. This observation supports the hypothesis that CMS-related genes often emerge independently in different populations through rapid and extensive mitochondrial restructuring. Similar to orf79 and orfH79 , orf182 exemplifies the high structural plasticity of plant mitochondrial genomes, wherein recurrent gene fusion events give rise to chimeric open reading frames. The occurrence of orf182 solely in Chinese wild rice further indicates that CMS genes are not universally conserved but rather represent region-specific evolutionary innovations. Its discovery highlights the distinctive genetic value of Chinese wild rice germplasm within the global rice gene pool. As a naturally occurring CMS-inducing gene, orf182 provides direct evidence of functional diversity preserved in wild populations. Among the collected Chinese Oryza rufipogon accessions, 23.75% were found to carry orf182 . Based on these findings, we propose that the mitochondrial genome harboring orf182 has not evolved through conservative mechanisms but rather via dynamic structural rearrangements. Mitochondrial genome structural plasticity and chimeric gene formation Our results demonstrate that large-scale structural variations (LSVs) are prevalent in the mitochondrial genomes of wild rice, including among accessions from the same geographic region. Accessions such as GD4 and GX4 exhibit pronounced structural divergence compared to others. Notably, the physical scale of SVs did not correlate with their quantity; for instance, GX34 displayed the largest SV size but ranked among the lowest in terms of SV number. Substantial variation in mitochondrial genome size was observed across the wild rice panel (Fig. 3 ). The assembly and annotation of 11 mitochondrial genomes provide new insights into this variability. None of the wild rice accessions shared a common presence–absence variation (PAV) region associated with orf182 , indicating that this gene did not originate from a recent insertion event. Instead, analysis of repetitive sequences revealed that some orf182 chimeric genes likely arose from duplication events, while others were generated through recombination mediated by a 71-bp repetitive element, which appears to have undergone multiple recombination cycles (Fig. 6 ). CMS gene diversity and implications for domestication and hybrid breeding Dynamic structural changes, including gene duplications, have facilitated the emergence of new gene copies in the mitochondrial genome. Nucleotide alignment of orf182 sequences identified several SNP sites, which were categorized into three haplotypes (Fig. S7 ). All detected SNPs were synonymous, suggesting neutral evolution at the sequence level. CMS genes such as orf182 may have historically functioned as genetic barriers that contributed to reproductive isolation, thereby influencing domestication pathways and exhibiting geographic specificity. The D1-CMS system, characterized by orf182 , represents a promising resource for the development of novel hybrid rice breeding systems. Material and methods Plant materials Oryza rufipogon collected from three province, Guangdong(GD), Guangxi(GX) and Jiangxi (JX), were used in the present research. The Guangdong common wild rice materials were generously contributed and identified by professor Yonggen Lu (South China Agricultural University), Guangxi wild rice were obtained from Researcher Yuntao Liang (Guangxi Academy of Agricultural Sciences), while Jiangxi wild rice were kindly provided by the Dongxiang Wild Rice Resource Nursery and identified by Hongwei Xie. Oryza spontanea (N104625) was provided by international rice research institute, Philippines. The voucher specimens (refer to Table S1) are stored in the herbarium of Jiangxi Academy of Agricultural Sciences, relevant data are stored in the Germplasm Resource Database of the Jiangxi Academy of Agricultural Sciences (currently an internal website, and are available from the corresponding author upon reasonable request). All the wild rice and O. spontanea were grown in the Wild Rice Germplasm Resource Nursery of the Jiangxi Academy of Agricultural Sciences (28°56′20″N, 115°39′59″E). Construction of maximum likelihood tree based on the Mitotype-specific sequences (MSSs) To selected the representative D1-CMS relatives among the 76 common wild rice collected from worldwide. The previous developed 20 pairs of evenly distribution D1-CMS mitochondrial specific sequences were used (Table S2). The 0/1 matrix was generated by MTTs gel results (Table S3). The phylogenetic tree was constructed using the maximum likelihood method in IQ-TREE software v1.6.12 [24], and the resulting maximum likelihood tree was visualized using ITOL software [25]. Samples, DNA extraction and Mitochondrial genome sequencing A total of 11 representative wild rice were selected based on the phylogenetic tree constructed by MSSs in this study (Table 1). Fresh leaves from the wild rice species were collected from a local filed experiment field of Jiangxi Academy of Agricultural Science, Nanchang, China. High-molecular-weight genomic DNA was extracted from 1-month-old leaves following the standard CTAB method. The mitochondrial genome of 11 wild rice was sequenced using the Illimina, Pacbio and Nanopore methods. Paired-end libraries with insert sizes of ∼300 bp were prepared following Illumina’s standard genomic DNA library preparation procedure. The qualified Illumina pair-end library would be used for Illumina NovaSeq 2500 sequencing (150 bp × 2, Shanghai BIOZERON Co., Ltd., Shanghai, China). Mitochondrial genome assembly and annotation Raw sequencing reads were trimmed using Btrim software to filter out low-quality bases and sequences. The high quality reads were assembled into bp using SOAP de novo v2.04 ( http://soap.genomics.org.cn/) and Celera Assembler 8.0. Gaps in the SSPACE scaffolds were filled using GapCloser v.1.12 [26] with default parameters. Finally, redundant sequences were manually removed to obtain the final mitochondrial genome. The denovo annotation strategy and homology-based comparison were conducted for Genome annotation. We utilized the protein sequences of the reference genome for homology alignment prediction. First, we rapidly aligned the protein sequences to each genome sequence, filtered out the poor alignment results, removed redundancies, and then using Genewise for precise alignment. The AUGUSTUS software (http://bioinf.uni-greifswald.de/augustus/) was performed to de novo gene prediction for plant mitochondrial genomes. Finally, we integrated the gene sets using the EVidenceModeler v1.1.1 software [27] to obtain the comprehensive annotation file. The output genebank format file was manually confirmed, and the mitochondrial circular map was drawn using Organellar Genome DRAW (MPI-MP CHLOROBOX - OGDRAW (mpg.de)). Analysis of repeated sequences BlastN was used to determine whether there were intragenomic duplications of large fragments or interspersed repeats in the 11 wild rice mitogenomes. Microsatellite identification tool was used to detect simple sequence repeats. The repeats of 1, 2, 3, 4, 5, and 6 bases with 8, 4, 4, 3, 3, and 3 repeats numbers, repeatively, were identified in this analysis. The tandem repeats with > 7p repeats units were detected using Tandem Repeats Finder v4.10 [28]. The direct and inverted repeats were identified using REPuter software (BiBiServ - Temporariliy not available (uni-bielefeld.de)) with the minimal repeat size set to 20 bp. Collinear analysis and comparative analysis among the D1-CMS wild rice The mitogenome of Nipponbare was characterized by Notsu [29] were used for comparision. Collinear analysis between the two mitogenomes were conducted in Mummer [30]. Collinear regions were identified using the nucmer module with an identity threshold of 95% under the many-to-many alignment mode. The resulting collinearity maps were visualized with Plotsr implemented in the SYRI software[31]. The genome sizes, GC contents, base compositions, start and stop codons, gene counts, and intron numbers of 11wild rice mitogenomes were compared to assess their variations and similarities. The copy number of protein coding genes among the mitogenome were identified. Rearrangement event identification in D1-type wild rice mitogenome Mauve[32] genome alignments serve as a powerful tool for identify conserved genomic regions and structural rearrangements across different samples. In this study, we conducted a series of Mauve alignments under default parameters to compare mitochondrial genomes of GD, GX and JX, representing three distinct province region of China. To infer the rearrangement rate between eleven wild rice, multiple alignment of all pairwise combinations of the mitogenomes were conducted using Mauve (version 20150226) to identify locally collinear blocks (LCBs) in each mitogenome with default parameters. Structural variation between the wild rice mitochondrial genome and cultivate rice To construct the D1-type wild rice mitochondrial SV maps, we divided the SVs into four categories-insertions (INSs), deletions (DELs), inversions (INVs) and duplications (DUPs). A whole genome alignment by using mummer combined with SYRI software based on the NIP reference mitochondrial genome. Construction of mitochondrial genome variant maps The SNP and INDEL calling of 11 D1-wild rice accessions was also conducted using NIP (BA000029.3) as a reference mitochondrial genome. Raw data of 11 wild rice were trimmed by Trimmomatic v0.39 [33]. Clean data was mapped to the reference genome using BWA[34]. Samtools [35] was used to convert the SAM into BAM format file. Picards (Picard Tools - By Broad Institute) was used to remove the duplicated reads. Variation identification and filtering were conducted by using GATK 4.1[36]. Finally, all SNP and INDELs with MAF large than 0.05 and max-missing rate less than 0.1 was kept for subsequent analysis. Analyses of the protein-coding-gene based Pan-mitochondrial genome The all-to-all comparison of the peptide sequence of protein-coding genes was carried out using BLASTP with an E-value of e-10, and followed by clustering of the blast results using OrthoFinder [37] with default parameters. The clusters were categorized into three groups: the core genome shared by all 12 genomes, the dispensable genome containing genes present in 2 to 11 genomes, and specific gene families unique to only one genome. Declarations Data Availability The mitochondrial genome sequences of long reads and short reads supporting in this study have been submitted to the National Center for Biotechnology Information (NCBI) with BioProject ID: PRJNA1338361 and PRJNA1338456, the material that support the findings in this study are available from the corresponding author, Hongwei Xie, upon reasonable request. Acknowledgements We appreciate professor Yonggen Lu, and Researcher Yuntao Liang for kindly providing the wild rice materials. Funding This work was supported by Major Talent Development Program of the Jiangxi Academy of Agricultural Sciences (JXSNKYJCRC202501), Jiangxi Province Science Foundation for Youths (20224BAB215008), Major Discipline Academic and Technical Leaders Training Program of Jiangxi Province (20213BCJL22044), China Agriculture Research System (CARS-01-08). Author information Authors and Affiliations Jiangxi Super-Rice Research and Development Center, Jiangxi Provincial Key Laboratory of Rice Germplasm Innovation and Breeding, National Engineering Research Center for Rice, Jiangxi Academy of Agricultural Sciences, Nanchang, 330200, China Weixiong Long, Jie Wang, Lihua Luo, Lujian Zhou, Wei Chen, Laiyang Luo, Weibiao Xu, Yonghui Li, Longan Yan, Yaohui Cai & Hongwei Xie Contributions Weixiong Long, Longan Yan, Yaohui Cai, and Hongwei Xie conceived and designed the project. Jie wang, Lihua Luo, Lujian Zhou and Wei Chen were responsible for the management of the wild rice materials. Laiyang Luo, Weibiao Xu, and Yonghui Li helped in the preparation of the manuscript. Weixiong Long and Hongwei Xie wrote and supervised the manuscript. All authors read and approved the manuscript. Corresponding authors Correspondence to Hongwei Xie. Ethics declarations Ethics approval and consent to participate Oryza rufipogon seedings collected from Guangdong province in China and were identified by professor Yonggen Lu, while the Guangxi wild rice ( Oryza rufipogon ) seedlings were collected from Guangxi Academy of Agricultural Sciences and were identified by researcher Yuntao Liang, the Jiangxi common wild rice were collected from Dongxiang, Jiangxi, China and were identified by researcher Hongwei Xie. The methods involved in the study were carried out in compliance with local and national regulations. Consent for publication Not applicable Competing interests The authors declare that they have no competing interests. References Milani L, Ghiselli F: Mitochondrial activity in gametes and transmission of viable mtDNA . Biol Direct 2015, 10 :22. Eyre-Walker A, Gaut RL, Hilton H, Feldman DL, Gaut BS: Investigation of the bottleneck leading to the domestication of maize . Proc Natl Acad Sci U S A 1998, 95 (8):4441-4446. Wang N, Li C, Kuang L, Wu X, Xie K, Zhu A, Xu Q, Larkin RM, Zhou Y, Deng X et al : Pan-mitogenomics reveals the genetic basis of cytonuclear conflicts in citrus hybridization, domestication, and diversification . Proc Natl Acad Sci U S A 2022, 119 (43):e2206076119. Zhou Y, Duvaux L, Ren G, Zhang L, Savolainen O, Liu J: Importance of incomplete lineage sorting and introgression in the origin of shared genetic variation between two closely related pines with overlapping distributions . Heredity (Edinb) 2017, 118 (3):211-220. Cole LW, Guo W, Mower JP, Palmer JD: High and Variable Rates of Repeat-Mediated Mitochondrial Genome Rearrangement in a Genus of Plants . Mol Biol Evol 2018, 35 (11):2773-2785. Kozik A, Rowan BA, Lavelle D, Berke L, Schranz ME, Michelmore RW, Christensen AC: The alternative reality of plant mitochondrial DNA: One ring does not rule them all . PLoS Genet 2019, 15 (8):e1008373. Tang H, Luo D, Zhou D, Zhang Q, Tian D, Zheng X, Chen L, Liu YG: The rice restorer Rf4 for wild-abortive cytoplasmic male sterility encodes a mitochondrial-localized PPR protein that functions in reduction of WA352 transcripts . Mol Plant 2014, 7 (9):1497-1500. Fujii S, Bond CS, Small ID: Selection patterns on restorer-like genes reveal a conflict between nuclear and mitochondrial genomes throughout angiosperm evolution . Proc Natl Acad Sci U S A 2011, 108 (4):1723-1728. Shedge V, Davila J, Arrieta-Montiel MP, Mohammed S, Mackenzie SA: Extensive rearrangement of the Arabidopsis mitochondrial genome elicits cellular conditions for thermotolerance . Plant Physiol 2010, 152 (4):1960-1970. Gualberto JM, Newton KJ: Plant Mitochondrial Genomes: Dynamics and Mechanisms of Mutation . Annu Rev Plant Biol 2017, 68 :225-252. Zhao N, Wang Y, Hua J: The Roles of Mitochondrion in Intergenomic Gene Transfer in Plants: A Source and a Pool . Int J Mol Sci 2018, 19 (2). Kong J, Wang J, Nie L, Tembrock LR, Zou C, Kan S, Ma X, Wendel JF, Wu Z: Evolutionary dynamics of mitochondrial genomes and intracellular transfers among diploid and allopolyploid cotton species . BMC Biol 2025, 23 (1):9. Bentolila S, Stefanov S: A reevaluation of rice mitochondrial evolution based on the complete sequence of male-fertile and male-sterile mitochondrial genomes . Plant Physiol 2012, 158 (2):996-1017. Clifton DR, Fields KA, Grieshaber SS, Dooley CA, Fischer ER, Mead DJ, Carabeo RA, Hackstadt T: A chlamydial type III translocated protein is tyrosine-phosphorylated at the site of entry and associated with recruitment of actin . Proc Natl Acad Sci U S A 2004, 101 (27):10166-10171. Sloan DB, Alverson AJ, Chuckalovcak JP, Wu M, McCauley DE, Palmer JD, Taylor DR: Rapid evolution of enormous, multichromosomal genomes in flowering plant mitochondria with exceptionally high mutation rates . PLoS Biol 2012, 10 (1):e1001241. Long W, He Q, Wang Y, Wang Y, Wang J, Yuan Z, Wang M, Chen W, Luo L, Luo L et al : Genome evolution and diversity of wild and cultivated rice species . Nat Commun 2024, 15 (1):9994. Huang X, Kurata N, Wei X, Wang ZX, Wang A, Zhao Q, Zhao Y, Liu K, Lu H, Li W et al : A map of rice genome variation reveals the origin of cultivated rice . Nature 2012, 490 (7421):497-501. Guo D, Li Y, Lu H, Zhao Y, Kurata N, Wei X, Wang A, Wang Y, Zhan Q, Fan D et al : A pangenome reference of wild and cultivated rice . Nature 2025, 642 (8068):662-671. Kadowaki K, Suzuki T, Kazama S: A chimeric gene containing the 5' portion of atp6 is associated with cytoplasmic male-sterility of rice . Mol Gen Genet 1990, 224 (1):10-16. Luo D, Xu H, Liu Z, Guo J, Li H, Chen L, Fang C, Zhang Q, Bai M, Yao N et al : A detrimental mitochondrial-nuclear interaction causes cytoplasmic male sterility in rice . Nat Genet 2013, 45 (5):573-577. Akagi H, Sakamoto M, Shinjyo C, Shimada H, Fujimura T: A unique sequence located downstream from the rice mitochondrial atp6 may cause male sterility . Curr Genet 1994, 25 (1):52-58. Xie H, Peng X, Qian M, Cai Y, Ding X, Chen Q, Cai Q, Zhu Y, Yan L, Cai Y: The chimeric mitochondrial gene orf182 causes non-pollen-type abortion in Dongxiang cytoplasmic male-sterile rice . Plant J 2018. Xie H, Wang J, Qian M, Li N, Zhu Y, Li S: Mitotype-specific sequences related to cytoplasmic male sterility in Oryza species . Molecular Breeding 2013, 33 (4):803-811. Trifinopoulos J, Nguyen LT, von Haeseler A, Minh BQ: W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis . Nucleic Acids Res 2016, 44 (W1):W232-235. Letunic I, Bork P: Interactive Tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool . Nucleic Acids Res 2024, 52 (W1):W78-W82. Xu M, Guo L, Gu S, Wang O, Zhang R, Peters BA, Fan G, Liu X, Xu X, Deng L et al : TGS-GapCloser: A fast and accurate gap closer for large genomes with low coverage of error-prone long reads . Gigascience 2020, 9 (9). Haas BJ, Salzberg SL, Zhu W, Pertea M, Allen JE, Orvis J, White O, Buell CR, Wortman JR: Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments . Genome Biol 2008, 9 (1):R7. Benson G: Tandem repeats finder: a program to analyze DNA sequences . Nucleic Acids Res 1999, 27 (2):573-580. Notsu Y, Masood S, Nishikawa T, Kubo N, Akiduki G, Nakazono M, Hirai A, Kadowaki K: The complete sequence of the rice (Oryza sativa L.) mitochondrial genome: frequent DNA sequence acquisition and loss during the evolution of flowering plants . Mol Genet Genomics 2002, 268 (4):434-445. Marcais G, Delcher AL, Phillippy AM, Coston R, Salzberg SL, Zimin A: MUMmer4: A fast and versatile genome alignment system . PLoS Comput Biol 2018, 14 (1):e1005944. Goel M, Sun H, Jiao WB, Schneeberger K: SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies . Genome Biol 2019, 20 (1):277. Darling AC, Mau B, Blattner FR, Perna NT: Mauve: multiple alignment of conserved genomic sequence with rearrangements . Genome Res 2004, 14 (7):1394-1403. Bolger AM, Lohse M, Usadel B: Trimmomatic: a flexible trimmer for Illumina sequence data . Bioinformatics 2014, 30 (15):2114-2120. Li H, Durbin R: Fast and accurate short read alignment with Burrows-Wheeler transform . Bioinformatics 2009, 25 (14):1754-1760. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, Genome Project Data Processing S: The Sequence Alignment/Map format and SAMtools . Bioinformatics 2009, 25 (16):2078-2079. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M et al : The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data . Genome Res 2010, 20 (9):1297-1303. Emms DM, Kelly S: OrthoFinder: phylogenetic orthology inference for comparative genomics . Genome Biol 2019, 20 (1):238. Additional Declarations No competing interests reported. Supplementary Files TableS1Alistofwildricespecieswhichcontainorf182inthisstudy.xlsx TableS2Theinformationoforf182andMSSmarkersusedinthisstudy.xlsx TableS3Thegenotypegenerateby18MSSmarker.xlsx TableS4Shortandlongreadsandtotalsize.xlsx TableS5chimericgene.xlsx TableS6wildricemitochondrialgenomerepeat.csv TableS7syri.xlsx FigS1genomeassemble.tif figS2generepeat.tif FigS3Repeatcount.tif figS4syrisummary.tif figS5.SV.tif FigS6mauve.tif FigS7orf182snp.jpg Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-7649942","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":539485080,"identity":"1c4b7e65-e91d-49c5-ae8d-0283b23f69ac","order_by":0,"name":"Weixiong Long","email":"","orcid":"","institution":"Jiangxi Academy of Agricultural Sciences","correspondingAuthor":false,"prefix":"","firstName":"Weixiong","middleName":"","lastName":"Long","suffix":""},{"id":539485081,"identity":"01ea4d5a-3b49-4ea1-ba67-736ff8971704","order_by":1,"name":"Jie Wang","email":"","orcid":"","institution":"Jiangxi Academy of Agricultural Sciences","correspondingAuthor":false,"prefix":"","firstName":"Jie","middleName":"","lastName":"Wang","suffix":""},{"id":539485082,"identity":"2eb22838-0857-450a-98b3-2985e8ca519f","order_by":2,"name":"Lihua Luo","email":"","orcid":"","institution":"Jiangxi Academy of Agricultural Sciences","correspondingAuthor":false,"prefix":"","firstName":"Lihua","middleName":"","lastName":"Luo","suffix":""},{"id":539485083,"identity":"3bde25dd-07ff-49dc-8981-d4989a77e05d","order_by":3,"name":"Lujian Zhou","email":"","orcid":"","institution":"Jiangxi Academy of Agricultural Sciences","correspondingAuthor":false,"prefix":"","firstName":"Lujian","middleName":"","lastName":"Zhou","suffix":""},{"id":539485084,"identity":"c80e406c-8b17-44a0-a9fa-e2bacca6dc65","order_by":4,"name":"Wei Chen","email":"","orcid":"","institution":"Jiangxi Academy of Agricultural Sciences","correspondingAuthor":false,"prefix":"","firstName":"Wei","middleName":"","lastName":"Chen","suffix":""},{"id":539485085,"identity":"add3cbf8-8576-4ac1-b243-abf2b2dca1ca","order_by":5,"name":"Laiyang Luo","email":"","orcid":"","institution":"Jiangxi Academy of Agricultural Sciences","correspondingAuthor":false,"prefix":"","firstName":"Laiyang","middleName":"","lastName":"Luo","suffix":""},{"id":539485086,"identity":"4eb04aad-a930-4b7f-958c-b0fe7ac0ecfe","order_by":6,"name":"Weibiao Xu","email":"","orcid":"","institution":"Jiangxi Academy of Agricultural Sciences","correspondingAuthor":false,"prefix":"","firstName":"Weibiao","middleName":"","lastName":"Xu","suffix":""},{"id":539485087,"identity":"cf658764-d992-45cf-9ec2-94aeb730ab35","order_by":7,"name":"Yonghui Li","email":"","orcid":"","institution":"Jiangxi Academy of Agricultural Sciences","correspondingAuthor":false,"prefix":"","firstName":"Yonghui","middleName":"","lastName":"Li","suffix":""},{"id":539485088,"identity":"d75904fa-866b-418f-8278-8ea194740426","order_by":8,"name":"Longan Yan","email":"","orcid":"","institution":"Jiangxi Academy of Agricultural Sciences","correspondingAuthor":false,"prefix":"","firstName":"Longan","middleName":"","lastName":"Yan","suffix":""},{"id":539485089,"identity":"68bb705d-873d-4d63-b887-fa47a0a4f6c2","order_by":9,"name":"Yaohui Cai","email":"","orcid":"","institution":"Jiangxi Academy of Agricultural Sciences","correspondingAuthor":false,"prefix":"","firstName":"Yaohui","middleName":"","lastName":"Cai","suffix":""},{"id":539485090,"identity":"e56d752e-dc5a-4894-b816-999f74bd48c3","order_by":10,"name":"Hongwei Xie","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAv0lEQVRIiWNgGAWjYDACCQbGBwkVNswkaWE2+HAmjTQtbJIzWw6ToMPgdo+BNG/DeXaD44cff2CouUNYi+ScMwbGvDtuM0v2pJlJMBx7RlgLv0TuhmTeM7eZ+RkSzBgYG4hwIRtQy2HetnPMbPzPP38gSgvQlo2NM9sOMPNL5BhIEKVFckb+Z4YPZ5KZJWe8KZNIOEaEFoMbaek/Eirskg3Op2/+8KGGhNBOBpMJxGtgYLAjRfEoGAWjYBSMMAAAJOw6qXuHtKwAAAAASUVORK5CYII=","orcid":"","institution":"Jiangxi Academy of Agricultural Sciences","correspondingAuthor":true,"prefix":"","firstName":"Hongwei","middleName":"","lastName":"Xie","suffix":""}],"badges":[],"createdAt":"2025-09-18 13:08:15","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-7649942/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-7649942/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":95232643,"identity":"9ddd00b5-6789-49c3-a732-df11d67a5436","added_by":"auto","created_at":"2025-11-05 16:40:32","extension":"tif","order_by":0,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":17057860,"visible":true,"origin":"","legend":"","description":"","filename":"Figure2mitogenomelargerepeat.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/7827e99dbb94b60089436b22.tif"},{"id":95232745,"identity":"4d4cab2e-61c2-4292-a11a-6692c3001e06","added_by":"auto","created_at":"2025-11-05 16:40:38","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":4826059,"visible":true,"origin":"","legend":"","description":"","filename":"orf182manuscriptrevised3.docx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/f573d0fcdb121c4333034a8c.docx"},{"id":95232762,"identity":"66ee906b-118c-48a6-824e-8092ca54c69a","added_by":"auto","created_at":"2025-11-05 16:40:38","extension":"tif","order_by":2,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":4138356,"visible":true,"origin":"","legend":"","description":"","filename":"fig1.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/a92f2cd502785c4e52571661.tif"},{"id":95232458,"identity":"2d29b404-3996-47db-bb24-ba413a6457d2","added_by":"auto","created_at":"2025-11-05 16:40:28","extension":"tif","order_by":3,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":1462648,"visible":true,"origin":"","legend":"","description":"","filename":"fig3.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/82774f51cc5fc02da4e6ca47.tif"},{"id":95232642,"identity":"e59bde19-8501-4afa-acef-f4173434a423","added_by":"auto","created_at":"2025-11-05 16:40:32","extension":"tif","order_by":4,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":1219444,"visible":true,"origin":"","legend":"","description":"","filename":"fig4repeat.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/d2b428dd4d8b4df0fde3e78b.tif"},{"id":95232649,"identity":"cc36970c-c121-455d-ae53-519f9b921ad3","added_by":"auto","created_at":"2025-11-05 16:40:33","extension":"tif","order_by":5,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":937460,"visible":true,"origin":"","legend":"","description":"","filename":"fig5.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/5a155810e54af5fdff962881.tif"},{"id":95232453,"identity":"aedff932-4d01-48b6-b64e-fb5613f1bbe0","added_by":"auto","created_at":"2025-11-05 16:40:27","extension":"tif","order_by":6,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":464020,"visible":true,"origin":"","legend":"","description":"","filename":"fig6.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/b3690fcb8339982abc2bbaec.tif"},{"id":95232647,"identity":"11ffa18b-b4e4-4e0d-bb31-f9732a5f210a","added_by":"auto","created_at":"2025-11-05 16:40:33","extension":"json","order_by":7,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":11370,"visible":true,"origin":"","legend":"","description":"","filename":"2ae97bd3540e4816b5de5eac187bc88c.json","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/8e39231910249b3bc3957758.json"},{"id":95232459,"identity":"ea1bf78b-f809-441a-8160-be44905834dc","added_by":"auto","created_at":"2025-11-05 16:40:28","extension":"tif","order_by":8,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":365460,"visible":true,"origin":"","legend":"","description":"","filename":"FigS1genomeassemble.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/088a900c556fe8a50346d855.tif"},{"id":95232450,"identity":"98afc65b-2efc-4f1c-a910-38a04bc76cc1","added_by":"auto","created_at":"2025-11-05 16:40:26","extension":"tif","order_by":9,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":559640,"visible":true,"origin":"","legend":"","description":"","filename":"FigS3Repeatcount.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/82caa36888691cff14a4ac2e.tif"},{"id":95312562,"identity":"5dbe01d0-2146-433f-aeef-7af59989f0d4","added_by":"auto","created_at":"2025-11-06 15:49:42","extension":"tif","order_by":10,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":3583308,"visible":true,"origin":"","legend":"","description":"","filename":"FigS6mauve.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/2a2c42f725d2e22e7e7cab97.tif"},{"id":95232500,"identity":"32745dd9-7f9f-42cc-8164-bf65d571682e","added_by":"auto","created_at":"2025-11-05 16:40:30","extension":"jpg","order_by":11,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":7739951,"visible":true,"origin":"","legend":"","description":"","filename":"FigS7orf182snp.jpg","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/1cb1af7ca92a80ad15807c7c.jpg"},{"id":95312822,"identity":"142df2c4-ba20-47ac-9b3a-dc83662680e9","added_by":"auto","created_at":"2025-11-06 15:50:23","extension":"xlsx","order_by":12,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":14061,"visible":true,"origin":"","legend":"","description":"","filename":"TableS1Alistofwildricespecieswhichcontainorf182inthisstudy.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/ea78005f1e99c42123484cc5.xlsx"},{"id":95232492,"identity":"657a2b2d-91cc-47c6-8f10-3032925adff3","added_by":"auto","created_at":"2025-11-05 16:40:29","extension":"xlsx","order_by":13,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":10667,"visible":true,"origin":"","legend":"","description":"","filename":"TableS2Theinformationoforf182andMSSmarkersusedinthisstudy.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/cf88a20b8209f2b8e8d8ca39.xlsx"},{"id":95232446,"identity":"e3935f76-b0c1-433b-b39f-28172331ddf2","added_by":"auto","created_at":"2025-11-05 16:40:26","extension":"xlsx","order_by":14,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":15249,"visible":true,"origin":"","legend":"","description":"","filename":"TableS3Thegenotypegenerateby18MSSmarker.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/45a9ee5539352e682d89057f.xlsx"},{"id":95232648,"identity":"7ae9320e-5c82-4177-ae15-561f6ff41da7","added_by":"auto","created_at":"2025-11-05 16:40:33","extension":"xlsx","order_by":15,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":10408,"visible":true,"origin":"","legend":"","description":"","filename":"TableS4Shortandlongreadsandtotalsize.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/057518e1ecb1713affbaee3f.xlsx"},{"id":95312399,"identity":"c8e98516-77a1-43ad-9f5c-fdb19a993a9c","added_by":"auto","created_at":"2025-11-06 15:49:10","extension":"xlsx","order_by":16,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":10393,"visible":true,"origin":"","legend":"","description":"","filename":"TableS5chimericgene.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/6acc8a8e6f43864167dd690b.xlsx"},{"id":95232670,"identity":"64cf99dd-8cb5-4acd-bc88-08d96a3e06af","added_by":"auto","created_at":"2025-11-05 16:40:34","extension":"csv","order_by":17,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":1598,"visible":true,"origin":"","legend":"","description":"","filename":"TableS6wildricemitochondrialgenomerepeat.csv","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/b15dc58a35709e005d6667cb.csv"},{"id":95232457,"identity":"d3e98d1a-f144-4576-b274-87f83cd76218","added_by":"auto","created_at":"2025-11-05 16:40:28","extension":"xlsx","order_by":18,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":11657,"visible":true,"origin":"","legend":"","description":"","filename":"TableS7syri.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/baacc858c48a5dae3936894b.xlsx"},{"id":95232445,"identity":"6c6f7c36-d78a-42e6-9546-fce37ca11844","added_by":"auto","created_at":"2025-11-05 16:40:25","extension":"tif","order_by":19,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":8998400,"visible":true,"origin":"","legend":"","description":"","filename":"figS2generepeat.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/ce8bdb6b73f8713f1477145d.tif"},{"id":95232676,"identity":"fcdb637b-d97b-4dd0-9d0f-816bc913dfe2","added_by":"auto","created_at":"2025-11-05 16:40:34","extension":"tif","order_by":20,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":986168,"visible":true,"origin":"","legend":"","description":"","filename":"figS4syrisummary.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/bf375a39ea00e196f0534f0c.tif"},{"id":95312894,"identity":"202a52ef-7153-4be4-80b4-9958196d391d","added_by":"auto","created_at":"2025-11-06 15:50:32","extension":"tif","order_by":21,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":273528,"visible":true,"origin":"","legend":"","description":"","filename":"figS5.SV.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/125da2d6caf39196c046b427.tif"},{"id":95232447,"identity":"99950e3e-46a4-4492-a31f-b48627df1970","added_by":"auto","created_at":"2025-11-05 16:40:26","extension":"xml","order_by":22,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":74675,"visible":true,"origin":"","legend":"","description":"","filename":"2ae97bd3540e4816b5de5eac187bc88c1enriched.xml","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/21e664fe1cf17b4d2f8fbe38.xml"},{"id":95232530,"identity":"54031e05-d7b1-4208-b4e8-2f81430df90e","added_by":"auto","created_at":"2025-11-05 16:40:30","extension":"tif","order_by":23,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":17057860,"visible":true,"origin":"","legend":"","description":"","filename":"Figure2mitogenomelargerepeat.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/1c0b7a6e752c6b079874a37c.tif"},{"id":95232580,"identity":"492e64e9-18e4-4748-8f9e-c418826aa74e","added_by":"auto","created_at":"2025-11-05 16:40:31","extension":"tif","order_by":24,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":4138356,"visible":true,"origin":"","legend":"","description":"","filename":"fig1.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/b448b803c4a5e32f2ba67518.tif"},{"id":95232576,"identity":"e5183ebc-3aa3-4bed-b428-cf54cf983484","added_by":"auto","created_at":"2025-11-05 16:40:31","extension":"tif","order_by":25,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":1462648,"visible":true,"origin":"","legend":"","description":"","filename":"fig3.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/1488a01fd9724cee68568827.tif"},{"id":95312166,"identity":"836239e1-39c3-44d3-8ec1-57f1591fe096","added_by":"auto","created_at":"2025-11-06 15:47:45","extension":"tif","order_by":26,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":1219444,"visible":true,"origin":"","legend":"","description":"","filename":"fig4repeat.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/126404a8e60b2a712255a9df.tif"},{"id":95232719,"identity":"faa2659e-b96c-47cb-a5f1-a1aa26c435d1","added_by":"auto","created_at":"2025-11-05 16:40:35","extension":"tif","order_by":27,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":937460,"visible":true,"origin":"","legend":"","description":"","filename":"fig5.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/12650529a21c14939fa615a7.tif"},{"id":95312722,"identity":"95022d51-b108-4492-8487-42e8f3b83ff0","added_by":"auto","created_at":"2025-11-06 15:50:08","extension":"tif","order_by":28,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":464020,"visible":true,"origin":"","legend":"","description":"","filename":"fig6.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/e3a26ac3d0cb5a6a8572a719.tif"},{"id":95232655,"identity":"3c8581ac-5471-4965-8704-fdc9b3ca7aa0","added_by":"auto","created_at":"2025-11-05 16:40:33","extension":"jpeg","order_by":29,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":815360,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage1.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/8b9c6e6379ac70013b855115.jpeg"},{"id":95232766,"identity":"a09b42ef-5fbc-4f9d-b314-1ab0d64f0b87","added_by":"auto","created_at":"2025-11-05 16:40:39","extension":"jpeg","order_by":30,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":1371398,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage2.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/73687847b71999b80f91bd9a.jpeg"},{"id":95232669,"identity":"f5f61a3e-0cd8-41c2-a6d4-dc83c4e153e5","added_by":"auto","created_at":"2025-11-05 16:40:34","extension":"jpeg","order_by":31,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":1462648,"visible":true,"origin":"","legend":"","description":"","filename":"fig3.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/e26efce96fa0a16dfd26fbed.jpeg"},{"id":95232532,"identity":"37da72cf-14db-4c64-ad6f-928c416eb171","added_by":"auto","created_at":"2025-11-05 16:40:30","extension":"jpeg","order_by":32,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":324622,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage4.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/035bbc07c2d4f7bfbbd196ff.jpeg"},{"id":95232654,"identity":"4ae94893-3bc5-4a52-9659-16b1b9726f9d","added_by":"auto","created_at":"2025-11-05 16:40:33","extension":"jpeg","order_by":33,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":301208,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage5.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/585ed041c5f199636ea3f9f0.jpeg"},{"id":95232495,"identity":"68e46c4c-a8dd-4c7b-9214-bf98e350614b","added_by":"auto","created_at":"2025-11-05 16:40:29","extension":"jpeg","order_by":34,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":464020,"visible":true,"origin":"","legend":"","description":"","filename":"fig6.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/716752b9af3cdef719319bf0.jpeg"},{"id":95232645,"identity":"5c880c99-10c9-44fb-b62a-c1dd28abb7ef","added_by":"auto","created_at":"2025-11-05 16:40:32","extension":"png","order_by":35,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":459331,"visible":true,"origin":"","legend":"","description":"","filename":"OnlineFigure2mitogenomelargerepeat.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/421ad13878de82532fd8aad0.png"},{"id":95312608,"identity":"4d95fd16-1c68-4696-95d6-a7ef35510072","added_by":"auto","created_at":"2025-11-06 15:49:48","extension":"png","order_by":36,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":202671,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefig1.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/deebed8b2efb95b9d5efeba3.png"},{"id":95232644,"identity":"27321869-c0c2-42d1-977d-4a89e8fd2f62","added_by":"auto","created_at":"2025-11-05 16:40:32","extension":"png","order_by":37,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":28218,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefig3.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/34e04840165a88f15f74af8f.png"},{"id":95232743,"identity":"be1f7c8f-35cc-4928-a2a4-501a17844222","added_by":"auto","created_at":"2025-11-05 16:40:38","extension":"png","order_by":38,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":62023,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefig4repeat.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/833983ce3396535914ff93e9.png"},{"id":95232765,"identity":"511119e5-7c1f-46bd-8423-063cb5274917","added_by":"auto","created_at":"2025-11-05 16:40:39","extension":"png","order_by":39,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":73666,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefig5.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/46ba4754b865e5b138f58c3d.png"},{"id":95232451,"identity":"ff1a77f1-dd66-471a-be9a-2f1783aa2765","added_by":"auto","created_at":"2025-11-05 16:40:27","extension":"png","order_by":40,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":33622,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefig6.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/b488028538ca24a7e1932518.png"},{"id":95232653,"identity":"96276d39-05ef-4e2f-879d-2939ae9e9ec5","added_by":"auto","created_at":"2025-11-05 16:40:33","extension":"png","order_by":41,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":103777,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage1.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/02bf7153032101b2dc5f5df2.png"},{"id":95232579,"identity":"584a1da7-4211-457e-a3dc-51499627e05f","added_by":"auto","created_at":"2025-11-05 16:40:31","extension":"png","order_by":42,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":279621,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage2.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/0c6ccf603e3a5d6b7a18ff10.png"},{"id":95232744,"identity":"0b2f0fef-de3f-4506-8a28-795ffe306539","added_by":"auto","created_at":"2025-11-05 16:40:38","extension":"png","order_by":43,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":28218,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/46433062161c43c16249d71f.png"},{"id":95232672,"identity":"ac686d97-567e-4eae-8424-36bb24fe7b4f","added_by":"auto","created_at":"2025-11-05 16:40:34","extension":"png","order_by":44,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":39040,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage4.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/c5477323f62e650e0f324605.png"},{"id":95232623,"identity":"0bef1be7-a32a-48f1-8b22-28c4be4af368","added_by":"auto","created_at":"2025-11-05 16:40:31","extension":"png","order_by":45,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":45731,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage5.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/e8f21e6addbb685d06f6ab5e.png"},{"id":95232742,"identity":"21d49f89-8674-4152-9324-28d815b4a651","added_by":"auto","created_at":"2025-11-05 16:40:37","extension":"png","order_by":46,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":33622,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage6.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/05d3f834dc3bfabfee981881.png"},{"id":95232635,"identity":"08d387e2-92a2-442a-a117-6a88cbda2c4f","added_by":"auto","created_at":"2025-11-05 16:40:31","extension":"xml","order_by":47,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":71736,"visible":true,"origin":"","legend":"","description":"","filename":"2ae97bd3540e4816b5de5eac187bc88c1structuring.xml","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/0608f0cbf68024796666ae11.xml"},{"id":95232740,"identity":"7750c96f-05f3-436b-8c4e-0a0d4ab41b3f","added_by":"auto","created_at":"2025-11-05 16:40:36","extension":"html","order_by":48,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":91290,"visible":true,"origin":"","legend":"","description":"","filename":"earlyproof.html","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/b6db52eebb89f9f882b0750b.html"},{"id":95232764,"identity":"883c8140-1507-4979-8eae-68c62ae57295","added_by":"auto","created_at":"2025-11-05 16:40:39","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":3222155,"visible":true,"origin":"","legend":"\u003cp\u003ePhylogenetic and habitat-based applications of \u003cem\u003eOryza rufipogon\u003c/em\u003e (containing \u003cem\u003eorf182\u003c/em\u003e). A. the phylogenetic tree constructed by 20 MSTs, the sample in different clade\u003c/p\u003e\n\u003cp\u003emasked by colors were selected for sequencing. B. Three wild rice (\u003cem\u003eO. rufipogon\u003c/em\u003e) accessions collected from separate provinces in China representing different D1 type variations, served as the cytoplasm sources for breeding male-sterile lines. MXS: Mingxiangsi (WA type restorer line).\u003c/p\u003e","description":"","filename":"fig1.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/03400fac52716593673c6ebf.png"},{"id":95232452,"identity":"e0628220-b92f-443d-a87c-dc6fec7905c0","added_by":"auto","created_at":"2025-11-05 16:40:27","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":4043999,"visible":true,"origin":"","legend":"\u003cp\u003eThe features of the wild rice mitochondrial genome. A-K. GD4, GD5, GD7, GD9, GX4, GX34, GX44, GX46, GX51, JX2 and JX5, the outward-facing text on the out circular represents forward-oriented genes, while the inward-facing text denotes reverse-oriented genes. labels in red indicate the chimeric gene \u003cem\u003eorf182\u003c/em\u003e/\u003cem\u003enad6\u003c/em\u003e. The inner circular illustrates segments derived from full length mitochondrial genome duplication.\u003c/p\u003e","description":"","filename":"Figure2mitogenomelargerepeat.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/48c5a81b1ad007fcae8cf09f.png"},{"id":95232485,"identity":"4ec7739f-5c4c-4502-8a5b-834b2000b458","added_by":"auto","created_at":"2025-11-05 16:40:29","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":311924,"visible":true,"origin":"","legend":"\u003cp\u003eStructural variation between D1-type wild rice and Nipponbare. The red vertical tags on the mitochondrial genome represent \u003cem\u003eorf182\u003c/em\u003e. Gray lines between two genomes indicate collinearity, while yellow denotes inversions, green represents translocations, and blue corresponds to duplications.\u003c/p\u003e","description":"","filename":"fig3.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/fa7fbb5b8bba220239103854.png"},{"id":95232678,"identity":"80534361-b178-4f89-a803-2918301aef11","added_by":"auto","created_at":"2025-11-05 16:40:35","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":673563,"visible":true,"origin":"","legend":"\u003cp\u003eRepeat content in D1-type wild rice mitochondrial genome. A-B.Total number and length of different repeat types in 11 wild rice accessions mitochondrial genomes. class I: \u0026lt;100bp, class II:(100bp, 500bp), class III: (500bp, 2000bp), class IV: \u0026gt;2000bp. C-F: The correlation between genome size and different repeat type number (left), total repeat length. R indicates the correlation coefficient, and the p-value was determined by a two-tailed Student’s t-test.\u003c/p\u003e","description":"","filename":"fig4repeat.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/8d885d73afee44dc596e57ac.png"},{"id":95232573,"identity":"6cec626c-fc94-4f27-a78d-7aa29f0c1d00","added_by":"auto","created_at":"2025-11-05 16:40:31","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":551774,"visible":true,"origin":"","legend":"\u003cp\u003ePan-mitochondrial genome of the 11 D1-type wild rice and Nipponbare. A.\u003c/p\u003e\n\u003cp\u003eThe annotated protein-coding genes, and composition of core (red), dispensable (green), and private (blue) genes in each individual accession. B. Core gene cluster and pan-mitochondrial genome of D1-type wild rice. C. Modeling of pangenome and core-genome sizes when incorporating additional genomes into clustering and composition of the rice pan mitochondrial genome (pie chart). D. The orthogroup gene count per sample. E. the number and proportion of core, dispensable and private gene families in the pan mitochondrial genome.\u003c/p\u003e","description":"","filename":"fig5.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/74e28e7c14605e2115b1c47e.png"},{"id":95232667,"identity":"180b63b1-77a5-4a49-adb3-c3373155b9c1","added_by":"auto","created_at":"2025-11-05 16:40:34","extension":"png","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":299706,"visible":true,"origin":"","legend":"\u003cp\u003eThe evolutionary trajectory of \u003cem\u003eorf182\u003c/em\u003e in rice. Green indicates 71 bp repeat sequence shared by mitochondrial and nuclear genome. empty rectangular area means unknown region, blue rectangular shows the sequence derived from \u003cem\u003eSorghum biocolor\u003c/em\u003e mitochondrial genome. blue rectangular with black vertical line (SNP) and white region (InDel) denotes sequences variation compared to \u003cem\u003eSorghum biocolor \u003c/em\u003esequence, yellow rectangular stands for \u003cem\u003enad6\u003c/em\u003e gene, the hairpin arrow indicates a large inverted repeat region.\u003c/p\u003e","description":"","filename":"fig6.png","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/eb00620f2fe4b82ba3ce545d.png"},{"id":100370345,"identity":"de7a09bb-bb36-4f17-8651-5b5028ba9fb9","added_by":"auto","created_at":"2026-01-16 08:05:29","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":11380130,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/5b747bf4-8767-4cca-9285-f9ca868636ee.pdf"},{"id":95232493,"identity":"6f7b35be-0a9d-49d3-bdc1-d767ef964145","added_by":"auto","created_at":"2025-11-05 16:40:29","extension":"xlsx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":14061,"visible":true,"origin":"","legend":"","description":"","filename":"TableS1Alistofwildricespecieswhichcontainorf182inthisstudy.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/7a73ff41c8254f53d9235b7b.xlsx"},{"id":95232483,"identity":"a995c78b-ee38-4158-8cdd-a56471046a4a","added_by":"auto","created_at":"2025-11-05 16:40:29","extension":"xlsx","order_by":2,"title":"","display":"","copyAsset":false,"role":"supplement","size":10667,"visible":true,"origin":"","legend":"","description":"","filename":"TableS2Theinformationoforf182andMSSmarkersusedinthisstudy.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/b592e0e9c1d3117cc5c62390.xlsx"},{"id":95232677,"identity":"1551d86e-0985-455b-addf-7aa904e51734","added_by":"auto","created_at":"2025-11-05 16:40:35","extension":"xlsx","order_by":3,"title":"","display":"","copyAsset":false,"role":"supplement","size":15249,"visible":true,"origin":"","legend":"","description":"","filename":"TableS3Thegenotypegenerateby18MSSmarker.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/f314116cfcd2605255cc9a28.xlsx"},{"id":95232646,"identity":"2cafd417-0348-42f2-9171-b4e61c0c5c50","added_by":"auto","created_at":"2025-11-05 16:40:33","extension":"xlsx","order_by":4,"title":"","display":"","copyAsset":false,"role":"supplement","size":10408,"visible":true,"origin":"","legend":"","description":"","filename":"TableS4Shortandlongreadsandtotalsize.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/feb16df5d625b9e6dfd08f0d.xlsx"},{"id":95232481,"identity":"bc0672dc-9307-416e-a594-e40229d45ac5","added_by":"auto","created_at":"2025-11-05 16:40:29","extension":"xlsx","order_by":5,"title":"","display":"","copyAsset":false,"role":"supplement","size":10393,"visible":true,"origin":"","legend":"","description":"","filename":"TableS5chimericgene.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/95e10b5ecd18221acf7b1a54.xlsx"},{"id":95232577,"identity":"517fa1b3-7d1d-413b-bcbf-dcf3d7df2150","added_by":"auto","created_at":"2025-11-05 16:40:31","extension":"csv","order_by":6,"title":"","display":"","copyAsset":false,"role":"supplement","size":1598,"visible":true,"origin":"","legend":"","description":"","filename":"TableS6wildricemitochondrialgenomerepeat.csv","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/e3bdac73161251b29bb75264.csv"},{"id":95232497,"identity":"e63706b2-2a63-4c76-a59a-afbe20cd3dcf","added_by":"auto","created_at":"2025-11-05 16:40:29","extension":"xlsx","order_by":7,"title":"","display":"","copyAsset":false,"role":"supplement","size":11657,"visible":true,"origin":"","legend":"","description":"","filename":"TableS7syri.xlsx","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/3307dc5528b2ee6613e3276b.xlsx"},{"id":95232746,"identity":"2998e297-fdbe-48b7-8cb4-c122a0624977","added_by":"auto","created_at":"2025-11-05 16:40:38","extension":"tif","order_by":8,"title":"","display":"","copyAsset":false,"role":"supplement","size":365460,"visible":true,"origin":"","legend":"","description":"","filename":"FigS1genomeassemble.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/f3a8e7371889a07cfcb3c1d3.tif"},{"id":95232533,"identity":"6813b9ee-3057-451c-889f-0f367fc4bed5","added_by":"auto","created_at":"2025-11-05 16:40:30","extension":"tif","order_by":9,"title":"","display":"","copyAsset":false,"role":"supplement","size":8998400,"visible":true,"origin":"","legend":"","description":"","filename":"figS2generepeat.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/4825504feedb18e6a25d6ed2.tif"},{"id":95232675,"identity":"e06b1c62-da2f-4f74-9a3b-b504445c9cac","added_by":"auto","created_at":"2025-11-05 16:40:34","extension":"tif","order_by":10,"title":"","display":"","copyAsset":false,"role":"supplement","size":559640,"visible":true,"origin":"","legend":"","description":"","filename":"FigS3Repeatcount.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/690b7a5f032a12778b2d3d8c.tif"},{"id":95232571,"identity":"16c9983f-34e9-4d73-b7a4-e44c96a7eddb","added_by":"auto","created_at":"2025-11-05 16:40:31","extension":"tif","order_by":11,"title":"","display":"","copyAsset":false,"role":"supplement","size":986168,"visible":true,"origin":"","legend":"","description":"","filename":"figS4syrisummary.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/4fba756e2487535292620ee6.tif"},{"id":95232572,"identity":"03a5dce8-71b5-44a5-ac58-0f9b34e337d8","added_by":"auto","created_at":"2025-11-05 16:40:31","extension":"tif","order_by":12,"title":"","display":"","copyAsset":false,"role":"supplement","size":273528,"visible":true,"origin":"","legend":"","description":"","filename":"figS5.SV.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/38d9faf321a680a12a5c49c6.tif"},{"id":95232747,"identity":"41d871c1-8eda-4b79-aef4-713aad1c7911","added_by":"auto","created_at":"2025-11-05 16:40:38","extension":"tif","order_by":13,"title":"","display":"","copyAsset":false,"role":"supplement","size":3583308,"visible":true,"origin":"","legend":"","description":"","filename":"FigS6mauve.tif","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/8596519d9d2d8dcdb653aec5.tif"},{"id":95232651,"identity":"827e2aa6-a4f5-4784-bb24-fc103313c8fb","added_by":"auto","created_at":"2025-11-05 16:40:33","extension":"jpg","order_by":14,"title":"","display":"","copyAsset":false,"role":"supplement","size":7739951,"visible":true,"origin":"","legend":"","description":"","filename":"FigS7orf182snp.jpg","url":"https://assets-eu.researchsquare.com/files/rs-7649942/v1/755035957aa0c5280380257f.jpg"}],"financialInterests":"No competing interests reported.","formattedTitle":"Comparative Analysis of orf182 Widespread in Chinese Wild Rice Species for Hybrid Rice Breeding","fulltext":[{"header":"Introduction","content":"\u003cp\u003eMitochondria, often termed the powerhouses of eukaryotic cells, are semi-autonomous organelles believed to have originated from prokaryotic ancestors via endosymbiosis. They harbor their own genetic material, which is indispensable for mitochondrial and cellular functions [1]. The mitochondrial genome (mitogenome or mtDNA) is typically maternally inherited and follows evolutionary trajectories distinct from those of nuclear genomes. Outbreeding-mediated reshuffling of nuclear and cytoplasmic genomes occurs widely and frequently, potentially reconfiguring the functional and structural interplay between these genetic systems. Rearrangements, mutations, and inheritance patterns in mitochondrial genomes can provoke evolutionary conflicts that significantly influence plant diversification, domestication, and hybridization [2, 3]. Nevertheless, the evolutionary mechanisms shaping cytoplasmic genomes\u0026mdash;especially among wild relatives of crops\u0026mdash;remain inadequately explored.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eStructural variation (SV) is considerably more prevalent in mitochondrial genomes than in chloroplast genomes, where SVs are rare and occur at low frequency [4], whereas mitochondrial genomes exhibit extensive SVs driven by frequent repeat-mediated recombination[5]. In contrast, plant mitochondrial genomes undergo extensive recombination mediated by repetitive sequences, leading to pronounced structural diversity [6] and can disrupt gene expression through the reshuffling of regulatory elements or the creation of chimeric open reading frames (ORFs). Many such chimeric ORFs have been associated with cytoplasmic male sterility (CMS), a phenotype characterized by aberrant floral development and abortion of male gametophytes [7, 8].\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eRecombination events are often facilitated by short repeats (SRs; 50\u0026ndash;500 bp) and micro-repeats (MRs; \u0026lt;50 bp), which promote asymmetric and largely irreversible recombination, yielding recombinant molecules present at low (sub-stoichiometric) levels [9, 10]. These molecules, termed \u003cem\u003esublimons\u003c/em\u003e, can rapidly amplify under certain conditions through a mechanism known as sub-stoichiometric shifting (SSS), thereby altering the mitochondrial genotype (mitotype) and driving mtDNA evolution [11].\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eCollectively, recombination processes mediated by SRs, MRs, intracellular gene transfer (IGT), and horizontal gene transfer (HGT) generate diverse SVs\u0026mdash;including rearrangements, duplications, insertions, and deletions [5, 10, 12]. Although SVs within gene regions are generally deleterious and selected against, those in intergenic regions may persist, contributing to the high sequence divergence observed in mitochondrial intergenic spacers [13-15].\u003c/p\u003e\n\u003cp\u003eWild rice (\u003cem\u003eOryza rufipogon\u003c/em\u003e), the direct progenitor of cultivated rice (\u003cem\u003eOryza sativa\u003c/em\u003e), was domesticated approximately 8,000\u0026ndash;10,000 years ago. It represents a critical genetic resource for improving stress tolerance, yield, and grain quality in modern rice breeding [16]. While the nuclear genome of \u003cem\u003eO. rufipogon\u003c/em\u003e has been extensively studied, providing key insights into rice evolution and domestication [17, 18], \u0026nbsp;research on its mitochondrial genome remains limited. Early studies using restriction fragment length polymorphism (RFLP) analysis revealed mtDNA variation and distinct mitotypes among wild and cultivated rice populations [19]. Furthermore, several CMS types\u0026mdash;including the widely used WA-, BT-, HL-, and D1-CMS systems\u0026mdash;have been identified in cultivated rice, many tracing back to \u003cem\u003eO. rufipogon\u003c/em\u003e or closely related accessions [7, 20-22]. However, only a few complete mitogenomes of \u003cem\u003eO. rufipogon\u003c/em\u003e have been reported to date, and a comprehensive comparative analysis is still lacking. As a result, the extent of mitochondrial genomic diversity and its evolutionary relationship with cultivated rice remain poorly understood.\u003c/p\u003e\n\u003cp\u003eIn this study, we employed D1-CMS-type wild rice as a model to investigate cytoplasmic genome evolution. We constructed a pan-mitogenome based on high-quality long-read assemblies from 11 representative accessions of common wild rice. Our comprehensive analyses revealed extensive mitochondrial rearrangements and the presence of chimeric ORFs. We generated genome-wide mitochondrial variation maps from diverse \u003cem\u003eOryza\u003c/em\u003e accessions for population-level genomic studies. The potential role of chimeric ORFs\u0026mdash;classified into distinct phylogenetic clades based on mitotype-specific markers (MSS)\u0026mdash;in CMS was evaluated using cytoplasmic substitution lines. Furthermore, we examined recombination dynamics around the mitochondrial gene \u003cem\u003eorf182\u003c/em\u003e to elucidate its structural complexity and explore the mechanisms underlying rice mtDNA evolution. Finally, we characterized structural variations across the mitochondrial genome to uncover general architectural features and evolutionary constraints.\u0026nbsp;\u003c/p\u003e"},{"header":"Results","content":"\u003cp\u003e\u003cstrong\u003ePhylogenetic tree constructed by Mitotype-specific sequence (MSS)\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003ePrevious studies identified the \u003cem\u003eorf182\u003c/em\u003e gene exclusively in 18 \u003cem\u003eOryza rufipogon\u003c/em\u003e accessions from China, distributed among 17 species across nine genomic types, based on a screen of 446 wild rice samples [22]. To further elucidate the evolutionary origin and distribution of this gene, we collected an additional 320 common wild rice accessions from three provinces in China: Guangdong, Jiangxi, and Guangxi. Among these, 76 accessions (23.75%) exhibiting the D1-type cytoplasmic male sterility (CMS) phenotype were selected for subsequent genetic and genomic analyses (Table S1).\u003c/p\u003e\n\u003cp\u003eFrom a total of 57 mitochondrial structural sequence (MSS) profiles obtained from these accessions [23], 18 representative samples showing pronounced polymorphisms were used to reconstruct a phylogenetic tree, aiming to clarify the evolutionary relationships within the D1-type wild rice group (Fig. 1, Tables S2, S3). Furthermore, 11 representative wild rice accessions spanning nine distinct D1-type phylogenetic clades were subjected to high-quality mitochondrial genome assembly using a hybrid sequencing strategy combining long-read and short-read technologies. These high-resolution assemblies facilitated detailed comparative and evolutionary analyses of the mitochondrial genome, with particular emphasis on structural variation, recombination events, and the potential functional implications of CMS-associated open reading frames such as \u003cem\u003eorf182\u003c/em\u003e.\u003c/p\u003e\n\u003cp\u003eBased on three \u003cem\u003eOryza rufipogon\u003c/em\u003e accessions carrying \u003cem\u003eorf182\u0026nbsp;\u003c/em\u003efromdifferent geographical origins and phylogenetic tree branch, we successfully converted Mingxiangsi into male sterile lines after only five backcross generations(Fig. 1B). These results demonstrates that the D1-type CMS system can be readily introduced into diverse genetic backgrounds, highlighting its efficiency and broad applicability in hybrid breeding programs.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eGeneral features of the \u003cem\u003eOryza rufipogon\u003c/em\u003e mitochondrial genome\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eA total of 69 Gb of raw Illumina reads (150 bp in length) from 11 samples, along with 506 Mb of Nanopore PromethION data from 9 wild rice varieties and 572 Mb of PacBio long reads (average length ~10 kb), were generated for mitochondrial genome assembly (Table S4). The assembled mitogenomes of the 11 wild rice accessions ranged in size from 438,632 bp to 718,357 bp. Functional annotations and physical locations of genes across these mitochondrial genomes are summarized in Fig. 2 and Fig. S1.\u003c/p\u003e\n\u003cp\u003eIn the JX2 mitogenome, 77 genes were annotated, including 46 protein-coding genes, 28 tRNAs, and 3 rRNAs. These encode 37 distinct proteins, with \u003cem\u003eatp4\u003c/em\u003e, \u003cem\u003eatp6\u003c/em\u003e, \u003cem\u003eatp8\u003c/em\u003e, \u003cem\u003eatp9\u003c/em\u003e, \u003cem\u003ecob-like\u003c/em\u003e, \u003cem\u003eorf182\u003c/em\u003e, and \u003cem\u003eorfX\u003c/em\u003e present in two copies each, and \u003cem\u003enad6\u003c/em\u003e in three copies. The protein-coding genes were classified into 11 functional categories: ATP synthases (5 genes), cytochrome C biogenesis (4 genes), ubiquinol cytochrome c reductase (1 gene), cytochrome c oxidase (3 genes), maturases (1 gene), NADH dehydrogenase (9 genes), cytoplasmic male sterility-related (1 gene), open reading frames (1 gene), large ribosomal subunit proteins (3 genes), and small ribosomal subunit proteins (8 genes).\u003c/p\u003e\n\u003cp\u003eOn average, seven chimeric ORFs derived from fragments of known and conserved protein-coding genes (excluding tRNAs and rRNAs) were identified in each mitogenome. Notably, \u003cem\u003enad6\u003c/em\u003e (from the \u003cem\u003enad\u003c/em\u003e family) and \u003cem\u003erpl2\u003c/em\u003e and \u003cem\u003erpl16\u003c/em\u003e (from the \u003cem\u003erpl\u003c/em\u003e family) frequently participated in the formation of prominent chimeric ORFs across all wild rice accessions examined (Table S5).\u003c/p\u003e\n\u003cp\u003eTo assess structural variation (SV) and its functional implications, we performed gene copy number analysis. A core set of 37 protein-coding genes was conserved across all 11 mitochondrial genomes. Among these, \u003cem\u003enad6\u003c/em\u003e exhibited the highest copy number. The accession GX34 showed the most extensive gene loss, affecting \u003cem\u003eatp4\u003c/em\u003e, \u003cem\u003eatp8\u003c/em\u003e, \u003cem\u003erps7\u003c/em\u003e, \u003cem\u003ecox3\u003c/em\u003e, \u003cem\u003enad5\u003c/em\u003e, \u003cem\u003enad7\u003c/em\u003e, \u003cem\u003erps13\u003c/em\u003e, \u003cem\u003eccmFc\u003c/em\u003e, \u003cem\u003erpl5\u003c/em\u003e, and \u003cem\u003ecox1\u003c/em\u003e. Additionally, genes such as \u003cem\u003ecob-like\u003c/em\u003e and \u003cem\u003eorfX\u003c/em\u003e were lost in two other accessions. Gene duplication was widespread: \u003cem\u003ecox1\u003c/em\u003e was replicated in nearly all wild rice mitogenomes, and \u003cem\u003enad6\u003c/em\u003e duplication occurred in all 11 accessions (Fig. S2).\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eRepeat sequence variation and\u0026nbsp;correlations between\u0026nbsp;genome size and\u0026nbsp;repeat sequences\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eMitogenome repeats were classified into class I (repeats ≤ 100 bp), class II (100–500 bp), class III (500–2000 bp), and IV (\u0026gt; 2000 bp) based on repeat length. The number of class I repeat sequences ranged from 239 (GX44 and GX51) to 724 (GX4), account for 1.53-1.68% of mitochondrial genome size (Fig. 3A; Table S6). Among the samples, the number of class I repeats in GX4 samples was significantly higher than samples from wild rice in same province and other wild rice, while the number of repeats longer than 100 bp was not significantly increased (t-test, P-value = 0.11) (Table S6). Meanwhile the total repeat number (864) of GX4 was higher than that of other sample (total repeat number: 316–490) (Fig. 3), and the total repeat length show the similar pattern, while the mitochondrial size of GX4 greater than other contributed by the class IV type repeat. These were followed by repeats in class II, which accounted for approximately 15% of the total repeats, with 56–90 in number. There were most least repeats length located in class III and only accounted for 0.92%. In contrast, a total of 12–38 repeats in class IV were identified, with lengths ranging from 114.38 to 574.38kb, contributing 26.04–79.96% of the mitogenome size. Despite the relatively low number of repeats in class IV, they were the major contributors to the total mitogenome size (Fig. 3B).\u003c/p\u003e\n\u003cp\u003eFor all of the 11 wild rice samples, mitogenomes size show significantly high correlations (P-value \u0026lt; 0.001) with total repeat number and length (Fig. S3, Fig. 3c, d). All the count of the four repeat categories showed significant correlation with variations in genome size (R2 adj ranged from 0.63 to 0.99, P-value \u0026lt; 0.003) (Fig. S3). While the Class II repeat size show low correlations with the mitochondrial genome size (Fig. 3d). In addition, several mitogenomes may have an overrepresentation of repeat content (Table S6). For example, GX46 (total repeat length: 466 kb) had a similar genome size to GX51 and GX44 (438 kb), but the total repeat length was about 1.46-fold of GX51 (131 kb).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eIdentified the structural variation in D1-wild rice mitogenomes\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003ePrevious studies have indicated that repeat sequences can mediate rearrangements of varying extent in the mitochondrial genome. By aligning the 11 assembled mitogenomes to the Nipponbare reference genome, we identified syntenic regions accounting for 12% to 30% of each genome (Table S7). Structural variations across assemblies included one to three inversions, two to six translocations, 27 to 68 insertions, and 32 to 61 deletions (Fig. S4). Wild rice accessions with D1-CMS exhibited the highest abundance of SVs (Fig. S5). Notably, although GX34 ranked fourth lowest in terms of SV quantity, it displayed the greatest total SV length. This apparent discrepancy is largely attributable to the absence of large-scale translocations in GX34, resulting in a notably low minimal SV length.\u003c/p\u003e\n\u003cp\u003eMitogenomes of wild rice from Guangdong province appeared relatively conserved, a pattern that may reflect either geographic sampling concentration or limited underlying genetic differentiation. In contrast, the two Jiangxi accessions exhibited markedly distinct profiles in both the number and size of SVs (Fig. S5).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eWhole-genome alignment using Mauve further characterized large-scale structural rearrangements, revealing 17 locally collinear blocks (LCBs) with variable ordering across accessions (Fig. S6). Three major rearrangement events were detected in Jiangxi Dongxiang wild rice, whereas only five LCBs were identified in GX4 and GX34. These results suggest that the mitochondrial genomes of Guangxi D1-CMS wild rice have lost numerous conserved genomic modules during evolution, consistent with previous reports of extensive structural variation in accessions from this region.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003ePan-mitochondrial genome of 11 selected D1 CMS wild rice and Nipponbare\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eTo further investigate mitochondrial genomic diversity across different origins, we selected 11 D1-CMS wild rice accessions representing distinct groups, along with the previously published Nipponbare mitochondrial genome, for pan-genome construction and comparative analysis. The gene accumulation curve (Fig. 5C) indicated an open pan-genome, suggesting that new genes are likely to be identified as more mitochondrial genomes from rice are sequenced.\u003c/p\u003e\n\u003cp\u003eComparative analysis of the eleven D1-CMS wild rice mitogenomes revealed no private genes among these accessions. Mitochondrial genomes from Guangdong accessions were generally the most conserved. However, one accession, GD4, exhibited a reduced number of core genes and a higher proportion of variable genes within the pan-genome.\u003c/p\u003e\n\u003cp\u003eIn contrast, accessions from Guangxi showed greater variability in pan-genome composition. Among these, GX34 contained fewer variable genes, implying a more stable evolutionary history and possible adaptation to a narrower ecological niche. Further comparison between Jiangxi accessions indicated that JX5 possessed more variable genes than JX2. Notably, private genes were exclusively identified in the Nipponbare reference genome, highlighting substantial divergence in mitochondrial gene content between the reference and the D1-CMS wild rice lineages examined in this study (Fig. 5A).\u003c/p\u003e\n\u003cp\u003eWe also quantified the proportional distribution of different genomic components. Core gene families accounted for the largest share (44.23%), followed by private (30.77%) and variable (25%) gene families. In terms of copy number, core genes constituted 66.55% of all genes, while private genes represented only 2.89%.\u003c/p\u003e"},{"header":"Discussion","content":"\u003cp\u003eIn this study, we identified the chimeric mitochondrial gene \u003cem\u003eorf182\u003c/em\u003e as a unique genetic element present exclusively in a subset of Chinese wild rice accessions. This finding offers significant insights into the evolutionary dynamics of plant mitochondrial genomes, the origin of cytoplasmic male sterility (CMS)-associated genes, and the potential utility of \u003cem\u003eorf182\u003c/em\u003e in hybrid rice breeding. The restricted distribution of \u003cem\u003eorf182\u003c/em\u003e suggests that it likely originated from a localized mitochondrial recombination event. This observation supports the hypothesis that CMS-related genes often emerge independently in different populations through rapid and extensive mitochondrial restructuring. Similar to \u003cem\u003eorf79\u003c/em\u003e and \u003cem\u003eorfH79\u003c/em\u003e, \u003cem\u003eorf182\u003c/em\u003e exemplifies the high structural plasticity of plant mitochondrial genomes, wherein recurrent gene fusion events give rise to chimeric open reading frames. The occurrence of \u003cem\u003eorf182\u003c/em\u003e solely in Chinese wild rice further indicates that CMS genes are not universally conserved but rather represent region-specific evolutionary innovations. Its discovery highlights the distinctive genetic value of Chinese wild rice germplasm within the global rice gene pool. As a naturally occurring CMS-inducing gene, \u003cem\u003eorf182\u003c/em\u003e provides direct evidence of functional diversity preserved in wild populations. Among the collected Chinese \u003cem\u003eOryza rufipogon\u003c/em\u003e accessions, 23.75% were found to carry \u003cem\u003eorf182\u003c/em\u003e. Based on these findings, we propose that the mitochondrial genome harboring \u003cem\u003eorf182\u003c/em\u003e has not evolved through conservative mechanisms but rather via dynamic structural rearrangements.\u003c/p\u003e\n\u003ch3\u003eMitochondrial genome structural plasticity and chimeric gene formation\u003c/h3\u003e\n\u003cp\u003eOur results demonstrate that large-scale structural variations (LSVs) are prevalent in the mitochondrial genomes of wild rice, including among accessions from the same geographic region. Accessions such as GD4 and GX4 exhibit pronounced structural divergence compared to others. Notably, the physical scale of SVs did not correlate with their quantity; for instance, GX34 displayed the largest SV size but ranked among the lowest in terms of SV number. Substantial variation in mitochondrial genome size was observed across the wild rice panel (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e). The assembly and annotation of 11 mitochondrial genomes provide new insights into this variability. None of the wild rice accessions shared a common presence\u0026ndash;absence variation (PAV) region associated with \u003cem\u003eorf182\u003c/em\u003e, indicating that this gene did not originate from a recent insertion event. Instead, analysis of repetitive sequences revealed that some \u003cem\u003eorf182\u003c/em\u003e chimeric genes likely arose from duplication events, while others were generated through recombination mediated by a 71-bp repetitive element, which appears to have undergone multiple recombination cycles (Fig.\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003e).\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003cdiv id=\"Sec8\" class=\"Section2\"\u003e\u003ch2\u003eCMS gene diversity and implications for domestication and hybrid breeding\u003c/h2\u003e\u003cp\u003eDynamic structural changes, including gene duplications, have facilitated the emergence of new gene copies in the mitochondrial genome. Nucleotide alignment of \u003cem\u003eorf182\u003c/em\u003e sequences identified several SNP sites, which were categorized into three haplotypes (Fig. \u003cspan refid=\"MOESM7\" class=\"InternalRef\"\u003eS7\u003c/span\u003e). All detected SNPs were synonymous, suggesting neutral evolution at the sequence level. CMS genes such as \u003cem\u003eorf182\u003c/em\u003e may have historically functioned as genetic barriers that contributed to reproductive isolation, thereby influencing domestication pathways and exhibiting geographic specificity. The D1-CMS system, characterized by \u003cem\u003eorf182\u003c/em\u003e, represents a promising resource for the development of novel hybrid rice breeding systems.\u003c/p\u003e\u003c/div\u003e"},{"header":"Material and methods","content":"\u003cp\u003e\u003cstrong\u003ePlant materials\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003e\u003cem\u003eOryza rufipogon\u003c/em\u003e collected from three province, Guangdong(GD), Guangxi(GX) and Jiangxi (JX), were used in the present research. The Guangdong common wild rice materials were generously contributed and identified by professor Yonggen Lu (South China Agricultural University), Guangxi wild rice were obtained from Researcher Yuntao Liang (Guangxi Academy of Agricultural Sciences), while Jiangxi wild rice were kindly provided by the Dongxiang Wild Rice Resource Nursery and identified by Hongwei Xie. \u003cem\u003eOryza spontanea\u003c/em\u003e (N104625) was provided by international rice research institute, Philippines. The voucher specimens (refer to Table S1) are stored in the herbarium of Jiangxi Academy of Agricultural Sciences, relevant data are stored in the Germplasm Resource Database of the Jiangxi Academy of Agricultural Sciences (currently an internal website, and are available from the corresponding author upon reasonable request). All the wild rice and \u003cem\u003eO. spontanea\u003c/em\u003e were grown in the Wild Rice Germplasm Resource Nursery of the Jiangxi Academy of Agricultural Sciences (28\u0026deg;56\u0026prime;20\u0026Prime;N, 115\u0026deg;39\u0026prime;59\u0026Prime;E).\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConstruction of maximum likelihood tree based on the Mitotype-specific sequences (MSSs)\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eTo selected the representative D1-CMS relatives among the 76 common wild rice collected from worldwide. The previous developed 20 pairs of evenly distribution D1-CMS mitochondrial specific sequences were used (Table S2). The 0/1 matrix was generated by MTTs gel results (Table S3). The phylogenetic tree was constructed using the maximum likelihood method in IQ-TREE software v1.6.12 [24], and the resulting maximum likelihood tree was visualized using ITOL software [25].\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eSamples, DNA extraction and Mitochondrial genome sequencing\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eA total of 11 representative wild rice were selected based on the phylogenetic tree constructed by MSSs in this study (Table 1). Fresh leaves from the wild rice species were collected from a local filed experiment field of Jiangxi Academy of Agricultural Science, Nanchang, China. High-molecular-weight genomic DNA was extracted from 1-month-old leaves following the standard CTAB method. The mitochondrial genome of 11 wild rice was sequenced using the Illimina, Pacbio and Nanopore methods. Paired-end libraries with insert sizes of \u0026sim;300\u0026thinsp;bp were prepared following Illumina\u0026rsquo;s standard genomic DNA library preparation procedure. The qualified Illumina pair-end library would be used for Illumina NovaSeq 2500 sequencing (150\u0026thinsp;bp \u0026times; 2, Shanghai BIOZERON Co., Ltd., Shanghai, China). \u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eMitochondrial genome assembly and annotation\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eRaw sequencing reads were trimmed using Btrim software to filter out low-quality bases and sequences. The high quality reads were assembled into bp using SOAP de novo v2.04 ( http://soap.genomics.org.cn/) and Celera Assembler 8.0. Gaps in the SSPACE scaffolds were filled using GapCloser v.1.12 [26] with default parameters. Finally, redundant sequences were manually removed to obtain the final mitochondrial genome. \u003c/p\u003e\n\u003cp\u003eThe denovo annotation strategy and homology-based comparison were conducted for Genome annotation. We utilized the protein sequences of the reference genome for homology alignment prediction. First, we rapidly aligned the protein sequences to each genome sequence, filtered out the poor alignment results, removed redundancies, and then using Genewise for precise alignment. The AUGUSTUS software (http://bioinf.uni-greifswald.de/augustus/) was performed to de novo gene prediction for plant mitochondrial genomes. Finally, we integrated the gene sets using the EVidenceModeler v1.1.1 software [27] to obtain the comprehensive annotation file. The output genebank format file was manually confirmed, and the mitochondrial circular map was drawn using Organellar Genome DRAW (MPI-MP CHLOROBOX - OGDRAW (mpg.de)).\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAnalysis of repeated sequences\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eBlastN was used to determine whether there were intragenomic duplications of large fragments or interspersed repeats in the 11 wild rice mitogenomes. Microsatellite identification tool was used to detect simple sequence repeats. The repeats of 1, 2, 3, 4, 5, and 6 bases with 8, 4, 4, 3, 3, and 3 repeats numbers, repeatively, were identified in this analysis. The tandem repeats with \u0026gt; 7p repeats units were detected using Tandem Repeats Finder v4.10 [28]. The direct and inverted repeats were identified using REPuter software (BiBiServ - Temporariliy not available (uni-bielefeld.de)) with the minimal repeat size set to 20 bp.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eCollinear analysis and comparative analysis among the D1-CMS wild rice\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe mitogenome of Nipponbare was characterized by Notsu [29] were used for comparision. Collinear analysis between the two mitogenomes were conducted in Mummer [30]. Collinear regions were identified using the \u003cem\u003enucmer\u003c/em\u003e module with an identity threshold of 95% under the many-to-many alignment mode. The resulting collinearity maps were visualized with Plotsr implemented in the SYRI software[31].\u003c/p\u003e\n\u003cp\u003eThe genome sizes, GC contents, base compositions, start and stop codons, gene counts, and intron numbers of 11wild rice mitogenomes were compared to assess their variations and similarities. The copy number of protein coding genes among the mitogenome were identified.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eRearrangement event identification in D1-type wild rice mitogenome\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eMauve[32] genome alignments serve as a powerful tool for identify conserved genomic regions and structural rearrangements across different samples. In this study, we conducted a series of Mauve alignments under default parameters to compare mitochondrial genomes of GD, GX and JX, representing three distinct province region of China.\u003c/p\u003e\n\u003cp\u003eTo infer the rearrangement rate between eleven wild rice, multiple alignment of all pairwise combinations of the mitogenomes were conducted using Mauve (version 20150226) to identify locally collinear blocks (LCBs) in each mitogenome with default parameters. \u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eStructural variation between the wild rice mitochondrial genome and cultivate rice\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eTo construct the D1-type wild rice mitochondrial SV maps, we divided the SVs into four categories-insertions (INSs), deletions (DELs), inversions (INVs) and duplications (DUPs). A whole genome alignment by using mummer combined with SYRI software based on the NIP reference mitochondrial genome. \u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConstruction of mitochondrial genome variant maps \u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe SNP and INDEL calling of 11 D1-wild rice accessions was also conducted using NIP (BA000029.3) as a reference mitochondrial genome. Raw data of 11 wild rice were trimmed by Trimmomatic v0.39 [33]. Clean data was mapped to the reference genome using BWA[34]. Samtools [35] was used to convert the SAM into BAM format file. Picards (Picard Tools - By Broad Institute) was used to remove the duplicated reads. Variation identification and filtering were conducted by using GATK 4.1[36]. Finally, all SNP and INDELs with MAF large than 0.05 and max-missing rate less than 0.1 was kept for subsequent analysis. \u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAnalyses of the protein-coding-gene based \u003c/strong\u003e\u003cstrong\u003ePan-mitochondrial genome\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe all-to-all comparison of the peptide sequence of protein-coding genes was carried out using BLASTP with an E-value of e-10, and followed by clustering of the blast results using OrthoFinder [37] with default parameters. The clusters were categorized into three groups: the core genome shared by all 12 genomes, the dispensable genome containing genes present in 2 to 11 genomes, and specific gene families unique to only one genome.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eData Availability\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe mitochondrial genome sequences of long reads and short reads supporting in this study have been submitted to the National Center for Biotechnology Information (NCBI) with BioProject ID: PRJNA1338361 and PRJNA1338456, the material that support the findings in this study are available from the corresponding author, Hongwei Xie, upon reasonable request.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAcknowledgements\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eWe appreciate professor Yonggen Lu, and Researcher Yuntao Liang for\u0026nbsp;kindly providing the wild rice materials.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFunding\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThis work was supported by Major Talent Development Program of the Jiangxi Academy of Agricultural Sciences (JXSNKYJCRC202501), Jiangxi Province Science Foundation for Youths (20224BAB215008), Major Discipline Academic and Technical Leaders Training Program of Jiangxi Province (20213BCJL22044), China Agriculture Research System (CARS-01-08).\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthor information\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eAuthors and Affiliations\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eJiangxi Super-Rice Research and Development Center, Jiangxi Provincial Key Laboratory of Rice Germplasm Innovation and Breeding, National Engineering Research Center for Rice, Jiangxi Academy of Agricultural Sciences, Nanchang, 330200, China\u003c/p\u003e\n\u003cp\u003eWeixiong Long, Jie Wang, Lihua Luo, Lujian Zhou, Wei Chen, Laiyang Luo, Weibiao Xu, Yonghui Li, Longan Yan, Yaohui Cai \u0026amp; Hongwei Xie\u003c/p\u003e\n\u003cp\u003eContributions\u003c/p\u003e\n\u003cp\u003eWeixiong Long, Longan Yan, Yaohui Cai, and Hongwei Xie conceived and designed the project. Jie wang, Lihua Luo, Lujian Zhou and Wei Chen were responsible for the management of the wild rice materials. Laiyang Luo, Weibiao Xu, and Yonghui Li helped in the preparation of the manuscript. Weixiong Long and Hongwei Xie wrote and supervised the manuscript. All authors read and approved the manuscript.\u003c/p\u003e\n\u003cp\u003eCorresponding authors\u003c/p\u003e\n\u003cp\u003eCorrespondence to Hongwei Xie.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eEthics declarations\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eEthics approval and consent to participate\u003c/p\u003e\n\u003cp\u003e\u003cem\u003eOryza rufipogon\u003c/em\u003e seedings collected from Guangdong province in China and were identified by professor Yonggen Lu, while the Guangxi wild rice (\u003cem\u003eOryza rufipogon\u003c/em\u003e) seedlings were collected from Guangxi Academy of Agricultural Sciences and were identified by researcher Yuntao Liang, the Jiangxi common wild rice were collected from Dongxiang, Jiangxi, China and were identified by researcher Hongwei Xie. The methods involved in the study were carried out in compliance with local and national regulations.\u003c/p\u003e\n\u003cp\u003eConsent for publication\u003c/p\u003e\n\u003cp\u003eNot applicable\u003c/p\u003e\n\u003cp\u003eCompeting interests\u003c/p\u003e\n\u003cp\u003eThe authors declare that they have no competing interests.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eMilani L, Ghiselli F: \u003cstrong\u003eMitochondrial activity in gametes and transmission of viable mtDNA\u003c/strong\u003e. \u003cem\u003eBiol Direct \u003c/em\u003e2015, \u003cstrong\u003e10\u003c/strong\u003e:22.\u003c/li\u003e\n\u003cli\u003eEyre-Walker A, Gaut RL, Hilton H, Feldman DL, Gaut BS: \u003cstrong\u003eInvestigation of the bottleneck leading to the domestication of maize\u003c/strong\u003e. \u003cem\u003eProc Natl Acad Sci U S A \u003c/em\u003e1998, \u003cstrong\u003e95\u003c/strong\u003e(8):4441-4446.\u003c/li\u003e\n\u003cli\u003eWang N, Li C, Kuang L, Wu X, Xie K, Zhu A, Xu Q, Larkin RM, Zhou Y, Deng X\u003cem\u003e et al\u003c/em\u003e: \u003cstrong\u003ePan-mitogenomics reveals the genetic basis of cytonuclear conflicts in citrus hybridization, domestication, and diversification\u003c/strong\u003e. \u003cem\u003eProc Natl Acad Sci U S A \u003c/em\u003e2022, \u003cstrong\u003e119\u003c/strong\u003e(43):e2206076119.\u003c/li\u003e\n\u003cli\u003eZhou Y, Duvaux L, Ren G, Zhang L, Savolainen O, Liu J: \u003cstrong\u003eImportance of incomplete lineage sorting and introgression in the origin of shared genetic variation between two closely related pines with overlapping distributions\u003c/strong\u003e. \u003cem\u003eHeredity (Edinb) \u003c/em\u003e2017, \u003cstrong\u003e118\u003c/strong\u003e(3):211-220.\u003c/li\u003e\n\u003cli\u003eCole LW, Guo W, Mower JP, Palmer JD: \u003cstrong\u003eHigh and Variable Rates of Repeat-Mediated Mitochondrial Genome Rearrangement in a Genus of Plants\u003c/strong\u003e. \u003cem\u003eMol Biol Evol \u003c/em\u003e2018, \u003cstrong\u003e35\u003c/strong\u003e(11):2773-2785.\u003c/li\u003e\n\u003cli\u003eKozik A, Rowan BA, Lavelle D, Berke L, Schranz ME, Michelmore RW, Christensen AC: \u003cstrong\u003eThe alternative reality of plant mitochondrial DNA: One ring does not rule them all\u003c/strong\u003e. \u003cem\u003ePLoS Genet \u003c/em\u003e2019, \u003cstrong\u003e15\u003c/strong\u003e(8):e1008373.\u003c/li\u003e\n\u003cli\u003eTang H, Luo D, Zhou D, Zhang Q, Tian D, Zheng X, Chen L, Liu YG: \u003cstrong\u003eThe rice restorer Rf4 for wild-abortive cytoplasmic male sterility encodes a mitochondrial-localized PPR protein that functions in reduction of WA352 transcripts\u003c/strong\u003e. \u003cem\u003eMol Plant \u003c/em\u003e2014, \u003cstrong\u003e7\u003c/strong\u003e(9):1497-1500.\u003c/li\u003e\n\u003cli\u003eFujii S, Bond CS, Small ID: \u003cstrong\u003eSelection patterns on restorer-like genes reveal a conflict between nuclear and mitochondrial genomes throughout angiosperm evolution\u003c/strong\u003e. \u003cem\u003eProc Natl Acad Sci U S A \u003c/em\u003e2011, \u003cstrong\u003e108\u003c/strong\u003e(4):1723-1728.\u003c/li\u003e\n\u003cli\u003eShedge V, Davila J, Arrieta-Montiel MP, Mohammed S, Mackenzie SA: \u003cstrong\u003eExtensive rearrangement of the Arabidopsis mitochondrial genome elicits cellular conditions for thermotolerance\u003c/strong\u003e. \u003cem\u003ePlant Physiol \u003c/em\u003e2010, \u003cstrong\u003e152\u003c/strong\u003e(4):1960-1970.\u003c/li\u003e\n\u003cli\u003eGualberto JM, Newton KJ: \u003cstrong\u003ePlant Mitochondrial Genomes: Dynamics and Mechanisms of Mutation\u003c/strong\u003e. \u003cem\u003eAnnu Rev Plant Biol \u003c/em\u003e2017, \u003cstrong\u003e68\u003c/strong\u003e:225-252.\u003c/li\u003e\n\u003cli\u003eZhao N, Wang Y, Hua J: \u003cstrong\u003eThe Roles of Mitochondrion in Intergenomic Gene Transfer in Plants: A Source and a Pool\u003c/strong\u003e. \u003cem\u003eInt J Mol Sci \u003c/em\u003e2018, \u003cstrong\u003e19\u003c/strong\u003e(2).\u003c/li\u003e\n\u003cli\u003eKong J, Wang J, Nie L, Tembrock LR, Zou C, Kan S, Ma X, Wendel JF, Wu Z: \u003cstrong\u003eEvolutionary dynamics of mitochondrial genomes and intracellular transfers among diploid and allopolyploid cotton species\u003c/strong\u003e. \u003cem\u003eBMC Biol \u003c/em\u003e2025, \u003cstrong\u003e23\u003c/strong\u003e(1):9.\u003c/li\u003e\n\u003cli\u003eBentolila S, Stefanov S: \u003cstrong\u003eA reevaluation of rice mitochondrial evolution based on the complete sequence of male-fertile and male-sterile mitochondrial genomes\u003c/strong\u003e. \u003cem\u003ePlant Physiol \u003c/em\u003e2012, \u003cstrong\u003e158\u003c/strong\u003e(2):996-1017.\u003c/li\u003e\n\u003cli\u003eClifton DR, Fields KA, Grieshaber SS, Dooley CA, Fischer ER, Mead DJ, Carabeo RA, Hackstadt T: \u003cstrong\u003eA chlamydial type III translocated protein is tyrosine-phosphorylated at the site of entry and associated with recruitment of actin\u003c/strong\u003e. \u003cem\u003eProc Natl Acad Sci U S A \u003c/em\u003e2004, \u003cstrong\u003e101\u003c/strong\u003e(27):10166-10171.\u003c/li\u003e\n\u003cli\u003eSloan DB, Alverson AJ, Chuckalovcak JP, Wu M, McCauley DE, Palmer JD, Taylor DR: \u003cstrong\u003eRapid evolution of enormous, multichromosomal genomes in flowering plant mitochondria with exceptionally high mutation rates\u003c/strong\u003e. \u003cem\u003ePLoS Biol \u003c/em\u003e2012, \u003cstrong\u003e10\u003c/strong\u003e(1):e1001241.\u003c/li\u003e\n\u003cli\u003eLong W, He Q, Wang Y, Wang Y, Wang J, Yuan Z, Wang M, Chen W, Luo L, Luo L\u003cem\u003e et al\u003c/em\u003e: \u003cstrong\u003eGenome evolution and diversity of wild and cultivated rice species\u003c/strong\u003e. \u003cem\u003eNat Commun \u003c/em\u003e2024, \u003cstrong\u003e15\u003c/strong\u003e(1):9994.\u003c/li\u003e\n\u003cli\u003eHuang X, Kurata N, Wei X, Wang ZX, Wang A, Zhao Q, Zhao Y, Liu K, Lu H, Li W\u003cem\u003e et al\u003c/em\u003e: \u003cstrong\u003eA map of rice genome variation reveals the origin of cultivated rice\u003c/strong\u003e. \u003cem\u003eNature \u003c/em\u003e2012, \u003cstrong\u003e490\u003c/strong\u003e(7421):497-501.\u003c/li\u003e\n\u003cli\u003eGuo D, Li Y, Lu H, Zhao Y, Kurata N, Wei X, Wang A, Wang Y, Zhan Q, Fan D\u003cem\u003e et al\u003c/em\u003e: \u003cstrong\u003eA pangenome reference of wild and cultivated rice\u003c/strong\u003e. \u003cem\u003eNature \u003c/em\u003e2025, \u003cstrong\u003e642\u003c/strong\u003e(8068):662-671.\u003c/li\u003e\n\u003cli\u003eKadowaki K, Suzuki T, Kazama S: \u003cstrong\u003eA chimeric gene containing the 5\u0026apos; portion of atp6 is associated with cytoplasmic male-sterility of rice\u003c/strong\u003e. \u003cem\u003eMol Gen Genet \u003c/em\u003e1990, \u003cstrong\u003e224\u003c/strong\u003e(1):10-16.\u003c/li\u003e\n\u003cli\u003eLuo D, Xu H, Liu Z, Guo J, Li H, Chen L, Fang C, Zhang Q, Bai M, Yao N\u003cem\u003e et al\u003c/em\u003e: \u003cstrong\u003eA detrimental mitochondrial-nuclear interaction causes cytoplasmic male sterility in rice\u003c/strong\u003e. \u003cem\u003eNat Genet \u003c/em\u003e2013, \u003cstrong\u003e45\u003c/strong\u003e(5):573-577.\u003c/li\u003e\n\u003cli\u003eAkagi H, Sakamoto M, Shinjyo C, Shimada H, Fujimura T: \u003cstrong\u003eA unique sequence located downstream from the rice mitochondrial atp6 may cause male sterility\u003c/strong\u003e. \u003cem\u003eCurr Genet \u003c/em\u003e1994, \u003cstrong\u003e25\u003c/strong\u003e(1):52-58.\u003c/li\u003e\n\u003cli\u003eXie H, Peng X, Qian M, Cai Y, Ding X, Chen Q, Cai Q, Zhu Y, Yan L, Cai Y: \u003cstrong\u003eThe chimeric mitochondrial gene orf182 causes non-pollen-type abortion in Dongxiang cytoplasmic male-sterile rice\u003c/strong\u003e. \u003cem\u003ePlant J \u003c/em\u003e2018.\u003c/li\u003e\n\u003cli\u003eXie H, Wang J, Qian M, Li N, Zhu Y, Li S: \u003cstrong\u003eMitotype-specific sequences related to cytoplasmic male sterility in Oryza species\u003c/strong\u003e. \u003cem\u003eMolecular Breeding \u003c/em\u003e2013, \u003cstrong\u003e33\u003c/strong\u003e(4):803-811.\u003c/li\u003e\n\u003cli\u003eTrifinopoulos J, Nguyen LT, von Haeseler A, Minh BQ: \u003cstrong\u003eW-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis\u003c/strong\u003e. \u003cem\u003eNucleic Acids Res \u003c/em\u003e2016, \u003cstrong\u003e44\u003c/strong\u003e(W1):W232-235.\u003c/li\u003e\n\u003cli\u003eLetunic I, Bork P: \u003cstrong\u003eInteractive Tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool\u003c/strong\u003e. \u003cem\u003eNucleic Acids Res \u003c/em\u003e2024, \u003cstrong\u003e52\u003c/strong\u003e(W1):W78-W82.\u003c/li\u003e\n\u003cli\u003eXu M, Guo L, Gu S, Wang O, Zhang R, Peters BA, Fan G, Liu X, Xu X, Deng L\u003cem\u003e et al\u003c/em\u003e: \u003cstrong\u003eTGS-GapCloser: A fast and accurate gap closer for large genomes with low coverage of error-prone long reads\u003c/strong\u003e. \u003cem\u003eGigascience \u003c/em\u003e2020, \u003cstrong\u003e9\u003c/strong\u003e(9).\u003c/li\u003e\n\u003cli\u003eHaas BJ, Salzberg SL, Zhu W, Pertea M, Allen JE, Orvis J, White O, Buell CR, Wortman JR: \u003cstrong\u003eAutomated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments\u003c/strong\u003e. \u003cem\u003eGenome Biol \u003c/em\u003e2008, \u003cstrong\u003e9\u003c/strong\u003e(1):R7.\u003c/li\u003e\n\u003cli\u003eBenson G: \u003cstrong\u003eTandem repeats finder: a program to analyze DNA sequences\u003c/strong\u003e. \u003cem\u003eNucleic Acids Res \u003c/em\u003e1999, \u003cstrong\u003e27\u003c/strong\u003e(2):573-580.\u003c/li\u003e\n\u003cli\u003eNotsu Y, Masood S, Nishikawa T, Kubo N, Akiduki G, Nakazono M, Hirai A, Kadowaki K: \u003cstrong\u003eThe complete sequence of the rice (Oryza sativa L.) mitochondrial genome: frequent DNA sequence acquisition and loss during the evolution of flowering plants\u003c/strong\u003e. \u003cem\u003eMol Genet Genomics \u003c/em\u003e2002, \u003cstrong\u003e268\u003c/strong\u003e(4):434-445.\u003c/li\u003e\n\u003cli\u003eMarcais G, Delcher AL, Phillippy AM, Coston R, Salzberg SL, Zimin A: \u003cstrong\u003eMUMmer4: A fast and versatile genome alignment system\u003c/strong\u003e. \u003cem\u003ePLoS Comput Biol \u003c/em\u003e2018, \u003cstrong\u003e14\u003c/strong\u003e(1):e1005944.\u003c/li\u003e\n\u003cli\u003eGoel M, Sun H, Jiao WB, Schneeberger K: \u003cstrong\u003eSyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies\u003c/strong\u003e. \u003cem\u003eGenome Biol \u003c/em\u003e2019, \u003cstrong\u003e20\u003c/strong\u003e(1):277.\u003c/li\u003e\n\u003cli\u003eDarling AC, Mau B, Blattner FR, Perna NT: \u003cstrong\u003eMauve: multiple alignment of conserved genomic sequence with rearrangements\u003c/strong\u003e. \u003cem\u003eGenome Res \u003c/em\u003e2004, \u003cstrong\u003e14\u003c/strong\u003e(7):1394-1403.\u003c/li\u003e\n\u003cli\u003eBolger AM, Lohse M, Usadel B: \u003cstrong\u003eTrimmomatic: a flexible trimmer for Illumina sequence data\u003c/strong\u003e. \u003cem\u003eBioinformatics \u003c/em\u003e2014, \u003cstrong\u003e30\u003c/strong\u003e(15):2114-2120.\u003c/li\u003e\n\u003cli\u003eLi H, Durbin R: \u003cstrong\u003eFast and accurate short read alignment with Burrows-Wheeler transform\u003c/strong\u003e. \u003cem\u003eBioinformatics \u003c/em\u003e2009, \u003cstrong\u003e25\u003c/strong\u003e(14):1754-1760.\u003c/li\u003e\n\u003cli\u003eLi H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, Genome Project Data Processing S: \u003cstrong\u003eThe Sequence Alignment/Map format and SAMtools\u003c/strong\u003e. \u003cem\u003eBioinformatics \u003c/em\u003e2009, \u003cstrong\u003e25\u003c/strong\u003e(16):2078-2079.\u003c/li\u003e\n\u003cli\u003eMcKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M\u003cem\u003e et al\u003c/em\u003e: \u003cstrong\u003eThe Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data\u003c/strong\u003e. \u003cem\u003eGenome Res \u003c/em\u003e2010, \u003cstrong\u003e20\u003c/strong\u003e(9):1297-1303.\u003c/li\u003e\n\u003cli\u003eEmms DM, Kelly S: \u003cstrong\u003eOrthoFinder: phylogenetic orthology inference for comparative genomics\u003c/strong\u003e. \u003cem\u003eGenome Biol \u003c/em\u003e2019, \u003cstrong\u003e20\u003c/strong\u003e(1):238.\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Oryza rufipogon, orf182, wild rice, CMS, structural variation","lastPublishedDoi":"10.21203/rs.3.rs-7649942/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-7649942/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eThe utilization of hybrid rice has successfully addressed food security challenges in China. However, the predominant use of wild-abortive (WA) cytoplasmic male sterility (CMS) systems in hybrid rice cultivation raises significant safety concerns. In this study, we have elucidated the distribution patterns of \u003cem\u003eorf182\u003c/em\u003e, a novel CMS gene identified in our previous work, in Chinese common wild rice. Mitochondrial sequencing and assembly of 11 representative wild rice species harboring \u003cem\u003eorf182\u003c/em\u003e revealed 16 novel genes. A total of 469 mitochondrial genes were classified into 36 gene families, with nine families containing single-copy homologous genes, indicating significant gene duplication in mitochondria. We observed a strong positive correlation between mitogenome size and the quantity and size of repetitive sequences. Collinearity analysis revealed extensive mitochondrial variation and large-scale inversions in Guangdong wild rice. Comparative genome analysis uncovered inversions, translocations, and several variations surrounding \u003cem\u003eorf182\u003c/em\u003e, with a 71 bp repeat sequence mediating the formation of the \u003cem\u003eorf182\u003c/em\u003e-\u003cem\u003enad6\u003c/em\u003e chimeric gene. Gene copy number analysis (GCNV) revealed variable \u003cem\u003eorf182\u003c/em\u003e gene copy counts (1, 2, and 3) in wild rice species. Additionally, successful transformation of \u003cem\u003eorf182\u003c/em\u003e from various sources into sterile lines was achieved. These findings provide valuable resources for advancing hybrid rice development in China, thus contributing to enhanced food security.\u003c/p\u003e","manuscriptTitle":"Comparative Analysis of orf182 Widespread in Chinese Wild Rice Species for Hybrid Rice Breeding","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-11-05 16:35:12","doi":"10.21203/rs.3.rs-7649942/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"94f0a097-8e85-43ce-abcc-ff1dbd12fb5a","owner":[],"postedDate":"November 5th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[],"tags":[],"updatedAt":"2026-01-14T12:40:37+00:00","versionOfRecord":[],"versionCreatedAt":"2025-11-05 16:35:12","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-7649942","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-7649942","identity":"rs-7649942","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00