Comparative Chloroplast Genomics and Maternal Lineage Diversity in Oil Palm (Elaeis guineensis) | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Comparative Chloroplast Genomics and Maternal Lineage Diversity in Oil Palm (Elaeis guineensis) Ally Juma Mkude, Redi Aditama, Sintho W. Ardie, Willy B. Suwarno, and 3 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-8662850/v1 This work is licensed under a CC BY 4.0 License Status: Under Revision Version 1 posted 16 You are reading this latest preprint version Abstract Background Oil palm ( Elaeis guineensis ) is the world’s most important source of vegetable oil, yet comparative studies of its chloroplast genomes remain limited, particularly for African accessions. Understanding plastome diversity and maternal lineage variation is critical for tracing domestication history and developing molecular markers for breeding and conservation. Results We report the first complete chloroplast genomes from Tanzanian Dura and Pisifera, Cameroonian Dura, and the ancestral Deli Dura from Indonesia, analyzed alongside 44 additional plastomes. Comparative analyses revealed highly conserved genome organization and gene content, with subtle differences in IR–SC boundaries and intergenic spacers. Simple sequence repeat (SSR) profiling confirmed the dominance of A/T mononucleotide repeats but highlighted lineage-specific variation, including a markedly reduced genic SSR density in Pisifera-Tz, while total SSR counts remained stable across accessions (≈ 859–860; Supplementary Table S3 ). Phylogenetic reconstruction clustered Pisifera-Tz with Dura-Cmr, while Dura-Tz and Dura-BBG formed distinct clades, reflecting multiple maternal lineages. Haplotype network analysis identified 14 distinct haplotypes, revealing moderate but structured plastome diversity. Benchmarking of sequencing platforms demonstrated complementary strengths: MGI provided deep coverage, while PacBio improved assembly contiguity and resolution of repetitive regions. Conclusion This study expands plastome resources for E. guineensis , generating the first Tanzanian and Cameroonian chloroplast genomes and uncovering lineage-specific SSR variation. Integration of comparative genomics, phylogenetics, and haplotype analysis advances understanding of oil palm domestication and supports the development of plastome-based molecular markers for breeding, conservation, and genetic resource management. Oil palm chloroplast genome plastome diversity SSR variation phylogenetics maternal lineage Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 Introduction Oil palm ( Elaeis guineensis Jacq.) is the most productive source of vegetable oil worldwide, contributing more than one-third of the global supply (1). Its high oil yield per unit area has made it central to food security, bioenergy, and industrial applications (2,3). However, the rapid expansion of oil palm cultivation has raised concerns about deforestation, biodiversity loss, and sustainability (4,5). In addition, the limited genetic base of current breeding populations constrains progress in developing resilience traits, yield stability, and climate adaptation (6,7). The narrow genetic base of Southeast Asian oil palm populations largely stems from a small number of African introductions during the colonial era (7). This founder effect has restricted genetic variation in commercially important lines, particularly the Deli Dura population in Indonesia (8). In contrast, African wild populations harbor significant allelic richness and represent untapped reservoirs of genetic diversity (8,9). Expanding cytoplasmic and nuclear variation is therefore critical for sustainable cultivation and long-term breeding success (10). Chloroplast genomes (plastomes) are powerful tools for studying plant diversity, evolution, and adaptation (11,12). Plastomes play essential roles in photosynthesis, energy metabolism, and biosynthesis of amino acids, fatty acids, and pigments. Their conserved quadripartite structure, uniparental inheritance, and relatively low mutation rate make them valuable for phylogenetics, species differentiation, and lineage tracing (13). In flowering plants, plastomes typically range from 120 to 160 kb and encode 110 to 130 genes involved in transcription, translation, and photosynthesis (14,15). Despite their conservation, single-nucleotide polymorphisms, indels, and structural variations in non-coding regions provide informative markers for evolutionary research, germplasm authentication, and lineage tracing (16,17). Previous studies of oil palm plastomes revealed highly conserved gene content and organization, consistent with other members of the palm family (Arecaceae) (18). The first complete E. guineensis plastome established a baseline structure of ~ 156,973 bp (18), and subsequent work confirmed its stability across the species (19,20). Nevertheless, comparative analyses identified nucleotide variations concentrated in single-copy regions and intergenic spacers, offering opportunities to differentiate closely related lineages (21). Commercial planting materials are derived from hybridization between Dura (thick-shelled) and Pisifera (shell-less) types to produce the high-yielding Tenera hybrid (22). Tracing maternal lineages is therefore essential for germplasm authentication and understanding cytoplasmic genetic effects (23,24) Given the narrow maternal base of commercially dominant Deli Dura lines (16), resolving plastome diversity across wild and cultivated accessions is critical for breeding programs (25,26). While nuclear genomic studies of E. guineensis and E. oleifera are extensive (27–30), detailed comparative analyses of intraspecific plastome diversity remain limited. Even small numbers of variable sites within plastomes can be powerful for differentiating lineages and mapping cytoplasmic diversity (31). Characterizing plastome variation across diverse African accessions is thus essential for conservation, breeding authentication, and marker development (32). Tanzania and Cameroon represent centers of origin and diversity for oil palm, harboring ancient lineages and wild populations that could significantly enrich global breeding programs (9,32). In contrast, Indonesian Dura populations descend from a narrow genetic base introduced from Africa in the early 20th century (16). Comparative plastome analyses across African regions, therefore, offer unique insights into domestication history, adaptation mechanisms, and maternal lineage divergence (27) Recent advances in sequencing technologies have transformed plastome research (33). Short-read platforms such as Illumina and MGI DNBSEQ provide cost-effective accuracy for sequencing and variant detection but struggle with repetitive regions and structural variations (34). Long-read platforms such as PacBio HiFi and Oxford Nanopore generate reads spanning thousands of bases, enabling resolution of repetitive elements and structural variants (35). Long reads improve assembly contiguity, facilitate haplotype phasing, and enable precise detection of complex rearrangements (35). Comparing short- and long-read assemblies provides critical validation for plastome reconstruction, particularly in complex plant lineages such as oil palm (34). However, systematic comparisons of sequencing platform performance in oil palm plastomes remain scarce. This study addresses these gaps by sequencing and analyzing chloroplast genomes of Tanzanian Dura (Dura-Tz) and Pisifera (Pisifera-Tz) accessions. We integrate these with previously sequenced Cameroonian Dura (Dura-Cmr), the ancestral Deli Dura from Indonesia (Dura-BBG), 29 Indonesian breeding lines, and 15 plastome accessions from public databases. By benchmarking MGI and PacBio sequencing platforms, we highlight their complementary strengths for plastome assembly and variant detection. Collectively, this work provides new insights into plastome diversity in E. guineensis, advances understanding of domestication history, and supports the development of plastome-based molecular markers for breeding, conservation, and genetic resource management. Methods Oil palm chloroplast genome samples Two chloroplast genomes were sequenced in this study, including one Tanzanian Dura type (Dura-Tz) and one Tanzanian Pisifera type (Pisifera-Tz). The selected plant materials was identified by Mr. Basil E. Kavishe, Research Assistant at Tanzania Agricultural Research Institute (TARI). No voucher specimen was deposited in publicly accessible herbarium. However, the plant materials were photographed during sample collection, and they are now monitored by TARI for reference and future studies. Young leaves were collected from the Kwitanga plantation, Kigoma, Tanzania. Samples were cleaned, cut into ~ 20 cm pieces, sterilized with 70% ethanol, air-dried, sealed in airtight plastic bags, and transported in a cold box to PT Genetika Science, Indonesia, for sequencing. Two representative wild oil palm samples, one Cameroonian Dura-type (Dura-Cmr) and one Deli Dura progenitor from Bogor Botanical Garden, Indonesia (Dura-BBG), were sequenced in a previous project at PMB Lab, IPB University (NCBI accession numbers are on Progress). Moreover, as many as twenty-nine (29) samples were also from previously sequenced chloroplast genomes and obtained from Dr. Azis Natawijaya (PT Bumitama Gunajaya Agro Company). Inquiries for these 29 sequences should be directly forwarded to Dr. Azis Natawijaya. Other cpDNAs were obtained from NCBI GenBank DNA Database ( https://ncbi.nlm.nih.gov/ ). All chloroplast genome sequences generated in this study, including their associated genome annotations, are publicly available in the NCBI database under BioProject accession number PRJNA1416287 . Chloroplast genome sequencing and assembly Short-read sequencing of Dura-Tz and Pisifera-Tz total genomes was performed using the MGI DNBSEQ-G400 platform (34) ( https://mgi-tech.eu/sequencing-products/dnbseq-g400 ) (PE150) with MGIEasy FS DNA Library Prep Set. Long-read sequencing of Dura-Tz total genomes was conducted using the PacBio Vega platform with HiFi Plex Prep Kit 96 (PacBio, 103-122-800) (36) ( https://www.pacb.com/wp-content/uploads/Insert-HiFi-plex-prep-kit-96 ). Raw reads of Dura-Tz and Pisifera-Tz were processed with Fastp v0.20.1 (37) to trim low-quality sequences and adaptor contamination. De novo assembly was performed using NOVOPlasty v4.3.5 (38), and the assemblies were polished using EnsemblPlants56 ( https://plants.ensembl.org ). Sequencing depth and coverage of the Dura-Tz and Pisifera-Tz chloroplast genomes were evaluated by mapping raw reads back to the assembled plastomes. Depth profiles were generated across the entire genome length (0–156 kb) for both Dura-Tz and Pisifera-Tz accessions. Coverage plots were inspected to confirm uniformity across large single copy (LSC), small single copy (SSC), and inverted repeat (IR) regions. Chloroplast genome annotation Assembled genomes in FASTA format were annotated using GeSeq within CHLOROBOX (39) and the Dual Organellar Genome Annotator (DOGMA) with default parameters—annotation employed 95% sequence similarity thresholds for protein-coding genes, rRNAs, and tRNAs. BLASTX and BLASTN were used to identify coding genes and rRNAs, while tRNAscan-SE v2.0.7 was used for tRNA annotation (40). Intron boundaries, start, and stop codons were manually curated by comparison with a combination of many related species of Arecaceae plastomes. Genome maps were visualized using OGDRAW (41). Comparative analysis of Plastome structure and Synteny Using assembled chloroplast genomes, the base composition and frequencies of the LSC, SSC, and IR regions were determined in Geneious Prime v2024.0.7 (42). Two multiple progressive sequence alignments were conducted; the first one included the two new plastomes plus Dura-BBG and Dura-Cmr. Multiple sequence alignments of the plastomes were generated using MAFFT v7.490 (43). IR boundary variation was visualized using IRscope (44). The second analysis was carried out using the two new plastomes plus Dura-BBG and Dura-Cmr and six Indonesian breeding lines (Dura-Id_574, Dura-Id_101128179, Pisifera-Id_221, Pisifera-Id_1169, Tenera-Id_1093, Tenera-Id_15730) and six plastomes that were available on NCBI (ON248756, NC_017602, OR125034, OR125040, OR125048, OR125054) in Mauve v2.4.0 (45). Simple sequence repeats (SSRs) Pairwise multiple alignments of cp genomes were performed using MAFFT v7.490 (43) in Geneious Prime. Simple sequence repeats (SSRs) containing 1–6 nucleotides were identified using the Phobos v3.3.12 (46). The following configuration was adopted to search for SSR motifs: SSR of one to six nucleotides long, with a minimum repeat number of 10, 5, and 4 units for mono-, di-, and trinucleotide SSRs, respectively, and three units for tetra-, penta-, and hexanucleotide SSRs. Phylogenetic and haplotype analysis Phylogenetic analysis included 48 E. guineensis cp genomes. Multiple sequence alignments were generated using MAFFT v7.490 (43). Phylogenetic trees were constructed using the Maximum Likelihood (ML) method with the Tamura-Nei (TN93/TNG93 + G + I) genetic distance model, and 1000 bootstrap replications (47,48). Haplotype networks were constructed from 48 E. guineensis cp genomes, including 19 PT Bumitama Gunajaya Agro accessions, 25 NCBI accessions, and 4 genomes sequenced in the PMB lab at IPB University. Alignments were performed in Geneious Prime, and haplotype networks were generated using NETWORK software (49). Results Chloroplast Genome Sequencing Sequencing output is summarized in Table 1 . For the Dura‑Tz and Pisifera‑Tz accessions sequenced with MGI, approximately 242 million reads were generated per sample, corresponding to ~ 36 Gb of total bases. In contrast, PacBio sequencing of Dura‑Tz produced only 5.95 million reads but yielded a substantially higher total base output of 63.33 Gb (Table 1 ). Table 1 Sequencing data summary for Tanzanian Dura (Dura-Tz) and Pisifera (Pisifera-Tz) Sample name Sequencing Platform Total reads (M.) Total bases (Gb) Dura-Tz MGI 241.78 36.27 PacBio 5.95 63.33 Pisifera-Tz MGI 242.00 36.30 These results highlight the complementary strengths of the two platforms. MGI provides deep coverage through massive read counts, ensuring reliable variant detection and uniform base representation. PacBio, despite generating fewer reads, delivers long sequences that enhance assembly contiguity and resolve repetitive regions more effectively. Together, the datasets demonstrate that combining short‑read and long‑read technologies offers both depth and structural accuracy for plastome reconstruction. Sequencing Depth and Coverage Robust sequencing coverage was achieved across the chloroplast genomes of both Dura-Tz and Pisifera-Tz accessions. The Dura-Tz assembly reached depths of up to ~ 3,500 reads (Fig. 1 A), while Pisifera-Tz attained nearly 7,000 reads (Fig. 1 B), reflecting its substantially higher sequencing throughput. Coverage profiles were generally uniform across the plastome, with expected peaks in the inverted repeat (IR) regions due to duplication and troughs in AT-rich or non-coding regions that are inherently more challenging to sequence. The overall consistency of coverage confirms the completeness and reliability of both assemblies. Significantly, the greater depth observed in Pisifera-Tz enhances confidence in variant detection and structural resolution, thereby providing a stronger foundation for downstream comparative and phylogenetic analyses. Chloroplast Genome Assembly and Structure The complete chloroplast genomes of Dura‑Tz (PX957236) and Pisifera‑Tz (PX957237) were assembled into circular maps of 156,883 bp, each displaying the typical quadripartite organization comprising large single‑copy (LSC), small single‑copy (SSC), and inverted repeat (IR) regions (Fig. 2 A, 2 B). Comparative analysis with previously sequenced accessions (Dura‑Cmr and Dura‑BBG) revealed highly conserved structural and compositional features (Table 2 ). These findings are consistent with earlier oil palm plastome characterizations (23,50) and broader plastid genome studies in seed plants (11,13,14, 23). Table 2 Summary of general features of 4 accessions of Elaeis guineensis chloroplast-genomes Attribute Dura-Tz Pisifera-Tz Dura-Cmr Dura-BBG Genome Size (bp) 156,983 156,983 156,982 156,990 GC content (%) 37.4 37.4 37.4 37.4 LSC size (bp) 85,194 85,194 85,193 85,198 SSC size (bp) 17,643 17,643 17,643 17,644 IRs size (bp) 27,073 27,073 27,073 27,074 LSC GC content (%) 35.5 35.5 35.5 35.5 SSC GC content (%) 31.0 30.9 31.0 31.0 IRs GC content (%) 42.5 42.5 42.5 42.5 Number of genes 115 115 115 115 Number of protein-coding genes 79 80 80 80 Number of tRNA 32 31 31 31 Number of rRNA 4 4 4 4 Chloroplast Genome Annotation and Gene Content Annotation identified a consistent set of 115 genes across all accessions, comprising 79–80 protein‑coding genes, 31–32 tRNAs, and 4 rRNAs. Dura-Tz contained 79 protein-coding genes and 32 tRNAs, compared with 80/31 in others (Table 2 ). This subtle difference may reflect lineage-specific variation in gene annotation or minor events of tRNA duplication or loss, possibly involving ycf1 or related loci. Overall, gene composition remains highly stable, underscoring the structural conservation of E. guineensis plastomes. Such uniformity highlights the evolutionary stability of the chloroplast genome, while slight differences in gene counts may serve as useful markers for lineage differentiation and comparative genomics. Functional categorization revealed a conserved organization across all four accessions, including genes encoding photosystem components, ribosomal proteins, ATP synthase, NADH dehydrogenase, Rubisco, RNA polymerase, and proteins involved in DNA replication and repair. Transfer RNA (tRNA) and ribosomal RNA (rRNA) genes were distributed throughout the plastome, ensuring its translational capacity. Hypothetical chloroplast reading frames (ycf) were also identified, consistent with their conserved presence in angiosperm plastomes. Gene transcription orientation was bidirectional, with genes on the outer ring transcribed clockwise and those on the inner ring transcribed counterclockwise. Core functional categories—photosynthesis, transcription, translation, and genome maintenance—were consistently represented, in line with previous annotations of oil palm plastomes (10,17,19). Together, these findings confirm the high degree of conservation in gene content and organization among oil palm plastomes. The stability observed between Dura‑Tz and Pisifera‑Tz, along with other accessions, provides a reliable foundation for comparative and evolutionary analyses. At the same time, minor lineage‑specific variations can serve as potential markers of genetic differentiation. SSR Distribution and Motif Composition Analysis of simple sequence repeats (SSRs) across the four accessions—Dura‑BBG, Dura‑CMR, Pisifera‑Tz, and Dura‑Tz—revealed a highly conserved repeat landscape (Fig. 3 , Supplementary Table 1). Total SSR counts ranged narrowly from 859 to 860 per plastome, indicating strong structural stability across lineages. Mononucleotide repeats dominated the SSR profile, averaging 338.5 per accession, with a pronounced bias toward A/T motifs. Each accession contained approximately 317 A/T mononucleotide repeats, accounting for over 93% of the total, while G/C repeats were rare (22 per plastome). This A/T enrichment reflects the low GC content and mutational bias typical of plastid genomes (10,17) Dinucleotide repeats were equally conserved, with 58 per accession. AT and TA motifs were the most abundant, comprising 144 of the 232 total dinucleotide repeats (62%). Other motifs such as AG, GA, TC, and CT were evenly distributed across accessions, with no lineage-specific enrichment. Trinucleotide and tetranucleotide motifs also showed uniform distribution, with 71 and 95 repeats per accession, respectively. Trinucleotide motifs were dominated by A/T-rich sequences such as TAT, AAG, and AAC, while tetranucleotide repeats included AAAT, AATA, ATTA, and TTTA. These patterns reinforce the A/T bias observed in shorter motifs and suggest that even longer SSRs retain compositional consistency across accessions. Pentanucleotide repeats were similarly conserved, with 134 motifs per accession. Dominant sequences included AAAAT, AATTT, TTTTA, and TTAAA, all of which are rich in A/T bases. The recurrence of these motifs across all accessions further supports the plastome’s mutational bias and structural stability. No accession-specific expansion or contraction of pentanucleotide motifs was observed. Hexanucleotide repeats showed slightly more diversity, with counts ranging from 157 to 158 per accession. Despite the broader sequence range, dominant motifs remained A/T-rich, including TTCTTT, TTTTAT, and TTTTTT. The consistent counts and motif composition across accessions suggest that even the longest SSRs are evolutionarily stable and not subject to lineage-specific expansion. Hexanucleotide repeats showed minor variation (157–158), likely reflecting annotation differences or sequencing noise rather than true biological divergence. Although Pisifera-Tz exhibited a marginally higher total SSR count (860 vs. 859 in other accessions), its genic SSR density was substantially reduced (277 vs. ~681–682). This indicates that while overall SSR abundance remains stable, Pisifera-Tz plastomes have undergone lineage-specific contraction in genic regions (see Supplementary Table S3 ).” Overall, the uniformity of SSR types and motif composition across all accessions underscores the evolutionary stability of the oil palm plastome. While SSR variability is limited, conserved tetranucleotide, pentanucleotide, and hexanucleotide motifs—particularly those rich in A/T bases—may still serve as neutral molecular markers for population genetics, germplasm authentication, and comparative genomics. This conserved SSR profile highlights plastome stability and mirrors SSR distributions reported in other palm species (8,51,52). SSR marker development in oil palm has similarly emphasized A/T-rich motifs (53,54). Genic SSRs Across Accessions Comparative analysis of genic SSRs revealed both broad conservation and striking lineage‑specific differences among the four E. guineensis chloroplast genomes (Table 3 ). Dura‑Cmr and Dura‑BBG each contained 682 genic SSRs, while Dura‑Tz harbored 681, reflecting near‑identical SSR abundance and distribution across genes associated with photosynthesis, replication, and genome maintenance. This consistency underscores the evolutionary stability of genic SSRs in most oil palm accessions, with plastome function preserved through balanced representation across essential pathways. Table 3 Summary of genic SSRs in four Elaeis guineensis chloroplast genomes Genome Total genic SSRs Gene categories with SSRs Dura-Cmr/Dura-BBG/Dura-Tz 681–682 Photosynthesis : atp (B/E/F/I), ccsA , cemA , ndH (A/B/D/E/H/I/K), paf (I/II), pet (A/B/D), psa (A/B/J), psb (A/B/C/F/I/K/L/M/T/Z), rbcL. Maintenance : accD , clpP1 , matK , rpl (2/16/22/32/33), rps (3/4/7/11/12/14/15/16/18/19), ycf (1/2) Replication : rpo A/B/C1/C2 Pisifera-Tz 277 Photosynthesis : atp (B/E/F/I), cemA , ndhK , paf I/II, pet A/B/D, psa A/B/I/J, psb (A/B/C/F/I/K/L/M/T/Z), rbcL . Maintenance : accD , clpP1 , matK , rpl (16/22/33), rps (3/4/11/12/14/16/18) Replication : rpo A/B/C1/C2 In contrast, Pisifera‑Tz exhibited a markedly reduced genic SSR count (277), representing a substantial contraction relative to the other accessions. Despite this reduction, SSRs in Pisifera‑Tz were still distributed across key functional genes, including rbcL, matK, psa, psb, and rpo, ensuring that core photosynthetic and transcriptional processes remained represented. However, the absence of SSRs in several maintenance‑related loci—including ycf1, ycf2, and members of the ndh family—suggests lineage‑specific loss or reduced SSR density in regions typically associated with genome stability and stress response. This pattern highlights two complementary insights: first, the conserved SSR profiles in Dura accessions reflect plastome resilience and functional redundancy across diverse lineages; second, the reduced SSR density in Pisifera‑Tz may indicate either structural streamlining or selective pressures that minimized repetitive elements in its plastome. From a practical perspective, these differences provide opportunities for marker development: conserved genic SSRs in Dura accessions can serve as stable lineage markers. At the same time, the unique contraction in Pisifera‑Tz offers diagnostic motifs for distinguishing maternal lineages in breeding programs. Despite this contraction, SSRs remained present in key functional genes, including rbcL , matK , psa , psb , and rpo . The absence of SSRs in ycf1 , ycf2 , and ndh family genes suggests lineage-specific streamlining, consistent with plastome variation observed in Pisifera lineages (55,56) Plastome Structure and Synteny Multiple genome alignment of 16 Elaeis guineensis chloroplast genomes—representing Dura, Tenera, and Pisifera—revealed a high degree of structural conservation across accessions (Fig. 4 ). Using the Mauve algorithm, plastome sizes ranged narrowly from 153,000 to 158,000 bp, with most accessions exhibiting uniform syntenic block arrangements and stable genome architecture. This consistency reflects strong evolutionary constraints on plastome organization, likely driven by the essential roles of chloroplast genes in photosynthesis and cellular metabolism. Notably, two Indonesian Pisifera accessions (Pisifera‑Id_221 and Pisifera‑Id_101128179) displayed structural rearrangements in the 140–145 kb region, characterized by translocations and/or inversions (highlighted in red blocks). These localized disruptions suggest lineage‑specific plastome remodeling, potentially arising from recombination events or selective pressures unique to the Pisifera background. While the overall plastome structure remains conserved, such rearrangements may influence gene regulation or plastome stability, warranting further investigation into their functional consequences. Complementary mapping of the 16 plastomes confirmed the conservation of junction architecture between the large single copy (LSC), small single copy (SSC), and inverted repeat (IR) regions (Fig. 5 ). All accessions shared nearly identical genome sizes (~ 156.98 kb) and consistent placement of key genes at junctions, including rps19, rpl2, psbA, ndhF, ycf1, and trnN. Minor differences in junction lengths (25–65 bp) were observed but did not affect gene orientation or overall structural layout. These findings reinforce the evolutionary stability of plastome boundaries in E. guineensis and validate their utility for comparative and phylogenetic analyses. The conserved junctions and syntenic blocks across most accessions provide a robust framework for plastome‑based lineage tracing. In contrast, the structural variation in select Pisifera accessions offers a unique opportunity to explore plastome dynamics and their potential role in fruit form differentiation or adaptation. Such rearrangements have been linked to recombination events in plastomes of palms and other monocots (20,24,45,57) Comparative mapping of LSC, SSC, and IR junctions confirmed consistent placement of key genes ( rps19 , rpl2 , psbA , ndhF , ycf1 , trnN ) and minor junction length variation (25–65 bp), reinforcing plastome boundary stability [54](44). Phylogenetic Relationships Phylogenetic analysis based on complete chloroplast genome sequences revealed distinct clustering patterns among the four E. guineensis accessions (Fig. 6 ). Pisifera-Tz and Dura-Cmr formed a closely related clade, suggesting high genetic similarity and a likely shared maternal lineage. In contrast, Dura-Tz and Dura-BBG grouped separately, indicating moderate divergence that may reflect differences in evolutionary origin, geographic adaptation, or breeding history. Despite these variations, all accessions clustered within a unified species group, confirming their overall genomic relatedness and supporting their classification within E. guineensis . This pattern underscores the utility of chloroplast genomes for resolving maternal lineages while reinforcing the evolutionary cohesion of the species. These results align with previous phylogenetic studies that resolved maternal lineages in oil palm using plastome data (12,17,18). Haplotype Diversity Median-joining haplotype network analysis identified 14 distinct plastome haplotypes (HAP1–HAP14) among 48 accessions (Supplementary table 2 ). The network displayed short mutational distances and interconnected haplotypes, reflecting low divergence and shared evolutionary origin. Median vectors (mv1–mv5) represented inferred ancestral or intermediate haplotypes, providing insight into mutational pathways and lineage continuity. This moderate but structured plastome diversity supports the evolutionary stability of the plastome and offers practical utility for germplasm management and conservation of cytoplasmic diversity (Fig. 7 ). The presence of median vectors (mv1–mv5) suggests inferred ancestral haplotypes, consistent with haplotype diversity patterns reported in oil palm germplasm collections (9,28,49,51,52) Discussion The assemblies confirm the evolutionary stability of oil palm plastomes, with conserved genome sizes and GC content across accessions. This mirrors findings from broader palm plastome studies (20,24,58) and emphasizes strong selective constraints on chloroplast architecture (10,11,59). Annotation results reinforce this resilience, with highly conserved gene repertoires across accessions, consistent with previous genome sequencing efforts (10,17,60) SSR analysis revealed conserved motif landscapes across accessions, dominated by A/T mononucleotide repeats. This pattern reflects mutational bias typical of plastid genomes (14,15) and parallels SSR distributions reported in oil palm (53,54). Genic SSR analysis highlighted Pisifera-Tz as an outlier, with reduced SSR density and absence in ycf and ndh loci, paralleling reports of cytoplasmic variation in oil palm (61,62). Comparative synteny analysis confirmed conserved plastome architecture across most accessions, with structural rearrangements observed only in Indonesian Pisifera genomes. These rearrangements are consistent with recombination-driven plastome variation in palms (20,45,57). Phylogenetic analysis resolved maternal lineages, clustering Pisifera-Tz with Dura-Cmr, while haplotype network analysis (Fig. 7 ) revealed moderate diversity but strong connectivity, consistent with maternal inheritance and evolutionary stability (12,17,49). Together, these findings highlight a dual narrative: deep conservation of plastome structure and gene content across E. guineensis, coupled with subtle lineage-specific variation in SSR density, Synteny, and haplotype distribution. This balance of stability and divergence provides a robust framework for phylogenetic inference, germplasm authentication, and breeding strategies aimed at conserving cytoplasmic diversity in oil palm (8,28,52,63). Conclusions Comparative analysis of four Elaeis guineensis chloroplast genomes revealed a highly conserved plastome structure, with only minor variation in junction lengths and genome sizes. Despite this overall stability, Pisifera-Tz exhibited a markedly reduced genic SSR abundance, highlighting its distinct genomic profile. Phylogenetic and haplotype analyses further demonstrated close evolutionary relationships among accessions, while uncovering moderate chloroplast valuable diversity for lineage tracing, population genetics, and applied breeding. Together, these findings provide valuable genomic resources that strengthen evolutionary understanding and support future conservation and crop improvement strategies in oil palm. Declarations Authors' contributions : AJM and SS: Research conceptualization, implementation, data analysis, original manuscript preparation, and reviewing. RA: Data analysis and manuscript data preparation SWA and WBS: Supervising and reviewing the manuscript. AN and GIL: Providing the chloroplast genome of 19 oil palm germplasm accessions from Indonesia and reviewing the manuscript. Data availability All data supporting the findings of this study are available within the paper and its Supplementary Information. All chloroplast genome sequences generated in this study, including their associated genome annotations, are publicly available in the NCBI database under BioProject accession number PRJNA1416287 . Ethics, Consent to Participate, and Consent to Publish declarations Not applicable. Acknowledgments We appreciate the management of Bogor Botanical Gardens and the National Research and Innovation Agency (BRIN) for giving access to the Dura-BBG sample. This manuscript is part of the AJM's master's (MSc) thesis. We acknowledge the support of Revocatus Petro Mushumbusi, PhD, the Director of Tanzania Forestry Research Institute (TAFORI), and his support for AJM's graduate program at IPB University, Bogor, Indonesia. AI was used to polish the English language quality in the manuscript preparation. Funding: Ally Juma Mkude, Rare Forest Profession, Tanzania Forest Fund Redi Aditama, The Oil Palm Research Grant, the Indonesian Palm Oil Fund Management Agency (BPDPKS), GRS_20220227110350 Sudarsono Sudarsono, The Oil Palm Research Grant, the Indonesian Palm Oil Fund Management Agency (BPDPKS), GRS_20220227110350 References Qaim M, Sibhatu KT, Siregar H, Grass I. Environmental, Economic, and Social Consequences of the Oil Palm Boom. Annu Rev Resour Econ [Internet]. 2020 Oct 6;12(1):321–44. Available from: https://www.annualreviews.org/doi/10.1146/annurev-resource-110119-024922 Meijaard E, Brooks TM, Carlson KM, Slade EM, Garcia-Ulloa J, Gaveau DLA, et al. The environmental impacts of palm oil in context. Nat Plants [Internet]. 2020 Dec 7;6(12):1418–26. Available from: https://www.nature.com/articles/s41477-020-00813-w Murphy DJ, Goggin K, Paterson RRM. Oil palm in the 2020s and beyond: challenges and solutions. CABI Agric Biosci [Internet]. 2021 Oct 11;2(1):39. Available from: https://www.cabidigitallibrary.org/doi/10.1186/s43170-021-00058-3 Meijaard E, Sheil D. The Moral Minefield of Ethical Oil Palm and Sustainable Development. Front For Glob Chang [Internet]. 2019 May 28;2. Available from: https://www.frontiersin.org/article/10.3389/ffgc.2019.00022/full Vijay V, Pimm SL, Jenkins CN, Smith SJ. The Impacts of Oil Palm on Recent Deforestation and Biodiversity Loss. Anand M, editor. PLoS One [Internet]. 2016 Jul 27;11(7):e0159668. Available from: https://dx.plos.org/10.1371/journal.pone.0159668 Soh AC, Mayes S, Roberts J. Introduction to the Oil Palm Crop. In: Oil Palm Breeding [Internet]. Boca Raton : Taylor & Francis, 2017.: CRC Press; 2017. p. 1–6. Available from: https://www.taylorfrancis.com/books/9781498715454/chapters/10.1201/9781315119724-1 Corley RHV, Tinker PB. The Oil Palm. Fifth. West Sussex: John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ , UK; 2016. 1–627 p. Magaña-Álvarez A, Pérez-Brito D, Cortés-Velázquez A, Nexticapan-Garcéz Á, Ortega-Ramírez ME, García-Cámara I, et al. Genetic Variability of Oil Palm in Mexico: An Assessment Based on Microsatellite Markers. Agric. 2023 Sep 1;13(9). Hayati A, Wickneswari R, Maizura I, Rajanaidu N. Genetic diversity of oil palm (Elaeis guineensis Jacq.) germplasm collections from Africa: implications for improvement and conservation of genetic resources. Theor Appl Genet [Internet]. 2004 May 16;108(7):1274–84. Available from: http://link.springer.com/10.1007/s00122-003-1545-0 Daniell H, Lin C sea, Yu M, Chang W jung. Chloroplast genomes : diversity , evolution , and applications in genetic engineering. Genome Biol. 2016;1–29. Wicke S, Schneeweiss GM, DePamphilis CW, Müller KF, Quandt D. The evolution of the plastid chromosome in land plants: gene content, gene order, gene function. Plant Mol Biol [Internet]. 2011 Jul 22;76(3–5):273–97. Available from: http://link.springer.com/10.1007/s11103-011-9762-4 Gitzendanner MA, Soltis PS, Wong GK ‐S., Ruhfel BR, Soltis DE. Plastid phylogenomic analysis of green plants: A billion years of evolutionary history. Am J Bot [Internet]. 2018 Mar 30;105(3):291–301. Available from: https://bsapubs.onlinelibrary.wiley.com/doi/10.1002/ajb2.1048 Jansen RK, Ruhlman TA. Plastid Genomes of Seed Plants. In 2012. p. 103–26. Available from: http://link.springer.com/10.1007/978-94-007-2920-9_5 Palmer JD. COMPARATIVE ORGANIZATION OF CHLOROPLAST GENOMES. Annu Rev Genet [Internet]. 1985 Dec;19(1):325–54. Available from: https://www.annualreviews.org/doi/10.1146/annurev.ge.19.120185.001545 Shinozaki K, Ohme M, Tanaka M, Wakasugi T, Hayashida N, Matsubayashi T, et al. The complete nucleotide sequence of the tobacco chloroplast genome: its gene organization and expression. EMBO J [Internet]. 1986 Sep;5(9):2043–9. Available from: https://link.springer.com/article/10.1002/j.1460-2075.1986.tb04464.x Uthaipaisanwong P, Chanprasert J, Shearman JR, Sangsrakru D, Yoocha T, Jomchai N. Characterization of the chloroplast genome sequence of oil palm ( Elaeis guineensis Jacq .). 2012;500:172–80. Shaw J, Lickey EB, Beck JT, Farmer SB, Liu W, Miller J, et al. The tortoise and the hare II: relative utility of 21 noncoding chloroplast DNA sequences for phylogenetic analysis. Am J Bot [Internet]. 2005 Jan;92(1):142–66. Available from: https://bsapubs.onlinelibrary.wiley.com/doi/10.3732/ajb.92.1.142 Dong W, Liu J, Yu J, Wang L, Zhou S. Highly Variable Chloroplast Markers for Evaluating Plant Phylogeny at Low Taxonomic Levels and for DNA Barcoding. Moustafa A, editor. PLoS One [Internet]. 2012 Apr 12;7(4):e35071. Available from: https://dx.plos.org/10.1371/journal.pone.0035071 Poczai P, Varga I, Laos M, Cseh A, Bell N, Valkonen JP, et al. Advances in plant gene-targeted and functional markers: a review. Plant Methods [Internet]. 2013;9(1):6. Available from: http://plantmethods.biomedcentral.com/articles/10.1186/1746-4811-9-6 Rahmawati A, Volkaert HA, Dinarti D, Maskromo I, Hatta ANNL, Sudarsono S. Complete Chloroplast Genome Sequences of Coconut cv. Kopyor Green Dwarf and Comparative Genome Analysis to Oil Palm, Date Palm, Sago Palm, and Miniature Sugar Palm. In: Oil Crop Genomics [Internet]. Cham: Springer International Publishing; 2021. p. 189–216. Available from: https://link.springer.com/10.1007/978-3-030-70420-9_10 Arshad A, Aditama R, Rahayu MSRI, Natawijaya A. Comparative study of chloroplast genomes across seven Salacca species. 2024;25(11):4043–58. Putu NI, Asih SRI, Pratiwi RA, Prihatini I, Kuswandi PC, Yulianti EVY, et al. Comparative analysis of the complete chloroplast genome of the rare endemic palm Pinanga arinasae Witono from Bali , Indonesia. 2025;49(5):432–46. Daniell H, Lin CS, Yu M, Chang WJ. Chloroplast genomes: Diversity, evolution, and applications in genetic engineering. Vol. 17, Genome Biology. 2016. p. 1–29. Khan AL, Asaf S, Lee IJ, Al-Harrasi A, Al-Rawahi A. First chloroplast genomics study of Phoenix dactylifera (var. Naghal and Khanezi): A comparative analysis. Chen S, editor. PLoS One [Internet]. 2018 Jul 31;13(7):e0200104. Available from: https://dx.plos.org/10.1371/journal.pone.0200104 Chan KL, Tatarinova T V., Rosli R, Amiruddin N, Azizi N, Halim MAA, et al. Evidence-based gene models for structural and functional annotations of the oil palm genome. Biol Direct [Internet]. 2017 Dec 8;12(1):21. Available from: https://biologydirect.biomedcentral.com/articles/10.1186/s13062-017-0191-4 Mohd Talkah NS, Aziz NAKA, Rahim MFA, Hanafi NFF, Ahmad Mokhtar MA, Othman AS. The chloroplast genome inheritance pattern of the Deli-Nigerian prospection material (NPM) × Yangambi population of Elaeis guineensis Jacq. PeerJ [Internet]. 2024 May 27;12:e17335. Available from: https://peerj.com/articles/17335 Malike FA, Marhalil M, Zulkifli Y, Aziz NA, Oyebamiji YO. Genetic Variability of Oil Palm ( Elaeis guineensis Jacq .) Deli Dura Inter-Crosses Population in The MPOB Research Station Hulu Paka ,. 2024;53:125–33. Gan ST, Teo CJ, Manirasa S, Wong WC, Wong CK. Assessment of genetic diversity and population structure of oil palm (Elaeis guineensis Jacq.) field genebank: A step towards molecular-assisted germplasm conservation. Avramidou E V., editor. PLoS One [Internet]. 2021 Jul 29;16(7):e0255418. Available from: https://dx.plos.org/10.1371/journal.pone.0255418 Zaki NM, Singh R, Rosli R, Ismail I. Elaeis oleifera Genomic-SSR Markers : Exploitation in Oil Palm Germplasm Diversity and Cross-Amplification in Arecaceae. 2012;4069–88. Osorio-Guarín JA, Garzón-Martínez GA, Delgadillo-Duran P, Bastidas S, Moreno LP, Enciso-Rodríguez FE, et al. Genome-wide association study (GWAS) for morphological and yield-related traits in an oil palm hybrid (Elaeis oleifera x Elaeis guineensis) population. BMC Plant Biol [Internet]. 2019 Dec 3;19(1):533. Available from: https://bmcplantbiol.biomedcentral.com/articles/10.1186/s12870-019-2153-8 Corley RHV, Tinker PB. The Oil Palm [Internet]. Wiley; 2015. Available from: https://onlinelibrary.wiley.com/doi/book/10.1002/9781118953297 Chung KP. Cytoplasmic inheritance: The transmission of plastid and mitochondrial genomes across cells and generations. Plant Physiol [Internet]. 2025 Apr 30;198(1). Available from: https://academic.oup.com/plphys/article/doi/10.1093/plphys/kiaf168/8123037 Goodwin S, McPherson JD, McCombie WR. Coming of age: ten years of next-generation sequencing technologies. Nat Rev Genet [Internet]. 2016 Jun 17;17(6):333–51. Available from: https://www.nature.com/articles/nrg.2016.49 Sun X, Hu Y, Song Z. Library construction of metabarcoding at DNBSEQ-G400 with MGIEasy universal DNA library prep Kit v1 [Internet]. 2020. Available from: https://www.protocols.io/view/library-construction-of-metabarcoding-at-dnbseq-g4-bn95mh86 Begum G, Albanna A, Bankapur A, Nassir N, Tambi R, Berdiev BK, et al. Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome. Int J Mol Sci [Internet]. 2021 Feb 19;22(4):2060. Available from: https://www.mdpi.com/1422-0067/22/4/2060 Bentz ML, Clines C, Padilla J, Horn C, Lee J, Sheth M. High-throughput PacBio library preparation and sequencing techniques for genomic DNA and TNA. Front Genet [Internet]. 2025 Aug 6;16. Available from: https://www.frontiersin.org/articles/10.3389/fgene.2025.1587691/full Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics [Internet]. 2018 Sep 1;34(17):i884–90. Available from: https://academic.oup.com/bioinformatics/article/34/17/i884/5093234 Dierckxsens N, Mardulyn P, Smits G. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res [Internet]. 2016 Oct 24;gkw955. Available from: https://academic.oup.com/nar/article-lookup/doi/10.1093/nar/gkw955 Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, et al. GeSeq – versatile and accurate annotation of organelle genomes. Nucleic Acids Res [Internet]. 2017 Jul 3;45(W1):W6–11. Available from: https://academic.oup.com/nar/article-lookup/doi/10.1093/nar/gkx391 Chan PP, Lin BY, Mak AJ, Lowe TM. tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res [Internet]. 2021 Sep 20;49(16):9077–96. Available from: https://academic.oup.com/nar/article/49/16/9077/6355886 Lohse M, Drechsel O, Kahlau S, Bock R. OrganellarGenomeDRAW—a suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets. Nucleic Acids Res [Internet]. 2013 Jul 1;41(W1):W575–81. Available from: http://academic.oup.com/nar/article/41/W1/W575/1092342/OrganellarGenomeDRAWa-suite-of-tools-for Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012 Jun;28(12):1647–9. Katoh K, Standley DM. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol Biol Evol. 2013 Apr;30(4):772–80. Amiryousefi A, Hyvönen J, Poczai P. IRscope: an online program to visualize the junction sites of chloroplast genomes. Hancock J, editor. Bioinformatics [Internet]. 2018 Sep 1;34(17):3030–1. Available from: https://academic.oup.com/bioinformatics/article/34/17/3030/4961430 Darling ACE, Mau B, Blattner FR, Perna NT. Mauve: Multiple Alignment of Conserved Genomic Sequence With Rearrangements. Genome Res [Internet]. 2004 Jul;14(7):1394–403. Available from: http://genome.cshlp.org/lookup/doi/10.1101/gr.2289704 Mayer C, Leese F, Tollrian R. Genome-wide analysis of tandem repeats in Daphnia pulex - a comparative approach. 2010; Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol [Internet]. 1987 Jul;4(4):406–25. Available from: https://academic.oup.com/mbe/article/4/4/406/1029664/The-neighborjoining-method-a-new-method-for Tamura K, Stecher G, Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Battistuzzi FU, editor. Mol Biol Evol [Internet]. 2021 Jun 25;38(7):3022–7. Available from: https://academic.oup.com/mbe/article/38/7/3022/6248099 Bandelt HJ, Forster P, Rohl A. Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol [Internet]. 1999 Jan 1;16(1):37–48. Available from: https://academic.oup.com/mbe/article-lookup/doi/10.1093/oxfordjournals.molbev.a026036 Li DM, Pan YG, Wu XY, Zou SP, Wang L, Zhu GF. Comparative chloroplast genomics , phylogenetic relationships and molecular markers development of Aglaonema commutatum and seven green cultivars of Aglaonema. Sci Rep [Internet]. 2024;1–17. Available from: https://doi.org/10.1038/s41598-024-62586-y Bakoumé C, Wickneswari R, Rajanaidu N, Kushairi A, Amblard P, Billotte N. Allelic diversity of natural oil palm ( Elaeis guineensis Jacq.) populations detected by microsatellite markers: implications for conservation. Plant Genet Resour [Internet]. 2007 Aug 1;5(2):104–7. Available from: https://www.cambridge.org/core/product/identifier/S1479262107710870/type/journal_article Bakoumé C, Wickneswari R, Siju S, Rajanaidu N, Kushairi A, Billotte N. Genetic diversity of the world’s largest oil palm (Elaeis guineensis Jacq.) field genebank accessions using microsatellite markers. Genet Resour Crop Evol [Internet]. 2015 Mar 3;62(3):349–60. Available from: http://link.springer.com/10.1007/s10722-014-0156-8 B. KB, K. L. MR, Sahu S, Mathur RK, P. NK, G. R, et al. Development and validation of whole genome-wide and genic microsatellite markers in oil palm (Elaeis guineensis Jacq.): First microsatellite database (OpSatdb). Sci Rep [Internet]. 2019 Feb 13;9(1):1899. Available from: https://www.nature.com/articles/s41598-018-37737-7 Zhou L, Yarra R, Zhao Z, Jin L, Cao H. Development of SSR markers based on transcriptome data and association mapping analysis for fruit shell thickness associated traits in oil palm (Elaeis guineensis Jacq.). 3 Biotech [Internet]. 2020 Jun;10(6):280. Available from: https://link.springer.com/10.1007/s13205-020-02269-3 Wu H, Li DZ, Ma PF. Unprecedented variation pattern of plastid genomes and the potential role in adaptive evolution in Poales. BMC Biol [Internet]. 2024 Apr 29;22(1):97. Available from: https://bmcbiol.biomedcentral.com/articles/10.1186/s12915-024-01890-5 Tu XD, Lin WJ, Fu HH, Lin YZ, Shen J, Chen S, et al. Comparative Analysis of Six Complete Plastomes of Tripterospermum spp. Int J Mol Sci [Internet]. 2024 Feb 22;25(5):2534. Available from: https://www.mdpi.com/1422-0067/25/5/2534 SRI ASIH NP, PRATIWI RA, PRIHATINI I, KUSWANDI PC, YULIANTI E, PRIYADI A. Comparative analysis of the complete chloroplast genome of the rare endemic palm Pinanga arinasae Witono from Bali, Indonesia. Turk J Botany [Internet]. 2025 Sep 19;49(5):432–46. Available from: https://journals.tubitak.gov.tr/botany/vol49/iss5/7 Id F, Cauz-santos LA, Morales JA, Alencar D, Picanc D, Teresa M, et al. Complete chloroplast genomes and phylogeny in three Euterpe palms ( E . edulis , E . oleracea and E . precatoria ) from different Brazilian biomes. 2022;1–26. Wang J, Kan S, Liao X, Zhou J, Tembrock LR, Daniell H, et al. Plant organellar genomes: much done, much more to do. Trends Plant Sci [Internet]. 2024 Jul;29(7):754–69. Available from: https://linkinghub.elsevier.com/retrieve/pii/S1360138523004120 Singh R, Low ETL, Ooi LCL, Ong-Abdullah M, Ting NC, Nagappan J, et al. The oil palm SHELL gene controls oil yield and encodes a homologue of SEEDSTICK. Nature [Internet]. 2013 Aug 24;500(7462):340–4. Available from: https://www.nature.com/articles/nature12356 Mohd Talkah NS, Aziz NAKA, Rahim MFA, Hanafi NFF, Ahmad Mokhtar MA, Othman AS. The chloroplast genome inheritance pattern of the Deli-Nigerian prospection material (NPM) × Yangambi population of Elaeis guineensis Jacq. PeerJ. 2024 May;12:e17335. Singh R, Ong-Abdullah M, Low ETL, Manaf MAA, Rosli R, Nookiah R, et al. Oil palm genome sequence reveals divergence of interfertile species in Old and New worlds. Nature [Internet]. 2013 Aug 24;500(7462):335–9. Available from: https://www.nature.com/articles/nature12309 Fadila AM, Marhalil M, Zulkifli Y, Shamsudin NAA, Oyebamiji YO. Genetic Variability of Oil Palm (Elaeis guineensis Jacq.) Deli Dura Inter-Crosses Population in The MPOB Research Station Hulu Paka, Terengganu, Malaysia. Malaysian Appl Biol [Internet]. 2024 Jun 30;53(2):125–33. Available from: https://jms.mabjournal.com/index.php/mab/article/view/2975 Additional Declarations No competing interests reported. Supplementary Files AllyMkudeSupplementaryTableS1.docx AllyMkudeSupplementaryTableS2.docx AllyMkudeSupplementaryTableS3.docx AllyMkudeSupplementaryTableS4.docx Cite Share Download PDF Status: Under Revision Version 1 posted Editorial decision: Revision requested 19 Mar, 2026 Reviews received at journal 05 Mar, 2026 Reviews received at journal 01 Mar, 2026 Reviews received at journal 27 Feb, 2026 Reviews received at journal 24 Feb, 2026 Reviews received at journal 24 Feb, 2026 Reviewers agreed at journal 23 Feb, 2026 Reviewers agreed at journal 19 Feb, 2026 Reviewers agreed at journal 19 Feb, 2026 Reviewers agreed at journal 19 Feb, 2026 Reviewers agreed at journal 18 Feb, 2026 Reviewers invited by journal 18 Feb, 2026 Editor assigned by journal 18 Feb, 2026 Editor invited by journal 06 Feb, 2026 Submission checks completed at journal 06 Feb, 2026 First submitted to journal 06 Feb, 2026 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-8662850","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":594827976,"identity":"0b1babfa-d639-4993-97db-f3143c9c214c","order_by":0,"name":"Ally Juma Mkude","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAv0lEQVRIiWNgGAWjYDACHgYGCQYeGzkwiwQtMmnGpGqxOZTYQLQW3Z4zhjc+5BxI33D87MEHHxjs5HQbCGgxO9tjbDnjzJ3cDWfykg1nMCQbmx0gpOU8j5k0b8+z3A0HcsykeRgOJG4jSsvff4fTDc6/IVbL2R4zaQaewwkGN4i25cyxYssenjTDmTfeGBvOMCDGL2eSN974wWMjz3c+x/DBhwo7OYJa4EABrNKAWOUgIN9AiupRMApGwSgYUQAAWKtEh1Lzu0AAAAAASUVORK5CYII=","orcid":"","institution":"Tanzanian Forestry Research Institute","correspondingAuthor":true,"prefix":"","firstName":"Ally","middleName":"Juma","lastName":"Mkude","suffix":""},{"id":594827977,"identity":"2480a8e5-9eab-4ba6-90b7-8cbed16fa269","order_by":1,"name":"Redi Aditama","email":"","orcid":"","institution":"IPB University","correspondingAuthor":false,"prefix":"","firstName":"Redi","middleName":"","lastName":"Aditama","suffix":""},{"id":594827978,"identity":"76b5086d-83da-4fd6-99f5-13ef8357847d","order_by":2,"name":"Sintho W. Ardie","email":"","orcid":"","institution":"IPB University","correspondingAuthor":false,"prefix":"","firstName":"Sintho","middleName":"W.","lastName":"Ardie","suffix":""},{"id":594827979,"identity":"03ef91c5-f1d8-4942-8be5-66c8eb0c7ed5","order_by":3,"name":"Willy B. Suwarno","email":"","orcid":"","institution":"IPB University","correspondingAuthor":false,"prefix":"","firstName":"Willy","middleName":"B.","lastName":"Suwarno","suffix":""},{"id":594827980,"identity":"d4411abf-2d63-46d7-9ec1-f0e4be05cc40","order_by":4,"name":"Azis Natawijaya","email":"","orcid":"","institution":"PT Bumitama Gunajaya Agro","correspondingAuthor":false,"prefix":"","firstName":"Azis","middleName":"","lastName":"Natawijaya","suffix":""},{"id":594827981,"identity":"bda4e4dc-5dad-481b-a4f2-64ab845081e2","order_by":5,"name":"Indri Gita Lestari","email":"","orcid":"","institution":"IPB University","correspondingAuthor":false,"prefix":"","firstName":"Indri","middleName":"Gita","lastName":"Lestari","suffix":""},{"id":594827982,"identity":"98687eee-3cf4-4834-8f1d-9db92140b117","order_by":6,"name":"Sudarsono Sudarsono","email":"","orcid":"","institution":"IPB University","correspondingAuthor":false,"prefix":"","firstName":"Sudarsono","middleName":"","lastName":"Sudarsono","suffix":""}],"badges":[],"createdAt":"2026-01-21 18:39:40","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-8662850/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-8662850/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":103262446,"identity":"0745f149-2d8e-475f-9dca-894b365c5a79","added_by":"auto","created_at":"2026-02-23 18:32:13","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":166535,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSequencing depth and coverage profiles of Tanzanian Dura (Dura‑Tz) and Pisifera (Pisifera‑Tz) plastomes.\u003c/strong\u003e (A) Dura‑Tz assembly showing coverage depths up to ~3,500 reads. (B) Pisifera‑Tz assembly reaching ~7,000 reads, reflecting higher sequencing throughput. Coverage is generally uniform, with peaks in inverted repeat (IR) regions and troughs in AT‑rich or non‑coding regions. Greater depth in Pisifera‑Tz enhances confidence in variant detection and structural resolution.\u003c/p\u003e","description":"","filename":"1.png","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/9f703fda248ace34ef2c3e4c.png"},{"id":103262447,"identity":"3055a613-e016-4329-bc58-c623adaacf2b","added_by":"auto","created_at":"2026-02-23 18:32:13","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":235241,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eCircular maps of Dura‑Tz (PX957236) and Pisifera‑Tz (PX957237) chloroplast genomes.\u003c/strong\u003e Genome organization showing large single‑copy (LSC), small single‑copy (SSC), and inverted repeat (IR) regions. Gene content and order are highly conserved, with minor differences in intergenic spacers and IR–SC boundaries.\u003c/p\u003e","description":"","filename":"2.png","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/5806845c5047248cc025d433.png"},{"id":103505263,"identity":"140a1fee-bd71-480a-aea7-b9f66397618c","added_by":"auto","created_at":"2026-02-26 13:29:09","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":28303,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eDistribution of simple sequence repeats (SSRs) across chloroplast genomes.\u003c/strong\u003e Comparison of SSR types and densities among Dura‑Tz, Pisifera‑Tz, Dura‑Cmr, and Dura‑BBG. Mononucleotide A/T repeats dominate across all plastomes, with Pisifera‑Tz showing markedly reduced genic SSR density.\u003c/p\u003e","description":"","filename":"3.png","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/00e1774dff3203a48110b404.png"},{"id":103262449,"identity":"6d66d086-6987-4a75-97b5-2680fe158f9f","added_by":"auto","created_at":"2026-02-23 18:32:13","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":278350,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eMultiple genome alignment of 16 \u003c/strong\u003e\u003cem\u003e\u003cstrong\u003eElaeis guineensis\u003c/strong\u003e\u003c/em\u003e\u003cstrong\u003e plastomes\u003c/strong\u003e. Showing structural conservation and localize rearrangements. The alignment was performed using the Mauve algorithm to identify Local Collinear Blocks (LCBs). Each horizontal track represents an individual accession, including Dura, Pisifera, and Tenera fruit forms, with the reference genome NC-017602 included for comparison. The green LCB (left) corresponds to the Large Single Copy (LSC) region, while the teal LCB (right) represents the Small Single Copy (SSC) and Inverted Repeat regions. The vertical height of the colored profiles indicates the degree of sequence conservation. Colored lined connecting the tracks denote synteny regions; diagonal red lines and displaced red blocks in accessions Pisifera-Id_221 and pisifera-Id_1109 highlight structural variations, including translocations and inversions relative to the reference. Scale bar at the top indicates genomic position in base pairs.\u003c/p\u003e","description":"","filename":"4.png","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/6f27cba25ce50feb9848c349.png"},{"id":103506104,"identity":"a1b8f0cd-e9c1-4ec8-a05a-56dad502082c","added_by":"auto","created_at":"2026-02-26 13:34:06","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":512500,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eComparison of inverted repeat (IR) boundary regions among16 oil palm (\u003c/strong\u003e\u003cem\u003e\u003cstrong\u003eElaeis guineensis\u003c/strong\u003e\u003c/em\u003e\u003cstrong\u003e) plastomes using IRscope\u003c/strong\u003e\u003c/p\u003e","description":"","filename":"5.png","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/ac580e44d2290c3d0dfd2d96.png"},{"id":103262456,"identity":"01028557-7291-4b80-a0ff-bd2a105c8e59","added_by":"auto","created_at":"2026-02-23 18:32:13","extension":"png","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":219068,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003ePhylogenetic relationships among oil palm accessions inferred using the Maximum Likelihood method and Tamura -Nei (TN93+G+I) model based on\u003c/strong\u003e \u003cstrong\u003elikelihood tree based on complete plastome sequences\u003c/strong\u003e. Pisifera‑Tz clusters with Dura‑Cmr, while Dura‑Tz and Dura‑BBG form distinct clades, reflecting multiple maternal lineages.\u003c/p\u003e","description":"","filename":"6.png","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/d6f180b81534dc097e3c9fd1.png"},{"id":103505636,"identity":"6c4fda53-d3a2-44ed-9a30-aa758f93a145","added_by":"auto","created_at":"2026-02-26 13:32:17","extension":"png","order_by":7,"title":"Figure 7","display":"","copyAsset":false,"role":"figure","size":133246,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eThe fourteen haplotypes from 48 Elaeis guineensis\u003c/em\u003eplastome genomes, with circle sizes proportional to haplotype frequency and red squares (mv1–mv5) denoting inferred median vectors.\u003c/p\u003e","description":"","filename":"7.png","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/3e8d4e2c7703609c1cc3b24f.png"},{"id":103510940,"identity":"92de3f7d-3745-48b6-b78d-3c6b8b65c38d","added_by":"auto","created_at":"2026-02-26 14:07:52","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":2907215,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/311e9432-4143-437b-9e8e-bc024af0b895.pdf"},{"id":103506217,"identity":"27148d5c-2393-43a9-99a6-3d78de91426f","added_by":"auto","created_at":"2026-02-26 13:34:39","extension":"docx","order_by":0,"title":"","display":"","copyAsset":false,"role":"supplement","size":15431,"visible":true,"origin":"","legend":"","description":"","filename":"AllyMkudeSupplementaryTableS1.docx","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/de6b8a431048ae846f2e655e.docx"},{"id":103506203,"identity":"4389fa8b-e747-4d3a-84f3-a5eaff234457","added_by":"auto","created_at":"2026-02-26 13:34:34","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":61590,"visible":true,"origin":"","legend":"","description":"","filename":"AllyMkudeSupplementaryTableS2.docx","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/1f2a0cef9e29a283b3c7f00d.docx"},{"id":103505122,"identity":"ddacbd8d-e046-465b-bb12-68e1de36603f","added_by":"auto","created_at":"2026-02-26 13:24:30","extension":"docx","order_by":2,"title":"","display":"","copyAsset":false,"role":"supplement","size":18657,"visible":true,"origin":"","legend":"","description":"","filename":"AllyMkudeSupplementaryTableS3.docx","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/5b65ddc370e5bcf00fdbf2db.docx"},{"id":103506543,"identity":"366c5e9a-0fa0-442e-8401-bb6a8c20726e","added_by":"auto","created_at":"2026-02-26 13:37:31","extension":"docx","order_by":3,"title":"","display":"","copyAsset":false,"role":"supplement","size":14544,"visible":true,"origin":"","legend":"","description":"","filename":"AllyMkudeSupplementaryTableS4.docx","url":"https://assets-eu.researchsquare.com/files/rs-8662850/v1/e9fc45e252438e05004b3a31.docx"}],"financialInterests":"No competing interests reported.","formattedTitle":"Comparative Chloroplast Genomics and Maternal Lineage Diversity in Oil Palm (Elaeis guineensis)","fulltext":[{"header":"Introduction","content":"\u003cp\u003eOil palm (\u003cem\u003eElaeis guineensis\u003c/em\u003e Jacq.) is the most productive source of vegetable oil worldwide, contributing more than one-third of the global supply (1). Its high oil yield per unit area has made it central to food security, bioenergy, and industrial applications (2,3). However, the rapid expansion of oil palm cultivation has raised concerns about deforestation, biodiversity loss, and sustainability (4,5). In addition, the limited genetic base of current breeding populations constrains progress in developing resilience traits, yield stability, and climate adaptation (6,7).\u003c/p\u003e \u003cp\u003eThe narrow genetic base of Southeast Asian oil palm populations largely stems from a small number of African introductions during the colonial era (7). This founder effect has restricted genetic variation in commercially important lines, particularly the Deli Dura population in Indonesia (8). In contrast, African wild populations harbor significant allelic richness and represent untapped reservoirs of genetic diversity (8,9). Expanding cytoplasmic and nuclear variation is therefore critical for sustainable cultivation and long-term breeding success (10).\u003c/p\u003e \u003cp\u003eChloroplast genomes (plastomes) are powerful tools for studying plant diversity, evolution, and adaptation (11,12). Plastomes play essential roles in photosynthesis, energy metabolism, and biosynthesis of amino acids, fatty acids, and pigments. Their conserved quadripartite structure, uniparental inheritance, and relatively low mutation rate make them valuable for phylogenetics, species differentiation, and lineage tracing (13). In flowering plants, plastomes typically range from 120 to 160 kb and encode 110 to 130 genes involved in transcription, translation, and photosynthesis (14,15). Despite their conservation, single-nucleotide polymorphisms, indels, and structural variations in non-coding regions provide informative markers for evolutionary research, germplasm authentication, and lineage tracing (16,17).\u003c/p\u003e \u003cp\u003ePrevious studies of oil palm plastomes revealed highly conserved gene content and organization, consistent with other members of the palm family (Arecaceae) (18). The first complete \u003cem\u003eE. guineensis\u003c/em\u003e plastome established a baseline structure of ~\u0026thinsp;156,973 bp (18), and subsequent work confirmed its stability across the species (19,20). Nevertheless, comparative analyses identified nucleotide variations concentrated in single-copy regions and intergenic spacers, offering opportunities to differentiate closely related lineages (21). Commercial planting materials are derived from hybridization between Dura (thick-shelled) and Pisifera (shell-less) types to produce the high-yielding Tenera hybrid (22). Tracing maternal lineages is therefore essential for germplasm authentication and understanding cytoplasmic genetic effects (23,24)\u003c/p\u003e \u003cp\u003eGiven the narrow maternal base of commercially dominant Deli Dura lines (16), resolving plastome diversity across wild and cultivated accessions is critical for breeding programs (25,26). While nuclear genomic studies of E. guineensis and E. oleifera are extensive (27\u0026ndash;30), detailed comparative analyses of intraspecific plastome diversity remain limited. Even small numbers of variable sites within plastomes can be powerful for differentiating lineages and mapping cytoplasmic diversity (31). Characterizing plastome variation across diverse African accessions is thus essential for conservation, breeding authentication, and marker development (32).\u003c/p\u003e \u003cp\u003eTanzania and Cameroon represent centers of origin and diversity for oil palm, harboring ancient lineages and wild populations that could significantly enrich global breeding programs (9,32). In contrast, Indonesian Dura populations descend from a narrow genetic base introduced from Africa in the early 20th century (16). Comparative plastome analyses across African regions, therefore, offer unique insights into domestication history, adaptation mechanisms, and maternal lineage divergence (27)\u003c/p\u003e \u003cp\u003eRecent advances in sequencing technologies have transformed plastome research (33). Short-read platforms such as Illumina and MGI DNBSEQ provide cost-effective accuracy for sequencing and variant detection but struggle with repetitive regions and structural variations (34). Long-read platforms such as PacBio HiFi and Oxford Nanopore generate reads spanning thousands of bases, enabling resolution of repetitive elements and structural variants (35). Long reads improve assembly contiguity, facilitate haplotype phasing, and enable precise detection of complex rearrangements (35). Comparing short- and long-read assemblies provides critical validation for plastome reconstruction, particularly in complex plant lineages such as oil palm (34). However, systematic comparisons of sequencing platform performance in oil palm plastomes remain scarce.\u003c/p\u003e \u003cp\u003eThis study addresses these gaps by sequencing and analyzing chloroplast genomes of Tanzanian Dura (Dura-Tz) and Pisifera (Pisifera-Tz) accessions. We integrate these with previously sequenced Cameroonian Dura (Dura-Cmr), the ancestral Deli Dura from Indonesia (Dura-BBG), 29 Indonesian breeding lines, and 15 plastome accessions from public databases. By benchmarking MGI and PacBio sequencing platforms, we highlight their complementary strengths for plastome assembly and variant detection. Collectively, this work provides new insights into plastome diversity in E. guineensis, advances understanding of domestication history, and supports the development of plastome-based molecular markers for breeding, conservation, and genetic resource management.\u003c/p\u003e"},{"header":"Methods","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eOil palm chloroplast genome samples\u003c/h2\u003e \u003cp\u003eTwo chloroplast genomes were sequenced in this study, including one Tanzanian Dura type (Dura-Tz) and one Tanzanian Pisifera type (Pisifera-Tz). The selected plant materials was identified by Mr. Basil E. Kavishe, Research Assistant at Tanzania Agricultural Research Institute (TARI). No voucher specimen was deposited in publicly accessible herbarium. However, the plant materials were photographed during sample collection, and they are now monitored by TARI for reference and future studies. Young leaves were collected from the Kwitanga plantation, Kigoma, Tanzania. Samples were cleaned, cut into ~\u0026thinsp;20 cm pieces, sterilized with 70% ethanol, air-dried, sealed in airtight plastic bags, and transported in a cold box to PT Genetika Science, Indonesia, for sequencing. Two representative wild oil palm samples, one Cameroonian Dura-type (Dura-Cmr) and one Deli Dura progenitor from Bogor Botanical Garden, Indonesia (Dura-BBG), were sequenced in a previous project at PMB Lab, IPB University (NCBI accession numbers are on Progress). Moreover, as many as twenty-nine (29) samples were also from previously sequenced chloroplast genomes and obtained from Dr. Azis Natawijaya (PT Bumitama Gunajaya Agro Company). Inquiries for these 29 sequences should be directly forwarded to Dr. Azis Natawijaya. Other cpDNAs were obtained from NCBI GenBank DNA Database (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://ncbi.nlm.nih.gov/\u003c/span\u003e\u003cspan address=\"https://ncbi.nlm.nih.gov/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e). All chloroplast genome sequences generated in this study, including their associated genome annotations, are publicly available in the NCBI database under BioProject accession number \u003cb\u003ePRJNA1416287\u003c/b\u003e.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eChloroplast genome sequencing and assembly\u003c/h3\u003e\n\u003cp\u003eShort-read sequencing of Dura-Tz and Pisifera-Tz total genomes was performed using the MGI DNBSEQ-G400 platform (34) (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://mgi-tech.eu/sequencing-products/dnbseq-g400\u003c/span\u003e\u003cspan address=\"https://mgi-tech.eu/sequencing-products/dnbseq-g400\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e) (PE150) with MGIEasy FS DNA Library Prep Set. Long-read sequencing of Dura-Tz total genomes was conducted using the PacBio Vega platform with HiFi Plex Prep Kit 96 (PacBio, 103-122-800) (36) (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.pacb.com/wp-content/uploads/Insert-HiFi-plex-prep-kit-96\u003c/span\u003e\u003cspan address=\"https://www.pacb.com/wp-content/uploads/Insert-HiFi-plex-prep-kit-96\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eRaw reads of Dura-Tz and Pisifera-Tz were processed with \u003cb\u003eFastp v0.20.1\u003c/b\u003e (37) to trim low-quality sequences and adaptor contamination. De novo assembly was performed using \u003cb\u003eNOVOPlasty v4.3.5\u003c/b\u003e (38), and the assemblies were polished using \u003cb\u003eEnsemblPlants56\u003c/b\u003e (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://plants.ensembl.org\u003c/span\u003e\u003cspan address=\"https://plants.ensembl.org\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eSequencing depth and coverage of the Dura-Tz and Pisifera-Tz chloroplast genomes were evaluated by mapping raw reads back to the assembled plastomes. Depth profiles were generated across the entire genome length (0\u0026ndash;156 kb) for both Dura-Tz and Pisifera-Tz accessions. Coverage plots were inspected to confirm uniformity across large single copy (LSC), small single copy (SSC), and inverted repeat (IR) regions.\u003c/p\u003e\n\u003ch3\u003eChloroplast genome annotation\u003c/h3\u003e\n\u003cp\u003eAssembled genomes in FASTA format were annotated using \u003cb\u003eGeSeq\u003c/b\u003e within CHLOROBOX (39) and the \u003cb\u003eDual Organellar Genome Annotator (DOGMA)\u003c/b\u003e with default parameters\u0026mdash;annotation employed 95% sequence similarity thresholds for protein-coding genes, rRNAs, and tRNAs. \u003cb\u003eBLASTX\u003c/b\u003e and \u003cb\u003eBLASTN\u003c/b\u003e were used to identify coding genes and rRNAs, while \u003cb\u003etRNAscan-SE v2.0.7\u003c/b\u003e was used for tRNA annotation (40). Intron boundaries, start, and stop codons were manually curated by comparison with a combination of many related species of \u003cem\u003eArecaceae\u003c/em\u003e plastomes. Genome maps were visualized using \u003cb\u003eOGDRAW\u003c/b\u003e (41).\u003c/p\u003e\n\u003ch3\u003eComparative analysis of Plastome structure and Synteny\u003c/h3\u003e\n\u003cp\u003eUsing assembled chloroplast genomes, the base composition and frequencies of the LSC, SSC, and IR regions were determined in \u003cb\u003eGeneious Prime v2024.0.7\u003c/b\u003e (42). Two multiple progressive sequence alignments were conducted; the first one included the two new plastomes plus Dura-BBG and Dura-Cmr. Multiple sequence alignments of the plastomes were generated using \u003cb\u003eMAFFT v7.490\u003c/b\u003e (43). IR boundary variation was visualized using \u003cb\u003eIRscope\u003c/b\u003e (44). The second analysis was carried out using the two new plastomes plus Dura-BBG and Dura-Cmr and six Indonesian breeding lines (Dura-Id_574, Dura-Id_101128179, Pisifera-Id_221, Pisifera-Id_1169, Tenera-Id_1093, Tenera-Id_15730) and six plastomes that were available on NCBI (ON248756, NC_017602, OR125034, OR125040, OR125048, OR125054) in Mauve v2.4.0 (45).\u003c/p\u003e\n\u003ch3\u003eSimple sequence repeats (SSRs)\u003c/h3\u003e\n\u003cp\u003ePairwise multiple alignments of cp genomes were performed using \u003cb\u003eMAFFT v7.490\u003c/b\u003e (43) in Geneious Prime. Simple sequence repeats (SSRs) containing 1\u0026ndash;6 nucleotides were identified using the \u003cb\u003ePhobos v3.3.12\u003c/b\u003e (46). The following configuration was adopted to search for SSR motifs: SSR of one to six nucleotides long, with a minimum repeat number of 10, 5, and 4 units for mono-, di-, and trinucleotide SSRs, respectively, and three units for tetra-, penta-, and hexanucleotide SSRs.\u003c/p\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003ePhylogenetic and haplotype analysis\u003c/h2\u003e \u003cp\u003ePhylogenetic analysis included 48 E. guineensis cp genomes. Multiple sequence alignments were generated using \u003cb\u003eMAFFT v7.490\u003c/b\u003e (43). Phylogenetic trees were constructed using the \u003cb\u003eMaximum Likelihood\u003c/b\u003e (ML) method with the \u003cb\u003eTamura-Nei (TN93/TNG93\u0026thinsp;+\u0026thinsp;G\u0026thinsp;+\u0026thinsp;I)\u003c/b\u003e genetic distance model, and \u003cb\u003e1000 bootstrap replications\u003c/b\u003e (47,48). Haplotype networks were constructed from 48 E. guineensis cp genomes, including 19 PT Bumitama Gunajaya Agro accessions, 25 NCBI accessions, and 4 genomes sequenced in the PMB lab at IPB University. Alignments were performed in Geneious Prime, and haplotype networks were generated using \u003cb\u003eNETWORK software\u003c/b\u003e (49).\u003c/p\u003e \u003c/div\u003e"},{"header":"Results","content":"\u003cdiv id=\"Sec10\" class=\"Section2\"\u003e\n \u003ch2\u003eChloroplast Genome Sequencing\u003c/h2\u003e\n \u003cp\u003eSequencing output is summarized in Table \u003cspan class=\"InternalRef\"\u003e1\u003c/span\u003e. For the Dura‑Tz and Pisifera‑Tz accessions sequenced with MGI, approximately 242 million reads were generated per sample, corresponding to ~\u0026thinsp;36 Gb of total bases. In contrast, PacBio sequencing of Dura‑Tz produced only 5.95 million reads but yielded a substantially higher total base output of 63.33 Gb (Table\u0026nbsp;\u003cspan class=\"InternalRef\"\u003e1\u003c/span\u003e).\u003c/p\u003e\n \u003cdiv class=\"gridtable\"\u003e\u0026nbsp;\u003ctable id=\"Tab1\" border=\"1\"\u003e\n \u003ccaption language=\"En\"\u003e\n \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e\n \u003cdiv class=\"CaptionContent\"\u003e\n \u003cp\u003eSequencing data summary for Tanzanian Dura (Dura-Tz) and Pisifera (Pisifera-Tz)\u003c/p\u003e\n \u003c/div\u003e\n \u003c/caption\u003e\n \u003ccolgroup cols=\"4\"\u003e\u003c/colgroup\u003e\n \u003cthead\u003e\n \u003ctr\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003eSample name\u003c/p\u003e\n \u003c/th\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003eSequencing Platform\u003c/p\u003e\n \u003c/th\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003eTotal reads (M.)\u003c/p\u003e\n \u003c/th\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003eTotal bases (Gb)\u003c/p\u003e\n \u003c/th\u003e\n \u003c/tr\u003e\n \u003c/thead\u003e\n \u003ctbody\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e\u003cem\u003eDura-Tz\u003c/em\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eMGI\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"char\"\u003e\n \u003cp\u003e241.78\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"char\"\u003e\n \u003cp\u003e36.27\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\u0026nbsp;\u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003ePacBio\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"char\"\u003e\n \u003cp\u003e5.95\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"char\"\u003e\n \u003cp\u003e63.33\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e\u003cem\u003ePisifera-Tz\u003c/em\u003e\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eMGI\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"char\"\u003e\n \u003cp\u003e242.00\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"char\"\u003e\n \u003cp\u003e36.30\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003c/tbody\u003e\n \u003c/table\u003e\n \u003c/div\u003e\n \u003cp\u003eThese results highlight the complementary strengths of the two platforms. MGI provides deep coverage through massive read counts, ensuring reliable variant detection and uniform base representation. PacBio, despite generating fewer reads, delivers long sequences that enhance assembly contiguity and resolve repetitive regions more effectively. Together, the datasets demonstrate that combining short‑read and long‑read technologies offers both depth and structural accuracy for plastome reconstruction.\u003c/p\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec11\" class=\"Section2\"\u003e\n \u003ch2\u003eSequencing Depth and Coverage\u003c/h2\u003e\n \u003cp\u003eRobust sequencing coverage was achieved across the chloroplast genomes of both Dura-Tz and Pisifera-Tz accessions. The Dura-Tz assembly reached depths of up to ~\u0026thinsp;3,500 reads (Fig. \u003cspan class=\"InternalRef\"\u003e1\u003c/span\u003eA), while Pisifera-Tz attained nearly 7,000 reads (Fig. \u003cspan class=\"InternalRef\"\u003e1\u003c/span\u003eB), reflecting its substantially higher sequencing throughput. Coverage profiles were generally uniform across the plastome, with expected peaks in the inverted repeat (IR) regions due to duplication and troughs in AT-rich or non-coding regions that are inherently more challenging to sequence.\u003c/p\u003e\n \u003cp\u003eThe overall consistency of coverage confirms the completeness and reliability of both assemblies. Significantly, the greater depth observed in Pisifera-Tz enhances confidence in variant detection and structural resolution, thereby providing a stronger foundation for downstream comparative and phylogenetic analyses.\u003c/p\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec12\" class=\"Section2\"\u003e\n \u003ch2\u003eChloroplast Genome Assembly and Structure\u003c/h2\u003e\n \u003cp\u003eThe complete chloroplast genomes of Dura‑Tz (PX957236) and Pisifera‑Tz (PX957237) were assembled into circular maps of 156,883 bp, each displaying the typical quadripartite organization comprising large single‑copy (LSC), small single‑copy (SSC), and inverted repeat (IR) regions (Fig. \u003cspan class=\"InternalRef\"\u003e2\u003c/span\u003eA, \u003cspan class=\"InternalRef\"\u003e2\u003c/span\u003eB). Comparative analysis with previously sequenced accessions (Dura‑Cmr and Dura‑BBG) revealed highly conserved structural and compositional features (Table\u0026nbsp;\u003cspan class=\"InternalRef\"\u003e2\u003c/span\u003e). These findings are consistent with earlier oil palm plastome characterizations (23,50) and broader plastid genome studies in seed plants (11,13,14, 23).\u003c/p\u003e\n \u003cdiv class=\"gridtable\"\u003e\u0026nbsp;\u003ctable id=\"Tab2\" border=\"1\"\u003e\n \u003ccaption language=\"En\"\u003e\n \u003cdiv class=\"CaptionNumber\"\u003eTable 2\u003c/div\u003e\n \u003cdiv class=\"CaptionContent\"\u003e\n \u003cp\u003eSummary of general features of 4 accessions of \u003cem\u003eElaeis guineensis\u003c/em\u003e chloroplast-genomes\u003c/p\u003e\n \u003c/div\u003e\n \u003c/caption\u003e\n \u003ccolgroup cols=\"5\"\u003e\u003c/colgroup\u003e\n \u003cthead\u003e\n \u003ctr\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003eAttribute\u003c/p\u003e\n \u003c/th\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003e\u003cem\u003eDura-Tz\u003c/em\u003e\u003c/p\u003e\n \u003c/th\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003e\u003cem\u003ePisifera-Tz\u003c/em\u003e\u003c/p\u003e\n \u003c/th\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003eDura-Cmr\u003c/p\u003e\n \u003c/th\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003eDura-BBG\u003c/p\u003e\n \u003c/th\u003e\n \u003c/tr\u003e\n \u003c/thead\u003e\n \u003ctbody\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eGenome Size (bp)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e156,983\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e156,983\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e156,982\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e156,990\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eGC content (%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e37.4\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e37.4\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e37.4\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e37.4\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eLSC size (bp)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e85,194\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e85,194\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e85,193\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e85,198\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eSSC size (bp)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e17,643\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e17,643\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e17,643\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e17,644\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eIRs size (bp)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e27,073\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e27,073\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e27,073\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e27,074\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eLSC GC content (%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e35.5\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e35.5\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e35.5\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e35.5\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eSSC GC content (%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e31.0\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e30.9\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e31.0\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e31.0\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eIRs GC content (%)\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e42.5\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e42.5\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e42.5\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e42.5\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eNumber of genes\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e115\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e115\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e115\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e115\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eNumber of protein-coding genes\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e79\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e80\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e80\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e80\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eNumber of tRNA\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e32\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e31\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e31\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e31\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eNumber of rRNA\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e4\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e4\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e4\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e4\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003c/tbody\u003e\n \u003c/table\u003e\n \u003c/div\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec13\" class=\"Section2\"\u003e\n \u003ch2\u003eChloroplast Genome Annotation and Gene Content\u003c/h2\u003e\n \u003cp\u003eAnnotation identified a consistent set of 115 genes across all accessions, comprising 79\u0026ndash;80 protein‑coding genes, 31\u0026ndash;32 tRNAs, and 4 rRNAs. Dura-Tz contained 79 protein-coding genes and 32 tRNAs, compared with 80/31 in others (Table \u003cspan class=\"InternalRef\"\u003e2\u003c/span\u003e). This subtle difference may reflect lineage-specific variation in gene annotation or minor events of tRNA duplication or loss, possibly involving \u003cem\u003eycf1\u003c/em\u003e or related loci. Overall, gene composition remains highly stable, underscoring the structural conservation of E. guineensis plastomes. Such uniformity highlights the evolutionary stability of the chloroplast genome, while slight differences in gene counts may serve as useful markers for lineage differentiation and comparative genomics.\u003c/p\u003e\n \u003cp\u003eFunctional categorization revealed a conserved organization across all four accessions, including genes encoding photosystem components, ribosomal proteins, ATP synthase, NADH dehydrogenase, Rubisco, RNA polymerase, and proteins involved in DNA replication and repair. Transfer RNA (tRNA) and ribosomal RNA (rRNA) genes were distributed throughout the plastome, ensuring its translational capacity. Hypothetical chloroplast reading frames (ycf) were also identified, consistent with their conserved presence in angiosperm plastomes. Gene transcription orientation was bidirectional, with genes on the outer ring transcribed clockwise and those on the inner ring transcribed counterclockwise. Core functional categories\u0026mdash;photosynthesis, transcription, translation, and genome maintenance\u0026mdash;were consistently represented, in line with previous annotations of oil palm plastomes (10,17,19).\u003c/p\u003e\n \u003cp\u003eTogether, these findings confirm the high degree of conservation in gene content and organization among oil palm plastomes. The stability observed between Dura‑Tz and Pisifera‑Tz, along with other accessions, provides a reliable foundation for comparative and evolutionary analyses. At the same time, minor lineage‑specific variations can serve as potential markers of genetic differentiation.\u003c/p\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec14\" class=\"Section2\"\u003e\n \u003ch2\u003eSSR Distribution and Motif Composition\u003c/h2\u003e\n \u003cp\u003eAnalysis of simple sequence repeats (SSRs) across the four accessions\u0026mdash;Dura‑BBG, Dura‑CMR, Pisifera‑Tz, and Dura‑Tz\u0026mdash;revealed a highly conserved repeat landscape (Fig. \u003cspan class=\"InternalRef\"\u003e3\u003c/span\u003e, Supplementary Table 1). Total SSR counts ranged narrowly from 859 to 860 per plastome, indicating strong structural stability across lineages.\u003c/p\u003e\n \u003cp\u003eMononucleotide repeats dominated the SSR profile, averaging 338.5 per accession, with a pronounced bias toward A/T motifs. Each accession contained approximately 317 A/T mononucleotide repeats, accounting for over 93% of the total, while G/C repeats were rare (22 per plastome). This A/T enrichment reflects the low GC content and mutational bias typical of plastid genomes (10,17)\u003c/p\u003e\n \u003cp\u003eDinucleotide repeats were equally conserved, with 58 per accession. AT and TA motifs were the most abundant, comprising 144 of the 232 total dinucleotide repeats (62%). Other motifs such as AG, GA, TC, and CT were evenly distributed across accessions, with no lineage-specific enrichment. Trinucleotide and tetranucleotide motifs also showed uniform distribution, with 71 and 95 repeats per accession, respectively. Trinucleotide motifs were dominated by A/T-rich sequences such as TAT, AAG, and AAC, while tetranucleotide repeats included AAAT, AATA, ATTA, and TTTA. These patterns reinforce the A/T bias observed in shorter motifs and suggest that even longer SSRs retain compositional consistency across accessions.\u003c/p\u003e\n \u003cp\u003ePentanucleotide repeats were similarly conserved, with 134 motifs per accession. Dominant sequences included AAAAT, AATTT, TTTTA, and TTAAA, all of which are rich in A/T bases. The recurrence of these motifs across all accessions further supports the plastome\u0026rsquo;s mutational bias and structural stability. No accession-specific expansion or contraction of pentanucleotide motifs was observed.\u003c/p\u003e\n \u003cp\u003eHexanucleotide repeats showed slightly more diversity, with counts ranging from 157 to 158 per accession. Despite the broader sequence range, dominant motifs remained A/T-rich, including TTCTTT, TTTTAT, and TTTTTT. The consistent counts and motif composition across accessions suggest that even the longest SSRs are evolutionarily stable and not subject to lineage-specific expansion.\u003c/p\u003e\n \u003cp\u003eHexanucleotide repeats showed minor variation (157\u0026ndash;158), likely reflecting annotation differences or sequencing noise rather than true biological divergence. Although Pisifera-Tz exhibited a marginally higher total SSR count (860 vs. 859 in other accessions), its genic SSR density was substantially reduced (277 vs. ~681\u0026ndash;682). This indicates that while overall SSR abundance remains stable, Pisifera-Tz plastomes have undergone lineage-specific contraction in genic regions (see Supplementary Table \u003cspan class=\"InternalRef\"\u003eS3\u003c/span\u003e).\u0026rdquo;\u003c/p\u003e\n \u003cp\u003eOverall, the uniformity of SSR types and motif composition across all accessions underscores the evolutionary stability of the oil palm plastome. While SSR variability is limited, conserved tetranucleotide, pentanucleotide, and hexanucleotide motifs\u0026mdash;particularly those rich in A/T bases\u0026mdash;may still serve as neutral molecular markers for population genetics, germplasm authentication, and comparative genomics. This conserved SSR profile highlights plastome stability and mirrors SSR distributions reported in other palm species (8,51,52). SSR marker development in oil palm has similarly emphasized A/T-rich motifs (53,54).\u003c/p\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec15\" class=\"Section2\"\u003e\n \u003ch2\u003eGenic SSRs Across Accessions\u003c/h2\u003e\n \u003cp\u003eComparative analysis of genic SSRs revealed both broad conservation and striking lineage‑specific differences among the four E. guineensis chloroplast genomes (Table\u0026nbsp;\u003cspan class=\"InternalRef\"\u003e3\u003c/span\u003e). Dura‑Cmr and Dura‑BBG each contained 682 genic SSRs, while Dura‑Tz harbored 681, reflecting near‑identical SSR abundance and distribution across genes associated with photosynthesis, replication, and genome maintenance. This consistency underscores the evolutionary stability of genic SSRs in most oil palm accessions, with plastome function preserved through balanced representation across essential pathways.\u003c/p\u003e\n \u003cdiv class=\"gridtable\"\u003e\u0026nbsp;\u003ctable id=\"Tab3\" border=\"1\"\u003e\n \u003ccaption language=\"En\"\u003e\n \u003cdiv class=\"CaptionNumber\"\u003eTable 3\u003c/div\u003e\n \u003cdiv class=\"CaptionContent\"\u003e\n \u003cp\u003eSummary of genic SSRs in four \u003cem\u003eElaeis guineensis\u003c/em\u003e chloroplast genomes\u003c/p\u003e\n \u003c/div\u003e\n \u003c/caption\u003e\n \u003ccolgroup cols=\"3\"\u003e\u003c/colgroup\u003e\n \u003cthead\u003e\n \u003ctr\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003eGenome\u003c/p\u003e\n \u003c/th\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003eTotal genic SSRs\u003c/p\u003e\n \u003c/th\u003e\n \u003cth align=\"left\"\u003e\n \u003cp\u003eGene categories with SSRs\u003c/p\u003e\n \u003c/th\u003e\n \u003c/tr\u003e\n \u003c/thead\u003e\n \u003ctbody\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003eDura-Cmr/Dura-BBG/Dura-Tz\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e681\u0026ndash;682\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e\u003cstrong\u003ePhotosynthesis\u003c/strong\u003e:\u003c/p\u003e\n \u003cp\u003e\u003cem\u003eatp\u003c/em\u003e (B/E/F/I), \u003cem\u003eccsA\u003c/em\u003e, \u003cem\u003ecemA\u003c/em\u003e, \u003cem\u003endH\u003c/em\u003e (A/B/D/E/H/I/K), \u003cem\u003epaf\u003c/em\u003e (I/II), \u003cem\u003epet\u003c/em\u003e (A/B/D), \u003cem\u003epsa\u003c/em\u003e (A/B/J), psb (A/B/C/F/I/K/L/M/T/Z), \u003cem\u003erbcL.\u003c/em\u003e\u003c/p\u003e\n \u003cp\u003e\u003cstrong\u003eMaintenance\u003c/strong\u003e:\u003c/p\u003e\n \u003cp\u003e\u003cem\u003eaccD\u003c/em\u003e, \u003cem\u003eclpP1\u003c/em\u003e, \u003cem\u003ematK\u003c/em\u003e, \u003cem\u003erpl\u003c/em\u003e (2/16/22/32/33), \u003cem\u003erps\u003c/em\u003e (3/4/7/11/12/14/15/16/18/19), \u003cem\u003eycf\u003c/em\u003e (1/2)\u003c/p\u003e\n \u003cp\u003e\u003cstrong\u003eReplication\u003c/strong\u003e:\u003c/p\u003e\n \u003cp\u003e\u003cem\u003erpo\u003c/em\u003e A/B/C1/C2\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003ctr\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003ePisifera-Tz\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e277\u003c/p\u003e\n \u003c/td\u003e\n \u003ctd align=\"left\"\u003e\n \u003cp\u003e\u003cstrong\u003ePhotosynthesis\u003c/strong\u003e: \u003cem\u003eatp\u003c/em\u003e (B/E/F/I), \u003cem\u003ecemA\u003c/em\u003e, \u003cem\u003endhK\u003c/em\u003e, \u003cem\u003epaf\u003c/em\u003e I/II, \u003cem\u003epet\u003c/em\u003e A/B/D, \u003cem\u003epsa\u003c/em\u003e A/B/I/J, \u003cem\u003epsb\u003c/em\u003e (A/B/C/F/I/K/L/M/T/Z), \u003cem\u003erbcL\u003c/em\u003e.\u003c/p\u003e\n \u003cp\u003e\u003cstrong\u003eMaintenance\u003c/strong\u003e: \u003cem\u003eaccD\u003c/em\u003e, \u003cem\u003eclpP1\u003c/em\u003e, \u003cem\u003ematK\u003c/em\u003e, \u003cem\u003erpl\u003c/em\u003e (16/22/33), \u003cem\u003erps\u003c/em\u003e (3/4/11/12/14/16/18)\u003c/p\u003e\n \u003cp\u003e\u003cstrong\u003eReplication\u003c/strong\u003e:\u003c/p\u003e\n \u003cp\u003e\u003cem\u003erpo\u003c/em\u003e A/B/C1/C2\u003c/p\u003e\n \u003c/td\u003e\n \u003c/tr\u003e\n \u003c/tbody\u003e\n \u003c/table\u003e\n \u003c/div\u003e\n \u003cp\u003eIn contrast, Pisifera‑Tz exhibited a markedly reduced genic SSR count (277), representing a substantial contraction relative to the other accessions. Despite this reduction, SSRs in Pisifera‑Tz were still distributed across key functional genes, including rbcL, matK, psa, psb, and rpo, ensuring that core photosynthetic and transcriptional processes remained represented. However, the absence of SSRs in several maintenance‑related loci\u0026mdash;including ycf1, ycf2, and members of the ndh family\u0026mdash;suggests lineage‑specific loss or reduced SSR density in regions typically associated with genome stability and stress response.\u003c/p\u003e\n \u003cp\u003eThis pattern highlights two complementary insights: first, the conserved SSR profiles in Dura accessions reflect plastome resilience and functional redundancy across diverse lineages; second, the reduced SSR density in Pisifera‑Tz may indicate either structural streamlining or selective pressures that minimized repetitive elements in its plastome. From a practical perspective, these differences provide opportunities for marker development: conserved genic SSRs in Dura accessions can serve as stable lineage markers. At the same time, the unique contraction in Pisifera‑Tz offers diagnostic motifs for distinguishing maternal lineages in breeding programs. Despite this contraction, SSRs remained present in key functional genes, including \u003cem\u003erbcL\u003c/em\u003e, \u003cem\u003ematK\u003c/em\u003e, \u003cem\u003epsa\u003c/em\u003e, \u003cem\u003epsb\u003c/em\u003e, and \u003cem\u003erpo\u003c/em\u003e. The absence of SSRs in \u003cem\u003eycf1\u003c/em\u003e, \u003cem\u003eycf2\u003c/em\u003e, and \u003cem\u003endh\u003c/em\u003e family genes suggests lineage-specific streamlining, consistent with plastome variation observed in Pisifera lineages (55,56)\u003c/p\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec16\" class=\"Section2\"\u003e\n \u003ch2\u003ePlastome Structure and Synteny\u003c/h2\u003e\n \u003cp\u003eMultiple genome alignment of 16 Elaeis guineensis chloroplast genomes\u0026mdash;representing Dura, Tenera, and Pisifera\u0026mdash;revealed a high degree of structural conservation across accessions (Fig. \u003cspan class=\"InternalRef\"\u003e4\u003c/span\u003e). Using the Mauve algorithm, plastome sizes ranged narrowly from 153,000 to 158,000 bp, with most accessions exhibiting uniform syntenic block arrangements and stable genome architecture. This consistency reflects strong evolutionary constraints on plastome organization, likely driven by the essential roles of chloroplast genes in photosynthesis and cellular metabolism.\u003c/p\u003e\n \u003cp\u003eNotably, two Indonesian Pisifera accessions (Pisifera‑Id_221 and Pisifera‑Id_101128179) displayed structural rearrangements in the 140\u0026ndash;145 kb region, characterized by translocations and/or inversions (highlighted in red blocks). These localized disruptions suggest lineage‑specific plastome remodeling, potentially arising from recombination events or selective pressures unique to the Pisifera background. While the overall plastome structure remains conserved, such rearrangements may influence gene regulation or plastome stability, warranting further investigation into their functional consequences.\u003c/p\u003e\n \u003cp\u003eComplementary mapping of the 16 plastomes confirmed the conservation of junction architecture between the large single copy (LSC), small single copy (SSC), and inverted repeat (IR) regions (Fig. \u003cspan class=\"InternalRef\"\u003e5\u003c/span\u003e). All accessions shared nearly identical genome sizes (~\u0026thinsp;156.98 kb) and consistent placement of key genes at junctions, including rps19, rpl2, psbA, ndhF, ycf1, and trnN. Minor differences in junction lengths (25\u0026ndash;65 bp) were observed but did not affect gene orientation or overall structural layout.\u003c/p\u003e\n \u003cp\u003eThese findings reinforce the evolutionary stability of plastome boundaries in E. guineensis and validate their utility for comparative and phylogenetic analyses. The conserved junctions and syntenic blocks across most accessions provide a robust framework for plastome‑based lineage tracing. In contrast, the structural variation in select Pisifera accessions offers a unique opportunity to explore plastome dynamics and their potential role in fruit form differentiation or adaptation. Such rearrangements have been linked to recombination events in plastomes of palms and other monocots (20,24,45,57) Comparative mapping of LSC, SSC, and IR junctions confirmed consistent placement of key genes (\u003cem\u003erps19\u003c/em\u003e, \u003cem\u003erpl2\u003c/em\u003e, \u003cem\u003epsbA\u003c/em\u003e, \u003cem\u003endhF\u003c/em\u003e, \u003cem\u003eycf1\u003c/em\u003e, \u003cem\u003etrnN\u003c/em\u003e) and minor junction length variation (25\u0026ndash;65 bp), reinforcing plastome boundary stability [54](44).\u003c/p\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec17\" class=\"Section2\"\u003e\n \u003ch2\u003ePhylogenetic Relationships\u003c/h2\u003e\n \u003cp\u003ePhylogenetic analysis based on complete chloroplast genome sequences revealed distinct clustering patterns among the four E. guineensis accessions (Fig. \u003cspan class=\"InternalRef\"\u003e6\u003c/span\u003e). Pisifera-Tz and Dura-Cmr formed a closely related clade, suggesting high genetic similarity and a likely shared maternal lineage. In contrast, Dura-Tz and Dura-BBG grouped separately, indicating moderate divergence that may reflect differences in evolutionary origin, geographic adaptation, or breeding history. Despite these variations, all accessions clustered within a unified species group, confirming their overall genomic relatedness and supporting their classification within \u003cem\u003eE. guineensis\u003c/em\u003e. This pattern underscores the utility of chloroplast genomes for resolving maternal lineages while reinforcing the evolutionary cohesion of the species. These results align with previous phylogenetic studies that resolved maternal lineages in oil palm using plastome data (12,17,18).\u003c/p\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec18\" class=\"Section2\"\u003e\n \u003ch2\u003eHaplotype Diversity\u003c/h2\u003e\n \u003cp\u003eMedian-joining haplotype network analysis identified 14 distinct plastome haplotypes (HAP1\u0026ndash;HAP14) among 48 accessions (Supplementary table \u003cspan class=\"InternalRef\"\u003e2\u003c/span\u003e). The network displayed short mutational distances and interconnected haplotypes, reflecting low divergence and shared evolutionary origin. Median vectors (mv1\u0026ndash;mv5) represented inferred ancestral or intermediate haplotypes, providing insight into mutational pathways and lineage continuity. This moderate but structured plastome diversity supports the evolutionary stability of the plastome and offers practical utility for germplasm management and conservation of cytoplasmic diversity (Fig. \u003cspan class=\"InternalRef\"\u003e7\u003c/span\u003e). The presence of median vectors (mv1\u0026ndash;mv5) suggests inferred ancestral haplotypes, consistent with haplotype diversity patterns reported in oil palm germplasm collections (9,28,49,51,52)\u003c/p\u003e\n\u003c/div\u003e"},{"header":"Discussion","content":"\u003cp\u003eThe assemblies confirm the evolutionary stability of oil palm plastomes, with conserved genome sizes and GC content across accessions. This mirrors findings from broader palm plastome studies (20,24,58) and emphasizes strong selective constraints on chloroplast architecture (10,11,59). Annotation results reinforce this resilience, with highly conserved gene repertoires across accessions, consistent with previous genome sequencing efforts (10,17,60)\u003c/p\u003e \u003cp\u003eSSR analysis revealed conserved motif landscapes across accessions, dominated by A/T mononucleotide repeats. This pattern reflects mutational bias typical of plastid genomes (14,15) and parallels SSR distributions reported in oil palm (53,54). Genic SSR analysis highlighted Pisifera-Tz as an outlier, with reduced SSR density and absence in \u003cem\u003eycf\u003c/em\u003e and \u003cem\u003endh\u003c/em\u003e loci, paralleling reports of cytoplasmic variation in oil palm (61,62).\u003c/p\u003e \u003cp\u003eComparative synteny analysis confirmed conserved plastome architecture across most accessions, with structural rearrangements observed only in Indonesian Pisifera genomes. These rearrangements are consistent with recombination-driven plastome variation in palms (20,45,57). Phylogenetic analysis resolved maternal lineages, clustering Pisifera-Tz with Dura-Cmr, while haplotype network analysis (Fig.\u0026nbsp;\u003cspan refid=\"Fig7\" class=\"InternalRef\"\u003e7\u003c/span\u003e) revealed moderate diversity but strong connectivity, consistent with maternal inheritance and evolutionary stability (12,17,49).\u003c/p\u003e \u003cp\u003eTogether, these findings highlight a dual narrative: deep conservation of plastome structure and gene content across E. guineensis, coupled with subtle lineage-specific variation in SSR density, Synteny, and haplotype distribution. This balance of stability and divergence provides a robust framework for phylogenetic inference, germplasm authentication, and breeding strategies aimed at conserving cytoplasmic diversity in oil palm (8,28,52,63).\u003c/p\u003e"},{"header":"Conclusions","content":"\u003cp\u003eComparative analysis of four \u003cem\u003eElaeis guineensis\u003c/em\u003e chloroplast genomes revealed a highly conserved plastome structure, with only minor variation in junction lengths and genome sizes. Despite this overall stability, Pisifera-Tz exhibited a markedly reduced genic SSR abundance, highlighting its distinct genomic profile. Phylogenetic and haplotype analyses further demonstrated close evolutionary relationships among accessions, while uncovering moderate chloroplast valuable diversity for lineage tracing, population genetics, and applied breeding. Together, these findings provide valuable genomic resources that strengthen evolutionary understanding and support future conservation and crop improvement strategies in oil palm.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eAuthors' contributions\u003c/strong\u003e:\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eAJM and SS: Research conceptualization, implementation, data analysis, original manuscript preparation, and reviewing.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eRA: Data analysis and manuscript data preparation\u003c/p\u003e\n\u003cp\u003eSWA and WBS: Supervising and reviewing the manuscript.\u003c/p\u003e\n\u003cp\u003eAN and GIL: Providing the chloroplast genome of 19 oil palm germplasm accessions from Indonesia and reviewing the manuscript.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eData availability\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eAll data supporting the findings of this study are available within the paper and its Supplementary Information. All chloroplast genome sequences generated in this study, including their associated genome annotations, are publicly available in the NCBI database under BioProject accession number \u003cstrong\u003ePRJNA1416287\u003c/strong\u003e.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eEthics, Consent to Participate, and Consent to Publish declarations\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNot applicable.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAcknowledgments\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eWe appreciate the management of Bogor Botanical Gardens and the National Research and Innovation Agency (BRIN) for giving access to the Dura-BBG sample. This manuscript is part of the AJM's master's (MSc) thesis. We acknowledge the support of Revocatus Petro Mushumbusi, PhD, the Director of Tanzania Forestry Research Institute (TAFORI), and his support for AJM's graduate program at IPB University, Bogor, Indonesia. AI was used to polish the English language quality in the manuscript preparation.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eFunding:\u003c/strong\u003e\u003c/p\u003e\n\u003cul type=\"disc\"\u003e\n \u003cli\u003e\u003cstrong\u003eAlly Juma Mkude, Rare Forest Profession, Tanzania Forest Fund\u003c/strong\u003e\u003c/li\u003e\n \u003cli\u003e\u003cstrong\u003eRedi Aditama, The Oil Palm Research Grant, the Indonesian Palm Oil Fund Management Agency (BPDPKS), GRS_20220227110350\u003c/strong\u003e\u003c/li\u003e\n \u003cli\u003e\u003cstrong\u003eSudarsono Sudarsono, The Oil Palm Research Grant, the Indonesian Palm Oil Fund Management Agency (BPDPKS), GRS_20220227110350\u003c/strong\u003e\u003c/li\u003e\n\u003c/ul\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eQaim M, Sibhatu KT, Siregar H, Grass I. Environmental, Economic, and Social Consequences of the Oil Palm Boom. Annu Rev Resour Econ [Internet]. 2020 Oct 6;12(1):321\u0026ndash;44. Available from: https://www.annualreviews.org/doi/10.1146/annurev-resource-110119-024922\u003c/li\u003e\n\u003cli\u003eMeijaard E, Brooks TM, Carlson KM, Slade EM, Garcia-Ulloa J, Gaveau DLA, et al. The environmental impacts of palm oil in context. Nat Plants [Internet]. 2020 Dec 7;6(12):1418\u0026ndash;26. Available from: https://www.nature.com/articles/s41477-020-00813-w\u003c/li\u003e\n\u003cli\u003eMurphy DJ, Goggin K, Paterson RRM. Oil palm in the 2020s and beyond: challenges and solutions. CABI Agric Biosci [Internet]. 2021 Oct 11;2(1):39. Available from: https://www.cabidigitallibrary.org/doi/10.1186/s43170-021-00058-3\u003c/li\u003e\n\u003cli\u003eMeijaard E, Sheil D. The Moral Minefield of Ethical Oil Palm and Sustainable Development. Front For Glob Chang [Internet]. 2019 May 28;2. Available from: https://www.frontiersin.org/article/10.3389/ffgc.2019.00022/full\u003c/li\u003e\n\u003cli\u003eVijay V, Pimm SL, Jenkins CN, Smith SJ. The Impacts of Oil Palm on Recent Deforestation and Biodiversity Loss. Anand M, editor. PLoS One [Internet]. 2016 Jul 27;11(7):e0159668. Available from: https://dx.plos.org/10.1371/journal.pone.0159668\u003c/li\u003e\n\u003cli\u003eSoh AC, Mayes S, Roberts J. Introduction to the Oil Palm Crop. In: Oil Palm Breeding [Internet]. Boca Raton : Taylor \u0026amp; Francis, 2017.: CRC Press; 2017. p. 1\u0026ndash;6. Available from: https://www.taylorfrancis.com/books/9781498715454/chapters/10.1201/9781315119724-1\u003c/li\u003e\n\u003cli\u003eCorley RHV, Tinker PB. The Oil Palm. Fifth. West Sussex: John Wiley \u0026amp; Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ , UK; 2016. 1\u0026ndash;627 p. \u003c/li\u003e\n\u003cli\u003eMaga\u0026ntilde;a-\u0026Aacute;lvarez A, P\u0026eacute;rez-Brito D, Cort\u0026eacute;s-Vel\u0026aacute;zquez A, Nexticapan-Garc\u0026eacute;z \u0026Aacute;, Ortega-Ram\u0026iacute;rez ME, Garc\u0026iacute;a-C\u0026aacute;mara I, et al. Genetic Variability of Oil Palm in Mexico: An Assessment Based on Microsatellite Markers. Agric. 2023 Sep 1;13(9). \u003c/li\u003e\n\u003cli\u003eHayati A, Wickneswari R, Maizura I, Rajanaidu N. Genetic diversity of oil palm (Elaeis guineensis Jacq.) germplasm collections from Africa: implications for improvement and conservation of genetic resources. Theor Appl Genet [Internet]. 2004 May 16;108(7):1274\u0026ndash;84. Available from: http://link.springer.com/10.1007/s00122-003-1545-0\u003c/li\u003e\n\u003cli\u003eDaniell H, Lin C sea, Yu M, Chang W jung. Chloroplast genomes : diversity , evolution , and applications in genetic engineering. Genome Biol. 2016;1\u0026ndash;29. \u003c/li\u003e\n\u003cli\u003eWicke S, Schneeweiss GM, DePamphilis CW, M\u0026uuml;ller KF, Quandt D. The evolution of the plastid chromosome in land plants: gene content, gene order, gene function. Plant Mol Biol [Internet]. 2011 Jul 22;76(3\u0026ndash;5):273\u0026ndash;97. Available from: http://link.springer.com/10.1007/s11103-011-9762-4\u003c/li\u003e\n\u003cli\u003eGitzendanner MA, Soltis PS, Wong GK ‐S., Ruhfel BR, Soltis DE. Plastid phylogenomic analysis of green plants: A billion years of evolutionary history. Am J Bot [Internet]. 2018 Mar 30;105(3):291\u0026ndash;301. Available from: https://bsapubs.onlinelibrary.wiley.com/doi/10.1002/ajb2.1048\u003c/li\u003e\n\u003cli\u003eJansen RK, Ruhlman TA. Plastid Genomes of Seed Plants. In 2012. p. 103\u0026ndash;26. Available from: http://link.springer.com/10.1007/978-94-007-2920-9_5\u003c/li\u003e\n\u003cli\u003ePalmer JD. COMPARATIVE ORGANIZATION OF CHLOROPLAST GENOMES. Annu Rev Genet [Internet]. 1985 Dec;19(1):325\u0026ndash;54. Available from: https://www.annualreviews.org/doi/10.1146/annurev.ge.19.120185.001545\u003c/li\u003e\n\u003cli\u003eShinozaki K, Ohme M, Tanaka M, Wakasugi T, Hayashida N, Matsubayashi T, et al. The complete nucleotide sequence of the tobacco chloroplast genome: its gene organization and expression. EMBO J [Internet]. 1986 Sep;5(9):2043\u0026ndash;9. Available from: https://link.springer.com/article/10.1002/j.1460-2075.1986.tb04464.x\u003c/li\u003e\n\u003cli\u003eUthaipaisanwong P, Chanprasert J, Shearman JR, Sangsrakru D, Yoocha T, Jomchai N. Characterization of the chloroplast genome sequence of oil palm ( Elaeis guineensis Jacq .). 2012;500:172\u0026ndash;80. \u003c/li\u003e\n\u003cli\u003eShaw J, Lickey EB, Beck JT, Farmer SB, Liu W, Miller J, et al. The tortoise and the hare II: relative utility of 21 noncoding chloroplast DNA sequences for phylogenetic analysis. Am J Bot [Internet]. 2005 Jan;92(1):142\u0026ndash;66. Available from: https://bsapubs.onlinelibrary.wiley.com/doi/10.3732/ajb.92.1.142\u003c/li\u003e\n\u003cli\u003eDong W, Liu J, Yu J, Wang L, Zhou S. Highly Variable Chloroplast Markers for Evaluating Plant Phylogeny at Low Taxonomic Levels and for DNA Barcoding. Moustafa A, editor. PLoS One [Internet]. 2012 Apr 12;7(4):e35071. Available from: https://dx.plos.org/10.1371/journal.pone.0035071\u003c/li\u003e\n\u003cli\u003ePoczai P, Varga I, Laos M, Cseh A, Bell N, Valkonen JP, et al. Advances in plant gene-targeted and functional markers: a review. Plant Methods [Internet]. 2013;9(1):6. Available from: http://plantmethods.biomedcentral.com/articles/10.1186/1746-4811-9-6\u003c/li\u003e\n\u003cli\u003eRahmawati A, Volkaert HA, Dinarti D, Maskromo I, Hatta ANNL, Sudarsono S. Complete Chloroplast Genome Sequences of Coconut cv. Kopyor Green Dwarf and Comparative Genome Analysis to Oil Palm, Date Palm, Sago Palm, and Miniature Sugar Palm. In: Oil Crop Genomics [Internet]. Cham: Springer International Publishing; 2021. p. 189\u0026ndash;216. Available from: https://link.springer.com/10.1007/978-3-030-70420-9_10\u003c/li\u003e\n\u003cli\u003eArshad A, Aditama R, Rahayu MSRI, Natawijaya A. Comparative study of chloroplast genomes across seven Salacca species. 2024;25(11):4043\u0026ndash;58. \u003c/li\u003e\n\u003cli\u003ePutu NI, Asih SRI, Pratiwi RA, Prihatini I, Kuswandi PC, Yulianti EVY, et al. Comparative analysis of the complete chloroplast genome of the rare endemic palm Pinanga arinasae Witono from Bali , Indonesia. 2025;49(5):432\u0026ndash;46. \u003c/li\u003e\n\u003cli\u003eDaniell H, Lin CS, Yu M, Chang WJ. Chloroplast genomes: Diversity, evolution, and applications in genetic engineering. Vol. 17, Genome Biology. 2016. p. 1\u0026ndash;29. \u003c/li\u003e\n\u003cli\u003eKhan AL, Asaf S, Lee IJ, Al-Harrasi A, Al-Rawahi A. First chloroplast genomics study of Phoenix dactylifera (var. Naghal and Khanezi): A comparative analysis. Chen S, editor. PLoS One [Internet]. 2018 Jul 31;13(7):e0200104. Available from: https://dx.plos.org/10.1371/journal.pone.0200104\u003c/li\u003e\n\u003cli\u003eChan KL, Tatarinova T V., Rosli R, Amiruddin N, Azizi N, Halim MAA, et al. Evidence-based gene models for structural and functional annotations of the oil palm genome. Biol Direct [Internet]. 2017 Dec 8;12(1):21. Available from: https://biologydirect.biomedcentral.com/articles/10.1186/s13062-017-0191-4\u003c/li\u003e\n\u003cli\u003eMohd Talkah NS, Aziz NAKA, Rahim MFA, Hanafi NFF, Ahmad Mokhtar MA, Othman AS. The chloroplast genome inheritance pattern of the Deli-Nigerian prospection material (NPM) \u0026times; Yangambi population of Elaeis guineensis Jacq. PeerJ [Internet]. 2024 May 27;12:e17335. Available from: https://peerj.com/articles/17335\u003c/li\u003e\n\u003cli\u003eMalike FA, Marhalil M, Zulkifli Y, Aziz NA, Oyebamiji YO. Genetic Variability of Oil Palm ( Elaeis guineensis Jacq .) Deli Dura Inter-Crosses Population in The MPOB Research Station Hulu Paka ,. 2024;53:125\u0026ndash;33. \u003c/li\u003e\n\u003cli\u003eGan ST, Teo CJ, Manirasa S, Wong WC, Wong CK. Assessment of genetic diversity and population structure of oil palm (Elaeis guineensis Jacq.) field genebank: A step towards molecular-assisted germplasm conservation. Avramidou E V., editor. PLoS One [Internet]. 2021 Jul 29;16(7):e0255418. Available from: https://dx.plos.org/10.1371/journal.pone.0255418\u003c/li\u003e\n\u003cli\u003eZaki NM, Singh R, Rosli R, Ismail I. Elaeis oleifera Genomic-SSR Markers : Exploitation in Oil Palm Germplasm Diversity and Cross-Amplification in Arecaceae. 2012;4069\u0026ndash;88. \u003c/li\u003e\n\u003cli\u003eOsorio-Guar\u0026iacute;n JA, Garz\u0026oacute;n-Mart\u0026iacute;nez GA, Delgadillo-Duran P, Bastidas S, Moreno LP, Enciso-Rodr\u0026iacute;guez FE, et al. Genome-wide association study (GWAS) for morphological and yield-related traits in an oil palm hybrid (Elaeis oleifera x Elaeis guineensis) population. BMC Plant Biol [Internet]. 2019 Dec 3;19(1):533. Available from: https://bmcplantbiol.biomedcentral.com/articles/10.1186/s12870-019-2153-8\u003c/li\u003e\n\u003cli\u003eCorley RHV, Tinker PB. The Oil Palm [Internet]. Wiley; 2015. Available from: https://onlinelibrary.wiley.com/doi/book/10.1002/9781118953297\u003c/li\u003e\n\u003cli\u003eChung KP. Cytoplasmic inheritance: The transmission of plastid and mitochondrial genomes across cells and generations. Plant Physiol [Internet]. 2025 Apr 30;198(1). Available from: https://academic.oup.com/plphys/article/doi/10.1093/plphys/kiaf168/8123037\u003c/li\u003e\n\u003cli\u003eGoodwin S, McPherson JD, McCombie WR. Coming of age: ten years of next-generation sequencing technologies. Nat Rev Genet [Internet]. 2016 Jun 17;17(6):333\u0026ndash;51. Available from: https://www.nature.com/articles/nrg.2016.49\u003c/li\u003e\n\u003cli\u003eSun X, Hu Y, Song Z. Library construction of metabarcoding at DNBSEQ-G400 with MGIEasy universal DNA library prep Kit v1 [Internet]. 2020. Available from: https://www.protocols.io/view/library-construction-of-metabarcoding-at-dnbseq-g4-bn95mh86\u003c/li\u003e\n\u003cli\u003eBegum G, Albanna A, Bankapur A, Nassir N, Tambi R, Berdiev BK, et al. Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome. Int J Mol Sci [Internet]. 2021 Feb 19;22(4):2060. Available from: https://www.mdpi.com/1422-0067/22/4/2060\u003c/li\u003e\n\u003cli\u003eBentz ML, Clines C, Padilla J, Horn C, Lee J, Sheth M. High-throughput PacBio library preparation and sequencing techniques for genomic DNA and TNA. Front Genet [Internet]. 2025 Aug 6;16. Available from: https://www.frontiersin.org/articles/10.3389/fgene.2025.1587691/full\u003c/li\u003e\n\u003cli\u003eChen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics [Internet]. 2018 Sep 1;34(17):i884\u0026ndash;90. Available from: https://academic.oup.com/bioinformatics/article/34/17/i884/5093234\u003c/li\u003e\n\u003cli\u003eDierckxsens N, Mardulyn P, Smits G. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res [Internet]. 2016 Oct 24;gkw955. Available from: https://academic.oup.com/nar/article-lookup/doi/10.1093/nar/gkw955\u003c/li\u003e\n\u003cli\u003eTillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, et al. GeSeq \u0026ndash; versatile and accurate annotation of organelle genomes. Nucleic Acids Res [Internet]. 2017 Jul 3;45(W1):W6\u0026ndash;11. Available from: https://academic.oup.com/nar/article-lookup/doi/10.1093/nar/gkx391\u003c/li\u003e\n\u003cli\u003eChan PP, Lin BY, Mak AJ, Lowe TM. tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res [Internet]. 2021 Sep 20;49(16):9077\u0026ndash;96. Available from: https://academic.oup.com/nar/article/49/16/9077/6355886\u003c/li\u003e\n\u003cli\u003eLohse M, Drechsel O, Kahlau S, Bock R. OrganellarGenomeDRAW\u0026mdash;a suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets. Nucleic Acids Res [Internet]. 2013 Jul 1;41(W1):W575\u0026ndash;81. Available from: http://academic.oup.com/nar/article/41/W1/W575/1092342/OrganellarGenomeDRAWa-suite-of-tools-for\u003c/li\u003e\n\u003cli\u003eKearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012 Jun;28(12):1647\u0026ndash;9. \u003c/li\u003e\n\u003cli\u003eKatoh K, Standley DM. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol Biol Evol. 2013 Apr;30(4):772\u0026ndash;80. \u003c/li\u003e\n\u003cli\u003eAmiryousefi A, Hyv\u0026ouml;nen J, Poczai P. IRscope: an online program to visualize the junction sites of chloroplast genomes. Hancock J, editor. Bioinformatics [Internet]. 2018 Sep 1;34(17):3030\u0026ndash;1. Available from: https://academic.oup.com/bioinformatics/article/34/17/3030/4961430\u003c/li\u003e\n\u003cli\u003eDarling ACE, Mau B, Blattner FR, Perna NT. Mauve: Multiple Alignment of Conserved Genomic Sequence With Rearrangements. Genome Res [Internet]. 2004 Jul;14(7):1394\u0026ndash;403. Available from: http://genome.cshlp.org/lookup/doi/10.1101/gr.2289704\u003c/li\u003e\n\u003cli\u003eMayer C, Leese F, Tollrian R. Genome-wide analysis of tandem repeats in Daphnia pulex - a comparative approach. 2010; \u003c/li\u003e\n\u003cli\u003eSaitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol [Internet]. 1987 Jul;4(4):406\u0026ndash;25. Available from: https://academic.oup.com/mbe/article/4/4/406/1029664/The-neighborjoining-method-a-new-method-for\u003c/li\u003e\n\u003cli\u003eTamura K, Stecher G, Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Battistuzzi FU, editor. Mol Biol Evol [Internet]. 2021 Jun 25;38(7):3022\u0026ndash;7. Available from: https://academic.oup.com/mbe/article/38/7/3022/6248099\u003c/li\u003e\n\u003cli\u003eBandelt HJ, Forster P, Rohl A. Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol [Internet]. 1999 Jan 1;16(1):37\u0026ndash;48. Available from: https://academic.oup.com/mbe/article-lookup/doi/10.1093/oxfordjournals.molbev.a026036\u003c/li\u003e\n\u003cli\u003eLi DM, Pan YG, Wu XY, Zou SP, Wang L, Zhu GF. Comparative chloroplast genomics , phylogenetic relationships and molecular markers development of Aglaonema commutatum and seven green cultivars of Aglaonema. Sci Rep [Internet]. 2024;1\u0026ndash;17. Available from: https://doi.org/10.1038/s41598-024-62586-y\u003c/li\u003e\n\u003cli\u003eBakoum\u0026eacute; C, Wickneswari R, Rajanaidu N, Kushairi A, Amblard P, Billotte N. Allelic diversity of natural oil palm ( Elaeis guineensis Jacq.) populations detected by microsatellite markers: implications for conservation. Plant Genet Resour [Internet]. 2007 Aug 1;5(2):104\u0026ndash;7. Available from: https://www.cambridge.org/core/product/identifier/S1479262107710870/type/journal_article\u003c/li\u003e\n\u003cli\u003eBakoum\u0026eacute; C, Wickneswari R, Siju S, Rajanaidu N, Kushairi A, Billotte N. Genetic diversity of the world\u0026rsquo;s largest oil palm (Elaeis guineensis Jacq.) field genebank accessions using microsatellite markers. Genet Resour Crop Evol [Internet]. 2015 Mar 3;62(3):349\u0026ndash;60. Available from: http://link.springer.com/10.1007/s10722-014-0156-8\u003c/li\u003e\n\u003cli\u003eB. KB, K. L. MR, Sahu S, Mathur RK, P. NK, G. R, et al. Development and validation of whole genome-wide and genic microsatellite markers in oil palm (Elaeis guineensis Jacq.): First microsatellite database (OpSatdb). Sci Rep [Internet]. 2019 Feb 13;9(1):1899. Available from: https://www.nature.com/articles/s41598-018-37737-7\u003c/li\u003e\n\u003cli\u003eZhou L, Yarra R, Zhao Z, Jin L, Cao H. Development of SSR markers based on transcriptome data and association mapping analysis for fruit shell thickness associated traits in oil palm (Elaeis guineensis Jacq.). 3 Biotech [Internet]. 2020 Jun;10(6):280. Available from: https://link.springer.com/10.1007/s13205-020-02269-3\u003c/li\u003e\n\u003cli\u003eWu H, Li DZ, Ma PF. Unprecedented variation pattern of plastid genomes and the potential role in adaptive evolution in Poales. BMC Biol [Internet]. 2024 Apr 29;22(1):97. Available from: https://bmcbiol.biomedcentral.com/articles/10.1186/s12915-024-01890-5\u003c/li\u003e\n\u003cli\u003eTu XD, Lin WJ, Fu HH, Lin YZ, Shen J, Chen S, et al. Comparative Analysis of Six Complete Plastomes of Tripterospermum spp. Int J Mol Sci [Internet]. 2024 Feb 22;25(5):2534. Available from: https://www.mdpi.com/1422-0067/25/5/2534\u003c/li\u003e\n\u003cli\u003eSRI ASIH NP, PRATIWI RA, PRIHATINI I, KUSWANDI PC, YULIANTI E, PRIYADI A. Comparative analysis of the complete chloroplast genome of the rare endemic palm Pinanga arinasae Witono from Bali, Indonesia. Turk J Botany [Internet]. 2025 Sep 19;49(5):432\u0026ndash;46. Available from: https://journals.tubitak.gov.tr/botany/vol49/iss5/7\u003c/li\u003e\n\u003cli\u003eId F, Cauz-santos LA, Morales JA, Alencar D, Picanc D, Teresa M, et al. Complete chloroplast genomes and phylogeny in three Euterpe palms ( E . edulis , E . oleracea and E . precatoria ) from different Brazilian biomes. 2022;1\u0026ndash;26. \u003c/li\u003e\n\u003cli\u003eWang J, Kan S, Liao X, Zhou J, Tembrock LR, Daniell H, et al. Plant organellar genomes: much done, much more to do. Trends Plant Sci [Internet]. 2024 Jul;29(7):754\u0026ndash;69. Available from: https://linkinghub.elsevier.com/retrieve/pii/S1360138523004120\u003c/li\u003e\n\u003cli\u003eSingh R, Low ETL, Ooi LCL, Ong-Abdullah M, Ting NC, Nagappan J, et al. The oil palm SHELL gene controls oil yield and encodes a homologue of SEEDSTICK. Nature [Internet]. 2013 Aug 24;500(7462):340\u0026ndash;4. Available from: https://www.nature.com/articles/nature12356\u003c/li\u003e\n\u003cli\u003eMohd Talkah NS, Aziz NAKA, Rahim MFA, Hanafi NFF, Ahmad Mokhtar MA, Othman AS. The chloroplast genome inheritance pattern of the Deli-Nigerian prospection material (NPM) \u0026times; Yangambi population of Elaeis guineensis Jacq. PeerJ. 2024 May;12:e17335. \u003c/li\u003e\n\u003cli\u003eSingh R, Ong-Abdullah M, Low ETL, Manaf MAA, Rosli R, Nookiah R, et al. Oil palm genome sequence reveals divergence of interfertile species in Old and New worlds. Nature [Internet]. 2013 Aug 24;500(7462):335\u0026ndash;9. Available from: https://www.nature.com/articles/nature12309\u003c/li\u003e\n\u003cli\u003eFadila AM, Marhalil M, Zulkifli Y, Shamsudin NAA, Oyebamiji YO. Genetic Variability of Oil Palm (Elaeis guineensis Jacq.) Deli Dura Inter-Crosses Population in The MPOB Research Station Hulu Paka, Terengganu, Malaysia. Malaysian Appl Biol [Internet]. 2024 Jun 30;53(2):125\u0026ndash;33. Available from: https://jms.mabjournal.com/index.php/mab/article/view/2975\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"bmc-genomics","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"gics","sideBox":"Learn more about [BMC Genomics](http://bmcgenomics.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/gics","title":"BMC Genomics","twitterHandle":"#BMCGenomics","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"Oil palm, chloroplast genome, plastome diversity, SSR variation, phylogenetics, maternal lineage","lastPublishedDoi":"10.21203/rs.3.rs-8662850/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-8662850/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003ch2\u003eBackground\u003c/h2\u003e \u003cp\u003eOil palm (\u003cem\u003eElaeis guineensis\u003c/em\u003e) is the world\u0026rsquo;s most important source of vegetable oil, yet comparative studies of its chloroplast genomes remain limited, particularly for African accessions. Understanding plastome diversity and maternal lineage variation is critical for tracing domestication history and developing molecular markers for breeding and conservation.\u003c/p\u003e\u003ch2\u003eResults\u003c/h2\u003e \u003cp\u003eWe report the first complete chloroplast genomes from Tanzanian Dura and Pisifera, Cameroonian Dura, and the ancestral Deli Dura from Indonesia, analyzed alongside 44 additional plastomes. Comparative analyses revealed highly conserved genome organization and gene content, with subtle differences in IR\u0026ndash;SC boundaries and intergenic spacers. Simple sequence repeat (SSR) profiling confirmed the dominance of A/T mononucleotide repeats but highlighted lineage-specific variation, including a markedly reduced genic SSR density in Pisifera-Tz, while total SSR counts remained stable across accessions (\u0026asymp;\u0026thinsp;859\u0026ndash;860; Supplementary Table \u003cspan refid=\"MOESM3\" class=\"InternalRef\"\u003eS3\u003c/span\u003e). Phylogenetic reconstruction clustered Pisifera-Tz with Dura-Cmr, while Dura-Tz and Dura-BBG formed distinct clades, reflecting multiple maternal lineages. Haplotype network analysis identified 14 distinct haplotypes, revealing moderate but structured plastome diversity. Benchmarking of sequencing platforms demonstrated complementary strengths: MGI provided deep coverage, while PacBio improved assembly contiguity and resolution of repetitive regions.\u003c/p\u003e\u003ch2\u003eConclusion\u003c/h2\u003e \u003cp\u003eThis study expands plastome resources for \u003cem\u003eE. guineensis\u003c/em\u003e, generating the first Tanzanian and Cameroonian chloroplast genomes and uncovering lineage-specific SSR variation. Integration of comparative genomics, phylogenetics, and haplotype analysis advances understanding of oil palm domestication and supports the development of plastome-based molecular markers for breeding, conservation, and genetic resource management.\u003c/p\u003e","manuscriptTitle":"Comparative Chloroplast Genomics and Maternal Lineage Diversity in Oil Palm (Elaeis guineensis)","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-02-23 18:32:08","doi":"10.21203/rs.3.rs-8662850/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2026-03-19T20:09:21+00:00","index":"","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-03-05T10:29:06+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-03-01T21:29:46+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-02-27T14:15:29+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-02-25T03:22:02+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-02-25T03:13:43+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"117843563695271243294070940411679095060","date":"2026-02-24T03:27:11+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"41294436877869304934275901962533582021","date":"2026-02-19T13:07:46+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"280809109341340313500433158698014268886","date":"2026-02-19T12:16:49+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"109284986128218017380306639830053353562","date":"2026-02-19T08:32:51+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"331250881095119248581629524168550007697","date":"2026-02-19T03:35:20+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2026-02-19T03:23:51+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2026-02-19T03:21:24+00:00","index":"","fulltext":""},{"type":"editorInvited","content":"","date":"2026-02-06T19:55:45+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2026-02-06T17:10:04+00:00","index":"","fulltext":""},{"type":"submitted","content":"BMC Genomics","date":"2026-02-06T16:58:23+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"bmc-genomics","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"gics","sideBox":"Learn more about [BMC Genomics](http://bmcgenomics.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/gics","title":"BMC Genomics","twitterHandle":"#BMCGenomics","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"em","reportingPortfolio":"BMC Series","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"e4f42baa-3ac8-4b54-abc1-7910954d4fb8","owner":[],"postedDate":"February 23rd, 2026","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"in-revision","subjectAreas":[],"tags":[],"updatedAt":"2026-03-19T20:23:21+00:00","versionOfRecord":[],"versionCreatedAt":"2026-02-23 18:32:08","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-8662850","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-8662850","identity":"rs-8662850","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.