A ribozyme mass extinction at the RNA-cellular boundary and its potential imprint on the genetic code

preprint OA: closed CC-BY-4.0
📄 Open PDF Full text JSON View at publisher
Full text 38,834 characters · extracted from oa-pdf · 7 sections · click to expand

Abstract

The transition from the RNA world to cellular life remains one of the least understood events in the history of biology. This study proposes that geochemical changes approximately 3.9-3.8 billion years ago triggered a mass extinction of ribozymes - possibly the first extinction event on Earth - and that the genetic code bears the fingerprints of its survivors. The argument draws on parallels with animal mass extinctions, where generalist species survive, dominate post-extinction ecosystems as disaster taxa, and shape subsequent evolutionary trajectories. Among small self-cleaving ribozymes, the hammerhead ribozyme accounts for approximately 91% of all ~221,000 known sequences and is the only ribozyme that is found across all kingdoms of life. The phylogeny of hammerhead ribozymes exhibits a star-like topology consistent with rapid post-bottleneck expansion, and its small size, ability to tolerate diverse chemical conditions, and broad substrate specificity suggest it was a resilient generalist feeder fitting as a disaster taxon. Other families of ribozymes possibly survived in small, taxonomically isolated populations resembling relict species in ecological refugia. It is suggested that the body plans of surviving ribozymes seeded the primitive processes that would later become the genetic code, for example with RNA-degrading trinucleotides becoming stop codons, partitioning the trinucleotide space into signals of termination and translation. This hypothesis proposes a reframing of the origin of the genetic code in part as an ecological legacy rather than a purely chemical inevitability. 1 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint

Introduction

The RNA world hypothesis posits a period in Earth's early history during which RNA molecules served simultaneously as genetic material and catalytic agents 1,2. This hypothesis, grounded in the discovery of catalytic RNA 3,4, is now broadly accepted as a foundation of origin-of-life theory. Yet the transition from the RNA world to the cellular world remains one of the least understood events in the history of life 5,6. Most accounts of this transition emphasize its constructive aspects: the synthesis of fatty acid vesicles, the encapsulation of replicating molecules, and the emergence of protocells with rudimentary metabolism 7-9. Less discussed is the possibility that this transition was also destructive - that the geochemical changes enabling cellular life simultaneously devastated the pre-existing RNA ecosystem. In the animal kingdom, the most consequential evolutionary transitions are routinely associated with mass extinction events 10,11. The end-Permian extinction cleared the way for dinosaur radiation 12; the end-Cretaceous extinction enabled the rise of mammals 12,13. In each case, what appears as a creative explosion was preceded by catastrophic destruction. This paper proposes that the RNA-to-cellular transition was triggered by a mass extinction of ribozymes - potentially the first mass extinction event on Earth - and that the genetic code bears the chemical fingerprints of its dominant survivor. The argument proceeds in three steps. First, geochemical evidence is reviewed for environmental changes ~3.9-3.8 Ga that would have been catastrophic for free-living RNA organisms. Second, evidence is presented that extant ribozyme families exhibit every hallmark of a post-mass-extinction ecosystem, with the hammerhead ribozyme as the disaster taxon. Third, computational analysis demonstrates that the trinucleotide composition of the ribozyme body plan regions maps onto the functional architecture of the genetic code. 2 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint

Results

To identify candidate triggers for a possible major ecological shift occurring at the RNA-cellular (R-C) boundary, we surveyed the geochemical literature for changes proposed to have taken place between approximately 4.2 and 3.5 Ga. Four interlinked factors converge within the interval ~3.9-3.8 Ga (Fig. 1). The upper bound of this interval is constrained by geochemical modeling showing the onset of rapid environmental change, and the lower bound by the earliest isotopic signatures of biological carbon fixation at ~3.8 Ga 14,15, indicating that cellular life was already emerging by that time. First, the late accretion bombardment declined sharply across this interval 16 (Fig. 1A). While the waning of impact sterilization is typically viewed as permissive for life, it simultaneously reduced the meteoritic delivery of metals and reduced phosphorus species that had sustained prebiotic chemistry 17,18. Second, surface temperatures plunged from over 200°C to near-modern values across this interval (Fig. 1B). While the cooling itself would have expanded the thermal range habitable by RNA, it accelerated silicate weathering of the evolving crust, releasing cations into the ocean and drawing down atmospheric CO₂ - directly driving a rapid pH rise 19,20. Ocean pH rose from approximately 5.0 toward near-neutral values more rapidly than previously estimated 21 (Fig. 1C). The sensitivity of RNA to alkaline hydrolysis is intrinsic to its chemistry 22, hence rising pH would have progressively shortened the half-life of exposed ribozymes. Fourth, the delivery of reduced phosphorus by late accretion declined by orders of magnitude across this interval 17,23 (Fig. 1D), while phosphate sequestration through mineral precipitation further constrained its availability 18,24. Ribozyme populations thus faced a growing scarcity of their fundamental building blocks. These four stressors are not independent: the declining bombardment drove the thermal crash, which in turn accelerated pH equilibration through enhanced weathering, which promoted phosphate mineralization. The

Result

was a synergistic collapse - ribozymes simultaneously lost structural stability, catalytic competence, and RNA building blocks (food) for reproduction. This convergence of stressors mirrors the multi-causal structure of animal mass extinctions, in which synergistic interactions between environmental changes amplify extinction severity beyond what any single trigger could produce 25,26. A key detail in understanding the R-C transition is the distribution of extant ribozyme families, which exhibits a pattern strikingly consistent with a post-mass-extinction ecosystem (Fig. 2), with each hallmark finding a direct parallel in the animal mass extinction record (Table 1). Of approximately 221,000 known self-cleaving ribozyme sequences across all ten families, approximately 199,000 (91%) are hammerhead ribozymes 27,28 (Fig. 2A), a degree of numerical dominance characteristic of disaster taxa 29, exceeding even the post-Permian dominance of Lystrosaurus (70-95% of terrestrial vertebrate individuals) 30,31. The other types of small, self-cleaving ribozymes exist in taxonomically isolated populations (82 to ~11,500 sequences) consistent with relict species in ecological refugia 32,33. The hatchet ribozyme (~166 copies restricted entirely to Crassvirales phage genomes) represents an extreme case of refugial survival, paralleling the coelacanth 34. The hammerhead satisfies many criteria of a disaster taxon: it is a small, adaptive (modular) 35 and versatile ribozyme; it can tolerate and operate in diverse, non-standard chemical conditions 36,37; and it possesses the broadest substrate specificity (NUH↓, covering 12 of 16 dinucleotide combinations) of all small, self-cleaving ribozymes. Disaster taxa expand in a typical pattern. In order to examine whether hammerhead ribozymes exhibit such a pattern, the phylogeny of 80,548 unique complete hammerhead sequences 3 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint spanning 97 genera was constructed. This phylogeny exhibits a star-like topology consistent with rapid post-bottleneck expansion (Fig. 2B). Population genetics statistics computed on the full stratified subsample (n = 5,036; S = 40) reveal Tajima's D = +3.79 and Fu's Fs = −690.8; coalescent null testing on a matched subsample (n = 200; S = 37; 10,000 replicates) confirms both values are extreme (D = +2.28, upper-tail p = 0.013; Fs = −618.0, p < 0.0001) (Fig. 2C). The discordance between positive D (reflecting deep hierarchical structure across 97 genera) and extremely negative Fs (reflecting haplotype excess from rapid within-lineage expansion) is diagnostic of ancient divergence preserved through a bottleneck, followed by explosive radiation within surviving host lineages 38. The mass extinction of ribozymes placed the hammerheads that survived it at a unique position that allowed these ribozymes to shape the early exit from a pre-code world. The emergence of the genetic code is one of the most profound questions in biology, and is still under active study 39-42. It has been addressed by various angles, including chemistry and information theory, but rarely as a legacy of ecological circumstance at the R-C boundary. Some of the predictions of this hypothesis were next tested. If the hammerhead's body plan shaped part of the emergence of the genetic code, it would have been likely that the functional organs of the hammerhead partitioned the trinucleotide sequence space in ways corresponding to functional categories of the genetic code. We first analyzed stem and core regions from the entire curated dataset of hammerhead ribozymes to see if such partitions exist. Two results confirmed this hypothesis. Stems and cores contain RNA triplets that clearly segregate by their roles in the genetic code (Fig. 3A). In particular, amino acids that are hypothesized to have entered the genetic code early, are enriched 2.8-fold in stems (0.8-0.9-fold in cores) (Fig. 3B). A Spearman correlation between codon-level stem enrichment and amino acid chronological rank yielded ρ = −0.243 (p = 0.06, permutation p = 0.056): marginal, but in the predicted direction. The correlation is driven by the binary partition between core-resident and non-core trinucleotides rather than a smooth gradient, reflecting the discrete nature of ribozyme architecture. Interestingly, standard stop codons are enriched 3.4-fold in catalytic cores of all ribozymes, and 6.1-fold in the hammerhead core specifically, while being depleted 0.4-fold in stem regions (Fig. 3A, Fig. S1). The rationale for catalytically-active RNA triplets assuming the role of stop signals is twofold: first, they are hazardous to the integrity (and thus reliability) of RNA messages, so they need to be kept as far as possible from the message (at its end); and second, they are by definition natural terminators of RNA strands. In a wide range of hammerhead ribozymes, the most striking feature of the catalytic core is the motif CUGAUGA, which includes two tandem repeats of the UGA (opal/umber) stop codon. The hammerhead is the only of the small self-cleaving ribozymes to contain two stop codons. The unique placement of the hammerhead in the post-extinction world raises the hypothesis that UGA was the first primitive stop codon, and was replaced only later by UAA. UGA's leakiness and frequent reassignments 43,44 could be reinterpreted not as recent recruitment but as evidence of its pre-translational origin; it was never optimized for the release factor system. This is supported by analysis of 492 universally conserved genes across three organisms: universal genes strongly prefer UAA (e.g., E. coli: 88% vs. 64% background, p = 5.4 × 10⁻⁶ ), consistent with UAA as the ancestral translational stop. In S. cerevisiae and H. sapiens, UGA shows less 4 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint downstream context bias than UAA (S. cerevisiae : mean hexamer IC = 0.12 vs. 0.19 bits; H. sapiens: 0.06 vs. 0.06 bits), consistent with UGA's function originating in a pre-translational system where downstream context was irrelevant 45,46 (Fig. 3C, 3D) Furthermore, UGA readthrough sites - where UGA is decoded as selenocysteine or other amino acids rather than functioning as a stop - show significantly more stable local RNA structure than efficiently terminating UGA controls (MFE: −30.6 vs. −16.8 kcal/mol, p = 1.2 × 10⁻⁵ ) and more stem-loop structures (p = 0.006), consistent with structural elements shielding UGA from the termination machinery and preserving its pre-translational accessibility to alternative decoding (Fig. 3E). 5 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint

Discussion

The central claim of this paper is that the transition from the RNA world to cellular life was triggered by a catastrophic event leading to mass extinction of ribozymes, and that the survivors of this event had left a legible imprint on the genetic code. The evidence presented - geochemical convergence of stressors, ecological signatures of mass extinction among extant ribozymes, and the correspondence between the ribozyme body plan and genetic code primitives - lends support to this hypothesis. Several aspects of this framework warrant further discussion. The mass extinction model makes a specific prediction about the nature of the geochemical trigger: it should have been rapid relative to evolutionary timescales, multi-causal rather than attributable to a single stressor, and selectively destructive of ecologically specialized forms. The convergence of reduced bombardment, temperature shift, pH changes, and depletion of available phosphate documented for the 3.9-3.8 Ga interval satisfies all three criteria. While individual parameters remain subject to considerable uncertainty - early ocean pH estimates range from 5 to 7 depending on the model [Guo & Korenaga, 2025; Krissansen-Totton et al., 2018; Zahnle et al., 2010] - the convergence of multiple independent environmental shifts is robust across models. The distinction between ecological stop (UGA) and translational stop (UAA) proposes a reframing of the traditional view of UGA as a late addition to the code 47. The present model pictures UGA as the primary weapon carried by the ribozymes that survived the extinction, who carried it in two copies each; it was in this capacity as a ribozyme-degrading blade, that UGA functioned as a natural termination signal for any primordial translation process that utilized RNA as its message and occurred in its vicinity. This might be a reason why the first amino acids to be translated by RNA were translated at the stem, physically remote from this hazardous triplet. The apparent unreliability of UGA in the modern code reflects its ancestry - a signal from a fundamentally different world, which was never fully adapted to the modern system it was grandfathered into. A further prediction - that UGA readthrough sites would retain structural signatures of their pre-translational history - was confirmed: readthrough sites show significantly more stable local RNA structure than termination controls (p = 1.2 × 10⁻⁵ ) and more stem-loop structures (p = 0.006). This is consistent with a model in which UGA's transition from ecological stop to translational stop was completed most efficiently at structurally accessible sites, where release factors could recognize UGA without steric interference. At readthrough sites, elaborate local structures (SECIS elements, viral pseudoknots) shield UGA from the termination machinery, preserving its availability for alternative decoding - an echo of the post-extinction world in which UGA first acquired biological meaning. Finally, the present study raises the possibility of a mass extinction event at the R-C boundary, showing multiple parallels to other extinction events from the geological past. Such catastrophic events are known to not only exterminate many clades, but also open new opportunities and trigger rapid evolution and expansion of new branches. It is intriguing to wonder whether, just as the Cambrian explosion followed the Ediacaran extinction, and the mammalian radiation followed the dinosaur extinction, the emergence of cellular life and the genetic code followed the 6 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint extinction of ribozymes, of which we might learn from their fossil record within genomes across all kingdoms of life. 7 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint

Methods

Trinucleotide enrichment analysis. Trinucleotide frequencies were computed for catalytic core and structural stem regions across all seven small self-cleaving ribozyme families using Rfam seed alignments [Kalvari et al., 2021]. Each sequence was parsed into core and stem regions based on published secondary structure annotations. Enrichment was calculated as observed frequency divided by expected frequency (background trinucleotide composition of the full sequence). A total of 524,824 stem trinucleotides and 563,867 core trinucleotides were analyzed. Pyrimidine-pyrimidine dinucleotide analysis. For each ribozyme family, the frequency of pyrimidine-pyrimidine dinucleotides (CC, CT, TC, TT/UU) was computed in catalytic core sequences and compared to the expected frequency of 0.25 under random composition. Trifonov consensus chronology correlation. Amino acid chronological ranks were taken from Trifonov's consensus chronology [Trifonov, 2004]. For each of the 61 sense codons, stem enrichment was calculated as described above. Spearman rank correlation was computed between codon-level stem enrichment and the chronological rank of the corresponding amino acid. Permutation testing (10,000 permutations) was used to assess significance. Phylogenetic analysis. 80,548 unique complete hammerhead ribozyme sequences were retrieved from Rfam [Kalvari et al., 2021] and aligned using MAFFT v7 [Katoh & Standley, 2013]. An approximate maximum-likelihood tree was constructed using FastTree 2 [Price et al., 2010] with the GTR+CAT model. Population genetics statistics. Tajima's D [Tajima, 1989] and Fu's Fs [Fu, 1997] were computed on a stratified subsample of 5,036 sequences preserving the taxonomic distribution of the full dataset (97 genera). The subsample contained 40 segregating sites and 4,555 distinct haplotypes. P-values were assessed by coalescent simulation (10,000 replicates). UGA contextual analysis. Stop codon usage and flanking nucleotide context were analyzed in 492 universally conserved genes (ribosomal proteins, translation factors, RNA polymerase subunits) across E. coli (93 genes), S. cerevisiae (300 genes), and H. sapiens (99 genes), compared against 10,298 total coding sequences. Information content was computed for the six positions downstream of each stop codon as IC = Σ pᵢ log₂(pᵢ/0.25). UGA readthrough analysis. 25 known UGA readthrough sites (20 human selenoprotein SECIS elements: GPX1-4,6, TXNRD1-3, DIO1-3, SELENOP,W,H,I,K,N,S,T, MSRB1; plus 5 cellular readthrough genes: rabbit beta-globin, TMV , MuLV , Sindbis, VEGFA-Ax) were compared against 26 control UGA terminators from housekeeping genes (8 E. coli, 4 S. cerevisiae, 14 H. sapiens). All sequences were fetched from NCBI. Minimum free energy, base-pairing probability, and hairpin loop count were computed using RNAfold from ViennaRNA 2.7.0 [Lorenz et al., 2011] on 103-nucleotide windows (50 nt upstream + UGA + 50 nt downstream). Statistical comparisons used Mann-Whitney U tests (MFE, pairing probability, hairpin count) and Fisher's exact test (UGA paired/unpaired classification). 8 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint Body plan classification. Ribozyme body plan classification follows the framework established in Bachelet [2026], with structural regions designated as bodies (paired stems), cavities (catalytic cores), and limbs (unpaired substrate-interacting strands). Author contributions I.B. designed experiments, performed experiments, analyzed data, and wrote manuscript. Competing interests declaration The author declares no competing interests. 9 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint

References

1. Neveu, M., Kim, H.-J. & Benner, S. A. The ‘strong’ RNA world hypothesis: fifty years old. Astrobiology 13, 391-403 (2013). 2. Gilbert, W. Origin of life: The RNA world. Nature Publishing Group UK http://dx.doi.org/10.1038/319618a0 (1986) doi:10.1038/319618a0. 3. Kruger, K. et al. Self-splicing RNA: autoexcision and autocyclization of the ribosomal RNA intervening sequence of Tetrahymena. Cell 31, 147-157 (1982). 4. Guerrier-Takada, C., Gardiner, K., Marsh, T., Pace, N. & Altman, S. The RNA moiety of ribonuclease P is the catalytic subunit of the enzyme. Cell 35, 849-857 (1983). 5. Szostak, J. W., Bartel, D. P. & Luisi, P. L. Synthesizing life. Nature 409, 387-390 (2001). 6. Sutherland, J. D. The Origin of Life--Out of the Blue. Angew Chem Int Ed Engl 55, 104-121 (2016). 7. Deamer, D. The Role of Lipid Membranes in Life’s Origin. Life (Basel) 7, (2017). 8. Mansy, S. S. et al. Template-directed synthesis of a genetic polymer in a model protocell. Nature 454, 122-125 (2008). 9. Szostak, J. W. The eightfold path to non-enzymatic RNA replication. J. Syst. Chem. 3, 2 (2012). 10. Raup, D. M. The role of extinction in evolution. Proc Natl Acad Sci U S A 91, 6758-6763 (1994). 11. Website. https://doi.org/10.1666/0094-8373(2005)031[0192:MEAM]2.0.CO;2[Opens in a new window] doi:10.1666/0094-8373(2005)031[0192:MEAM]2.0.CO;2[Opens in a new window]. 12. Metcalfe, I. BENTON, M. J. 2003. When Life Nearly Died. The Greatest Mass Extinction of All Time. 336 pp. London: Thames & Hudson. Price £16.95 (hard covers). ISBN 0 500 05116 X. Geol. Mag. 140, 613-613 (2003). 13. Alroy, J. The fossil record of North American mammals: evidence for a Paleocene evolutionary radiation. Syst Biol 48, 107-118 (1999). 14. Mojzsis, S. J. et al. Evidence for life on Earth before 3,800 million years ago. Nature 384, 55-59 (1996). 15. Rosing, M. T. 13C-Depleted carbon microparticles in >3700-Ma sea-floor sedimentary rocks from west greenland. Science 283, 674-676 (1999). 16. Bottke, W. F. & Norman, M. D. The Late Heavy Bombardment. Annu. Rev. Earth Planet. Sci. 45, 619-647 (2017). 17. Ritson, D. J., Mojzsis, S. J. & Sutherland, J. D. Supply of phosphate to early Earth by photogeochemistry after meteoritic weathering. Nat Geosci 13, 344-348 (2020). 18. Pasek, M. A., Harnmeijer, J. P., Buick, R., Gull, M. & Atlas, Z. Evidence for reactive reduced phosphorus species in the early Archean ocean. Proc Natl Acad Sci U S A 110, 10089-10094 (2013). 19. Guo, M. & Korenaga, J. Rapid rise of early ocean pH under elevated weathering rates. Nat. Geosci. 18, 260-266 (2025). 20. Krissansen-Totton, J., Arney, G. N. & Catling, D. C. Constraining the climate and ocean pH of the early Earth with a geological carbon cycle model. Proc. Natl. Acad. Sci. U. S. A. 115, 4105-4110 (2018). 21. Website. https://doi.org/10.1029/2023GC011252Digital Object Identifier (DOI) doi:10.1029/2023GC011252Digital Object Identifier (DOI). 22. Li, Y . & Breaker, R. R. Kinetics of RNA degradation by specific base catalysis of 10 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint transesterification involving the 2‘-hydroxyl group. J. Am. Chem. Soc. 121, 5364-5372 (1999). 23. Brady, M. P., Tostevin, R. & Tosca, N. J. Marine phosphate availability and the chemical origins of life on Earth. Nat Commun 13, 5162 (2022). 24. Watanabe, Y ., Ozaki, K. & Tajika, E. Marine phosphate level during the Archean constrained by the global redox budget. Geophys. Res. Lett. 51, (2024). 25. Arens, N. C. & West, I. D. Press-pulse: a general theory of mass extinction? Paleobiology 34, 456-471 (2008). 26. Bond, D. P. G. & Grasby, S. E. On the causes of mass extinctions. Palaeogeogr. Palaeoclimatol. Palaeoecol. 478, 3-29 (2017). 27. de la Peña, M. & García-Robles, I. Ubiquitous presence of the hammerhead ribozyme motif along the tree of life. RNA 16, 1943-1950 (2010). 28. Perreault, J. et al. Identification of hammerhead ribozymes in all domains of life reveals novel structural variations. PLoS Comput Biol 7, e1002031 (2011). 29. Hart, M. B. Biotic Recovery from Mass Extinction Events. (Geological Society of London, 1996). 30. Botha-Brink, J., Codron, D., Huttenlocker, A. K., Angielczyk, K. D. & Ruta, M. Breeding Young as a Survival Strategy during Earth’s Greatest Mass Extinction. Sci Rep 6, 24053 (2016). 31. Smith, R. M. H. & Botha-Brink, J. Anatomy of a mass extinction: Sedimentological and taphonomic evidence for drought-induced die-offs at the Permo-Triassic boundary in the main Karoo Basin, South Africa. Palaeogeogr. Palaeoclimatol. Palaeoecol. 396, 99-118 (2014). 32. Kalvari, I. et al. Rfam 14: expanded coverage of metagenomic, viral and microRNA families. Nucleic Acids Res 49, D192-D200 (2021). 33. Weinberg, Z. et al. New classes of self-cleaving ribozymes revealed by comparative genomics analysis. Nature Chemical Biology 11, 606-610 (2015). 34. Smith, J. L. B. A living fish of Mesozoic type. Nature 143, 455-456 (1939). 35. Peña, M. de la, de la Peña, M., García-Robles, I. & Cervera, A. The Hammerhead Ribozyme: A Long History for a Short RNA. Molecules vol. 22 78 Preprint at https://doi.org/10.3390/molecules22010078 (2017). 36. Athavale, S. S. et al. RNA folding and catalysis mediated by iron (II). PLoS One 7, e38024 (2012). 37. Weiss, R. et al. A food chain of ribozymes built on an ancestral strand. bioRxiv 2025.12.22.695801 (2025) doi:10.64898/2025.12.22.695801. 38. Excoffier, L. Patterns of DNA sequence diversity and genetic structure after a range expansion: lessons from the infinite-island model. Mol Ecol 13, 853-864 (2004). 39. Lenstra, R., Lenstra, A. C. & José, M. V . Crick’s ‘Frozen accident theory’ of the genetic code explored by Ising models. J Theor Biol 609, 112146 (2025). 40. Website. https://doi.org/10.1016/0022-2836(68)90392-6 doi:10.1016/0022-2836(68)90392-6. 41. Website. https://doi.org/10.48550/arXiv.1007.4122 doi:10.48550/arXiv.1007.4122. 42. Aharon, A., Polak, P. & Yaari, G. Revealing the inherent design principles of the genetic code via an error correcting code representation. Scientific Reports 1-16 (2026). 43. Barrell, B. G., Bankier, A. T. & Drouin, J. A different genetic code in human mitochondria. Nature 282, 189-194 (1979). 11 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint 44. Böck, A. et al. Selenocysteine: the 21st amino acid. Mol Microbiol 5, 515-520 (1991). 45. Cridge, A. G., Crowe-McAuliffe, C., Mathew, S. F. & Tate, W. P. Eukaryotic translational termination efficiency is influenced by the 3’ nucleotides within the ribosomal mRNA channel. Nucleic Acids Res 46, 1927-1944 (2018). 46. Poole, E. S., Brown, C. M. & Tate, W. P. The identity of the base following the stop codon determines the efficiency of in vivo translational termination in Escherichia coli. EMBO J 14, 151-158 (1995). 47. Koonin, E. V . & Novozhilov, A. S. Origin and evolution of the genetic code: the universal enigma. IUBMB Life 61, 99-111 (2009). 12 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint Figure 1 Figure 1. Geochemical changes at the RNA-cellular boundary. Composite timeline of environmental variables across the 4.5-3.5 Ga interval, with the proposed R-C boundary interval (~3.9-3.8 Ga) indicated. (A) Declining impact flux (data from Bottke & Norman, 2017). (B) Surface temperature evolution showing the crash from >200 °C to near-modern values (data from Guo & Korenaga, 2025). (C) Ocean pH rise from acidic to neutral (data from Guo & Korenaga, 2025). (D) Declining delivery of reduced phosphorus by late accretion (data from Ritson et al., 2020). Together, these panels document a convergence of environmental stressors - physical (bombardment), thermal (cooling), chemical (pH shift), and nutritional (phosphate depletion) - coinciding with the proposed extinction interval. 13 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint Figure 2 Figure 2. Evidence for ribozyme mass extinction. (A) Abundance distribution across all ten known small self-cleaving ribozyme families, showing the hammerhead's 91% dominance, a pattern characteristic of disaster taxa following mass extinction events. (B) Circular phylogeny of 80,548 unique hammerhead sequences colored by host taxonomic group. The star-like topology is consistent with post-bottleneck radiation, and the interspersion of distantly related host phyla around the tree reflects repeated independent colonization events rather than co-speciation, consistent with horizontal transfer of a single dominant survivor lineage. (C) Null distribution tests for Tajima's D (left) and Fu's Fs (right), each based on 10,000 coalescent replicates (n = 200, S = 40). Blue and orange histograms show the null distributions under neutral expectations; red dashed lines indicate the observed values (D = +2.28, upper-tail p = 0.013; Fs = −618.0, p < 0.0001). Observed D falls in the extreme upper tail (98.7% of null replicates below observed), indicating deep hierarchical structure; observed Fs falls far below any null replicate, indicating massive haplotype excess. The discordance between positive D and extremely negative Fs is diagnostic of ancient divergence preserved through a bottleneck followed by rapid within-lineage expansion. Full-sample values (n = 5,036) are D = +3.79, Fs = −690.8. 14 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint Figure 3 Figure 3. The body plan's imprint on the genetic code. (A) Trinucleotide enrichment heatmap showing stop codons enriched in catalytic cores (6.1-fold) and early amino acid codons enriched in structural stems (2.8-fold for Val/Asp/Gly). (B) Stem enrichment vs. amino acid chronological rank (Trifonov consensus), showing a marginal but predicted negative correlation at codon level (ρ = −0.243, p = 0.06, permutation p = 0.056). (C) Stop codon usage and downstream context in universally conserved genes across E. coli, S. cerevisiae, and H. sapiens. Left: fractional usage of UAA, UAG, and UGA, showing a progressive shift from UAA dominance in E. coli (88% UAA) to UGA dominance in H. sapiens (51% UGA), consistent with UGA's gradual co-option from a pre-translational cleavage signal into a translational stop codon. (D) Mean information content (IC) of the downstream hexamer for UAA- vs. UGA-terminated universal genes, showing that UAA-terminated genes carry higher downstream sequence constraint in E. coli and S. cerevisiae , consistent with UAA having been under translational selection longer than UGA. (E) UGA readthrough vs. termination structural analysis. Readthrough sites (n = 25; 20 selenoprotein SECIS elements, 5 cellular readthrough genes) show significantly more stable local RNA structure than UGA termination controls (n = 26; MFE: −30.6 vs. −16.8 kcal/mol, p = 1.2 × 10⁻⁵ ) and more stem-loop structures (2.48 vs. 1.85 hairpins, p = 0.006), consistent with local structure shielding UGA from release factors and preserving its accessibility to alternative decoding machinery. 15 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint Table 1. Parallels between ribozyme and animal mass extinction signatures. Each row identifies a hallmark signature of mass extinction and recovery, with the ribozyme evidence from this study and its closest paleontological analog. Ribozyme citations refer to independent published studies except where noted; paleontological analogs span the end-Permian, end-Cretaceous, and Late Ordovician mass extinctions. See Supplementary Note S2 for expanded discussion of each parallel. Extinction signature Ribozyme extinction Paleontological analog Disaster taxon dominance Hammerhead: ~91% of ~221,000 sequences; present in all domains of life De la Peña & García-Robles 2010; Perreault et al. 2011 Lystrosaurus: 70-95% of Early Triassic terrestrial vertebrate individuals Botha-Brink et al. 2016; Kauffman & Harries 1996 Relict in refugium Hatchet: ~166 copies restricted to Crassvirales phage genomes Rfam RF02678; Weinberg et al. 2019 Latimeria (coelacanth): 2 species in deep-sea caves; presumed extinct 66 Myr Smith 1939; Amemiya et al. 2013 Relict survivor (restricted range) Hairpin (~139), Pistol (~103), Twister sister (~129): each in narrow host ranges Rfam 15.1; Weinberg et al. 2019 Nautilus: sole extant nautiloid; restricted Indo-Pacific after ammonite extinction Ward 1987 Generalist survival advantage Hammerhead NUH↓: 12/16 dinucleotide targets (NHH rule); tolerates clay, Fe²⁺, extreme conditions Kore et al. 1998; Athavale et al. 2012; Biondi et al. 2007 K-Pg bivalves: genera in ≥3 biogeographic provinces had ~20% extinction vs. 70% for restricted Jablonski 2005 Marine disaster taxa Hammerhead dominant across marine invertebrate host phyla (Trematoda, Bivalvia, Cephalopoda) De la Peña & García-Robles 2010; Perreault et al. 2011 Claraia, Unionites, Lingularia: low-diversity high-dominance Early Triassic benthos Clapham et al. 2016 Star phylogeny / post-bottleneck radiation Hammerhead: star-like tree across 97 genera; D = +3.79, Fs = −690.8 This study Neoaves: explosive diversification at K-Pg; unresolved star polytomy in molecular phylogenies Jarvis et al. 2014 16 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint Figure S1 Figure S1. Partitioning of amino acid roles and enrichment of stop codons in catalytic cores across small, self-cleaving ribozymes (~ 221,000). 17 .CC-BY 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted March 9, 2026. ; https://doi.org/10.64898/2026.03.05.709948doi: bioRxiv preprint

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-pdf

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-05-23T02:00:01.238055+00:00
License: CC-BY-4.0