Codon degeneracy contributes to divergent fitness effects of rare tRNAs with A-starting anticodons

preprint OA: closed CC-BY-4.0
📄 Open PDF Full text JSON View at publisher
Full text 115,961 characters · extracted from oa-pdf · 9 sections · click to expand

Abstract

14 15 Transfer RNA (tRNA) repertoires vary greatly across genomes, shaped by genetic drift and 16 selection. A peculiar pattern across prokaryotes is the near-complete absence of tRNAs with 17 unmodified adenine at the 34th (wobble) position (i.e., tRNAANN). Each of these tRNAs are just 18 a single mutation away from several other tRNAs. Hence, their persistent absence suggests 19 fundamental but hitherto unclear constraints. We engineered 36 Escherichia coli strains 20 expressing tRNAs carrying each theoretically possible ANN anticodon to determine their 21 functionality and fitness effects. Notably, there was no evidence of broad toxicity due to these 22 tRNAs. All five tRNAANN tested underwent post-transcriptional maturation and all seven tested 23 compensated for the deletion of their respective native tRNABNN (carrying G, C or U at the 34th 24 position), demonstrating that tRNA ANN are translationally active. Furthermore, tRNAANN from 25 four-fold degenerate (4D) codon boxes were unmodified and were generally neutral or 26 beneficial, whereas tRNAANN from two-fold degenerate (2D) boxes underwent A34-to-I34 27 modification and were more likely to impair fitness. We suggest superwobbling by tRNAANN — 28 decoding an entire four-codon set — as one mechanism underlying these differential fitness 29 effects. Maximal degeneracy in 4D boxes buffers or exploits tRNAANN superwobbling via 30 synonymous decoding, whereas constrained degeneracy in 2D boxes renders it deleterious, 31 likely through amino acid misincorporation. Thus, these differential fitness effects, sharpens 32 the paradox of neutral or beneficial yet absent 4D tRNAANN, while beginning to empirically 33 unravel underlying causes for the absence of 2D tRNAANN. 34 35 36

Keywords

tRNA, Mistranslation, Superwobble , tRNA modifying enzymes, Adenine 34 , 37 Inosine, tadA, ADAT, Codon degeneracy 38 39 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint

Introduction

40 Transfer RNA (tRNA) pools, largely determined by tRNA gene copy numbers (GCN), play a 41 crucial role in maintaining translation rate, translation accuracy, growth rate and overall fitness 42 (Young et al. 1976; Dong et al. 1996; Scott et al. 2010; Scott and Hwa 2011; Klumpp et al. 43 2013; Wilusz 2015; Hu and Lercher 2021; Raval et al. 2023). Within species, tRNA repertoires 44 adapt in response to various environmental factors (Nilsson et al. 2006; Iben et al. 2011; 45 Bedhomme et al. 2019; Ayan et al. 2020; Khomarbaghi et al. 2024), and across species, they 46 echo the diversity of ecological niches (Goodenbour and Pan 2006; Vieira -Silva and Rocha 47 2010; Fujishima and Kanai 2014; Chan and Lowe 2016; Rak et al. 2018) . For a given amino 48 acid, the GCN of tRNAs with distinct anticodons for the same amino acid (‘tRNA isoacceptors’) 49 varies across species. The GCN is often correlated with relative codon usage (Ikemura 1985; 50 Dong et al. 1996; Berg and Kurland 1997; Elf et al. 2003; Rocha 2004; Novoa and Ribas de 51 Pouplana 2012; McDonald et al. 2015), contributing to the diversity of tRNA repertoires. The 52 effective cytosolic tRNA pools can also vary with expression levels from each tRNA gene copy 53 (Kanaya et al. 1999; Dittmar et al. 2004; Dittmar et al. 2006; Bloom -Ackermann et al. 2014; 54 Sagi et al. 2016; Raval et al. 2023) and differential charging across isoacceptors (Elf et al. 55 2003; Dittmar et al. 2005). Another layer of variability is generated by tRNA modifying enzymes 56 (MEs) that can post-transcriptionally modify tRNAs at about a dozen different sites, particularly 57 in the anticodon loop which extends the initially proposed wobble pairing (Crick 1966) to an 58 expanding list of modified wobbles (Agris et al. 2007; Grosjean et al. 2010; Iben and Maraia 59 2012; Novoa et al. 2012; Agris et al. 2017; Maraia and Arimbasseri 2017; Agris et al. 2018; 60 Suzuki 2021) . Despite such numerous mechanisms to specifically generate an immense 61 diversity of functional tRNAs within and across species , tRNAs with specific anticodons are 62 curiously either missing or rare (Maraia and Arimbasseri 2017; Diwan and Agashe 2018; Rak 63 et al. 2018; Torres 2019; Lei and Burton 2020; Ehrlich et al. 2021; Pernod et al. 2021). 64 Of these seemingly ‘prohibited’ tRNAs, perhaps most striking is the lack of those carrying an 65 unmodified adenine at the 34th (wobble) position (henceforth, tRNAANN). A total of eight ANN 66 anticodons are theoretically possible for the eight family-codon boxes with four -fold 67 degeneracy (‘4D’; whereby all four codons within a box, differing only at the 3rd base, encode 68 the same amino acid), and another eight for the split-codon with two-fold degeneracy (‘2D’; 69 where the four codons within a box encode different amino acids). tRNAANN carrying each of 70 these theoretically possible ANN are a single point mutation away from tRNAUNN, tRNAGNN, 71 and tRNACNN (collectively, ‘tRNABNN’) in the same codon box, and should occur frequently in 72 bacterial populations. For instance, with a mutation rate of 9.1×10−11 per base per generation, 73 an Escherichia coli population that grows from ca. 1,000 cells to 1 billion cells (e.g., a 1 mL 74 culture grown in LB medium overnight) will sample at least one mutation leading to a tRNAANN. 75 However, seven 2D codon tRNAANN and one 4D codon tRNAANN are universally absent (Fig. 76 1A). In addition, seven more 4D tRNAANN are absent in prokaryotes, though Leu-tRNAAAG and 77 Thr-tRNAAGT do occur in some bacteria (Andachi et al. 1987; Borén et al. 1993; Inagaki et al. 78 1995; Phillips and de Crécy -Lagard 2011; Diwan and Agashe 2018; Ehrlich et al. 2021) . 79 Curiously, these latter seven 4D codon box tRNA ANN are also absent from eukaryotic 80 organelles of bacterial origin and from bacteriophages (Hatfull 2015; Pope et al. 2015; 81 Morgado and Vicente 2019; Ehrlich et al. 2021) . Hence, comparative genomics alone 82 underscore the persistent rarity of tRNAANN, suggesting that their expression may not be well-83 tolerated. 84 There are some exceptions where tRNAANN are genomically encoded; however , their post-85 transcriptional processing again indicates poor tolerance of unmodified tRNAANN. Both bacteria 86 and eukaryotes encode Arg-tRNAACG, and it is in fact the preferred Arg-tRNA isoacceptor. But 87 in this case, the A34 is post-transcriptionally masked via deamination by the modifying enzyme 88 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint TadA (Karcher and Bock 2009; Diwan and Agashe 2018; Rafels-Ybern et al. 2019; Ehrlich et 89 al. 2021) in prokaryotes or ADAT in eukaryotes, resulting in tRNAs with inosine-34 (tRNAICG) 90 in the cytosol (Gerber and Keller 1999; Wolf et al. 2002; Torres et al. 2014) and in the 91 chloroplast stroma (Delannoy et al. 2009; Karcher and Bock 2009) . Eukaryotes also encode 92 and similarly modify six additional 4D codon box tRNAANN (Torres et al. 2014; Rafels-Ybern et 93 al. 2018) (Fig. 1A). The only remaining 4D codon box tRNA ANN (Gly-tRNAACC) is universally 94 absent, likely because it is a poor substrate for ADAT (Saint-Léger et al. 2016) and TadA 95 (Borén et al. 1993). The A-to-I modification appears to be essential for efficient translation in 96 bacteria (Wolf et al. 2002), eukaryotes (Torres et al. 2021) and chloroplasts (Delannoy et al. 97 2009). Notably, both tRNA ANN and TadA/ADAT appear to be absent in archaea and 98 mitochondria (Ehrlich et al. 2021) . Together, the strict co -occurrence of tRNA ANN and 99 TadA/ADAT and the universal lack of the poor TadA/ADAT substrates tRNAANN point to ancient 100 and strong purifying selection against unmodified tRNAANN, which has likely persisted through 101 the four billion years of life on Earth. 102 Single base mutations should frequently generate tRNAANN from tRNABNN genes, but are 103 presumably removed by selection. A first step towards understanding the intensity of and 104 mechanisms driving such selection is comprehensive in vivo tests of functionality and potential 105 fitness impacts of unmodified tRNAANN. Previous studies have largely focused on a few ANN 106 anticodons in their native backbones or expressed multiple ANN from a single backbone, 107 limiting the ability to generalize. For instance, Gly -tRNAACC is unmodified and functional in 108 Escherichia coli (Borén et al. 1993) and humans (Saint-Léger et al. 2016) and Pro-tRNAAGG 109 does not affect fitness or translational elongation rates in Salmonella (Chen et al. 2002). Both 110 these tRNAANN were generated from the respective native tRNABNN backbones. In another 111 study, all sixteen possible ANN anticodons were generated from a single Methanocaldococcus 112 jannaschii Tyr-tRNA backbone, and these introduced tRNAANN decoded 1-20% of NNU codons 113 in E. coli (Biddle et al. 2016; Schmitt et al. 2018; Schmitt et al. 2024) . These observations 114 suggest that unmodified tRNA ANN are likely to be translationally active and non -lethal. 115 However, the decoding capacities and modifications of tRNAs (including A-to-I) depend on the 116 backbone sequence (Li et al. 1997; Qian et al. 1997; Nakanishi et al. 2005; Schmeing et al. 117 2011; Saint-Léger et al. 2016; Roura Frigolé et al. 2019); and fitness impacts of tRNAs depend 118 on the ecological niche (Bloom-Ackermann et al. 2014; Li et al. 2016; Li and Zhang 2018; 119 Gabzi et al. 2022; Raval et al. 2023). Therefore, validating function in native tRNA backbones 120 and conducting systematic fitness assays across different growth media are both essential to 121 quantify and understand the source of the hypothesized selection against tRNAANN. 122 To this end, we introduced 20 tRNAANN via point mutation in the native tRNA genes of E. coli, 123 and tested their expression, maturation and functionality. Our results provide some of the first 124 empirical evidence for folding, maturation and translational activity of a broad set of tRNAANN 125 in an endogenous context. Most of the tRNAANN were accommodated in the cytosol 126 unmodified, did not alter fitness under tested conditions , and a subset of tested tRNA ANN 127 evidently effectively replaced native tRNABNN. However, of tRNAANN that affected fitness, 2D 128 and 4D tRNAANN differed from each other in that 2D tRNAANN appeared prone to modification 129 and reduced fitness when overexpressed , whereas 4D tRNA ANN appeared unmodified and 130 improved fitness, especially in nutrient poor media . We propose that within-codon box 131 superwobbling (decoding of all four codons) may underlie the observed fitness differences 132 wherein the degeneracy of the code will buffer or profit from superwobbling of 4D tRNAANN but 133 not from 2D tRNAANN due to the limited degeneracy of 2D codon boxes. Overall, our results 134 highlight the lack of any clear overarching source of purifying selection on most tRNAANN and 135 underscore benefits of a majority of 4D tRNAANN, deepening the mystery of their near-universal 136 absence. However, our work also points at mistranslation as a potential source of counter -137 selection for 2D codon amino acids, which merits further research. 138 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint

Methods

139 Generating strains 140 We conducted all tRNA gene manipulations in E. coli MG1655 (the wild type, WT). The native 141 tRNA genes were amplified from the genome and cloned into an IPTG-inducible low copy 142 number plasmid (pACDH, a derivative of pACD with an ACYC origin of replication (Mangroo 143 and RajBhandary 1995; Rao and Varchney 2001) ) or a high copy number plasmid (pUC19 144 (Norrander et al. 1983)). We introduced an B34-to-A34 point mutation (‘B34A’) in the native 145 tRNABNN genes via PCR mutagenesis (Hoa et al. 1989) , followed by Sanger sequencing to 146 confirm the mutation. To introduce tRNAANN on the genome, we utilized a scar-free, two-step 147 allelic exchange method that utilizes the pKOV vector that carries a chloramphenicol 148 resistance marker (Link et al. 1997). Briefly, we amplified the locus with the native tRNA gene 149 of interest (along with ca. 200 -300bp upstream and downstream regions) from the genome 150 while also introducing a B34A substitution in the amplicon through PCR mutagenesis . We 151 cloned the amplicons with mutated alleles into pKOV and transformed the WT with pKOV -152 tRNAANN. We selected colonies with genomic integration of pKOV-tRNAANN on 153 chloramphenicol (20 µg/ml) at 43°C (where the pKOV plasmid replicates very inefficiently) and 154 confirmed the chromosomal insertion via PCR. We selected cells that had removed pKOV, by 155 growing the population first overnight in LB (Lysogeny Broth, Difco) with 5% sucrose at 37°C 156 and then on LB agar with 5% sucrose to isolate single colonies at 37°C. The presence of 157 sucrose in both these steps selects against cells with pKOV since the expression of sacB from 158 pKOV is lethal. proL B34A was introduced on the genome via the Datsenko-Wanner method 159 (Datsenko and Wanner 2000) . Briefly, kanamycin cassette was first inserted 60 bp 160 downstream of proL. The proL::Kan region was amplified through a mutagenesis PCR that 161 introduced B34A via the forward primer. The proL B34A::Kan amplicon was electroporated 162 into the WT carrying plasmid pKD46 where the red recombinase switched the WT proL with 163 proL B34A::Kan. The Kan cassette was cured using PCP20 resulting in the proL B34A strain. 164 We screened the mutant strains for the intended mutation, and to rule out any other mutations, 165 via PCR followed by Sanger sequencing and further confirmed the m via Next Generation 166 Sequencing (Illumina HiSeq PE150, 4-5 million reads per strain, >100x depth). We stored all 167 the strains thus generated as glycerol cryostocks at –80°C. 168 169 Measuring growth parameters 170 We revived strains from cryostocks by streaking onto LB (Lysogeny Broth, Difco) agar plates, 171 with appropriate antibiotics where applicable. We inoculated individual colonies in LB and 172 incubated at 37°C (180 rpm shaking) for 14-16 hours to grow the preculture. We set up growth 173 measurement experiments by sub -culturing 1% v/v of precultures in 48 well microplates 174 (Corning) in 500 µL growth medium: LB or M9 minimal medium (M9 salts, 1mM CaCl2, 2.5 175 mM MgSO4) supplemented with specific carbon and nitrogen sources (“GA”: 0.4% w/v 176 glucose and cas amino acids; or carbon sources alone: glucose (Glu) 0.2% w/v, galactose 177 (Gal) 0.2%, pyruvate (Pyr) 50 mM, or glycerol (Gly) 0.6% w/v) (Raval et al. 2023) and with 178 appropriate antibiotics where applicable. To test for fitness costs that may accumulate over 179 longer periods of time, we passaged the strains with a B34A substitution (six independent 180 lines per strain) for eight days (about 80 generations) by diluting 1% v/v every 24h in 500 µL 181 LB in 48 well microplates incubated at 37°C (180 rpm shaking). We stored cryostocks at fourth 182 and eighth day, revived and measured their fitness as per above. 183 We estimated various growth proxies by measuring optical density at 600 nm (OD600) in a 184 Tecan microplate reader, an automated system (LiconiX incubator, robotic arm and Tecan 185 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint microplate reader) for strains where tRNAANN were introduced from the plasmid , or a 186 microplate reader from Agilent Biotek (Epoch2) for strains with a B34A substitution. We 187 included the WT in every microplate as a reference, to account for differences across plate 188 readers and over time. We inferred growth rate by fitting an exponential equation to 189 approximately the first one third of each growth curve (region where growth was exponential) 190 using Curvefitter software (Delaney et al. 2013). We inferred the lag phase length as the time 191 taken to reach early log phase , defined as approximately 1/3rd of the final OD600 in a given 192 growth medium (OD600 0.2 for LB and M9 GA; 0.15 for M9 Glu, Gal and Gly; and 0.12 for M9 193 Pyr). We normalized fitness of WT carrying tRNAANN, with that of WT carrying the respective 194 tRNABNN control for each case of tRNAANN expression from a plasmid. 195 Finally, we approximated the overall fitness effect of each 2D codon tRNAANN expressed from 196 high copy plasmid by qualitatively scoring its impact on the three growth parameters across 197 six media, assigning +1, 0, and –1 to positive, neutral, and negative effects, respectively. We 198 defined the sum of these scores, across media and across parameters, as the ‘fitness effect 199 index’. We estimated the mistranslation likelihood for each tRNAANN as the ratio of non-cognate 200 to cognate codon usage across the E. coli genome. 201 202 Estimating adenine to inosine modification and mature tRNA levels 203 We measured the relative abundance of each tRNA isotype , for strains with B34A 204 substitutions. Prior work suggests that adenine to inosine modification can be reliably 205 estimated using the frequency of cytosine in the complementary strand, i.e., unmodified A34 206 appears as T34 in reverse-complemented cDNA, whereas I34 appears as C34 (Schmitt et al. 207 2024). We therefore utilized YAMAT -Seq (Shigematsu et al. 2017; Ayan et al. 2020) as 208 optimized previously for E. coli (Raval et al. 2023). Briefly, we grew three biological replicates 209 of each strain in LB till mid-log phase (ca. OD 0.3) and isolated total RNA. After deacylation, 210 we ligated Y-shaped, DNA/RNA hybrid adapters (Shigematsu et al. 2017) (Eurofins) to the 5'-211 NCCA-3' and 3'-inorganic phosphate-5' ends of uncharged tRNAs, reverse transcribed , and 212 further amplified the cDNA products by PCR with sample -specific indices (Illumina). After a 213 quality and quantity check through bioanalyzer (Agilent DNA7500 Series II kit and Agilent 2100 214 Bioanalyzer Systems), we combined equimolar amounts of each sample and further purified 215 the fully assembled cDNA (with adapters and indices ligated at both ends of the tRNA 216 sequence) by separating them from unligated adapters based on size on a 5% poly-acrylamide 217 gel. The final products were sequenced using an Illumina NextSeq 550 High-Output 2x75 bp 218 kit (Single-end, 150 bp reads) at the Max Planck Institute for Evolutionary Biology. YAMAT-219 seq for proL-ANN (with WT control) was performed with size-based separation by magnetic 220 beads (Ampure XP beads) followed by verification of the size of purified products (200 -223 221 bp) using TapeStation DNA D1000 HS Screen Tape and sequencing on the Novaseq platform 222 (paired end, 150 bp) at the National Centre for Biological Sciences. 223 We sorted the combined raw reads into reads derived from individual strains (available from 224 NCBI GEO; accession number GSE328815) based on the unique Illumina barcodes. We 225 downloaded all native tRNA sequences predicted by GtRNAdb 2.0 (Chan and Lowe 2009) for 226 WT and manually added additional tRNAANN sequences by introducing B34A substitution in 227 the sequence of tRNABNN isoacceptors mutated in our study (e.g. to get the gly -tRNAACC 228 sequence, we replaced the C34 in gly-tRNAGCC sequence with A34). We mapped bases 80 -229 151 (expected to tRNA sequences) of the raw reads from each strain against the collection of 230 native and ANN tRNA sequences using Geneious Prime (version 11.1.4). We used the 231 following previously mapping criteria : 10 % mismatch/gap rate, max 2 -bp gap size, max. 5 232 ambiguities; no iterations, discard reads that align equally well to multiple reference sequences 233 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint (Khomarbaghi et al. 2024). On average, at least around 95% of the reads aligned with tRNAs, 234 the number of reads aligned to each tRNA isoacceptor were exported by Geneious and used 235 for calculating within-sample proportions of each tRNA isoacceptor. 236 237 Analyses of tRNA sequences from GtRNA DB and TadA homologues 238 We downloaded all predicted tRNA sequences from all bacteria and archaea available on the 239 Genomic tRNA database, GtRNAdb (Chan and Lowe 2016; Thornlow et al. 2020) including 240 the taxonomy of each species . For 14 genera where at least one tRNA ANN (other than Arg-241 tRNAACG) was present in all of its species (a total of 273 species across genera) we attempted 242 to retrieve protein coding sequences (CDS) as nucleotide sequences from NCBI Genome 243 Datasets (v18.24.0) , using the taxonomic IDs provided on GtRNAdb . We could retrieve 244 genomes of 263 species using this approach for which we retrieved genome AT content from 245 the metadata of the genome assemblies, and calculated codon usage from nucleotide CDS 246 (eight Lactobacilli and two Streptococci species, the genomes could not be fetched readily 247 using the taxonomy IDs). Lastly, out of 555 bacterial genomes from GtRNAdb with at least 248 one tRNAANN (other than Arg-tRNAACG), we could retrieve amino acid CDS for 539 genomes 249 which were used to searched for TadA homologues using the E. coli tadA amino acid 250 sequence as a query (diamond BLAST v2.1.24.178 (Buchfink et al. 2014) ; e-value30%, query coverage >50%). 252 253 Data analysis 254 We used in -house python scripts (except for YAMAT -Seq) for all statistical analysis and 255 generating plots. We used Mann–Whitney U tests to compare the fitness of strains expressing 256 tRNAANN with relevant control strains (WT or strain expressing tRNABNN). To quantify skew in 257 fitness effects relative to zero (neutral), we computed a zero -anchored quantile asymmetry 258 metric (Q-asym0) defined as Q(+) 0.9 - |Q(-)0.1| / Q(+) 0.9 + |Q(-)0.1|, where Q(+) 0.9 is the 90th 259 percentile of positive values of log2(magnitude of fitness) and Q(-)0.1 the 10th percentile of 260 negative values. This metric compares the extent of the positive and negative tails relative to 261 zero, yielding a normalized measure of tail imbalance scaled between −1 ( deleterious skew) 262 and +1 (beneficial skew), with 0 indicating symmetry. While similar in essence to Bowley’s 263 coefficient (a measure based on quantiles and median ), Q-asym0 is zero -anchored. 264 Significance was assessed by bootstrap (10,000 resamples), with percentile -based 95% 265 confidence intervals; skew was considered significant when the interval excluded zero. All raw 266 data used for analysis are provided in the source data files for each figure, along with the 267 statistics; in -house scripts to conduct the analyses and plots are available on Zenodo 268 (https://zenodo.org/records/20180916). 269 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint

Results

270 Generation of an E. coli strain library encompassing all theoretically possible A-271 starting anticodons 272 In theory, sixteen anticodons can start with an adenosine, but fifteen of these are absent from 273 bacterial tRNA repertoires, including that of E. coli MG1655 (the wild type, WT). The WT tRNA 274 repertoire consists of 37 genes for tRNAs with G-, U- or C-starting anticodons (‘tRNABNN’) that 275 decode family or four-fold degenerate codon boxes (‘4D’ codon boxes, where all four codons 276 within a codon box are assigned to the same amino acid) and 45 genes for tRNAs that decode 277 split or two-fold degenerate codon boxes (‘2D’ codon boxes, where the four codons within a 278 codon box are split across two different amino acids). Thus, a total of 82 native tRNABNN genes 279 (Fig. S1) could generate a tRNA with an A-starting anticodon (‘tRNA ANN’) through a single 280 substitution leading to adenosine at the 34th base (‘B34A substitution’) (Fig. 1A, Fig. S1). 281 The functional and fitness effects of these tRNAANN should be determined by a combination of 282 tRNA backbone and the extent of degeneracy of the codons decoded by tRNA ANN through 283 canonical, wobble and superwobble base -paring. Under canonical or wobble pairing, the 284 backbone alone determines amino acid charging, such that a G34 to (‘G34A’) will always 285 retain the original amino acid identity and should hence be tolerated due to degeneracy in 286 both 4D codon and 2D codon boxes. On the other hand, a C34A or U34A substitution will lead 287 to mistranslation in a 2D codon box due to a lack of four-fold degeneracy (Fig. 1A). Hence, 288 the probability of mistranslation is higher in tRNA ANN generated from a tRNA CNN or tRNAUNN 289 backbone of the adjacent 2D codon box, and could potentially explain selection against C34A 290 or U34A substitution s. However, an additional complication is revealed by prior work 291 suggesting that tRNA ANN can superwobble, i.e., read all four codons in a box , in bacteria, 292 mitochondria and eukaryotic cytosol (Sibler et al. 1986; Andachi et al. 1987; Borén et al. 1993; 293 Inagaki et al. 1995; Watanabe et al. 1997; Von Nickisch -Rosenegk et al. 2001; Chen et al. 294 2002; Aldinger et al. 2012; Yokobori et al. 2013; Soma et al. 2023; Kompatscher et al. 2024; 295 Schmitt et al. 2024) . Under this scenario, even within a 2D codon box, a G34 A substitution 296 could result in mistranslation because the tRNA will still be charged with the cognate amino 297 acid but it can decode the non-cognate codons via superwobble (Fig. 1B). We hypothesized 298 that such mistranslation through superwobbling may impair fitness, generating selection 299 against 2D codon tRNAANN derived from a tRNAGNN isoacceptor. In contrast, since a 4D codon 300 tRNAANN remains an isoacceptor regardless of which isoacceptor tRNABNN backbone it arose 301 from (Fig. 1C), 4D tRNAANN should have weaker fitness effects. 302 303 Figure 1: Expected range of decoding by potential tRNAANN that could arise via single 304 mutations in native tRNABNN. (A) All theoretically possible anticodons (written from 5’-to-3’) 305 organized across 4D and 2D codon boxes in the codon table. ANNs that are universally rare 306 and modified in different domains (summarized from a previous study (Ehrlich et al. 2021)) are 307 color coded as per the key on the bottom. The native BNN tRNAs mutated to ANN in this study 308 (within their respective codon boxes) are indicated in bold; ANNs expressed from a plasmid 309 are underlined, and those expressed from the genome indicated by asterisk s. (B–C) 310 Schematics illustrate decoding in examples of 2D and 4D codon boxes under Watson-Crick 311 (WC; indicated by black lines ), wobble (indicated by dotted black lines) and superwobble 312 (indicated by dotted orange lines) in the wild type (WT) and after B34A substitutions in distinct 313 tRNABNN. In each case, mRNA codons are written from 5’ -to-3’ at the bottom , and 314 complementary anticodons written from 3’-to-5’ on top, with flanking lines and attached single 315 letter codes indicating the tRNA backbone and amino acid after charging, respectively. The 316 34th base is shown in bold and anticodon mutations leading to A34 are marked in red . 317 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint Predicted mistranslation is indicated by red arrows . (B) A codon box split between 318 phenylalanine and leucine, with two codons each decoded by Phe-tRNA and Leu-tRNA in the 319 WT. A G34A substitution would generate Phe-tRNAAAG charged with phenylalanine , which 320 under WC pairing reads the phenylalanine codon UUU but can decode the leucine codons 321 UUA and UUG through superwobbling, resulting in mistranslation. Similarly, U34A or C34A 322 substitutions would result in tRNAAAG charged with leucine, resulting in leucine-to-323 phenylalanine mistranslation. Superwobbling resulting in mistranslation is indicated by red 324 arrows and is expected to be harmful. (C) The leucine 4D codon box, where all four codons 325 are decoded by tRNAGAG, tRNAUAG and tRNACAG isoacceptors. An B34A substitution in any of 326 these isoacceptors should generate Leu-tRNAAAG charged with leucine. Under canonical base 327 pairing, Leu-tRNAAAG decodes the CUU anticodon and under superwobbling it decodes CUC, 328 CUA and CUG. In either case, the lysine charged Leu -tRNAAAG decodes lysine codons, and 329 does not result in mistranslation. Hence, in tRNABNN from 4D codon boxes, B34A mutations 330 are unlikely to mistranslate. Superwobbling by tRNAANN within 4D codon boxes is indicated 331 by green arrows and is expected to be tolerated or beneficial. Such superwobbling is indicated 332 only for a tRNAANN resulting from G34A substitution, but it is equally applicable for any B34A 333 substitution within a 4D codon box. 334 335 336 337 To test these hypotheses, we constructed tRNAANN genes by introducing B34A substitutions 338 in 20 diverse WT tRNABNN genes (Fig. 1A, Fig. S1). From each of the eight 2D codon boxes, 339 we derived one tRNAANN from the native tRNAGNN (mimicking possible transition mutations) to 340 address fitness effects potentially stemming from mistranslation due to superwobbling. Of the 341 eight 4D codon boxes, one is decoded by tRNAACG whereas the remaining seven are decoded 342 by different combinations of tRNABNN isoacceptors with different backbones (Fig. S2). We 343 introduced B34A substitutions in a subset of these tRNABNN isoacceptor genes. We generated 344 one tRNA ANN each from proline, valine, alanine and glycine 4D codon boxes. For serine, 345 leucine and threonine 4D codon boxes, we generated tRNAANN from a subset of isoacceptors 346 differing from each other in their backbone sequences by more than 30%, resulting in two 347 tRNAANN from leucine codon boxes, and three from the threonine and serine codon box. 348 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint This set of 20 tRNAANN genes thus covered all fifteen ANN anticodons absent across bacteria, 349 with serine, leucine and threonine ANN generated from more than one backbone. We 350 integrated five of these tRNA ANN into the WT genome by replacing the corresponding native 351 tRNABNN with its tRNA ANN variant (Fig. 1A), generating strains with tRNA ANN expressed from 352 the genome (individually referred with the gene name followed by ‘ANN’, e.g. pheV-ANN, and 353 collectively referred to as ‘ANN strains’). We also expressed seven tRNAANN through a high-354 copy plasmid in E. coli strains lacking their respective native tRNA BNN, generated previously 355 (Raval et al. 2023) (individually referred with Δ followed by gene name , e.g. ΔpheV, and 356 collectively referred as knockouts, ‘KOs’). In the WT, we additionally expressed five of these 357 tRNAANN from a low-copy plasmid and finally, 19 of the tRNAANN from a high-copy plasmid. The 358 strains thus generated (Table S1) provided a comprehensive system to probe specific aspects 359 of tRNAANN functionality and fitness effects. 360 361 tRNAANN are translationally active 362 A parsimonious mechanism by which tRNAANN can affect cellular fitness is through translation 363 rate and/or accuracy , although evidence for their translational activity remains indirect and 364 limited (Borén et al. 1993; Biddle et al. 2016; Schmitt et al. 2018; Schmitt et al. 2024) . A key 365 prerequisite for translational activity is accurate post-transcriptional folding and maturation of 366 the tRNA transcripts. To quantify the relative abundance of mature tRNAANN within the total 367 tRNA pool, we utilized YAMAT -seq ( Y-shaped Adapter-ligated MAture TRNA sequencing) 368 (Shigematsu et al. 2017; Ayan et al. 2020; Raval et al. 2023) . In this method, two ends of Y-369 shaped adapters are ligated to the conserved 3′-CCA and to 5’ -OH, respectively, in the 370 acceptor arms of fully folded and matured tRNAs . Adapter-ligated tRNAs are then reverse -371 transcribed, PCR -amplified using adapter -specific primers, and the resulting libraries are 372 sequenced and analyzed using a standard small-RNA sequencing work-flow. We quantified 373 the relative abundance of each tRNA species in all five strains where we replaced native 374 tRNABNN genes by tRNAANN variants. For four of these ANN strains, w e also included strains 375 lacking the corresponding native tRNABNN (KOs) (Raval et al. 2023), as a control for potential 376 compensatory upregulation from the remaining gene copies. 377 The 2D codon Phe-tRNAGAA is encoded by two gene copies in E. coli: pheV and pheU. We 378 note that quantitative inferences about Phe-tRNAGAA remain challenging since this is amongst 379 the most difficult tRNA species to detect by YAMAT (Ayan et al. 2020; Khomarbaghi et al. 380 2024). Nonetheless, the ΔpheV strain had decreased Phe-tRNAGAA proportion relative to WT 381 whereas pheV-ANN had a higher Phe-tRNAGAA proportion than ΔpheV (Fig. 2A). This indicated 382 that upregulation from pheU may not be sufficient to restore Phe-tRNAGAA levels in ΔpheV, as 383 also suggested by the fitness defect observed in ΔpheV previously (Raval et al. 2023) . We 384 thus inferred that at least a fraction of Phe-tRNAGAA in pheV-ANN is due to Phe-tRNAAAA 385 undergoing A34-to-inosine 34 modification, which appears in reverse complemented cDNA as 386 C34 and therefore reads as G34 in the gene sequence (Delannoy et al. 2009; Torres et al. 387 2015; Saint-Léger et al. 2016) . In contrast, all four 4D codon tRNAANN that we tested were 388 expressed and detected unmodified among mature tRNAs (Fig. 2A). For instance, the ΔserX 389 strain showed Ser-tRNAGGA levels (encoded by serX and serW) comparable to WT, suggesting 390 potential upregulation of serW. However, serX -ANN showed a significant reduction in Ser -391 tRNAGGA (from the unaltered serW) and significant proportion of Ser-tRNAAGA (from serX-392 AGA). Proportions of Ser-tRNAGGA and Ser-tRNAAGA, were equal, and together matched Ser-393 tRNAGGA levels in the WT (Fig. 2A). Similarly, the Pro -tRNAAGG expressed on the genome 394 (carrying a G34A mutation in the single-copy tRNA gene proL) was also detected unmodified 395 in the cytosolic pool at levels comparable to the native Pro-tRNAGGG in the WT. Likewise, Thr-396 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint tRNAGGU (encoded by thrT and thrV) was lower than WT in both ΔthrT and thrT-ANN strains. 397 However, thrT-ANN expressed both Thr-tRNAGGU and Thr-tRNAAGU, whose combined 398 proportions matched those of Thr-tRNAGGU in the WT. The Gly-tRNACCC (single copy, encoded 399 by glyU) was absent in the tRNA pool of the ΔglyU, whereas glyU-ANN showed Gly-tRNAACC 400 proportions similar to Gly-tRNACCC in the WT. The observation that 4D tRNAANN proportions 401 were similar to WT (for Pro -tRNAAGG and Gly -tRNAACC) or added up to WT together with 402 tRNABNN (for Ser -tRNAAGA and Thr -tRNAAGU) suggested that tRNAANN were tolerated 403 unmodified in cells, without any inhibition of expression from G34A alleles or post-404 transcriptional degradation. Furthermore, in strains carrying tRNAANN, relative levels of 405 isoacceptors or other tRNA species were unaltered (Fig. S 3), suggesting that tRNAANN 406 expression is tolerated without major changes in the overall cytosolic tRNA pools. Taken 407 together, YAMAT-seq confirmed that both 4D and 2D codon tRNAANN are expressed, correctly 408 folded, and fully mature akin to native tRNABNN. The 4D codon tRNAANN remained unmodified, 409 whereas the 2D codon tRNAANN was modified to tRNAINN. 410 We reasoned that if mature tRNAANN are also translationally active, the fitness of strains 411 carrying the G34A substitution should be similar to WT , and tRNA ANN should rescue any 412 deleterious effect of losing the respective tRNABNN. Indeed, whereas ΔpheV and ΔthrT showed 413 lower growth rate than WT, the growth rates of pheV-ANN and thrT-ANN were comparable to 414 WT (Fig. 2B). Strains ΔpheV and ΔthrT also showed a longer lag phase in LB, which was 415 shortened in strains carrying the respective ANNs (Fig. S 4A). The f inal OD was largely 416 indistinguishable (± 5% changes) from WT for most KO and ANN strains, and was rescued in 417 thrT-ANN (Fig. S4B). Better growth parameters of ANN strains as compared to the respective 418 KO strains suggested that tRNAANN functionally replaced the native tRNABNN. Growth rates of 419 serX-ANN, pro-ANN, and glyU -ANN were indistinguishable from WT, suggesting that these 420 tRNAANN were also well tolerated. tRNAANN strains passaged in LB for eight days (about 80 421 generations) also did not show any fitness costs, suggesting a lack of long-term fitness effects 422 of B34A substitutions on the genome (Fig. 2C). These observations further suggested that 423 tRNAANN can successfully replace tRNABNN on the genome without significant fitness costs. 424 To further validate that tRNAANN can compensate for the loss of the respective tRNABNN, we 425 introduced plasmid-borne tRNAANN and tRNABNN into four of the 2D codon and three of the 4D 426 codon tRNA KOs. Across all five growth media and seven tRNAANN tested (35 combinations), 427 tRNAANN improved overall growth of the respective KOs (Fig. S5A). Across all 35 combinations 428 tested, they shortened the lag phase of the respective KOs (statistically significant for 25 429 combinations, Fig. S5B). All strains showed exponential growth in two nutrient-rich media (LB 430 and M9GA0.4) , where complementation by tRNAANN increased growth rate for all 14 431 combinations tested (Fig. 2D) and increased final OD for 10 combinations (Fig. S 5C). 432 Moreover, complementation by tRNAANN and respective tRNABNN increased the fitness of KOs 433 to a similar extent, further corroborating the functionality of tRNAANN. 434 Thus, all five tRNAANN introduced on the genome were folded, matured, and could functionally 435 replace their respective tRNABNN; and seven tRNAANN expressed from a plasmid could also 436 functionally replace their native tRNABNN genes. A priori, a G34A substitution is unlikely to 437 render a tRNA translationally inactive as described earlier, and prior studies also suggest that 438 such tRNAANN have translational activity (Borén et al. 1993; Biddle et al. 2016; Schmitt et al. 439 2018; Schmitt et al. 2024) . Taken together, we concluded that tRNAANN are generally 440 translationally active. 441 442 Figure 2: tRNAANN undergo normal maturation and can compensate for the loss of native 443 tRNAs. (A) For a subset of tRNA genes (pheV, serX, thrT, glyU), we quantified the relative 444 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint proportion of each tRNA species within the mature tRNA pool, for strains lacking the tRNABNN 445 gene (KO, in orange), strains carrying the G34A substitution in the same tRNABNN gene (tRNA-446 ANN, in green), and the wild type (WT, in grey). For proL, we quantified the relative proportion 447 of each tRNA species for proL-ANN and WT. Bar graphs show the relative abundance of 448 tRNABNN and tRNA ANN of interest (n=3, mean ± standard deviation ). ‘Total’ indicates the 449 combined abundance of tRNA BNN and tRNA ANN in ANN strains . Statistical significance is 450 indicated only for relevant comparisons: blue brackets indicate p 0.05. See the source data file for Fig. 2A for all statistical comparisons and 452 Fig. S3 for proportions of all tRNA species. (B) Growth rates relative to WT, for tRNA KO and 453 strains with tRNAANN expression from the genome (as measured in Fig. 2A ). KO strains that 454 are significantly different from WT and ANN strains significantly different from KO are indicated 455 by thick borders. (C) Growth rates of the five strains with tRNAANN expression from the genome 456 in LB after transferring 1% v/v every 24 hours for 8 days. Each cell shows the mean growth 457 rate of six such populations (relative to WT ancestor) for each strain at the fourth and eighth 458 day. None of these growth rates were significantly different from WT (paired t test s). (D) 459 Growth rates relative to WT+pACDH (empty vector control) , for KO+pACDH, KO+pACDH -460 tRNABNN and KO+pACDH-tRNAANN for a subset of genes (asnU, asnV, aspV, tyrV, proL, thrW, 461 glyU). The KO+pACDH strains that were significantly different from WT+pACDH control, and 462 the complementation strains (KO+pACDH -tRNABNN and KO+pACDH-tRNAANN) significantly 463 different from the KO+pACDH , are indicated by thick borders. Asterisks for KO+pACDH-464 tRNAANN cells indicate a significant differen ce from KO+pACDH -tRNABNN (p<0.05, Mann-465 Whitney U test). 466 467 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint 468 469 Overexpression of 2D tRNAANN is often neutral or deleterious 470 The observation that expression of translationally active tRNAANN did not impair fitness of WT 471 (Fig. 2 B-C) was puzzling since their persistent rarity suggest ed otherwise, at least for 2D 472 codon tRNAANN that should be prone to mistranslation (Fig. 1). However, we noticed that all 473 4D codon tRNAANN tested were tolerated unmodified whereas the 2D codon tRNAANN was 474 masked by A34 -to-I34 modification which can potentially restrict supperwobbling (Curran 475 1995; Gerber and Keller 1999; Wolf et al. 2002; Yokobori et al. 2013; Schmitt et al. 2024) and 476 mitigate potential toxicity. Furthermore, while tRNAANN expression from a single copy on the 477 genome closely mimics potential B34A substitutions in natural bacterial populations, the 478 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint tRNAANN thus expressed may also get outcompeted by native tRNABNN and may not contribute 479 substantially to translation . To reconcile a lack of clear fitness effect s (even after ca. 80 480 generations) and the rarity of tRNAANN in nature, we reasoned that the fitness effects of 481 tRNAANN may be attenuated when they are expressed from a single copy but compounded 482 over evolutionary time scale s. If so, tRNAANN overexpressed from a multi-copy plasmid may 483 accentuate fitness effects and allow them to be detected within the timeframe of our 484 experiments. Similarly, temperature stress may also exaggerate weak mistranslation-induced 485 fitness consequences (Rydbn and Isaksson 1984; Thorbjarnardottir et al. 1991; Dahlgren and 486 Rydén-Aulin 2000; Lyu et al. 2023; Romero Romero et al. 2024). 487 To test these hypotheses, we first overexpressed a subset of tRNAANN from a low-copy-number 488 IPTG-inducible plasmid under permissive growth conditions (37°C, LB medium). We observed 489 that both the 2D codon Asn-tRNAAUU and Asp-tRNAAUC reduced relative growth rate (R rel) 490 across all four IPTG concentrations, and Asn-tRNAAUU also lowered the final OD600 (Fig. 3A, 491 Fig. S6A-C). However, the three 4D codon tRNAANN were largely neutral both with respect to 492 growth rate (Fig. 3A, Fig. S6B) and final OD600 (Fig. 3B, Fig. S6C). We next assessed fitness 493 under temperature stress (42° C and 30°C), where we also included Asn-tRNAAUU under its 494 native promoter to allow for native gene regulation. At 42°C Asp-tRNAATC reduced growth rate 495 (Fig. 3C, Fig. S6D), whereas Leu-tRNAAAG lowered the final OD600 (Fig. 3D, Fig. S6E). At 30°C, 496 however, none of the tRNAANN showed any fitness impacts. The 2D codons Asp-tRNAAUC and 497 Asn-tRNAAUU (expressed from their native promoter) lowered the final OD 600 (Fig. 3C-D, Fig. 498 S6D-E). Thus, temperature stress and overexpression indeed revealed some negative fitness 499 impacts, and further suggested that 2D codon box tRNAANN may be more likely to impair fitness 500 than 4D codon box tRNAANN. 501 502 Figure: 3 Overexpression of tRNAANN from a low-copy number plasmid. Heatmaps show 503 growth rates (A) and final OD (B) of strains carrying tRNAANN relative to that of their respective 504 tRNABNN. Throughout the study, we compared fitness effects of overexpression of tRNA ANN 505 with that of respective tRNA BNN to account for fitness effect stemming purely from 506 overexpression. To calculate relative fitness parameters, we divided growth rate or OD of the 507 strain expressing tRNAANN by that of the strain expressing tRNA BNN. Hence values of these 508 parameters above 1 indicate a benefit (shown in green) and below 1 indicate a cost (shown in 509 red) of tRNAANN expression. tRNA genes were expressed on the plasmid pACDH and strains 510 were grown in LB medium at 37°C across a gradient of inducer concentration (IPTG). Cases 511 where tRNAANN were significantly different from tRNA BNN (Mann Whitney test, p<0.05; N=4) 512 with an effect size higher than 5% are further indicated by red or green thick borders. 513 Statistically significant differences of effect sizes smaller than 5% (potentially within the range 514 of noise resulting from detection limits and fitting of the growth equation) are indicated by thick 515 grey borders. Relative growth rates (C) and relative final OD (D) of the same strains under 516 30°C and 42°C, in LB with 0.5 mM IPTG. 517 518 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint 519 520 To test for potential differences between the fitness effects of 2D and 4D codon tRNAANN more 521 systematically, w e next investigated the fitness effects of tRNA carrying all theoretically 522 possible ANN anticodons (Fig. S1) introduced into the WT background from a high copy 523 plasmid, which we expected to further accentuate fitness consequences. In half the tRNAANN-524 media combinations tested, there was no significant fitness effect (Fig. 4A -C, Fig. S7A, Fig. 525 S8). Three out of the eight tested 2D codon tRNAANN were deleterious: P he-tRNAAAA, Cys-526 tRNAACA and His-tRNAAUG significantly prolonged the lag phase and reduced growth rate (Fig. 527 4A-B, Fig. S8A-B) in at least two of six media. Phe -tRNAAAA and Cys-tRNAACA also reduced 528 final OD in four of the six media (Fig. 4C, Fig. S8C). Tyr-tRNAAUA, Asn-tRNAAUU and Asp-529 tRNAAUC prolonged the lag phase and reduced growth rate in at least one medium (Fig. 4A-B, 530 Fig. S8 A). Note tha t overexpression of Phe -tRNAAAA from a high -copy plasmid impaired 531 fitness, in contrast to expression from a single genome -encoded copy (Fig. 2B-C). This is 532 expected because expression from a single copy should result in tRNA levels low enough for 533 the modification system (Fig. 2A) to mitigate fitness effects, whereas a substantial fraction of 534 overexpressed Phe-tRNAAAA is likely to have remained unmodified , reducing fitness. Finally, 535 Ser-tRNAACU improved all three parameters across at least two media (Fig. 4A-C, Fig. S8) and 536 Ile-tRNAAAU increased growth rate (Fig. 4A , Fig. S8B) and OD (Fig. 4C , Fig. S8C) in at least 537 two media. The magnitude of fitness effects (Fig. 4D -F) was skewed, significantly increasing 538 the lag phase length and also lowering final OD; whereas effects on growth rate were less 539 asymmetric and small (i.e., equally likely to be beneficial or deleterious). Overall, these results 540 suggested that the deleterious effects of 2D tRNAANN were larger and more consistent than 541 beneficial effects, and on average, 2D tRNAANN are likely to be neutral or deleterious. 542 543 Figure 4: Overexpression of 2D codon box tRNA ANN from a high-copy plasmid. (A–C) 544 Heat maps show growth parameters of WT strains carrying tRNAANN, relative to those carrying 545 the corresponding native tRNABNN gene on the high copy plasmid pUC19 induced with 0.5mM 546 IPTG at 37°C . Positive effects on growth (tRNA ANN/ tRNABNN > 1) are shown in green and 547 negative impacts (tRNAANN/ tRNABNN < 1) in red. Significant differences between tRNAANN and 548 the corresponding tRNA BNN (Mann-Whitney test, p 5% are 549 highlighted by a thick border. Statistically significant differences with effect size < 5% 550 (potentially reflecting noise from detection limits and fitting of the growth equation) are 551 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint indicated by thick grey borders. (A) Relative length of lag phase, (B) relative growth rate and 552 (C) relative final OD are shown across different media, with values of each parameter given 553 in each cell . The first two rows include nutrient -rich media and the next four rows include 554 nutrient-poor media. In some cases, data for one or more replicates from either the tRNA ANN 555 strain or the respective tRNA BNN variant did not fit the exponential growth equation whereas 556 the other variant grew; e.g., Cys-tRNAANN in Gly 0.6 medium did not grow, unlike Cys-tRNABNN 557 (raw OD600 vs. time curves are shown in Fig. S7). Such cases yielded infinite or infinitesimal 558 values for relative growth rate, so the corresponding cells in the heatmaps are empty; but the 559 fill colours and outlines indicate the qualitative direction of relative growth (green if only 560 tRNAANN grew and red if only tRNABNN grew). Cases where neither tRNAANN nor tRNABNN grew 561 sufficiently well to fit an exponential growth equation or to calculate lag phase (e.g. , AsnT-562 tRNAs in Gly 0.6) are indicated by cells with a cross. The absolute values of growth parameters 563 are shown in Fig. S8 and summarised in the source data file for Fig. S8. (D-F) Distribution of 564 the mean relative fitness effect shown in the heatmaps in Fig. 4A-C, after log2 transformation 565 (bin width= 0.1). Smoothed lines (using bin width 0.4) indicate the underlying probability 566 distribution estimated using the kernel density. Q-asym0 estimates magnitude of fitness effects 567 skew towards beneficial (+1) to harmful ( -1), values with asterisks indicate statistically 568 significant skew; red/green denote harmful/beneficial skew , grey denote no significant skew 569 (see methods for calculation of Q-asym0 and statistical analysis). 570 571 572 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint Overexpression of 4D tRNAANN is often neutral or beneficial 573 We next investigated fitness effects of overexpressing 4D codon tRNAANN, which should be 574 better tolerated due to complete degeneracy in their target codon boxes. Indeed, six of the 575 eleven 4D codon tRNAANN significantly improved lag phase length in at least two media, and 576 four were neutral (Fig. 5A, Fig. S7B, Fig. S8A), in contrast to the results for 2D tRNAANN (Fig. 577 4A, Fig. S7A, Fig. S8A). Exponential growth rate was also improved by five 4D tRNAANN and 578 remained unaffected by overexpression of four other 4D codon tRNAANN (Fig. 5B, Fig. S8B). 579 Three of the 4D codon tRNAANN improved final OD and the remaining eight appeared to be 580 neutral (Fig. 5C, Fig. S8C). Val-tRNAAAC reduced growth rate in three media (Fig. 5B, Fig. S8B) 581 and Gly-tRNAACC (the only universally absent 4D codon tRNAANN) impaired lag phase as well 582 as growth rate (Fig. 5A-B, Fig. S8A-B). Overall, apart from the universally absent Gly-tRNAACC, 583 4D codon tRNAANN either did not affect growth or improved it. Similar to 2D tRNAANN, in half of 584 the media -tRNAANN combinations tested, there were no significant fitness effects. In the 585 remaining half of the cases tested , 4D codon tRNA ANN also showed skewed fitness 586 magnitudes, albeit opposite to that of 2D tRNAANN. Overexpression of 4D tRNAANN skewed 587 each of the three growth parameters significantly towards more beneficial effects (Fig. 5D-F). 588 Thus, we concluded that on average, 4D tRNA ANN are likely to be neutral or substantially 589 beneficial. 590 591 Figure 5: Overexpression of 4D codon box tRNAANN from a high copy plasmid . (A–C) 592 Heat maps show growth parameters of WT strains carrying tRNAANN, relative to those carrying 593 the corresponding native tRNABNN gene on the high copy plasmid pUC19 induced with 0.5mM 594 IPTG at 37°C. Positive effects on growth (tRNA ANN/ tRNABNN > 1) are shown in green and 595 negative impacts (tRNAANN/ tRNABNN < 1) in red. Significant differences between tRNAANN and 596 the corresponding tRNA BNN (Mann-Whitney test, p 5% are 597 highlighted by a thick border. Statistically significant differences with effect size < 5% 598 (potentially reflecting noise from detection limits and fitting of the growth equation) are 599 indicated by thick grey borders. (A) Relative length of lag phase, (B) relative growth rate and 600 (C) relative final OD are shown across different media, with values of each parameter given 601 in each cell. The first two rows include nutrient -rich media and the next four rows include 602 nutrient-poor media. In some cases, data for one or more replicates from either the tRNA ANN 603 strain or the respective tRNA BNN variant did not fit the exponential growth equation whereas 604 the other variant grew; e.g., Ser-tRNAGGA in Gal 0.2 medium did not grow unlike Ser-tRNAAGA 605 (raw OD600 vs. time curves are shown in Fig. S7). Such cases yielded infinite or infinitesimal 606 values for relative growth rate, so the corresponding cells in the heatmaps are empty; but the 607 fill colours and outlines indicate the qualitative direction of relative growth (green if only 608 tRNAANN grew and red if only tRNA BNN grew). The absolute values of growth parameters are 609 shown in Fig. S8 and summarised in the source data file for Fig. S8. (D-F) Distribution of the 610 mean relative fitness effect shown in the heatmaps in Fig. 5A-C, after log2 transformation (bin 611 width= 0.1). Smoothed lines (using bin width 0.4) indicate the underlying probability distribution 612 estimated using the kernel density. Q-asym0 estimates magnitude of fitness effects skew 613 towards beneficial (+1) to harmful ( -1), values with asterisks indicate statistically significant 614 skew; red/green denote deleterious/beneficial skew, grey denote s no significant skew (see 615

Methods

for calculation of Q-asym0 and statistical analysis). 616 617 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint 618 619 On the whole, a majority of both 2D and 4D tRNAANN appeared to be neutral (Fig. 4-5, Fig. 620 S9A-B). However, when tRNAANN did affect fitness, 4D codon tRNAANN were consistently more 621 likely to improve it than to impair it (Fig. 5D-F, Fig. S9 A-B). 2D codon tRNAANN, in contrast, 622 appeared more likely to impair early growth (lag phase) and less likely to improve later stages 623 (exponential growth and final OD) , compared to 4D codon tRNAANN (Fig. 4D-F, Fig. S9A-B). 624 Recall that the tendency of 2D codon tRNAANN to impair fitness is expected from 625 superwobbling, which can increase mistranslation in 2D codon boxes (Fig. 1). Although direct 626 measurement of mistranslation by 2D codon tRNAANN will require more detailed studies, we 627 indirectly tested for this effect by estimating the genome-wide mistranslation likelihood for each 628 2D codon tRNA ANN (Fig. S9 D), expecting that higher mistranslation likelihood should 629 correspond to lower fitness. 2D codon Phe-tRNAAAA, Cys-tRNAACA, Asn-tRNAATT with high 630 mistranslation likelihood impaired fitness whereas S er-tRNAACU and Ile-tRNAAAU with low 631 mistranslation likelihood improved it. This yielded a weak negative correlation between the 632 overall fitness effect of each tRNAANN and its mistranslation likelihood (Fig. S9E), though we 633 acknowledge that this analysis is constrained because only eight tRNAANN carrying 2D 634 anticodon are theoretically possible. This is also further confounded by potential outliers such 635 as His-tRNAAUG which did not impair fitness despite a high mistranslation likelihood . One 636 explanation could be A34-to-I34 modification, which restricts superwobbling. While prior work 637 shows modification of His-tRNAAUG in an orthogonal tRNA backbone (Biddle et al. 2016; 638 Schmitt et al. 2024), it remains to be directly investigated what fraction of His-tRNAAUG or any 639 other tRNAANN introduced here from a plasmid is modified. Nevertheless, this analysis at least 640 qualitatively suggested that mistranslation resulting from superwobbling may contribute to the 641 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint negative fitness effects of 2D codon tRNAANN. Superwobbling by 4D tRNAANN on the other 642 hand, may be tolerated or even exploited due to the degeneracy of the genetic code. 643 644 B34A substitutions are generally rare, but better tolerated in 4D codon box tRNAs 645 The beneficial and neutral fitness effects of 4D codon tRNAANN were unexpected due to their 646 near-universal absence, with the exception of two 4D codon tRNAANN in Leuconostocaceae 647 (Leuconostoc and Enococcus sp.) reported in previous analyses of a relatively small number 648 of bacterial genomes(Diwan and Agashe 2018; Ehrlich et al. 2021). We re-evaluated the rarity 649 of tRNAANN using the entire prokaryotic tRNA repertoire known to date, which includes 246,393 650 predicted tRNA sequences across 4047 bacteria and 10,517 tRNAs from 220 archaea in the 651 GtRNA database (Chan and Lowe 2016; Thornlow et al. 2020) . Across 10,517 predicted 652 archaeal tRNAs, only three cases of tRNAANN were found, all from 4D codon boxes (Fig. 6A). 653 Of 8383 bacterial tRNAANN genes, 7,688 encoded Arg-tRNAACG, supporting the rarity of other 654 ANN anticodons across bacteria (Fig. 6A). However, the 695 non-ACG tRNAANN sequences 655 did include all other theoretically possible ANN. 81% of these were 4D codon tRNAANN, with 656 Leu-tRNAAAG and Thr -tRNAAGT being most frequent (Fig. 6B). Of the remaining (19%) 2D 657 tRNAANN sequences, half were Phe -tRNAAAA and His -tRNAAUG, which are potentially 658 compatible with the A-to-I modification system. 659 Firmicutes encoded the highest number of tRNAANN (Fig. 6B, Fig. S10), and within this phylum, 660 all Lactobacillus, Lactococcus and Streptococcus species encoded Leu-tRNAAAG (Fig. 6C). 661 These three genera lacked Leu-tRNAGAG, suggesting a G34A substitution dating back at least 662 until their common ancestor (Fig. 6C). Thr-tRNAAGT also showed a similar pattern, albeit for 663 fewer genera. The presence of an ANN anticodon in lieu of a GNN is similar to Arg -tRNAs 664 where ACG is preferred and GCG is absent, and suggests that these genera not only encode, 665 but potentially prefer these specific 4D tRNAANN. Interestingly, these two ANNs only replaced 666 GNNs, but not other BNN isoacceptors, suggesting that potential superwobbling by 667 unmodified ANNs may be suboptimal even in 4D codon boxes (e.g., due to slower translation 668 rate). In contrast, 2D-codon tRNAANN were present sporadically across species from different 669 genera (Fig. S10) and always in the presence of tRNAGNN (Fig. S11). We also found some 670 tRNAANN genes in Proteobacteria (genera Salmonella, Escherichia) and Tenericutes 671 (subphylum Mollicutes), although their occurrence was scattered within the subphyla or 672 genera, suggesting lineage specific acquisition or loss (Fig. 6B, S10). 673 674 Figure 6: Occurrence of tRNA ANN across prokaryotes. (A) Fractions of tRNA ANN and 675 tRNABNN within all bacterial and archaeal tRNAs reported in the GtRNA DB . The number of 676 tRNA genes i s indicated in parentheses. (B) Number of bacterial genomes carrying each 677 tRNAANN gene, grouped by phylum. 4D and 2D tRNA ANN are labelled in green and red, 678 respectively. For tRNAANN present in fewer than 50 species, refer to the right hand Y axis. (C) 679 Bacterial genera (colored by phylum, see panel B) where at least one tRNAANN is found in all 680 species (number of species is indicated in parentheses). The coloured portion of each cell 681 represents the fraction of species where the tRNAANN is present. 4D and 2D tRNA ANN are 682 labelled in green and red, respectively. For comparison, presence of the respective tRNAGNN 683 is also shown (labelled in black). Instances where a tRNANN potentially replaced a tRNAGNN 684 (indicated by a concomitant lack of the respective tRNAGNN) are highlighted by black squares. 685 (D) For the genera shown in panel C, average genome AT content (1 indicating 100% AT) , 686 fraction of U-ending codons out of all codons used in the genome , and fraction of U -ending 687 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint codons within the codon box where the tRNAANN is present (averaged across all species within 688 a genus). 689 690 691 692 Next, we analysed genomic features of species that do have tRNAANN. The 15 genera with at 693 least one tRNAANN found in all species ( i.e., candidates for acquisition by their respective 694 ancestors) all had high genomic AT content ranging from 60-70% (Fig. 6D) and 30-40% of all 695 codons in these genomes were U-ending. In genera such as Streptococcus and Lactococcus, 696 U-ending codons were also preferred over their synonyms in the same codon box where 697 tRNAANN was pres ent. However, in other genera, this preference was only moderate , 698 suggesting that factors other than codon usage contribute to the retention of tRNAANN in some 699 lineages. Lastly, 88% of the species with at least one non -ACG tRNAANN encode a TadA 700 homologue (Fig. S10) which may modify these tRNAANN (e.g., Leu-tRNAGAG is modified in 701 Oenococcus (Rafels-Ybern et al. 2019) and Streptococcus (Wulff et al. 2024) ). Such large-702 scale genome-based predictions are somewhat limited due to prediction errors and a lack of 703 direct evidence for gene expression and functionality. Nonetheless, this analysis of 704 exceptional non -ACG tRNAANN suggests that 4D codon tRNAANN (Leu-tRNAGAG and Thr-705 tRNAAGT) are retained across evolutionary timescales and potentially even preferred in some 706 genera; and that overall, other 4D codon tRNAANN are generally better tolerated than 707 unmodified 2D codon tRNAANN, as shown by our experimental results. 708 709 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint

Discussion

710 “It seems likely that inosine will be formed enzymically from an adenine in the nascent sRNA. 711 This may mean that A in this position will be rare or absent, depending upon the exact 712 specificity of the enzyme(s) involved.” (Crick 1966) 713 Anticipation of constraints on anticodon space dates back to the 1960s when only a handful 714 of tRNAs were sequenced (Ingram 1963; Holley R.W. 1965; Crick 1966) . These sequences 715 already showed adenosine 34 to inosine 34 (A-to-I) modification and Crick argued that I34 at 716 the first or wobble base of the anticodon can allow pairing with codons ending in U, C or A and 717 drive amino acid misincorporation (mistranslation) during decoding of two -fold degenerate 718 (2D) codon boxes. He speculated in a footnote (see above) that A34 may therefore be rare 719 among tRNAs decoding 2D codon boxes (Crick 1966). In essence, selection for translational 720 fidelity can constrain the anticodon space. While his was an argument against wobbling by 721 I34, subsequent research showed that unmodified A34 itself can decode all four anticodons 722 in a codon box (supperwobble) in bacteria, mitochondria and eukaryotic cytosol (Sibler et al. 723 1986; Andachi et al. 1987; Borén et al. 1993; Inagaki et al. 1995; Watanabe et al. 1997; Von 724 Nickisch-Rosenegk et al. 2001; Chen et al. 2002; Aldinger et al. 2012; Yokobori et al. 2013; 725 Soma et al. 2023; Kompatscher et al. 2024; Schmitt et al. 2024) .Therefore, unmodified A34 726 may be more harmful than I34 in 2D codon boxes, though both are expected to be suboptimal 727 and rare in 2D codon boxes. 728 Genome sequencing over the last three decades revealed that A34 in tRNA genes decoding 729 two-fold degenerate codon boxes (‘2D codon tRNAANN’) are indeed extremely rare, and that 730 when A34 is observed in tRNAs decoding four -fold degenerate codon boxes (‘4D codon 731 tRNAANN’) it typically co-occurs with A-to-I modifying enzymes (MEs) (Chan and Lowe 2016; 732 Diwan and Agashe 2018; Ehrlich et al. 2021) . The near-universal absence of unmodified 2D 733 and 4D codon tRNAANN implies strong purifying selection, arguably acting since early cellular 734 evolution (Fig. 7A). Although numerous sources of purifying selection can be speculated (Fig. 735 7B), a key first step towards explaining the absence of tRNA ANN is to directly determine the 736 functionality and fitness effects of expressing such tRNAs. Our results show that tRNAANN in 737 their native backbones are tolerated, folded, matured and translationally active in E. coli. A 738 majority of the 4D codon tRNAANN were also tolerated in an unmodified state and showed 739 neutral or positive fitness effects when overexpressed. However, 2D codon box tRNAANN were 740 more likely to be deleterious when overexpressed and their magnitude of fitness effect 741 appeared to scale with a coarsely estimated likelihood of mistranslation. 742 Orthogonal tRNAANN, introduced from a medium -copy number plasmid and designed to 743 incorporate tyrosine against each sense codon , reduced growth rate by 10-50% (Schmitt et 744 al. 2018). Similar global mistranslation, albeit less severe in magnitude, likely contributed to 745 the fitness cost s of 2D codon tRNA ANN. Furthermore, all tested 4D codon tRNAANN were 746 tolerated unmodified, whereas the only tested 2D codon Phe -tRNAAAA was modified to Phe -747 tRNAIAA. The INN anticodon can decode only three codons and thus it is likely to mis -748 incorporate amino acids at fewer codons than unmodified ANN. A prior study tested all 749 tRNAANN from a non-native backbone and found that only a 2D codon His-tRNAAUG underwent 750 such modification (Biddle et al. 2016; Schmitt et al. 2024) . The enzyme responsible for 751 modification of these “novel” bacterial tRNAANN substrates for I34 modification remains 752 unknown, though a parsimonious explanation is TadA which canonically modifies Arg -753 tRNAACG and was recently shown to modify numerous mRNAs in E. coli (Arad et al. 2026) . 754 Our study thus suggests potential substrate flexibility of this ancient and essential enzyme 755 (Wolf et al. 2002; Delannoy et al. 2009; Diwan and Agashe 2018; Torres et al. 2021) that may 756 mitigate the deleterious effects of other tRNA ANN and confer additional advantages, though 757 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint this remains to be confirmed. Intriguingly, the two tRNAANN that are compatible with 758 modification also constituted 50% of the rarely found 2D codon tRNAANN across bacteria (Fig 759 6B), suggesting that A-to-I modification might mitigate mistranslation and fitness impairment 760 by such tRNAs. Taken together, selection for translational fidelity hence emerges as one 761 plausible explanation for the absence of 2D codon tRNAANN. 762 763 Figure 7: A deep mystery of scarce tRNAANN: (A) The near-universal absence of unmodified 764 tRNAANN suggests purifying selection on tRNAANN genes and the evolution of mechanisms to 765 mitigate deleterious effects of tRNAANN that are retained and expressed. (B) Potential sources 766 of negative selection (in the red box) could involve translation and include error-prone or 767 inefficient decoding by unmodified tRNAs (A-U pairing is also the least efficient (Pernod et al. 768 2021)) ribosomal stalling or disassembly. However, they may also include non-translational 769 sources of selection such as toxic effects of fragments resulting from tRNA degradation 770 (suggested for some tRNABNN in eukaryotes (Magee and Rigoutsos 2020; Polacek and Ivanov 771 2020)), interreference with non -translational processes (suggested for some tRNA BNN in 772 eukaryotes (Seligmann 2010; Katz et al. 2016; Balasubramaniam et al. 2017; Hamdani et al. 773 2019; Su et al. 2020; Ehrlich et al. 2021)), or more speculatively, DNA structure-level effects. 774 Potential mechanisms for removing or managing tRNAANN genes (in the green box) include 775 substitutions in the anticodon loop (e.g., A34B) or elsewhere in the gene , causing 776 pseudogenization, loss or silencing of the gene. Known mitigation mechanisms (in the yellow 777 box) are likely ancient and include changes in the tRNA backbone (e.g., pairing between bases 778 32 and 38, which reduces miscoding) and post-transcriptional modifications. Substrate tRNAs 779 and respective modifying enzymes (MEs) always co -occur, and beyond mitigating the 780 translational effects of unmodified anticodons (e.g. , of tRNAANN and tRNAGNN), modification 781 may also confer additional advantages that drive selection favouring substrate tRNAs and 782 counter-selection against poor substrates. MEs (e.g., TadA) introduced into the archaeal host 783 by endosymbiosis likely allowed eukaryotic tRNA repertoires to expand (e.g., tRNAANN). 784 Overall, while translational fidelity appears to be a key factor explaining why tRNAANN are 785 absent, alternative sources of negative selection remains to be investigated. 786 787 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint 788 789 In contrast to 2D codon tRNAANN, superwobbling by a tRNA ANN within a 4D codon box is not 790 expected to cause mistranslation. Indeed, due to four-fold degeneracy, all 4D tRNAANN (except 791 Gly- tRNAACC) were neutral or substantially beneficial. 4D tRNAANN were also more likely to be 792 retained or even preferred as compared to 2D codon tRNAANN among the exceptional cases 793 of A34 occurrences across bacteria. Nevertheless, seven of eight 4D tRNAANN are missing 794 from 99.99% of predicted bacterial tRNA repertoires. Both bacteria and eukaryotes decode 795 the arginine 4D codon box with Arg -tRNAACG; whereas for the other seven 4D codon boxes, 796 bacteria prefer tRNAUNN and eukaryotes use tRNAANN (Novoa et al. 2012). Each of these 4D 797 codon tRNAs is modified via A34-to-I34 or U34-to-cmo5U34 (Novoa et al. 2012; Diwan and 798 Agashe 2018) and tRNAGNN alternatives are absent. Codons recognized by these modified 799 tRNAs are enriched in highly expressed genes, and these tRNAs (Novoa et al. 2012) and A-800 to-I modification appear to be essential (Wolf et al. 2002; Delannoy et al. 2009; Torres et al. 801 2021), further underscoring a crucial role of tRNA modification in decoding 4D codon boxes. 802 Indeed, it has been proposed that coevolution between MEs and tRNA repertoires shaped the 803 ancient preference for modifiable 4D codon tRNAUNN in bacteria and tRNAANN in eukaryotes 804 (Novoa et al. 2012) . The sources of selection against unmodified 4D codon tRNA ANN and 805 tRNAGNN remain incompletely understood, although unmodified 4D codon tRNAGNN (absent 806 across eukaryotes and present with G34 -to-queuosine modification in most bacteria (Diwan 807 and Agashe 2018) ) makes the second and third bases error -prone, causing mistranslation 808 outside the cognate codon boxes , and driving cellular toxicity (Pernod et al. 2021) . This 809 inherent propensity to error and toxicity is mitigated by a canonical base pairing between the 810 32nd and 38 th bases of the tRNA , which respectively mark the beginning and end of the 811 anticodon loop (Pernod et al. 2021). This additional base pairing changes the conformation of 812 the anticodon loop and lowers miscoding by tRNAGNN. This underscores the idea that specific 813 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint anticodons — potentially also ANN — in specific backbones are inherently error-prone, unless 814 mitigated by additional features of the backbone or anticodon modifications . Thus, selection 815 for translational fidelity may also contribute to the absence of 4D tRNAANN (Fig. 7B). 816 Our observation that both 2D and 4D tRNAANN showed lower growth rates at 42°C than at 817 30°C hints at such mistranslation, exacerbated by high temperature. While mistranslation is 818 generally thought to be deleterious (O’connor et al. 1992; Beebe et al. 2003; Bacher et al. 819 2004; Kohanski et al. 2008; Berg et al. 2019; Kelly et al. 2019), it can also be beneficial under 820 specific conditions, and especially under stress (Fan et al. 2015; Samhita et al. 2020; Samhita 821 et al. 2021; Samhita 2022) . The extent of mistranslation also varies across tRNAANN due to 822 differences in stability across superwobble base -pairs (e.g., A-A is least stable). 823 Consequences of amino acid misincorporation on protein structure and fitness also depends 824 on the physico-chemical differences between the exchanged amino acids ; thus, mis-825 incorporation by some tRNAANN (e.g., replacing Lys with Asn) may be less detrimental than 826 others (e.g., Arg to Ser) (Sengupta et al. 2007). Fitness effects of mistranslation are therefore 827 likely to vary across specific tRNAANN and environment s, and our suggestion that 828 mistranslation contributes to the absence of tRNAANN remains to be directly tested. A first step 829 would be to measure the extent of mistranslation in cells expressing tRNAANN. Strains with 830 high-fidelity ribosomes (Ruusala et al. 1984; Chumpolkulwong et al. 2004; Agarwal et al. 2015) 831 and downregulation of tadA could also be used to test mistranslation -driven fitness defects, 832 which should be alleviated and exacerbated in the respective strain backgrounds. Likewise, 833 antibiotics reducing ribosomal fidelity should also amplify such fitness defects. Lastly, we note 834 that despite our normalization of fitness effects of overexpressed tRNA ANN with that of 835 overexpressed tRNABNN, we cannot rule out that overexpression of tRNAANN caused extremely 836 high mistranslation or resulted in other effects qualitatively different from those observed in 837 nature. Thus, such overexpression may not have amplified natural fitness effects as we had 838 intended. We hope that future studies using more sensitive fitness assays and long term 839 evolution may provide further insights. 840 Overall, while mistranslation is an attractive hypothesis, given these causes of substantial 841 variation in its effects across specific tRNAANN and environments, it is unlikely to be the sole 842 explanation for the rarity of all tRNAANN across prokaryotes. Therefore, we speculate important 843 contributions from other mechanisms, likely as universal as core features of the translation 844 machinery. For instance, correct pairing at the A site in bacterial ribosomes causes 845 A1492/A1493 to flip out from 16S rRNA into the minor groove of the first two codon-anticodon 846 base pair and promotes EF-Tu GTP hydrolysis and amino acylated tRNA accommodation 847 (Schmeing and Ramakrishnan 2009) . It is possible that an adenosine at the first anticodon 848 position causes steric hindrance (due to the presence of two additional As in the vicinity) and 849 impairs this essential step inside the decoding center, which is conserved across the tree of 850 life (Rodnina and Wintermeyer 2009; Schmeing and Ramakrishnan 2009; Dever et al. 2018; 851 Tirumalai et al. 2021) . However, an adenosine is tolerated on the mRNA (e.g. , in A-ending 852 codons), suggesting that any steric hindrance is likely asymmetric and specifically unable to 853 accommodate tRNA with A34. While this argument remains to be tested via structural or 854 molecular dynamics analyses, it serves as a useful example for the nature of potential 855 fundamental explanations for the ancient and persistent rarity of tRNAANN. 856 A34 may also be rare because tRNA genes at large are highly conserved despite several 857 times higher mutations rates than the rest of the genome (Thornlow et al. 2018). For instance, 858 out of 62,941 mutations observed across 50,000 generations of adaptive evolution in twelve 859 E. coli evolution populations, only 50 were in tRNA genes and none in the anticodons (Table 860 S2). Analysis of ca. 700 mutations from E. coli mutation accumulation lines (Sane et al. 2025) 861 also captured only one insertion at the first base of the leuW tRNA gene, suggesting that even 862 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint under a regime largely dominated by drift, mutations inside tRNA genes are rare. A saturation 863 mutagenesis study of the yeast Arg-tRNACCU also found most substitutions to be detrimental, 864 with those in the anticodon loop reducing fitness by more than 25% (Li et al. 2016) . Hence, 865 overall strong purifying selection suggests that any disadvantage of B34A substitutions (e.g., 866 low levels of mistranslation) may be penalized especially severely and contribute to the 867 scarcity of A34 over evolutionary timescales. Apart from substitutions, the presence or 868 absence of entire tRNA genes can also be under selection, e.g., due to ecological factors such 869 as nutrient availability (Raval et al. 2023). Even in the current study, tRNAANN were more likely 870 to affect fitness in nutrient poor media (Fig. 4,5, Fig. S9A-C); whereas in nutrient-rich media, 871 most tRNAANN, particularly 4D codon tRNA ANN, were neutral. It is therefore expected that if 872 some tRNAANN genes enhance translation capacity , they may be retained due to drift or 873 positive selection following nutrient upshifts . Fitness assays during nutrient fluctuations, 874 including competition with WT, might reveal further fitness effects of unmodified tRNAANN. 875 In closing, t he near-complete absence of unmodified tRNAANN remains intriguing, more so 876 given the largely neutral and sometimes beneficial effects that we observed. At a population 877 size of a billion and assuming a 10% fitness advantage (i.e., selection coefficient s=0.1), a 878 mutation creating a tRNAANN from a tRNABNN has about 17% chance of reaching fixation in an 879 average of 410 generations, as inferred from 1000 discrete -time Wright–Fisher transitions 880 (Fig. S12). Even a tRNAANN with a 10% disadvantage can persist for tens of generations, and 881 a tRNAANN with a disadvantage smaller than 1% can persist for over a thousand generations. 882 Nonetheless, tRNAANN are remarkably rare. Although cases with negative fitness effects of 2D 883 tRNAANN suggest that selection for translational fidelity likely contributes to their absence, the 884 neutral and beneficial effects observed for 4D tRNAANN suggests that other fundamental 885 constraints on these molecules, operating in nature, remain hidden. 886 887 DATA AVAILABILITY 888 Supplementary figures are available with this submission. Raw reads for YAMAT -seq are 889 available on from NCBI GEO; accession number GSE328815 . Additional data are available 890 on Zenodo (10.5281/zenodo.20180916) which includes, but is not limited to, the following 891 supplementary data and in-house scripts: 892 Source data for the main and supplementary figures 893 Table S1 (Primers and strains from this study) 894 Table S2 (tRNA gene mutations in long term evolution experiment) 895 Scripts, Source files for the scripts, Output files and plots. 896 897

Acknowledgements

898 We thank Saurabh Mahajan, Laasya Samhita, S upratim Sengupta and members of the 899 Agashe lab for discussion and critical comments on the manuscript; the NCBS NGS facility for 900 help with genome and YAMAT sequencing; Gaurav Diwan and Joshua Miranda for setting up 901 and maintaining our automated growth measurement system; Gunda Dechow-Seligmann and 902 Sven Künzel for helping with data collection for YAMAT-seq; George Stoletov, Raagini Biswas, 903 Adrita Chakraborty and Pratibha Sanjenbam for helping with the experiments; and the NCBS 904 laboratory kitchen staff for their crucial support. PKR acknowledges the use of ChatGPT 905 (OpenAI) for assistance with writing Python scripts. We acknowledge funding and support 906 from the National Centre for Biological Sciences (NCBS-TIFR) and the Department of Atomic 907 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint Energy, Government of India (Project Identification No. RTI 4006) to DA, a CSIR-UGC-NET 908 June/2018/430 fellowship to PKR, the Max Planck Society (JG and SL), and the International 909 Max Planck Research School for Evolutionary Biology (SL). 910 911 AUTHOR CONTRIBUTIONS 912 PKR: Conceptualization, Experimental design, Methodology, Investigation, Data curation, 913 Validation, Formal analysis, Visualization, Writing - original draft, review and editing. SL: 914 Investigation, Data curation, Formal analysis JG: Experimental design, Funding acquisition, 915 Resources, Methodology, Writing - review and editing. DA: Conceptualization, Project 916 administration, Supervision, Experimental design, Funding acquisition, Resources, Writing - 917 review and editing. 918 919

References

920 Agarwal D, Kamath D, Gregory ST, O’Connor M. 2015. Modulation of decoding fidelity by 921 ribosomal proteins S4 and S5. J. Bacteriol. 197:1017–1025. 922 Agris PF, Eruysal ER, Narendran A, Väre VYP, Vangaveti S, Ranganathan S V. 2018. 923 Celebrating wobble decoding: Half a century and still much is new. RNA Biol. 15:537–924 553. 925 Agris PF, Narendran A, Sarachan K, Väre VYP, Eruysal E. 2017. The Importance of Being 926 Modified: The Role of RNA Modifications in Translational Fidelity. Enzymes (Essen). 927 41:1–50. 928 Agris PF, Vendeix FAP, Graham WD. 2007. tRNA’s Wobble Decoding of the Genome: 40 929 Years of Modification. J. Mol. Biol. 366:1–13. 930 Aldinger CA, Leisinger AK, Gaston KW, Limbach PA, Igloi GL. 2012. The absence of A-to-I 931 editing in the anticodon of plant cytoplasmic tRNAArgACG demands a relaxation of the 932 wobble decoding rules. RNA Biol. 9:1239–1246. 933 Andachi Y, Yamao F, Iwami M, Muto A, Osawa S. 1987. Occurrence of unmodified adenine 934 and uracil at the first position of anticodon in threonine tRNAs in Mycoplasma 935 capricolum (codon recognition/unmodified anticodon/biased mutation pressure). Proc. 936 Nati. Acad. Sci. USA 84:7398–7402. 937 Arad D, Fargeon O, Levin L, Svenningsen S Lo, Aspit L, Bar-Yaacov D. 2026. Landscape 938 and dynamics of TadA-dependent RNA editing in Escherichia coli reveal a role in 939 nutrient-rich growth.Hinton D, editor. mBio. 940 Ayan GB, Park HJ, Gallie J. 2020. The birth of a bacterial tRNA gene by large-scale, tandem 941 duplication events. Elife 9. 942 Bacher JM, Rie De Cré Cy-Lagard V, Schimmel PR. 2004. Inhibited cell growth and protein 943 functional changes from an editing-defective tRNA synthetase. Proc Natl Acad Sci U S 944 A . 102:1697–1701. 945 Balasubramaniam M, Reis RJS, Ayyadevara S, Wang X, Ganne A, Khaidakov M. 2017. 946 Involvement of tRNAs in replication of human mitochondrial DNA and modifying effects 947 of telomerase. Mech. Ageing Dev. 166:55–63. 948 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint Bedhomme S, Amorós-Moya D, Valero LM, Bonifaci N, Pujana MÀ, Bravo IG, Gonzalez J. 949 2019. Evolutionary Changes after Translational Challenges Imposed by Horizontal 950 Gene Transfer. Genome Biol. Evol. 11:814–831. 951 Beebe K, Ribas de Pouplana L, Schimmel P. 2003. Elucidation of tRNA-dependent editing 952 by a class II tRNA synthetase and signi®cance for cell viability. EMBO J. 22:668–675. 953 Berg MD, Zhu Y, Genereaux J, Ruiz BY, Rodriguez-Mias RA, Allan T, Bahcheli A, Villén J, 954 Brandl CJ. 2019. Modulating mistranslation potential of tRNASer in saccharomyces 955 cerevisiae. Genetics 213:849–863. 956 Berg OG, Kurland CG. 1997. Growth rate-optimised tRNA abundance and codon usage. J. 957 Mol. Biol. 270:544–550. 958 Biddle W, Schmitt MA, Fisk JD. 2016. Modification of orthogonal tRNAs: Unexpected 959 consequences for sense codon reassignment. Nucleic Acids Res. 44:10042–10050. 960 Bloom-Ackermann Z, Navon S, Gingold H, Towers R, Pilpel Y, Dahan O. 2014. A 961 comprehensive tRNA deletion library unravels the genetic architecture of the tRNA 962 pool. PLoS Genet. 10:e1004084. 963 Borén T, Elias P, Samuelsson T, Claesson C, Barciszewska M, Gehrke CW, Kuo KC, Lustig 964 F. 1993. Undiscriminating codon reading with adenosine in the wobble position. J. Mol. 965 Biol. 230:739–749. 966 Buchfink B, Xie C, Huson DH. 2014. Fast and sensitive protein alignment using DIAMOND. 967 Nat. Methods 12:59–60. 968 Chan PP, Lowe TM. 2009. GtRNAdb: a database of transfer RNA genes detected in 969 genomic sequence. Nucleic Acids Res. 37:D93-7. 970 Chan PP, Lowe TM. 2016. GtRNAdb 2.0: An expanded database of transfer RNA genes 971 identified in complete and draft genomes. Nucleic Acids Res. 44:D184–D189. 972 Chen P, Qian Q, Zhang S, Isaksson LA, Björk GR. 2002. A cytosolic tRNA with an 973 unmodified adenosine in the wobble position reads a codon ending with the non-974 complementary nucleoside cytidine. J. Mol. Biol. 317:481–492. 975 Chumpolkulwong N, Hori-Takemoto C, Hosaka T, Inaoka T, Kigawa T, Shirouzu M, Ochi K, 976 Yokoyama S. 2004. Effects of Escherichia coli ribosomal protein S12 mutations on cell-977 free protein synthesis. Eur. J. Biochem. 271:1127–1134. 978 Crick FHC. 1966. Codon—anticodon pairing: The wobble hypothesis. J. Mol. Biol. 19:548–979 555. 980 Curran JF. 1995. Decoding with the A:l wobble pair is inefficient. Nucleic Acids Res. 23. 981 Dahlgren A, Rydén-Aulin M. 2000. A novel mutation in ribosomal protein S4 that affects the 982 function of a mutated RF1. 82:683–691. 983 Datsenko KA, Wanner BL. 2000. One-step inactivation of chromosomal genes in Escherichia 984 coli K-12 using PCR products. Proc. Natl. Acad. Sci. U. S. A. 97:6640–6645. 985 Delaney NF, Kaczmarek ME, Ward LM, Swanson PK, Lee M-C, Marx CJ. 2013. 986 Development of an Optimized Medium, Strain and High-Throughput Culturing Methods 987 for Methylobacterium extorquens. PLoS One 8:1–10. 988 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint Delannoy E, Ret M Le, Faivre-Nitschke E, Estavillo GM, Bergdoll M, Taylor NL, Pogson BJ, 989 Small I, Lmbault P, Gualberto JM. 2009. Arabidopsis tRNA adenosine deaminase 990 arginine edits the wobble nucleotide of chloroplast tRNAArg(ACG) and is essential for 991 efficient chloroplast translation. Plant Cell 21:2058–2071. 992 Dever TE, Dinman JD, Green R. 2018. Translation elongation and recoding in eukaryotes. 993 Cold Spring Harb. Perspect. Biol. 10. 994 Dittmar KA, Goodenbour JM, Pan T. 2006. Tissue-specific differences in human transfer 995 RNA expression. PLoS Genet. 2:2107–2115. 996 Dittmar KA, Mobley EM, Radek AJ, Pan T. 2004. Exploring the regulation of tRNA 997 distribution on the genomic scale. J. Mol. Biol. 337:31–47. 998 Dittmar KA, Sørensen MA, Elf J, Ehrenberg M, Pan T. 2005. Selective charging of tRNA 999 isoacceptors induced by amino-acid starvation. EMBO Rep. 6:151–157. 1000 Diwan GD, Agashe D. 2018. Wobbling forth and drifting back: The evolutionary history and 1001 impact of bacterial tRNA modifications. Mol. Biol. Evol. 35:2046–2059. 1002 Dong H, Nilsson L, Kurland CG. 1996. Co-variation of tRNA abundance and codon usage in 1003 Escherichia coli at different growth rates. J. Mol. Biol. 260:649–663. 1004 Ehrlich R, Davyt M, López I, Chalar C, Marín M. 2021. On the Track of the Missing tRNA 1005 Genes: A Source of Non-Canonical Functions? Front. Mol. Biosci. 8:1–15. 1006 Elf J, Nilsson D, Tenson T, Ehrenberg M. 2003. Selective charging of tRNA isoacceptors 1007 explains patterns of codon usage. Science 300:1718–1722. 1008 Fan Y, Wu J, Ung MH, De Lay N, Cheng C, Ling J. 2015. Protein mistranslation protects 1009 bacteria against oxidative stress. Nucleic Acids Res. 43:1740–1748. 1010 Fujishima K, Kanai A. 2014. tRNA gene diversity in the three domains of life. Front. Genet. 1011 5:1–11. 1012 Gabzi T, Pilpel Y, Friedlander T. 2022. Fitness Landscape Analysis of a tRNA Gene Reveals 1013 that the Wild Type Allele is Sub-optimal, Yet Mutationally Robust. Mol. Biol. Evol. 39. 1014 Gerber AP, Keller W. 1999. An Adenosine Deaminase That Generates Inosine at the 1015 Wobble Position of tRNAs. Science (1979). 286:1146–1149. 1016 Goodenbour JM, Pan T. 2006. Diversity of tRNA genes in eukaryotes. Nucleic Acids Res. 1017 34:6137–6146. 1018 Grosjean H, de Crécy-Lagard V, Marck C. 2010. Deciphering synonymous codons in the 1019 three domains of life: co-evolution with specific tRNA modification enzymes. FEBS 1020 Lett. 584:252–264. 1021 Hamdani O, Dhillon N, Hsieh T-HS, Fujita T, Ocampo J, Kirkland JG, Lawrimore J, 1022 Kobayashi TJ, Friedman B, Fulton D, et al. 2019. tRNA Genes Affect Chromosome 1023 Structure and Function via Local Effects. Mol. Cell. Biol. 39. 1024 Hatfull GF. 2015. Dark Matter of the Biosphere: the Amazing World of Bacteriophage 1025 Diversity. J. Virol. 89:8107–8110. 1026 Hoa SN, Hunt HD, Horton RM, Pullen JK, Peasea LR. 1989. Site-directed mutagenesis by 1027 overlap extension using the polymerase chain reaction. Gene 77:51–59. 1028 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint Holley R.W. AJ, EGA, MJT, MM, MSH, PJR, ZA. 1965. Structure of a ribonucleic acid. 1029 Science (1979). 147:1462–1465. 1030 Hu XP, Lercher MJ. 2021. An optimal growth law for RNA composition and its partial 1031 implementation through ribosomal and tRNA gene locations in bacterial genomes. 1032 PLoS Genet. 17. 1033 Iben JR, Epstein JA, Bayfield MA, Bruinsma MW, Hasson S, Bacikova D, Ahmad D, 1034 Rockwell D, Kittler ELW, Zapp ML, et al. 2011. Comparative whole genome sequencing 1035 reveals phenotypic tRNA gene duplication in spontaneous Schizosaccharomyces 1036 pombe la mutants. Nucleic Acids Res. 39:4728–4742. 1037 Iben JR, Maraia RJ. 2012. tRNAomics: tRNA gene copy number variation and codon use 1038 provide bioinformatic evidence of a new anticodon:codon wobble pair in a eukaryote. 1039 RNA 18:1358–1372. 1040 Ikemura T. 1985. Codon usage and tRNA content in unicellular and multicellular organisms. 1041 Mol. Biol. Evol. 2:13–34. 1042 Inagaki Y, Kojima A, Bessho Y, Hori H, Ohama T, Osawa S. 1995. Translation of 1043 Synonymous Codons in Family Boxes by Mycoplasma capricolum tRNAs with 1044 Unmodified Uridine or Adenosine at the First Anticodon Position. J. Mol. Biol:486–492. 1045 Ingram VM& SJA. 1963. Studies on the structure of purified alanine and valine transfer RNA 1046 from yeast. In: synthesis and structure of macromolecules. Cold. Spr. Harb. Symp. 1047 Quant. Biol. 28:133–138. 1048 Kanaya S, Yamada Y, Kudo Y, Ikemura T. 1999. Studies of codon usage and tRNA genes of 1049 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: gene expression 1050 level and species-specific diversity of codon usage based on multivariate analysis. 1051 Gene 238:143–155. 1052 Karcher D, Bock R. 2009. Identification of the chloroplast adenosine-to-inosine tRNA editing 1053 enzyme. RNA 15:1251–1257. 1054 Katz A, Elgamal S, Rajkovic A, Ibba M. 2016. Non-canonical roles of tRNAs and tRNA 1055 mimics in bacterial cell biology. Mol. Microbiol. 101:545–558. 1056 Kelly P, Backes N, Mohler K, Buser C, Kavoor A, Rinehart J, Phillips G, Ibba M. 2019. 1057 Alanyl-tRNA synthetase quality control prevents global dysregulation of the escherichia 1058 coli proteome. mBio 10. 1059 Khomarbaghi Z, Ngan WY, Ayan GB, Lim S, Dechow-Seligmann G, Nandy P, Gallie J. 2024. 1060 Large-scale duplication events underpin population-level flexibility in tRNA gene copy 1061 number in Pseudomonas fluorescens SBW25. Nucleic Acids Res. 52:2446–2462. 1062 Klumpp S, Scott M, Pedersen S, Hwa T. 2013. Molecular crowding limits translation and cell 1063 growth. Proc. Natl. Acad. Sci. U. S. A. 110:16754–16759. 1064 Kohanski MA, Dwyer DJ, Wierzbowski J, Cottarel G, Collins JJ. 2008. Mistranslation of 1065 Membrane Proteins and Two-Component System Activation Trigger Antibiotic-Mediated 1066 Cell Death. Cell 135:679–690. 1067 Kompatscher M, Bartosik K, Erharter K, Plangger R, Juen FS, Kreutz C, Micura R, Westhof 1068 E, Erlacher MD. 2024. Contribution of tRNA sequence and modifications to the 1069 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint decoding preferences of E. coli and M. mycoides tRNAGlyUCC for synonymous glycine 1070 codons. Nucleic Acids Res. 52:1374–1386. 1071 Lei L, Burton ZF. 2020. Evolution of life on earth: TRNA, aminoacyl-tRNA synthetases and 1072 the genetic code. . 10. 1073 Li C, Qian W, Maclean CJ, Zhang J. 2016. The fitness landscape of a tRNA gene. Science 1074 (1979). 352:837–840. 1075 Li C, Zhang J. 2018. Multi-environment fitness landscapes of a tRNA gene. Nat. Ecol. Evol. 1076 2:1025–1032. 1077 Li J-N, Esberg B, Curran JF, Bjo GR, Rk È. 1997. Three Modified Nucleosides Present in the 1078 Anticodon Stem and Loop Influence the in vivo aa-tRNA Selection in a tRNA-dependent 1079 Manner. J. Mol. Biol 271:209–221. 1080 Link AJ, Phillips † Dereth, Church GM. 1997. Methods for Generating Precise Deletions and 1081 Insertions in the Genome of Wild-Type Escherichia coli: Application to Open Reading 1082 Frame Characterization. J. Bacteriol. 179:6228–6237. 1083 Lyu Z, Wilson C, Ling J. 2023. Translational Fidelity during Bacterial Stresses and Host 1084 Interactions. Pathogens 12. 1085 Magee R, Rigoutsos I. 2020. On the expanding roles of tRNA fragments in modulating cell 1086 behavior. Nucleic Acids Res. 48:9433–9448. 1087 Mangroo D, RajBhandary UL. 1995. Mutants of Escherichia coli initiator tRNA defective in 1088 initiation. Effects of overproduction of methionyl-tRNA transformylase and the initiation 1089 factors IF2 and IF3. Journal of Biological Chemistry 270:12203–12209. 1090 Maraia RJ, Arimbasseri AG. 2017. Factors that shape eukaryotic tRNAomes: Processing, 1091 modification and anticodon-codon use. Biomolecules 7. 1092 McDonald MJ, Chou CH, Swamy KBS, Huang H Da, Leu JY. 2015. The evolutionary 1093 dynamics of tRNA-gene copy number and codon-use in E. coli. BMC Evol. Biol. 15:1–1094 10. 1095 Morgado S, Vicente AC. 2019. Global in-silico scenario of tRNA genes and their organization 1096 in virus genomes. Viruses 11. 1097 Nakanishi K, Fukai S, Ikeuchi Y, Soma A, Sekine Y, Suzuki T, Nureki O. 2005. Structural 1098 basis for lysidine formation by ATP pyrophosphatase accompanied by a lysine-specific 1099 loop and a tRNA-recognition domain. Proc. Natl. Acad. Sci. U. S. A. 102:7487–7492. 1100 Von Nickisch-Rosenegk M, Brown WM, Boore JL. 2001. Complete Sequence of the 1101 Mitochondrial Genome of the Tapeworm Hymenolepis diminuta: Gene Arrangements 1102 Indicate that Platyhelminths Are Eutrochozoans. Mol. Biol. Evol 18:721–730. 1103 Nilsson AI, Zorzet A, Kanth A, Dahlströ S, Berg OG, Andersson DI. 2006. Reducing the 1104 fitness cost of antibiotic resistance by amplification of initiator tRNA genes. Proc. Natl. 1105 Acad. Sci. U. S. A. 103:6976–6981. 1106 Norrander J, Kempe T, Messing J. 1983. Construction of improved Ml3 vectors using 1107 oligodeoxynucleotidedirected mutagenesis. Gene 26:935–7335. 1108 Novoa EM, Pavon-Eternod M, Pan T, Ribas De Pouplana L. 2012. A role for tRNA 1109 modifications in genome structure and codon usage. Cell 149:202–213. 1110 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint Novoa EM, Ribas de Pouplana L. 2012. Speeding with control: Codon usage, tRNAs, and 1111 ribosomes. Trends in Genetics 28:574–581. 1112 O’connor M, Ulrich Goringer H, Dahlberg AE, Planck M-. 1992. A ribosomal ambiguity 1113 mutation in the 530 loop of E.coli 16S rRNA. Nucleic Acids Res. 20:4221–4227. 1114 Pernod K, Schaeffer L, Chicher J, Hok E, Rick C, Geslain R, Eriani G, Westhof E, 1115 Ryckelynck M, Martin F. 2021. The nature of the purine at position 34 in tRNAs of 4-1116 codon boxes is correlated with nucleotides at positions 32 and 38 to maintain decoding 1117 fidelity. Nucleic Acids Res. 48:6170–6183. 1118 Phillips G, de Crécy-Lagard V. 2011. Biosynthesis and function of tRNA modifications in 1119 Archaea. Curr. Opin. Microbiol. 14:335–341. 1120 Polacek N, Ivanov P. 2020. The regulatory world of tRNA fragments beyond canonical tRNA 1121 biology. RNA Biol. 17:1057–1059. 1122 Pope WH, Bowman CA, Russell DA, Jacobs-Sera D, Asai DJ, Cresawn SG, Jacobs William 1123 R J, Hendrix RW, Lawrence JG, Hatfull GF, et al. 2015. Whole genome comparison of 1124 a large collection of mycobacteriophages reveals a continuum of phage genetic 1125 diversity.Kolter R, editor. Elife 4:e06416. 1126 Qian Q, Bjo GR, Rk È. 1997. Structural Alterations Far from the Anticodon of the tRNA Pro 1127 GGG of Salmonella typhimurium Induce +1 Frameshifting at the Peptidyl-site. J. Mol. 1128 Biol. 273:978–992. 1129 Rafels-Ybern À, Torres AG, Camacho N, Herencia-Ropero A, Frigolé HR, Wulff TF, Raboteg 1130 M, Bordons A, Grau-Bove X, Ruiz-Trillo I, et al. 2019. The expansion of inosine at the 1131 wobble position of tRNAs, and its role in the evolution of proteomes. Mol. Biol. Evol. 1132 36:650–662. 1133 Rafels-Ybern À, Torres AG, Grau-Bove X, Ruiz-Trillo I, Ribas de Pouplana L. 2018. Codon 1134 adaptation to tRNAs with Inosine modification at position 34 is widespread among 1135 Eukaryotes and present in two Bacterial phyla. RNA Biol. 15:500–507. 1136 Rak R, Dahan O, Pilpel Y. 2018. Repertoires of tRNAs: The Couplers of Genomics and 1137 Proteomics. Annu. Rev. Cell Dev. Biol. 34:239–264. 1138 Rao AR, Varchney U. 2001. Specific interaction between the ribosome recycling factor and 1139 the elongation factor G from Mycobacterium tuberculosis mediates peptidyl-tRNA 1140 release and ribosome recycling in Escherichia coli. EMBO J. 20:2977–2986. 1141 Raval PK, Ngan WY, Gallie J, Agashe D. 2023. The layered costs and benefits of 1142 translational redundancy. Elife 12:e81005. 1143 Rocha. 2004. Codon usage bias from tRNA’s point of view: Redundancy, specialization, and 1144 efficient decoding for translation optimization. Genome Res. 14:2279–2286. 1145 Rodnina M V., Wintermeyer W. 2009. Recent mechanistic insights into eukaryotic 1146 ribosomes. Curr. Opin. Cell Biol. 21:435–443. 1147 Romero Romero ML, Poehls J, Kirilenko A, Richter D, Jumel T, Shevchenko A, Toth-1148 Petroczy A. 2024. Environment modulates protein heterogeneity through transcriptional 1149 and translational stop codon readthrough. Nat. Commun. 15:4446. 1150 Roura Frigolé H, Camacho N, Coma MC, Fernández-Lozano C, García-Lema J, Rafels-1151 Ybern À, Canals A, Coll M, De Pouplana LR. 2019. tRNA deamination by ADAT 1152 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint requires substrate-specific recognition mechanisms and can be inhibited by tRFs. RNA 1153 25:607–619. 1154 Ruusala T, Andersson D, Ehrenberg M, Kurland CG. 1984. Hyper-accurate ribosomes inhibit 1155 growth. EMBO J. 3:2575–2580. 1156 Rydbn SM, Isaksson LA. 1984. A Temperature-Sensitive Mutant of Escherichia coli that 1157 Shows Enhanced Misreading of UAG[A and Increased Efficiency for Some tRNA 1158 Nonsense Suppressors. Mol Gen Genet 193:38–45. 1159 Sagi D, Rak R, Gingold H, Adir I, Maayan G, Dahan O, Broday L, Pilpel Y, Rechavi O. 2016. 1160 Tissue- and Time-Specific Expression of Otherwise Identical tRNA Genes. PLoS 1161 Genet. 12. 1162 Saint-Léger A, Bello C, Dans PD, Torres AG, Novoa EM, Camacho N, Orozco M, 1163 Kondrashov FA, De Pouplana LR. 2016. Saturation of recognition elements blocks 1164 evolution of new tRNA identities. Sci. Adv. 2:e1501860. 1165 Samhita L. 2022. Re-reading the genetic code: The evolutionary potential of frameshifting in 1166 time. J. Biosci. 47:49. 1167 Samhita L, K Raval P, Stephenson G, Thutupalli S, Agashe D. 2021. The impact of 1168 mistranslation on phenotypic variability and fitness. Evolution (N. Y). 75:1201–1217. 1169 Samhita L, Raval PK, Agashe D. 2020. Global mistranslation increases cell survival under 1170 stress in Escherichia coli. PLoS Genet. 16:1–21. 1171 Sane M, Parveen S, Agashe D. 2025. Mutation bias alters the distribution of fitness effects of 1172 mutations. PLoS Biol. 23(7):e3003282. 1173 Schmeing TM, Ramakrishnan V. 2009. What recent ribosome structures have revealed 1174 about the mechanism of translation. Nature 461:1234–1242. 1175 Schmeing TM, Voorhees RM, Kelley AC, Ramakrishnan V. 2011. How mutations in tRNA 1176 distant from the anticodon affect the fidelity of decoding. Nat. Struct. Mol. Biol. 18:432–1177 437. 1178 Schmitt MA, Biddle W, Fisk JD. 2018. Mapping the Plasticity of the Escherichia coli Genetic 1179 Code with Orthogonal Pair-Directed Sense Codon Reassignment. Biochemistry 1180 57:2762–2774. 1181 Schmitt MA, Tittle JM, Fisk JD. 2024. Codon decoding by orthogonal tRNAs interrogates the 1182 in vivo preferences of unmodified adenosine in the wobble position. Front. Genet. 15. 1183 Scott M, Gunderson CW, Mateescu EM, Zhang Z, Hwa T. 2010. Interdependence of Cell 1184 Growth and Gene Expression: Origins and Consequences. Science (1979). 330:1099–1185 1102. 1186 Scott M, Hwa T. 2011. Bacterial growth laws and their applications. Curr. Opin. Biotechnol. 1187 22:559–565. 1188 Seligmann H. 2010. Mitochondrial tRNAs as light strand replication origins: Similarity 1189 between anticodon loops and the loop of the light strand replication origin predicts 1190 initiation of DNA replication. BioSystems 99:85–93. 1191 Sengupta S, Yang X, Higgs PG. 2007. The mechanisms of codon reassignments in 1192 mitochondrial genetic codes. J. Mol. Evol. 64:662–688. 1193 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint Shigematsu M, Honda S, Loher P, Telonis AG, Rigoutsos I, Kirino Y. 2017. YAMAT-seq: An 1194 efficient method for high-throughput sequencing of mature transfer RNAs. Nucleic Acids 1195 Res. 45:e70. 1196 Sibler AP, Dirheimer G, Martin RP. 1986. Codon reading patterns in Saccharomyces 1197 cerevisiae mitochondria based on sequences of mitochondrial tRNAs. FEBS Lett. 1198 194:131–138. 1199 Soma A, Kubota A, Tomoe D, Ikeuchi Y, Kawamura F, Arimoto H, Shiwa Y, Kanesaki Y, 1200 Nanamiya H, Yoshikawa H, et al. 2023. yaaJ, the tRNA-Specific Adenosine 1201 Deaminase, Is Dispensable in Bacillus subtilis. Genes (Basel). 14:1515. 1202 Su Z, Wilson B, Kumar P, Dutta A. 2020. Noncanonical Roles of tRNAs: tRNA Fragments 1203 and Beyond. Annu. Rev. Genet. 54:47–69. 1204 Suzuki T. 2021. The expanding world of tRNA modifications and their disease relevance. 1205 Nat. Rev. Mol. Cell Biol. 22:375–392. 1206 Thorbjarnardottir S, Bjornsson A, Amundad6ttir L, Eggertsson G. 1991. Temperature 1207 Sensitivity Caused by Missense Suppressor supH and Amber Suppressor supP in 1208 Escherichia coli. J. Bacteriol. 173:412–416. 1209 Thornlow BP, Armstrong J, Holmes AD, Howard JM, Corbett-Detig RB, Lowe TM. 2020. 1210 Predicting transfer RNA gene activity from sequence and genome context. Genome 1211 Res. 30:85–94. 1212 Thornlow BP, Hough J, Roger JM, Gong H, Lowe TM, Corbett-Detig RB. 2018. Transfer 1213 RNA genes experience exceptionally elevated mutation rates. Proc. Natl. Acad. Sci. U. 1214 S. A. 115:8996–9001. 1215 Tirumalai MR, Rivas M, Tran Q, Fox GE. 2021. The Peptidyl Transferase Center: a Window 1216 to the Past. Microbiol Mol Biol Rev 85:1055–1067. 1217 Torres AG. 2019. Enjoy the Silence: Nearly Half of Human tRNA Genes Are Silent. 1218 Bioinform. Biol. Insights 13. 1219 Torres AG, Piñeyro D, Filonava L, Stracker TH, Batlle E, Ribas De Pouplana L. 2014. A-to-I 1220 editing on tRNAs: Biochemical, biological and evolutionary implications. FEBS Lett. 1221 588:4279–4286. 1222 Torres AG, Piñeyro D, Rodríguez-Escribà M, Camacho N, Reina O, Saint-Léger A, Filonava 1223 L, Batlle E, Ribas De Pouplana L. 2015. Inosine modifications in human tRNAs are 1224 incorporated at the precursor tRNA level. Nucleic Acids Res. 43:5145–5157. 1225 Torres AG, Rodríguez-Escribà M, Marcet-Houben M, Santos Vieira HG, Camacho N, Catena 1226 H, Murillo Recio M, Rafels-Ybern À, Reina O, Torres FM, et al. 2021. Human tRNAs 1227 with inosine 34 are essential to efficiently translate eukarya-specific low-complexity 1228 proteins. Nucleic Acids Res. 49:7011–7034. 1229 Vieira-Silva S, Rocha EPC. 2010. The systemic imprint of growth and its uses in ecological 1230 (meta)genomics. PLoS Genet. 6:e1000808. 1231 Watanabe Y-I, Tsurui H, Ueda T, Furusihima-Shimogawara R, Takamiya S, Kita K, 1232 Nishikawa K, Watanabe K. 1997. Primary sequence of mitochondrial tRNA Arg of a 1233 nematode Ascaris suum: occurrence of unmodified adenosine at the first position of the 1234 anticodon. Biochim. Biophys. Acta 1350:119–122. 1235 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint Wilusz JE. 2015. Controlling translation via modulation of tRNA levels. Wiley Interdiscip. 1236 Rev. RNA 6:453–470. 1237 Wolf J, Gerber AP, Keller W. 2002. tadA, an essential tRNA-specic adenosine deaminase 1238 from Escherichia coli. EMBO J. 21:3841–3851. 1239 Wulff TF, Hahnke K, Lécrivain AL, Schmidt K, Ahmed-Begrich R, Finstermeier K, 1240 Charpentier E. 2024. Dynamics of diversified A-to-I editing in Streptococcus pyogenes 1241 is governed by changes in mRNA stability. Nucleic Acids Res. 52:11234–11253. 1242 Yokobori SI, Kitamura A, Grosjean H, Bessho Y. 2013. Life without tRNAArg-adenosine 1243 deaminase TadA: Evolutionary consequences of decoding the four CGN codons as 1244 arginine in Mycoplasmas and other Mollicutes. Nucleic Acids Res. 41:6531–6543. 1245 Young RY, Bremer H, Programs B. 1976. Polypeptide-Chain-Elongation Rate in Escherichia 1246 coli B/r as a Function of Growth Rate. Biochem. J 160:185–194. 1247 1248 .CC-BY 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is The copyright holder for this preprintthis version posted May 15, 2026. ; https://doi.org/10.64898/2026.05.15.725136doi: bioRxiv preprint

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-pdf

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-05-23T02:00:01.238055+00:00
License: CC-BY-4.0