Evolution At Spike Position 519 in SARS-CoV-2 Facilitated Adaptation to Humans

preprint OA: closed
Full text JSON View at publisher
Full text 117,320 characters · extracted from preprint-html · click to expand
Evolution At Spike Position 519 in SARS-CoV-2 Facilitated Adaptation to Humans | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article Evolution At Spike Position 519 in SARS-CoV-2 Facilitated Adaptation to Humans James Weger-Lucarelli, Chelsea Cereghino, Kasia Michalak, Stephen DiGiuseppe, and 7 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-3835105/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted 9 You are reading this latest preprint version Abstract As the COVID-19 pandemic enters its fourth year, the pursuit of identifying a progenitor virus to SARS-CoV-2 and understanding the mechanism of its emergence persists, albeit against the backdrop of intensified efforts to monitor the ongoing evolution of the virus and the influx of new mutations. Surprisingly, few residues hypothesized to be essential for SARS-CoV-2 emergence and adaptation to humans have been validated experimentally, despite the importance that these mutations could contribute to the development of effective antivirals. To remedy this, we searched for genomic regions in the SARS-CoV-2 genome that show evidence of past selection around residues unique to SARS-CoV-2 compared with closely related coronaviruses. In doing so, we identified a residue at position 519 in Spike within the receptor binding domain that holds a static histidine in human-derived SARS-CoV-2 sequences but an asparagine in SARS-related coronaviruses from bats and pangolins. In experimental validation, the SARS-CoV-2 Spike protein mutant carrying the putatively ancestral H519N substitution showed reduced replication in human lung cells, suggesting that the histidine residue contributes to viral fitness in the human host. Structural analyses revealed a potential role of Spike residue 519 in mediating conformational transitions necessary for Spike to adopt an up configuration prior to binding with ACE2. Pseudotyped viruses bearing the putatively ancestral N519 also demonstrated significantly reduced infectivity in cells expressing the human ACE2 receptor compared to H519. Biochemical assays corroborated that N519 binds human ACE2 with lower affinity than H519. Collectively, these findings suggest that the evolutionary transition at position 519 of the Spike protein played a critical role in SARS-CoV-2 emergence and adaptation to the human host. Additionally, this residue presents as a potential drug target for designing small molecule inhibitors tailored to this site. Biological sciences/Microbiology/Virology/Sars cov 2 Biological sciences/Microbiology/Virology/Viral evolution Figures Figure 1 Figure 2 Figure 3 Introduction Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the causative agent of coronavirus disease 2019 (COVID-19), which has claimed just under seven million lives and caused 770 million cases of the disease since its emergence in 2019 1 . SARS-CoV-2 is a positive sense, single-stranded RNA virus of the family Coronaviridae , genus Betacoronavirus , and subgenus Sarbecovirus 2 . The first human cases were detected in the city of Wuhan in the Hubei province of China 3 , 4 and were associated with the Huanan wholesale seafood market 5 , 6 , where the virus may have spilled over from a live animal source 7 , 8 . While not definitive, there is evidence of a bat origin of the progenitor to SARS-CoV-2 9 . Sarbecoviruses isolated from bats have high nucleotide sequence similarity to SARS-CoV-2 on a genome level and within the receptor-binding domain (RBD) of the Spike protein 4 , 10 . BANAL-20-52, isolated from a bat in Laos has the highest nucleotide similarity with SARS-CoV-2 to date at 96.8% 10 . Hypotheses posited the progenitor to SARS-CoV-2 evolved in bats and was pre-adapted to humans since positive selection was detected on deep branches of the nCoV clade and not terminal branches leading to SARS-CoV-2 on a phylogenetic tree 9 , 10 . However, the precise molecular mechanisms of adaptation of the progenitor to SARS-CoV-2 to humans have not been well characterized despite how these data would aid in the development of therapeutics and surveillance for pandemic prevention. Our previous study determined evolution towards an alanine at position 372 in the SARS-CoV-2 Spike protein was important for human adaptation by first identifying selective sweeps in the SARS-CoV-2 genome 11 . Selective sweeps occur when a favorable mutation is quickly fixed in the population 12 . Neighboring “hitch-hiker” alleles also increase in frequency as a result of the driver mutation of the selective sweep on the recombinant genomic region 13 , 14 Thus, parsing out truly adaptive mutations requires experimental validation and is critical to determining mechanisms of emergence. Many studies investigating the emergence of SARS-CoV-2 have strictly involved computational analyses or the generation of pseudotyped viruses expressing Spike. These methods are useful but do not validate predictions or provide the most accurate quantitation of fitness of SARS-CoV-2 compared to examining phenotypes of replication-competent virus. Here, we gain new insights into the evolutionary pressures and molecular mechanisms behind the emergence of SARS-CoV-2 in humans by generating both pseudotyped and replication-competent virus with a putative ancestral residue consistently mapping to a selective sweep region, as based on the analysis of approximately two million SARS-CoV-2 genomes. The mutation (Spike H519N) significantly decreased SARS-CoV-2 replication in human lung epithelial cells and reduced infectivity in pseudotyped virus assays and binding to human ACE2 in a biochemical assay, consistent with structural predictions. This study provides evidence that the evolution at Spike site 519 was important for the progenitor of SARS-CoV-2’s adaptation to the human host. The Spike site also remains highly conserved among SARS-CoV-2 sequences and offers a suitable candidate for targeting with small molecule inhibitors. Results SARS-CoV-2 sequences hold evidence of a past, strong selective event in the receptor binding domain of Spike To investigate which regions in SARS-CoV-2 have contributed to its efficient infection and circulation in humans, we analyzed sequences for signs of strong positive selection events known as selective sweeps. We acquired 1,912,191 human-isolated SARS-CoV-2 sequences from GISAID, covering the period up to August 2021. We then used a computational pipeline combining OmegaPlus 15 and RAiSD 16 to identify regions with a high probability of selective sweep events. To discern the selective sweep regions instrumental in driving mutations during the early stages of human infection, we categorized the sequences by their sample collection dates, arranging them into monthly cohorts (Fig. 1 A). Our analysis identified an average of 11 sweep regions per month, with the numbers ranging from 4 to 16 (Supplementary file 1). While many sweep regions were identified, our attention was primarily drawn to the sweep regions in the receptor-binding domain (RBD) of the Spike protein (nucleotide position 22,878 to 23,332; amino acid positions 439–590), as mutations in the RBD are known modulators of host tropism and receptor binding 17 . The selective sweep regions included the A372 residue in the RBD, a critical factor we previously identified for the emergence of SARS-CoV-2 and its sustained transmission among humans 11 . These regions were evident in the early months of the COVID-19 outbreak (Fig. 1 A). Since selective sweep regions indicate a past selective event driven by various adaptive mutations and their associated hitchhiker mutations, we sought to determine the site within the Spike selective sweep region where pivotal mutations occurred upon early infection of humans 18 . While the progenitor virus to SARS-CoV-2 is unknown, we inferred the progenitor Spike sequence could assume several amino acid identities found at the corresponding site in Spike from closely related sarbecoviruses infecting bats and pangolins. To identify the closest coronaviruses related to SARS-CoV-2, we constructed a phylogenetic tree based on the amino acid sequence of Spike with PhyML (Fig. 1 B) 19 , 20 . Using these phylogenetic relationships, we aligned the amino acid sequence of the selective sweep region in Spike from the Wuhan-Hu-1, lineage B strain of SARS-CoV-2 with the homologous region in closely related sarbecoviruses to reveal sites with differential amino acid identities (Fig. 1 C; see also Extended Data Fig. 1 for the full amino acid alignment of the sweep region in Spike). Similar to amino acid position 372 in Spike, position 519 holds a non-synonymous mutation (nucleotide substitution C23117A) that differentiates SARS-CoV-2 from the aligned bat- and pangolin-derived Sarbecovirus sequences 11 . This position was identified in the selective sweep regions during one of the early months of the outbreak (January 2021; Fig. 1 A). SARS-CoV-2 has a histidine at 519, while the bat and pangolin-derived sequences bear an asparagine or lysine. We hypothesized the progenitor to SARS-CoV-2, potentially a virus infecting bats or pangolins, acquired a histidine at some point in the evolutionary timeline, thereby gaining an adaptive advantage in humans. Since sites with low amino acid diversity may implicate a crucial role of the predominant amino acid in viral fitness, we sought to investigate the amino acid diversity at position 519 in Spike in SARS-CoV-2. Using data from 3,848 genomes sampled between December 2019 and October 2023 from the nCoV GISAID dataset displayed on NextStrain, we determined Spike position 519 in SARS-CoV-2 has a normalized Shannon entropy of 0, suggesting little to no flexibility is allowable at this position in humans ( Extended Data Fig. 2 ) 21 , 22 . With this result, we hypothesized the predominant histidine at Spike 519 in SARS-CoV-2 may hold an important function for the virus in humans, while the ancestral residue, asparagine, would result in deleterious effects on fitness in human cells. SARS-CoV-2 Spike mutant bearing a residue of bat and pangolin Sarbecovirus origin has reduced replicative fitness and infectivity in human cells Towards identifying the driving mutations of the putative selective event, we reasoned evolution from the asparagine to histidine at position 519 may have driven early adaptation of the progenitor virus to humans. Since position 519 is in the RBD of Spike, we first used a previously described pseudovirus system 23 to determine whether Spike H519N had any functional significance during infection of cells expressing human ACE2 (hACE2), the receptor for SARS-CoV-2 24 . First, we verified hACE2 protein levels by western blot expressed in human embryonic kidney cells ( Extended Data Fig. 3 ). We generated lentiviruses pseudotyped with full wild-type (WT) Spike, Spike H519N, Spike D614G, and Spike A372T, a RBD mutant bearing an ancestral threonine which we previously showed attenuates the virus in human lung epithelial cells 11 . Spike D614G is used as a known human-adaptive variant that emerged early in the pandemic and is now fixed in all SARS-CoV-2 variants 25 . The pseudoviruses express both luciferase and a green fluorescence protein (GFP), ZsGreen 23 , allowing for sensitive detection of infection efficiency. We detected Spike protein levels from prepared virus stocks and observed a greater incorporation of G614 into the pseudoviruses as previously observed (Fig. 2 A) 26 . Ectopic expression of Spike was similar between WT Spike and Spike H519N. Next, we used pseudovirus particles to infect human embryonic kidney cells expressing hACE2 to determine whether infectivity through hACE2 is altered by Spike H519N. Spike D614G enhanced the infectivity of the pseudotyped viruses compared to WT Spike, in agreement with data extensively reported in literature (Fig. 2 B) 26 . Spike A372T significantly reduced the infectivity of SARS-CoV-2, consistent with our previous study (Fig. 2 B) 11 . Importantly, Spike H519N significantly reduced the infectivity of SARS-CoV-2 compared to Spike D614G and WT Spike (Fig. 2 B). When infected cells were quantified based on luciferase expression, we also observed significant decreases in infectivity for Spike H519N compared to WT Spike (Fig. 2 C; p < 0.0001 ). Towards determining whether Spike N519 reduces the replicative fitness of SARS-CoV-2 in human cells, we generated a replication-competent, SARS-CoV-2 mutant bearing an asparagine at position 519 in Spike, the amino acid present in closely related sarbecoviruses. In human lung epithelial cells, the putatively ancestral SARS-CoV-2 Spike H519N mutant replicated to significantly lower titers than the WT virus and Spike D614G, demonstrating a nearly 2-log difference in viral titers (Fig. 2 D; p < 0.01 at all timepoints ). These data suggest that reverting the histidine at Spike 519 to the ancestral asparagine significantly reduces infection and replication in human lung epithelial cells. Spike H519N potentially impacts ability to sample up/down conformations and reduces binding affinity to human ACE2 In elucidating the mechanism of attenuation of SARS-CoV-2 bearing the putatively ancestral asparagine at Spike position 519, we analyzed the interactions between Spike chains to identify how H519N is impacting infection (Fig. 3 A). Residue 519 is in the RBD but not within the receptor binding motif (RBM) to interact with ACE2 directly. Rather, residue 519 is positioned on a cleft at the interface between Spike chains that constitute the full trimer complex. Indeed, residue 519 is within 4–5 Å of residues of adjacent chains of the Spike trimer, participates in polar interchain interactions, and conformational up/down movement can be impacted at this interface. Here, we sought to utilize this interaction interface to compute predicted free energy of binding calculations (MM/GBSA) analyzing up/down conformation favorability by probing interchain binding free energy in Spike. Results demonstrate that the H519 up conformation has similar interchain binding free energy to the N519 up conformation when unprotonated (-284.1 kcal/mol and − 264.1 kcal/mol, respectively). The down conformation exhibits larger differences between the H519 and N519 structures, with H519 interchain predicted free energy binding energy of -259.2 kcal/mol and N519 of -365.9 kcal/mol. Interestingly, when analyzing protonated H519, we observe predicted free energy of binding for the up conformation H519 of -359.4 kcal/mol and − 280.9 kcal/mol for the down confirmation of H519. This suggests that in a lower physiological pH, the H519 Spike samples the up conformation more favorably compared to the N519 which alternatively energetically favors the down conformation based on interchain interactions. Structural inspection by analyzing the neighboring residues reveals that 519 is surrounded by a pocket of polar and charged residues from a neighboring Spike chain (Fig. 3 B-C). Surface mapping of residue properties highlights that H519 exhibits a neutral surface area (Fig. 3 D) while N519 results in a more polar surface area (Fig. 3 E). These results reveal that H519 may result in a lower energy barrier to overcome when moving between down/up positioning to bind ACE2, potentially allowing for increased infection efficiency. With the insight that H519N appears to impact up/down conformational changes in protonated SARS-CoV-2 Spike and the potential ability to bind ACE2 by sampling the up position more favorably, we further sought to determine whether Spike H519 would experimentally bind with higher affinity to hACE2. Enzyme-linked immunosorbent assay (ELISA) was performed with hACE2 and several concentrations of RBD from either WT Spike, Spike N501Y, Spike A372T, or Spike H519N. Spike N501Y was used as a positive control with known increased binding to hACE2 27 . Consistent with reduced binding efficiency, the absorbance curve for Spike H519N and Spike A372T RBD are shifted right from WT RBD (Fig. 3 F). We observed significantly lower EC 50 values for Spike N501Y RBD and significantly higher EC 50 values for Spike H519N and Spike A372T compared to WT RBD (Fig. 3 G). This result suggests more molecules of Spike H519N RBD would be required to saturate hACE2 binding sites; therefore, this mutation may reduce the affinity to hACE2 leading to reductions in replication. Discussion During zoonotic spillover events, selective pressures exist for pathogens to adapt to the cellular components and immune responses of new host species in the pursuit of optimal fitness. RNA viruses like SARS-CoV-2 are skilled at navigating the fitness landscape in new hosts, taking advantage of both recombination and the error-prone RNA-dependent RNA polymerase to source new genotypes that can lead to fitness advantages 28 . Our study provides evidence of historical selection acting on SARS-CoV-2 via selective sweeps and assesses the contribution of an amino acid site within a selective sweep region in the RBD of Spike that likely facilitated adaptation to humans. We provide evidence that the putatively ancestral asparagine at Spike 519 in closely related sarbecoviruses reduces the fitness of SARS-CoV-2 in cells expressing hACE2 due to reduced affinity of Spike RBD N519 to hACE2 and higher binding free energy in the up conformation. Several studies have gathered strong evidence for a natural origin of SARS-CoV-2, but there is disagreement on how SARS-CoV-2 might have evolved to sustain productive infection of and transmission between humans. Some studies have suggested the progenitor to SARS-CoV-2 evolved in bats and required little to no adaptation to humans before spillover and emergence 10 . While we do not refute evolution of the progenitor to SARS-CoV-2 occurred in bats, our study provides evidence of a mutation that enhanced human infection but does not predict in which host this mutation arose. It is possible that genetic drift led to mutations in the reservoir host which enabled human adaptation prior to human exposure 9 . This study is the second of ours that supports the theory that the progenitor to the Wuhan-Hu-1 strain of SARS-CoV-2 was subject to a strong selective pressure resulting in adaptation to humans at a site in the RBD of Spike 11 . Zoonotic spillover events are often associated with the selection of mutations in viral receptor binding proteins that afford the use of orthologous receptors in the human host 29 . For coronaviruses in particular, mutations in the Spike protein, which binds the host ACE2 receptor, are sufficient to expand the host range of the virus even though a variety of other factors are important for tropism 17 . Another zoonotic coronavirus, severe acute respiratory syndrome coronavirus (SARS-CoV), which caused an epidemic from 2002–2003, emerged likely due to a set of mutations both in the receptor binding domain (RBD) and other parts of Spike and underwent adaptive evolution from its reconstructed ancestral sequence 17 , 30 . Our results are in agreement with other studies that mutations outside of the RBM of the coronavirus Spike protein, including both subunits of Spike, can affect receptor binding and are also associated with host-range expansion 11 , 31 , 32 . Amino acid position 519 in Spike is in the RBD but not within the receptor binding motif (RBM). The putatively ancestral asparagine has more favorable co-association of Spike in the down conformation, which impacts the potential to sample an up conformation and bind with hACE2. This can be observed with the reduced binding affinity of Spike H519N RBD to hACE2 via ELISA and interchain free energy calculations highlighting the H519 up conformation favorability in an acidic environment. While these results provide insight into potential structural mechanisms related to the fitness of H519, more robust molecular dynamics (MD) simulations could reveal more major morphological changes related to 519 fitness in up/down conformations. Our results are not surprising since mutations outside of the RBM have been shown to increase binding affinity to hACE2 during a deep mutational scan of the Spike RBD 33 . Furthermore, mutations at 372 and 614, which lie outside of the RBD, affect ACE2 binding 11 , 34 . Our results are biologically relevant given that the human nasal cavity pH is 6.6 which might provide a more optimal environment for the protonated SARS-CoV-2 Spike H519 to associate more with hACE2 than Spike 519N in this environment 35 . Taken in an evolutionary context, it is possible that Spike N519H arose in the progenitor virus and that this mutation increased association with hACE2 when the virus initially encountered humans, leading to enhanced infection and transmissibility. We used NextStrain to examine the diversity within Spike and identified low normalized Shannon entropy approaching 0 at position 519 in Spike of SARS-CoV-2 sequences from 2019–2023 from GISAID. Since we have shown the histidine at position 519 in Spike is important for replication in human lung cells and that it is highly conserved regardless of lineage and variant of concern, position 519 might constitute a suitable drug target. Indeed, structural inspection of the surface area maps and structural residue region of 519 indicates that it is within a solvent accessible pocket, surrounded by polar and charged residues from neighboring chains. Suitable pockets for small molecule inhibitors may also correlate with regions of low diversity in the genome, however further investigation is needed. Efforts to design small molecule inhibitors of SARS-CoV-2 have resulted in targeting genes such as the main protease of SARS-CoV-2 36 . Recent surveillance studies identified multiple mutations in SARS-CoV-2 that confer resistance to nirmatrelvir 37 . Another protease inhibitor ensitrelvir is being used for emergency use authorization in Japan, but mutations conferring resistance to this inhibitor have been identified as well 37 . This highlights the importance of finding new components of the virus to target, and the RBD at position 519 in Spike might be suitable since other amino acids at this position have not emerged to any appreciable extent. To conclude, a mutation altering the amino acid identity at position 519 in the Spike protein of SARS-CoV-2 is likely to have partly mediated adaptation of the progenitor virus to the human host. We demonstrated the conserved histidine at position 519 in Spike affords fitness advantages through increased replication in human cells via increased hACE2 binding affinity and favorability of Spike in an up conformation. Position 519 in Spike holds promise as a site to which small molecular inhibitors could be designed amidst the concern of the development of drug resistance. Methods Selective sweep analysis of SARS-CoV-2 genomes A total of 1,914,191 human-derived, complete SARS-CoV-2 genomes up to Aug. 2021 were downloaded from the GISAID EpiCov database ( https://www.gisaid.org/ ). Sequences that had low coverage or displayed over 5% nucleotide ambiguities (designated as 'N') were excluded from the analysis to ensure analytical rigor. For further analysis, sequences were categorized monthly based on their sample collection dates. Selective sweep identification was conducted within these monthly cohorts using previously established data curation and methodologies 11 . Construction of phylogenetic tree The following coronavirus sequences were downloaded from GenBank: Bat coronavirus isolate BANAL-20-52/Laos/2020 MZ937000.1, Bat coronavirus isolate BANAL-20-103/Laos/2020, complete genome MZ937001.1, Bat coronavirus isolate BANAL-20-116/Laos/2020, complete genome MZ937002.1, Bat coronavirus isolate BANAL-20-236/Laos/2020, complete genome MZ937003.2, Bat coronavirus isolate BANAL-20-247/Laos/2020, complete genome MZ937004.1, Bat coronavirus RacCS203, complete genome MW251308.1, Bat coronavirus RaTG13, complete genome MN996532, Bat coronavirus strain BetaCoV/Rm/Yunnan/YN02/2019 spike protein (S) gene, partial cds MW201982.1, Bat sarbecovirus sp. isolate PrC31, complete genome MW703458.1, Bat SARS-like coronavirus isolate Rs4231 KY417146.1, Bat Severe acute respiratory syndrome-related coronavirus Rc-o319 RNA, complete genome LC556375.1, Betacoronavirus sp. RpYN06 strain bat/Yunnan/RpYN06/2020, complete genome MZ081381.1, Coronavirus BtRs-BetaCoV/YN2018B MK211376.1, MAG Pangolin coronavirus isolate GD P79-9 2019, complete genomeOQ297708.1, Pangolin coronavirus isolate MP789, complete genome MT121216.1, Pangolin coronavirus isolate PCoV_GX-P4L, complete genome MT040333.1, Pangolin coronavirus isolate PCoV_GX-P5L MT040335.1, SARS coronavirus Tor2 AY274119.3, Severe_acute_respiratory_syndrome_coronavirus_2_isolate_Wuhan-Hu-1_NC_045512.2. Geneious (version 2023.1 created by Biomatters. Available from https://www.geneious.com ) was used to construct a phylogenetic tree based on the amino acid sequence of Spike using PhyML 3.3.20180214 with a LG substitution model and 1000 bootstraps 19 . An amino acid alignment of the selective sweep region in Spike identified in January 2021 was performed using Geneious with the same viruses as above to identify non-synonymous mutations compared to the Wuhan-Hu-1 strain of SARS-CoV-2. Diversity analysis in Spike NextStrain was accessed on November 1, 2023 displaying all 3,848 nCov genomes sampled between December 2019 and October 2023 from the GISAID nCoV dataset. Amino acid diversity data was obtained by downloading the full Spike diversity dataset from the site. NextStrain measures normalized Shannon entropy by determining the observed counts of a residue at a given amino acid position normalized by the number of tips in the phylogenetic tree on NextStrain and summing the individual entropies for an amino acid identity at the position 21 . Cell lines and plasmids Human embryonic kidney cells expressing human ACE2 (HEK-293T-hACE2; NR-52511) and African green monkey kidney epithelial cells expressing TMPRSS2 and human ACE2 (Vero E6-TMPRSS2-T2A-hACE2; NR-54970) were obtained from BEI Resources. Human lung epithelial cells (Calu-3 HTB-55) and human embryonic kidney cells (HEK-293T) were acquired from ATCC. Cells were grown in a humidified atmosphere with carbon dioxide supplied at 5% at 37°C. HEK-293T and HEK-293T-hACE2 cells were grown in Dulbecco’s Modified Eagle Medium (DMEM; Corning™ 10013CV) supplemented with 10% Fetal Bovine Serum, 100 U/mL penicillin, 100 U/mL streptomycin, 2mM L-glutamine. Vero E6-TMPRSS2-T2A-hACE2 and Calu-3 cells were grown in Dulbecco’s Modified Eagle Medium supplemented with gentamicin sulfate (0.1%), non-essential amino acids (1X), HEPES (25mM), and either 5% (Vero E6-TMPRSS2-T2A-hACE2) or 20% (Calu-3) fetal bovine serum. Vero E6-TMPRSS2-T2A-hACE2 also required the addition of 0.01 mg/mL puromycin. SARS-Related Coronavirus 2, Wuhan-Hu-1 Spike-Pseudotyped Lentiviral Kit V2 was obtained from BEI Resources (NR-53816). Generation of Spike pseudotyped virus mutants Spike mutations A372T and H519N were introduced to the human codon-optimized Spike gene on vector pHDM SARS-Related Coronavirus 2 Wuhan-Hu-1 Spike Glycoprotein (BEI; NR-53742). We used the BioLabs Q5 Site-Directed Mutagenesis Kit (NEB #E0554) as described from the manufacturer’s protocol with mutagenesis primers Forward (CGAATTGCTCAACGCTCCAGC) and Reverse (AAACTCAAGACCACAACTCTG) for mutant H519N, and Forward (GTATAATAGTACAAGCTTTAGCACATTC) and Reverse (AATACTGAGTAGTCCGCC) for mutant A372T. Mutations were confirmed by Sanger sequencing. Vector HDM SARS2-Spike-del121 D614G was purchased from Addgene (cat #158762). Pseudotyped lentiviral particles were generated by transfecting HEK-293T cells using PolyJet In Vitro DNA Transfection reagent (SignaGen cat #SL100688) following the manufacturer’s recommendations for the six-well plate, with the adjusted amount of 3µg for each of the following BEI plasmids: pRC-CMV-Rev1b (NR-52519), HDM-tat1b (NR-52518), HDM-Hgpm2 (NR-52518), Luciferase-IRES-ZsGreen (NR52516), and 3µg of the pHDM vector encoding the Wuhan-Hu-1 Spike Glycoprotein (WT), or D614G, A372T, or H519N spike mutations, respectively. We monitored the transfection efficiency by detecting ZsGreen signal via live cell fluorescence microscopy using a Leica DMI4000B inverted microscope. Supernatants with lentiviral particles were harvested 48- and 72-hours post-transfection. Pseudoviruses were precipitated overnight at 4 o C with 40% PEG-8000/1.2 M NaCl pH 7.2 at a 1 to 3 ratio, centrifuged at 1200rpm for 1 hour at 4 o C, and obtained pellet resuspended in 1/10 of the original collection volume. Western blot analysis of pseudotyped virus Spike incorporation We verified the presence of SARS-CoV-2 Spike protein on WT and mutated pseudotyped lentiviral particles by Western blot analysis. Viral protein lysates were denatured at 95 o C, resolved on a 10% SDS-PAGE gel, and transferred to Biorad Immun-Blot PVDF membrane (cat #1620177). After 1 hour Blocking in 5% milk in TBS-T buffer (50 mM Tris [pH 7.5], 150 mM NaCl containing 0.1% Tween 20), the membrane was incubated overnight with Invitrogen Anti-SARS-CoV2-Spike protein S1 primary antibody (cat #PA5-114528). After three washes with TBS-T buffer, blot was probed for 1 hour with Peroxidase-conjugated AffiniPure Goat Anti-Rabbit IgG (H + L) secondary antibody (Jackson ImmunoResearch Laboratories cat #115-035-003) in 5% milk/TBS-T buffer. The membrane was washed three times with TBS-T, and the signal was detected with Biorad Clarity Western ECL substrate cat (#170–5061) and imaged using Biorad ChemiDoc MP Imaging System. Quantification of SARS-CoV-2 plasmid pseudogenome via real-time PCR For the quantification of pseudovirus particles, we extracted the DNA from lentiviral stocks using Qiagen DNeasy Blood & Tissue Kit (cat #69504) according to the manufacturer's protocol. We quantified a ZsGreen amplicon by qPCR via Applied Biosystems PowerSYBR Green PCR Master Mix (cat #4376659) using Forward (GACCATGAAGTACCGCATG) and Reverse (CCGCCCTCCACCACGCAC) ZsGreen-specific primers. The DNA Standard curve of 10 ng/µL, 1 ng/µL, 0.1 ng/µL and 0.01 ng/µL serial dilutions of pHAGE-CMV-Luc2-ZsGreen plasmid served as our reference. The relative amounts of pseudoviral genomes of all mutants were adjusted to WT lentivirus before infections. Luciferase Assay The luciferase activity of HEK-293T-hACE2 cells infected with pseudovirus was measured using the Promega Luciferase Assay System according to the manufacturer's protocol (cat #E1500). Cells were washed 2 times with PBS and lysed in 1X Lysis Buffer per well in a 12-well plate. 100 µL of the 1X Luciferase Assay per 20 µL of lysate was used for detection of luciferase activity in each well of a 96-well black microplate. The efficiency of infection was determined by measuring the intensity of luminescence calculated as relative luciferase units using the Agilent BioTek Synergy HTX Multi-Mode Microplate Reader. Generation of SARS-CoV-2 mutants Live SARS-CoV-2 mutants were generated from a previously constructed full-length infectious clone of the Wuhan-Hu-1 strain of SARS-CoV-2 (GenBank accession no. NC_045512.2). Mutagenic primers were used to introduce mutations into the backbone by PCR with Invitrogen Platinum SuperFi II PCR master mix (12368010) or Quantabio repliQa HiFi ToughMix (95200-025). Fragments were purified from agarose gels using the Machery-Nagel nucleospin gel and pcr clean-up kit (740609.250). Purified fragments were mixed at an equimolar ratio and assembled and amplified by replication cycle reaction using the OriCiro Genomics Cell-Free Cloning System. Virus was recovered by transfecting a 1:1 ratio of replication cycle reaction product and pUC19 38 into BHK-21 cells using Polyplus jetOPTIMUS DNA transfection reagent (101000051). Three days post-transfection, a blind passage of transfection supernatant on Vero E6-TMPRSS2-T2A-hACE2 cells was performed. After observing cytopathic effect, the virus was harvested, titered by plaque assay on Vero E6-TMPRSS2-T2A-hACE2 cells, and 400bp libraries were prepared in-house using IDT Artic V4.1 NCOV-2019 Panel (10011442) and sequenced by Genewiz NGS Amplicon-EZ Illumina sequencing. Growth curve of Spike H519N in human lung epithelial cells Human lung epithelial cells (Calu-3) were infected at 80% confluency with a MOI of 0.1 of SARS-CoV-2 Wuhan-Hu-1 (wild-type), SARS-CoV-2 Spike D614G mutant, and SARS-CoV-2 Spike H519N mutant diluted in Roswell Park Memorial Institute 1640 (RPMI-1640) medium supplemented with fetal bovine serum (2%) and HEPES (10 mM). Supernatant was harvested every 24 hours post-infection and titered by plaque assays on Vero E6-TMPRSS2-T2A-hACE2 cells. Quantitation of infectious virus Infectious virus was quantified by plaque assay by infecting 90% confluent monolayers of Vero E6-TMPRSS2-T2A-hACE2 with serial dilutions of virus made in RPMI-1640 medium supplemented with fetal bovine serum (2%) and HEPES (10 mM). After one hour of adsorption at 37°C, a 1:1 mixture of 2X media (2X EMEM, 4X L-glutamine, 0.735% sodium bicarbonate, 0.2 mg/mL gentamicin sulfate, 4% FBS, 20 mM HEPES) and 3% methylcellulose was added to the cells. Infection proceeded for three days at 37°C before fixation in 10% buffered formalin and staining with a 0.1% crystal violet solution containing 20% ethanol to visualize plaques. Structural Analyses Molecular Mechanics/Generalized Born Surface Area (MM/GBSA) was used to predict interchain free energy of binding with the Schrodinger- Maestro software v. 2020-2 (Schrödinger, LLC, New York, NY, 2020). Spike protein (PDB ID: 7KNB) was split into individual chains and modified to reflect D614G in the Wuhan-Hu-1 strain 39 . Predicted free energy of binding between Spike chains first on one chain in an up-RBD conformation and one chain in a down-RBD conformation to approximate up conformation free energy and two down-RBD conformation chains for down conformation free energy on H519 (neutral), H519 (protonated), and N519 structures. Protonated and unprotonated states were simulated at a pH of 5 and 7, respectively. Surface mapping was performed using Schrodinger- Maestro software v. 2020-2. Biochemical characterization of Spike with human ACE2 A sandwich ELISA was performed using recombinant, purified RBD of His-tagged wild-type Wuhan-Hu-1 SARS-CoV-2, SARS-CoV-2 Spike N501Y, SARS-CoV-2 Spike H519N, or SARS-CoV-2 Spike A372T and hACE2. A plate was coated with ACE2-mFc (Cat:10108-H05H) at 2 µg/mL and incubated overnight at 4°C. The following day, RBD of each aforementioned genotype was added at the following concentrations: 0.256, 1.28, 6.4, 32, 160, 800, and 4000 ng/mL. The antibody anti-His-HRP (Sino A5327) was used to detect the His-tagged RBDs. Absorbance was measured at 450nm, and values were normalized to the blank. Statistical Analyses All statistical analyses for Figs. 2 – 3 were performed using Prism 9 (GraphPad). Growth curve data was analyzed using a Two-Way ANOVA with Dunnett’s correction for multiple comparisons. All other data were analyzed with a One-Way ANOVA with Dunnett’s correction for multiple comparisons. Declarations Acknowledgements We extend our thanks to the scientists who performed surveillance and sequencing of sarbecoviruses and SARS-CoV-2 and made the data accessible via GenBank and GISAID and to the team that built and maintains NextStrain. We would like to acknowledge Sino Biological for performing ELISAs. Competing interests declaration The authors of this manuscript do not declare any competing interests. Author contributions Conceptualization: P.M., L.K., J.W.L., A.B., S.D. Data curation: C.C., K.M., A.S., P.M., L.K., J.W.L., A.B., S.D. Formal analysis: C.C., K.M., A.S., P.M., L.K., J.W.L., A.B., S.D. Investigation: C.C., K.M., A.S., P.M., L.K., J.W.L., A.B., S.D. Methodology: P.M., L.K., J.W.L., A.B., S.D. Validation: C.C., K.M. Writing – original draft: C.C. Writing – review & editing: C.C., K.M., A.S., S.D., L.K., A.B., J.W.L., P.M. Funding acquisition: J.W.L., P.M. Project administration: J.W.L., P.M. Resources: J.W.L., P.M., S.D. Supervision: P.M., J.W.L., A.B., S.D. All authors read and approved the final manuscript. Biosecurity All research protocols were approved by the Virginia Tech Institutional Biosafety Committee prior to beginning experiments. Pseudotyped virus work was conducted at biosafety level 2, while live virus manipulation of SARS-CoV-2 was performed in a biosafety level 3 (BSL3) laboratory at Virginia Tech with appropriate fitting N95 respirators and other standard PPE. SARS-CoV-2 was appropriately inactivated at 65C for 10 minutes after the addition of a viral lysis buffer, as described previously, before removal from the BSL3 for library preparation 40 . References WHO Coronavirus (COVID-19) dashboard. https://covid19.who.int/. Coronaviridae Study Group of the International Committee on Taxonomy of Viruses. The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat Microbiol 5 , 536–544 (2020). Huang, C. et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 395 , 497–506 (2020). Zhou, P. et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature 579 , 270–273 (2020). Wu, F. et al. A new coronavirus associated with human respiratory disease in China. Nature 579 , 265–269 (2020). Organization, W. H. & Others. WHO-convened global study of origins of SARS-CoV-2: China Part. (2021). Worobey, M. et al. The Huanan Seafood Wholesale Market in Wuhan was the early epicenter of the COVID-19 pandemic. Science 377 , 951–959 (2022). Holmes, E. C. et al. The origins of SARS-CoV-2: A critical review. Cell 184 , 4848–4856 (2021). MacLean, O. A. et al. Natural selection in the evolution of SARS-CoV-2 in bats created a generalist virus and highly capable human pathogen. PLoS Biol. 19 , e3001115 (2021). Temmam, S. et al. Bat coronaviruses related to SARS-CoV-2 and infectious for human cells. Nature 604 , 330–336 (2022). Kang, L. et al. A selective sweep in the Spike gene has driven SARS-CoV-2 human adaptation. Cell 184 , 4392-4400.e4 (2021). Durrett, R. & Schweinsberg, J. Approximating selective sweeps. Theor. Popul. Biol. 66 , 129–138 (2004). Nordborg, M., Charlesworth, B. & Charlesworth, D. The effect of recombination on background selection. Genet. Res. 67 , 159–174 (1996). Lai, M. M. & Cavanagh, D. The molecular biology of coronaviruses. Adv. Virus Res. 48 , 1–100 (1997). Alachiotis, N., Stamatakis, A. & Pavlidis, P. OmegaPlus: a scalable tool for rapid detection of selective sweeps in whole-genome datasets. Bioinformatics 28 , 2274–2275 (2012). Alachiotis, N. & Pavlidis, P. RAiSD detects positive selection based on multiple signatures of a selective sweep and SNP vectors. Commun Biol 1 , 79 (2018). Li, W. et al. Receptor and viral determinants of SARS-coronavirus adaptation to human ACE2. EMBO J. 24 , 1634–1643 (2005). Stephan, W. Selective Sweeps. Genetics 211 , 5–13 (2019). Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59 , 307–321 (2010). Guindon, S. & Gascuel, O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52 , 696–704 (2003). Hadfield, J. et al. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics 34 , 4121–4123 (2018). Sagulenko, P., Puller, V. & Neher, R. A. TreeTime: Maximum-likelihood phylodynamic analysis. Virus Evol 4 , vex042 (2018). Dadonaite, B. et al. A pseudovirus system enables deep mutational scanning of the full SARS-CoV-2 spike. Cell 186 , 1263-1278.e20 (2023). Jackson, C. B., Farzan, M., Chen, B. & Choe, H. Mechanisms of SARS-CoV-2 entry into cells. Nat. Rev. Mol. Cell Biol. 23 , 3–20 (2022). Plante, J. A. et al. Spike mutation D614G alters SARS-CoV-2 fitness. Nature 592 , 116–121 (2021). Zhang, L. et al. SARS-CoV-2 spike-protein D614G mutation increases virion spike density and infectivity. Nat. Commun. 11 , 6013 (2020). Liu, Y. et al. The N501Y spike substitution enhances SARS-CoV-2 infection and transmission. Nature 602 , 294–299 (2022). Sun, F. et al. SARS-CoV-2 Quasispecies Provides an Advantage Mutation Pool for the Epidemic Variants. Microbiol Spectr 9 , e0026121 (2021). Warren, C. J. & Sawyer, S. L. How host genetics dictates successful viral zoonosis. PLoS Biol. 17 , e3000217 (2019). Yuan, Z., Nan, Z., Pei, H. & Yang, Z. Reconstruction of the most recent common ancestor sequences of SARS-Cov S gene and detection of adaptive evolution in the spike protein. Chin. Sci. Bull. 49 , 1311–1313 (2004). Becker, M. M. et al. Synthetic recombinant bat SARS-like coronavirus is infectious in cultured cells and in mice. Proc. Natl. Acad. Sci. U. S. A. 105 , 19944–19949 (2008). McRoy, W. C. & Baric, R. S. Amino acid substitutions in the S2 subunit of mouse hepatitis virus variant V51 encode determinants of host range expansion. J. Virol. 82 , 1414–1424 (2008). Starr, T. N. et al. Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding. Cell 182 , 1295-1310.e20 (2020). Ozono, S. et al. SARS-CoV-2 D614G spike mutation increases entry efficiency with enhanced ACE2-binding affinity. Nat. Commun. 12 , 848 (2021). Kreutzberger, A. J. B. et al. SARS-CoV-2 requires acidic pH to infect cells. Proc. Natl. Acad. Sci. U. S. A. 119 , e2209514119 (2022). Liu, H. et al. Development of optimized drug-like small molecule inhibitors of the SARS-CoV-2 3CL protease for treatment of COVID-19. Nat. Commun. 13 , 1891 (2022). Moghadasi, S. A. et al. Transmissible SARS-CoV-2 variants with resistance to clinical protease inhibitors. Sci Adv 9 , eade8778 (2023). Søndergaard, J. N. et al. Successful delivery of large-size CRISPR/Cas9 vectors in hard-to-transfect human cells using small plasmids. Commun Biol 3 , 319 (2020). Zhou, T. et al. Cryo-EM Structures of SARS-CoV-2 Spike without and with ACE2 Reveal a pH-Dependent Switch to Mediate Endosomal Positioning of Receptor-Binding Domains. Cell Host Microbe 28 , 867-879.e5 (2020). Castellanos-Gonzalez, A. et al. Direct RT-PCR amplification of SARS-CoV-2 from clinical samples using a concentrated viral lysis-amplification buffer prepared with IGEPAL-630. Sci. Rep. 11 , 14204 (2021). Additional Declarations No competing interests reported. Supplementary Files SupplFiledocx.docx Cite Share Download PDF Status: Under Review Version 1 posted Editorial decision: Revision requested 21 Feb, 2024 Reviews received at journal 23 Jan, 2024 Reviews received at journal 20 Jan, 2024 Reviewers agreed at journal 11 Jan, 2024 Reviewers agreed at journal 10 Jan, 2024 Reviewers invited by journal 09 Jan, 2024 Editor assigned by journal 06 Jan, 2024 Submission checks completed at journal 05 Jan, 2024 First submitted to journal 04 Jan, 2024 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-3835105","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":265611714,"identity":"d79fe17f-9308-4975-a764-ac52ed27bb67","order_by":0,"name":"James Weger-Lucarelli","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAoklEQVRIiWNgGAWjYNCCCjBpQIqWMwwSJGphbCNFi8Hx9ouPK+cdrmNgb94mQZyWM2eKDc9uOyzBwHOsjDgtZjdy0iQbt6VJMEjkmBGp5f6b9J+Nc4Ba5N8Qq+UG+zHGxgYboC08RGqxP5PDLNlwzEayjSet2IIoLZLtxx9+bKiR4OdnP7zxBlFaGBh4INHBRqRyEGB/QILiUTAKRsEoGJEAADMtKsj4ylcyAAAAAElFTkSuQmCC","orcid":"","institution":"Virginia Tech","correspondingAuthor":true,"prefix":"","firstName":"James","middleName":"","lastName":"Weger-Lucarelli","suffix":""},{"id":265611715,"identity":"cdbfef1b-522b-46ce-8fb4-b23cbd90bd64","order_by":1,"name":"Chelsea Cereghino","email":"","orcid":"","institution":"Virginia Tech","correspondingAuthor":false,"prefix":"","firstName":"Chelsea","middleName":"","lastName":"Cereghino","suffix":""},{"id":265611716,"identity":"68579994-75ba-4a29-98fc-4beaf42be58b","order_by":2,"name":"Kasia Michalak","email":"","orcid":"","institution":"Edward Via College of Osteopathic Medicine","correspondingAuthor":false,"prefix":"","firstName":"Kasia","middleName":"","lastName":"Michalak","suffix":""},{"id":265611717,"identity":"38693202-e6bb-4b85-a52d-04d1124e4ab3","order_by":3,"name":"Stephen DiGiuseppe","email":"","orcid":"","institution":"Edward Via College of Osteopathic Medicine","correspondingAuthor":false,"prefix":"","firstName":"Stephen","middleName":"","lastName":"DiGiuseppe","suffix":""},{"id":265611718,"identity":"b6f00c9e-5e1b-4772-9540-4d0fa4ada095","order_by":4,"name":"Juan Guerra","email":"","orcid":"","institution":"Edward Via College of Osteopathic Medicine","correspondingAuthor":false,"prefix":"","firstName":"Juan","middleName":"","lastName":"Guerra","suffix":""},{"id":265611719,"identity":"a48484af-7f63-4d0f-9f85-2548495294da","order_by":5,"name":"Delaney Yu","email":"","orcid":"","institution":"Edward Via College of Osteopathic Medicine","correspondingAuthor":false,"prefix":"","firstName":"Delaney","middleName":"","lastName":"Yu","suffix":""},{"id":265611720,"identity":"e7aece52-8d4e-43ed-b6ca-a4af2f3d223a","order_by":6,"name":"Ariana Faraji","email":"","orcid":"","institution":"Edward Via College of Osteopathic Medicine","correspondingAuthor":false,"prefix":"","firstName":"Ariana","middleName":"","lastName":"Faraji","suffix":""},{"id":265611721,"identity":"8407c8e8-3172-4303-8d00-6ef0fd06b0d0","order_by":7,"name":"Amanda Sharp","email":"","orcid":"","institution":"Virginia Tech","correspondingAuthor":false,"prefix":"","firstName":"Amanda","middleName":"","lastName":"Sharp","suffix":""},{"id":265611722,"identity":"931e9c23-add1-4a5a-85a0-d05871b9dead","order_by":8,"name":"Anne Brown","email":"","orcid":"","institution":"Virginia Tech","correspondingAuthor":false,"prefix":"","firstName":"Anne","middleName":"","lastName":"Brown","suffix":""},{"id":265611723,"identity":"74695b57-dff0-436d-bcdc-b5f7caa1820a","order_by":9,"name":"Lin Kang","email":"","orcid":"","institution":"Edward Via College of Osteopathic Medicine","correspondingAuthor":false,"prefix":"","firstName":"Lin","middleName":"","lastName":"Kang","suffix":""},{"id":265611724,"identity":"2653a4f7-2025-4a46-b842-9d894de03c80","order_by":10,"name":"Pawel Michalak","email":"","orcid":"","institution":"Edward Via College of Osteopathic Medicine","correspondingAuthor":false,"prefix":"","firstName":"Pawel","middleName":"","lastName":"Michalak","suffix":""}],"badges":[],"createdAt":"2024-01-04 17:44:17","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-3835105/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-3835105/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":49322288,"identity":"5c86f74d-a9bf-4423-bc11-ef6034e8238b","added_by":"auto","created_at":"2024-01-08 16:50:53","extension":"jpg","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":1545268,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eAnalysis of millions of SARS-CoV-2 sequences identified selective sweep regions within Spike.\u003c/strong\u003e \u003cstrong\u003eA. \u003c/strong\u003eSelective sweep regions were identified per month up to August 2021. Sweep regions are denoted in purple. \u003cstrong\u003eB.\u003c/strong\u003ePhylogenetic tree of SARS-CoV-2 and SARS-related coronaviruses constructed from the Spike amino acid sequence using PhyML LG substitution model and 1000 bootstraps. The species from which the virus was isolated is depicted by an image of the animal to the right of the virus name. \u003cstrong\u003eC.\u003c/strong\u003e Amino acid alignment of sweep region in SARS-CoV-2 RBD with corresponding region in closely related sarbecoviruses from bats and pangolins and SARS-CoV. The alignment was performed using MAFFT.\u003c/p\u003e","description":"","filename":"Onlinefloatimage1.jpg","url":"https://assets-eu.researchsquare.com/files/rs-3835105/v1/ef3473702e3ff39c94c91be0.jpg"},{"id":49322290,"identity":"faa5e56e-16ab-417c-867c-34bacba9a131","added_by":"auto","created_at":"2024-01-08 16:50:53","extension":"jpg","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":1670351,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eDecreased replicative fitness and infectivity of SARS-CoV-2 Spike H519N in human lung and kidney cells. A.\u003c/strong\u003e Western blot analysis of Spike protein from prepared pseudovirus stocks. An antibody was used to detect the S1 subunit of Spike. The higher band at 180 kilodaltons (kDa) corresponds to full Spike while the band near 100 kDa corresponds to furin-cleaved Spike. Units are represented in kDa. \u003cstrong\u003eB. \u003c/strong\u003eRepresentative images of Zsgreen expression in Spike pseudotyped virus infected HEK-293T-hACE2 cells observed by live-cell fluorescence microscopy. Cells infected with pseudoviruses expressing ZsGreen and luciferase pseudotyped with full Wuhan-Hu-1 Spike or mutants Spike D614G, A372T, or H519N. Images were taken 48h post-infection and are representative of one of two technical replicates from three independent experiments \u003cstrong\u003eC\u003c/strong\u003e. Infectivity was quantified using a luciferase assay 48h post-infection. Experiments were performed in duplicate for three biological experiments. Statistical analyses were performed using a One-Way ANOVA with Dunnett’s correction for multiple comparisons. \u003cstrong\u003eD. \u003c/strong\u003eGrowth curve of Spike H519N in human lung epithelial cells. Calu-3 cells were infected at a MOI of 0.1 with the WT Wuhan-Hu-1 strain of SARS-CoV-2 and mutants bearing either Spike H519N or D614G. Supernatant was collected each day post-infection and titered on Vero E6-TMPRSS2-T2A-hACE2 cells by plaque assay. Infections were performed in triplicate in two independent biological experiments. A two-way ANOVA with Dunnett’s correction for multiple comparisons was performed on these data. Bars represent standard deviation.\u003c/p\u003e","description":"","filename":"Onlinefloatimage2.jpg","url":"https://assets-eu.researchsquare.com/files/rs-3835105/v1/e86be8dfaa056171f8078ff0.jpg"},{"id":49322674,"identity":"8e24ad9f-64bb-422c-a6fd-5dbbaa7bcdb2","added_by":"auto","created_at":"2024-01-08 16:58:53","extension":"jpg","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":1829401,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eStructural and biochemical analyses of Spike H519N reveal decreased affinity to human ACE2. A. \u003c/strong\u003eStructure of Spike bound to hACE2 (PDB: 7KNB). 519 location is represented in pink spheres\u003cstrong\u003e. B. \u003c/strong\u003eVisualization of H519 and\u003cstrong\u003e C. \u003c/strong\u003eN519 structural location.\u003cstrong\u003e \u003c/strong\u003e519 is highlighted in red, with residues within the same chain in gray and a neighboring chain in teal. \u003cstrong\u003eD. \u003c/strong\u003eSurface mapping of Spike H519 and\u003cstrong\u003e E. \u003c/strong\u003eN519 colored by residue sidechain properties. Colors represent gray for neutral, teal for polar uncharged, blue for positive charge, red for negative charge, and green for hydrophobic.\u003cstrong\u003e F. \u003c/strong\u003eELISA saturation curves. Human ACE2-mFc was coated at 2 ug/mL, and various concentrations of Spike RBD were detected with an antibody. Absorbance values were measured at OD\u003csub\u003e450\u003c/sub\u003enm, and the values normalized to the blank are depicted. Curves intersect the mean of four binding events. \u003cstrong\u003eG. \u003c/strong\u003eEC\u003csub\u003e50 \u003c/sub\u003econcentrations from ELISA curves extrapolated with a nonlinear regression with a least squares fit. A One-Way ANOVA with Dunnett’s Correction was used to test for statistical significance.\u003c/p\u003e","description":"","filename":"Onlinefloatimage3.jpg","url":"https://assets-eu.researchsquare.com/files/rs-3835105/v1/4f958b8d8e822e009f99bccc.jpg"},{"id":49323157,"identity":"43e08c46-cdce-4a1d-97fe-fc34614baa69","added_by":"auto","created_at":"2024-01-08 17:06:53","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1058540,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-3835105/v1/df3dccec-b8ff-4159-9b89-7acd66da113b.pdf"},{"id":49322287,"identity":"88d02610-4d62-4ced-a3aa-4930cefdaf40","added_by":"auto","created_at":"2024-01-08 16:50:53","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":345257,"visible":true,"origin":"","legend":"","description":"","filename":"SupplFiledocx.docx","url":"https://assets-eu.researchsquare.com/files/rs-3835105/v1/71b413a7fd9bf57f4db10555.docx"}],"financialInterests":"No competing interests reported.","formattedTitle":"Evolution At Spike Position 519 in SARS-CoV-2 Facilitated Adaptation to Humans","fulltext":[{"header":"Introduction","content":"\u003cp\u003eSevere acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the causative agent of coronavirus disease 2019 (COVID-19), which has claimed just under seven million lives and caused 770\u0026nbsp;million cases of the disease since its emergence in 2019\u003csup\u003e1\u003c/sup\u003e. SARS-CoV-2 is a positive sense, single-stranded RNA virus of the family \u003cem\u003eCoronaviridae\u003c/em\u003e, genus \u003cem\u003eBetacoronavirus\u003c/em\u003e, and subgenus \u003cem\u003eSarbecovirus\u003c/em\u003e\u003csup\u003e\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e\u003c/sup\u003e. The first human cases were detected in the city of Wuhan in the Hubei province of China\u003csup\u003e\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e,\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e\u003c/sup\u003e and were associated with the Huanan wholesale seafood market\u003csup\u003e\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e,\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e\u003c/sup\u003e, where the virus may have spilled over from a live animal source\u003csup\u003e\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e,\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e \u003cp\u003eWhile not definitive, there is evidence of a bat origin of the progenitor to SARS-CoV-2\u003csup\u003e9\u003c/sup\u003e. Sarbecoviruses isolated from bats have high nucleotide sequence similarity to SARS-CoV-2 on a genome level and within the receptor-binding domain (RBD) of the Spike protein\u003csup\u003e\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e,\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e\u003c/sup\u003e. BANAL-20-52, isolated from a bat in Laos has the highest nucleotide similarity with SARS-CoV-2 to date at 96.8%\u003csup\u003e10\u003c/sup\u003e. Hypotheses posited the progenitor to SARS-CoV-2 evolved in bats and was pre-adapted to humans since positive selection was detected on deep branches of the nCoV clade and not terminal branches leading to SARS-CoV-2 on a phylogenetic tree\u003csup\u003e\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e,\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e\u003c/sup\u003e. However, the precise molecular mechanisms of adaptation of the progenitor to SARS-CoV-2 to humans have not been well characterized despite how these data would aid in the development of therapeutics and surveillance for pandemic prevention. Our previous study determined evolution towards an alanine at position 372 in the SARS-CoV-2 Spike protein was important for human adaptation by first identifying selective sweeps in the SARS-CoV-2 genome \u003csup\u003e\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e\u003c/sup\u003e. Selective sweeps occur when a favorable mutation is quickly fixed in the population\u003csup\u003e\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e\u003c/sup\u003e. Neighboring \u0026ldquo;hitch-hiker\u0026rdquo; alleles also increase in frequency as a result of the driver mutation of the selective sweep on the recombinant genomic region\u003csup\u003e\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e,\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e\u003c/sup\u003e Thus, parsing out truly adaptive mutations requires experimental validation and is critical to determining mechanisms of emergence.\u003c/p\u003e \u003cp\u003eMany studies investigating the emergence of SARS-CoV-2 have strictly involved computational analyses or the generation of pseudotyped viruses expressing Spike. These methods are useful but do not validate predictions or provide the most accurate quantitation of fitness of SARS-CoV-2 compared to examining phenotypes of replication-competent virus. Here, we gain new insights into the evolutionary pressures and molecular mechanisms behind the emergence of SARS-CoV-2 in humans by generating both pseudotyped and replication-competent virus with a putative ancestral residue consistently mapping to a selective sweep region, as based on the analysis of approximately two million SARS-CoV-2 genomes. The mutation (Spike H519N) significantly decreased SARS-CoV-2 replication in human lung epithelial cells and reduced infectivity in pseudotyped virus assays and binding to human ACE2 in a biochemical assay, consistent with structural predictions. This study provides evidence that the evolution at Spike site 519 was important for the progenitor of SARS-CoV-2\u0026rsquo;s adaptation to the human host. The Spike site also remains highly conserved among SARS-CoV-2 sequences and offers a suitable candidate for targeting with small molecule inhibitors.\u003c/p\u003e"},{"header":"Results","content":"\u003cp\u003e \u003cb\u003eSARS-CoV-2 sequences hold evidence of a past, strong selective event in the receptor binding domain of Spike\u003c/b\u003e \u003c/p\u003e \u003cp\u003eTo investigate which regions in SARS-CoV-2 have contributed to its efficient infection and circulation in humans, we analyzed sequences for signs of strong positive selection events known as selective sweeps. We acquired 1,912,191 human-isolated SARS-CoV-2 sequences from GISAID, covering the period up to August 2021. We then used a computational pipeline combining OmegaPlus\u003csup\u003e\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e\u003c/sup\u003e and RAiSD\u003csup\u003e\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e\u003c/sup\u003e to identify regions with a high probability of selective sweep events. To discern the selective sweep regions instrumental in driving mutations during the early stages of human infection, we categorized the sequences by their sample collection dates, arranging them into monthly cohorts (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eA). Our analysis identified an average of 11 sweep regions per month, with the numbers ranging from 4 to 16 (Supplementary file 1). While many sweep regions were identified, our attention was primarily drawn to the sweep regions in the receptor-binding domain (RBD) of the Spike protein (nucleotide position 22,878 to 23,332; amino acid positions 439\u0026ndash;590), as mutations in the RBD are known modulators of host tropism and receptor binding\u003csup\u003e\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e\u003c/sup\u003e. The selective sweep regions included the A372 residue in the RBD, a critical factor we previously identified for the emergence of SARS-CoV-2 and its sustained transmission among humans\u003csup\u003e\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e\u003c/sup\u003e. These regions were evident in the early months of the COVID-19 outbreak (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eA).\u003c/p\u003e \u003cp\u003eSince selective sweep regions indicate a past selective event driven by various adaptive mutations and their associated hitchhiker mutations, we sought to determine the site within the Spike selective sweep region where pivotal mutations occurred upon early infection of humans\u003csup\u003e\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e\u003c/sup\u003e. While the progenitor virus to SARS-CoV-2 is unknown, we inferred the progenitor Spike sequence could assume several amino acid identities found at the corresponding site in Spike from closely related sarbecoviruses infecting bats and pangolins. To identify the closest coronaviruses related to SARS-CoV-2, we constructed a phylogenetic tree based on the amino acid sequence of Spike with PhyML (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eB)\u003csup\u003e\u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e,\u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e20\u003c/span\u003e\u003c/sup\u003e. Using these phylogenetic relationships, we aligned the amino acid sequence of the selective sweep region in Spike from the Wuhan-Hu-1, lineage B strain of SARS-CoV-2 with the homologous region in closely related sarbecoviruses to reveal sites with differential amino acid identities (Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eC; see also \u003cb\u003eExtended Data\u003c/b\u003e Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003e for the full amino acid alignment of the sweep region in Spike). Similar to amino acid position 372 in Spike, position 519 holds a non-synonymous mutation (nucleotide substitution C23117A) that differentiates SARS-CoV-2 from the aligned bat- and pangolin-derived \u003cem\u003eSarbecovirus\u003c/em\u003e sequences\u003csup\u003e\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e\u003c/sup\u003e. This position was identified in the selective sweep regions during one of the early months of the outbreak (January 2021; Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003eA). SARS-CoV-2 has a histidine at 519, while the bat and pangolin-derived sequences bear an asparagine or lysine. We hypothesized the progenitor to SARS-CoV-2, potentially a virus infecting bats or pangolins, acquired a histidine at some point in the evolutionary timeline, thereby gaining an adaptive advantage in humans.\u003c/p\u003e \u003cp\u003eSince sites with low amino acid diversity may implicate a crucial role of the predominant amino acid in viral fitness, we sought to investigate the amino acid diversity at position 519 in Spike in SARS-CoV-2. Using data from 3,848 genomes sampled between December 2019 and October 2023 from the nCoV GISAID dataset displayed on NextStrain, we determined Spike position 519 in SARS-CoV-2 has a normalized Shannon entropy of 0, suggesting little to no flexibility is allowable at this position in humans (\u003cb\u003eExtended Data\u003c/b\u003e Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003e)\u003csup\u003e\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e,\u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e\u003c/sup\u003e. With this result, we hypothesized the predominant histidine at Spike 519 in SARS-CoV-2 may hold an important function for the virus in humans, while the ancestral residue, asparagine, would result in deleterious effects on fitness in human cells.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003cb\u003eSARS-CoV-2 Spike mutant bearing a residue of bat and pangolin\u003c/b\u003e \u003cb\u003eSarbecovirus\u003c/b\u003e \u003cb\u003eorigin has reduced replicative fitness and infectivity in human cells\u003c/b\u003e\u003c/p\u003e \u003cp\u003eTowards identifying the driving mutations of the putative selective event, we reasoned evolution from the asparagine to histidine at position 519 may have driven early adaptation of the progenitor virus to humans. Since position 519 is in the RBD of Spike, we first used a previously described pseudovirus system\u003csup\u003e\u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e23\u003c/span\u003e\u003c/sup\u003e to determine whether Spike H519N had any functional significance during infection of cells expressing human ACE2 (hACE2), the receptor for SARS-CoV-2\u003csup\u003e24\u003c/sup\u003e. First, we verified hACE2 protein levels by western blot expressed in human embryonic kidney cells (\u003cb\u003eExtended Data\u003c/b\u003e Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e). We generated lentiviruses pseudotyped with full wild-type (WT) Spike, Spike H519N, Spike D614G, and Spike A372T, a RBD mutant bearing an ancestral threonine which we previously showed attenuates the virus in human lung epithelial cells\u003csup\u003e\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e\u003c/sup\u003e. Spike D614G is used as a known human-adaptive variant that emerged early in the pandemic and is now fixed in all SARS-CoV-2 variants\u003csup\u003e\u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e25\u003c/span\u003e\u003c/sup\u003e. The pseudoviruses express both luciferase and a green fluorescence protein (GFP), ZsGreen\u003csup\u003e\u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e23\u003c/span\u003e\u003c/sup\u003e, allowing for sensitive detection of infection efficiency. We detected Spike protein levels from prepared virus stocks and observed a greater incorporation of G614 into the pseudoviruses as previously observed (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eA)\u003csup\u003e\u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e26\u003c/span\u003e\u003c/sup\u003e. Ectopic expression of Spike was similar between WT Spike and Spike H519N. Next, we used pseudovirus particles to infect human embryonic kidney cells expressing hACE2 to determine whether infectivity through hACE2 is altered by Spike H519N. Spike D614G enhanced the infectivity of the pseudotyped viruses compared to WT Spike, in agreement with data extensively reported in literature (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eB)\u003csup\u003e\u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e26\u003c/span\u003e\u003c/sup\u003e. Spike A372T significantly reduced the infectivity of SARS-CoV-2, consistent with our previous study (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eB)\u003csup\u003e\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e\u003c/sup\u003e. Importantly, Spike H519N significantly reduced the infectivity of SARS-CoV-2 compared to Spike D614G and WT Spike (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eB). When infected cells were quantified based on luciferase expression, we also observed significant decreases in infectivity for Spike H519N compared to WT Spike (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eC; \u003cb\u003ep\u0026thinsp;\u0026lt;\u0026thinsp;0.0001\u003c/b\u003e).\u003c/p\u003e \u003cp\u003eTowards determining whether Spike N519 reduces the replicative fitness of SARS-CoV-2 in human cells, we generated a replication-competent, SARS-CoV-2 mutant bearing an asparagine at position 519 in Spike, the amino acid present in closely related sarbecoviruses. In human lung epithelial cells, the putatively ancestral SARS-CoV-2 Spike H519N mutant replicated to significantly lower titers than the WT virus and Spike D614G, demonstrating a nearly 2-log difference in viral titers (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003eD; \u003cb\u003ep\u0026thinsp;\u0026lt;\u0026thinsp;0.01 at all timepoints\u003c/b\u003e). These data suggest that reverting the histidine at Spike 519 to the ancestral asparagine significantly reduces infection and replication in human lung epithelial cells.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003cb\u003eSpike H519N potentially impacts ability to sample up/down conformations and reduces binding affinity to human ACE2\u003c/b\u003e \u003c/p\u003e \u003cp\u003eIn elucidating the mechanism of attenuation of SARS-CoV-2 bearing the putatively ancestral asparagine at Spike position 519, we analyzed the interactions between Spike chains to identify how H519N is impacting infection (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eA). Residue 519 is in the RBD but not within the receptor binding motif (RBM) to interact with ACE2 directly. Rather, residue 519 is positioned on a cleft at the interface between Spike chains that constitute the full trimer complex. Indeed, residue 519 is within 4\u0026ndash;5 \u0026Aring; of residues of adjacent chains of the Spike trimer, participates in polar interchain interactions, and conformational up/down movement can be impacted at this interface. Here, we sought to utilize this interaction interface to compute predicted free energy of binding calculations (MM/GBSA) analyzing up/down conformation favorability by probing interchain binding free energy in Spike. Results demonstrate that the H519 up conformation has similar interchain binding free energy to the N519 up conformation when unprotonated (-284.1 kcal/mol and \u0026minus;\u0026thinsp;264.1 kcal/mol, respectively). The down conformation exhibits larger differences between the H519 and N519 structures, with H519 interchain predicted free energy binding energy of -259.2 kcal/mol and N519 of -365.9 kcal/mol. Interestingly, when analyzing protonated H519, we observe predicted free energy of binding for the up conformation H519 of -359.4 kcal/mol and \u0026minus;\u0026thinsp;280.9 kcal/mol for the down confirmation of H519. This suggests that in a lower physiological pH, the H519 Spike samples the up conformation more favorably compared to the N519 which alternatively energetically favors the down conformation based on interchain interactions. Structural inspection by analyzing the neighboring residues reveals that 519 is surrounded by a pocket of polar and charged residues from a neighboring Spike chain (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eB-C). Surface mapping of residue properties highlights that H519 exhibits a neutral surface area (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eD) while N519 results in a more polar surface area (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eE). These results reveal that H519 may result in a lower energy barrier to overcome when moving between down/up positioning to bind ACE2, potentially allowing for increased infection efficiency.\u003c/p\u003e \u003cp\u003eWith the insight that H519N appears to impact up/down conformational changes in protonated SARS-CoV-2 Spike and the potential ability to bind ACE2 by sampling the up position more favorably, we further sought to determine whether Spike H519 would experimentally bind with higher affinity to hACE2. Enzyme-linked immunosorbent assay (ELISA) was performed with hACE2 and several concentrations of RBD from either WT Spike, Spike N501Y, Spike A372T, or Spike H519N. Spike N501Y was used as a positive control with known increased binding to hACE2\u003csup\u003e27\u003c/sup\u003e. Consistent with reduced binding efficiency, the absorbance curve for Spike H519N and Spike A372T RBD are shifted right from WT RBD (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eF). We observed significantly lower EC\u003csub\u003e50\u003c/sub\u003e values for Spike N501Y RBD and significantly higher EC\u003csub\u003e50\u003c/sub\u003e values for Spike H519N and Spike A372T compared to WT RBD (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003eG). This result suggests more molecules of Spike H519N RBD would be required to saturate hACE2 binding sites; therefore, this mutation may reduce the affinity to hACE2 leading to reductions in replication.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e"},{"header":"Discussion","content":"\u003cp\u003eDuring zoonotic spillover events, selective pressures exist for pathogens to adapt to the cellular components and immune responses of new host species in the pursuit of optimal fitness. RNA viruses like SARS-CoV-2 are skilled at navigating the fitness landscape in new hosts, taking advantage of both recombination and the error-prone RNA-dependent RNA polymerase to source new genotypes that can lead to fitness advantages\u003csup\u003e\u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e28\u003c/span\u003e\u003c/sup\u003e. Our study provides evidence of historical selection acting on SARS-CoV-2 via selective sweeps and assesses the contribution of an amino acid site within a selective sweep region in the RBD of Spike that likely facilitated adaptation to humans. We provide evidence that the putatively ancestral asparagine at Spike 519 in closely related sarbecoviruses reduces the fitness of SARS-CoV-2 in cells expressing hACE2 due to reduced affinity of Spike RBD N519 to hACE2 and higher binding free energy in the up conformation.\u003c/p\u003e \u003cp\u003eSeveral studies have gathered strong evidence for a natural origin of SARS-CoV-2, but there is disagreement on how SARS-CoV-2 might have evolved to sustain productive infection of and transmission between humans. Some studies have suggested the progenitor to SARS-CoV-2 evolved in bats and required little to no adaptation to humans before spillover and emergence\u003csup\u003e\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e\u003c/sup\u003e. While we do not refute evolution of the progenitor to SARS-CoV-2 occurred in bats, our study provides evidence of a mutation that enhanced human infection but does not predict in which host this mutation arose. It is possible that genetic drift led to mutations in the reservoir host which enabled human adaptation prior to human exposure\u003csup\u003e\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e\u003c/sup\u003e. This study is the second of ours that supports the theory that the progenitor to the Wuhan-Hu-1 strain of SARS-CoV-2 was subject to a strong selective pressure resulting in adaptation to humans at a site in the RBD of Spike\u003csup\u003e\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e \u003cp\u003eZoonotic spillover events are often associated with the selection of mutations in viral receptor binding proteins that afford the use of orthologous receptors in the human host\u003csup\u003e\u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e29\u003c/span\u003e\u003c/sup\u003e. For coronaviruses in particular, mutations in the Spike protein, which binds the host ACE2 receptor, are sufficient to expand the host range of the virus even though a variety of other factors are important for tropism\u003csup\u003e\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e\u003c/sup\u003e. Another zoonotic coronavirus, severe acute respiratory syndrome coronavirus (SARS-CoV), which caused an epidemic from 2002\u0026ndash;2003, emerged likely due to a set of mutations both in the receptor binding domain (RBD) and other parts of Spike and underwent adaptive evolution from its reconstructed ancestral sequence\u003csup\u003e\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e,\u003cspan citationid=\"CR30\" class=\"CitationRef\"\u003e30\u003c/span\u003e\u003c/sup\u003e. Our results are in agreement with other studies that mutations outside of the RBM of the coronavirus Spike protein, including both subunits of Spike, can affect receptor binding and are also associated with host-range expansion\u003csup\u003e\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e,\u003cspan citationid=\"CR31\" class=\"CitationRef\"\u003e31\u003c/span\u003e,\u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e32\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e \u003cp\u003eAmino acid position 519 in Spike is in the RBD but not within the receptor binding motif (RBM). The putatively ancestral asparagine has more favorable co-association of Spike in the down conformation, which impacts the potential to sample an up conformation and bind with hACE2. This can be observed with the reduced binding affinity of Spike H519N RBD to hACE2 via ELISA and interchain free energy calculations highlighting the H519 up conformation favorability in an acidic environment. While these results provide insight into potential structural mechanisms related to the fitness of H519, more robust molecular dynamics (MD) simulations could reveal more major morphological changes related to 519 fitness in up/down conformations. Our results are not surprising since mutations outside of the RBM have been shown to increase binding affinity to hACE2 during a deep mutational scan of the Spike RBD\u003csup\u003e\u003cspan citationid=\"CR33\" class=\"CitationRef\"\u003e33\u003c/span\u003e\u003c/sup\u003e. Furthermore, mutations at 372 and 614, which lie outside of the RBD, affect ACE2 binding\u003csup\u003e\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e,\u003cspan citationid=\"CR34\" class=\"CitationRef\"\u003e34\u003c/span\u003e\u003c/sup\u003e. Our results are biologically relevant given that the human nasal cavity pH is 6.6 which might provide a more optimal environment for the protonated SARS-CoV-2 Spike H519 to associate more with hACE2 than Spike 519N in this environment\u003csup\u003e\u003cspan citationid=\"CR35\" class=\"CitationRef\"\u003e35\u003c/span\u003e\u003c/sup\u003e. Taken in an evolutionary context, it is possible that Spike N519H arose in the progenitor virus and that this mutation increased association with hACE2 when the virus initially encountered humans, leading to enhanced infection and transmissibility.\u003c/p\u003e \u003cp\u003eWe used NextStrain to examine the diversity within Spike and identified low normalized Shannon entropy approaching 0 at position 519 in Spike of SARS-CoV-2 sequences from 2019\u0026ndash;2023 from GISAID. Since we have shown the histidine at position 519 in Spike is important for replication in human lung cells and that it is highly conserved regardless of lineage and variant of concern, position 519 might constitute a suitable drug target. Indeed, structural inspection of the surface area maps and structural residue region of 519 indicates that it is within a solvent accessible pocket, surrounded by polar and charged residues from neighboring chains. Suitable pockets for small molecule inhibitors may also correlate with regions of low diversity in the genome, however further investigation is needed. Efforts to design small molecule inhibitors of SARS-CoV-2 have resulted in targeting genes such as the main protease of SARS-CoV-2\u003csup\u003e36\u003c/sup\u003e. Recent surveillance studies identified multiple mutations in SARS-CoV-2 that confer resistance to nirmatrelvir\u003csup\u003e\u003cspan citationid=\"CR37\" class=\"CitationRef\"\u003e37\u003c/span\u003e\u003c/sup\u003e. Another protease inhibitor ensitrelvir is being used for emergency use authorization in Japan, but mutations conferring resistance to this inhibitor have been identified as well\u003csup\u003e\u003cspan citationid=\"CR37\" class=\"CitationRef\"\u003e37\u003c/span\u003e\u003c/sup\u003e. This highlights the importance of finding new components of the virus to target, and the RBD at position 519 in Spike might be suitable since other amino acids at this position have not emerged to any appreciable extent.\u003c/p\u003e \u003cp\u003eTo conclude, a mutation altering the amino acid identity at position 519 in the Spike protein of SARS-CoV-2 is likely to have partly mediated adaptation of the progenitor virus to the human host. We demonstrated the conserved histidine at position 519 in Spike affords fitness advantages through increased replication in human cells via increased hACE2 binding affinity and favorability of Spike in an up conformation. Position 519 in Spike holds promise as a site to which small molecular inhibitors could be designed amidst the concern of the development of drug resistance.\u003c/p\u003e"},{"header":"Methods","content":"\u003cdiv id=\"Sec5\" class=\"Section2\"\u003e \u003ch2\u003eSelective sweep analysis of SARS-CoV-2 genomes\u003c/h2\u003e \u003cp\u003eA total of 1,914,191 human-derived, complete SARS-CoV-2 genomes up to Aug. 2021 were downloaded from the GISAID EpiCov database (\u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.gisaid.org/\u003c/span\u003e\u003cspan address=\"https://www.gisaid.org/\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e). Sequences that had low coverage or displayed over 5% nucleotide ambiguities (designated as 'N') were excluded from the analysis to ensure analytical rigor. For further analysis, sequences were categorized monthly based on their sample collection dates. Selective sweep identification was conducted within these monthly cohorts using previously established data curation and methodologies\u003csup\u003e\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec6\" class=\"Section2\"\u003e \u003ch2\u003eConstruction of phylogenetic tree\u003c/h2\u003e \u003cp\u003eThe following coronavirus sequences were downloaded from GenBank: Bat coronavirus isolate BANAL-20-52/Laos/2020 MZ937000.1, Bat coronavirus isolate BANAL-20-103/Laos/2020, complete genome MZ937001.1, Bat coronavirus isolate BANAL-20-116/Laos/2020, complete genome MZ937002.1, Bat coronavirus isolate BANAL-20-236/Laos/2020, complete genome MZ937003.2, Bat coronavirus isolate BANAL-20-247/Laos/2020, complete genome MZ937004.1, Bat coronavirus RacCS203, complete genome MW251308.1, Bat coronavirus RaTG13, complete genome MN996532, Bat coronavirus strain BetaCoV/Rm/Yunnan/YN02/2019 spike protein (S) gene, partial cds MW201982.1, Bat sarbecovirus sp. isolate PrC31, complete genome MW703458.1, Bat SARS-like coronavirus isolate Rs4231 KY417146.1, Bat Severe acute respiratory syndrome-related coronavirus Rc-o319 RNA, complete genome LC556375.1, Betacoronavirus sp. RpYN06 strain bat/Yunnan/RpYN06/2020, complete genome MZ081381.1, Coronavirus BtRs-BetaCoV/YN2018B MK211376.1, MAG Pangolin coronavirus isolate GD P79-9 2019, complete genomeOQ297708.1, Pangolin coronavirus isolate MP789, complete genome MT121216.1, Pangolin coronavirus isolate PCoV_GX-P4L, complete genome MT040333.1, Pangolin coronavirus isolate PCoV_GX-P5L MT040335.1, SARS coronavirus Tor2 AY274119.3, Severe_acute_respiratory_syndrome_coronavirus_2_isolate_Wuhan-Hu-1_NC_045512.2. Geneious (version 2023.1 created by Biomatters. Available from \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.geneious.com\u003c/span\u003e\u003cspan address=\"https://www.geneious.com\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e) was used to construct a phylogenetic tree based on the amino acid sequence of Spike using PhyML 3.3.20180214 with a LG substitution model and 1000 bootstraps\u003csup\u003e\u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e\u003c/sup\u003e. An amino acid alignment of the selective sweep region in Spike identified in January 2021 was performed using Geneious with the same viruses as above to identify non-synonymous mutations compared to the Wuhan-Hu-1 strain of SARS-CoV-2.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec7\" class=\"Section2\"\u003e \u003ch2\u003eDiversity analysis in Spike\u003c/h2\u003e \u003cp\u003eNextStrain was accessed on November 1, 2023 displaying all 3,848 nCov genomes sampled between December 2019 and October 2023 from the GISAID nCoV dataset. Amino acid diversity data was obtained by downloading the full Spike diversity dataset from the site. NextStrain measures normalized Shannon entropy by determining the observed counts of a residue at a given amino acid position normalized by the number of tips in the phylogenetic tree on NextStrain and summing the individual entropies for an amino acid identity at the position\u003csup\u003e\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003eCell lines and plasmids\u003c/h2\u003e \u003cp\u003eHuman embryonic kidney cells expressing human ACE2 (HEK-293T-hACE2; NR-52511) and African green monkey kidney epithelial cells expressing TMPRSS2 and human ACE2 (Vero E6-TMPRSS2-T2A-hACE2; NR-54970) were obtained from BEI Resources. Human lung epithelial cells (Calu-3 HTB-55) and human embryonic kidney cells (HEK-293T) were acquired from ATCC. Cells were grown in a humidified atmosphere with carbon dioxide supplied at 5% at 37\u0026deg;C. HEK-293T and HEK-293T-hACE2 cells were grown in Dulbecco\u0026rsquo;s Modified Eagle Medium (DMEM; Corning\u0026trade; 10013CV) supplemented with 10% Fetal Bovine Serum, 100 U/mL penicillin, 100 U/mL streptomycin, 2mM L-glutamine. Vero E6-TMPRSS2-T2A-hACE2 and Calu-3 cells were grown in Dulbecco\u0026rsquo;s Modified Eagle Medium supplemented with gentamicin sulfate (0.1%), non-essential amino acids (1X), HEPES (25mM), and either 5% (Vero E6-TMPRSS2-T2A-hACE2) or 20% (Calu-3) fetal bovine serum. Vero E6-TMPRSS2-T2A-hACE2 also required the addition of 0.01 mg/mL puromycin. SARS-Related Coronavirus 2, Wuhan-Hu-1 Spike-Pseudotyped Lentiviral Kit V2 was obtained from BEI Resources (NR-53816).\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec9\" class=\"Section2\"\u003e \u003ch2\u003eGeneration of Spike pseudotyped virus mutants\u003c/h2\u003e \u003cp\u003eSpike mutations A372T and H519N were introduced to the human codon-optimized Spike gene on vector pHDM SARS-Related Coronavirus 2 Wuhan-Hu-1 Spike Glycoprotein (BEI; NR-53742). We used the BioLabs Q5 Site-Directed Mutagenesis Kit (NEB #E0554) as described from the manufacturer\u0026rsquo;s protocol with mutagenesis primers Forward (CGAATTGCTCAACGCTCCAGC) and Reverse (AAACTCAAGACCACAACTCTG) for mutant H519N, and Forward (GTATAATAGTACAAGCTTTAGCACATTC) and Reverse (AATACTGAGTAGTCCGCC) for mutant A372T. Mutations were confirmed by Sanger sequencing. Vector HDM SARS2-Spike-del121 D614G was purchased from Addgene (cat #158762). Pseudotyped lentiviral particles were generated by transfecting HEK-293T cells using PolyJet In Vitro DNA Transfection reagent (SignaGen cat #SL100688) following the manufacturer\u0026rsquo;s recommendations for the six-well plate, with the adjusted amount of 3\u0026micro;g for each of the following BEI plasmids: pRC-CMV-Rev1b (NR-52519), HDM-tat1b (NR-52518), HDM-Hgpm2 (NR-52518), Luciferase-IRES-ZsGreen (NR52516), and 3\u0026micro;g of the pHDM vector encoding the Wuhan-Hu-1 Spike Glycoprotein (WT), or D614G, A372T, or H519N spike mutations, respectively. We monitored the transfection efficiency by detecting ZsGreen signal via live cell fluorescence microscopy using a Leica DMI4000B inverted microscope. Supernatants with lentiviral particles were harvested 48- and 72-hours post-transfection. Pseudoviruses were precipitated overnight at 4\u003csup\u003eo\u003c/sup\u003eC with 40% PEG-8000/1.2 M NaCl pH 7.2 at a 1 to 3 ratio, centrifuged at 1200rpm for 1 hour at 4\u003csup\u003eo\u003c/sup\u003eC, and obtained pellet resuspended in 1/10 of the original collection volume.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec10\" class=\"Section2\"\u003e \u003ch2\u003eWestern blot analysis of pseudotyped virus Spike incorporation\u003c/h2\u003e \u003cp\u003eWe verified the presence of SARS-CoV-2 Spike protein on WT and mutated pseudotyped lentiviral particles by Western blot analysis. Viral protein lysates were denatured at 95\u003csup\u003eo\u003c/sup\u003eC, resolved on a 10% SDS-PAGE gel, and transferred to Biorad Immun-Blot PVDF membrane (cat #1620177). After 1 hour Blocking in 5% milk in TBS-T buffer (50 mM Tris [pH 7.5], 150 mM NaCl containing 0.1% Tween 20), the membrane was incubated overnight with Invitrogen Anti-SARS-CoV2-Spike protein S1 primary antibody (cat #PA5-114528). After three washes with TBS-T buffer, blot was probed for 1 hour with Peroxidase-conjugated AffiniPure Goat Anti-Rabbit IgG (H\u0026thinsp;+\u0026thinsp;L) secondary antibody (Jackson ImmunoResearch Laboratories cat #115-035-003) in 5% milk/TBS-T buffer. The membrane was washed three times with TBS-T, and the signal was detected with Biorad Clarity Western ECL substrate cat (#170\u0026ndash;5061) and imaged using Biorad ChemiDoc MP Imaging System.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003eQuantification of SARS-CoV-2 plasmid pseudogenome via real-time PCR\u003c/h2\u003e \u003cp\u003eFor the quantification of pseudovirus particles, we extracted the DNA from lentiviral stocks using Qiagen DNeasy Blood \u0026amp; Tissue Kit (cat #69504) according to the manufacturer's protocol. We quantified a ZsGreen amplicon by qPCR via Applied Biosystems PowerSYBR Green PCR Master Mix (cat #4376659) using Forward (GACCATGAAGTACCGCATG) and Reverse (CCGCCCTCCACCACGCAC) ZsGreen-specific primers. The DNA Standard curve of 10 ng/\u0026micro;L, 1 ng/\u0026micro;L, 0.1 ng/\u0026micro;L and 0.01 ng/\u0026micro;L serial dilutions of pHAGE-CMV-Luc2-ZsGreen plasmid served as our reference. The relative amounts of pseudoviral genomes of all mutants were adjusted to WT lentivirus before infections.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section2\"\u003e \u003ch2\u003eLuciferase Assay\u003c/h2\u003e \u003cp\u003eThe luciferase activity of HEK-293T-hACE2 cells infected with pseudovirus was measured using the Promega Luciferase Assay System according to the manufacturer's protocol (cat #E1500). Cells were washed 2 times with PBS and lysed in 1X Lysis Buffer per well in a 12-well plate. 100 \u0026micro;L of the 1X Luciferase Assay per 20 \u0026micro;L of lysate was used for detection of luciferase activity in each well of a 96-well black microplate. The efficiency of infection was determined by measuring the intensity of luminescence calculated as relative luciferase units using the Agilent BioTek Synergy HTX Multi-Mode Microplate Reader.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec13\" class=\"Section2\"\u003e \u003ch2\u003eGeneration of SARS-CoV-2 mutants\u003c/h2\u003e \u003cp\u003eLive SARS-CoV-2 mutants were generated from a previously constructed full-length infectious clone of the Wuhan-Hu-1 strain of SARS-CoV-2 (GenBank accession no. NC_045512.2). Mutagenic primers were used to introduce mutations into the backbone by PCR with Invitrogen Platinum SuperFi II PCR master mix (12368010) or Quantabio repliQa HiFi ToughMix (95200-025). Fragments were purified from agarose gels using the Machery-Nagel nucleospin gel and pcr clean-up kit (740609.250). Purified fragments were mixed at an equimolar ratio and assembled and amplified by replication cycle reaction using the OriCiro Genomics Cell-Free Cloning System. Virus was recovered by transfecting a 1:1 ratio of replication cycle reaction product and pUC19\u003csup\u003e38\u003c/sup\u003e into BHK-21 cells using Polyplus jetOPTIMUS DNA transfection reagent (101000051). Three days post-transfection, a blind passage of transfection supernatant on Vero E6-TMPRSS2-T2A-hACE2 cells was performed. After observing cytopathic effect, the virus was harvested, titered by plaque assay on Vero E6-TMPRSS2-T2A-hACE2 cells, and 400bp libraries were prepared in-house using IDT Artic V4.1 NCOV-2019 Panel (10011442) and sequenced by Genewiz NGS Amplicon-EZ Illumina sequencing.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec14\" class=\"Section2\"\u003e \u003ch2\u003eGrowth curve of Spike H519N in human lung epithelial cells\u003c/h2\u003e \u003cp\u003eHuman lung epithelial cells (Calu-3) were infected at 80% confluency with a MOI of 0.1 of SARS-CoV-2 Wuhan-Hu-1 (wild-type), SARS-CoV-2 Spike D614G mutant, and SARS-CoV-2 Spike H519N mutant diluted in Roswell Park Memorial Institute 1640 (RPMI-1640) medium supplemented with fetal bovine serum (2%) and HEPES (10 mM). Supernatant was harvested every 24 hours post-infection and titered by plaque assays on Vero E6-TMPRSS2-T2A-hACE2 cells.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec15\" class=\"Section2\"\u003e \u003ch2\u003eQuantitation of infectious virus\u003c/h2\u003e \u003cp\u003eInfectious virus was quantified by plaque assay by infecting 90% confluent monolayers of Vero E6-TMPRSS2-T2A-hACE2 with serial dilutions of virus made in RPMI-1640 medium supplemented with fetal bovine serum (2%) and HEPES (10 mM). After one hour of adsorption at 37\u0026deg;C, a 1:1 mixture of 2X media (2X EMEM, 4X L-glutamine, 0.735% sodium bicarbonate, 0.2 mg/mL gentamicin sulfate, 4% FBS, 20 mM HEPES) and 3% methylcellulose was added to the cells. Infection proceeded for three days at 37\u0026deg;C before fixation in 10% buffered formalin and staining with a 0.1% crystal violet solution containing 20% ethanol to visualize plaques.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec16\" class=\"Section2\"\u003e \u003ch2\u003eStructural Analyses\u003c/h2\u003e \u003cp\u003eMolecular Mechanics/Generalized Born Surface Area (MM/GBSA) was used to predict interchain free energy of binding with the Schrodinger- Maestro software v. 2020-2 (Schr\u0026ouml;dinger, LLC, New York, NY, 2020). Spike protein (PDB ID: 7KNB) was split into individual chains and modified to reflect D614G in the Wuhan-Hu-1 strain\u003csup\u003e\u003cspan citationid=\"CR39\" class=\"CitationRef\"\u003e39\u003c/span\u003e\u003c/sup\u003e. Predicted free energy of binding between Spike chains first on one chain in an up-RBD conformation and one chain in a down-RBD conformation to approximate up conformation free energy and two down-RBD conformation chains for down conformation free energy on H519 (neutral), H519 (protonated), and N519 structures. Protonated and unprotonated states were simulated at a pH of 5 and 7, respectively. Surface mapping was performed using Schrodinger- Maestro software v. 2020-2.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec17\" class=\"Section2\"\u003e \u003ch2\u003eBiochemical characterization of Spike with human ACE2\u003c/h2\u003e \u003cp\u003eA sandwich ELISA was performed using recombinant, purified RBD of His-tagged wild-type Wuhan-Hu-1 SARS-CoV-2, SARS-CoV-2 Spike N501Y, SARS-CoV-2 Spike H519N, or SARS-CoV-2 Spike A372T and hACE2. A plate was coated with ACE2-mFc (Cat:10108-H05H) at 2 \u0026micro;g/mL and incubated overnight at 4\u0026deg;C. The following day, RBD of each aforementioned genotype was added at the following concentrations: 0.256, 1.28, 6.4, 32, 160, 800, and 4000 ng/mL. The antibody anti-His-HRP (Sino A5327) was used to detect the His-tagged RBDs. Absorbance was measured at 450nm, and values were normalized to the blank.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec18\" class=\"Section2\"\u003e \u003ch2\u003eStatistical Analyses\u003c/h2\u003e \u003cp\u003eAll statistical analyses for Figs.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003e\u0026ndash;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e were performed using Prism 9 (GraphPad). Growth curve data was analyzed using a Two-Way ANOVA with Dunnett\u0026rsquo;s correction for multiple comparisons. All other data were analyzed with a One-Way ANOVA with Dunnett\u0026rsquo;s correction for multiple comparisons.\u003c/p\u003e \u003c/div\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eAcknowledgements\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eWe extend our thanks to the scientists who performed surveillance and sequencing of sarbecoviruses and SARS-CoV-2 and made the data accessible via GenBank and GISAID and to the team that built and maintains NextStrain. We would like to acknowledge Sino Biological for performing ELISAs.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eCompeting interests declaration\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe authors of this manuscript do not declare any competing interests.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthor contributions\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eConceptualization: P.M., L.K., J.W.L., A.B., S.D.\u003c/p\u003e\n\u003cp\u003eData curation: C.C., K.M., A.S., P.M., L.K., J.W.L., A.B., S.D.\u003c/p\u003e\n\u003cp\u003eFormal analysis: C.C., K.M., A.S., P.M., L.K., J.W.L., A.B., S.D.\u003c/p\u003e\n\u003cp\u003eInvestigation: C.C., K.M., A.S., P.M., L.K., J.W.L., A.B., S.D.\u003c/p\u003e\n\u003cp\u003eMethodology: P.M., L.K., J.W.L., A.B., S.D.\u003c/p\u003e\n\u003cp\u003eValidation: C.C., K.M.\u003c/p\u003e\n\u003cp\u003eWriting \u0026ndash; original draft: C.C.\u003c/p\u003e\n\u003cp\u003eWriting \u0026ndash; review \u0026amp; editing: C.C., K.M., A.S., S.D., L.K., A.B., J.W.L., P.M.\u003c/p\u003e\n\u003cp\u003eFunding acquisition: J.W.L., P.M.\u003c/p\u003e\n\u003cp\u003eProject administration: J.W.L., P.M.\u003c/p\u003e\n\u003cp\u003eResources: J.W.L., P.M., S.D.\u003c/p\u003e\n\u003cp\u003eSupervision: P.M., J.W.L., A.B., S.D.\u003c/p\u003e\n\u003cp\u003eAll authors read and approved the final manuscript.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eBiosecurity\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eAll research protocols were approved by the Virginia Tech Institutional Biosafety Committee prior to beginning experiments. Pseudotyped virus work was conducted at biosafety level 2, while live virus manipulation of SARS-CoV-2 was performed in a biosafety level 3 (BSL3) laboratory at Virginia Tech with appropriate fitting N95 respirators and other standard PPE. SARS-CoV-2 was appropriately inactivated at 65C for 10 minutes after the addition of a viral lysis buffer, as described previously, before removal from the BSL3 for library preparation\u003csup\u003e40\u003c/sup\u003e.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eWHO Coronavirus (COVID-19) dashboard. https://covid19.who.int/.\u003c/li\u003e\n\u003cli\u003eCoronaviridae Study Group of the International Committee on Taxonomy of Viruses. The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. \u003cem\u003eNat Microbiol\u003c/em\u003e \u003cstrong\u003e5\u003c/strong\u003e, 536\u0026ndash;544 (2020).\u003c/li\u003e\n\u003cli\u003eHuang, C. \u003cem\u003eet al.\u003c/em\u003e Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. \u003cem\u003eLancet\u003c/em\u003e \u003cstrong\u003e395\u003c/strong\u003e, 497\u0026ndash;506 (2020).\u003c/li\u003e\n\u003cli\u003eZhou, P. \u003cem\u003eet al.\u003c/em\u003e A pneumonia outbreak associated with a new coronavirus of probable bat origin. \u003cem\u003eNature\u003c/em\u003e \u003cstrong\u003e579\u003c/strong\u003e, 270\u0026ndash;273 (2020).\u003c/li\u003e\n\u003cli\u003eWu, F. \u003cem\u003eet al.\u003c/em\u003e A new coronavirus associated with human respiratory disease in China. \u003cem\u003eNature\u003c/em\u003e \u003cstrong\u003e579\u003c/strong\u003e, 265\u0026ndash;269 (2020).\u003c/li\u003e\n\u003cli\u003eOrganization, W. H. \u0026amp; Others. WHO-convened global study of origins of SARS-CoV-2: China Part. (2021).\u003c/li\u003e\n\u003cli\u003eWorobey, M. \u003cem\u003eet al.\u003c/em\u003e The Huanan Seafood Wholesale Market in Wuhan was the early epicenter of the COVID-19 pandemic. \u003cem\u003eScience\u003c/em\u003e \u003cstrong\u003e377\u003c/strong\u003e, 951\u0026ndash;959 (2022).\u003c/li\u003e\n\u003cli\u003eHolmes, E. C. \u003cem\u003eet al.\u003c/em\u003e The origins of SARS-CoV-2: A critical review. \u003cem\u003eCell\u003c/em\u003e \u003cstrong\u003e184\u003c/strong\u003e, 4848\u0026ndash;4856 (2021).\u003c/li\u003e\n\u003cli\u003eMacLean, O. A. \u003cem\u003eet al.\u003c/em\u003e Natural selection in the evolution of SARS-CoV-2 in bats created a generalist virus and highly capable human pathogen. \u003cem\u003ePLoS Biol.\u003c/em\u003e \u003cstrong\u003e19\u003c/strong\u003e, e3001115 (2021).\u003c/li\u003e\n\u003cli\u003eTemmam, S. \u003cem\u003eet al.\u003c/em\u003e Bat coronaviruses related to SARS-CoV-2 and infectious for human cells. \u003cem\u003eNature\u003c/em\u003e \u003cstrong\u003e604\u003c/strong\u003e, 330\u0026ndash;336 (2022).\u003c/li\u003e\n\u003cli\u003eKang, L. \u003cem\u003eet al.\u003c/em\u003e A selective sweep in the Spike gene has driven SARS-CoV-2 human adaptation. \u003cem\u003eCell\u003c/em\u003e \u003cstrong\u003e184\u003c/strong\u003e, 4392-4400.e4 (2021).\u003c/li\u003e\n\u003cli\u003eDurrett, R. \u0026amp; Schweinsberg, J. Approximating selective sweeps. \u003cem\u003eTheor. Popul. Biol.\u003c/em\u003e \u003cstrong\u003e66\u003c/strong\u003e, 129\u0026ndash;138 (2004).\u003c/li\u003e\n\u003cli\u003eNordborg, M., Charlesworth, B. \u0026amp; Charlesworth, D. The effect of recombination on background selection. \u003cem\u003eGenet. Res.\u003c/em\u003e \u003cstrong\u003e67\u003c/strong\u003e, 159\u0026ndash;174 (1996).\u003c/li\u003e\n\u003cli\u003eLai, M. M. \u0026amp; Cavanagh, D. The molecular biology of coronaviruses. \u003cem\u003eAdv. Virus Res.\u003c/em\u003e \u003cstrong\u003e48\u003c/strong\u003e, 1\u0026ndash;100 (1997).\u003c/li\u003e\n\u003cli\u003eAlachiotis, N., Stamatakis, A. \u0026amp; Pavlidis, P. OmegaPlus: a scalable tool for rapid detection of selective sweeps in whole-genome datasets. \u003cem\u003eBioinformatics\u003c/em\u003e \u003cstrong\u003e28\u003c/strong\u003e, 2274\u0026ndash;2275 (2012).\u003c/li\u003e\n\u003cli\u003eAlachiotis, N. \u0026amp; Pavlidis, P. RAiSD detects positive selection based on multiple signatures of a selective sweep and SNP vectors. \u003cem\u003eCommun Biol\u003c/em\u003e \u003cstrong\u003e1\u003c/strong\u003e, 79 (2018).\u003c/li\u003e\n\u003cli\u003eLi, W. \u003cem\u003eet al.\u003c/em\u003e Receptor and viral determinants of SARS-coronavirus adaptation to human ACE2. \u003cem\u003eEMBO J.\u003c/em\u003e \u003cstrong\u003e24\u003c/strong\u003e, 1634\u0026ndash;1643 (2005).\u003c/li\u003e\n\u003cli\u003eStephan, W. Selective Sweeps. \u003cem\u003eGenetics\u003c/em\u003e \u003cstrong\u003e211\u003c/strong\u003e, 5\u0026ndash;13 (2019).\u003c/li\u003e\n\u003cli\u003eGuindon, S. \u003cem\u003eet al.\u003c/em\u003e New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. \u003cem\u003eSyst. Biol.\u003c/em\u003e \u003cstrong\u003e59\u003c/strong\u003e, 307\u0026ndash;321 (2010).\u003c/li\u003e\n\u003cli\u003eGuindon, S. \u0026amp; Gascuel, O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. \u003cem\u003eSyst. Biol.\u003c/em\u003e \u003cstrong\u003e52\u003c/strong\u003e, 696\u0026ndash;704 (2003).\u003c/li\u003e\n\u003cli\u003eHadfield, J. \u003cem\u003eet al.\u003c/em\u003e Nextstrain: real-time tracking of pathogen evolution. \u003cem\u003eBioinformatics\u003c/em\u003e \u003cstrong\u003e34\u003c/strong\u003e, 4121\u0026ndash;4123 (2018).\u003c/li\u003e\n\u003cli\u003eSagulenko, P., Puller, V. \u0026amp; Neher, R. A. TreeTime: Maximum-likelihood phylodynamic analysis. \u003cem\u003eVirus Evol\u003c/em\u003e \u003cstrong\u003e4\u003c/strong\u003e, vex042 (2018).\u003c/li\u003e\n\u003cli\u003eDadonaite, B. \u003cem\u003eet al.\u003c/em\u003e A pseudovirus system enables deep mutational scanning of the full SARS-CoV-2 spike. \u003cem\u003eCell\u003c/em\u003e \u003cstrong\u003e186\u003c/strong\u003e, 1263-1278.e20 (2023).\u003c/li\u003e\n\u003cli\u003eJackson, C. B., Farzan, M., Chen, B. \u0026amp; Choe, H. Mechanisms of SARS-CoV-2 entry into cells. \u003cem\u003eNat. Rev. Mol. Cell Biol.\u003c/em\u003e \u003cstrong\u003e23\u003c/strong\u003e, 3\u0026ndash;20 (2022).\u003c/li\u003e\n\u003cli\u003ePlante, J. A. \u003cem\u003eet al.\u003c/em\u003e Spike mutation D614G alters SARS-CoV-2 fitness. \u003cem\u003eNature\u003c/em\u003e \u003cstrong\u003e592\u003c/strong\u003e, 116\u0026ndash;121 (2021).\u003c/li\u003e\n\u003cli\u003eZhang, L. \u003cem\u003eet al.\u003c/em\u003e SARS-CoV-2 spike-protein D614G mutation increases virion spike density and infectivity. \u003cem\u003eNat. Commun.\u003c/em\u003e \u003cstrong\u003e11\u003c/strong\u003e, 6013 (2020).\u003c/li\u003e\n\u003cli\u003eLiu, Y. \u003cem\u003eet al.\u003c/em\u003e The N501Y spike substitution enhances SARS-CoV-2 infection and transmission. \u003cem\u003eNature\u003c/em\u003e \u003cstrong\u003e602\u003c/strong\u003e, 294\u0026ndash;299 (2022).\u003c/li\u003e\n\u003cli\u003eSun, F. \u003cem\u003eet al.\u003c/em\u003e SARS-CoV-2 Quasispecies Provides an Advantage Mutation Pool for the Epidemic Variants. \u003cem\u003eMicrobiol Spectr\u003c/em\u003e \u003cstrong\u003e9\u003c/strong\u003e, e0026121 (2021).\u003c/li\u003e\n\u003cli\u003eWarren, C. J. \u0026amp; Sawyer, S. L. How host genetics dictates successful viral zoonosis. \u003cem\u003ePLoS Biol.\u003c/em\u003e \u003cstrong\u003e17\u003c/strong\u003e, e3000217 (2019).\u003c/li\u003e\n\u003cli\u003eYuan, Z., Nan, Z., Pei, H. \u0026amp; Yang, Z. Reconstruction of the most recent common ancestor sequences of SARS-Cov S gene and detection of adaptive evolution in the spike protein. \u003cem\u003eChin. Sci. Bull.\u003c/em\u003e \u003cstrong\u003e49\u003c/strong\u003e, 1311\u0026ndash;1313 (2004).\u003c/li\u003e\n\u003cli\u003eBecker, M. M. \u003cem\u003eet al.\u003c/em\u003e Synthetic recombinant bat SARS-like coronavirus is infectious in cultured cells and in mice. \u003cem\u003eProc. Natl. Acad. Sci. U. S. A.\u003c/em\u003e \u003cstrong\u003e105\u003c/strong\u003e, 19944\u0026ndash;19949 (2008).\u003c/li\u003e\n\u003cli\u003eMcRoy, W. C. \u0026amp; Baric, R. S. Amino acid substitutions in the S2 subunit of mouse hepatitis virus variant V51 encode determinants of host range expansion. \u003cem\u003eJ. Virol.\u003c/em\u003e \u003cstrong\u003e82\u003c/strong\u003e, 1414\u0026ndash;1424 (2008).\u003c/li\u003e\n\u003cli\u003eStarr, T. N. \u003cem\u003eet al.\u003c/em\u003e Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding. \u003cem\u003eCell\u003c/em\u003e \u003cstrong\u003e182\u003c/strong\u003e, 1295-1310.e20 (2020).\u003c/li\u003e\n\u003cli\u003eOzono, S. \u003cem\u003eet al.\u003c/em\u003e SARS-CoV-2 D614G spike mutation increases entry efficiency with enhanced ACE2-binding affinity. \u003cem\u003eNat. Commun.\u003c/em\u003e \u003cstrong\u003e12\u003c/strong\u003e, 848 (2021).\u003c/li\u003e\n\u003cli\u003eKreutzberger, A. J. B. \u003cem\u003eet al.\u003c/em\u003e SARS-CoV-2 requires acidic pH to infect cells. \u003cem\u003eProc. Natl. Acad. Sci. U. S. A.\u003c/em\u003e \u003cstrong\u003e119\u003c/strong\u003e, e2209514119 (2022).\u003c/li\u003e\n\u003cli\u003eLiu, H. \u003cem\u003eet al.\u003c/em\u003e Development of optimized drug-like small molecule inhibitors of the SARS-CoV-2 3CL protease for treatment of COVID-19. \u003cem\u003eNat. Commun.\u003c/em\u003e \u003cstrong\u003e13\u003c/strong\u003e, 1891 (2022).\u003c/li\u003e\n\u003cli\u003eMoghadasi, S. A. \u003cem\u003eet al.\u003c/em\u003e Transmissible SARS-CoV-2 variants with resistance to clinical protease inhibitors. \u003cem\u003eSci Adv\u003c/em\u003e \u003cstrong\u003e9\u003c/strong\u003e, eade8778 (2023).\u003c/li\u003e\n\u003cli\u003eS\u0026oslash;ndergaard, J. N. \u003cem\u003eet al.\u003c/em\u003e Successful delivery of large-size CRISPR/Cas9 vectors in hard-to-transfect human cells using small plasmids. \u003cem\u003eCommun Biol\u003c/em\u003e \u003cstrong\u003e3\u003c/strong\u003e, 319 (2020).\u003c/li\u003e\n\u003cli\u003eZhou, T. \u003cem\u003eet al.\u003c/em\u003e Cryo-EM Structures of SARS-CoV-2 Spike without and with ACE2 Reveal a pH-Dependent Switch to Mediate Endosomal Positioning of Receptor-Binding Domains. \u003cem\u003eCell Host Microbe\u003c/em\u003e \u003cstrong\u003e28\u003c/strong\u003e, 867-879.e5 (2020).\u003c/li\u003e\n\u003cli\u003eCastellanos-Gonzalez, A. \u003cem\u003eet al.\u003c/em\u003e Direct RT-PCR amplification of SARS-CoV-2 from clinical samples using a concentrated viral lysis-amplification buffer prepared with IGEPAL-630. \u003cem\u003eSci. Rep.\u003c/em\u003e \u003cstrong\u003e11\u003c/strong\u003e, 14204 (2021).\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"npj-viruses","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"","sideBox":"Learn more about [npj Viruses](https://www.nature.com/npjviruses)","snPcode":"44298","submissionUrl":"https://submission.springernature.com/new-submission/44298/3","title":"npj Viruses","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"NPJ","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"","lastPublishedDoi":"10.21203/rs.3.rs-3835105/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-3835105/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eAs the COVID-19 pandemic enters its fourth year, the pursuit of identifying a progenitor virus to SARS-CoV-2 and understanding the mechanism of its emergence persists, albeit against the backdrop of intensified efforts to monitor the ongoing evolution of the virus and the influx of new mutations. Surprisingly, few residues hypothesized to be essential for SARS-CoV-2 emergence and adaptation to humans have been validated experimentally, despite the importance that these mutations could contribute to the development of effective antivirals. To remedy this, we searched for genomic regions in the SARS-CoV-2 genome that show evidence of past selection around residues unique to SARS-CoV-2 compared with closely related coronaviruses. In doing so, we identified a residue at position 519 in Spike within the receptor binding domain that holds a static histidine in human-derived SARS-CoV-2 sequences but an asparagine in SARS-related coronaviruses from bats and pangolins. In experimental validation, the SARS-CoV-2 Spike protein mutant carrying the putatively ancestral H519N substitution showed reduced replication in human lung cells, suggesting that the histidine residue contributes to viral fitness in the human host. Structural analyses revealed a potential role of Spike residue 519 in mediating conformational transitions necessary for Spike to adopt an up configuration prior to binding with ACE2. Pseudotyped viruses bearing the putatively ancestral N519 also demonstrated significantly reduced infectivity in cells expressing the human ACE2 receptor compared to H519. Biochemical assays corroborated that N519 binds human ACE2 with lower affinity than H519. Collectively, these findings suggest that the evolutionary transition at position 519 of the Spike protein played a critical role in SARS-CoV-2 emergence and adaptation to the human host. Additionally, this residue presents as a potential drug target for designing small molecule inhibitors tailored to this site.\u003c/p\u003e","manuscriptTitle":"Evolution At Spike Position 519 in SARS-CoV-2 Facilitated Adaptation to Humans","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-01-08 16:50:48","doi":"10.21203/rs.3.rs-3835105/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2024-02-21T20:36:27+00:00","index":"","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2024-01-23T16:24:24+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2024-01-21T04:03:48+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"68e2018f-ed2f-45d0-ae1a-ce809cc6efce","date":"2024-01-11T22:28:22+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"0ae6c13a-ab18-4b51-ba3b-ea3ecdce6eb4","date":"2024-01-10T21:35:04+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2024-01-09T20:27:42+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2024-01-06T18:59:40+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2024-01-05T14:30:19+00:00","index":"","fulltext":""},{"type":"submitted","content":"npj Viruses","date":"2024-01-04T17:29:47+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"npj-viruses","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"","sideBox":"Learn more about [npj Viruses](https://www.nature.com/npjviruses)","snPcode":"44298","submissionUrl":"https://submission.springernature.com/new-submission/44298/3","title":"npj Viruses","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"NPJ","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"ce3345a8-31c8-4ebc-b468-db5ab56cee4d","owner":[],"postedDate":"January 8th, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[{"id":27990562,"name":"Biological sciences/Microbiology/Virology/Sars cov 2"},{"id":27990563,"name":"Biological sciences/Microbiology/Virology/Viral evolution"}],"tags":[],"updatedAt":"2024-04-25T07:13:43+00:00","versionOfRecord":[],"versionCreatedAt":"2024-01-08 16:50:48","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-3835105","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-3835105","identity":"rs-3835105","version":["v1"]},"buildId":"qtupq5eGEP_6zYnWcrvyt","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-19T01:45:01.086888+00:00