An interaction between the transmembrane domains of Streptococcus pyogenes sortase A and its endogenous substrate M protein revealed by molecular dynamics simulations

doi:10.1101/2025.08.19.671115

An interaction between the transmembrane domains of Streptococcus pyogenes sortase A and its endogenous substrate M protein revealed by molecular dynamics simulations

2025 · doi:10.1101/2025.08.19.671115

preprint OA: closed CC-BY-NC-4.0

📄 Open PDF Full text JSON View at publisher

Full text 62,073 characters · extracted from oa-pdf · 10 sections · click to expand

Keywords

sortases, enzymes, target specificity, computational modeling, molecular dynamics .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint

Abstract

Sortase enzymes are cysteine transpeptidases at the cell surface of gram-positive bacteria. Localized to distinct foci on the cell membrane, class A sortases (SrtAs) recognize a cell wall sorting signal (CWSS), and following cleavage at this specific bindi ng motif, target proteins are ligated to precursors of the growing peptidoglycan layer. This activity of SrtA enzymes is utilized extensively in sortase -mediated ligation (SML) strategies, for a variety of protein engineering applications. Typically, engineered variants of SrtA are used for SML experiments considering the relatively low catalytic efficiency of this enzyme. Understandably, most biochemical studies are conducted with the isolated catalytic domain of SrtA enzymes from various bacteria, and the stereochemistry of the endogenous interaction between SrtA and substrate is not well understood. Here, we used AlphaFold2 to create a model of the full-length SrtA enzyme from Streptococcus pyogenes (spySrtA) with or without either a peptide substrate or a portion of M protein, a cellular target. We ran triplicate 500 ns molecular dynamics simulations for each model embedded in a lipid bilayer, which revealed several stereochemical features of this system. Contact map analyses revealed specific interactions between catalytic domain positions of spySrtA and the lipid bilayer, as well as between the enzyme and M protein residues outside the canonical LPXTG pentapeptide CWSS. We also characterized a potential transmembrane domain interaction between spySrtA and M protein that we predict orients and stabilizes substrate binding. Taken together, these interactions likely increase the catalytic efficiency of the enzyme for its substrates in vivo, and may provide important stereochemical insights for SML uses. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint

Introduction

The surface of gram-positive pathogenic bacteria are extensively decorated in proteins. 1 These include toxins, environmental sensors, components of pili, and proteins with myriad other functions critical to the survival and pathogenicity of these organisms.1–4 One mechanism by which covalent attachment of cell surface proteins is achieved includes ligation mediated by sortase enzymes. The first sortase identified was the class A sortase (SrtA) from Staphylococcus aureus over 25 years ago. 5,6 These enzymes are localized to discrete foci on the cell membrane, often the cleavage furrow of dividing bacteria, ligating substrates to precursors of the growing peptidoglycan layer.7,8 The catalytic mechanism of SrtA enzymes is well understood. SrtA recognizes a specific cell wall sorting signal (CWSS) , defined by the sequence LPXTG , where X=any amino acid and positions are referred to as P4=Leu, P3=Pro, P2=X, P1=Thr, and P1’=Gly.2,9,10 Initial cleavage between the P1/P1’ position occurs following nucleophilic attack of the P1 Thr carbonyl carbon by the thiol side chain of a catalytic cysteine residue, forming an acyl enzyme intermediate.9,10 A second nucleophile, often the α-amine of an N-terminal amino acid, resolves this intermediate, and the ligation product is formed.9,10 In addition to the Cys (C184), the catalytic residues were traditionally thought to include His (H120) and Arg (R197); however, recent work from ourselves and others suggested that while critical, the Arg may not play a catalytic role in electrostatic stabilization of the oxyanion tetrahedral intermediate. We and others determined that this stabilization is instead facilitated by the hyd roxyl group of a highly conserved Thr immediately N-terminal to the catalytic Cys, as well as the backbone amide of the amino acid following the catalytic His.10–12 The ability to recognize the CWSS followed by specific proteolytic cleavage and ligation of two sequences makes SrtA enzymes versatile tool s in sortase-mediated ligation (SML) protein engineering applications.10,13,14 Staphylococcus aureus SrtA, the first sortase discovered, continues to see regular use for SML experiments. However, because of the strict specificity of this enzyme for the LPXTG target motif, as well as its relatively low catalytic efficiency (a characteristic of all sortases studied to date), engineered variants are most often utilized. 15,16 Specifically, directed evolution studies identified a pentamutant that increased the catalytic efficiency >100 -fold, from 200 M -1 s-1 (kcat=1.5 s -1, Km=7.6 mM, for an LPETG peptide) to 23,000 M-1 s-1 (kcat=5.4 s-1, Km=0.23 mM).15 Despite these advances on the use of sortases in .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint vitro, as well as nice early work in the field identifying and investigating the sortase reaction in vivo, the role of the sortase transmembrane domain in the catalytic mechanism is not well understood from a biochemical perspective.6,17–20 In this work, th e increased capabilities of structural modeling, e.g., due to AlphaFold and RoseTTAFold, have allowed us to investigate the behavior of a full-length SrtA enzyme in unprecedented atomic detail for the first time. 21–23 Specifically, we use d AlphaFold2 to model full -length Streptococcus pyogenes SrtA (spySrtA), followed by molecular dynamics simulations of this structure in a lipid bilayer mimicking the composition of that in gram-positive bacteria.24,25 We chose to investigate spySrtA and not Staphylococcus aureus SrtA for several reasons. Staphylococcus SrtA enzymes are the only identified that require allosteric activation by calcium,2,9,10 and we reasoned that using a non-Staphylococcus enzyme may both simplify the overall system and also be more applicable to the superfamily at large. In addition, we recently solved structures of a catalytically inactive variant of spySrtA with peptide substrates (sequences LPATA and LPATS, PDB IDs 7S4O and 7S51) as well as a product mimic (LPAT-LII, PDB IDs 7T8Y and 7T8Z).26 To our knowledge, ours are the only experimental sortase structures that contain a non-covalently bound ligand and which show the substrate in the active site conformation that is consistent with known biochemical data.10 However, because our structures were deposited in the Protein Data Bank in 2022, and the AlphaFold training database only includes structures deposited in 2021 and earlier, 22 these act as a structural control for the computationally-generated output models. Finally, although S. aureus SrtA and its derivatives are the most widely used enzymes for SML, spySrtA is also utilized, particularly in dual-labeling strategies, due to a larger degree of substrate promiscuity (e.g., at the P1’ position).10,14,27–29 Triplicate mo lecular dynamics simulations were performed, and a number of analyses were employed to better understand the stereochemistry of full-length spySrtA in the membrane, as well as its interaction with an endogenous substrate, M protein. M protein is a well -studied virulence factor in S. pyogenes that binds to host proteins and interferes with the host immune response, including by inhibiting phagocytosis.30 Specifically, we were curious in characterizing how the CWSS properly binds the active site of spySrtA considering this sequence (LPSTG in M protein) ends very close to the predicted transmembrane domain. We predicted that the lipid bilayer would facilitate positioning of the spySrtA catalytic domain and that the transmembrane domains of both proteins may directly interact. Our MD .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint simulation results suggested that the transmembrane domains of spySrtA and M protein likely do interact, and that specific contacts with lipids may orient spySrtA for catalysis. Furthermore, we identif ied specific interactions between spySrtA and residues beyond the CWSS, which may play a role in target specificity. Taken together, we predict that the membrane plays a major role in sortase biology in vivo and may provide additional insights for further development of these important enzymes for SML engineering applications. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint

Materials and methods

Structural modeling using AlphaFold2 and software for structural analysis . The sortase A and substrate sequences for structural modeling obtained from Uniprot include: spy SrtA (Uniprot ID: Q99ZN4_STRP1) and M protein (Uniport ID: M6A_STRP6). The full-length spySrtA protein se quence was used, and the C-terminal portion of M protein, residues 376-415. Structural models were determined with AlphaFold2 on the European Galaxy Server with default settings.22,31,32 AlphaFold2 input sequences can be found in the Supporting Information. Output structures are ranked using the predicted long -distance difference test (pLDDT) and the ranked_0 structure was used for molecular dynamics simulations. Electrostatic potential was calculated using the ABPS plugin and visualized in PyMOL (Schrodinger Software).33 Molecular dynamics simulations. Streptococcus pyogenes sortase A (spy SrtA) bilayer positioning was calculated using the PPM 2.0 web server using data from the OPM database.34 SpySrtA in isolation, with substrate, or peptide was embedded in an 80% 1,2-dioleoyl-sn-glycero-3-phosphoglycerol (DOPG), 20% tetraoleyl-cardiolipin (TOCL2) bilayer using CHARMM-GUI,35–38 with CHARMM36m all atom force field.39,40 Each system was solvated using TIP3P water and sodium and chloride ions to neutralize the system with an ionic strength of 0.15 M and equilibrated using GROMACS 2022.4 (Table S1).41 A steepest decent energy minimization was performed until the maximum force on any atom is less than 1000 kJ/mol/nm. The temperature was first equilibrated at 300K with restraints on all heavy protein and lipid atoms using a Berendsen thermostat for 250 ps.42 The pressure of the system was equilibrated in the NPT ensemble at 1 bar with decreasing restraints on all heavy protein and lipid atoms using a Berendsen semi-isotropic barostat for 1.75 ns total.42 The temperature and pressure of the system was further equilibrated in the NPT ensemble with a Parrinello-Rahman semi-isotropic barostat43 and Nose-Hoover thermostat44,45 at 300K without restraints for 10 ns. Hydrogen atoms were restrained with the LINCS algorithm.46 The equilibrated structures were used to run a single 500 ns simulation at 300K and 1 bar using GROMACS 2022.4. 41 Simulations were performed in triplicate, with separate equilibrations run for each. Atomic coordinates were saved every 100 ps. The size of each system is reported in Table S1. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint Analysis of MD simulations. Contacts between atom pairs (spySrtA, DOPG, TOCL2, and/or substrate) were calculated with PLUMED 2.4.47,48 A contact was considered formed if the distance between atoms was less than 4 Å. The contacts included nitrogen, oxygen, and carbon atoms for each residue in spySrtA near the membrane surface and nitrogen, oxygen, and carbon atoms for each lipid type in the membrane (DOPG and TOCL2). Contacts were measured between hydrophilic (nitrogen and oxygen) and hydrophobic (carbon) atoms in spyS rtA and each lipid type , and/or between nitrogen, oxygen, and car bon atoms for each residue in spySrtA and substrate. Contacts were monitored over the 500 ns simulations. Root mean squared fluctuation (RMSF) per residue of backbone atoms, root mean squared deviation (RMSD) of backbone atoms, and solvent accessible surface area (SASA) per residue for side chain and backbone atoms were calculated using GROMACS analysis tools. 49 Potential energy between spySrtA and substrate or peptide was calculated using GROMACS. Bioinformatics analysis of natural spyS rtA substrates and homologous Sortase A enzymes . Predicted natural substrates in the Staphylococcus pyogenes genome were identified from a Hidden Markov Model (HMM).50,51 Five residues before and twenty-three residues after the LPXTG recognition motif sequences were aligned. A logo map of amino acid residue prevalence in each position of the substrate was created using the online WebLogo tool.52 A NCBI blast search was performed on the wild -type (WT) spyS rtA sequence filtering for the genus Streptococcus. Twenty-eight sequences from the Streptococcus genus were aligned with spySrtA using Clustal Omega. 53 Conserved residues were identified using EndScript Server 3.054 and ConSurf.55,56 SpySrtA conserved residues with 300 sortase A sequences were identified with Con Surf. Conserved M protein substrate residues were also identified with ConSurf using 300 sequences as an input in the online server. Conservation scores were mapped on to the respective protein structures. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint

Results

The spySrtA catalytic domain interacts with the gram-positive bacterial membrane via electrostatic and hydrophobic interactions. To get a better understanding of the structure of the full-length spySrtA enzyme, we utilized AlphaFold2 modeling, as described in the Materials and Methods ( Figure 1A). The st ructure largely agreed with our predictions of the full -length protein; however, we were intrigued by a number of intramolecular contacts in residues of the spySrtA extracellular domain, which are not typically included in the catalytic domain constructs that have been utilized for in vitro work (e.g., amino acids 81-249) (Figure 1B). Specifically, we observed multiple hydrophobic (Y49-V94-F110, V51-V94, I59-I95) and polar (N47- S243, Q50-N106, S55-E17) interactions (Figure 1B). Catalytic domain constructs of sortase A enzymes were historically determined via similarity in multiple sequence alignment s, which is how the construct boundaries of spySrtA were defined.57,58 However, there is evidence that residues N -terminal to the catalytic domain may play a regulatory role in SrtA enzymes such as Bacillus anthracis SrtA.59 Moving forward, we will refer to the extracellular domain of spySrtA as amino acids 3 4-249 (34NKPIR… NQVST249) and the catalytic domain as amino acids 81 -249 (81SVLQA… NQVST249). We next modeled spySrtA in its membrane environment, utilizing a lipid composition of 80% 1,2-dioleoyl-sn- glycero-3-phosphoglycerol ( DOPG) and 20% tetraoleoyl-cardiolipin (TOCL2), based on previous studies of gram-positive bacterial membranes (Figures 1C, S1).24 Insertion of spySrtA into the lipid bilayer is described in the Materials and Methods. Following generation of our model Figure 1. AlphaFold models of full -length Streptococcus pyogenes SrtA (spySrtA) with and without a lipid bilayer. (A) An AlphaFold2-generated model of full-length spySrtA (residues 1-249) is shown in cartoon representation and including a transparent surface. Three regions of the proteins are colo red and labeled. ( B) Amino acids which may facilitate intraprotein interactions between the catalytic domain (as commonly used in biochemical studies, residues 81-249) and residues N -terminal to this region are highlighted with the side chains shown as spheres and colored by heteroatom (O=red, N=blue, C=marine (for catalytic domain) and C=cyan (for N-terminal to catalytic domain)). SpySrtA is shown in cartoon. ( C) A full-length model of spySrtA (gray, cartoon representation) in a lipid bilayer (lines, colored by heteroatom with C=gray). .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint of full-length spySrtA enzyme in a lipid bilayer, we ran triplicate molecular dynamics simulations for 500 ns to assess sortase-membrane interactions, as described in the Materials and Methods. Overall, the catalytic domain of spySrtA remained stable during the course of each simulation (Figure S2). Contact analyses of specific spySrtA atoms with the lipid bilayer revealed a number of interactions in the catalytic ( extracellular) domain that frequently occurred duri ng the simulations (Figure 2). As expected, transmembrane residues remained embedded in the lipid bilayer during the entire simulation (Figures 2A-B). In addition, residues close to the transmembrane domain were also frequently associated with the membrane exterior . Interestingly, there were also a number of residues not immediately adjacent to the transmembrane domain that were frequently (defined as >50% of the simulation) in contact with lipid carbon or oxygen atoms. Th ese included residues N-terminal to the catalytic domain (S78- E80) or immediately within it (L83, Q86, M87), as well as I147 and T148 near the C-terminus. Some of these residues appeared to preferentially bind to either DOPG or TOCL2. Residues near the Figure 2. Contact map of spySrtA and the lipid bilayer. Following triplicate 500 ns molecular dynamics simulations, a contact map was generated to assess the percent (%) of simulation bound for specific catalytic domain atoms in spySrtA and lipid groups. ( A) SpySrtA is shown in cartoon representation and colored in gray. The traditional His-Cys-Arg catalytic residues are in gold, with side chain atoms shown as sticks and colored by heteroatom (N=blue, O=red, C=gold). For amino acids that made specific contacts, the C a atoms are shown as spheres and colored accor ding to the key. These colors match the data in ( B). (B) Specific contacts for the intracellular, transmembrane, and catalytic domain residues of spySrtA with lipid groups are shown and colored as labeled. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint peptide binding groove (R38, S78 -E80, and L83) bound to the phosphate and/or fatty acid tail of DOPG whereas residues opposite to the peptide binding groove (T40-L41, R44, N47-K48, and Q86-M87) bound to polar and/or hydrophobic groups of TOLC2 for >80% of the simulation (Figures 2A-B). Residue-specific preferences for lipid moieties could support the orientations spyS rtA adopts in the membrane. Taken together, these data revealed interactions between spySrtA, including the catalytic domain, and the membrane. The transmembrane domains of SpySrtA and its endogenous substrate M protein interact specifically and stably during molecular dynamics simulations. To investigate the tripartite complex of spySrtA, membrane, and substrate, we created two separate models using the pipeline described above. For both, we chose to include the sortase recognition motif initially bound in the active site, in order to investigate interactions between proteins and with the membrane in the bound complex. In future experiments, it would also be interesting to investigate initial recognition of sortase for its substrate(s). In the first model, we used a peptide substrate (LPSTG, where L=P4, P=P3, S=P2, T=P1, and G=P1’) to match the canonical pentapeptide recognition motif (LPXTG) for sortase enzymes (Figure 3A).2,5,6,10 In addition, t his sequence is derived from the S. pyogenes M protein, a virulence factor that is attached to the bacterial cell surface by sortase-mediated ligation.3,60 Our second model included a region of the M protein containing both the Figure 3. AlphaFold models of full -length s pySrtA with substrate inserted into a lipid bilayer. Output models of spySrtA with an LPSTG peptide ( A) or extended M protein sequence ( B) are shown with the spySrtA protein in gray surface representation. For all, the lipid bilayer is shown as lines and colored by heteroatom (C=gray, O=red, N=blue). (A) The LPSTG peptide is in yellow spheres. (B) The extended M protein model is in cartoon for the intracellular and tr ansmembrane domains, and stick representation colored by heteroatom for the extracellular domain (C=yellow), which includes the LPSTG motif . (C) The predicted transmembrane domain residues are highlighted as spheres and colored by heteroatom (spySrtA: C=gr ay, M protein: C=yellow). The extracellular domains are shown in cartoon representation and colored as labeled. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint LPSTG sequence and its C -terminal transmembrane domain. Because there is reported variability in M protein extracellular sequences, th e protein is very large, and there is no evidence to our knowledge of interactions between the spySrtA catalytic domain and other regions of the substrate, we restricted our model to M protein residues 376 -415, which included five residues before the start of the target LPSTG motif (Figure 3B). The prediction algorithm TMHMM-2.0 was used to predict the transmembrane domain of M protein, suggesting this domain is residues 387 -409.61 Our model was largely consistent with this result, although it suggested a more accurate transmembrane domain excludes T387 (Figure 3C ). For spySrtA, TMHMM-2.0 predicts the transmembrane domain to contain residues 13-32. Again, our model largely agreed, with the addition of F33 and N34 (which interacted with the polar head groups of the lipid molecules) (Figure 3C). We ran triplicate 500 ns molecular dynamics simulations with each of these spySrtA -substrate- membrane complexes, as described in the

Materials

and Methods. Again, we observed that all components were stable throughout each simulation, with minimal variability, as measured using relative root-mean-square deviation over time and root-mean-square fluctuation by residue calculations (Figures S3-4). This is also apparent in structural alignment of 20 states from an example simulation, with each structure (membrane not shown) representing a state every 25 ns of simulation time (Figure 4). The largest Figure 4. Molecular dynamics simulations reveal stable substrate binding to spySrtA. The results of one simulation replicate are shown for spySrtA-LPSTG (A) and spySrtA-M protein (B). Output states corresponding to Dt=25 ns (21 states total, including t=0) are aligned and shown. The lipid bilayer is not shown for clarity (although it was present in all simulations). SpySrtA is in gray cartoon. The intracellular and transmembrane domain of M protein is in yellow cartoon (B). All other peptide or M protein (LPSTG or KRQLPST, respectively) residues are shown as yellow sticks and colored by heteroatom (N=blue, O=red, C=yellow). The insets show a zoomed-in version of the interaction and are rendered similarly. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint variability is seen for the five amino acids preceding the LPSTG motif in M protein, which was not surprising as these residues are not expected to specifically interact with spySrtA (Figure 4B). Furthermore, the relative stability of the atoms in the LPSTG sequence in both simulations was similar to what we observed in previous 1000 ns spySrtA 81-249-LPATA molecular dynamics simulations, using our experimental structures.26 The distance between the thiol of the catalytic C208 and P1 carbonyl carbon, the site of nucleophilic attack, was also stabilized by the presence of additional M protein residues, as visualized by a shift in the distance distribution (Figure S5). For example, in our triplicate simulations of spySrtA-M protein- membrane (defined as T1, T2, and T3), the distance distribution between these atoms was centered around 3.8 Å for T1 and T2, but closer to 5 Å for T3. For reference, we observed a probability distribution centered at 3.8 Å previously in our spySrtA 81-249-LPATA simulations.26 However, in all three simulations for the spySrtA-LPSTG-membrane system, we saw a bimodal distribution (centered at the 3.8 and 5 Å distances) (Figure S5). There is no clear reasoning for this discrepancy, although we predict that this may reflect the peptide and/or substrate sampling both a catalytically competent bound state and an unreactive partially bound state. With M protein, the averaged ratio favors the closer or ‘bound’ state, with relative probabilities of roughly 0.25:0.15 for the peak maxima of 3.8 Å:5 Å, or ‘bound’:’partially bound’ distances. Conversely, for the LPSTG peptide simulations, this ratio is flipped , at roughly 0.2:0.3 for the peaks corresponding to the 3.8 Å:5 Å distances (Figure S5). Notably, the peptide does stay stably bound despite some fluctuations in this distance (Figure 4). The spySrtA -M protein-membrane model and simulations also revealed a relatively stable transmembrane domain interaction between the two proteins. This interaction persists throughout the entirety of each replic ate simulation, although the specific residues that maintain contact varies (Figure 5). Here, we visually analyzed amino acids oriented towards each other at t=0 ns of each simulation, including Leu20, Ile21, Leu24, Gly28, and Leu31 in spySrtA and Phe392, Ala395, Ala396, Val399, and Ala403 in M protein, for the t=0, 250, and 500 ns states ( Figure 5). The transmembrane domains of these proteins remain associated in all replicates, with the highest degree of dissociation at t=500 ns for simulation T1. Furthermore, in the t=500 ns state, interacting residues differ for T1 (Leu31-Ala395), T2 (Leu24-Ala403 and Leu31-Ala396), and T3 (Leu 31-Phe392, Leu 24-Val399, and Ile 21-Ala403) ( Figure 5 ). Overall, these .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint simulations suggest that the transmembrane domain interaction between spySrtA and M protein m ay be multivalent, likely reflecting the dynamic nature of both the membrane and proteins themselves. Specific residues in spySrtA and M protein are buried upon substrate binding. We wanted to further analyze substrate recognition by the catalytic domain of spySrtA in the context of the full -length proteins. When we analyze d the change in solvent accessible surface area (DSASA), defined as SASAbound - SASAunbound (in Å 2), we see that the P4 Leu and P1 Thr in the M protein fragment are substantially buried (-DSASA > 10 Å2) upon substrate binding in the triplicate simulations (Figures 6A-B). In addition, we saw that for one of the replicates, the largest -DSASA value observed was for the P2’ Glu. For the other two replicates, -DSASA for the P2’ Glu was second to only the P4 Leu, a position previously described as binding in a specific hydrophobic pocket (Figure 6B).26 This observation will be discussed in detail below. Other substrate residues with relatively large -DSASA values include the P6 Arg (RQLPSTGE, Arg in bold), P3 Pro, P2 Ser, P1’ Gly, and other positions within the transmembrane domain (Figure 6B). Figure 5. Structural trajectories of spySrtA and M protein transmembrane domain interactions. Specific states (corresponding to t=0, 250, 500 ns) are shown for each rep licate simulation, T1 ( A), T2 ( B), and T3 (C). The lipid bilayer is not shown for clarity although is present in all simulations. SpySrtA is shown as gray cartoon and M protein as yellow cartoon. Amino acid sidechains that are oriented towards each other in the transmembrane helices are shown as spheres and labeled for all. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint For the spySrtA enzyme, in addition to the expected positions required for enzymatic activity (H142- C208-R216), we also observed other residues within the catalytic domain with a similar -DSASA, including E189, V191, and I211 (Figure 6A). These positions may bind and stabilize substrate binding outside the LPSTG recognition motif, for example at the N-terminal KRQ and C-terminal ET positions (M protein sequence = KRQ LPSTGET) (Figure 6C ). Taken together, these results suggested there may be s pecific interactions between spySrtA and M protein beyond the canonical LPXTG recognition motif for class A sortases. Two additional spySrtA residues ( Q72 and P76 ) that are not in the transmembrane domain also exhibited relatively large -DSASA values in the substrate bound models ( Figure 6A). These two residues also appear to interact directly with the M protein substrate in or near the recognition motif, at either the P5 Gln (with Q72) or the P2 Ser and P1’ Gly (with P76) positions ( Figure 6C ). These int eractions are present in all three replicate simulations, with average distances between Ca atoms equal to: Q72-P5 Gln = 7.3 ± 0.5 Å (T1), 7.4 ± 0.6 Å (T2), and 7.3 ± 0.5 Å (T3), and P76-P1’ Gly = 6.1 ± 0.7 Å (T1), 4.8 ± 0.3 Å (T2), and 6.3 ± 0.7 Å (T3). The extracellular domain (amino acids 34 -249) of s pySrtA interacts with its endogenous substrate M protein at positions beyond the canonical pentapeptide recognition motif. To complement the analysis of our AlphaFold models, we also used our spySrtA 1-249-M protein376-415 molecular dynamics simulations to investigate interactions in residues adjacent to Figure 6. Residues outside the canonical pentapeptide recognition motif interact with the catalytic domain of spySrtA. Analysis of the change in solvent accessible surface area (SASA) between the bound and unbound AlphaFold models ( DSASA) reveal several amino acids that become buried upon substrate binding, defined as a relatively large -DSASA are highlighted and labeled, for spySrtA (A) and M protein (B). ( C) Predicted interactions at amino acids N -terminal (KRQ) and C -terminal (ET) to the LPSTG pentapeptide recognition motif are highlighted in spySrtA (gray cartoon) as side chain spheres and colored by heteroatom (C=gray, O=red, N=blue). The KRQLPSTGET sequence of M protein is shown as spheres and colored by heteroatom (C=yellow), with other amino acids as a yellow cartoon. M protein numbering is based on the full -length protein. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint the LPXTG substrate recognition motif. In our triplicate simulations, contact map analysis revealed interactions at amino acids both N- and C-terminal to the M protein LPSTG sequence (Figure 7A). At the N-terminus, these data confirmed interactions between Q72, E189 and V191 with the P6 -P5 RQ (RQLPSTG) positions, as also highlighted above in our DSASA analysis. A potential role for backbone atoms of F71 and P188 was also identified ( Figure 7A). We observed even more persistent interactions adjacent to the C -terminus of LPSTG, with both specific backbone and side -chain contacts between the P2’ Glu (LPSTGE) and several spySrtA residues. Most notably, electrostatic interactions with the side-chain atoms of K35 and R38 in spySrtA were present throughout each simulation (Figure 7B). Multiple sequence alignment of spySrtA plus 27 additional Streptococcus SrtA proteins indicates that these Lys and Arg positions are relatively well conserved. Nineteen of the 28 sequences contain a Lys in the equivalent K35 position, with the other sequences Figure 7. Additional specific interactions are identified between spySrtA and the P2’ Glu in M protein. (A) Contact map of spySrtA amino acids and either the KRQ or ET residues of M protein, in the sequence KRQLPSTGET. “Hydrophilic” refers to side chain atoms. (B) Initial and final states from the T1, T2, and T3 simulations (t=0 and 500 ns) highlighting persistent interactions between K35 and R38 spySrtA with the P2’ Glu in M protein. M protein is shown as yellow cartoon with the P2’ Glu side chain as sticks and colored by heteroatom (C=yellow, O=red, N=blue). SpySrtA is shown as gray cartoon with the K35 and R38 side chain atoms as sticks and colored by heteroatom (C=gray). Relevant distances are labeled. ( C) Sequence logo ( WebLogo) of 24 predicted endogenous substr ates of spySrtA confirms conservation at the P2’ position for a negatively-charged amino acid (either D or E). ( D) ConSurf analysis with 28 Streptococcus SrtA sequnces reveals that the K35 and R38 positions are generally conserved (left), although this is not true for 300 SrtA sequences from a broader range of bacterial species (middle). M protein conservation is also highlighted (right). For all, the proteins are shown in cartoon representation with relevant C a atoms as spheres. The conservation scale is shown and labeled. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint containing either Ser or Thr polar residues (Figure S6). Twenty-five of the 28 sequences contain an Arg in the equivalent R38 position (Figure S6). Conservation is also reflected in spySrtA substrate sequences; a sequence logo of 24 predicted spySrtA endogenous substrates reveals that a negative amino acid (either Glu or Asp) in the P2’ position is highly conserved (Figure 7C). We also used Consurf to investigate the evolutionary conservation of these residues visually (Figure 7D).55,56 When we limited our analysis to the 28 Streptococcus SrtA proteins, again, the relatively high conservation of residues, e.g., K35 and R38 in spySrtA, were apparent (black arrow in left panel of Figure 7D); however, when applied to 300 SrtA sequences (middle panel in Figure 7D), this was not conserved. For examp le, sequence alignment of spySrtA with Staphylococcus aureus SrtA (UniProt ID SRTA_STAA8) revealed that while the Lys is conserved (K26 in S. aureus SrtA), the residue in the equivalent Arg position is D30. To our knowledge, these types of interactions have not been explored in the literature, and it remains unclear whether they have a significant impact on spySrtA activity. In the first step of sortase-mediated catalysis, the substrate is cleaved between the P1/P1’ positions (LPST/GE), and initial binding of the substrate is potentially facilitated by interactions outside of the standard LPXTG motif .2,9,10 Additional experimentation will be necessary to probe these interactions further, and to understand the mechanistic implications of these observations. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint

Discussion

One of our major questions prior to modeling full -length spySrtA with M protein in the membrane was to understand how the LPXTG recognition motif would be oriented with respect to the enzyme active site. The transmembrane domain of M protein is C-terminal to the LPSTG by only a small number of residues, and the stereochemistry of recognition is unclear. A transmembrane domain prediction algorithm predicted it starts at T387, just two amino acids after the P1’ Gly , sequence: 381LPSTGETANPFF392. Our model was able to illustrate that the enzyme and substrate are indeed well positioned for catalysis in the context of a lipid bilayer , which , while not surprising from an evolutionary or biochemical standpoint, provides a new structural view of this fundamentally important enzymatic reaction. In addition, we observed that because the sortase recognition motif is proximal to the membrane, the catalytic domain of spySrtA was also positioned near the membrane surface, and our contact map revealed several residues in the catalytic domain that interacted directly with lipid molecules (Figure 1C). Another insight that resulted from our simulations was that there were interactions between spySrtA and M protein residues outside of the LPSTG pentapeptide recognition motif. This included the following amino acids (underlined), KRQLPSTGET (Figure 7A). Most notably, we observed specific interactions between K35 and R38 in spySrtA, which are well conserved in Streptococcus SrtA enzymes, and the P2’ Glu, which is also strongly conserved in endogenous substrates for spySrtA ( Figures 7C, S6 ). The contribution of these contacts was not tested with respect to the sortase -mediated ligation reaction, however, we predict that a better understanding of specificity at these positions may be useful in the design of substrates for protein engineering applications using sortase -mediated ligation (SML) strategies.10,14 Consistent with these observations, our data also suggested that the presence of the transmembrane domain of M protein may facilitate an increased percentage of ‘bound’ substrate in the active site as compared to the isolated peptide (Figure S5). This is a challenging system to study experimentally. In preliminary experiments not presented here, we successfully purified full-length spySrtA in lipid nanodiscs and confirmed catalytic activity with a fluorescent peptide substrate, similar to our previous work. 26,62–64 Significantly, the ideal su bstrate would also include its transmembrane domain in order to directly probe the role of the transmembrane domain in substrate recognition, as well as to test a potential transmembrane domain-mediated interaction between .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint spySrtA with M protein. Challenge s in the system design included controlling the membrane insertion direction for both enzyme and substrate, and inhibiting enzymatic activity prior to experiment initiation. These issues are surmountable, and experimental work is ongoing. Overall, our mol ecular dynamics simulations revealed structural insights into the substrate recognition of a class A sortase with an endogenous substrate within a model lipid bilayer. Specifically, we identified specific contacts between the enzyme and membrane, and our results indicated that there is a transmembrane domain interaction that may facilitate sortase catalysis in vivo. Specifically, we hypothesize that the proposed transmembrane interaction could enhance both substrate recognition and turnover rate in vivo, thus directly impacting relative enzyme efficiency. This has potential implications for SML protein engineering methods, where the catalytic domains of class A sortases in isolation are widely used despite the wild-type enzymes being limited by low catalytic efficiencies.14,15,65,66 While gains in enzyme efficiency and substrate scope have been achieved via d irected evolution experiments and other strategies , we envision that a complementary approach may be to mimic the novel enzyme -substrate interactions described here.13,15,16,67,68 In this regard, prior work has demonstrated that preassociation of sortase and sortase substrates through either protein-protein interactions or on the surface of liposomes does indeed facilitate SML.69–71 The continued development of these strategies is thus a promising means for the further development of SML methodology. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint

Acknowledgements

We want to thank all additional members of the Amache r, Antos, McCarty, and Spiegel labs for helpful

Discussion

and research support. This work was supported by NIH 1R15GM154315-01 to J.F. Amacher, J.M. Antos, and J. McCarty. It was additionally supported by NSF CHE-2044958 and a Cottrell Scholar Award from the Research Corporation for Science Advancement to J.F. Amacher, NSF CHE-2102189 and MCB-2441210 to J. McCarty, and NIH 2R15HL135658-03 to P.C. Spiegel. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint

References

(1) Hook, M.; Foster, T. J. Editorial: Cell Surface Proteins of Gram-Positive Pathogenic Bacteria. Front. Microbiol. 2021, 12, 681880. (2) Spirig, T.; Weiner, E. M.; Clubb, R. T. Sortase Enzymes in Gram-Positive Bacteria. Mol. Microbiol. 2011, 82, 1044–1059. (3) Fischetti, V. A. M Protein and Other Surface Proteins on Streptococcus Pyogenes. In Streptococcus pyogenes: Basic Biology to Clinical Manifestations; Ferretti, J. J.; Stevens, D. L.; Fischetti, V. A., Eds.; University of Oklahoma Health Sciences Center: Oklahoma City (OK), 2022. (4) Schneewind, O.; Model, P.; Fischetti, V. A. Sorting of Protein A to the Staphylococcal Cell Wall. Cell 1992, 70, 267–281. (5) Ton-That, H.; Liu, G.; Mazmanian, S. K.; Faull, K. F.; Schneewind, O. Purification and Characterization of Sortase, the Transpeptidase That Cleaves Surface Proteins of Staphylococcus Aureus at the LPXTG Motif. Proc. Natl. Acad. Sci. USA 1999, 96, 12424–12429. (6) Mazmanian, S. K.; Liu, G.; Ton -That, H.; Schneewind, O. Staphylococcus Aureus Sortase, an Enzyme That Anchors Surface Proteins to the Cell Wall. Science 1999, 285, 760–763. (7) Raz, A.; Fischetti, V . A. Sortase A Localizes to Distinct Foci on the Streptococcus Pyogenes Membrane. Proc. Natl. Acad. Sci. USA 2008, 105, 18549–18554. (8) Kline, K. A.; Kau, A. L.; Chen, S. L.; Lim, A.; Pinkner, J. S.; Rosch, J.; Nallapareddy, S. R.; Murray, B. E.; Henriques-Normark, B.; Beatty, W.; et al. Mechanism for Sortase Localization and the Role of Sortase Localization in Efficient Pilus Assembly in Enterococcus Faecalis. J. Bacteriol. 2009, 191, 3237–3247. (9) Jacobitz, A. W.; Kattke, M. D.; Wereszczynski, J.; Clubb, R. T. Sortase Transpeptidases: Structural Biology and Catalytic Mechanism. Adv. Protein Chem. Struct. Biol. 2017, 109, 223–264. (10) Amacher, J. F.; Antos, J. M. Sortases: Structure, Mechanism, and Implications for Protein Engineering. Trends Biochem. Sci. 2024, 49, 596–610. (11) Chen, J. -L.; Wang, X.; Yang, F.; Li, B.; Otting, G.; Su, X. -C. 3D Structure of the Transient Intermediate of the Enzyme–substrate Complex of Sortase A Reveals How Calcium Binding and Substrate Recognition Cooperate in Substrate Activation. ACS Catal. 2023, 13, 11610–11624. (12) Tian, B. -X.; Eriksson, L. A. Catalytic Mechanism and Roles of Arg197 and Thr183 in the Staphylococcus Aureus Sortase A Enzyme. J. Phys. Chem. B 2011, 115, 13003–13011. (13) Antos, J. M.; Truttmann, M. C.; Ploegh, H. L. Recent Advances in Sortase -Catalyzed Ligation Methodology. Curr. Opin. Struct. Biol. 2016, 38, 111–118. (14) Morgan, H. E.; Turnbull, W. B.; Webb, M. E. Challenges in the Use of Sortase and Other Peptide Ligases for Site-Specific Protein Modification. Chem. Soc. Rev. 2022, 51, 4121–4145. (15) Chen, I.; Dorr, B. M.; Liu, D. R. A General Strategy for the Evolution of Bond -Forming Enzymes Using Yeast Display. Proc. Natl. Acad. Sci. USA 2011, 108, 11399–11404. (16) Podracky, C. J.; An, C.; DeSousa, A.; Dorr, B. M.; Walsh, D. M.; Liu, D. R. Laboratory Evolution of a Sortase Enzyme That Modifies Amyloid-β Protein. Nat. Chem. Biol. 2021, 17, 317–325. (17) Perry, A. M.; Ton-That, H.; Mazmanian, S. K.; Schneewind, O. Anchoring of Surface Proteins to the Cell Wall of Staphylococcus Aureus. III. Lipid II Is an in Vivo Peptidoglycan Substrate for Sortase- Catalyzed Surface Protein Anchoring. J. Biol. Chem. 2002, 277, 16241–16248. (18) Mazmanian, S. K.; Ton -That, H.; Su, K.; Schneewind, O. An Iron -Regulated Sortase Anchors a Class of Surface Protein during Staphylococcus Aureus Pathogenesis. Proc. Natl. Acad. Sci. USA 2002, 99, 2293–2298. (19) Mazmanian, S. K.; Ton-That, H.; Schneewind, O. Sortase-Catalysed Anchoring of Surface Proteins to the Cell Wall of Staphylococcus Aureus. Mol. Microbiol. 2001, 40, 1049–1057. (20) Ton-That, H.; Mazmanian, S. K.; Alksne, L.; Schneewind, O. Anchoring of Surface Proteins to the Cell Wall of Staphylococcus Aureus. Cysteine 184 and Histidine 120 of Sortase Form a Thiolate - Imidazolium Ion Pair for Catalysis. J. Biol. Chem. 2002, 277, 7447–7452. (21) Baek, M.; DiMaio, F.; Anishchenko, I.; Dauparas, J.; Ovchinnikov, S.; Lee, G. R.; Wang, J.; Cong, Q.; Kinch, L. N.; Schaeffer, R. D.; et al. Accurate Prediction of Protein Structures and Interactions Using a Three-Track Neural Network. Science 2021, 373, 871–876. (22) Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly Accurate Protein Structure Prediction with .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint AlphaFold. Nature 2021, 596, 583–589. (23) Abramson, J.; Adler, J.; Dunger, J.; Evans, R.; Green, T.; Pritzel, A.; Ronneberger, O.; Willmore, L.; Ballard, A. J.; Bambrick, J.; et al. Accurate Structure Prediction of Biomolecular Interactions with AlphaFold 3. Nature 2024, 630, 493–500. (24) Sohlenkamp, C.; Geiger, O. Bacterial Membrane Lipids: Diversity in Structures and Pathways. FEMS Microbiol. Rev. 2016, 40, 133–159. (25) Hayami, M.; Okabe, A.; Kariyama, R.; Abe, M.; Kanemasa, Y. Lipid Composition of Staphylococcus Aureus and Its Derived L-Forms. Microbiol. Immunol. 1979, 23, 435–442. (26) Johnson, D. A.; Piper, I. M.; Vogel, B. A.; Jackson, S. N.; Svendsen, J. E.; Kodama, H. M.; Lee, D. E.; Lindblom, K. M.; McCarty, J.; Antos, J. M.; et al. Structures of Streptococcus Pyogenes Class A Sortase in Complex with Substrate and Product Mimics Provide Key Details of Target Recognition. J. Biol. Chem. 2022, 298, 102446. (27) Antos, J. M.; Chew, G. -L.; Guimaraes, C. P.; Yoder, N. C.; Grotenbreg, G. M.; Popp, M. W. -L.; Ploegh, H. L. Site-Specific N- and C-Terminal Labeling of a Single Polypeptide Using Sortases of Different Specificity. J. Am. Chem. Soc. 2009, 131, 10800–10801. (28) Hess, G. T.; Guimaraes, C. P.; Spooner, E.; Ploegh, H. L.; Belcher, A. M. Orthogonal Labeling of M13 Minor Capsid Proteins with DNA to Self -Assemble End-to-End Multiphage Structures. ACS Synth. Biol. 2013, 2, 490–496. (29) Hess, G. T.; Cragnolini, J. J.; Popp, M. W.; Allen, M. A.; Dougan, S. K.; Spooner, E.; Ploegh, H. L.; Belcher, A. M.; Guimaraes, C. P. M13 Bacteriophage Display Framework That Allows Sortase - Mediated Modification of Surface-Accessible Phage Proteins. Bioconjug. Chem. 2012, 23, 1478– 1487. (30) Smeesters, P. R.; McMillan, D. J.; Sriprakash, K. S. The Streptococcal M Protein: A Highly Versatile Molecule. Trends Microbiol. 2010, 18, 275–282. (31) Galaxy Community. The Galaxy Platform for Accessible, Reproduci ble and Collaborative Biomedical Analyses: 2022 Update. Nucleic Acids Res. 2022, 50, W345–W351. (32) Afgan, E.; Baker, D.; Batut, B.; van den Beek, M.; Bouvier, D.; Cech, M.; Chilton, J.; Clements, D.; Coraor, N.; Grüning, B. A.; et al. The Galaxy Platfor m for Accessible, Reproducible and Collaborative Biomedical Analyses: 2018 Update. Nucleic Acids Res. 2018, 46, W537–W544. (33) Jurrus, E.; Engel, D.; Star, K.; Monson, K.; Brandi, J.; Felberg, L. E.; Brookes, D. H.; Wilson, L.; Chen, J.; Liles, K.; et al. Improvements to the APBS Biomolecular Solvation Software Suite. Protein Sci. 2018, 27, 112–128. (34) Lomize, M. A.; Pogozheva, I. D.; Joo, H.; Mosberg, H. I.; Lomize, A. L. OPM Database and PPM Web Server: Resources for Positioning of Proteins in Membra nes. Nucleic Acids Res. 2012, 40, D370-6. (35) Jo, S.; Kim, T.; Iyer, V. G.; Im, W. CHARMM -GUI: A Web -Based Graphical User Interface for CHARMM. J. Comput. Chem. 2008, 29, 1859–1865. (36) Jo, S.; Kim, T.; Im, W. Automated Builder and Database of Protein/Membrane Complexes for Molecular Dynamics Simulations. PLoS One 2007, 2, e880. (37) Wu, E. L.; Cheng, X.; Jo, S.; Rui, H.; Song, K. C.; Dávila -Contreras, E. M.; Qi, Y.; Lee, J.; Monje- Galvan, V.; Venable, R. M.; et al. CHARMM -GUI Membrane Builder toward Realistic Biological Membrane Simulations. J. Comput. Chem. 2014, 35, 1997–2004. (38) Lee, J.; Cheng, X.; Swails, J. M.; Yeom, M. S.; Eastman, P. K.; Lemkul, J. A.; Wei, S.; Buckner, J.; Jeong, J. C.; Qi, Y.; et al. CHARMM -GUI Input Generator for NAMD, GROMACS, AMBER, OpenMM, and CHARMM/OpenMM Simulations Using the CHARMM36 Additive Force Field. J. Chem. Theory Comput. 2016, 12, 405–413. (39) Best, R. B.; Zhu, X.; Shim, J.; Lopes, P. E. M.; Mittal, J.; Feig, M.; Mackerell, A. D. Optimization of the Additive CHARMM All-Atom Protein Force Field Targeting Improved Sampling of the Backbone φ, ψ and Side-Chain χ(1) and χ(2) Dihedral Angles. J. Chem. Theory Comput. 2012, 8, 3257–3273. (40) Klauda, J. B.; Venable, R. M.; Freites, J. A.; O’Connor, J. W.; Tobias, D. J.; Mondragon -Ramirez, C.; Vorobyov, I.; MacKerell, A. D.; Pastor, R. W. Update of the CHARMM All-Atom Additive Force Field for Lipids: Validation on Six Lipid Types. J. Phys. Chem. B 2010, 114, 7830–7843. (41) Berendsen, H. J. C.; van der Spoel, D.; van Drunen, R. GROMACS: A Message -Passing Parallel Molecular Dynamics Implementation. Comput Phys Commun 1995, 91, 43–56. (42) Berendsen, H. J. C.; Postma, J. P. M.; van Gunsteren, W. F.; DiN ola, A.; Haak, J. R. Molecular Dynamics with Coupling to an External Bath. J. Chem. Phys. 1984, 81, 3684. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint (43) Parrinello, M. Polymorphic Transitions in Single Crystals: A New Molecular Dynamics Method. J. Appl. Phys. 1981, 52, 7182. (44) Hoover, W. G. Canonical Dynamics: Equilibrium Phase -Space Distributions. Phys. Rev. A, Gen. Phys. 1985, 31, 1695–1697. (45) Nosé, S. A Molecular Dynamics Method for Simulations in the Canonical Ensemble. Mol. Phys. 1984, 52, 255–268. (46) Hess, B.; Bekker, H.; Berendsen, H. J. C.; Fraaije, J. G. E. M. LINCS: A Linear Constraint Solver for Molecular Simulations. J. Comput. Chem. 1997. (47) PLUMED consortium. Promoting Transparency and Reproducibility in Enhanced Molecular Simulations. Nat. Methods 2019, 16, 670–673. (48) Tribello, G. A.; Bonomi, M.; Branduardi, D.; Camilloni, C.; Bussi, G. PLUMED 2: New Feathers for an Old Bird. Comput Phys Commun 2014, 185, 604–613. (49) Eisenhaber, F.; Lijnzaad, P.; Argos, P.; Sander, C.; Scharf, M. The Double Cubic La ttice Method: Efficient Approaches to Numerical Integration of Surface Area and Volume and to Dot Surface Contouring of Molecular Assemblies. J. Comput. Chem. 1995, 16, 273–284. (50) Litou, Z. I.; Bagos, P. G.; Tsirigos, K. D.; Liakopoulos, T. D.; Hamodra kas, S. J. Prediction of Cell Wall Sorting Signals in Gram -Positive Bacteria with a Hidden Markov Model: Application to Complete Genomes. J Bioinform Comput Biol 2008, 6, 387–401. (51) Fimereli, D. K.; Tsirigos, K. D.; Litou, Z. I.; Liakopoulos, T. D.; Bagos, P. G.; Hamodrakas, S. J. CW- PRED: A HMM -Based Method for the Classification of Cell Wall -Anchored Proteins of Gram - Positive Bacteria. In Artificial intelligence: theories and applications; Maglogiannis, I.; Plagianakos, V.; Vlahavas, I., Eds.; Lecture notes in computer science; Springer Berlin Heidelberg: Berlin, Heidelberg, 2012; Vol. 7297, pp. 285–290. (52) Crooks, G. E.; Hon, G.; Chandonia, J. M.; Brenner, S. E. WebLogo: A Sequence Logo Generator. Genome Res. 2004, 14, 1188–1190. (53) Sievers, F.; Wilm, A.; Dineen, D.; Gibson, T. J.; Karplus, K.; Li, W.; Lopez, R.; McWilliam, H.; Remmert, M.; Söding, J.; et al. Fast, Scalable Generation of High-Quality Protein Multiple Sequence Alignments Using Clustal Omega. Mol. Syst. Biol. 2011, 7, 539. (54) Madeira, F.; Madhusoodanan, N.; Lee, J.; Eusebi, A.; Niewielska, A.; Tivey, A. R. N.; Lopez, R.; Butcher, S. The EMBL-EBI Job Dispatcher Sequence Analysis Tools Framework in 2024. Nucleic Acids Res. 2024, 52, W521–W525. (55) Yariv, B.; Yariv, E.; Kessel, A.; Masrati, G.; Chorin, A. B.; Martz, E.; Mayrose, I.; Pupko, T.; Ben - Tal, N. Using Evolutionary Data to Make Sense of Macromolecules with a “Face -Lifted” ConSurf. Protein Sci. 2023, 32, e4582. (56) Ashkenazy, H.; Abadi, S.; Martz, E.; Chay, O.; Mayrose, I.; Pupko, T.; Ben -Tal, N. ConSurf 2016: An Improved Methodology to Estimate and Visualize Evolutionary Conservation in Macromolecules. Nucleic Acids Res. 2016, 44, W344-50. (57) Ilangovan, U.; Ton-That, H.; Iwahara, J.; Sch neewind, O.; Clubb, R. T. Structure of Sortase, the Transpeptidase That Anchors Proteins to the Cell Wall of Staphylococcus Aureus. Proc. Natl. Acad. Sci. USA 2001, 98, 6056–6061. (58) Race, P. R.; Bentley, M. L.; Melvin, J. A.; Crow, A.; Hughes, R. K.; S mith, W. D.; Sessions, R. B.; Kehoe, M. A.; McCafferty, D. G.; Banfield, M. J. Crystal Structure of Streptococcus Pyogenes Sortase A: Implications for Sortase Mechanism. J. Biol. Chem. 2009, 284, 6924–6933. (59) Chan, A. H.; Yi, S. W.; Terwilliger, A. L.; Maresso, A. W.; Jung, M. E.; Clubb, R. T. Structure of the Bacillus Anthracis Sortase A Enzyme Bound to Its Sorting Signal: A FLEXIBLE AMINO-TERMINAL APPENDAGE MODULATES SUBSTRATE ACCESS. J. Biol. Chem. 2015, 290, 25461–25474. (60) Glinton, K.; Beck, J.; Liang, Z.; Qiu, C.; Lee, S. W.; Ploplis, V. A.; Castellino, F. J. Variable Region in Streptococcal M -Proteins Provides Stable Binding with Host Fibrinogen for Plasminogen - Mediated Bacterial Invasion. J. Biol. Chem. 2017, 292, 6775–6785. (61) Krogh, A.; Larsson, B.; von Heijne, G.; Sonnhammer, E. L. L. Predicting Transmembrane Protein Topology with a Hidden Markov Model: Application to Complete Genomes. J. Mol. Biol. 2001, 305, 567–580. (62) Gao, M.; Johnson, D. A.; Piper, I. M.; Kodama, H. M.; Svendsen, J. E.; Tahti, E.; Longshore-Neate, F.; Vogel, B.; Antos, J. M.; Amacher, J. F. Structural and Biochemical Analyses of Selectivity Determinants in Chimeric Streptococcus Class A Sortase Enzymes. Protein Sci. 2022, 31, 701– 715. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint (63) Piper, I. M. ; Struyvenberg, S. A.; Valgardson, J. D.; Johnson, D. A.; Gao, M.; Johnston, K.; Svendsen, J. E.; Kodama, H. M.; Hvorecny, K. L.; Antos, J. M.; et al. Sequence Variation in the Β7- Β8 Loop of Bacterial Class A Sortase Enzymes Alters Substrate Selectivity. J. Biol. Chem. 2021, 297, 100981. (64) Kodama, H. M.; Lindblom, K. M.; Walkenhauer, E. G.; Antos, J. M.; Amacher, J. F. Amino Acid Variability at W194 of Staphylococcus Aureus Sortase A Alters Nucleophile Specificity. Protein Sci. 2024, 33, e5212. (65) Huang, X.; Aulabaugh, A.; Ding, W.; Kapoor, B.; Alksne, L.; Tabei, K.; Ellestad, G. Kinetic Mechanism of Staphylococcus Aureus Sortase SrtA. Biochemistry 2003, 42, 11307–11315. (66) Marraffini, L. A.; DeDent, A. C.; Schneewind, O. Sortases and the Art of Anchoring Proteins to the Envelopes of Gram-Positive Bacteria. Microbiol. Mol. Biol. Rev. 2006, 70, 192–221. (67) Freund, C.; Schwarzer, D. Engineered Sortases in Peptide and Protein Chemistry. Chembiochem 2021, 22, 1347–1356. (68) Schmohl, L.; Bierlmeier, J.; Gerth, F.; Freund, C.; Schwarzer, D. Engineering Sortase A by Screening a Second-Generation Library Using Phage Display. J Pept Sci 2017, 23, 631–635. (69) Yu, W.; Gillespie, K. P.; Chhay, B.; Svensson, A.-S.; Nygren, P.-Å.; Blair, I. A.; Yu, F.; Tsourkas, A. Efficient Labeling of Native Human IgG by Proximity-Based Sortase-Mediated Isopeptide Ligation. Bioconjug. Chem. 2021, 32, 1058–1066. (70) Wang, H. H.; Altun, B.; Nwe, K.; Tsourkas, A. Proximity-Based Sortase-Mediated Ligation. Angew. Chem. Int. Ed. 2017, 56, 5349–5352. (71) Silvius, J. R.; Leventis, R. A Novel “Prebinding” Strategy Dramatically Enhances Sortase-Mediated Coupling of Proteins to Liposomes. Bioconjug. Chem. 2017, 28, 1271–1282. .CC-BY-NC 4.0 International licenseavailable under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: oa-pdf ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall: last seen: 2026-05-24T02:00:01.246996+00:00

License: CC-BY-NC-4.0