Results
The spySrtA catalytic domain interacts with the gram-positive bacterial membrane via electrostatic and
hydrophobic interactions. To get a better understanding of the structure of the full-length spySrtA enzyme,
we utilized AlphaFold2 modeling, as described in the Materials and Methods ( Figure 1A). The st ructure
largely agreed with our predictions of the full -length protein; however, we were intrigued by a number of
intramolecular contacts in residues of the spySrtA extracellular domain, which are not typically included in
the catalytic domain constructs that have been utilized for in vitro work (e.g., amino acids 81-249) (Figure
1B). Specifically, we observed multiple hydrophobic (Y49-V94-F110, V51-V94, I59-I95) and polar (N47-
S243, Q50-N106, S55-E17) interactions (Figure 1B). Catalytic domain constructs of sortase A enzymes
were historically determined via similarity in multiple sequence alignment s, which is how the construct
boundaries of spySrtA were defined.57,58
However, there is evidence that
residues N -terminal to the catalytic
domain may play a regulatory role in
SrtA enzymes such as Bacillus
anthracis SrtA.59 Moving forward, we will
refer to the extracellular domain of
spySrtA as amino acids 3 4-249
(34NKPIR… NQVST249) and the catalytic
domain as amino acids 81 -249
(81SVLQA… NQVST249).
We next modeled spySrtA in its
membrane environment, utilizing a lipid
composition of 80% 1,2-dioleoyl-sn-
glycero-3-phosphoglycerol ( DOPG)
and 20% tetraoleoyl-cardiolipin
(TOCL2), based on previous studies of gram-positive bacterial membranes (Figures 1C, S1).24 Insertion of
spySrtA into the lipid bilayer is described in the Materials and Methods. Following generation of our model
Figure 1. AlphaFold models of full -length Streptococcus
pyogenes SrtA (spySrtA) with and without a lipid bilayer. (A) An
AlphaFold2-generated model of full-length spySrtA (residues 1-249)
is shown in cartoon representation and including a transparent
surface. Three regions of the proteins are colo red and labeled. ( B)
Amino acids which may facilitate intraprotein interactions between the
catalytic domain (as commonly used in biochemical studies, residues
81-249) and residues N -terminal to this region are highlighted with
the side chains shown as spheres and colored by heteroatom (O=red,
N=blue, C=marine (for catalytic domain) and C=cyan (for N-terminal
to catalytic domain)). SpySrtA is shown in cartoon. ( C) A full-length
model of spySrtA (gray, cartoon representation) in a lipid bilayer
(lines, colored by heteroatom with C=gray).
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
of full-length spySrtA enzyme in a lipid bilayer, we ran triplicate molecular dynamics simulations for 500 ns
to assess sortase-membrane interactions, as described in the Materials and Methods. Overall, the catalytic
domain of spySrtA remained stable during
the course of each simulation (Figure S2).
Contact analyses of specific
spySrtA atoms with the lipid bilayer
revealed a number of interactions in the
catalytic ( extracellular) domain that
frequently occurred duri ng the simulations
(Figure 2). As expected, transmembrane
residues remained embedded in the lipid
bilayer during the entire simulation
(Figures 2A-B). In addition, residues
close to the transmembrane domain were
also frequently associated with the
membrane exterior . Interestingly, there
were also a number of residues not
immediately adjacent to the
transmembrane domain that were
frequently (defined as >50% of the
simulation) in contact with lipid carbon or
oxygen atoms. Th ese included residues
N-terminal to the catalytic domain (S78-
E80) or immediately within it (L83, Q86,
M87), as well as I147 and T148 near the
C-terminus. Some of these residues
appeared to preferentially bind to either
DOPG or TOCL2. Residues near the
Figure 2. Contact map of spySrtA and the lipid bilayer.
Following triplicate 500 ns molecular dynamics simulations, a
contact map was generated to assess the percent (%) of
simulation bound for specific catalytic domain atoms in spySrtA
and lipid groups. ( A) SpySrtA is shown in cartoon
representation and colored in gray. The traditional His-Cys-Arg
catalytic residues are in gold, with side chain atoms shown as
sticks and colored by heteroatom (N=blue, O=red, C=gold). For
amino acids that made specific contacts, the C a atoms are
shown as spheres and colored accor ding to the key. These
colors match the data in ( B). (B) Specific contacts for the
intracellular, transmembrane, and catalytic domain residues of
spySrtA with lipid groups are shown and colored as labeled.
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
peptide binding groove (R38, S78 -E80, and L83) bound to the phosphate and/or fatty acid tail of DOPG
whereas residues opposite to the peptide binding groove (T40-L41, R44, N47-K48, and Q86-M87) bound
to polar and/or hydrophobic groups of TOLC2 for >80% of the simulation (Figures 2A-B). Residue-specific
preferences for lipid moieties could support the orientations spyS rtA adopts in the membrane. Taken
together, these data revealed interactions between spySrtA, including the catalytic domain, and the
membrane.
The transmembrane domains of SpySrtA and its endogenous substrate M protein interact specifically and
stably during molecular dynamics simulations. To investigate the tripartite complex of spySrtA, membrane,
and substrate, we created two separate models using the pipeline described above. For both, we chose to
include the sortase recognition motif
initially bound in the active site, in order to
investigate interactions between proteins
and with the membrane in the bound
complex. In future experiments, it would
also be interesting to investigate initial
recognition of sortase for its substrate(s).
In the first model, we used a peptide
substrate (LPSTG, where L=P4, P=P3,
S=P2, T=P1, and G=P1’) to match the
canonical pentapeptide recognition motif
(LPXTG) for sortase enzymes (Figure
3A).2,5,6,10 In addition, t his sequence is
derived from the S. pyogenes M protein, a
virulence factor that is attached to the
bacterial cell surface by sortase-mediated
ligation.3,60 Our second model included a
region of the M protein containing both the
Figure 3. AlphaFold models of full -length s pySrtA with
substrate inserted into a lipid bilayer. Output models of spySrtA
with an LPSTG peptide ( A) or extended M protein sequence ( B)
are shown with the spySrtA protein in gray surface representation.
For all, the lipid bilayer is shown as lines and colored by
heteroatom (C=gray, O=red, N=blue). (A) The LPSTG peptide is
in yellow spheres. (B) The extended M protein model is in cartoon
for the intracellular and tr ansmembrane domains, and stick
representation colored by heteroatom for the extracellular domain
(C=yellow), which includes the LPSTG motif . (C) The predicted
transmembrane domain residues are highlighted as spheres and
colored by heteroatom (spySrtA: C=gr ay, M protein: C=yellow).
The extracellular domains are shown in cartoon representation
and colored as labeled.
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
LPSTG sequence and its C -terminal transmembrane domain. Because there is reported variability in M
protein extracellular sequences, th e protein is very large, and there is no evidence to our knowledge of
interactions between the spySrtA catalytic domain and other regions of the substrate, we restricted our
model to M protein residues 376 -415, which included five residues before the start of the target LPSTG
motif (Figure 3B). The prediction algorithm TMHMM-2.0 was used to predict the transmembrane domain
of M protein, suggesting this domain is
residues 387 -409.61 Our model was
largely consistent with this result,
although it suggested a more accurate
transmembrane domain excludes T387
(Figure 3C ). For spySrtA, TMHMM-2.0
predicts the transmembrane domain to
contain residues 13-32. Again, our model
largely agreed, with the addition of F33
and N34 (which interacted with the polar
head groups of the lipid molecules)
(Figure 3C).
We ran triplicate 500 ns
molecular dynamics simulations with
each of these spySrtA -substrate-
membrane complexes, as described in the
Materials
and Methods. Again, we
observed that all components were stable
throughout each simulation, with minimal
variability, as measured using relative
root-mean-square deviation over time and root-mean-square fluctuation by residue calculations (Figures
S3-4). This is also apparent in structural alignment of 20 states from an example simulation, with each
structure (membrane not shown) representing a state every 25 ns of simulation time (Figure 4). The largest
Figure 4. Molecular dynamics simulations reveal stable
substrate binding to spySrtA. The results of one simulation
replicate are shown for spySrtA-LPSTG (A) and spySrtA-M protein
(B). Output states corresponding to Dt=25 ns (21 states total,
including t=0) are aligned and shown. The lipid bilayer is not
shown for clarity (although it was present in all simulations).
SpySrtA is in gray cartoon. The intracellular and transmembrane
domain of M protein is in yellow cartoon (B). All other peptide or M
protein (LPSTG or KRQLPST, respectively) residues are shown
as yellow sticks and colored by heteroatom (N=blue, O=red,
C=yellow). The insets show a zoomed-in version of the interaction
and are rendered similarly.
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
variability is seen for the five amino acids preceding the LPSTG motif in M protein, which was not surprising
as these residues are not expected to specifically interact with spySrtA (Figure 4B). Furthermore, the
relative stability of the atoms in the LPSTG sequence in both simulations was similar to what we observed
in previous 1000 ns spySrtA 81-249-LPATA molecular dynamics simulations, using our experimental
structures.26 The distance between the thiol of the catalytic C208 and P1 carbonyl carbon, the site of
nucleophilic attack, was also stabilized by the presence of additional M protein residues, as visualized by a
shift in the distance distribution (Figure S5). For example, in our triplicate simulations of spySrtA-M protein-
membrane (defined as T1, T2, and T3), the distance distribution between these atoms was centered around
3.8 Å for T1 and T2, but closer to 5 Å for T3. For reference, we observed a probability distribution centered
at 3.8 Å previously in our spySrtA 81-249-LPATA simulations.26 However, in all three simulations for the
spySrtA-LPSTG-membrane system, we saw a bimodal distribution (centered at the 3.8 and 5 Å distances)
(Figure S5). There is no clear reasoning for this discrepancy, although we predict that this may reflect the
peptide and/or substrate sampling both a catalytically competent bound state and an unreactive partially
bound state. With M protein, the averaged ratio favors the closer or ‘bound’ state, with relative probabilities
of roughly 0.25:0.15 for the peak maxima of 3.8 Å:5 Å, or ‘bound’:’partially bound’ distances. Conversely,
for the LPSTG peptide simulations, this ratio is flipped , at roughly 0.2:0.3 for the peaks corresponding to
the 3.8 Å:5 Å distances (Figure S5). Notably, the peptide does stay stably bound despite some fluctuations
in this distance (Figure 4).
The spySrtA -M protein-membrane model and simulations also revealed a relatively stable
transmembrane domain interaction between the two proteins. This interaction persists throughout the
entirety of each replic ate simulation, although the specific residues that maintain contact varies (Figure 5).
Here, we visually analyzed amino acids oriented towards each other at t=0 ns of each simulation, including
Leu20, Ile21, Leu24, Gly28, and Leu31 in spySrtA and Phe392, Ala395, Ala396, Val399, and Ala403 in M
protein, for the t=0, 250, and 500 ns states ( Figure 5). The transmembrane domains of these proteins
remain associated in all replicates, with the highest degree of dissociation at t=500 ns for simulation T1.
Furthermore, in the t=500 ns state, interacting residues differ for T1 (Leu31-Ala395), T2 (Leu24-Ala403 and
Leu31-Ala396), and T3 (Leu 31-Phe392, Leu 24-Val399, and Ile 21-Ala403) ( Figure 5 ). Overall, these
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
simulations suggest that the
transmembrane domain
interaction between spySrtA
and M protein m ay be
multivalent, likely reflecting the
dynamic nature of both the
membrane and proteins
themselves.
Specific residues in spySrtA
and M protein are buried upon
substrate binding. We wanted
to further analyze substrate
recognition by the catalytic
domain of spySrtA in the
context of the full -length
proteins. When we analyze d
the change in solvent
accessible surface area
(DSASA), defined as
SASAbound - SASAunbound (in Å 2), we see that the P4 Leu and P1 Thr in the M protein fragment are
substantially buried (-DSASA > 10 Å2) upon substrate binding in the triplicate simulations (Figures 6A-B).
In addition, we saw that for one of the replicates, the largest -DSASA value observed was for the P2’ Glu.
For the other two replicates, -DSASA for the P2’ Glu was second to only the P4 Leu, a position previously
described as binding in a specific hydrophobic pocket (Figure 6B).26 This observation will be discussed in
detail below. Other substrate residues with relatively large -DSASA values include the P6 Arg (RQLPSTGE,
Arg in bold), P3 Pro, P2 Ser, P1’ Gly, and other positions within the transmembrane domain (Figure 6B).
Figure 5. Structural trajectories of spySrtA and M protein
transmembrane domain interactions. Specific states (corresponding to
t=0, 250, 500 ns) are shown for each rep licate simulation, T1 ( A), T2 ( B),
and T3 (C). The lipid bilayer is not shown for clarity although is present in all
simulations. SpySrtA is shown as gray cartoon and M protein as yellow
cartoon. Amino acid sidechains that are oriented towards each other in the
transmembrane helices are shown as spheres and labeled for all.
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
For the spySrtA enzyme, in addition to the expected positions required for enzymatic activity (H142-
C208-R216), we also observed other residues within the
catalytic domain with a similar -DSASA, including E189, V191,
and I211 (Figure 6A). These positions may bind and stabilize
substrate binding outside the LPSTG recognition motif, for
example at the N-terminal KRQ and C-terminal ET positions (M
protein sequence = KRQ LPSTGET) (Figure 6C ). Taken
together, these results suggested there may be s pecific
interactions between spySrtA and M protein beyond the
canonical LPXTG recognition motif for class A sortases.
Two additional spySrtA residues ( Q72 and P76 ) that
are not in the transmembrane domain also exhibited relatively
large -DSASA values in the substrate bound models ( Figure
6A). These two residues also appear to interact directly with the
M protein substrate in or near the recognition motif, at either the
P5 Gln (with Q72) or the P2 Ser and P1’ Gly (with P76)
positions ( Figure 6C ). These int eractions are present in all
three replicate simulations, with average distances between Ca
atoms equal to: Q72-P5 Gln = 7.3 ± 0.5 Å (T1), 7.4 ± 0.6 Å (T2),
and 7.3 ± 0.5 Å (T3), and P76-P1’ Gly = 6.1 ± 0.7 Å (T1), 4.8 ±
0.3 Å (T2), and 6.3 ± 0.7 Å (T3).
The extracellular domain (amino acids 34 -249) of s pySrtA
interacts with its endogenous substrate M protein at positions
beyond the canonical pentapeptide recognition motif. To
complement the analysis of our AlphaFold models, we also
used our spySrtA 1-249-M protein376-415 molecular dynamics
simulations to investigate interactions in residues adjacent to
Figure 6. Residues outside the
canonical pentapeptide recognition
motif interact with the catalytic domain
of spySrtA. Analysis of the change in
solvent accessible surface area (SASA)
between the bound and unbound
AlphaFold models ( DSASA) reveal
several amino acids that become buried
upon substrate binding, defined as a
relatively large -DSASA are highlighted
and labeled, for spySrtA (A) and M protein
(B). ( C) Predicted interactions at amino
acids N -terminal (KRQ) and C -terminal
(ET) to the LPSTG pentapeptide
recognition motif are highlighted in
spySrtA (gray cartoon) as side chain
spheres and colored by heteroatom
(C=gray, O=red, N=blue). The
KRQLPSTGET sequence of M protein is
shown as spheres and colored by
heteroatom (C=yellow), with other amino
acids as a yellow cartoon. M protein
numbering is based on the full -length
protein.
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
the LPXTG substrate recognition motif. In our triplicate simulations, contact map analysis revealed
interactions at amino acids both N- and C-terminal to the M protein LPSTG sequence (Figure 7A). At the
N-terminus, these data confirmed interactions between Q72, E189 and V191 with the P6 -P5 RQ
(RQLPSTG) positions, as also highlighted above in our DSASA analysis. A potential role for backbone
atoms of F71 and P188 was also identified ( Figure 7A). We observed even more persistent interactions
adjacent to the C -terminus of LPSTG, with both specific backbone and side -chain contacts between the
P2’ Glu (LPSTGE) and several spySrtA residues. Most notably, electrostatic interactions with the side-chain
atoms of K35 and
R38 in spySrtA were
present throughout
each simulation
(Figure 7B).
Multiple
sequence alignment
of spySrtA plus 27
additional
Streptococcus SrtA
proteins indicates
that these Lys and
Arg positions are
relatively well
conserved.
Nineteen of the 28
sequences contain
a Lys in the
equivalent K35
position, with the
other sequences
Figure 7. Additional specific interactions are identified between spySrtA and the
P2’ Glu in M protein. (A) Contact map of spySrtA amino acids and either the KRQ or ET
residues of M protein, in the sequence KRQLPSTGET. “Hydrophilic” refers to side chain
atoms. (B) Initial and final states from the T1, T2, and T3 simulations (t=0 and 500 ns)
highlighting persistent interactions between K35 and R38 spySrtA with the P2’ Glu in M
protein. M protein is shown as yellow cartoon with the P2’ Glu side chain as sticks and
colored by heteroatom (C=yellow, O=red, N=blue). SpySrtA is shown as gray cartoon
with the K35 and R38 side chain atoms as sticks and colored by heteroatom (C=gray).
Relevant distances are labeled. ( C) Sequence logo ( WebLogo) of 24 predicted
endogenous substr ates of spySrtA confirms conservation at the P2’ position for a
negatively-charged amino acid (either D or E). ( D) ConSurf analysis with 28
Streptococcus SrtA sequnces reveals that the K35 and R38 positions are generally
conserved (left), although this is not true for 300 SrtA sequences from a broader range of
bacterial species (middle). M protein conservation is also highlighted (right). For all, the
proteins are shown in cartoon representation with relevant C a atoms as spheres. The
conservation scale is shown and labeled.
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
containing either Ser or Thr polar residues (Figure S6). Twenty-five of the 28 sequences contain an Arg in
the equivalent R38 position (Figure S6). Conservation is also reflected in spySrtA substrate sequences; a
sequence logo of 24 predicted spySrtA endogenous substrates reveals that a negative amino acid (either
Glu or Asp) in the P2’ position is highly conserved (Figure 7C).
We also used Consurf to investigate the evolutionary conservation of these residues visually
(Figure 7D).55,56 When we limited our analysis to the 28 Streptococcus SrtA proteins, again, the relatively
high conservation of residues, e.g., K35 and R38 in spySrtA, were apparent (black arrow in left panel of
Figure 7D); however, when applied to 300 SrtA sequences (middle panel in Figure 7D), this was not
conserved. For examp le, sequence alignment of spySrtA with Staphylococcus aureus SrtA (UniProt ID
SRTA_STAA8) revealed that while the Lys is conserved (K26 in S. aureus SrtA), the residue in the
equivalent Arg position is D30. To our knowledge, these types of interactions have not been explored in the
literature, and it remains unclear whether they have a significant impact on spySrtA activity. In the first step
of sortase-mediated catalysis, the substrate is cleaved between the P1/P1’ positions (LPST/GE), and initial
binding of the substrate is potentially facilitated by interactions outside of the standard LPXTG motif .2,9,10
Additional experimentation will be necessary to probe these interactions further, and to understand the
mechanistic implications of these observations.
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
References
(1) Hook, M.; Foster, T. J. Editorial: Cell Surface Proteins of Gram-Positive Pathogenic Bacteria. Front.
Microbiol. 2021, 12, 681880.
(2) Spirig, T.; Weiner, E. M.; Clubb, R. T. Sortase Enzymes in Gram-Positive Bacteria. Mol. Microbiol.
2011, 82, 1044–1059.
(3) Fischetti, V. A. M Protein and Other Surface Proteins on Streptococcus Pyogenes. In Streptococcus
pyogenes: Basic Biology to Clinical Manifestations; Ferretti, J. J.; Stevens, D. L.; Fischetti, V. A.,
Eds.; University of Oklahoma Health Sciences Center: Oklahoma City (OK), 2022.
(4) Schneewind, O.; Model, P.; Fischetti, V. A. Sorting of Protein A to the Staphylococcal Cell Wall. Cell
1992, 70, 267–281.
(5) Ton-That, H.; Liu, G.; Mazmanian, S. K.; Faull, K. F.; Schneewind, O. Purification and
Characterization of Sortase, the Transpeptidase That Cleaves Surface Proteins of Staphylococcus
Aureus at the LPXTG Motif. Proc. Natl. Acad. Sci. USA 1999, 96, 12424–12429.
(6) Mazmanian, S. K.; Liu, G.; Ton -That, H.; Schneewind, O. Staphylococcus Aureus Sortase, an
Enzyme That Anchors Surface Proteins to the Cell Wall. Science 1999, 285, 760–763.
(7) Raz, A.; Fischetti, V . A. Sortase A Localizes to Distinct Foci on the Streptococcus Pyogenes
Membrane. Proc. Natl. Acad. Sci. USA 2008, 105, 18549–18554.
(8) Kline, K. A.; Kau, A. L.; Chen, S. L.; Lim, A.; Pinkner, J. S.; Rosch, J.; Nallapareddy, S. R.; Murray,
B. E.; Henriques-Normark, B.; Beatty, W.; et al. Mechanism for Sortase Localization and the Role
of Sortase Localization in Efficient Pilus Assembly in Enterococcus Faecalis. J. Bacteriol. 2009,
191, 3237–3247.
(9) Jacobitz, A. W.; Kattke, M. D.; Wereszczynski, J.; Clubb, R. T. Sortase Transpeptidases: Structural
Biology and Catalytic Mechanism. Adv. Protein Chem. Struct. Biol. 2017, 109, 223–264.
(10) Amacher, J. F.; Antos, J. M. Sortases: Structure, Mechanism, and Implications for Protein
Engineering. Trends Biochem. Sci. 2024, 49, 596–610.
(11) Chen, J. -L.; Wang, X.; Yang, F.; Li, B.; Otting, G.; Su, X. -C. 3D Structure of the Transient
Intermediate of the Enzyme–substrate Complex of Sortase A Reveals How Calcium Binding and
Substrate Recognition Cooperate in Substrate Activation. ACS Catal. 2023, 13, 11610–11624.
(12) Tian, B. -X.; Eriksson, L. A. Catalytic Mechanism and Roles of Arg197 and Thr183 in the
Staphylococcus Aureus Sortase A Enzyme. J. Phys. Chem. B 2011, 115, 13003–13011.
(13) Antos, J. M.; Truttmann, M. C.; Ploegh, H. L. Recent Advances in Sortase -Catalyzed Ligation
Methodology. Curr. Opin. Struct. Biol. 2016, 38, 111–118.
(14) Morgan, H. E.; Turnbull, W. B.; Webb, M. E. Challenges in the Use of Sortase and Other Peptide
Ligases for Site-Specific Protein Modification. Chem. Soc. Rev. 2022, 51, 4121–4145.
(15) Chen, I.; Dorr, B. M.; Liu, D. R. A General Strategy for the Evolution of Bond -Forming Enzymes
Using Yeast Display. Proc. Natl. Acad. Sci. USA 2011, 108, 11399–11404.
(16) Podracky, C. J.; An, C.; DeSousa, A.; Dorr, B. M.; Walsh, D. M.; Liu, D. R. Laboratory Evolution of
a Sortase Enzyme That Modifies Amyloid-β Protein. Nat. Chem. Biol. 2021, 17, 317–325.
(17) Perry, A. M.; Ton-That, H.; Mazmanian, S. K.; Schneewind, O. Anchoring of Surface Proteins to the
Cell Wall of Staphylococcus Aureus. III. Lipid II Is an in Vivo Peptidoglycan Substrate for Sortase-
Catalyzed Surface Protein Anchoring. J. Biol. Chem. 2002, 277, 16241–16248.
(18) Mazmanian, S. K.; Ton -That, H.; Su, K.; Schneewind, O. An Iron -Regulated Sortase Anchors a
Class of Surface Protein during Staphylococcus Aureus Pathogenesis. Proc. Natl. Acad. Sci. USA
2002, 99, 2293–2298.
(19) Mazmanian, S. K.; Ton-That, H.; Schneewind, O. Sortase-Catalysed Anchoring of Surface Proteins
to the Cell Wall of Staphylococcus Aureus. Mol. Microbiol. 2001, 40, 1049–1057.
(20) Ton-That, H.; Mazmanian, S. K.; Alksne, L.; Schneewind, O. Anchoring of Surface Proteins to the
Cell Wall of Staphylococcus Aureus. Cysteine 184 and Histidine 120 of Sortase Form a Thiolate -
Imidazolium Ion Pair for Catalysis. J. Biol. Chem. 2002, 277, 7447–7452.
(21) Baek, M.; DiMaio, F.; Anishchenko, I.; Dauparas, J.; Ovchinnikov, S.; Lee, G. R.; Wang, J.; Cong,
Q.; Kinch, L. N.; Schaeffer, R. D.; et al. Accurate Prediction of Protein Structures and Interactions
Using a Three-Track Neural Network. Science 2021, 373, 871–876.
(22) Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.;
Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly Accurate Protein Structure Prediction with
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
AlphaFold. Nature 2021, 596, 583–589.
(23) Abramson, J.; Adler, J.; Dunger, J.; Evans, R.; Green, T.; Pritzel, A.; Ronneberger, O.; Willmore,
L.; Ballard, A. J.; Bambrick, J.; et al. Accurate Structure Prediction of Biomolecular Interactions with
AlphaFold 3. Nature 2024, 630, 493–500.
(24) Sohlenkamp, C.; Geiger, O. Bacterial Membrane Lipids: Diversity in Structures and Pathways.
FEMS Microbiol. Rev. 2016, 40, 133–159.
(25) Hayami, M.; Okabe, A.; Kariyama, R.; Abe, M.; Kanemasa, Y. Lipid Composition of Staphylococcus
Aureus and Its Derived L-Forms. Microbiol. Immunol. 1979, 23, 435–442.
(26) Johnson, D. A.; Piper, I. M.; Vogel, B. A.; Jackson, S. N.; Svendsen, J. E.; Kodama, H. M.; Lee, D.
E.; Lindblom, K. M.; McCarty, J.; Antos, J. M.; et al. Structures of Streptococcus Pyogenes Class A
Sortase in Complex with Substrate and Product Mimics Provide Key Details of Target Recognition.
J. Biol. Chem. 2022, 298, 102446.
(27) Antos, J. M.; Chew, G. -L.; Guimaraes, C. P.; Yoder, N. C.; Grotenbreg, G. M.; Popp, M. W. -L.;
Ploegh, H. L. Site-Specific N- and C-Terminal Labeling of a Single Polypeptide Using Sortases of
Different Specificity. J. Am. Chem. Soc. 2009, 131, 10800–10801.
(28) Hess, G. T.; Guimaraes, C. P.; Spooner, E.; Ploegh, H. L.; Belcher, A. M. Orthogonal Labeling of
M13 Minor Capsid Proteins with DNA to Self -Assemble End-to-End Multiphage Structures. ACS
Synth. Biol. 2013, 2, 490–496.
(29) Hess, G. T.; Cragnolini, J. J.; Popp, M. W.; Allen, M. A.; Dougan, S. K.; Spooner, E.; Ploegh, H. L.;
Belcher, A. M.; Guimaraes, C. P. M13 Bacteriophage Display Framework That Allows Sortase -
Mediated Modification of Surface-Accessible Phage Proteins. Bioconjug. Chem. 2012, 23, 1478–
1487.
(30) Smeesters, P. R.; McMillan, D. J.; Sriprakash, K. S. The Streptococcal M Protein: A Highly Versatile
Molecule. Trends Microbiol. 2010, 18, 275–282.
(31) Galaxy Community. The Galaxy Platform for Accessible, Reproduci ble and Collaborative
Biomedical Analyses: 2022 Update. Nucleic Acids Res. 2022, 50, W345–W351.
(32) Afgan, E.; Baker, D.; Batut, B.; van den Beek, M.; Bouvier, D.; Cech, M.; Chilton, J.; Clements, D.;
Coraor, N.; Grüning, B. A.; et al. The Galaxy Platfor m for Accessible, Reproducible and
Collaborative Biomedical Analyses: 2018 Update. Nucleic Acids Res. 2018, 46, W537–W544.
(33) Jurrus, E.; Engel, D.; Star, K.; Monson, K.; Brandi, J.; Felberg, L. E.; Brookes, D. H.; Wilson, L.;
Chen, J.; Liles, K.; et al. Improvements to the APBS Biomolecular Solvation Software Suite. Protein
Sci. 2018, 27, 112–128.
(34) Lomize, M. A.; Pogozheva, I. D.; Joo, H.; Mosberg, H. I.; Lomize, A. L. OPM Database and PPM
Web Server: Resources for Positioning of Proteins in Membra nes. Nucleic Acids Res. 2012, 40,
D370-6.
(35) Jo, S.; Kim, T.; Iyer, V. G.; Im, W. CHARMM -GUI: A Web -Based Graphical User Interface for
CHARMM. J. Comput. Chem. 2008, 29, 1859–1865.
(36) Jo, S.; Kim, T.; Im, W. Automated Builder and Database of Protein/Membrane Complexes for
Molecular Dynamics Simulations. PLoS One 2007, 2, e880.
(37) Wu, E. L.; Cheng, X.; Jo, S.; Rui, H.; Song, K. C.; Dávila -Contreras, E. M.; Qi, Y.; Lee, J.; Monje-
Galvan, V.; Venable, R. M.; et al. CHARMM -GUI Membrane Builder toward Realistic Biological
Membrane Simulations. J. Comput. Chem. 2014, 35, 1997–2004.
(38) Lee, J.; Cheng, X.; Swails, J. M.; Yeom, M. S.; Eastman, P. K.; Lemkul, J. A.; Wei, S.; Buckner, J.;
Jeong, J. C.; Qi, Y.; et al. CHARMM -GUI Input Generator for NAMD, GROMACS, AMBER,
OpenMM, and CHARMM/OpenMM Simulations Using the CHARMM36 Additive Force Field. J.
Chem. Theory Comput. 2016, 12, 405–413.
(39) Best, R. B.; Zhu, X.; Shim, J.; Lopes, P. E. M.; Mittal, J.; Feig, M.; Mackerell, A. D. Optimization of
the Additive CHARMM All-Atom Protein Force Field Targeting Improved Sampling of the Backbone
φ, ψ and Side-Chain χ(1) and χ(2) Dihedral Angles. J. Chem. Theory Comput. 2012, 8, 3257–3273.
(40) Klauda, J. B.; Venable, R. M.; Freites, J. A.; O’Connor, J. W.; Tobias, D. J.; Mondragon -Ramirez,
C.; Vorobyov, I.; MacKerell, A. D.; Pastor, R. W. Update of the CHARMM All-Atom Additive Force
Field for Lipids: Validation on Six Lipid Types. J. Phys. Chem. B 2010, 114, 7830–7843.
(41) Berendsen, H. J. C.; van der Spoel, D.; van Drunen, R. GROMACS: A Message -Passing Parallel
Molecular Dynamics Implementation. Comput Phys Commun 1995, 91, 43–56.
(42) Berendsen, H. J. C.; Postma, J. P. M.; van Gunsteren, W. F.; DiN ola, A.; Haak, J. R. Molecular
Dynamics with Coupling to an External Bath. J. Chem. Phys. 1984, 81, 3684.
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
(43) Parrinello, M. Polymorphic Transitions in Single Crystals: A New Molecular Dynamics Method. J.
Appl. Phys. 1981, 52, 7182.
(44) Hoover, W. G. Canonical Dynamics: Equilibrium Phase -Space Distributions. Phys. Rev. A, Gen.
Phys. 1985, 31, 1695–1697.
(45) Nosé, S. A Molecular Dynamics Method for Simulations in the Canonical Ensemble. Mol. Phys.
1984, 52, 255–268.
(46) Hess, B.; Bekker, H.; Berendsen, H. J. C.; Fraaije, J. G. E. M. LINCS: A Linear Constraint Solver
for Molecular Simulations. J. Comput. Chem. 1997.
(47) PLUMED consortium. Promoting Transparency and Reproducibility in Enhanced Molecular
Simulations. Nat. Methods 2019, 16, 670–673.
(48) Tribello, G. A.; Bonomi, M.; Branduardi, D.; Camilloni, C.; Bussi, G. PLUMED 2: New Feathers for
an Old Bird. Comput Phys Commun 2014, 185, 604–613.
(49) Eisenhaber, F.; Lijnzaad, P.; Argos, P.; Sander, C.; Scharf, M. The Double Cubic La ttice Method:
Efficient Approaches to Numerical Integration of Surface Area and Volume and to Dot Surface
Contouring of Molecular Assemblies. J. Comput. Chem. 1995, 16, 273–284.
(50) Litou, Z. I.; Bagos, P. G.; Tsirigos, K. D.; Liakopoulos, T. D.; Hamodra kas, S. J. Prediction of Cell
Wall Sorting Signals in Gram -Positive Bacteria with a Hidden Markov Model: Application to
Complete Genomes. J Bioinform Comput Biol 2008, 6, 387–401.
(51) Fimereli, D. K.; Tsirigos, K. D.; Litou, Z. I.; Liakopoulos, T. D.; Bagos, P. G.; Hamodrakas, S. J. CW-
PRED: A HMM -Based Method for the Classification of Cell Wall -Anchored Proteins of Gram -
Positive Bacteria. In Artificial intelligence: theories and applications; Maglogiannis, I.; Plagianakos,
V.; Vlahavas, I., Eds.; Lecture notes in computer science; Springer Berlin Heidelberg: Berlin,
Heidelberg, 2012; Vol. 7297, pp. 285–290.
(52) Crooks, G. E.; Hon, G.; Chandonia, J. M.; Brenner, S. E. WebLogo: A Sequence Logo Generator.
Genome Res. 2004, 14, 1188–1190.
(53) Sievers, F.; Wilm, A.; Dineen, D.; Gibson, T. J.; Karplus, K.; Li, W.; Lopez, R.; McWilliam, H.;
Remmert, M.; Söding, J.; et al. Fast, Scalable Generation of High-Quality Protein Multiple Sequence
Alignments Using Clustal Omega. Mol. Syst. Biol. 2011, 7, 539.
(54) Madeira, F.; Madhusoodanan, N.; Lee, J.; Eusebi, A.; Niewielska, A.; Tivey, A. R. N.; Lopez, R.;
Butcher, S. The EMBL-EBI Job Dispatcher Sequence Analysis Tools Framework in 2024. Nucleic
Acids Res. 2024, 52, W521–W525.
(55) Yariv, B.; Yariv, E.; Kessel, A.; Masrati, G.; Chorin, A. B.; Martz, E.; Mayrose, I.; Pupko, T.; Ben -
Tal, N. Using Evolutionary Data to Make Sense of Macromolecules with a “Face -Lifted” ConSurf.
Protein Sci. 2023, 32, e4582.
(56) Ashkenazy, H.; Abadi, S.; Martz, E.; Chay, O.; Mayrose, I.; Pupko, T.; Ben -Tal, N. ConSurf 2016:
An Improved Methodology to Estimate and Visualize Evolutionary Conservation in Macromolecules.
Nucleic Acids Res. 2016, 44, W344-50.
(57) Ilangovan, U.; Ton-That, H.; Iwahara, J.; Sch neewind, O.; Clubb, R. T. Structure of Sortase, the
Transpeptidase That Anchors Proteins to the Cell Wall of Staphylococcus Aureus. Proc. Natl. Acad.
Sci. USA 2001, 98, 6056–6061.
(58) Race, P. R.; Bentley, M. L.; Melvin, J. A.; Crow, A.; Hughes, R. K.; S mith, W. D.; Sessions, R. B.;
Kehoe, M. A.; McCafferty, D. G.; Banfield, M. J. Crystal Structure of Streptococcus Pyogenes
Sortase A: Implications for Sortase Mechanism. J. Biol. Chem. 2009, 284, 6924–6933.
(59) Chan, A. H.; Yi, S. W.; Terwilliger, A. L.; Maresso, A. W.; Jung, M. E.; Clubb, R. T. Structure of the
Bacillus Anthracis Sortase A Enzyme Bound to Its Sorting Signal: A FLEXIBLE AMINO-TERMINAL
APPENDAGE MODULATES SUBSTRATE ACCESS. J. Biol. Chem. 2015, 290, 25461–25474.
(60) Glinton, K.; Beck, J.; Liang, Z.; Qiu, C.; Lee, S. W.; Ploplis, V. A.; Castellino, F. J. Variable Region
in Streptococcal M -Proteins Provides Stable Binding with Host Fibrinogen for Plasminogen -
Mediated Bacterial Invasion. J. Biol. Chem. 2017, 292, 6775–6785.
(61) Krogh, A.; Larsson, B.; von Heijne, G.; Sonnhammer, E. L. L. Predicting Transmembrane Protein
Topology with a Hidden Markov Model: Application to Complete Genomes. J. Mol. Biol. 2001, 305,
567–580.
(62) Gao, M.; Johnson, D. A.; Piper, I. M.; Kodama, H. M.; Svendsen, J. E.; Tahti, E.; Longshore-Neate,
F.; Vogel, B.; Antos, J. M.; Amacher, J. F. Structural and Biochemical Analyses of Selectivity
Determinants in Chimeric Streptococcus Class A Sortase Enzymes. Protein Sci. 2022, 31, 701–
715.
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint
(63) Piper, I. M. ; Struyvenberg, S. A.; Valgardson, J. D.; Johnson, D. A.; Gao, M.; Johnston, K.;
Svendsen, J. E.; Kodama, H. M.; Hvorecny, K. L.; Antos, J. M.; et al. Sequence Variation in the Β7-
Β8 Loop of Bacterial Class A Sortase Enzymes Alters Substrate Selectivity. J. Biol. Chem. 2021,
297, 100981.
(64) Kodama, H. M.; Lindblom, K. M.; Walkenhauer, E. G.; Antos, J. M.; Amacher, J. F. Amino Acid
Variability at W194 of Staphylococcus Aureus Sortase A Alters Nucleophile Specificity. Protein Sci.
2024, 33, e5212.
(65) Huang, X.; Aulabaugh, A.; Ding, W.; Kapoor, B.; Alksne, L.; Tabei, K.; Ellestad, G. Kinetic
Mechanism of Staphylococcus Aureus Sortase SrtA. Biochemistry 2003, 42, 11307–11315.
(66) Marraffini, L. A.; DeDent, A. C.; Schneewind, O. Sortases and the Art of Anchoring Proteins to the
Envelopes of Gram-Positive Bacteria. Microbiol. Mol. Biol. Rev. 2006, 70, 192–221.
(67) Freund, C.; Schwarzer, D. Engineered Sortases in Peptide and Protein Chemistry. Chembiochem
2021, 22, 1347–1356.
(68) Schmohl, L.; Bierlmeier, J.; Gerth, F.; Freund, C.; Schwarzer, D. Engineering Sortase A by
Screening a Second-Generation Library Using Phage Display. J Pept Sci 2017, 23, 631–635.
(69) Yu, W.; Gillespie, K. P.; Chhay, B.; Svensson, A.-S.; Nygren, P.-Å.; Blair, I. A.; Yu, F.; Tsourkas, A.
Efficient Labeling of Native Human IgG by Proximity-Based Sortase-Mediated Isopeptide Ligation.
Bioconjug. Chem. 2021, 32, 1058–1066.
(70) Wang, H. H.; Altun, B.; Nwe, K.; Tsourkas, A. Proximity-Based Sortase-Mediated Ligation. Angew.
Chem. Int. Ed. 2017, 56, 5349–5352.
(71) Silvius, J. R.; Leventis, R. A Novel “Prebinding” Strategy Dramatically Enhances Sortase-Mediated
Coupling of Proteins to Liposomes. Bioconjug. Chem. 2017, 28, 1271–1282.
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted August 20, 2025. ; https://doi.org/10.1101/2025.08.19.671115doi: bioRxiv preprint