Results
Identification of the pes BGC
During our genome mining efforts for novel RiPP BGCs encoding MNIOs, a BGC
(pes cluster) from Peribacillus simplex VanAntwerpen02 was identified. The BGC
encodes the αKG-HExxH protein PesO, two precursor peptides PesA1/A2, a n MNIO
enzyme PesH, a putative MNIO partner protein PesI, a n AS-like enzyme PesC, a
hypothetical protein PesX, two M16 peptidases PesP1/P2, and a transporter protein PesT
(Figure 1a). Unlike previously reported MNIO enzymes which require a partner protein
that is encoded adjacently within the BGC ,25,27,30,33-36 the putative MNIO partner protein
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
4
gene pesI is positioned next to pesX. Initial bioinformatic analysis of PesX suggested that
it is a hypothetical protein from an unknown protein family, but structural analysis with
AlphaFold 337 as well as a Foldseek38 search revealed that PesX likely possesses a
triose-phosphate isomerase (TIM)-barrel fold like MNIOs and shares structural similarity
to another MNIO protein MbnB (Figure S2). However, a sequence alignment of PesX with
characterized MNIO proteins demonstrated that it lacks the iron-binding residues that are
conserved in MNIOs (Figure S3).39 Therefore, the role and function of PesX was unclear.
A sequence h omology search of the enzymes in the pes cluster using BLASTp,
followed by genome neighborhood analysis using RODEO40 lead to the discovery of 26
gene clusters that resemble the pes cluster composition. These orthologous clusters all
contain homologs of PesX, PesI, PesH and PesC. αKG-HExxH proteins are only present
in ~ 75% of the clusters and when they are absent, they are replaced by a predicted
arginase (e.g. Figure S4). Alignment of the precursor peptides from these homologous
clusters reveals several conserved residues including a conserved C-terminal R(V/T)DN
motif (Figure 1b). Notably, a proline residue before the R(V/T)DN motif only co-occurs
when an αKG-HExxH protein is encoded in the BGC, hinting that the latter enzyme may
modify this Pro residue.
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
5
Figure 1. a) Composition of the pes BGC from Peribacillus simplex VanAntwerpen02 and
the amino acid sequence of precursor peptides PesA1/2. The numbering is based on the
C-terminal fragment obtained upon GluC-digestion; b) Sequence logo of precursor
peptides from BGCs homologous to the pes BGC (4 2 total sequences, 21 unique
sequences (Figure S5), identical sequences were removed).
Characterization of the product of the pes BGC
Because the native pes BGC encoding strains were not available to us , we
investigated the functions of the encoded enzymes by heterologous expression in E. coli.
A His 6-tag was appended to the N -terminus of the precursor peptides, and they were
expressed separately or co -expressed with a subset of putative modifying enzymes.
Following immobilized metal affinity chromatography (IMAC) purification, the purified
peptides were digested with endoproteinase GluC and characterized by mass
spectrometry.
We co-expressed both precursor peptides PesA1 and PesA2 individually with the
various enzymes encoded in the pes BGC with essentially the same results. We will
describe the data with PesA2 here and refer to the Supporting Information for the
analogous data for PesA1. Co-expression of PesA2 with the MNIO protein PesH and its
potential partner PesI and analysis by liquid chromatography-high resolution mass
spectrometry (LC-HRMS) revealed the emergence of a new species with a +16 Da mass
shift with respect to the unmodified PesA2 (Figure 2). The site of PesHI modification was
located to Asp21 by high-resolution tandem mass spectrometry (HR-MS/MS) (Figure S6).
Marfey’s analysis was used to determine the position and stereochemistry of the
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
6
oxidation.41,42 After hydrolysis of the PesHI-modified peptide, the hydroxylated aspartate
was reacted with 1-fluoro-2-4-dinitrophenyl-5-L-alanine amide (L-FDAA) and analyzed by
liquid chromatography coupled to mass spectrometry detection ( LC-MS). The product
coeluted with an L-threo-3-hydroxyaspartate standard that was derivatized in the same
manner (Figure 3a and S7). Collectively, these observations demonstrate that PesHI
facilitate stereoselective hydroxylation of Asp21.
Next, we included the αKG -HExxH protein PesO in the co -expression system in
addition to PesHI, for which we will use the designation PesOHI . LC-HRMS analysis of
the endoproteinase GluC-digested product peptide PesA2-PesOHI revealed the
dominant production of a M+32 Da species (Figure 2). This additional +16 Da mass shift
compared to the PesHI modification was assigned to the Pro1 8 residue by HR-MS/MS
analysis (Figure S6). We again performed Marfey’s analysis on the PesOHI -modified
peptide to elucidate the identity of the hydroxyproline residue (Figure 3b and S7).
Coelution with L-3S-hydroxyproline derivatized with the M arfey’s reagent indicated that
PesO stereoselectively hydroxylates carbon 3 of Pro18.
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
7
Figure 2. HR-MS/MS analysis of unmodified precursor peptide PesA2 or PesA2 co-
expressed with the indicated modifying enzymes. The MS/MS fragmentation pattern of
each modified peptide is shown (for tandem MS spectra, see Figure S6). Prior to analysis,
the peptides were digested with endoproteinase GluC. Fragment ion annotation w as
performed using the interactive peptide spectral annotator 43 with residues indicated in
lower case p and d entered as hydroxylated residue s (M+16), and in lower case d n as
two residues that were hydroxylated and nitrile containing (+16 and −19 to give a net
change of −3 Da).
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
8
Figure 3. Structural determination of PesA2 co-expressed with PesOHIC. a) LC-MS
analysis of hydroxyaspartate authentic standards or hydrolyzed PesA2 -PesHI and
PesA2-PesOHI peptides derivatized with L-FDAA; for coinjections, see Fig. S7. b) LC-MS
analysis of L-FDAA derivatized hydroxyproline from authentic standards or hydrolyzed
PesA2-PesHI and PesA2-PesOHI peptide s; c) 1H−13C HMBC spectrum of digested
PesA2-PesHIC highlighting the C-terminal residue, and d) proposed structure of digested
PesA2-PesHIC peptide 1 and PesA2-PesOHIC peptide 2 based on NM R, MS/MS and
Marfey’s analysis. Key HMBC correlations from c) are labeled.
Incorporation of the AS-like enzyme PesC into the co-expression system resulted in
a new product with a −19 Da mass shift relative to the PesOHI modified peptide. HR -
MS/MS analysis demonstrated that a −3 Da mass shift had occurred to the C -terminal
Asp21-Asn22 sequence (Figure 2 and S6). Asparagine synthetases usually catalyze the
amidation of aspartate to yield asparagine, 44 while in natural product biosynthesis,
members of this enzyme family also catalyze lactam formation. 45,46 Therefore, we
envisioned that an amidation reaction of a carboxyl group (−1 Da) and a dehydration
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
9
event ( −18 Da) would account for the observed overall mass shift of −19 Da . When
coupled to the PesHI catalyzed hydroxylation of Asp21 (+16 Da) , amidation and
dehydration would explain the net −3 Da shift of the Asp21 -Asn22 motif. Within this
sequence, the dehydration could originate in a dehydroAsp, or in the formation of ester,
imide, or nitrile group s ( Figure S 8). The t andem MS/MS data were inconclusive in
distinguishing these possibilities, and we therefore turned to analysis by nuclear magnetic
resonance (NMR) spectroscopy described in the next section.
We also integrated the hypothetical protein PesX into the co-expression system. Co-
expression of PesX in any combination with the other enzymes did not lead to the
emergence of any new species derived from PesA2. These results imply that PesX was
either non-functional in the E. coli heterologous host or the protein serves a non-catalytic
role in the pes BGC. As described below, a potential function of PesX is suggested from
in vitro experiments.
As noted above, c o-expression experiments of PesA1 with Pes enzymes yielded
similar modification patterns as PesA2 (Figure S9), indicating that the less conserved
precursor N-termini as well as the residue at position 20 (Thr/Val) have minimal impact
on the function of the modifying enzymes within the pes BGC.
Structural Elucidation of Modified PesA2
The data described thus far suggest that PesC catalyze s consecutive
amidation/dehydration reactions on the precursor peptide that is hydroxylated by PesHI
on Asp21 , yet the precise form of dehydration remain ed unclear. For NMR structural
determination, we first attempted to process the modified PesA2 peptides with the native
protease PesP1/P2. While the expression of this heterodimer in E. coli yielded soluble
protein (Figure S10), prolonged incubation of the peptides and PesP2/P1 led to a range
of proteolytic products, with a 25-mer the major product (Figure S11). The poor in vitro
activities of PesP2/P1 limited their application in this study, and therefore we mutated
Gln14 to Lys in PesA2 with the aim of generating a short peptide upon digestion with
endoproteinase LysC (for residue numbering, see Figure 1a). We first explored the impact
of this mutation on the PTM process, as well as the minimal enzyme requirement for
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
10
complete C -terminal modification. HR-MS/MS analysis of this Q14K mutant after co -
expression with PesHIC verified that the product still contained the −3 Da change in the
Asp21-Asn22 sequence (Figure S12), indicating that Gln14 is not required for PesHIC
modification. Therefore, we conducted large scale preparation of PesHI C-modified
PesA2-Q14K in E. coli. After IMAC purification and LysC digestion , the resulting
octapeptide was purified by high-performance liquid chromatography (HPLC) and used
for NMR spectroscopic analysis.
A combination of 1D and 2D NMR experiments was conducted to elucidate the
structure. Based on interactions observed in 1H-1H TOCSY, 1H-1H NOESY, 1H-13C HSQC
and 1H-13C HMBC spectra (Figures S13-S15), all proton and carbon signals from each of
the eight residues were assigned ( Table S1). Importantly, all side chain protons and
carbons of the Asp21-Asn22 sequence displayed chemical shift deviation compared to
unmodified Asp/Asn residues. For Asp21, the β-carbon shifted downfield to 71.2 ppm and
has only one proton attached to it, affirming that PesHI catalyze d β-hydroxylation o f
Asp21. Thus, the NMR data ruled out dehydration involving the hydroxyl group introduced
by PesHI, and in turn implicated that the −19 Da change must occur entirely on the C -
terminal Asn. T he α-proton of Asn22 shifted downfield to 5.13 ppm, indicating a more
electron-withdrawing environment. Together with the overall mass change, these findings
are consistent with co nversion of the C -terminal carboxylate into a nitrile group.
Consistent with this hypothesis , the 13C NMR spectrum showed only 10 rather than the
expected 11 signals in the range of 150-220 ppm (10 amide/carboxylate carbonyl carbons
and one guanidine carbon), and instead a characteristic nitrile carbon signal at 118.0 ppm
was observed. In the 1H-13C HMBC spectrum, this carbon signal showed correlations with
the α and β protons and the amide proton of former Asn22 (Figure 3c). Collectively, these
observations from 1D and 2D NMR experiments unambiguously show that in the PesA2-
PesHIC peptide, the C-terminal carboxylate is converted to a nitrile group.
We also isolated and characterized the precursor peptide PesA2-Q14K after
PesOHIC modification and LysC digestion by NMR spectroscopy (Figure S1 6-S18).
Compared with the PesA2 -PesHIC peptide, the signals characteristic of a β-
hydroxyaspartate as well as the C-terminal nitrile group were retained, and additional
chemical shift deviations from unmodified residues were mainly observed for protons and
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
11
carbons associated with the Pro residue (Table S2). In the 1H-1H COSY and 1H-13C HSQC
spectra, the observed correlations clearly assigned the hydroxy group to the β position
(C3), consistent with the conclusions from Marfey’s analysis . This modification resulted
in the conversion of a CH2 group to a CH group, accompanied by a downfield shift of the
β-proton to 4.43 ppm and the corresponding β-carbon to 73.2 ppm.
Characterization of the Nitrile Synthetase PesC
With the C-terminal nitrilation activity of PesC established, we examined the reaction
in vitro . For typical AS-like enzymes , the N -terminus is buried inside the enzyme .44
Therefore, we constructed a PesC-encoding plasmid with a C -terminal His 6-tag for
expression in E. coli. In the canonical AS catalytic cycle, L-aspartate is converted to L-
asparagine via ATP-dependent adenylation of the side chain carboxylate , followed by
amidation using L-glutamine as the ammonia donor.44 Because PesC shows sequence
homology to asparagine synthetases, IMAC-purified PesC was incubated with Mg2+, ATP,
L-glutamine and the PesA2-PesOHI peptide for 6 hours. LC-MS analysis of the GluC -
digested product clearly demonstrated the formation of the C-terminal nitrile (Figure 4a).
When ATP was omitted from the reaction, nitrilation activity was abolished. When the
assay was performed for 1 hour, a product with a mass shift of −1 Da was observed by
high resolution electron spray ionization mass spectrometry (HR-ESI MS) (Figure 4a)
consistent with the formation of a n intermediate C-terminal amide. To confirm the origin
of the nitrogen atom in both products, the assay was also performed with L-glutamine-
(amide-15N). The resulting C-terminal amide and nitrile containing peptides were both
detected by LC-HRMS with a +1 Da mass shift (Figure 4b), indicating that in the PesC
reaction, the inserted nitrogen is indeed derived from the side chain amide nitrogen of
glutamine.
Next, we explore d the substrate requirements of PesC. The importance of prior
PesHI/PesO modification for PesC activity was first examined. When unmodified
precursor PesA2 , PesA2-PesHI, and PesA2-PesOHI were incubated individually with
PesC, the production of the corresponding nitrile products was only observed with PesHI-
and PesOHI-modified PesA2 (Figure 4a, S19). For unmodified PesA2, only trace amounts
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
12
of the amide species were formed. Thus, the hydroxylation of the adjacent Asp residue
on the substrate is essential for efficient amidation and for dehydration of the intermediate
amide to occur, while hydroxylation by PesO is not required. This conclusion is consistent
with the gene composition of orthologous BGCs, because pesHI homologs always co -
occur with pesC homologs while pesO is only partially conserved (e.g. Figure S4).
The leader peptide dependence of PesC was examined using PesA1 as substrate,
which provided more PesOHI -modified product from the co -expression experiment. We
purified PesOHI-modified, endoproteinase LysC digested PesA1 -Q14K as well as
PesOHI-modified, endoproteinase AspN digested PesA1. Reaction of the C-terminal 14-
mer (AspN product) with PesC under the standard reaction conditions resulted in nitrile
production (Figure S20). The C-terminal 8-mer (LysC product) was converted by PesC to
the corresponding amide with a small amount of nitrile formed (Figure S20). These results
suggest that PesC can still function in the absence of a leader peptide, but the attenuation
in PesC efficiency suggests that the enzyme likely acts prior to proteolysis in the
biosynthetic pathway.
Figure 4. a) LC-MS analysis of the in vitro PesC reaction with PesA2 co-expressed with
PesOHI after GluC digestion (assay conditions: 100 μM PesA2-PesOHI peptide, 100 μM
PesC, 5 mM ATP, 10 mM L-glutamine, 20 mM Mg2+); b) HR-ESI MS analysis of the
resulting amide or nitrile species when L-glutamine or L-glutamine-(amide-15N) was used
in the assay (calculated mass for amide product [ M+2H]2+: 1111.5798, nitrile product
[M+2H]2+: 1102.5745).
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
13
Based on these observations, a mechanism for the PesC catalyzed nitrilation can be
proposed (Figure S21). The reaction is initiated by ATP-dependent adenylation of the C-
terminal carboxylate, followed by the nucleophilic attack of glutamine-derived ammonia
to form a C-terminal amide intermediate. The amide is then activated by a second
equivalent of ATP to generate an AMP-imidate, from which AMP is eliminated to yield the
C-terminal nitrile. This mechanism is similar to that previously proposed for AS-like nitrile
synthetases in other biosynthetic contexts such as the NRPS product auranthine and 7-
cyano-7-deazaguanine (preQ0).13,14,47
With the aim of potentially uncovering more AS-like enzymes that are capable of
installing nitrile groups on ribosomal peptides, we performed a bioinformatic analysis on
PesC. A BLASTp search with the PesC sequence as query only recovered high-identity
homologs of PesC located in orthologous clusters of the pes BGC. Other less highly
conserved hits were not related to RiPP biosynthesis, e.g. enzymes linked to the
biosynthesis of N-acetylglutaminylglutamine amide.48 We then constructed a sequence
similarity network (SSN) of the asparagine synthase protein family (PF00733) , and
examined their co -occurrence with RiPP biosynthesis elements ( Figure S22).49,50 This
analysis identified ~3,800 candidates, of which ~3,500 were related to lasso peptide
biosynthesis. Among the remaining enzymes, TsrC51 and ScdTA52 are known to catalyze
amidation of the C-terminal carboxylate of their substrate peptides. The remaining ~300
uncharacterized candidates suggest a potential underexplored functional space of AS-
like enzymes in RiPP biosynthesis. These enzymes may catalyze amidation, nitrilation,
lactam formation, or currently uncharacterized reactions.
In vitro Characterization of MNIO Proteins
The co-expression experiments were unable to assign a role for the MNIO-like
protein PesX. We therefore conducted in vitro characterization. Considering the prevalent
partner dependence of MNIO proteins in the literature,39 we constructed PesHI and PesXI
expression plasmid s with a His6-tag on the N -terminus of either PesH or PesX. As
anticipated, PesI was co-purified with both PesH and PesX (Figure 5b). Reaction of
PesA2 with as -purified PesHI resulted in hydroxylation similar to the co -expression
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
14
studies. When the assay was performed with PesH (no PesI) or with PesXI, no activity
was observed by ESI-MS (Figure 5a). These findings are consistent with the absence of
the iron-binding amino acids that are characteristic for MNIOs in the sequence of PesX
(Figure S3), but do not provide any insights regarding the potential function of the protein.
The ability of PesX to co-purify with partner protein PesI led us to speculate that PesX
might serve to regulate PesH activity, and thereby nitrile formation . Structural models of
the PesXI and PesHI heterodimer s using AlphaFold 3, and alignment of these two
structures revealed strong similarity (Figure 5c). Similar to reported structures of the TglHI
heterodimer,53 only the helical N-terminus of PesI interacts with PesH or PesX . The
overlapping binding interface suggests that PesX can compete with PesH in binding to
the partner protein PesI, which is critical for PesH activity as shown above. This proposal
is supported by the observation that introduction of PesX in the PesHI activity assay
resulted in diminished hydroxylation activit y (Figure 5a) . Moreover, addition of Strep-
tagged PesX to a solution of the His6-PesH/PesI heterodimer, and subsequent addition
to Ni-NTA resin resulted in the complex of PesX and PesI eluting from the column (Figure
5b). These observations indicate that PesX compete s with PesH in binding with partner
protein PesI, thus inhibiting its aspartate hydroxylation activity. Considering Asp2 1
hydroxylation is essential for PesC to perform nitrile formation, expression of PesX can
slow down the production of the nitrile product. If correct, this regulatory mechanism
suggests that the nitrile product may be toxic to t he producing organism and that its
production must be delicately regulated.
We performed bioinformatic analysis on PesX to gauge its distribution . A BLASTp
search with an E value cutoff of 0.05 retrieved only PesX homologs from orthologous pes-
like BGCs, suggesting that the sequence of PesX is highly specific to the nitrile formation
pathway.
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
15
Figure 5. PesX inhibits the activity of PesH. a) ESI-MS analysis of the in vitro PesH/X/I
reaction with PesA2 peptide (calculated mass for unmodified product [ M+7H]7+:
1002.6476, hydroxylated product [ M+7H]7+: 110 4.9326); b) SDS-PAGE analysis of
purified His 6-PesH/PesI, His 6-PesX/PesI, and Strep-PesX. Also shown are the
flowthrough (FT) and elution (E) fractions of His6-PesH-PesI and Strep-PesX loaded on
an IMAC column; c) Overlayed AlphaFold 3 models of the PesHI and PesXI complexes.
Nitrilobacillin Inhibits Cysteine Proteases
While nitrilobacillin represent s the first peptide natural product with a C -terminal nitrile
group, such scaffolds have been extensively explored in medicinal chemistry. As seen in
the therapeutic peptides vildagliptin and nirmatrelvir (Figure 6),6,7 nitrile groups are
common warheads in peptide -derived compounds that function by covalently inhibiting
cysteine/serine proteases. To assess such activity of pes BGC products, peptide 2
(Figure 3d), was evaluated with a panel of proteases. Compound 2 exhibited little to no
inhibitory activity against trypsin, chymotrypsin or angiotensin-converting enzyme (ACE)
at 25 μM concentration. Instead, it inhibited the cysteine proteases cathepsin L, cathepsin
B and papain with IC 50 values of 1.1 μM, 9.0 μM and 9.7 μM, respectively (Figure 7) .
These observed inhibitory activities align well with the reported activities of synthe tic
peptidyl nitriles ,7 and highlight the poten tial of engineering nitrilobacillins as cysteine
protease inhibitors.
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
16
Figure 6. Proposed biosynthetic pathway of nitrilobacillin . Representative examples of
therapeutic peptides with C-terminal nitrile groups are shown in the box.
Figure 7. Protease inhibition assay of compound 2 with human cathepsin L (top), human
cathepsin B (middle) and papain (bottom).
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
17
References
(1) Pomeranz, C. Ueber allylcyanid und allylsenföl. Liebigs Ann. Chem. 1907, 351, 354-
62.
(2) Scotti, C.; Barlow, J. W. Natural products containing the nitrile functional group and
their biological activities. Nat. Prod. Commun. 2022, 17, 1934578X221099973.
(3) Gleadow, R. M.; Møller, B. L. Cyanogenic glycosides: synthesis, physiology, and
phenotypic plasticity. Annu. Rev. Plant Biol. 2014, 65, 155-85.
(4) Yulvianti, M.; Zidorn, C. Chemical diversity of plant cyanogenic glycosides: an
overview of reported natural products. Molecules 2021, 26, 719.
(5) Liu, M.; Li, S. Nitrile biosynthesis in nature: how and why? Nat. Prod. Rep. 2024, 41,
649-71.
(6) Wang, X.; Wang, Y.; Li, X.; Yu, Z.; Song, C.; Du, Y. Nitrile-containing
pharmaceuticals: target, mechanism of action, and their SAR studies. RSC Med. Chem.
2021, 12, 1650-71.
(7) Bonatto, V.; Lameiro, R. F.; Rocho, F. R.; Lameira, J.; Leitão, A.; Montanari, C. A.
Nitriles: an attractive approach to the development of covalent inhibitors. RSC Med.
Chem. 2023, 14, 201-17.
(8) Rakshit, A.; Dhara, H. N.; Sahoo, A. K.; Patel, B. K. The renaissance of organo
nitriles in organic synthesis. Chem.: Asian J. 2022, 17, e202200792.
(9) Kahn, R. A.; Bak, S.; Svendsen, I.; Halkier, B. A.; Moller, B. L. Isolation and
reconstitution of cytochrome P450ox and in vitro reconstitution of the entire biosynthetic
pathway of the cyanogenic glucoside dhurrin from sorghum. Plant Physiol. 1997, 115,
1661-70.
(10) Oinuma, K.; Hashimoto, Y.; Konishi, K.; Goda, M.; Noguchi, T.; Higashibata, H.;
Kobayashi, M. Novel aldoxime dehydratase involved in carbon-nitrogen triple bond
synthesis of Pseudomonas chlororaphis B23. Sequencing, gene expression,
purification, and characterization. J. Biol. Chem. 2003, 278, 29600-8.
(11) Yamaguchi, T.; Asano, Y. Nitrile-synthesizing enzymes and biocatalytic synthesis
of volatile nitrile compounds: a review. J. Biotechnol. 2024, 384, 20-8.
(12) Peng, M.; Wu, Q.; Ma, L.; Teng, Z.-J.; Hou, X.; Zhu, H.; Ju, J. Bacterial
biosynthesis of nitrile-containing natural products: basis for recognition of diversified
substrates. ACS Catal. 2024, 14, 17780-93.
(13) Zeng, Y.; Zhang, K.; Lu, T.; Yin, X.; Wang, Q.; Xiong, L.; Guo, H.; Li, J.; Lu, X.; Liu,
L.; Ma, H.; Gao, Z. Structural and mechanistic basis for nitrile synthetase by an
argininosuccinate synthetase-like enzyme. ACS Catal. 2025, 15, 16254-67.
(14) Kishimoto, S.; Tamura, T.; Okamoto, T.; Watanabe, K. Enantioselective
biosynthesis of (+)- and (−)-auranthines. J. Am. Chem. Soc. 2025, 147, 10612-7.
(15) Cha, L.; Tang, Y.; Guo, Y.; Chang, W.-C. In Comprehensive Natural Products III;
Liu, H.-W., Begley, T. P., Eds.; Elsevier: Oxford, 2020, p 191-214.
(16) Owen, D. R.; Allerton, C. M. N.; Anderson, A. S.; Aschenbrenner, L.; Avery, M.;
Berritt, S.; Boras, B.; Cardin, R. D.; Carlo, A.; Coffman, K. J.; Dantonio, A.; Di, L.; Eng,
H.; Ferre, R.; Gajiwala, K. S.; Gibson, S. A.; Greasley, S. E.; Hurst, B. L.; Kadar, E. P.;
Kalgutkar, A. S.; Lee, J. C.; Lee, J.; Liu, W.; Mason, S. W.; Noell, S.; Novak, J. J.;
Obach, R. S.; Ogilvie, K.; Patel, N. C.; Pettersson, M.; Rai, D. K.; Reese, M. R.;
Sammons, M. F.; Sathish, J. G.; Singh, R. S. P.; Steppan, C. M.; Stewart, A. E.; Tuttle,
J. B.; Updyke, L.; Verhoest, P. R.; Wei, L.; Yang, Q.; Zhu, Y. An oral SARS-CoV-2
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
23
M(pro) inhibitor clinical candidate for the treatment of COVID-19. Science 2021, 374,
1586-93.
(17) Gavriilidou, A.; Kautsar, S. A.; Zaburannyi, N.; Krug, D.; Müller, R.; Medema, M.
H.; Ziemert, N. Compendium of specialized metabolite biosynthetic diversity encoded in
bacterial genomes. Nat. Microbiol. 2022, 7, 726-35.
(18) Dang, T.; Süssmuth, R. D. Bioactive peptide natural products as lead structures for
medicinal use. Acc. Chem. Res. 2017, 50, 1566-76.
(19) Zimmer, L.; Crüsemann, M. Amino acid tailoring strategies in peptide natural
product biosynthesis. Trends Chem. 2025, 7, 705-18.
(20) Padhi, C.; Field, C. M.; Forneris, C. C.; Olszewski, D.; Fraley, A. E.; Sandu, I.;
Scott, T. A.; Farnung, J.; Ruscheweyh, H.-J.; Narayan Panda, A.; Oxenius, A.; Greber,
U. F.; Bode, J. W.; Sunagawa, S.; Raina, V.; Suar, M.; Piel, J. Metagenomic study of
lake microbial mats reveals protease-inhibiting antiviral peptides from a core
microbiome member. Proc. Natl. Acad. Sci. USA 2024, 121, e2409026121.
(21) Zhang, J.; Zhang, D.; Xu, Y.; Zhang, J.; Liu, R.; Gao, Y.; Shi, Y.; Cai, P.; Zhong,
Z.; He, B.; Li, X.; Zhou, H.; Chen, M.; Li, Y.-X. Large-scale biosynthetic analysis of
human microbiomes reveals diverse protective ribosomal peptides. Nature Commun.
2025, 16, 3054.
(22) Montalbán-López, M.; Scott, T. A.; Ramesh, S.; Rahman, I. R.; van Heel, A. J.;
Viel, J. H.; Bandarian, V.; Dittmann, E.; Genilloud, O.; Goto, Y.; Grande Burgos, M. J.;
Hill, C.; Kim, S.; Koehnke, J.; Latham, J. A.; Link, A. J.; Martínez, B.; Nair, S. K.; Nicolet,
Y.; Rebuffat, S.; Sahl, H.-G.; Sareen, D.; Schmidt, E. W.; Schmitt, L.; Severinov, K.;
Süssmuth, R. D.; Truman, A. W.; Wang, H.; Weng, J.-K.; van Wezel, G. P.; Zhang, Q.;
Zhong, J.; Piel, J.; Mitchell, D. A.; Kuipers, O. P.; van der Donk, W. A. New
developments in RiPP discovery, enzymology and engineering. Nat. Prod. Rep. 2021,
38, 130-239.
(23) Nguyen, D. T.; Mitchell, D. A.; van der Donk, W. A. Genome mining for new
enzyme chemistry. ACS Catal. 2024, 14, 4536-53.
(24) Richter, D.; Piel, J. Novel types of RiPP-modifying enzymes. Curr. Opin. Chem.
Biol. 2024, 80, 102463.
(25) Ting, C. P.; Funk, M. A.; Halaby, S. L.; Zhang, Z.; Gonen, T.; van der Donk, W. A.
Use of a scaffold peptide in the biosynthesis of amino acid-derived natural products.
Science 2019, 365, 280-4.
(26) Ayikpoe, R. S.; Zhu, L.; Chen, J. Y.; Ting, C. P.; van der Donk, W. A.
Macrocyclization and backbone rearrangement during RiPP biosynthesis by a SAM-
dependent domain-of-unknown-function 692. ACS Cent. Sci. 2023, 9, 1008-18.
(27) Nguyen, D. T.; Zhu, L.; Gray, D. L.; Woods, T. J.; Padhi, C.; Flatt, K. M.; Mitchell,
D. A.; van der Donk, W. A. Biosynthesis of macrocyclic peptides with C-terminal β-
amino-α-keto acid groups by three different metalloenzymes. ACS Cent. Sci. 2024, 10,
1022-32.
(28) Morishita, Y.; Ma, S.; De La Mora, E.; Li, H.; Chen, H.; Ji, X.; Usclat, A.; Amara, P.;
Sugiyama, R.; Tooh, Y. W.; Gunawan, G.; Pérard, J.; Nicolet, Y.; Zhang, Q.; Morinaka,
B. I. Fused radical SAM and αKG-HExxH domain proteins contain a distinct structural
fold and catalyse cyclophane formation and β-hydroxylation. Nat. Chem. 2024, 16,
1882-93.
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
24
(29) Kim, H. W.; Kang, S.; Kim, S.; Lee, H.; Hur, Y.; Song, W. J.; Oh, D. C.; Kim, S.
Discovery of a two-step enzyme cascade converting aspartate to aminomalonate in
peptide natural product biosynthesis. J. Am. Chem. Soc. 2025, 147, 20909-18.
(30) Padhi, C.; Zhu, L.; Chen, J. Y.; Huang, C.; Moreira, R.; Challis, G. L.; Cryle, M. J.;
van der Donk, W. A. Biosynthesis of biphenomycin-like macrocyclic peptides by
formation and cross-linking of ortho-tyrosines. J. Am. Chem. Soc. 2025, 147, 23781-96.
(31) Xia, Y.; Shi, Y.; Xiao, Z.; Xie, X.; Wang, B.; Yao, H.; Wang, H. α-Ketoglutarate-
dependent dioxygenase-catalyzed macrocyclization in RiPPs. J. Am. Chem. Soc. 2025,
147, 26693-703.
(32) Ouyang, Y.; Yu, Y.; Zhu, L.; Nguyen, D. T.; van der Donk, W. A. Oxidative peptide
backbone cleavage by a HEXXH enzyme during RiPP biosynthesis. J. Am. Chem. Soc.
2026, 148, 3551-61.
(33) Kenney, G. E.; Dassama, L. M. K.; Pandelia, M. E.; Gizzi, A. S.; Martinie, R. J.;
Gao, P.; DeHart, C. J.; Schachner, L. F.; Skinner, O. S.; Ro, S. Y.; Zhu, X.; Sadek, M.;
Thomas, P. M.; Almo, S. C.; Bollinger, J. M., Jr.; Krebs, C.; Kelleher, N. L.; Rosenzweig,
A. C. The biosynthesis of methanobactin. Science 2018, 359, 1411-6.
(34) Yu, Y.; van der Donk, W. A. Biosynthesis of 3-thia-α-amino acids on a carrier
peptide. Proc. Natl. Acad. Sci. U. S. A. 2022, 119, e2205285119.
(35) Leprevost, L.; Jünger, S.; Lippens, G.; Guillaume, C.; Sicoli, G.; Oliveira, L.;
Falcone, E.; de Santis, E.; Rivera-Millot, A.; Billon, G.; Stellato, F.; Henry, C.; Antoine,
R.; Zirah, S.; Dubiley, S.; Li, Y.; Jacob-Dubuisson, F. A widespread family of ribosomal
peptide metallophores involved in bacterial adaptation to metal stress. Proc. Natl. Acad.
Sci. U. S. A. 2024, 121, e2408304121.
(36) Tang, Y.; Zhong, W.; Fu, L.; Asante, E.; Kostenko, A.; Vidya, F. N. U.; Mandelare-
Ruiz, P.; Adeogun, T. T.; Anderson, G. P.; Edmonds, B. E.; Fang, O.; Han, M.;
Hollingsworth, A. S.; Ingham, A. R.; Kirby, C. R.; Landrum, A.; Mack, C. R.; Nobari, N.
S.; Oswald, E. J.; Polevoy, C. L.; Sharifian, Y.; So, T. J.; Stokes, J. R.; Thompson, R.
S.; Vuthamaraju, R.; Wang, E. C.; Yang, W. H.; Onstine, A. E.; Paul, V. J.; Wu, R.;
Aron, A. T.; Agarwal, V. Phylogenomic identification of a highly conserved copper-
binding RiPP biosynthetic gene cluster in marine Microbulbifer bacteria. ACS Chem.
Biol. 2025, 20, 2462-74.
(37) Abramson, J.; Adler, J.; Dunger, J.; Evans, R.; Green, T.; Pritzel, A.; Ronneberger,
O.; Willmore, L.; Ballard, A. J.; Bambrick, J.; Bodenstein, S. W.; Evans, D. A.; Hung, C.-
C.; O’Neill, M.; Reiman, D.; Tunyasuvunakool, K.; Wu, Z.; Žemgulytė, A.; Arvaniti, E.;
Beattie, C.; Bertolli, O.; Bridgland, A.; Cherepanov, A.; Congreve, M.; Cowen-Rivers, A.
I.; Cowie, A.; Figurnov, M.; Fuchs, F. B.; Gladman, H.; Jain, R.; Khan, Y. A.; Low, C. M.
R.; Perlin, K.; Potapenko, A.; Savy, P.; Singh, S.; Stecula, A.; Thillaisundaram, A.; Tong,
C.; Yakneen, S.; Zhong, E. D.; Zielinski, M.; Žídek, A.; Bapst, V.; Kohli, P.; Jaderberg,
M.; Hassabis, D.; Jumper, J. M. Accurate structure prediction of biomolecular
interactions with AlphaFold 3. Nature 2024, 630, 493-500.
(38) van Kempen, M.; Kim, S. S.; Tumescheit, C.; Mirdita, M.; Lee, J.; Gilchrist, C. L.
M.; Söding, J.; Steinegger, M. Fast and accurate protein structure search with Foldseek.
Nat. Biotechnol. 2024, 42, 243-6.
(39) Chen, J. Y.; van der Donk, W. A. Multinuclear non-heme iron dependent oxidative
enzymes (MNIOs) involved in unusual peptide modifications. Curr. Opin. Chem. Biol.
2024, 80, 102467.
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
25
(40) Tietz, J. I.; Schwalen, C. J.; Patel, P. S.; Maxson, T.; Blair, P. M.; Tai, H. C.; Zakai,
U. I.; Mitchell, D. A. A new genome-mining tool redefines the lasso peptide biosynthetic
landscape. Nat. Chem. Biol. 2017, 13, 470-8.
(41) Marfey, P. Determination of D-amino acids. II. Use of a bifunctional reagent, 1,5-
difluoro-2,4-dinitrobenzene. Carlsberg Res. Commun. 1984, 49, 591-6.
(42) Ayon, N. J.; Sharma, A. D.; Gutheil, W. G. LC-MS/MS-based separation and
quantification of Marfey's reagent derivatized proteinogenic amino acid DL-
stereoisomers. J. Am. Soc. Mass Spectrom. 2019, 30, 448-58.
(43) Brademan, D. R.; Riley, N. M.; Kwiecien, N. W.; Coon, J. J. Interactive peptide
spectral annotator: A versatile web-based tool for proteomic applications. Mol. Cell.
Proteom. 2019, 18, S193-S201.
(44) Lomelino, C. L.; Andring, J. T.; McKenna, R.; Kilberg, M. S. Asparagine
synthetase: function, structure, and role in disease. J. Biol. Chem. 2017, 292, 19952-8.
(45) Bachmann, B. O.; Li, R.; Townsend, C. A. β-Lactam synthetase: A new
biosynthetic enzyme. Proc. Natl. Acad. Sci. U. S. A. 1998, 95, 9082-6.
(46) Maksimov, M. O.; Pan, S. J.; Link, A. J. Lasso Peptides : Structure, Function,
Biosynthesis, and Engineering. Nat. Prod. Rep. 2012, 29, 996-1006.
(47) Nelp, M. T.; Bandarian, V. A single enzyme transforms a carboxylic acid into a
nitrile through an amide intermediate. Angew. Chem. Int. Ed. 2015, 54, 10627-9.
(48) Sagot, B.; Gaysinski, M.; Mehiri, M.; Guigonis, J.-M.; Le Rudulier, D.; Alloing, G.
Osmotically induced synthesis of the dipeptide N-acetylglutaminylglutamine amide is
mediated by a new pathway conserved among bacteria. Proc. Natl. Acad. Sci. U. S. A.
2010, 107, 12652-7.
(49) Oberg, N.; Zallot, R.; Gerlt, J. A. EFI-EST, EFI-GNT, and EFI-CGFP: Enzyme
Function Initiative (EFI) web resource for genomic enzymology tools. J. Mol. Biol. 2023,
435, 168018.
(50) Zallot, R.; Oberg, N.; Gerlt, J. A. Discovery of new enzymatic functions and
metabolic pathways using genomic enzymology web tools. Curr. Opin. Biotechnol.
2021, 69, 77-90.
(51) Liao, R.; Liu, W. Thiostrepton maturation involving a deesterification-amidation
way to process the C-terminally methylated peptide backbone. J. Am. Chem. Soc. 2011,
133, 2852-5.
(52) Crnkovic, C. M.; Braesel, J.; Krunic, A.; Eustáquio, A. S.; Orjala, J. Scytodecamide
from the cultured Scytonema sp. UIC 10036 expands the chemical and genetic diversity
of cyanobactins. Chem. Bio. Chem. 2020, 21, 845-52.
(53) Zheng, Y.; Xu, X.; Fu, X.; Zhou, X.; Dou, C.; Yu, Y.; Yan, W.; Yang, J.; Xiao, M.;
van der Donk, W. A.; Zhu, X.; Cheng, W. Structures of the holoenzyme TglHI required
for 3-thiaglutamate biosynthesis. Structure 2023, 31, 1220-32.
(54) Hausinger, R. P. In 2-Oxoglutarate-dependent oxygenases; Schofield, C.,
Hausinger, R., Eds.; The Royal Society of Chemistry: 2015, p 1-58.
(55) Foulston, L. C.; Bibb, M. J. Microbisporicin gene cluster reveals unusual features
of lantibiotic biosynthesis in actinomycetes. Proc. Natl. Acad. Sci. U. S. A. 2010, 107,
13461-6.
(56) Adak, S.; Ye, N.; Calderone, L. A.; Duan, M.; Lubeck, W.; Schäfer, R. J. B.;
Lukowski, A. L.; Houk, K. N.; Pandelia, M.-E.; Drennan, C. L.; Moore, B. S. A single
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint
26
diiron enzyme catalyses the oxidative rearrangement of tryptophan to indole nitrile. Nat.
Chem. 2024, 16, 1989-98.
(57) Cicmil, N.; Huang, R. H. Crystal structure of QueC from Bacillus subtilis: an
enzyme involved in preQ1 biosynthesis. Proteins 2008, 72, 1084-8.
(58) Miller, M. T.; Bachmann, B. O.; Townsend, C. A.; Rosenzweig, A. C. Structure of
β-lactam synthetase reveals how to synthesize antibiotics instead of asparagine. Nat.
Struct. Mol. Biol. 2001, 8, 684-9.
(59) Miller, M. T.; Bachmann, B. O.; Townsend, C. A.; Rosenzweig, A. C. The catalytic
cycle of β-lactam synthetase observed by x-ray crystallographic snapshots. Proc. Natl.
Acad. Sci. U. S. A. 2002, 99, 14752-7.
(60) Barrett, S. E.; Mitchell, D. A. Advances in lasso peptide discovery, biosynthesis,
and function. Trends Genet. 2024, 40, 950-68.
(61) Gadgil, M. G.; Dommaraju, S. R.; Liu, X.; Battiste, A. J.; Bregman, M. H.; Mitchell,
D. A. Biosynthesis of peptidic thiooxazole metallophores installed by multinuclear
nonheme iron enzymes. bioRxiv 2025, 10.1101/2025.11.18.689057.
(62) Park, Y. J.; Jodts, R. J.; Slater, J. W.; Reyes, R. M.; Winton, V. J.; Montaser, R. A.;
Thomas, P. M.; Dowdle, W. B.; Ruiz, A.; Kelleher, N. L.; Bollinger, J. M., Jr.; Krebs, C.;
Hoffman, B. M.; Rosenzweig, A. C. A mixed-valent Fe(II)Fe(III) species converts
cysteine to an oxazolone/thioamide pair in methanobactin biosynthesis. Proc. Natl.
Acad. Sci. U. S. A. 2022, 119, e2123566119.
(63) Allali, N.; Afif, H.; Couturier, M.; Van Melderen, L. The highly conserved TldD and
TldE proteins of Escherichia coli are involved in microcin B17 processing and in CcdA
degradation. J. Bacteriol. 2002, 184, 3224-31.
(64) Tsibulskaya, D.; Mokina, O.; Kulikovsky, A.; Piskunova, J.; Severinov, K.;
Serebryakova, M.; Dubiley, S. The product of Yersinia pseudotuberculosis mcc operon
is a peptide-cytidine antibiotic activated inside producing cells by the TldD/E protease.
J. Am. Chem. Soc. 2017, 139, 16178-87.
(65) Xue, D.; Older, E. A.; Zhong, Z.; Shang, Z.; Chen, N.; Dittenhauser, N.; Hou, L.;
Cai, P.; Walla, M. D.; Dong, S. H.; Tang, X.; Chen, H.; Nagarkatti, P.; Nagarkatti, M.; Li,
Y. X.; Li, J. Correlational networking guides the discovery of unclustered lanthipeptide
protease-encoding genes. Nat. Commun. 2022, 13, 1647.
(66) Ramos-Figueroa, J. S.; Zhu, L.; Halliman, M.; van der Donk, W. A. The
biosynthetic pathway to the pyrroloiminoquinone marine natural product ammosamide
C. J. Am. Chem. Soc. 2025, 147, 46315-23.
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 14, 2026. ; https://doi.org/10.64898/2026.03.11.711119doi: bioRxiv preprint