FrustraMPNN: An ultra-fast deep learning tool for proteome-scale analysis of deep mutational single-residue local energetic frustration in proteins

preprint OA: closed
Full text JSON View at publisher
Full text 2,155 characters · extracted from oa-doi-fallback · click to expand
Abstract Energetic frustration, characterized by conflicting local interactions, is a key determinant of protein dynamics, allostery, protein-protein interactions, enzyme catalysis, and overall protein function. Although the principle of minimal frustration describes an evolutionary bias towards a reduction of energetic conflicts for efficient protein folding, local violations are selectively embedded to encode the complex energy landscapes necessary for protein functions. However, the computational cost of traditional single-residue frustration analysis has prevented the calculation of these energetic conflicts at the proteome scale, creating a significant bottleneck in structural biology. Here, we introduce FrustraMPNN, a message-passing neural network retrained via transfer learning that predicts a complete per-residue frustration-index mutation profile (full mutational scanning per position for a whole protein) orders-of-magnitude faster than existing methods while maintaining high accuracy. This is demonstrated by calculating the single-residue frustration of the E. coli proteome, reducing the calculation time from years to 12 hours. We validate FrustraMPNN on a diverse external set of over 3,400 human protein structures, achieving a Spearman correlation of up to 0.80 across all frustration categories, demonstrating robust generalizability. By conducting a thorough dataset ablation study, we find that the model’s performance improves when trained on a broader range of protein sizes, highlighting an inherent limitation of datasets like Megascale. We provide FrustraMPNN as an open-source tool, expecting it will enable exploration of new hypotheses in fields such as personalized structural biology and allow analysis of local energetic frustration patterns involved in protein function at an unprecedented proteomic scale. Competing Interest Statement C.T.S. has received unrelated research funds from Navigo Proteins GmbH (Halle (Saale), Germany). F.E. is a Co-Founder of AI Driven Therapeutics GmbH. M.B. has been employed by AI Driven Therapeutics GmbH since July 2025. All other authors declare no conflicts of interest.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00