Decoding Allosteric Grammar with Explainable AI Integrating Protein Language Models and Energy Landscape Analysis: Neutral Frustration at Allosteric Binding Sites Encodes Regulatory Versatility in Protein Kinases

preprint OA: closed
Full text JSON View at publisher
Full text 2,305 characters · extracted from oa-doi-fallback · click to expand
Abstract Allosteric regulation enables protein kinases to integrate diverse cellular signals, yet the energetic organization principles that encode spatial and evolutionary diversity of regulatory binding sites lack a complete understanding. We introduce an explainable artificial intelligence (AI) framework that uses protein language models (PLMs) not as predictive endpoints, but as diagnostic probes of biophysical organization of regulatory regions. By integrating PLM-based binding site predictions with the energy landscape-based frustration analysis, we determine that the detectability of orthosteric and allosteric binding sites reflects their energetic embedding within the protein energy landscape. Using PLM predictions as unbiased probes across a structurally and functionally diverse dataset of 453 human kinases, we observe a striking and reproducible dichotomy in predictive behavior: orthosteric ATP-binding pockets are detected with high confidence, whereas allosteric sites consistently evade robust detection. Orthosteric catalytic sites reside within minimally frustrated, optimized energetic regions that are consistently detected with high confidence. In contrast, allosteric sites are enriched in neutrally frustrated zones, producing diffuse and context-dependent predictions, revealing that the “allosteric blind spot” arises from intrinsic biophysical design rather than algorithmic limitations. Atomic-resolution analysis of ABL kinase spanning multiple conformational states and complexes bound to diverse ligands provides mechanistic validation of this principle. The myristoyl allosteric pocket in ABL remains neutrally frustrated across complexes with physiological ligands, chemically diverse modulators, from allosteric inhibitors to activators, and conformations engaged with SH2–SH3 regulatory domains. We propose that allosteric sites are encoded in persistent neutrally frustrated regions optimized for context-dependent regulatory modulation. By using explainable AI to interrogate the energetic architecture of protein kinases, this work reveals how the organization of the protein energy landscape shapes functional plasticity and algorithmic detectability of regulatory binding sites. Competing Interest Statement The authors have declared no competing interest.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00