Structural semantic evolutionary distance (SSED) unifies the selection of cancer driver genes across macroevolution and tumorigenesis

preprint OA: closed
Full text JSON View at publisher
Full text 1,815 characters · extracted from oa-doi-fallback · click to expand
Abstract The non-random, site-specific enrichment of somatic mutations in cancer driver genes (CDGs) suggests their emergence is governed by underlying evolutionary constraints. However, quantitative methods to define these constraints and their underlying principles remain underexplored. To address this, we introduce the Structural Semantic Evolutionary Distance (SSED), a metric leveraging the pretrained ESM-3 protein language model to quantify evolutionary divergence within a unified structural semantic space. Our analysis demonstrates that CDGs are subject to persistent structural semantic constraints across species, tolerating a significantly narrower range of structural semantic changes during evolution compared to non-CDGs. Crucially, clinically observed oncogenic mutations follow this same principle, favoring minimal structural perturbation as shaped by long-term gene evolution. Such mutations maintain core protein function while conferring a capacity for immune evasion, thereby driving clonal expansion. Guided by this “evolutionary constraint” framework, we successfully predicted and experimentally validated a previously uncharacterized oncogenic mutation, KRAS R135L, in bronchial epithelial cells. Furthermore, clinical cohort analysis demonstrated that SSED acts as an independent predictor of response to immune checkpoint blockade, offering information orthogonal to tumor mutational burden (TMB). This study unifies the evolutionary principles governing CDGs across macroevolutionary and microevolutionary (tumorigenesis) timescales, elucidates the balance between structural adaptability, functional conservation, and immune pressure, and identifies a novel predictive biomarker for cancer immunotherapy. Competing Interest Statement The authors have declared no competing interest.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00