ProtRAP-LM: Fast and accurate protein relative accessibility prediction and membrane protein screening through protein language model embeddings

preprint OA: closed
📄 Open PDF Full text JSON View at publisher
Full text 1,604 characters · extracted from oa-doi-fallback · click to expand
Abstract Membrane proteins play pivotal roles in cellular signaling and transport, making them prime targets for therapeutic intervention. Therefore, expeditious screening and accurate property prediction of these proteins is crucial. Recently, we proposed a new metric, the membrane contact probability (MCP), to characterize the membrane-contacting features of membrane proteins and further refine the prediction of the relative accessibility of proteins (ProtRAP). However, these models relied on evolutionary information in the form of multiple sequence alignments (MSAs), which hindered rapid predictions. In this study, we present a novel transformer-based model, ProtRAP-LM, utilizing language model (LM) embeddings as input features, to quickly and accurately predict MCP and relative accessibility for each residue of a given protein sequence. ProtRAP-LM can achieve accurate predictions for entire proteomes within hours, demonstrating superior performance compared to previous MSA-based models, with a speedup of over 300 times. This empowers us to furnish more thorough annotations of membrane protein sequences on a proteome-wide scale, particularly for single-pass transmembrane proteins, membrane-anchored proteins, and β-sheet-containing membrane proteins, which have long been a challenge in the field. In the end, we provide a comprehensive list of membrane proteins for 48 living organisms, offering a rich resource for investigating the structure and function of these essential biomolecules in the future. Competing Interest Statement The authors have declared no competing interest.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00