ESM-LoRA-Gly: Improved prediction of N- and O-linked glycosylation sites by tuning protein language models with low-rank adaptation (LoRA)

preprint OA: closed
📄 Open PDF Full text JSON View at publisher
Full text 1,174 characters · extracted from oa-doi-fallback · click to expand
ABSTRACT Glycosylation associates with many diseases ranging from cancer to neurodegeneration and understanding these disease mechanisms requires the precise identification of glycosylation sites. Computational prediction of glycosylation sites has been useful to complement laborious experimental methods, while existing tools lack sufficient accuracy and scalability. Here, we introduce ESM-LoRA-Gly, a method that employs Low-Rank Adaptation (LoRA) to fine-tune the ESM2-3B protein language model for predicting both N- and O-linked glycosylation sites. According to the evaluation on the benchmark datasets, ESM-LoRA-Gly outperforms existing state-of-the-art techniques. The improvement is particularly significant (>100% in Matthews correlation coefficient) for the O-linked dataset. By substantially reducing trainable parameters while maintaining predictive power, ESM-LoRA-Gly enables computationally efficient proteome-scale predictions. This approach should be instrumental for advancing glycoproteomic research and accelerating therapeutic discovery for glycosylation-related diseases. Competing Interest Statement The authors have declared no competing interest.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00