PepGo: a deep learning and tree search-based model for de novo peptide sequencing

preprint OA: closed CC-BY-4.0
📄 Open PDF Full text JSON View at publisher
Full text 1,143 characters · extracted from oa-doi-fallback · click to expand
Abstract Identifying peptide sequences from tandem mass spectra is a fundamental problem in proteomics. Unlike search-based methods that rely on matching spectra to databases, de novo peptide sequencing determines peptides directly from mass spectra without any prior information. However, the design of models and algorithms for de novo peptide sequencing remains a challenge. Many de novo approaches leverage deep learning but primarily focus on the architecture of neural networks, paying less attention to search algorithms. We introduce PepGo, a de novo peptide sequencing model that integrates Transformer neural networks with Monte Carlo Tree Search (MCTS). PepGo predicts peptide sequences directly from mass spectra without databases, even without prior training. We show that PepGo surpasses existing methods, achieving state-of-the-art performance. To our knowledge, this is the first approach to combine deep learning with MCTS for de novo peptide sequencing, offering a powerful and adaptable solution for peptide identification in proteomics research. Competing Interest Statement The authors have declared no competing interest.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-05-23T02:00:01.238055+00:00
License: CC-BY-4.0