Learning a PRECISE language for small-molecule binding

preprint OA: closed
Full text JSON View at publisher
Full text 1,641 characters · extracted from oa-doi-fallback · click to expand
Abstract Virtual screening of billion-scale compound libraries has become feasible through machine learning approaches. In particular, CoNCISE (RECOMB 2025) introduced drug quantization via code-books, achieving highly scalable and accurate binary predictions. However, drug discovery requires understanding not just whether molecules bind, but where they bind and how to target specific sites. Here, we present PRECISE which leverages CoNCISE’s quantized small-molecule representations while operating on the target’s 3D structure as its input. The key innovation of PRECISE is reimagining drug-target interaction as compatibility between quantized drug embeddings and a latent representation of the target’s surface mesh, enriched with electrostatic and geometric features. PRECISE designs a novel surface representation, interpreted through a geometric deep learning architecture, enabling it to identify binding sites more accurately than state-of-the-art methods (DiffDock-L, Chai, and Boltz-2) while the codebook ensures billion-scale screening capability. Our formulation unlocks zero-shot generalization to complex targets such as metalloproteins and multi-chain complexes. To enable efficient integration with downstream docking workflows, we introduce Precise-MCTS, which combines fast Precise-based screening with selective Vina docking through an iterative Monte Carlo Tree Search approach. By providing both mechanistic understanding and massive scalability, PRECISE delivers capabilities that were previously mutually exclusive in virtual screening. Competing Interest Statement The authors have declared no competing interest.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00