MetaMAG Explorer: A Database-Augmenting Pipeline for Genome-Resolved Metagenomics and Enhanced Microbial Classification

preprint OA: closed
Full text JSON View at publisher
Full text 1,709 characters · extracted from oa-doi-fallback · click to expand
Abstract Accurate taxonomic classification in metagenomic studies remains challenging because reference databases are often static and incomplete, limiting our understanding of microbial diversity, especially in habitats that are not well represented. We introduce MetaMAG Explorer, a complete and modular pipeline designed to fill this gap with its unique database augmentation framework. Together with end-to-end features like read preprocessing, assembly, binning, and annotation, MetaMAG also presents an automated method for finding new metagenome-assembled genomes (MAGs), confirming their uniqueness by dereplication against curated repositories, and dynamically adding them to classification databases that are compatible with Kraken2. Additionally, MetaMAG makes it easier to understand data by automatically creating high-quality figures that are ready for publication, allowing results to be quickly included in scientific papers. Evaluated across human, plant, and rumen datasets, MetaMAG recovered 233 MAGs, including 121 high-quality genomes, of which 48 (20%) were novel. Database augmentation increased Kraken2 classification rates and reassigned millions of previously misclassified reads. Beyond the gain in read classification, the database augmentation revealed ecologically important taxa that are consistently present in all samples but previously undetected. By enabling iterative database growth driven by the novel MAGs, MetaMAG offers a scalable, highly reproducible, and extensible solution for truly genome-resolved metagenomics, advancing both microbial discovery and taxonomic classification accuracy. Competing Interest Statement The authors have declared no competing interest.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00