Negligible effect of host DNA on metagenomics analysis enables microbial ecology investigation in historical samples

preprint OA: closed
Full text JSON View at publisher
Full text 2,784 characters · extracted from oa-doi-fallback · 2 sections · click to expand

Abstract

Microbiome composition and function are strongly influenced by its environmental factors, with major shifts driven by intensified anthropogenic pressures over the past centuries. This timeframe extends beyond the scope of traditional experimental and longitudinal studies commonly used to investigate microbiome dynamics. The vast collection of historical samples available in museums and herbaria around the world represent a largely untapped resource for exploring host-microbiome interactions across broader temporal and spatial scales. However, their potential remains underutilized due to incompatibilities with standard analytical pipelines and limited knowledge of optimal classification parameters. While host DNA removal has traditionally considered essential for accurate taxonomic assignment of metagenomic reads, this step is often impractical for many historical samples due to the lack of reference genomes for their host species. Here, we demonstrated that host DNA content does not significantly affect key microbial ecological metrics such as alpha- and beta-diversity. Additionally, metagenomic reads from historical samples are often highly fragmented due to post-mortem degradation. Using k-mer analysis of genomic sequences from hosts and their associated microbiomes, we show that reads as short as 21 bp can still produce reliable results, enabling the recovery of microbial signals that would otherwise be discarded. Overall, this study provides a solid foundation for incorporating natural history collections into host-associated microbiome research, offering valuable insights into the long-term effects of anthropogenic change on microbial communities. Supplementary Material File (benchmarking paper version 10 (clean).docx) - Download - 7.78 MB Information & Authors Information Version history Copyright This work is licensed under a Non Exclusive No Reuse License.

Keywords

Authors Metrics & Citations Metrics Article Usage 238views 208downloads Citations Download citation Siu-Kin Ng, Rafal Gutaker. Negligible effect of host DNA on metagenomics analysis enables microbial ecology investigation in historical samples. Authorea. 21 April 2025. DOI: https://doi.org/10.22541/au.174521358.82122146/v1 DOI: https://doi.org/10.22541/au.174521358.82122146/v1 If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download. For more information or tips please see 'Downloading to a citation manager' in the Help menu.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00