A reusable model of pangenome selection informs optimal surveillance strategies over vaccine introductions

preprint OA: closed CC-BY-4.0
📄 Open PDF Full text JSON View at publisher
Full text 2,758 characters · extracted from oa-doi-fallback · 4 sections · click to expand

Abstract

Background The human pathogen Streptococcus pneumoniae is a major cause of disease, including pneumonia and meningitis. The introduction of Pneumococcal Conjugate Vaccines (PCVs) initially reduced the burden of disease through a reduction of colonisation by vaccine-targeted serotypes. However, since PCVs only target a proportion of pneumococcal serotypes, they shift intraspecific competition, eventually allowing non-targeted types to ‘replace’ vaccine types. Understanding the host and pathogen factors causing replacement is important for future vaccine development. Mechanistic understanding of vaccine replacement dynamics is crucial for forecasting and optimisation of genomic surveillance strategies to evaluate realised vaccine effectiveness.

Methods

We developed a mathematical model of the genomic and demographic factors which explain vaccine replacement, used this model to replicate serotype-frequency changes, and investigated cost-effective genomic surveillance strategies. We extended a forward-time model based on the Wright-Fisher model, developing a user-friendly model framework that describes the post-vaccine dynamics of S. pneumoniae populations. Our model describes vaccine replacement as a function of vaccine impact, immigration of new strains, and negative frequency-dependent selection (NFDS) on the accessory genome content.

Results

We used our model to study vaccine replacement in newly sequenced genomic surveillance data from Nepal, and existing data from the US, and the UK, with distinct surveillance strategies. We showed that the model with NFDS better replicates replacement dynamics than a null model without NFDS, and that NFDS likely only acts on part of the S. pneumoniae accessory genome. We found consistent estimates for vaccination effectiveness across the different study locations and country-specific genes under NFDS, highlighting the importance of conducting genomic surveillance in each country of interest. By simulating data from the model, we showed that an optimal surveillance strategy prioritises per-sampling sample size over sampling frequency for small sampling budgets.

Conclusions

Our model can be used to predict vaccine replacement dynamics after PCV introduction, and can be easily reapplied to analyse new data from vaccine introductions or new regions. Our model is available in the R package STUBENTIGER (Studying Balancing Evolution (NFDS) To Investigate Genome Replacement) on GitHub https://github.com/bacpop/Stubentiger. Competing Interest Statement N.J.C. has consulted for Antigen Discovery Inc. and Pfizer and has received an investigator-initiated award from GlaxoSmithKline, for work not directly related to this study. The other authors declare no competing interests.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-05-23T02:00:01.238055+00:00
License: CC-BY-4.0