The Portable Microhaplotype Object and Tools

preprint OA: closed
Full text JSON View at publisher
Full text 1,873 characters · extracted from oa-doi-fallback · 2 sections · click to expand

Abstract

Motivation The rapid increase in the generation of targeted sequencing data offers immense potential for research, medicine, and public health, however the lack of an established standard for these data has led to disparate solutions for data storage. A widely accepted standard is essential for data sharing, reuse, and the coordinated development of interoperable analysis tools.

Results

We propose the Portable Microhaplotype Object (PMO), a standardized format for efficiently and losslessly storing phased targeted sequencing data (microhaplotypes). The PMO format is JSON-based, allowing efficient, relational storage of genetic data together with relevant metadata to minimize orphaned data. The format includes required fields and a curated set of optional fields leveraging established ontologies. To facilitate ease of use, we developed pmotools-python, an open-source package for creating, manipulating, and exporting PMO data into common formats. Additionally, we provide a simple web-based app to quickly create PMO files from tabular inputs, making the format accessible to a wide variety of users. Example datasets from Plasmodium, Anopheles, Escherichia coli, and Staphylococcus aureus demonstrate the broad applicability of the approach. PMO will streamline data sharing, foster interoperability, and accelerate the development of harmonized analysis tools. Availability and implementation The Portable Microhaplotype Object (PMO) project, including the ontology specification, software tools, example datasets, and tutorials, is freely available at https://plasmogenepi.github.io/PMO_Docs/. Key software components and datasets have archived releases with DOIs to ensure permanence, detailed in the Supplementary Text 1-5. Competing Interest Statement The authors have declared no competing interest. Footnotes Contact: nickjhathaway{at}gmail.com

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00