Mapping and Harmonization of CVX vaccine terms to the Vaccine Ontology

preprint OA: closed
📄 Open PDF Full text JSON View at publisher
Full text 3,437 characters · extracted from oa-doi-fallback · 3 sections · click to expand

Abstract

Background With many vaccines developed and used, it is critical to standardize vaccine information. The OHDSI OMOP Common Data Model (CDM), widely used to support EHR data integration and analysis, leverages CVX, RxNorm, and RxNorm Extension codes to standardize vaccine-related records. However, these terminologies lack robust semantic relations, making the vaccine classification ineffective in OMOP CDM. To address this issue, our OHDSI Vaccine Vocabulary Working Group proposes to use the Vaccine Ontology (VO) to map these standards and build up its own semantic relations. As a first study of the work, we performed the mapping and alignment of the Vaccine Administered (CVX) codes with the VO using a combination of semi-automatic and manual mapping methods.

Results

A total of 273 CVX terms were first collected and classified. A high-level VO design pattern and an exact one-to-one mapping strategy were developed to guide the CVX-to-VO term mapping. To facilitate the manual mapping and harmonization process, we also developed and evaluated three semi-automated mapping approaches utilizing lexical and semantic information of vaccine concepts to map CVX to VO. These approaches suggested candidate VO mappings for CVX terms and also indicated CVX terms that were unmappable to VO and required new term additions to VO. The application of the best approach to the 2022-10-05 release of VO achieved an accuracy of 85.55% for its suggestions. The suggestions made by the semi-automated approaches were taken into account to further enhance the mappings, which led to our eventual mapping of all CVX terms to the latest version of VO. We innovatively proposed the inclusion of the ‘passive vaccine’ branch in VO, which includes 24 immunoglobulins and antitoxins from CVX as passive vaccines. A specific CVX-VO OWL file was developed and added to the VO GitHub. Use case queries were developed to demonstrate its support for computer-assisted queries of vaccine groups based on CVX-VO hierarchies.

Conclusion

All CVX terms were mapped to the VO using our combined semi-automatic and manual mapping methods. The mapped results enhanced semantic vaccine classification, providing a basis for further OMOP vaccine classification and EHR data analysis. Competing Interest Statement The authors have declared no competing interest. Footnotes ↵* These authors share first authorship; ↵# Co-corresponding authors Contact information: Yuanyi Pan: yuanyipan19{at}gmail.com; Warren Manuel: Warren.C.Manuel{at}uth.tmc.edu; Rashmie Abeysinghe: Rashmie.Abeysinghe{at}uth.tmc.edu; Zheng Jie: jiezhen{at}umich.edu; Alexander Davydov: davydov{at}ohdsi.org; Qi Yang: qi.yang1{at}iqvia.com; Asiyah Yu Lin: linikujp{at}gmail.com https://github.com/vaccineontology/VO/blob/master/mappings/vo-cvx.sssom.tsv Abbreviation - EHR - electronic health records - OMOP CDM - Observational Medical Outcomes Partnership Common Data Model - CVX - CDC Vaccine Administered code - HL7 - Health Level Seven International - VO - Vaccine Ontology - OHDSI - Observational Health Data Sciences and Informatics - OBO - Open Biological and Biomedical Ontology - NCIRD - National Center of Immunization and Respiratory Diseases - CDC - Centers for Disease Control and Prevention - VAERS - Vaccine Adverse Event Analysis System - HIPC - Human Immunology Project Consortium - VAC - Vaccine Adjuvant Compendium - Vaccine Vocab WG - Vaccine Vocabulary Working Group

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00