# A governance framework for medical code standardization to enhance multi-institutional data quality

**Authors:** Takanori Yamashita, Shin-Ichi Shibata, Atsushi Takada, Taeko Hotta, Rieko Izukura, Dongchon Kang, Naoki Nakashima

PMC · DOI: 10.1186/s12911-026-03397-1 · BMC Medical Informatics and Decision Making · 2026-02-25

## TL;DR

This paper introduces a governance framework to standardize medical codes across institutions, improving data quality for collaborative research.

## Contribution

The novel contribution is a practical governance model for medical code standardization that enhances multi-institutional data integration.

## Key findings

- After 1.5 years, correct code assignment rates reached 36% for drugs, 29% for lab tests, and 67% for diseases.
- Differential data accumulation enabled continuous monitoring and problem identification at each institution.
- Manual and complex code systems like JLAC-10 limited registration rates, especially for lab tests.

## Abstract

Data quality management is crucial for performing integrated analyses of medical data across multiple institutions, and mapping facility-specific local codes to standardized codes is a critical component of this process. This study aimed to improve the medical data quality of Medical Information Database Network (MID-NET®)-cooperating institutions by developing and implementing a governance framework for medical code standardization.

A governance center was established at Kyushu University Hospital, which developed a differential output tool for detecting change logs in local and standardized codes. This tool was introduced to 18 MID-NET institutions to extract differences between updates and securely transfer them to the governance center. The governance procedures involved collecting and verifying mapping tables, assigning standard codes (HOT, JLAC-10, or ICD-10), and distributing updates to cooperating institutions. The full-scale operation of the governance process began in July 2020, facilitating continuous improvement in mapping accuracy and efficiency. The most optimal standardized code was proposed by medical professionals, and feedback was provided monthly to each institution.

After approximately 1.5 years of governance, the correct standardized code assignment rates across all cooperating institutions were 36% for drugs, 29% for laboratory tests, and 67% for diseases. These values reflected the real-world baseline of standard code utilization in MID-NET institutions, where standardized codes had not been systematically assigned prior to governance implementation. Despite the monthly proposals provided by the governance center, the increase in registrations remained modest, particularly for laboratory tests, where the JLAC-10 codes were complex, highlighting the difficulty of achieving high coverage. However, the accumulation of differential data allowed for continuous monitoring of registration status and provided insights into problems and solutions at each institution. Mechanisms for semi-automatic registration and expansion of the governance system across multiple institutions and vendors were considered to further improve registration rates.

Maintaining high-quality data is crucial for ensuring reliable clinical collaboration and establishing a foundation for the secondary use of real-world data. This governance model provides a practical framework for data-driven projects that integrate centralized repositories with local electronic medical records, not only within MID-NET but also for other clinical research database initiatives.

The online version contains supplementary material available at 10.1186/s12911-026-03397-1.

## Full-text entities

- **Genes:** ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}
- **Diseases:** Diseases (MESH:D004194), MID-NET (MESH:D000069279), Related Health Problems (MESH:D000076082), HOT (OMIM:613339), RWD (MESH:D016773), PMDA (MESH:D009471)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13041221/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13041221/full.md

## References

10 references — full list in the complete paper: https://tomesphere.com/paper/PMC13041221/full.md

---
Source: https://tomesphere.com/paper/PMC13041221