# Variations in Using Diagnosis Codes for Defining Age-Related Macular Degeneration Cohorts

**Authors:** Fritz Gerald Paguiligan Kalaw, Jimmy S. Chen, Sally L. Baxter

PMC · DOI: 10.3390/informatics11020028 · Informatics (MDPI) · 2025-02-26

## TL;DR

This study finds that there is significant variation in how researchers use medical codes to identify patients with age-related macular degeneration, leading to inconsistent results.

## Contribution

The study reveals a lack of standardization in using ICD codes for AMD, which affects cohort accuracy and reproducibility.

## Key findings

- Only 7% of studies using ICD-9/9-CM correctly defined AMD codes, compared to 78% using ICD-10.
- 72% of cohort definitions had missing or incomplete AMD codes.
- 35% of articles included ICD codes outside the scope of AMD diagnosis.

## Abstract

Data harmonization is vital for secondary electronic health record data analysis, especially when combining data from multiple sources. Currently, there is a gap in knowledge as to how studies identify cohorts of patients with age-related macular degeneration (AMD), a leading cause of blindness. We hypothesize that there is variation in using medical condition codes to define cohorts of AMD patients that can lead to either the under- or overrepresentation of such cohorts. This study identified articles studying AMD using the International Classification of Diseases (ICD-9, ICD-9-CM, ICD-10, and ICD-10-CM). The data elements reviewed included the year of publication; dataset origin (Veterans Affairs, registry, national or commercial claims database, and institutional EHR); total number of subjects; and ICD codes used. A total of thirty-seven articles were reviewed. Six (16%) articles used cohort definitions from two ICD terminologies. The Medicare database was the most used dataset (14, 38%), and there was a noted increase in the use of other datasets in the last few years. We identified substantial variation in the use of ICD codes for AMD. For the studies that used ICD-10 terminologies, 7 (out of 9, 78%) defined the AMD codes correctly, whereas, for the studies that used ICD-9 and 9-CM terminologies, only 2 (out of 30, 7%) defined and utilized the appropriate AMD codes (p = 0.0001). Of the 43 cohort definitions used from 37 articles, 31 (72%) had missing or incomplete AMD codes used, and only 9 (21%) used the exact codes. Additionally, 13 articles (35%) captured ICD codes that were not within the scope of AMD diagnosis. Efforts to standardize data are needed to provide a reproducible research output.

## Linked entities

- **Diseases:** age-related macular degeneration (MONDO:0005150), AMD (MONDO:0005150)

## Full-text entities

- **Diseases:** blindness (MESH:D001766), Diseases (MESH:D004194), AMD (MESH:D008268)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11864795/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11864795/full.md

## References

66 references — full list in the complete paper: https://tomesphere.com/paper/PMC11864795/full.md

---
Source: https://tomesphere.com/paper/PMC11864795