# Unveiling the untreated: development of a database algorithm to identify potential Fabry disease patients in Germany

**Authors:** Max J. Hilz, Nicole Lyn, Felix Marczykowski, Barbara Werner, Marc Pignot, Elvira Ponce, Joseph Bender, Michael Edigkaufer, Pronabesh DasMahapatra

PMC · DOI: 10.1186/s13023-024-03258-y · Orphanet Journal of Rare Diseases · 2024-07-09

## TL;DR

A new algorithm was developed to identify potential Fabry disease patients in Germany using insurance claims data, aiming to improve early detection.

## Contribution

The study introduces a logistic regression model using ICD-10-GM codes to identify untreated Fabry disease patients in a German database.

## Key findings

- The model identified 284 potential Fabry disease patients using insurance claims data.
- The model achieved 80.4% sensitivity and 79.8% specificity at a cut-point of 0.08.
- Potential FD patients were predominantly under 30 and female, with higher incidence rates than treated FD patients.

## Abstract

Fabry disease (FD), an X-linked lysosomal storage disorder, is caused by mutations in the gene encoding α-galactosidase A, resulting in lysosomal accumulation of globotriaosylceramide and other glycosphingolipids. Early detection of FD is challenging, accounting for delayed diagnosis and treatment initiation. This study aimed to develop an algorithm using a logistic regression model to facilitate early identification of patients based on ICD-10-GM coding using a German Sickness Fund Database.

The logistic regression model was fitted on a binary outcome variable based on either a treated FD cohort or a control cohort (without FD). Comorbidities specific to the involved organs were used as covariates to identify potential FD patients with ICD-10-GM E75.2 diagnosis but without any FD-specific medication. Specificity and sensitivity of the model were optimized to determine a likely threshold. The cut-point with the largest values for the Youden index and concordance probability method and the lowest value for closest to (0,1) was identified as 0.08 for each respective value. The sensitivity and specificity for this cut-point were 80.4% and 79.8%, respectively. Additionally, a sensitivity analysis of the potential FD patients with at least two codes of E75.2 diagnoses was performed.

A total of 284 patients were identified in the potential FD cohort using the logistic regression model. Most potential FD patients were < 30 years old and female. The identification and incidence rates of FD in the potential FD cohort were markedly higher than those of the treated FD cohort.

This model serves as a tool to identify potential FD patients using German insurance claims data.

The online version contains supplementary material available at 10.1186/s13023-024-03258-y.

## Linked entities

- **Diseases:** Fabry disease (MONDO:0010526)

## Full-text entities

- **Genes:** GLA (galactosidase alpha) [NCBI Gene 2717] {aka GALA}
- **Diseases:** X-linked lysosomal storage disorder (MESH:D016464), FD (MESH:D000795)
- **Chemicals:** glycosphingolipids (MESH:D006028), globotriaosylceramide (MESH:C018549)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11234697/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC11234697/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC11234697/full.md

---
Source: https://tomesphere.com/paper/PMC11234697