# Capturing biomarkers associated with Alzheimer disease subtypes using data distribution characteristics

**Authors:** Kenneth Smith, Sharlee Climer

PMC · DOI: 10.3389/fncom.2024.1388504 · Frontiers in Computational Neuroscience · 2024-09-03

## TL;DR

This paper introduces a new method to identify biomarkers for specific Alzheimer's disease subtypes by analyzing data distribution patterns.

## Contribution

The novel Bimodality Coefficient Difference (BCD) metric captures subtype-specific biomarkers in heterogeneous diseases.

## Key findings

- BCD outperforms traditional methods in identifying biomarkers for AD subtypes in synthetic data trials.
- Application of BCD to gene expression data revealed potential biomarkers for heterogeneous Alzheimer's subtypes.
- The method effectively identifies analytes associated with subsets of diseased cases.

## Abstract

Late-onset Alzheimer disease (AD) is a highly complex disease with multiple subtypes, as demonstrated by its disparate risk factors, pathological manifestations, and clinical traits. Discovery of biomarkers to diagnose specific AD subtypes is a key step towards understanding biological mechanisms underlying this enigmatic disease, generating candidate drug targets, and selecting participants for drug trials. Popular statistical methods for evaluating candidate biomarkers, fold change (FC) and area under the receiver operating characteristic curve (AUC), were designed for homogeneous data and we demonstrate the inherent weaknesses of these approaches when used to evaluate subtypes representing less than half of the diseased cases. We introduce a unique evaluation metric that is based on the distribution of the values, rather than the magnitude of the values, to identify analytes that are associated with a subset of the diseased cases, thereby revealing potential biomarkers for subtypes. Our approach, Bimodality Coefficient Difference (BCD), computes the difference between the degrees of bimodality for the cases and controls. We demonstrate the effectiveness of our approach with large-scale synthetic data trials containing nearly perfect subtypes. In order to reveal novel AD biomarkers for heterogeneous subtypes, we applied BCD to gene expression data for 8,650 genes for 176 AD cases and 187 controls. Our results confirm the utility of BCD for identifying subtypes of heterogeneous diseases.

## Linked entities

- **Diseases:** Alzheimer disease (MONDO:0004975), Alzheimer's disease (MONDO:0004975)

## Full-text entities

- **Diseases:** AD (MESH:D000544)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11413970/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11413970/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC11413970/full.md

---
Source: https://tomesphere.com/paper/PMC11413970