# MD3F: Multivariate Distance Drift Diffusion Framework for High-Dimensional Datasets

**Authors:** Jessica Zielinski, Patricia Corby, Alexander V. Alekseyenko

PMC · DOI: 10.3390/genes15050582 · Genes · 2024-05-03

## TL;DR

The paper introduces MD3F, a new method for analyzing high-dimensional biomedical data over time, which improves statistical power and handles complex dynamics between health and disease.

## Contribution

The novel contribution is a multivariate distance-based drift-diffusion framework (MD3F) for longitudinal omics data analysis.

## Key findings

- MD3F provides valid and powerful hypothesis testing for longitudinal high-throughput datasets.
- The framework is robust and broadly applicable for assessing multivariate dynamics in omics data.
- Simulation and application studies confirm the effectiveness of MD3F.

## Abstract

High-dimensional biomedical datasets have become easier to collect in the last two decades with the advent of multi-omic and single-cell experiments. These can generate over 1000 measurements per sample or per cell. More recently, focus has been drawn toward the need for longitudinal datasets, with the appreciation that important dynamic changes occur along transitions between health and disease. Analysis of longitudinal omics data comes with many challenges, including type I error inflation and corresponding loss in power when thousands of hypothesis tests are needed. Multivariate analysis can yield approaches with higher statistical power; however, multivariate methods for longitudinal data are currently limited. We propose a multivariate distance-based drift-diffusion framework (MD3F) to tackle the need for a multivariate approach to longitudinal, high-throughput datasets. We show that MD3F can result in surprisingly simple yet valid and powerful hypothesis testing and estimation approaches using generalized linear models. Through simulation and application studies, we show that MD3F is robust and can offer a broadly applicable method for assessing multivariate dynamics in omics data.

## Full-text entities

- **Genes:** CD34 (CD34 molecule) [NCBI Gene 947]
- **Diseases:** head and neck cancer (MESH:D006258), cancer (MESH:D009369), injury to people or property (MESH:C000719191), amyotrophic lateral sclerosis (MESH:D000690), Diabetes (MESH:D003920), Mucositis (MESH:D052016), multiple myelomas (MESH:D009101), Type 1 diabetes (MESH:D003922), oral mucositis (MESH:D013280), gut dysbiosis (MESH:D064806)
- **Chemicals:** MD3F (-)
- **Species:** Bacteria Latreille et al. 1825 (Bacteria stick insect, genus) [taxon 629395], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11120789/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11120789/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC11120789/full.md

---
Source: https://tomesphere.com/paper/PMC11120789