# Joint modeling of longitudinal biomarker and survival outcomes with the presence of competing risk in the nested case–control studies with application to the TEDDY microbiome dataset

**Authors:** Yanan Zhao, Ting-Fang Lee, Boyan Zhou, Chan Wang, Ann Marie Schmidt, Mengling Liu, Huilin Li, Jiyuan Hu

PMC · DOI: 10.1093/bioinformatics/btag038 · Bioinformatics · 2026-01-22

## TL;DR

This paper introduces a new statistical framework for analyzing longitudinal biomarker data and survival outcomes in nested case-control studies, particularly when competing risks are present.

## Contribution

The novel contribution is a joint modeling framework (JM-NCC) that handles non-normal biomarkers and competing risks in nested case-control designs.

## Key findings

- The proposed JM-NCC framework demonstrates robustness and efficiency in simulation studies.
- Application to the TEDDY microbiome dataset validates the framework's practical utility.

## Abstract

Large-scale prospective cohort studies collect longitudinal biospecimens alongside time-to-event outcomes to investigate biomarker dynamics in relation to disease risk. The nested case–control (NCC) design provides a cost-effective alternative to full cohort biomarker studies while preserving statistical efficiency. Despite advances in joint modeling for longitudinal and time-to-event outcomes, few approaches address the unique challenges posed by NCC sampling, non-normally distributed biomarkers, and competing survival outcomes.

Motivated by the TEDDY study, we propose “JM-NCC”, a joint modeling framework designed for NCC studies with competing events. It integrates a generalized linear mixed-effects model for potentially non-normally distributed biomarkers with a cause-specific hazard model for competing risks. Two estimation methods are developed. fJM-NCC leverages NCC sub-cohort longitudinal biomarker data and full cohort survival and clinical metadata, while wJM-NCC uses only NCC sub-cohort data. Both simulation studies and an application to TEDDY microbiome dataset demonstrate the robustness and efficiency of the proposed methods.

Software is available at https://github.com/Zhaoyn-oss/JMNCC and archived on Zenodo at https://zenodo.org/records/18199759 (DOI: 10.5281/zenodo.18199759).

## Full-text entities

- **Genes:** DCLK3 (doublecortin like kinase 3) [NCBI Gene 85443] {aka CLR, DCAMKL3, DCDC3C, DCK3}
- **Diseases:** T1D (MESH:D003922), Diabetes (MESH:D003920), JM-NCC (MESH:C536209), inflammatory bowel disease (MESH:D015212), IA (MESH:D007516), cancer (MESH:D009369), Diabetes and Digestive and Kidney Diseases (MESH:D003928)
- **Chemicals:** GADA (-)
- **Species:** Homo sapiens (human, species) [taxon 9606], Bacillus cereus (species) [taxon 1396], gut metagenome (species) [taxon 749906], Bifidobacterium breve (species) [taxon 1685]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13005730/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13005730/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/PMC13005730/full.md

---
Source: https://tomesphere.com/paper/PMC13005730