A Machine Learning Approach to Predict Biological Age and its Longitudinal Drivers
Nazira Dunbayeva, Yulong Li, Yutong Xie, Imran Razzak

TL;DR
This study presents a machine learning pipeline that predicts biological age by incorporating longitudinal biomarker changes, outperforming traditional models and emphasizing the importance of health trajectories for personalized aging assessments.
Contribution
The paper introduces a novel feature engineering approach capturing biomarker change rates, significantly enhancing biological age prediction accuracy over static models.
Findings
Engineered slope features improve age prediction accuracy.
Longitudinal health trajectories are key determinants of biological age.
Model outperforms traditional linear and ensemble models.
Abstract
Predicting an individual's aging trajectory is a central challenge in preventative medicine and bioinformatics. While machine learning models can predict chronological age from biomarkers, they often fail to capture the dynamic, longitudinal nature of the aging process. In this work, we developed and validated a machine learning pipeline to predict age using a longitudinal cohort with data from two distinct time periods (2019-2020 and 2021-2022). We demonstrate that a model using only static, cross-sectional biomarkers has limited predictive power when generalizing to future time points. However, by engineering novel features that explicitly capture the rate of change (slope) of key biomarkers over time, we significantly improved model performance. Our final LightGBM model, trained on the initial wave of data, successfully predicted age in the subsequent wave with high accuracy ($R^2 =…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
