CoDaS: AI Co-Data-Scientist for Biomarker Discovery via Wearable Sensors
Yubin Kim, Salman Rahman, Samuel Schmidgall, Chunjong Park, A. Ali Heydari, Ahmed A. Metwally, Hong Yu, Xin Liu, Xuhai Xu, Yuzhe Yang, Maxwell A. Xu, Zhihan Zhang, Cynthia Breazeal, Tim Althoff, Petar Sirkovic, Ivor Rendulic, Annalisa Pawlosky, Nicolas Stroppa, Juraj Gottweis

TL;DR
CoDaS is an AI multi-agent system that systematically discovers and validates digital biomarkers from wearable sensor data for mental health and metabolic conditions, improving predictive models.
Contribution
Introduces CoDaS, a novel multi-agent framework that automates hypothesis generation, validation, and biomarker discovery from large-scale wearable datasets with human oversight.
Findings
Identified 41 digital biomarkers for mental health and 25 for metabolic outcomes.
Validated circadian instability features in depression cohorts with significant correlations.
Improved predictive performance for depression and insulin resistance using CoDaS-derived features.
Abstract
Scientific discovery in digital health requires converting continuous physiological signals from wearable devices into clinically actionable biomarkers. We introduce CoDaS (AI Co-Data-Scientist), a multi-agent system that structures biomarker discovery as an iterative process combining hypothesis generation, statistical analysis, adversarial validation, and literature-grounded reasoning with human oversight using large-scale wearable datasets. Across three cohorts totaling 9,279 participant-observations, CoDaS identified 41 candidate digital biomarkers for mental health and 25 for metabolic outcomes, each subjected to an internal validation battery spanning replication, stability, robustness, and discriminative power. Across two independent depression cohorts, CoDaS surfaced circadian instability-related features in both datasets, reflected in sleep duration variability (DWB, \rho =…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
