Large-scale Training of Foundation Models for Wearable Biosignals
Salar Abbaspourazad, Oussama Elachqar, Andrew C. Miller, Saba Emrani,, Udhyakumar Nallasamy, Ian Shapiro

TL;DR
This paper presents the development of large-scale foundation models for wearable biosignals, specifically PPG and ECG, using self-supervised learning on extensive consumer device data to improve health monitoring without needing labeled datasets.
Contribution
It introduces the first large-scale foundation models for PPG and ECG biosignals trained on data from over 140,000 participants, leveraging self-supervised learning to overcome data annotation limitations.
Findings
Models encode demographic and health information.
Pre-trained models generalize well across modalities.
Potential to enhance wearable health monitoring.
Abstract
Tracking biosignals is crucial for monitoring wellness and preempting the development of severe medical conditions. Today, wearable devices can conveniently record various biosignals, creating the opportunity to monitor health status without disruption to one's daily routine. Despite widespread use of wearable devices and existing digital biomarkers, the absence of curated data with annotated medical labels hinders the development of new biomarkers to measure common health conditions. In fact, medical datasets are usually small in comparison to other domains, which is an obstacle for developing neural network models for biosignals. To address this challenge, we have employed self-supervised learning using the unlabeled sensor data collected under informed consent from the large longitudinal Apple Heart and Movement Study (AHMS) to train foundation models for two common biosignals:…
Peer Reviews
Decision·ICLR 2024 poster
- Important domain with very limited available models - Solid results across an impressive array of downstream tasks - Careful tuning and experimentation considering the idiosyncrasies of the data
- Some missing references and links to previous works - Lack of discussion around scaling up the proposed models - No discussion around model/data release
- The paper is well-articulated. The authors clearly presented the proposed method, experimental setup, and analysis. - To the best of my knowledge, this represents the first attempt at training a self-supervised learning (SSL) foundational model for PPG and ECG data on such a grand scale. The results from this process offer valuable scientific insights. - The authors conducted an exhaustive evaluation of the pretrained models. These pretrained embeddings were assessed against more than 50 dis
- There are potential concerns on the technical methodology front. While this study is the product of extensive training and evaluation, much of the methodology draws from pre-existing studies. Although there are several SSL studies tailored for time-series data, particularly in the realm of biosignals in healthcare, the authors did not extensively compare different model architectures. Readers might be keen to discern whether biosignal SSL performance is more contingent upon scale or the model
Strengths: 1. **First Work on Foundation Models for Wearable Biosignals**: The research stands out as the pioneering effort to develop foundation models specifically for biosignals—photoplethysmography (PPG) and electrocardiogram (ECG)—collected via wearable devices. Such biosignals offer a treasure trove of biological and cardiac information, which can be instrumental in monitoring users' overall health. The convenience of wearable devices combined with the potential of these foundation models
Areas for improvement: 1. **Exploration of KoLeo Regularization**: An area of potential exploration is the specific impact of the KoLeo regularization on the model's performance. Ablation studies that incrementally remove or vary the strength of KoLeo regularization could provide clarity on its role and efficacy. Such an analysis would help in understanding whether the regularization is crucial for the model's success, and to what extent it contributes to the overall performance. This is partic
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Health and mHealth Applications
