Facial Video-based Remote Physiological Measurement via Self-supervised Learning
Zijie Yue, Miaojing Shi, Shuai Ding

TL;DR
This paper introduces a self-supervised learning framework for remote physiological measurement from facial videos, eliminating the need for ground truth PPG signals and achieving state-of-the-art results.
Contribution
It proposes a novel frequency-inspired self-supervised approach with frequency augmentation and specialized loss functions for rPPG estimation without annotated data.
Findings
Outperforms existing methods on four benchmarks.
Accurately estimates heart rate and respiration frequency.
Effective in learning from unlabeled facial videos.
Abstract
Facial video-based remote physiological measurement aims to estimate remote photoplethysmography (rPPG) signals from human face videos and then measure multiple vital signs (e.g. heart rate, respiration frequency) from rPPG signals. Recent approaches achieve it by training deep neural networks, which normally require abundant facial videos and synchronously recorded photoplethysmography (PPG) signals for supervision. However, the collection of these annotated corpora is not easy in practice. In this paper, we introduce a novel frequency-inspired self-supervised framework that learns to estimate rPPG signals from facial videos without the need of ground truth PPG signals. Given a video sample, we first augment it into multiple positive/negative samples which contain similar/dissimilar signal frequencies to the original one. Specifically, positive samples are generated using spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNon-Invasive Vital Sign Monitoring · ECG Monitoring and Analysis · Optical Imaging and Spectroscopy Techniques
