PhysFormer++: Facial Video-based Physiological Measurement with SlowFast Temporal Difference Transformer
Zitong Yu, Yuming Shen, Jingang Shi, Hengshuang Zhao, Yawen Cui,, Jiehua Zhang, Philip Torr, Guoying Zhao

TL;DR
This paper introduces PhysFormer++: a novel video transformer architecture for contactless physiological measurement from facial videos, leveraging long-range spatio-temporal features and advanced training strategies to outperform existing methods.
Contribution
The paper proposes PhysFormer and PhysFormer++, innovative transformer-based models that effectively capture long-range spatio-temporal features for rPPG, with training techniques that reduce overfitting and eliminate the need for large-scale pretraining.
Findings
Superior performance on four benchmark datasets
Effective long-range spatio-temporal feature modeling
Can be trained from scratch without large pretraining datasets
Abstract
Remote photoplethysmography (rPPG), which aims at measuring heart activities and physiological signals from facial video without any contact, has great potential in many applications (e.g., remote healthcare and affective computing). Recent deep learning approaches focus on mining subtle rPPG clues using convolutional neural networks with limited spatio-temporal receptive fields, which neglect the long-range spatio-temporal perception and interaction for rPPG modeling. In this paper, we propose two end-to-end video transformer based architectures, namely PhysFormer and PhysFormer++, to adaptively aggregate both local and global spatio-temporal features for rPPG representation enhancement. As key modules in PhysFormer, the temporal difference transformers first enhance the quasi-periodic rPPG features with temporal difference guided global attention, and then refine the local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNon-Invasive Vital Sign Monitoring · ECG Monitoring and Analysis · EEG and Brain-Computer Interfaces
