PhySU-Net: Long Temporal Context Transformer for rPPG with   Self-Supervised Pre-training

Marko Savic; Guoying Zhao

arXiv:2402.11913·cs.CV·February 20, 2024·1 cites

PhySU-Net: Long Temporal Context Transformer for rPPG with Self-Supervised Pre-training

Marko Savic, Guoying Zhao

PDF

Open Access

TL;DR

PhySU-Net is a novel transformer-based model for remote photoplethysmography that effectively utilizes long-term temporal context and self-supervised pre-training to enhance performance with limited labeled data.

Contribution

It introduces the first long spatial-temporal map rPPG transformer and a self-supervised pre-training strategy leveraging unlabeled data.

Findings

01

Superior performance on public datasets (OBF and VIPL-HR).

02

Self-supervised pre-training improves model accuracy.

03

Effective long-term temporal modeling in rPPG.

Abstract

Remote photoplethysmography (rPPG) is a promising technology that consists of contactless measuring of cardiac activity from facial videos. Most recent approaches utilize convolutional networks with limited temporal modeling capability or ignore long temporal context. Supervised rPPG methods are also severely limited by scarce data availability. In this work, we propose PhySU-Net, the first long spatial-temporal map rPPG transformer network and a self-supervised pre-training strategy that exploits unlabeled data to improve our model. Our strategy leverages traditional methods and image masking to provide pseudo-labels for self-supervised pre-training. Our model is tested on two public datasets (OBF and VIPL-HR) and shows superior performance in supervised training. Furthermore, we demonstrate that our self-supervised pre-training strategy further improves our model's performance by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Video Analysis and Summarization · Handwritten Text Recognition Techniques