TL;DR
This paper introduces rPPG-VQA, a framework for assessing video quality specifically for unsupervised rPPG training, improving model performance by filtering suitable videos using signal and scene analyses.
Contribution
It presents a novel dual-branch assessment architecture combining signal-to-noise ratio estimation and multimodal large language model analysis for video suitability evaluation.
Findings
Filtering videos with rPPG-VQA improves unsupervised rPPG model accuracy.
The framework effectively identifies low-quality videos that degrade model training.
Using the proposed assessment, models trained on curated data outperform those trained on unfiltered data.
Abstract
Unsupervised remote photoplethysmography (rPPG) promises to leverage unlabeled video data, but its potential is hindered by a critical challenge: training on low-quality "in-the-wild" videos severely degrades model performance. An essential step missing here is to assess the suitability of the videos for rPPG model learning before using them for the task. Existing video quality assessment (VQA) methods are mainly designed for human perception and not directly applicable to the above purpose. In this work, we propose rPPG-VQA, a novel framework for assessing video suitability for rPPG. We integrate signal-level and scene-level analyses and design a dual-branch assessment architecture. The signal-level branch evaluates the physiological signal quality of the videos via robust signal-to-noise ratio (SNR) estimation with a multi-method consensus mechanism, and the scene-level branch uses a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
