Day2Dark: Pseudo-Supervised Activity Recognition beyond Silent Daylight

Yunhua Zhang; Hazel Doughty; Cees G. M. Snoek

arXiv:2212.02053·cs.CV·August 29, 2023·1 cites

Day2Dark: Pseudo-Supervised Activity Recognition beyond Silent Daylight

Yunhua Zhang, Hazel Doughty, Cees G. M. Snoek

PDF

Open Access

TL;DR

This paper introduces a pseudo-supervised learning approach combined with adaptive audio-visual fusion to improve activity recognition in dark environments, addressing data scarcity and illumination challenges.

Contribution

It proposes a novel darkness-adaptive audio-visual recognizer and a pseudo-supervised learning scheme to enhance activity recognition in low-light conditions.

Findings

01

Outperforms image enhancement and domain adaptation methods

02

Improves robustness to occlusions and darkness

03

Effective across multiple datasets

Abstract

This paper strives to recognize activities in the dark, as well as in the day. We first establish that state-of-the-art activity recognizers are effective during the day, but not trustworthy in the dark. The main causes are the limited availability of labeled dark videos to learn from, as well as the distribution shift towards the lower color contrast at test-time. To compensate for the lack of labeled dark videos, we introduce a pseudo-supervised learning scheme, which utilizes easy to obtain unlabeled and task-irrelevant dark videos to improve an activity recognizer in low light. As the lower color contrast results in visual information loss, we further propose to incorporate the complementary activity information within audio, which is invariant to illumination. Since the usefulness of audio and visual features differs depending on the amount of illumination, we introduce our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Image Enhancement Techniques · Music and Audio Processing