Learning from Weakly-labeled Web Videos via Exploring Sub-Concepts
Kunpeng Li, Zizhao Zhang, Guanhang Wu, Xuehan Xiong, Chen-Yu Lee,, Zhichao Lu, Yun Fu, Tomas Pfister

TL;DR
This paper introduces a novel approach called Sub-Pseudo Label (SPL) for pre-training video action recognition models using weakly-labeled web videos, converting noisy labels into useful supervision signals to improve representation learning.
Contribution
The paper proposes SPL, a new label space that extrapolates weak labels and incorporates prior knowledge, enhancing supervision without extra training cost.
Findings
SPL outperforms existing pseudo-label pre-training strategies.
Models pre-trained with SPL achieve competitive results on HMDB-51 and UCF-101.
The method generalizes well to weakly-labeled image datasets.
Abstract
Learning visual knowledge from massive weakly-labeled web videos has attracted growing research interests thanks to the large corpus of easily accessible video data on the Internet. However, for video action recognition, the action of interest might only exist in arbitrary clips of untrimmed web videos, resulting in high label noises in the temporal space. To address this issue, we introduce a new method for pre-training video action recognition models using queried web videos. Instead of trying to filter out, we propose to convert the potential noises in these queried videos to useful supervision signals by defining the concept of Sub-Pseudo Label (SPL). Specifically, SPL spans out a new set of meaningful "middle ground" label space constructed by extrapolating the original weak labels during video querying and the prior knowledge distilled from a teacher model. Consequently, SPL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
