VideoSSL: Semi-Supervised Learning for Video Classification
Longlong Jing, Toufiq Parag, Zhe Wu, Yingli Tian, Hongcheng Wang

TL;DR
VideoSSL introduces a semi-supervised learning method for video classification that effectively leverages unlabeled data with minimal labeled examples, achieving high performance on standard datasets.
Contribution
The paper presents a novel semi-supervised approach for video classification that uses pseudo-labels and normalized probabilities from unlabeled data to reduce annotation dependency.
Findings
Achieves high accuracy with limited labeled data
Effective on UCF101, HMDB51, and Kinetics datasets
Reduces annotation costs significantly
Abstract
We propose a semi-supervised learning approach for video classification, VideoSSL, using convolutional neural networks (CNN). Like other computer vision tasks, existing supervised video classification methods demand a large amount of labeled data to attain good performance. However, annotation of a large dataset is expensive and time consuming. To minimize the dependence on a large annotated dataset, our proposed semi-supervised method trains from a small number of labeled examples and exploits two regulatory signals from unlabeled data. The first signal is the pseudo-labels of unlabeled examples computed from the confidences of the CNN being trained. The other is the normalized probabilities, as predicted by an image classifier CNN, that captures the information about appearances of the interesting objects in the video. We show that, under the supervision of these guiding signals from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Anomaly Detection Techniques and Applications
