VideoSSL: Semi-Supervised Learning for Video Classification

Longlong Jing; Toufiq Parag; Zhe Wu; Yingli Tian; Hongcheng Wang

arXiv:2003.00197·cs.CV·March 3, 2020·6 cites

VideoSSL: Semi-Supervised Learning for Video Classification

Longlong Jing, Toufiq Parag, Zhe Wu, Yingli Tian, Hongcheng Wang

PDF

Open Access

TL;DR

VideoSSL introduces a semi-supervised learning method for video classification that effectively leverages unlabeled data with minimal labeled examples, achieving high performance on standard datasets.

Contribution

The paper presents a novel semi-supervised approach for video classification that uses pseudo-labels and normalized probabilities from unlabeled data to reduce annotation dependency.

Findings

01

Achieves high accuracy with limited labeled data

02

Effective on UCF101, HMDB51, and Kinetics datasets

03

Reduces annotation costs significantly

Abstract

We propose a semi-supervised learning approach for video classification, VideoSSL, using convolutional neural networks (CNN). Like other computer vision tasks, existing supervised video classification methods demand a large amount of labeled data to attain good performance. However, annotation of a large dataset is expensive and time consuming. To minimize the dependence on a large annotated dataset, our proposed semi-supervised method trains from a small number of labeled examples and exploits two regulatory signals from unlabeled data. The first signal is the pseudo-labels of unlabeled examples computed from the confidences of the CNN being trained. The other is the normalized probabilities, as predicted by an image classifier CNN, that captures the information about appearances of the interesting objects in the video. We show that, under the supervision of these guiding signals from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Anomaly Detection Techniques and Applications