Active Learning for Video Classification with Frame Level Queries
Debanjan Goswami, Shayok Chakraborty

TL;DR
This paper introduces a novel active learning framework for video classification that significantly reduces human annotation effort by selecting informative videos and frames for labeling, instead of requiring full video review.
Contribution
It is the first to develop an active learning approach where annotators label only key frames within videos, improving efficiency over traditional methods.
Findings
Reduces manual annotation effort in video classification.
Uses uncertainty and diversity criteria for selecting informative videos.
Employs representative sampling to extract key frames.
Abstract
Deep learning algorithms have pushed the boundaries of computer vision research and have depicted commendable performance in a variety of applications. However, training a robust deep neural network necessitates a large amount of labeled training data, acquiring which involves significant time and human effort. This problem is even more serious for an application like video classification, where a human annotator has to watch an entire video end-to-end to furnish a label. Active learning algorithms automatically identify the most informative samples from large amounts of unlabeled data; this tremendously reduces the human annotation effort in inducing a machine learning model, as only the few samples that are identified by the algorithm, need to be labeled manually. In this paper, we propose a novel active learning framework for video classification, with the goal of further reducing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning
