When Few-Shot Learning Meets Video Object Detection
Zhongjie Yu, Gaoang Wang, Lin Chen, Sebastian Raschka, and Jiebo Luo

TL;DR
This paper introduces the problem of few-shot learning for video object detection, creates a new benchmark dataset, and proposes a simple method called Thaw to improve detection performance with limited labeled video clips.
Contribution
It defines the few-shot video object detection problem, develops a benchmark dataset, and proposes the Thaw method to address overfitting and insufficiency issues in this setting.
Findings
Thaw improves detection accuracy in few-shot scenarios.
Transfer learning effectively adapts models to novel classes.
Benchmark datasets facilitate future research in this area.
Abstract
Different from static images, videos contain additional temporal and spatial information for better object detection. However, it is costly to obtain a large number of videos with bounding box annotations that are required for supervised deep learning. Although humans can easily learn to recognize new objects by watching only a few video clips, deep learning usually suffers from overfitting. This leads to an important question: how to effectively learn a video object detector from only a few labeled video clips? In this paper, we study the new problem of few-shot learning for video object detection. We first define the few-shot setting and create a new benchmark dataset for few-shot video object detection derived from the widely used ImageNet VID dataset. We employ a transfer-learning framework to effectively train the video object detector on a large number of base-class objects and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
