Label-Efficient Online Continual Object Detection in Streaming Video

Jay Zhangjie Wu; David Junhao Zhang; Wynne Hsu; Mengmi Zhang; Mike; Zheng Shou

arXiv:2206.00309·cs.CV·August 24, 2023·1 cites

Label-Efficient Online Continual Object Detection in Streaming Video

Jay Zhangjie Wu, David Junhao Zhang, Wynne Hsu, Mengmi Zhang, Mike, Zheng Shou

PDF

Open Access 1 Repo

TL;DR

This paper introduces LEOCOD, a new approach for online continual object detection in streaming videos that reduces annotation costs while maintaining high performance, outperforming fully supervised methods with significantly fewer labels.

Contribution

The authors propose Efficient-CLS, a plug-and-play module that enhances existing continual learners for video object detection with minimal supervision, reducing annotation effort and retraining time.

Findings

01

Achieves significant performance improvements with minimal forgetting across supervision levels.

02

With only 25% annotated frames, outperforms fully supervised models trained on all frames.

03

Demonstrates effectiveness on challenging real-world video benchmarks.

Abstract

Humans can watch a continuous video stream and effortlessly perform continual acquisition and transfer of new knowledge with minimal supervision yet retaining previously learnt experiences. In contrast, existing continual learning (CL) methods require fully annotated labels to effectively learn from individual frames in a video stream. Here, we examine a more realistic and challenging problem $\unicode x 2014$ Label-Efficient Online Continual Object Detection (LEOCOD) in streaming video. We propose a plug-and-play module, Efficient-CLS, that can be easily inserted into and improve existing continual learners for object detection in video streams with reduced data annotation costs and model retraining time. We show that our method has achieved significant improvement with minimal forgetting across all supervision levels on two challenging CL benchmarks for streaming real-world videos.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

showlab/efficient-cls
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Image Enhancement Techniques · Multimodal Machine Learning Applications