RILOD: Near Real-Time Incremental Learning for Object Detection at the Edge
Dawei Li, Serafettin Tasci, Shalini Ghosh, Jingwen Zhu, Junting Zhang,, Larry Heck

TL;DR
RILOD is a practical system enabling near real-time incremental learning for object detection on edge devices, allowing models to quickly adapt to new classes without forgetting old ones.
Contribution
The paper introduces a novel incremental learning algorithm for one-stage object detection models and a real-time dataset construction pipeline for edge applications.
Findings
Detects new object classes in minutes, including data collection and training.
Outperforms traditional fine-tuning in speed and reduces manual labeling.
Maintains detection accuracy for old classes while learning new ones.
Abstract
Object detection models shipped with camera-equipped edge devices cannot cover the objects of interest for every user. Therefore, the incremental learning capability is a critical feature for a robust and personalized object detection system that many applications would rely on. In this paper, we present an efficient yet practical system, RILOD, to incrementally train an existing object detection model such that it can detect new object classes without losing its capability to detect old classes. The key component of RILOD is a novel incremental learning algorithm that trains end-to-end for one-stage deep object detection models only using training data of new object classes. Specifically to avoid catastrophic forgetting, the algorithm distills three types of knowledge from the old model to mimic the old model's behavior on object classification, bounding box regression and feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
