Continual Detection Transformer for Incremental Object Detection
Yaoyao Liu, Bernt Schiele, Andrea Vedaldi, Christian Rupprecht

TL;DR
This paper introduces CL-DETR, a novel transformer-based incremental object detection method that effectively employs knowledge distillation and exemplar replay, achieving state-of-the-art results on COCO 2017.
Contribution
The paper proposes a new transformer-based IOD method with specialized KD and ER strategies, addressing challenges in incremental learning for transformers.
Findings
CL-DETR outperforms existing methods on COCO 2017.
The DKD loss improves knowledge retention in incremental detection.
Calibration strategy enhances training and testing consistency.
Abstract
Incremental object detection (IOD) aims to train an object detector in phases, each with annotations for new object categories. As other incremental settings, IOD is subject to catastrophic forgetting, which is often addressed by techniques such as knowledge distillation (KD) and exemplar replay (ER). However, KD and ER do not work well if applied directly to state-of-the-art transformer-based object detectors such as Deformable DETR and UP-DETR. In this paper, we solve these issues by proposing a ContinuaL DEtection TRansformer (CL-DETR), a new method for transformer-based IOD which enables effective usage of KD and ER in this context. First, we introduce a Detector Knowledge Distillation (DKD) loss, focusing on the most informative and reliable predictions from old versions of the model, ignoring redundant background predictions, and ensuring compatibility with the available…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
MethodsAttention Is All You Need · Linear Layer · Dense Connections · Label Smoothing · Multi-Head Attention · Dropout · Position-Wise Feed-Forward Layer · Adam · Softmax · Absolute Position Encodings
