Continual Detection Transformer for Incremental Object Detection

Yaoyao Liu; Bernt Schiele; Andrea Vedaldi; Christian Rupprecht

arXiv:2304.03110·cs.CV·April 7, 2023·5 cites

Continual Detection Transformer for Incremental Object Detection

Yaoyao Liu, Bernt Schiele, Andrea Vedaldi, Christian Rupprecht

PDF

Open Access

TL;DR

This paper introduces CL-DETR, a novel transformer-based incremental object detection method that effectively employs knowledge distillation and exemplar replay, achieving state-of-the-art results on COCO 2017.

Contribution

The paper proposes a new transformer-based IOD method with specialized KD and ER strategies, addressing challenges in incremental learning for transformers.

Findings

01

CL-DETR outperforms existing methods on COCO 2017.

02

The DKD loss improves knowledge retention in incremental detection.

03

Calibration strategy enhances training and testing consistency.

Abstract

Incremental object detection (IOD) aims to train an object detector in phases, each with annotations for new object categories. As other incremental settings, IOD is subject to catastrophic forgetting, which is often addressed by techniques such as knowledge distillation (KD) and exemplar replay (ER). However, KD and ER do not work well if applied directly to state-of-the-art transformer-based object detectors such as Deformable DETR and UP-DETR. In this paper, we solve these issues by proposing a ContinuaL DEtection TRansformer (CL-DETR), a new method for transformer-based IOD which enables effective usage of KD and ER in this context. First, we introduce a Detector Knowledge Distillation (DKD) loss, focusing on the most informative and reliable predictions from old versions of the model, ignoring redundant background predictions, and ensuring compatibility with the available…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Label Smoothing · Multi-Head Attention · Dropout · Position-Wise Feed-Forward Layer · Adam · Softmax · Absolute Position Encodings