Dynamic Object Queries for Transformer-based Incremental Object Detection
Jichuan Zhang, Wei Li, Shuang Cheng, Ya-Li Li, Shengjin Wang

TL;DR
This paper introduces DyQ-DETR, a Transformer-based incremental object detection method using dynamic object queries to balance stability and plasticity, effectively reducing forgetting and improving performance over prior approaches.
Contribution
The paper proposes a novel incremental detection framework with dynamic object queries and isolated bipartite matching, enhancing knowledge integration and reducing inter-class confusion.
Findings
Outperforms state-of-the-art methods on incremental detection benchmarks.
Effectively balances stability and plasticity in incremental learning.
Reduces inter-class confusion through disentangled self-attention.
Abstract
Incremental object detection (IOD) aims to sequentially learn new classes, while maintaining the capability to locate and identify old ones. As the training data arrives with annotations only with new classes, IOD suffers from catastrophic forgetting. Prior methodologies mainly tackle the forgetting issue through knowledge distillation and exemplar replay, ignoring the conflict between limited model capacity and increasing knowledge. In this paper, we explore \textit{dynamic object queries} for incremental object detection built on Transformer architecture. We propose the \textbf{Dy}namic object \textbf{Q}uery-based \textbf{DE}tection \textbf{TR}ansformer (DyQ-DETR), which incrementally expands the model representation ability to achieve stability-plasticity tradeoff. First, a new set of learnable object queries are fed into the decoder to represent new classes. These new object queries…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques
MethodsAttention Is All You Need · Sparse Evolutionary Training · Label Smoothing · Adam · Linear Layer · Byte Pair Encoding · Layer Normalization · Softmax · Position-Wise Feed-Forward Layer · Dense Connections
