PM-DETR: Domain Adaptive Prompt Memory for Object Detection with Transformers
Peidong Jia, Jiaming Liu, Senqiao Yang, Jiarui Wu, Xiaodong Xie,, Shanghang Zhang

TL;DR
This paper introduces PM-DETR, a novel domain adaptive method for object detection with transformers that uses prompt memory to better handle distribution shifts across different domains, outperforming existing techniques.
Contribution
The paper proposes a hierarchical Prompt Domain Memory and Prompt Memory Alignment to improve domain adaptation in detection transformers, leveraging domain-specific knowledge and long-term memory.
Findings
Outperforms state-of-the-art methods on three benchmarks
Effectively reduces domain discrepancy with prompt memory techniques
Enhances detection performance across scene, synthetic, and weather variations
Abstract
The Transformer-based detectors (i.e., DETR) have demonstrated impressive performance on end-to-end object detection. However, transferring DETR to different data distributions may lead to a significant performance degradation. Existing adaptation techniques focus on model-based approaches, which aim to leverage feature alignment to narrow the distribution shift between different domains. In this study, we propose a hierarchical Prompt Domain Memory (PDM) for adapting detection transformers to different distributions. PDM comprehensively leverages the prompt memory to extract domain-specific knowledge and explicitly constructs a long-term memory space for the data distribution, which represents better domain diversity compared to existing methods. Specifically, each prompt and its corresponding distribution value are paired in the memory space, and we inject top M distribution-similar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification · Advanced Neural Network Applications
MethodsMulti-Head Attention · Attention Is All You Need · Layer Normalization · Absolute Position Encodings · Byte Pair Encoding · Convolution · Linear Layer · Label Smoothing · Feedforward Network · Adam
