Single-Domain Generalized Object Detection by Balancing Domain Diversity and Invariance

Zhenwei He; Hongsu Ni

arXiv:2502.03835·cs.CV·August 27, 2025

Single-Domain Generalized Object Detection by Balancing Domain Diversity and Invariance

Zhenwei He, Hongsu Ni

PDF

Open Access

TL;DR

This paper introduces DIDM, a novel object detection model that balances domain-specific diversity and invariance, improving generalization to unseen domains by explicitly preserving domain-specific features during training.

Contribution

The paper proposes the Diversity Invariant Detection Model (DIDM) with modules that simultaneously promote domain invariance and preserve domain-specific information, addressing limitations of prior invariance-focused methods.

Findings

01

Outperforms existing methods on multiple datasets

02

Effectively balances domain invariance and diversity

03

Enhances feature representation for unseen domains

Abstract

Single-domain generalization for object detection (S-DGOD) seeks to transfer learned representations from a single source domain to unseen target domains. While recent approaches have primarily focused on achieving feature invariance, they ignore that domain diversity also presents significant challenges for the task. First, such invariance-driven strategies often lead to the loss of domain-specific information, resulting in incomplete feature representations. Second, cross-domain feature alignment forces the model to overlook domain-specific discrepancies, thereby increasing the complexity of the training process. To address these limitations, this paper proposes the Diversity Invariant Detection Model (DIDM), which achieves a harmonious integration of domain-specific diversity and domain invariance. Our key idea is to learn the invariant representations by keeping the inherent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Face and Expression Recognition