Foundation Model Priors Enhance Object Focus in Feature Space for Source-Free Object Detection

Sairam VCR; Rishabh Lalla; Aveen Dayal; Tejal Kulkarni; Anuj Lalla; Vineeth N Balasubramanian; Muhammad Haris Khan

arXiv:2512.17514·cs.CV·February 24, 2026

Foundation Model Priors Enhance Object Focus in Feature Space for Source-Free Object Detection

Sairam VCR, Rishabh Lalla, Aveen Dayal, Tejal Kulkarni, Anuj Lalla, Vineeth N Balasubramanian, Muhammad Haris Khan

PDF

Open Access

TL;DR

This paper introduces FALCON-SFOD, a novel framework that enhances object focus in feature space for source-free object detection under domain shift by leveraging foundation model priors and noise-robust pseudo-labeling.

Contribution

It proposes a new method combining foundation model-based regularization and imbalance-aware pseudo-labeling to improve object detection without source data.

Findings

01

Achieves competitive results on SFOD benchmarks.

02

Effectively improves object-focused feature representations.

03

Enhances pseudo-label quality under domain shift.

Abstract

Current state-of-the-art approaches in Source-Free Object Detection (SFOD) typically rely on Mean-Teacher self-labeling. However, domain shift often reduces the detector's ability to maintain strong object-focused representations, causing high-confidence activations over background clutter. This weak object focus results in unreliable pseudo-labels from the detection head. While prior works mainly refine these pseudo-labels, they overlook the underlying need to strengthen the feature space itself. We propose FALCON-SFOD (Foundation-Aligned Learning with Clutter suppression and Noise robustness), a framework designed to enhance object-focused adaptation under domain shift. It consists of two complementary components. SPAR (Spatial Prior-Aware Regularization) leverages the generalization strength of vision foundation models to regularize the detector's feature space. Using class-agnostic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Visual Attention and Saliency Detection