Learning Where to Embed: Noise-Aware Positional Embedding for Query Retrieval in Small-Object Detection

Yangchen Zeng; Zhenyu Yu; Dongming Jiang; Wenbo Zhang; Yifan Hong; Zhanhua Hu; Jiao Luo; Kangning Cui

arXiv:2604.15065·cs.CV·April 17, 2026

Learning Where to Embed: Noise-Aware Positional Embedding for Query Retrieval in Small-Object Detection

Yangchen Zeng, Zhenyu Yu, Dongming Jiang, Wenbo Zhang, Yifan Hong, Zhanhua Hu, Jiao Luo, Kangning Cui

PDF

1 Repo

TL;DR

This paper introduces HELP, a noise-aware positional embedding framework for small-object detection that improves efficiency and accuracy by selectively embedding positional information and filtering background noise.

Contribution

The paper proposes a novel heatmap-guided embedding method that enhances query retrieval and reduces model complexity in small-object detection tasks.

Findings

01

Achieves 59.4% parameter reduction with maintained accuracy.

02

Reduces decoder layers from eight to three.

03

Improves small-object detection performance across benchmarks.

Abstract

Transformer-based detectors have advanced small-object detection, but they often remain inefficient and vulnerable to background-induced query noise, which motivates deep decoders to refine low-quality queries. We present HELP (Heatmap-guided Embedding Learning Paradigm), a noise-aware positional-semantic fusion framework that studies where to embed positional information by selectively preserving positional encodings in foreground-salient regions while suppressing background clutter. Within HELP, we introduce Heatmap-guided Positional Embedding (HPE) as the core embedding mechanism and visualize it with a heatbar for interpretable diagnosis and fine-tuning. HPE is integrated into both the encoder and decoder: it guides noise-suppressed feature encoding by injecting heatmap-aware positional encoding, and it enables high-quality query retrieval by filtering background-dominant embeddings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yidimopozhibai/Noise-Suppressed-Query-Retrieval
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.