BoQ: A Place is Worth a Bag of Learnable Queries
Amar Ali-Bey, Brahim Chaib-draa, Philippe Gigu\`ere

TL;DR
BoQ introduces learnable global queries for visual place recognition, improving accuracy and efficiency across multiple benchmarks by capturing universal place attributes with interpretable attention.
Contribution
The paper proposes a novel Bag-of-Queries method using distinct learnable global queries with cross-attention, enhancing place recognition performance and interpretability.
Findings
Outperforms state-of-the-art methods on 14 benchmarks.
Surpasses two-stage retrieval techniques in speed and efficiency.
Provides an interpretable attention mechanism compatible with CNN and Transformer backbones.
Abstract
In visual place recognition, accurately identifying and matching images of locations under varying environmental conditions and viewpoints remains a significant challenge. In this paper, we introduce a new technique, called Bag-of-Queries (BoQ), which learns a set of global queries designed to capture universal place-specific attributes. Unlike existing methods that employ self-attention and generate the queries directly from the input features, BoQ employs distinct learnable global queries, which probe the input features via cross-attention, ensuring consistent information aggregation. In addition, our technique provides an interpretable attention mechanism and integrates with both CNN and Vision Transformer backbones. The performance of BoQ is demonstrated through extensive experiments on 14 large-scale benchmarks. It consistently outperforms current state-of-the-art techniques…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Machine Learning and Data Classification · Advanced Database Systems and Queries
MethodsAttention Is All You Need · Sparse Evolutionary Training · Position-Wise Feed-Forward Layer · Dropout · Softmax · Layer Normalization · Linear Layer · Transformer · Multi-Head Attention · Dense Connections
