HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition

Suhan Woo; Seongwon Lee; Jinwoo Jang; Euntai Kim

arXiv:2506.04764·cs.CV·March 6, 2026

HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition

Suhan Woo, Seongwon Lee, Jinwoo Jang, Euntai Kim

PDF

Open Access

TL;DR

HypeVPR introduces a hyperbolic space-based hierarchical embedding framework for panoramic visual place recognition, effectively capturing hierarchical scene information to improve accuracy, efficiency, and robustness in matching equirectangular images.

Contribution

It proposes a novel hyperbolic embedding method with hierarchical feature aggregation for P2E visual place recognition, enabling flexible accuracy-efficiency trade-offs without extra training.

Findings

01

Achieves competitive recognition performance.

02

Reduces database storage requirements.

03

Speeds up retrieval process.

Abstract

Visual environments are inherently hierarchical, as a panoramic view naturally encompasses and organizes multiple perspective views within its field. Capturing this hierarchy is crucial for effective perspective-to-equirectangular (P2E) visual place recognition. In this work, we introduce HypeVPR, a hierarchical embedding framework in hyperbolic space specifically designed to address the challenges of P2E matching. HypeVPR leverages the intrinsic ability of hyperbolic space to represent hierarchical structures, allowing panoramic descriptors to encode both broad contextual information and fine-grained local details. To this end, we propose a hierarchical feature aggregation mechanism that organizes local-to-global feature representations within hyperbolic space. Furthermore, HypeVPR's hierarchical organization naturally enables flexible control over the accuracy-efficiency trade-off…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization · Multimodal Machine Learning Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings