TL;DR
This paper introduces 3D-LENS, a novel framework for single-view aerial-ground re-identification that synthesizes consistent novel views using 3D mesh reconstruction, improving cross-view retrieval without needing paired annotations.
Contribution
The paper presents the first formalization of single-view AG-ReID and a unified 3D-LENS framework that combines geometry-based view synthesis with robust feature learning.
Findings
Achieves state-of-the-art results on SV AG-ReID scenarios.
Effectively synthesizes view-consistent images across diverse categories.
Mitigates synthetic-to-real bias in novel view synthesis.
Abstract
Aerial-Ground Re-Identification (AG-ReID) is constrained by the viewpoint-domain gap, as drastic viewpoint disparities occlude or distort discriminative features, making cross-viewpoint image retrieval challenging. While existing methods rely on paired cross-view annotations, real-world deployments, such as wilderness search-and-rescue (SAR), often lack target-domain data, requiring retrieval from ground-level references alone. To our knowledge, we are the first to address this challenge by formalizing the Single-View AG-ReID (SV AG-ReID) setting, where models trained on a single real viewpoint must generalize to an unseen viewpoint. We propose 3D Lifting-based Elevated Novel-view Synthesis (3D-LENS), a unified framework combining geometrically-consistent novel view synthesis that leverages large-scale 3D mesh reconstruction, with a robust representation learning scheme to mitigate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
