SupScene: Scene-Structured Overlap Supervision for Image Retrieval in Unconstrained SfM
Xulei Shi, Maoyu Wang, Yuning Peng, Guanbo Wang, Xin Wang, Yifan Liao, Qi Chen, Pengjie Tao

TL;DR
SupScene introduces a scene-structured training framework for image retrieval in unconstrained SfM, leveraging overlap graphs and geometric supervision to improve matching accuracy and reconstruction completeness.
Contribution
The paper proposes a novel scene-structured training method with an overlap-ordered objective and a lightweight descriptor head, enhancing geometric matchability in SfM image retrieval.
Findings
Significantly improves retrieval performance on multiple benchmarks.
Enhances the completeness of SfM reconstructions.
Outperforms previous methods in geometric matchability.
Abstract
Image retrieval is a critical step for reducing the quadratic cost of image matching in unconstrained Structure-from-Motion (SfM). Unlike generic image retrieval, however, the relevant goal of SfM is to identify geometrically matchable image pairs rather than merely semantically similar images. Prevailing methods are largely trained under anchor-centric tuple guidance, which organizes the training around isolated tuples and under-utilizes the dense, graded overlap structure naturally established within a SfM scene. In this work, we present SupScene, a scene-structured training framework that samples connected local subgraphs from SfM overlap graphs and jointly supervises all valid within-subgraph pairwise relations. To explicitly align the trained descriptor with geometric co-visibility, we further introduce an overlap-ordered objective that combines multi-similarity optimization with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications
