U(PM)$^2$:Unsupervised polygon matching with pre-trained models for challenging stereo images
Chang Li, Xingtao Peng

TL;DR
This paper introduces U(PM)$^2$, an unsupervised, low-cost polygon matching method using pre-trained models that effectively handles challenging stereo images with disparity discontinuities and scale variations.
Contribution
It presents a novel pipeline combining pre-trained segmentation and feature extraction with a multi-stage matching strategy for polygon correspondence.
Findings
Achieved state-of-the-art accuracy on ScanNet and SceneFlow datasets.
Demonstrated strong generalization with low computational cost.
Operates without any training requirement.
Abstract
Stereo image matching is a fundamental task in computer vision, photogrammetry and remote sensing, but there is an almost unexplored field, i.e., polygon matching, which faces the following challenges: disparity discontinuity, scale variation, training requirement, and generalization. To address the above-mentioned issues, this paper proposes a novel U(PM): low-cost unsupervised polygon matching with pre-trained models by uniting automatically learned and handcrafted features, of which pipeline is as follows: firstly, the detector leverages the pre-trained segment anything model to obtain masks; then, the vectorizer converts the masks to polygons and graphic structure; secondly, the global matcher addresses challenges from global viewpoint changes and scale variation based on bidirectional-pyramid strategy with pre-trained LoFTR; finally, the local matcher further overcomes local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization
