UnsuperPoint: End-to-end Unsupervised Interest Point Detector and Descriptor
Peter Hviid Christiansen, Mikkel Fly Kragh, Yury Brodskiy, Henrik, Karstoft

TL;DR
UnsuperPoint is an unsupervised, real-time interest point detector and descriptor trained with a novel self-supervised approach, achieving high performance without requiring ground truth annotations or multiple training rounds.
Contribution
It introduces a fully end-to-end trainable unsupervised interest point detector and descriptor using a novel loss function and regression approach, eliminating the need for pseudo ground truth.
Findings
Runs at 323 fps at 224x320 resolution
Achieves comparable or better performance than state-of-the-art methods
Effective in repeatability, localization, matching, and homography estimation
Abstract
It is hard to create consistent ground truth data for interest points in natural images, since interest points are hard to define clearly and consistently for a human annotator. This makes interest point detectors non-trivial to build. In this work, we introduce an unsupervised deep learning-based interest point detector and descriptor. Using a self-supervised approach, we utilize a siamese network and a novel loss function that enables interest point scores and positions to be learned automatically. The resulting interest point detector and descriptor is UnsuperPoint. We use regression of point positions to 1) make UnsuperPoint end-to-end trainable and 2) to incorporate non-maximum suppression in the model. Unlike most trainable detectors, it requires no generation of pseudo ground truth points, no structure-from-motion-generated representations and the model is learned from only one…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization · Multimodal Machine Learning Applications
MethodsSiamese Network
