Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction
Henrique Siqueira, Patrick Ruhkamp, Ibrahim Halfaoui, Markus Karmann,, Onay Urfalioglu

TL;DR
This paper introduces CorrNet, an unsupervised contrastive learning framework that detects both low-level and high-level keypoints, improving robustness and discriminability for visual representations under challenging conditions.
Contribution
The paper presents CorrNet, a novel unsupervised contrastive learning method that detects diverse keypoints and learns discriminative descriptions, surpassing classical methods and previous deep learning approaches.
Findings
CorrNet detects both low-level and high-level features.
Achieves state-of-the-art results under illumination changes.
Shows competitive performance under viewpoint variations.
Abstract
Learnable keypoint detectors and descriptors are beginning to outperform classical hand-crafted feature extraction methods. Recent studies on self-supervised learning of visual representations have driven the increasing performance of learnable models based on deep networks. By leveraging traditional data augmentations and homography transformations, these networks learn to detect corners under adverse conditions such as extreme illumination changes. However, their generalization capabilities are limited to corner-like features detected a priori by classical methods or synthetically generated data. In this paper, we propose the Correspondence Network (CorrNet) that learns to detect repeatable keypoints and to extract discriminative descriptions via unsupervised contrastive learning under spatial constraints. Our experiments show that CorrNet is not only able to detect low-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Human Pose and Action Recognition
MethodsContrastive Learning
