Universal Correspondence Network
Christopher B. Choy, JunYoung Gwak, Silvio Savarese, Manmohan, Chandraker

TL;DR
This paper introduces a deep learning framework that learns a feature space for accurate geometric and semantic visual correspondences, outperforming prior methods in speed and accuracy across various datasets.
Contribution
It proposes a fully convolutional architecture with a novel correspondence contrastive loss and a spatial transformer, enabling faster training, improved accuracy, and applicability to diverse matching tasks.
Findings
Outperforms prior methods on KITTI, PASCAL, and CUB-2011 datasets.
Achieves faster training and testing with $O(n)$ complexity.
Enhances semantic correspondence accuracy using a convolutional spatial transformer.
Abstract
We present a deep learning framework for accurate visual correspondences and demonstrate its effectiveness for both geometric and semantic matching, spanning across rigid motions to intra-class shape or appearance variations. In contrast to previous CNN-based approaches that optimize a surrogate patch similarity objective, we use deep metric learning to directly learn a feature space that preserves either geometric or semantic similarity. Our fully convolutional architecture, along with a novel correspondence contrastive loss allows faster training by effective reuse of computations, accurate gradient computation through the use of thousands of examples per image pair and faster testing with feed forward passes for keypoints, instead of for typical patch similarity methods. We propose a convolutional spatial transformer to mimic patch normalization in traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Human Pose and Action Recognition · Face recognition and analysis
