WarpNet: Weakly Supervised Matching for Single-view Reconstruction
Angjoo Kanazawa, David W. Jacobs, Manmohan Chandraker

TL;DR
WarpNet introduces a novel deep learning method for matching objects across images in fine-grained datasets without part annotations, enabling weakly supervised single-view reconstruction with improved accuracy.
Contribution
The paper presents WarpNet, a new architecture that aligns objects across images without part annotations, leveraging dataset structure for training in an unsupervised manner.
Findings
Achieves 13.6% improvement in AP on CUB-200-2011 dataset.
Enables single-view reconstruction comparable to methods using annotated points.
Demonstrates effective matching across appearance, viewpoint, and articulation variations.
Abstract
We present an approach to matching images of objects in fine-grained datasets without using part annotations, with an application to the challenging problem of weakly supervised single-view reconstruction. This is in contrast to prior works that require part annotations, since matching objects across class and pose variations is challenging with appearance features alone. We overcome this challenge through a novel deep learning architecture, WarpNet, that aligns an object in one image with a different object in another. We exploit the structure of the fine-grained dataset to create artificial data for training this network in an unsupervised-discriminative learning approach. The output of the network acts as a spatial prior that allows generalization at test time to match real images across variations in appearance, viewpoint and articulation. On the CUB-200-2011 dataset of bird…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Human Pose and Action Recognition · Robotics and Sensor-Based Localization
