WarpNet: Weakly Supervised Matching for Single-view Reconstruction

Angjoo Kanazawa; David W. Jacobs; Manmohan Chandraker

arXiv:1604.05592·cs.CV·June 21, 2016·34 cites

WarpNet: Weakly Supervised Matching for Single-view Reconstruction

Angjoo Kanazawa, David W. Jacobs, Manmohan Chandraker

PDF

Open Access

TL;DR

WarpNet introduces a novel deep learning method for matching objects across images in fine-grained datasets without part annotations, enabling weakly supervised single-view reconstruction with improved accuracy.

Contribution

The paper presents WarpNet, a new architecture that aligns objects across images without part annotations, leveraging dataset structure for training in an unsupervised manner.

Findings

01

Achieves 13.6% improvement in AP on CUB-200-2011 dataset.

02

Enables single-view reconstruction comparable to methods using annotated points.

03

Demonstrates effective matching across appearance, viewpoint, and articulation variations.

Abstract

We present an approach to matching images of objects in fine-grained datasets without using part annotations, with an application to the challenging problem of weakly supervised single-view reconstruction. This is in contrast to prior works that require part annotations, since matching objects across class and pose variations is challenging with appearance features alone. We overcome this challenge through a novel deep learning architecture, WarpNet, that aligns an object in one image with a different object in another. We exploit the structure of the fine-grained dataset to create artificial data for training this network in an unsupervised-discriminative learning approach. The output of the network acts as a spatial prior that allows generalization at test time to match real images across variations in appearance, viewpoint and articulation. On the CUB-200-2011 dataset of bird…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Human Pose and Action Recognition · Robotics and Sensor-Based Localization