Learning Image Matching by Simply Watching Video

Gucan Long; Laurent Kneip; Jose M. Alvarez; Hongdong Li

arXiv:1603.06041·cs.CV·March 30, 2016·1 cites

Learning Image Matching by Simply Watching Video

Gucan Long, Laurent Kneip, Jose M. Alvarez, Hongdong Li

PDF

Open Access

TL;DR

This paper introduces an unsupervised method for image matching by training a CNN for frame-interpolation on videos and then deriving correspondences through inversion, achieving performance comparable to traditional methods.

Contribution

It presents a novel unsupervised approach to image matching that leverages video data and analysis-by-synthesis, avoiding manual annotations.

Findings

01

Achieves competitive accuracy with traditional methods

02

Uses only video data for training, no annotations needed

03

Demonstrates effectiveness on standard benchmarks

Abstract

This work presents an unsupervised learning based approach to the ubiquitous computer vision problem of image matching. We start from the insight that the problem of frame-interpolation implicitly solves for inter-frame correspondences. This permits the application of analysis-by-synthesis: we firstly train and apply a Convolutional Neural Network for frame-interpolation, then obtain correspondences by inverting the learned CNN. The key benefit behind this strategy is that the CNN for frame-interpolation can be trained in an unsupervised manner by exploiting the temporal coherency that is naturally contained in real-world video sequences. The present model therefore learns image matching by simply watching videos. Besides a promise to be more generally applicable, the presented approach achieves surprising performance comparable to traditional empirically designed methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Enhancement Techniques