ViewSynth: Learning Local Features from Depth using View Synthesis

Jisan Mahmud; Rajat Vikram Singh; Peri Akiva; Spondon Kundu,; Kuan-Chuan Peng; Jan-Michael Frahm

arXiv:1911.10248·cs.CV·September 3, 2020

ViewSynth: Learning Local Features from Depth using View Synthesis

Jisan Mahmud, Rajat Vikram Singh, Peri Akiva, Spondon Kundu,, Kuan-Chuan Peng, Jan-Michael Frahm

PDF

Open Access 1 Repo

TL;DR

ViewSynth introduces a novel framework for learning depth image features by jointly training view synthesis and keypoint descriptors, significantly improving 3D matching and localization performance.

Contribution

It proposes a new method that explicitly reasons beyond visible pixels by synthesizing views, enhancing depth feature learning for keypoint detection and matching.

Findings

01

Outperforms state-of-the-art in 3D keypoint matching and camera localization.

02

Demonstrates strong generalization across different RGB-D datasets.

03

Effectively encodes occluded scene information through view synthesis.

Abstract

The rapid development of inexpensive commodity depth sensors has made keypoint detection and matching in the depth image modality an important problem in computer vision. Despite great improvements in recent RGB local feature learning methods, adapting them directly in the depth modality leads to unsatisfactory performance. Most of these methods do not explicitly reason beyond the visible pixels in the images. To address the limitations of these methods, we propose a framework ViewSynth, to jointly learn: (1) viewpoint invariant keypoint-descriptor from depth images using a proposed Contrastive Matching Loss, and (2) view synthesis of depth images from different viewpoints using the proposed View Synthesis Module and View Synthesis Loss. By learning view synthesis, we explicitly encourage the feature extractor to encode information about not only the visible, but also the occluded parts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

periakiva/periakiva
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques