Three for one and one for three: Flow, Segmentation, and Surface Normals

Hoang-An Le; Anil S. Baslamisli; Thomas Mensink; Theo Gevers

arXiv:1807.07473·cs.CV·July 20, 2018·5 cites

Three for one and one for three: Flow, Segmentation, and Surface Normals

Hoang-An Le, Anil S. Baslamisli, Thomas Mensink, Theo Gevers

PDF

Open Access 1 Repo

TL;DR

This paper investigates how optical flow, semantic segmentation, and surface normals interact and improve scene understanding when combined, using a modular network and a synthetic dataset.

Contribution

It introduces a modular convolutional network trained on a synthetic dataset to study the influence and synergy of three scene understanding modalities.

Findings

01

Positive influence among modalities on object boundaries

02

Enhanced region consistency and scene structure understanding

03

Modular approach improves joint feature learning

Abstract

Optical flow, semantic segmentation, and surface normals represent different information modalities, yet together they bring better cues for scene understanding problems. In this paper, we study the influence between the three modalities: how one impacts on the others and their efficiency in combination. We employ a modular approach using a convolutional refinement network which is trained supervised but isolated from RGB images to enforce joint modality features. To assist the training process, we create a large-scale synthetic outdoor dataset that supports dense annotation of semantic segmentation, optical flow, and surface normals. The experimental results show positive influence among the three modalities, especially for objects' boundaries, region consistency, and scene structures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lhoangan/341n143
caffe2Official

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Remote Sensing and LiDAR Applications