Motion-based Object Segmentation based on Dense RGB-D Scene Flow

Lin Shao; Parth Shah; Vikranth Dwaracherla; Jeannette Bohg

arXiv:1804.05195·cs.RO·July 25, 2018

Motion-based Object Segmentation based on Dense RGB-D Scene Flow

Lin Shao, Parth Shah, Vikranth Dwaracherla, Jeannette Bohg

PDF

1 Repo

TL;DR

This paper introduces a deep neural network model that jointly estimates scene segmentation, object motion trajectories, and dense 3D scene flow from RGB-D images, specifically designed for robotic manipulation scenarios.

Contribution

It presents a novel hourglass neural network architecture that jointly predicts object segmentation and motion, trained on a new large-scale synthetic dataset for robotic manipulation.

Findings

01

Outperforms state-of-the-art methods on synthetic data

02

Generates more accurate object segmentation and motion trajectories

03

Transfers well to real-world scenes with improved results

Abstract

Given two consecutive RGB-D images, we propose a model that estimates a dense 3D motion field, also known as scene flow. We take advantage of the fact that in robot manipulation scenarios, scenes often consist of a set of rigidly moving objects. Our model jointly estimates (i) the segmentation of the scene into an unknown but finite number of objects, (ii) the motion trajectories of these objects and (iii) the object scene flow. We employ an hourglass, deep neural network architecture. In the encoding stage, the RGB and depth images undergo spatial compression and correlation. In the decoding stage, the model outputs three images containing a per-pixel estimate of the corresponding object center as well as object translation and rotation. This forms the basis for inferring the object segmentation and final object scene flow. To evaluate our model, we generated a new and challenging,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stanford-iprl-lab/sceneflownet
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.