Learning End-To-End Scene Flow by Distilling Single Tasks Knowledge

Filippo Aleotti; Matteo Poggi; Fabio Tosi; Stefano Mattoccia

arXiv:1911.10090·cs.CV·November 25, 2019

Learning End-To-End Scene Flow by Distilling Single Tasks Knowledge

Filippo Aleotti, Matteo Poggi, Fabio Tosi, Stefano Mattoccia

PDF

1 Repo

TL;DR

This paper introduces DWARF, a lightweight end-to-end neural network for scene flow estimation that leverages knowledge distillation from specialized stereo and flow networks, achieving real-time performance with moderate accuracy trade-offs.

Contribution

The paper presents DWARF, a novel architecture for joint scene flow estimation that is efficient, trainable end-to-end from scratch, and benefits from knowledge distillation to improve accuracy.

Findings

01

DWARF runs at 10 FPS on high-end GPU and 1 FPS on embedded devices.

02

Distilling knowledge from specialized networks improves training effectiveness.

03

DWARF achieves competitive accuracy with significantly reduced computational complexity.

Abstract

Scene flow is a challenging task aimed at jointly estimating the 3D structure and motion of the sensed environment. Although deep learning solutions achieve outstanding performance in terms of accuracy, these approaches divide the whole problem into standalone tasks (stereo and optical flow) addressing them with independent networks. Such a strategy dramatically increases the complexity of the training procedure and requires power-hungry GPUs to infer scene flow barely at 1 FPS. Conversely, we propose DWARF, a novel and lightweight architecture able to infer full scene flow jointly reasoning about depth and optical flow easily and elegantly trainable end-to-end from scratch. Moreover, since ground truth images for full scene flow are scarce, we propose to leverage on the knowledge learned by networks specialized in stereo or flow, for which much more data are available, to distill proxy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

FilippoAleotti/Dwarf-Tensorflow
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.