CReaM: Condensed Real-time Models for Depth Prediction using Convolutional Neural Networks
Andrew Spek, Thanuja Dharmasiri, Tom Drummond

TL;DR
This paper introduces CReaM, a novel neural network framework that enables real-time depth prediction at 30fps on mobile platforms like NVIDIA-TX2, using knowledge transfer to create compact yet accurate models.
Contribution
The work presents the first real-time depth prediction framework on a mobile platform and demonstrates effective model condensation through knowledge transfer.
Findings
Achieves 30fps depth prediction on NVIDIA-TX2
Uses knowledge transfer to create compact models
Demonstrates high accuracy with condensed architectures
Abstract
Since the resurgence of CNNs the robotic vision community has developed a range of algorithms that perform classification, semantic segmentation and structure prediction (depths, normals, surface curvature) using neural networks. While some of these models achieve state-of-the art results and super human level performance, deploying these models in a time critical robotic environment remains an ongoing challenge. Real-time frameworks are of paramount importance to build a robotic society where humans and robots integrate seamlessly. To this end, we present a novel real-time structure prediction framework that predicts depth at 30fps on an NVIDIA-TX2. At the time of writing, this is the first piece of work to showcase such a capability on a mobile platform. We also demonstrate with extensive experiments that neural networks with very large model capacities can be leveraged in order to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
