Multi-Modal Depth Estimation Using Convolutional Neural Networks
Sadique Adnan Siddiqui, Axel Vierling, Karsten Berns

TL;DR
This paper proposes a deep learning method for dense depth estimation by fusing Radar and camera data, demonstrating its effectiveness on multiple datasets including real-world and synthetic scenarios.
Contribution
It introduces a novel CNN-based depth estimation approach that leverages Radar and camera fusion with transfer learning, addressing challenges in adverse weather conditions.
Findings
Effective depth estimation on Nuscenes, KITTI, and synthetic datasets.
Successful application to safety-critical crane operation scenarios.
Outperforms existing methods in sensor fusion for depth prediction.
Abstract
This paper addresses the problem of dense depth predictions from sparse distance sensor data and a single camera image on challenging weather conditions. This work explores the significance of different sensor modalities such as camera, Radar, and Lidar for estimating depth by applying Deep Learning approaches. Although Lidar has higher depth-sensing abilities than Radar and has been integrated with camera images in lots of previous works, depth estimation using CNN's on the fusion of robust Radar distance data and camera images has not been explored much. In this work, a deep regression network is proposed utilizing a transfer learning approach consisting of an encoder where a high performing pre-trained model has been used to initialize it for extracting dense features and a decoder for upsampling and predicting desired depth. The results are demonstrated on Nuscenes, KITTI, and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
