R4Dyn: Exploring Radar for Self-Supervised Monocular Depth Estimation of Dynamic Scenes
Stefano Gasperini, Patrick Koch, Vinzenz Dallabetta, Nassir Navab,, Benjamin Busam, Federico Tombari

TL;DR
R4Dyn introduces a radar-enhanced self-supervised monocular depth estimation framework that improves dynamic object depth predictions by leveraging radar data during training and inference, addressing static world assumption violations.
Contribution
The paper presents a novel method to incorporate radar data into self-supervised depth estimation, improving dynamic scene understanding in autonomous driving.
Findings
37% improvement on dynamic object depth estimation
Radar data enhances robustness at inference time
Addresses radar noise and sparsity issues
Abstract
While self-supervised monocular depth estimation in driving scenarios has achieved comparable performance to supervised approaches, violations of the static world assumption can still lead to erroneous depth predictions of traffic participants, posing a potential safety issue. In this paper, we present R4Dyn, a novel set of techniques to use cost-efficient radar data on top of a self-supervised depth estimation framework. In particular, we show how radar can be used during training as weak supervision signal, as well as an extra input to enhance the estimation robustness at inference time. Since automotive radars are readily available, this allows to collect training data from a variety of existing vehicles. Moreover, by filtering and expanding the signal to make it compatible with learning-based approaches, we address radar inherent issues, such as noise and sparsity. With R4Dyn we are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
