TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation
Huawei Sun, Zixu Wang, Hao Feng, Julius Ott, Lorenzo Servadei, Robert Wille

TL;DR
TRIDE introduces a weather-aware radar-camera fusion network that leverages text descriptions and radar data to improve depth estimation accuracy in autonomous driving, especially under adverse weather conditions.
Contribution
The paper proposes a novel weather-aware fusion block and text-assisted feature extraction method for improved depth estimation in radar-camera systems.
Findings
Achieves 12.87% improvement in MAE on nuScenes dataset.
Achieves 9.08% improvement in RMSE on nuScenes dataset.
Demonstrates robustness of the approach under different weather conditions.
Abstract
Depth estimation, essential for autonomous driving, seeks to interpret the 3D environment surrounding vehicles. The development of radar sensors, known for their cost-efficiency and robustness, has spurred interest in radar-camera fusion-based solutions. However, existing algorithms fuse features from these modalities without accounting for weather conditions, despite radars being known to be more robust than cameras under adverse weather. Additionally, while Vision-Language models have seen rapid advancement, utilizing language descriptions alongside other modalities for depth estimation remains an open challenge. This paper first introduces a text-generation strategy along with feature extraction and fusion techniques that can assist monocular depth estimation pipelines, leading to improved accuracy across different algorithms on the KITTI dataset. Building on this, we propose TRIDE,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Generative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection
