TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection
Lei Cheng, Siyang Cao

TL;DR
TransRAD introduces a novel radar object detection model using a Retentive Vision Transformer with spatial priors, significantly improving accuracy and efficiency in 3D radar detection for autonomous systems.
Contribution
The paper proposes TransRAD, a new radar detection framework leveraging Retentive Vision Transformer and Location-Aware NMS to address radar data challenges and improve detection performance.
Findings
Outperforms state-of-the-art in 2D and 3D radar detection
Achieves higher accuracy and faster inference
Reduces computational complexity
Abstract
Despite significant advancements in environment perception capabilities for autonomous driving and intelligent robotics, cameras and LiDARs remain notoriously unreliable in low-light conditions and adverse weather, which limits their effectiveness. Radar serves as a reliable and low-cost sensor that can effectively complement these limitations. However, radar-based object detection has been underexplored due to the inherent weaknesses of radar data, such as low resolution, high noise, and lack of visual information. In this paper, we present TransRAD, a novel 3D radar object detection model designed to address these challenges by leveraging the Retentive Vision Transformer (RMT) to more effectively learn features from information-dense radar Range-Azimuth-Doppler (RAD) data. Our approach leverages the Retentive Manhattan Self-Attention (MaSA) mechanism provided by RMT to incorporate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies
MethodsAttention Is All You Need · Linear Layer · Dense Connections · Vision Transformer · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Softmax · Adam
