TransRAD: Retentive Vision Transformer for Enhanced Radar Object   Detection

Lei Cheng; Siyang Cao

arXiv:2501.17977·cs.CV·January 31, 2025

TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection

Lei Cheng, Siyang Cao

PDF

Open Access 1 Repo

TL;DR

TransRAD introduces a novel radar object detection model using a Retentive Vision Transformer with spatial priors, significantly improving accuracy and efficiency in 3D radar detection for autonomous systems.

Contribution

The paper proposes TransRAD, a new radar detection framework leveraging Retentive Vision Transformer and Location-Aware NMS to address radar data challenges and improve detection performance.

Findings

01

Outperforms state-of-the-art in 2D and 3D radar detection

02

Achieves higher accuracy and faster inference

03

Reduces computational complexity

Abstract

Despite significant advancements in environment perception capabilities for autonomous driving and intelligent robotics, cameras and LiDARs remain notoriously unreliable in low-light conditions and adverse weather, which limits their effectiveness. Radar serves as a reliable and low-cost sensor that can effectively complement these limitations. However, radar-based object detection has been underexplored due to the inherent weaknesses of radar data, such as low resolution, high noise, and lack of visual information. In this paper, we present TransRAD, a novel 3D radar object detection model designed to address these challenges by leveraging the Retentive Vision Transformer (RMT) to more effectively learn features from information-dense radar Range-Azimuth-Doppler (RAD) data. Our approach leverages the Retentive Manhattan Self-Attention (MaSA) mechanism provided by RMT to incorporate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

radar-lab/transrad
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfrared Target Detection Methodologies

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Vision Transformer · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Softmax · Adam