Mask-RadarNet: Enhancing Transformer With Spatial-Temporal Semantic Context for Radar Object Detection in Autonomous Driving
Yuzhi Wu, Jun Liu, Guangfeng Jiang, Weijian Liu, Danilo Orlando

TL;DR
Mask-RadarNet is a novel transformer-based model that enhances radar object detection by effectively capturing spatial-temporal semantic context while reducing computational complexity.
Contribution
The paper introduces Mask-RadarNet, which combines convolution and attention, uses patch shift for efficiency, and incorporates a class masking attention module for improved radar object detection.
Findings
Outperforms state-of-the-art radar detection algorithms on CRUW dataset.
Achieves higher accuracy with fewer parameters and lower computational cost.
Demonstrates effective spatial-temporal semantic feature learning.
Abstract
As a cost-effective and robust technology, automotive radar has seen steady improvement during the last years, making it an appealing complement to commonly used sensors like camera and LiDAR in autonomous driving. Radio frequency data with rich semantic information are attracting more and more attention. Most current radar-based models take radio frequency image sequences as the input. However, these models heavily rely on convolutional neural networks and leave out the spatial-temporal semantic context during the encoding stage. To solve these problems, we propose a model called Mask-RadarNet to fully utilize the hierarchical semantic features from the input radar data. Mask-RadarNet exploits the combination of interleaved convolution and attention operations to replace the traditional architecture in transformer-based models. In addition, patch shift is introduced to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced SAR Imaging Techniques · Geophysical Methods and Applications · Underwater Acoustics Research
MethodsSoftmax · Attention Is All You Need · Convolution
