4D Radar Semantic Segmentation of People in Field Conditions Using Temporal Multi-View Networks
Mikael Skog, Oleksandr Kotlyar, Vladim\'ir Kubelka, Martin Magnusson

TL;DR
This paper introduces TMVA4D, a neural network architecture leveraging 4D radar data for robust semantic segmentation of people in adverse conditions, outperforming traditional sensors.
Contribution
The authors develop a novel CNN and ConvLSTM-based architecture that utilizes 4D radar point clouds for accurate person segmentation in challenging environments.
Findings
Achieved Dice score of 75.9% and IoU of 61.2% for person class.
Demonstrated robustness of 4D radar in low-visibility conditions.
Models trained on multi-view projections outperform traditional sensor-based methods.
Abstract
Reliable people detection is crucial for the safe autonomy of mobile robots and heavy vehicles, both on roads and in industrial settings like mining and construction. However, common sensors like cameras or lidars are prone to failure in adverse conditions such as dust, fog, or smoke, which limits their use in real-world robotic systems. Radar, on the other hand, delivers robust measurements in a wide range of environmental conditions. In particular, modern high-resolution 4D imaging radars provide 4D point clouds across range, azimuth, and elevation, as well as per-point Doppler velocity data, well suited for robot perception. We propose TMVA4D, a family of artificial neural network architectures based on CNN and ConvLSTM encoders that leverage the 4D radar modality for semantic segmentation. The architectures are trained to distinguish between background and person classes using a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced SAR Imaging Techniques · Radar Systems and Signal Processing
