mmAnomaly: Leveraging Visual Context for Robust Anomaly Detection in the Non-Visual World with mmWave Radar
Tarik Reza Toha, Shao-Jung (Louie) Lu, Mahathir Monjur, and Shahriar Nirjon

TL;DR
mmAnomaly introduces a multi-modal framework combining mmWave radar and visual data to improve anomaly detection accuracy in challenging non-visual sensing scenarios.
Contribution
It presents a novel system that integrates visual context with mmWave radar signals using semantic classification and generative modeling for robust anomaly detection.
Findings
Achieves up to 94% F1 score in anomaly detection.
Demonstrates sub-meter localization accuracy.
Generalizes well across different environments and obstructions.
Abstract
mmWave radar enables human sensing in non-visual scenarios-e.g., through clothing or certain types of walls-where traditional cameras fail due to occlusion or privacy limitations. However, robust anomaly detection with mmWave remains challenging, as signal reflections are influenced by material properties, clutter, and multipath interference, producing complex, non-Gaussian distortions. Existing methods lack contextual awareness and misclassify benign signal variations as anomalies. We present mmAnomaly, a multi-modal anomaly detection framework that combines mmWave radar with RGBD input to incorporate visual context. Our system extracts semantic cues-such as scene geometry and material properties-using a fast ResNet-based classifier, and uses a conditional latent diffusion model to synthesize the expected mmWave spectrum for the given visual context. A dual-input comparison module then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
