LeAD-M3D: Leveraging Asymmetric Distillation for Real-Time Monocular 3D Detection
Johannes Meier, Jonathan Michel, Oussema Dhaouadi, Yung-Hsu Yang, Christoph Reich, Zuria Bauer, Stefan Roth, Marc Pollefeys, Jacques Kaiser, Daniel Cremers

TL;DR
LeAD-M3D is a novel monocular 3D detection method that achieves state-of-the-art accuracy and real-time speed by leveraging asymmetric distillation, 3D-aware matching, and confidence gating, without relying on extra modalities.
Contribution
The paper introduces LeAD-M3D, a new approach combining three key components to improve monocular 3D detection accuracy and efficiency without additional sensors.
Findings
Achieves state-of-the-art accuracy on KITTI and Waymo datasets.
Runs up to 3.6 times faster than previous high-accuracy models.
Sets a new Pareto frontier for accuracy and speed in monocular 3D detection.
Abstract
Real-time monocular 3D object detection remains challenging due to severe depth ambiguity, viewpoint shifts, and the high computational cost of 3D reasoning. Existing approaches either rely on LiDAR or geometric priors to compensate for missing depth or sacrifice efficiency to achieve competitive accuracy. We introduce LeAD-M3D, a monocular 3D detector that achieves state-of-the-art accuracy and real-time inference without extra modalities. Our method is enabled by three key components. Asymmetric Augmentation Denoising Distillation (A2D2) transfers geometric knowledge from a clean-image teacher to a MixUp-noised student via a quality- and importance-weighted depth-feature loss, enabling stronger depth reasoning without LiDAR. 3D-aware Consistent Matching (CM) improves prediction-to-ground truth assignment by integrating 3D MGIoU into the matching score, yielding stable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Advanced Vision and Imaging
