AsyncMDE: Real-Time Monocular Depth Estimation via Asynchronous Spatial Memory
Lianjie Ma, Yuquan Li, Bingzheng Jiang, Ziming Zhong, Han Ding, and Lijun Zhu

TL;DR
AsyncMDE introduces an asynchronous monocular depth estimation system that efficiently reuses features over time, enabling real-time, edge-deployable depth perception with high accuracy and low computational cost.
Contribution
It proposes a novel asynchronous framework combining a foundation model and a lightweight model for efficient, real-time depth estimation on edge devices.
Findings
Operates at 237 FPS on RTX 4090 with 3.83M parameters.
Recovers 77% of foundation model accuracy with 25X fewer parameters.
Achieves 161 FPS on Jetson AGX Orin, demonstrating real-time edge deployment.
Abstract
Foundation-model-based monocular depth estimation offers a viable alternative to active sensors for robot perception, yet its computational cost often prohibits deployment on edge platforms. Existing methods perform independent per-frame inference, wasting the substantial computational redundancy between adjacent viewpoints in continuous robot operation. This paper presents AsyncMDE, an asynchronous depth perception system consisting of a foundation model and a lightweight model that amortizes the foundation model's computational cost over time. The foundation model produces high-quality spatial features in the background, while the lightweight model runs asynchronously in the foreground, fusing cached memory with current observations through complementary fusion, outputting depth estimates, and autoregressively updating the memory. This enables cross-frame feature reuse with bounded…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Advanced Optical Sensing Technologies
