RTS-Mono: A Real-Time Self-Supervised Monocular Depth Estimation Method for Real-World Deployment
Zeyu Cheng, Tongfei Liu, Tao Lei, Xiang Hua, Yi Zhang, Chengkai Tang

TL;DR
RTS-Mono is a lightweight, real-time self-supervised monocular depth estimation method that achieves state-of-the-art accuracy and high inference speed, suitable for deployment in autonomous driving and robotics.
Contribution
The paper introduces RTS-Mono, a novel efficient encoder-decoder architecture for monocular depth estimation that balances performance and real-time inference capability.
Findings
Achieved state-of-the-art performance on KITTI dataset with only 3 million parameters.
Improved accuracy metrics over existing lightweight methods by significant margins.
Capable of real-time inference at 49 FPS on Nvidia Jetson Orin.
Abstract
Depth information is crucial for autonomous driving and intelligent robot navigation. The simplicity and flexibility of self-supervised monocular depth estimation are conducive to its role in these fields. However, most existing monocular depth estimation models consume many computing resources. Although some methods have reduced the model's size and improved computing efficiency, the performance deteriorates, seriously hindering the real-world deployment of self-supervised monocular depth estimation models in the real world. To address this problem, we proposed a real-time self-supervised monocular depth estimation method and implemented it in the real world. It is called RTS-Mono, which is a lightweight and efficient encoder-decoder architecture. The encoder is based on Lite-Encoder, and the decoder is designed with a multi-scale sparse fusion framework to minimize redundancy, ensure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Advanced Neural Network Applications
