KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation
Soofiyan Atar, Yuheng Zhi, Florian Richter, and Michael Yip

TL;DR
This paper introduces KineDepth, a real-time method that uses robot kinematics and an LSTM-based regressor to convert relative monocular depth estimates into accurate metric depths, enhancing robotic perception without additional hardware.
Contribution
We propose a novel online approach leveraging robot kinematics and probabilistic filtering to achieve metric depth estimation from monocular cameras in real-time.
Findings
22.1% reduction in depth error
52% increase in task success rate
Outperforms state-of-the-art monocular depth methods
Abstract
Depth perception is essential for a robot's spatial and geometric understanding of its environment, with many tasks traditionally relying on hardware-based depth sensors like RGB-D or stereo cameras. However, these sensors face practical limitations, including issues with transparent and reflective objects, high costs, calibration complexity, spatial and energy constraints, and increased failure rates in compound systems. While monocular depth estimation methods offer a cost-effective and simpler alternative, their adoption in robotics is limited due to their output of relative rather than metric depth, which is crucial for robotics applications. In this paper, we propose a method that utilizes a single calibrated camera, enabling the robot to act as a "measuring stick" to convert relative depth estimates into metric depth in real-time as tasks are performed. Our approach employs an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization
