Monocular Depth Estimation for Soft Visuotactile Sensors
Rares Ambrus, Vitor Guizilini, Naveen Kuppuswamy, Andrew Beaulieu,, Adrien Gaidon, Alex Alspach

TL;DR
This paper demonstrates that deep learning can be used to accurately estimate dense internal depth maps in soft visuotactile sensors from a single IR image, enabling improved tactile sensing for robotic manipulation.
Contribution
It introduces a supervised learning method for monocular depth estimation inside soft tactile sensors, requiring minimal training data and generalizing across objects and sensor setups.
Findings
Deep networks trained for long-range depth can be adapted for short-range tactile sensing.
The approach requires less than 10 contact poses for training.
The method achieves accurate, sample-efficient depth predictions that generalize well.
Abstract
Fluid-filled soft visuotactile sensors such as the Soft-bubbles alleviate key challenges for robust manipulation, as they enable reliable grasps along with the ability to obtain high-resolution sensory feedback on contact geometry and forces. Although they are simple in construction, their utility has been limited due to size constraints introduced by enclosed custom IR/depth imaging sensors to directly measure surface deformations. Towards mitigating this limitation, we investigate the application of state-of-the-art monocular depth estimation to infer dense internal (tactile) depth maps directly from the internal single small IR imaging sensor. Through real-world experiments, we show that deep networks typically used for long-range depth estimation (1-100m) can be effectively trained for precise predictions at a much shorter range (1-100mm) inside a mostly textureless deformable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
