Deep Gated Multi-modal Learning: In-hand Object Pose Changes Estimation using Tactile and Image Data
Tomoki Anzai, Kuniyuki Takahashi

TL;DR
This paper introduces a deep gated multi-modal learning approach that automatically determines the reliability of tactile and visual data to accurately estimate object pose changes during in-hand manipulation, even with occlusions and noise.
Contribution
It presents a novel end-to-end deep learning model that self-adjusts modality reliability, improving in-hand object pose estimation using combined tactile and image data.
Findings
Effective estimation of pose changes for 15 objects during grasping.
Reliability values adapt to noise and sensor failure, enhancing robustness.
Accurate pose change estimation for unknown objects.
Abstract
For in-hand manipulation, estimation of the object pose inside the hand is one of the important functions to manipulate objects to the target pose. Since in-hand manipulation tends to cause occlusions by the hand or the object itself, image information only is not sufficient for in-hand object pose estimation. Multiple modalities can be used in this case, the advantage is that other modalities can compensate for occlusion, noise, and sensor malfunctions. Even though deciding the utilization rate of a modality (referred to as reliability value) corresponding to the situations is important, the manual design of such models is difficult, especially for various situations. In this paper, we propose deep gated multi-modal learning, which self-determines the reliability value of each modality through end-to-end deep learning. For the experiments, an RGB camera and a GelSight tactile sensor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Tactile and Sensory Interactions · Soft Robotics and Applications
