Active Visuo-Tactile Interactive Robotic Perception for Accurate Object Pose Estimation in Dense Clutter
Prajval Kumar Murali, Anirvan Dutta, Michael Gentner, Etienne Burdet,, Ravinder Dahiya, Mohsen Kaboli

TL;DR
This paper introduces an active visuo-tactile robotic perception framework that uses a novel declutter graph and quaternion filter to improve object pose estimation accuracy in dense cluttered environments.
Contribution
It presents a new declutter graph for scene understanding and a quaternion filter for active pose estimation, enhancing accuracy over static vision methods.
Findings
Achieved up to 36% improvement in pose accuracy.
Demonstrated effectiveness in dense cluttered scenes with two robots.
Validated the approach through ablation studies and comparisons.
Abstract
This work presents a novel active visuo-tactile based framework for robotic systems to accurately estimate pose of objects in dense cluttered environments. The scene representation is derived using a novel declutter graph (DG) which describes the relationship among objects in the scene for decluttering by leveraging semantic segmentation and grasp affordances networks. The graph formulation allows robots to efficiently declutter the workspace by autonomously selecting the next best object to remove and the optimal action (prehensile or non-prehensile) to perform. Furthermore, we propose a novel translation-invariant Quaternion filter (TIQF) for active vision and active tactile based pose estimation. Both active visual and active tactile points are selected by maximizing the expected information gain. We evaluate our proposed framework on a system with two robots coordinating on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
