Gaussian Process-Based Active Exploration Strategies in Vision and Touch
Ho Jin Choi, Nadia Figueroa

TL;DR
This paper introduces a Gaussian Process-based framework for active multi-sensor exploration in robotics, fusing vision and touch to improve understanding of object geometry and properties without extensive pretraining.
Contribution
The work presents a novel GPDF representation that integrates visual and tactile data for active perception, enabling real-time geometry refinement and uncertainty-driven exploration.
Findings
Effective multi-sensor fusion for shape understanding
Active exploration improves geometric accuracy
Scalable with inducing point approximation methods
Abstract
Robots struggle to understand object properties like shape, material, and semantics due to limited prior knowledge, hindering manipulation in unstructured environments. In contrast, humans learn these properties through interactive multi-sensor exploration. This work proposes fusing visual and tactile observations into a unified Gaussian Process Distance Field (GPDF) representation for active perception of object properties. While primarily focusing on geometry, this approach also demonstrates potential for modeling surface properties beyond geometry. The GPDF encodes signed distance using point cloud, analytic gradient and Hessian, and surface uncertainty estimates, which are attributes that common neural network shape representation lack. By utilizing a point cloud to construct a distance function, GPDF does not need extensive pretraining on large datasets and can incorporate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Gaussian Processes and Bayesian Inference · Advanced Sensor and Energy Harvesting Materials
