SplatSearch: Instance Image Goal Navigation for Mobile Robots using 3D Gaussian Splatting and Diffusion Models
Siddarth Narasimhan, Matthew Lisondra, Haitong Wang, Goldie Nejat

TL;DR
SplatSearch enables mobile robots to locate specific objects in unknown environments using sparse-view 3D reconstructions and diffusion models, improving success rates in goal-directed navigation tasks.
Contribution
The paper introduces SplatSearch, a novel approach combining 3D Gaussian Splatting and diffusion models for robust instance image goal navigation in unknown environments.
Findings
Outperforms state-of-the-art in success rate and path efficiency
Effective in photorealistic and real-world environments
Ablation confirms importance of design components
Abstract
The Instance Image Goal Navigation (IIN) problem requires mobile robots deployed in unknown environments to search for specific objects or people of interest using only a single reference goal image of the target. This problem can be especially challenging when: 1) the reference image is captured from an arbitrary viewpoint, and 2) the robot must operate with sparse-view scene reconstructions. In this paper, we address the IIN problem, by introducing SplatSearch, a novel architecture that leverages sparse-view 3D Gaussian Splatting (3DGS) reconstructions. SplatSearch renders multiple viewpoints around candidate objects using a sparse online 3DGS map, and uses a multi-view diffusion model to complete missing regions of the rendered images, enabling robust feature matching against the goal image. A novel frontier exploration policy is introduced which uses visual context from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
