FusionSense: Bridging Common Sense, Vision, and Touch for Robust Sparse-View Reconstruction
Irving Fang, Kairui Shi, Xujin He, Siqi Tan, Yifan Wang, Hanwen Zhao,, Hung-Jui Huang, Wenzhen Yuan, Chen Feng, Jing Zhang

TL;DR
FusionSense is a novel 3D reconstruction framework that combines common-sense priors with sparse vision and tactile data, enabling robots to perceive complex objects more robustly and efficiently.
Contribution
It introduces a hierarchical optimization approach using 3D Gaussian Splatting to fuse priors with sparse sensory observations for improved 3D reconstruction.
Findings
Outperforms state-of-the-art sparse-view reconstruction methods
Enables robust perception of transparent, reflective, and dark objects
Facilitates downstream manipulation and navigation tasks
Abstract
Humans effortlessly integrate common-sense knowledge with sensory input from vision and touch to understand their surroundings. Emulating this capability, we introduce FusionSense, a novel 3D reconstruction framework that enables robots to fuse priors from foundation models with highly sparse observations from vision and tactile sensors. FusionSense addresses three key challenges: (i) How can robots efficiently acquire robust global shape information about the surrounding scene and objects? (ii) How can robots strategically select touch points on the object using geometric and common-sense priors? (iii) How can partial observations such as tactile signals improve the overall representation of the object? Our framework employs 3D Gaussian Splatting as a core representation and incorporates a hierarchical optimization strategy involving global structure construction, object visual hull…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Industrial Vision Systems and Defect Detection · Robotics and Sensor-Based Localization
MethodsPruning
