TriHelper: Zero-Shot Object Navigation with Dynamic Assistance
Lingfeng Zhang, Qiang Zhang, Hao Wang, Erjia Xiao, Zixuan Jiang,, Honglei Chen, Renjing Xu

TL;DR
TriHelper is a new framework that improves zero-shot object navigation by dynamically assisting agents with collision avoidance, exploration, and detection, leading to better success rates and efficiency in unknown environments.
Contribution
The paper introduces TriHelper, a novel multi-component framework that addresses specific navigation challenges dynamically, improving zero-shot object navigation performance.
Findings
Outperforms existing methods in success rate and exploration efficiency
Each helper component significantly enhances navigation capabilities
Effective in diverse datasets like HM3D and Gibson
Abstract
Navigating toward specific objects in unknown environments without additional training, known as Zero-Shot object navigation, poses a significant challenge in the field of robotics, which demands high levels of auxiliary information and strategic planning. Traditional works have focused on holistic solutions, overlooking the specific challenges agents encounter during navigation such as collision, low exploration efficiency, and misidentification of targets. To address these challenges, our work proposes TriHelper, a novel framework designed to assist agents dynamically through three primary navigation challenges: collision, exploration, and detection. Specifically, our framework consists of three innovative components: (i) Collision Helper, (ii) Exploration Helper, and (iii) Detection Helper. These components work collaboratively to solve these challenges throughout the navigation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques
