CogDDN: A Cognitive Demand-Driven Navigation with Decision Optimization and Dual-Process Thinking
Yuehao Huang, Liang Liu, Shuangming Lei, Yukai Ma, Hao Su, Jianbiao Mei, Pengxiang Zhao, Yaqing Gu, Yong Liu, Jiajun Lv

TL;DR
CogDDN is a novel framework for demand-driven robot navigation that mimics human cognition by integrating dual-process decision-making and semantic understanding, leading to improved accuracy and adaptability in unstructured environments.
Contribution
It introduces CogDDN, a VLM-based system combining fast and slow thinking, semantic alignment, and chain of thought reasoning for enhanced demand-driven navigation.
Findings
Outperforms single-view camera methods by 15% in accuracy.
Effectively integrates dual-process decision-making for better adaptability.
Demonstrates robustness in unstructured environments through extensive simulation tests.
Abstract
Mobile robots are increasingly required to navigate and interact within unknown and unstructured environments to meet human demands. Demand-driven navigation (DDN) enables robots to identify and locate objects based on implicit human intent, even when object locations are unknown. However, traditional data-driven DDN methods rely on pre-collected data for model training and decision-making, limiting their generalization capability in unseen scenarios. In this paper, we propose CogDDN, a VLM-based framework that emulates the human cognitive and learning mechanisms by integrating fast and slow thinking systems and selectively identifying key objects essential to fulfilling user demands. CogDDN identifies appropriate target objects by semantically aligning detected objects with the given instructions. Furthermore, it incorporates a dual-process decision-making module, comprising a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
