DANLI: Deliberative Agent for Following Natural Language Instructions
Yichi Zhang, Jianing Yang, Jiayi Pan, Shane Storks, Nikhil Devraj,, Ziqiao Ma, Keunwoo Peter Yu, Yuwei Bao, Joyce Chai

TL;DR
DANLI introduces a neuro-symbolic deliberative agent that enhances task performance and transparency in following natural language instructions for embodied AI, outperforming reactive models on complex benchmarks.
Contribution
The paper presents a novel neuro-symbolic deliberative framework that combines reasoning and planning with neural and symbolic representations for embodied AI.
Findings
Achieves over 70% improvement on TEACh benchmark
Provides transparency and explainability in agent behaviors
Demonstrates effectiveness in complex, long-horizon tasks
Abstract
Recent years have seen an increasing amount of work on embodied AI agents that can perform tasks by following human language instructions. However, most of these agents are reactive, meaning that they simply learn and imitate behaviors encountered in the training data. These reactive agents are insufficient for long-horizon complex tasks. To address this limitation, we propose a neuro-symbolic deliberative agent that, while following language instructions, proactively applies reasoning and planning based on its neural and symbolic representations acquired from past experience (e.g., natural language and egocentric vision). We show that our deliberative agent achieves greater than 70% improvement over reactive baselines on the challenging TEACh benchmark. Moreover, the underlying reasoning and planning processes, together with our modular framework, offer impressive transparency and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
