Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning
Shaohui Peng, Xing Hu, Qi Yi, Rui Zhang, Jiaming Guo, Di Huang, Zikang, Tian, Ruizhi Chen, Zidong Du, Qi Guo, Yunji Chen, Ling Li

TL;DR
This paper introduces Self-Driven Grounding (SDG), a framework enabling large language models to autonomously learn and verify skills through environment interaction, reducing human effort and enhancing task generality.
Contribution
The paper presents a novel SDG framework that allows LLMs to automatically ground skills via self-driven hypothesis testing and environment interaction, improving adaptability and reducing manual tuning.
Findings
SDG achieves comparable performance to imitation learning in BabyAI tasks.
SDG reduces human effort by automating skill grounding.
Learned skills enable tackling more complex tasks.
Abstract
Large language models (LLMs) show their powerful automatic reasoning and planning capability with a wealth of semantic knowledge about the human world. However, the grounding problem still hinders the applications of LLMs in the real-world environment. Existing studies try to fine-tune the LLM or utilize pre-defined behavior APIs to bridge the LLMs and the environment, which not only costs huge human efforts to customize for every single task but also weakens the generality strengths of LLMs. To autonomously ground the LLM onto the environment, we proposed the Self-Driven Grounding (SDG) framework to automatically and progressively ground the LLM with self-driven skill learning. SDG first employs the LLM to propose the hypothesis of sub-goals to achieve tasks and then verify the feasibility of the hypothesis via interacting with the underlying environment. Once verified, SDG can then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
Methodsfail
