Self-driven Grounding: Large Language Model Agents with Automatical   Language-aligned Skill Learning

Shaohui Peng; Xing Hu; Qi Yi; Rui Zhang; Jiaming Guo; Di Huang; Zikang; Tian; Ruizhi Chen; Zidong Du; Qi Guo; Yunji Chen; Ling Li

arXiv:2309.01352·cs.CL·September 6, 2023·1 cites

Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning

Shaohui Peng, Xing Hu, Qi Yi, Rui Zhang, Jiaming Guo, Di Huang, Zikang, Tian, Ruizhi Chen, Zidong Du, Qi Guo, Yunji Chen, Ling Li

PDF

Open Access

TL;DR

This paper introduces Self-Driven Grounding (SDG), a framework enabling large language models to autonomously learn and verify skills through environment interaction, reducing human effort and enhancing task generality.

Contribution

The paper presents a novel SDG framework that allows LLMs to automatically ground skills via self-driven hypothesis testing and environment interaction, improving adaptability and reducing manual tuning.

Findings

01

SDG achieves comparable performance to imitation learning in BabyAI tasks.

02

SDG reduces human effort by automating skill grounding.

03

Learned skills enable tackling more complex tasks.

Abstract

Large language models (LLMs) show their powerful automatic reasoning and planning capability with a wealth of semantic knowledge about the human world. However, the grounding problem still hinders the applications of LLMs in the real-world environment. Existing studies try to fine-tune the LLM or utilize pre-defined behavior APIs to bridge the LLMs and the environment, which not only costs huge human efforts to customize for every single task but also weakens the generality strengths of LLMs. To autonomously ground the LLM onto the environment, we proposed the Self-Driven Grounding (SDG) framework to automatically and progressively ground the LLM with self-driven skill learning. SDG first employs the LLM to propose the hypothesis of sub-goals to achieve tasks and then verify the feasibility of the hypothesis via interacting with the underlying environment. Once verified, SDG can then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

Methodsfail