Automated Skill Discovery for Language Agents through Exploration and Iterative Feedback
Yongjin Yang, Sinjae Kang, Juyong Lee, Dongjun Lee, Se-Young Yun, Kimin Lee

TL;DR
This paper introduces EXIF, an automatic framework for discovering and training skills in language agents through exploration and iterative feedback, reducing manual effort and improving agent capabilities.
Contribution
The paper presents a novel exploration-based, iterative feedback method for automatic skill discovery in language agents, enhancing feasibility and agent performance without human intervention.
Findings
EXIF effectively discovers meaningful skills for language agents.
Iterative feedback improves agent performance over multiple cycles.
Using the same model for Alice and Bob enhances the system's self-evolving capability.
Abstract
Training large language model (LLM) agents to acquire necessary skills and perform diverse tasks within an environment is gaining interest as a means to enable open-endedness. However, creating the training dataset for their skill acquisition faces several challenges. Manual trajectory collection requires significant human effort. Another approach, where LLMs directly propose tasks to learn, is often invalid, as the LLMs lack knowledge of which tasks are actually feasible. Moreover, the generated data may not provide a meaningful learning signal, as agents often already perform well on the proposed tasks. To address this, we propose a novel automatic skill discovery framework EXIF for LLM-powered agents, designed to improve the feasibility of generated target behaviors while accounting for the agents' capabilities. Our method adopts an exploration-first strategy by employing an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Intelligent Tutoring Systems and Adaptive Learning
