TL;DR
Skills-Coach is an innovative framework that self-evolves and optimizes skills in LLM-based agents through training-free modules, improving performance across diverse skill categories.
Contribution
It introduces a comprehensive, training-free skill optimization framework with modular components for skill generation, optimization, evaluation, and benchmarking.
Findings
Achieves significant performance improvements across various skills.
Introduces Skill-X, a benchmark dataset with 48 diverse skills.
Demonstrates flexible execution modes for practical deployment.
Abstract
We introduce Skills-Coach, a novel automated framework designed to significantly enhance the self-evolution of skills within Large Language Model (LLM)-based agents. Addressing the current fragmentation of the skill ecosystem, Skills-Coach explores the boundaries of skill capabilities, thereby facilitating the comprehensive competency coverage essential for intelligent applications. The framework comprises four core modules: a Diverse Task Generation Module that systematically creates a comprehensive test suite for various skills; a Lightweight Optimization Module dedicated to optimizing skill prompts and their corresponding code; a Comparative Execution Module facilitating the execution and evaluation of both original and optimized skills; and a Traceable Evaluation Module, which rigorously evaluates performance against specified criteria. Skills-Coach offers flexible execution options…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
