SI-Agent: An Agentic Framework for Feedback-Driven Generation and Tuning of Human-Readable System Instructions for Large Language Models
Jeshwanth Challagundla

TL;DR
SI-Agent is a novel framework that automatically generates and refines human-readable system instructions for LLMs using an agentic, feedback-driven process, improving interpretability and performance.
Contribution
The paper introduces SI-Agent, an innovative agentic framework that automates the creation and iterative refinement of human-readable system instructions for LLMs, combining multiple agents and feedback loops.
Findings
SI-Agent produces effective, readable system instructions.
It achieves a better balance between task performance and interpretability.
The framework demonstrates efficiency in instruction tuning.
Abstract
System Instructions (SIs), or system prompts, are pivotal for guiding Large Language Models (LLMs) but manual crafting is resource-intensive and often suboptimal. Existing automated methods frequently generate non-human-readable "soft prompts," sacrificing interpretability. This paper introduces SI-Agent, a novel agentic framework designed to automatically generate and iteratively refine human-readable SIs through a feedback-driven loop. SI-Agent employs three collaborating agents: an Instructor Agent, an Instruction Follower Agent (target LLM), and a Feedback/Reward Agent evaluating task performance and optionally SI readability. The framework utilizes iterative cycles where feedback guides the Instructor's refinement strategy (e.g., LLM-based editing, evolutionary algorithms). We detail the framework's architecture, agent roles, the iterative refinement process, and contrast it with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
