Input Conditioned Graph Generation for Language Agents
Lukas Vierling, Jie Fu, Kai Chen

TL;DR
This paper introduces a learnable, dynamic graph-based framework for language agents, enabling adaptive internal communication modeled as graph edges, fine-tuned with reinforcement learning, outperforming static approaches across multiple datasets.
Contribution
Develops a novel method to generate adaptive communication graphs for language agents using a pretrained LLM fine-tuned with reinforcement learning, enhancing flexibility and performance.
Findings
Achieves nearly 6% higher accuracy on MMLU and CMMLU datasets.
Surpasses static approaches by over 10% with sparsity-inducing loss.
Demonstrates superior performance on additional datasets like Mini Crossword Puzzles.
Abstract
Recent progress in Large Language Models (LLMs) and language agents has demonstrated significant promise for various future applications across multiple disciplines. While traditional approaches to language agents often rely on fixed, handcrafted designs, our research aims to develop both learnable and dynamic agents. Our method uses an existing framework that abstracts language agents as graphs. Within this graph framework, we aim to learn a model that can generate edges for every given input to the language agent. This allows us to generate edges that represent the flow of communication within the graph based on the given input, thereby adjusting the internal communication of a language agent. We learn to generate these edges using a pretrained LLM that is fine-tuned with reinforcement learning. This LLM can be fine-tuned on several datasets simultaneously, and we hypothesize that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
