Can Graph Learning Improve Planning in LLM-based Agents?
Xixi Wu, Yifei Shen, Caihua Shan, Kaitao Song, Siwei Wang, Bohang, Zhang, Jiarui Feng, Hong Cheng, Wei Chen, Yun Xiong, Dongsheng Li

TL;DR
This paper investigates how integrating graph neural networks with large language models can improve task planning in language agents, especially for complex, graph-structured sub-tasks, showing that GNNs outperform existing methods.
Contribution
It introduces a novel approach combining GNNs with LLMs for task planning, addressing attention biases and demonstrating improved performance on graph-structured tasks.
Findings
GNN-based methods outperform existing solutions without training.
Minimal training further improves GNN performance.
Performance gains increase with larger task graphs.
Abstract
Task planning in language agents is emerging as an important research topic alongside the development of large language models (LLMs). It aims to break down complex user requests in natural language into solvable sub-tasks, thereby fulfilling the original requests. In this context, the sub-tasks can be naturally viewed as a graph, where the nodes represent the sub-tasks, and the edges denote the dependencies among them. Consequently, task planning is a decision-making problem that involves selecting a connected path or subgraph within the corresponding graph and invoking it. In this paper, we explore graph learning-based methods for task planning, a direction that is orthogonal to the prevalent focus on prompt design. Our interest in graph learning stems from a theoretical discovery: the biases of attention and auto-regressive loss impede LLMs' ability to effectively navigate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics
MethodsFocus
