A Plan Reuse Mechanism for LLM-Driven Agent
Guopeng Li, Ruiqi Wu, Haisheng Tan

TL;DR
This paper introduces AgentReuse, a plan reuse mechanism for LLM-driven agents that significantly reduces response latency by reusing plans for similar user requests, achieving high accuracy in request similarity evaluation.
Contribution
The paper proposes a novel plan reuse mechanism leveraging intent classification to improve efficiency in LLM-driven agents, with high reuse rate and accuracy.
Findings
Achieves 93% plan reuse rate
Reduces latency by 93.12%
High request similarity evaluation accuracy (F1=0.9718)
Abstract
Integrating large language models (LLMs) into personal assistants, like Xiao Ai and Blue Heart V, effectively enhances their ability to interact with humans, solve complex tasks, and manage IoT devices. Such assistants are also termed LLM-driven agents. Upon receiving user requests, the LLM-driven agent generates plans using an LLM, executes these plans through various tools, and then returns the response to the user. During this process, the latency for generating a plan with an LLM can reach tens of seconds, significantly degrading user experience. Real-world dataset analysis shows that about 30% of the requests received by LLM-driven agents are identical or similar, which allows the reuse of previously generated plans to reduce latency. However, it is difficult to accurately define the similarity between the request texts received by the LLM-driven agent through directly evaluating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · AI in Service Interactions · Multimodal Machine Learning Applications
