Reason for Future, Act for Now: A Principled Framework for Autonomous   LLM Agents with Provable Sample Efficiency

Zhihan Liu; Hao Hu; Shenao Zhang; Hongyi Guo; Shuqi Ke; Boyi Liu,; Zhaoran Wang

arXiv:2309.17382·cs.AI·July 2, 2024·2 cites

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Zhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu,, Zhaoran Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces RAFA, a framework combining reasoning and acting in LLMs with provable regret guarantees, enabling efficient task completion through long-term planning and short-term actions.

Contribution

It presents a novel Bayesian MDP-based approach for LLM reasoning and planning with theoretical regret bounds and empirical performance improvements.

Findings

01

Achieves $\sqrt{T}$ regret bound in online interactions.

02

Outperforms existing frameworks on benchmarks.

03

Nearly perfect scores on selected tasks.

Abstract

Large language models (LLMs) demonstrate impressive reasoning abilities, but translating reasoning into actions in the real world remains challenging. In particular, it remains unclear how to complete a given task provably within a minimum number of interactions with the external environment, e.g., through an internal mechanism of reasoning. To this end, we propose a principled framework with provable regret guarantees to orchestrate reasoning and acting, which we call "reason for future, act for now" (\texttt{RAFA}). Specifically, we design a prompt template for reasoning that learns from the memory buffer and plans a future trajectory over a long horizon ("reason for future"). At each step, the LLM agent takes the initial action of the planned trajectory ("act for now"), stores the collected feedback in the memory buffer, and reinvokes the reasoning routine to replan the future…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

agentification/RAFA_code
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)