AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback

Wanpeng Zhang; Zongqing Lu

arXiv:2309.17176·cs.AI·May 6, 2024

AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback

Wanpeng Zhang, Zongqing Lu

PDF

Open Access 1 Repo

TL;DR

AdaRefiner is a framework that improves large language models' decision-making by automatically refining their understanding through reinforcement learning feedback, reducing the need for manual prompt engineering and fine-tuning.

Contribution

It introduces a lightweight adapter that enables automatic self-refinement of LLMs with RL feedback, enhancing decision-making in complex tasks without extensive prompt engineering.

Findings

01

Outperforms baselines on 22 open-world game tasks

02

Enhances high-level and common-sense skills in agents

03

Reduces need for prompt engineering and fine-tuning

Abstract

Large Language Models (LLMs) have demonstrated significant success across various domains. However, their application in complex decision-making tasks frequently necessitates intricate prompt engineering or fine-tuning, leading to challenges in unseen downstream tasks and heavy demands on computational resources. Meanwhile, Reinforcement Learning (RL) has been recognized as effective in decision-making problems but struggles in environments with sparse rewards, such as open-world games. To overcome these challenges, we introduce AdaRefiner, a novel framework designed to enhance the synergy between LLMs and RL feedback. The key component of AdaRefiner is a lightweight Adapter Language Model (LM), which automatically refines task comprehension based on feedback from RL agents. This method mitigates the need for intricate prompt engineering and intensive LLM fine-tuning while maintaining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pku-rl/adarefiner
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Topic Modeling · Explainable Artificial Intelligence (XAI)

MethodsAdapter