AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback
Wanpeng Zhang, Zongqing Lu

TL;DR
AdaRefiner is a framework that improves large language models' decision-making by automatically refining their understanding through reinforcement learning feedback, reducing the need for manual prompt engineering and fine-tuning.
Contribution
It introduces a lightweight adapter that enables automatic self-refinement of LLMs with RL feedback, enhancing decision-making in complex tasks without extensive prompt engineering.
Findings
Outperforms baselines on 22 open-world game tasks
Enhances high-level and common-sense skills in agents
Reduces need for prompt engineering and fine-tuning
Abstract
Large Language Models (LLMs) have demonstrated significant success across various domains. However, their application in complex decision-making tasks frequently necessitates intricate prompt engineering or fine-tuning, leading to challenges in unseen downstream tasks and heavy demands on computational resources. Meanwhile, Reinforcement Learning (RL) has been recognized as effective in decision-making problems but struggles in environments with sparse rewards, such as open-world games. To overcome these challenges, we introduce AdaRefiner, a novel framework designed to enhance the synergy between LLMs and RL feedback. The key component of AdaRefiner is a lightweight Adapter Language Model (LM), which automatically refines task comprehension based on feedback from RL agents. This method mitigates the need for intricate prompt engineering and intensive LLM fine-tuning while maintaining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Topic Modeling · Explainable Artificial Intelligence (XAI)
MethodsAdapter
