Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie Zhou, Yu Qiao,, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai

TL;DR
This paper presents Auto MC-Reward, a system that uses Large Language Models to automatically generate dense reward functions, significantly improving reinforcement learning efficiency in complex Minecraft tasks.
Contribution
It introduces a novel LLM-based framework for automatic dense reward design, including components for coding, verification, and refinement, enhancing RL in sparse-reward environments.
Findings
Improved success rates in complex Minecraft tasks
Enhanced learning efficiency of RL agents
Effective automatic reward function refinement
Abstract
Many reinforcement learning environments (e.g., Minecraft) provide only sparse rewards that indicate task completion or failure with binary values. The challenge in exploration efficiency in such environments makes it difficult for reinforcement-learning-based agents to learn complex tasks. To address this, this paper introduces an advanced learning system, named Auto MC-Reward, that leverages Large Language Models (LLMs) to automatically design dense reward functions, thereby enhancing the learning efficiency. Auto MC-Reward consists of three important components: Reward Designer, Reward Critic, and Trajectory Analyzer. Given the environment information and task descriptions, the Reward Designer first design the reward function by coding an executable Python function with predefined observation inputs. Then, our Reward Critic will be responsible for verifying the code, checking whether…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Anomaly Detection Techniques and Applications · Software Testing and Debugging Techniques
