Automated Hybrid Reward Scheduling via Large Language Models for Robotic   Skill Learning

Changxin Huang; Junyang Liang; Yanbin Chang; Jingzhao Xu; Jianqiang Li

arXiv:2505.02483·cs.RO·May 6, 2025

Automated Hybrid Reward Scheduling via Large Language Models for Robotic Skill Learning

Changxin Huang, Junyang Liang, Yanbin Chang, Jingzhao Xu, Jianqiang Li

PDF

Open Access

TL;DR

This paper introduces an Automated Hybrid Reward Scheduling framework using Large Language Models to dynamically adjust reward component importance during robotic skill learning, improving reinforcement learning efficiency.

Contribution

It presents a novel LLM-based method for automatically scheduling reward components in RL for complex robotic tasks, enhancing learning performance.

Findings

01

Achieved an average 6.48% performance improvement across multiple tasks.

02

Demonstrated the effectiveness of LLM-guided reward scheduling in high-DOF robots.

03

Showed that dynamic reward adjustment outperforms uniform reward summation.

Abstract

Enabling a high-degree-of-freedom robot to learn specific skills is a challenging task due to the complexity of robotic dynamics. Reinforcement learning (RL) has emerged as a promising solution; however, addressing such problems requires the design of multiple reward functions to account for various constraints in robotic motion. Existing approaches typically sum all reward components indiscriminately to optimize the RL value function and policy. We argue that this uniform inclusion of all reward components in policy optimization is inefficient and limits the robot's learning performance. To address this, we propose an Automated Hybrid Reward Scheduling (AHRS) framework based on Large Language Models (LLMs). This paradigm dynamically adjusts the learning intensity of each reward component throughout the policy optimization process, enabling robots to acquire skills in a gradual and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsLib · Sparse Evolutionary Training