Constraints as Rewards: Reinforcement Learning for Robots without Reward   Functions

Yu Ishihara; Noriaki Takasugi; Kotaro Kawakami; Masaya Kinoshita,; Kazumi Aoyama

arXiv:2501.04228·cs.RO·January 10, 2025

Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions

Yu Ishihara, Noriaki Takasugi, Kotaro Kawakami, Masaya Kinoshita,, Kazumi Aoyama

PDF

Open Access

TL;DR

This paper introduces Constraints as Rewards (CaR), a novel reinforcement learning approach that uses constraints instead of reward functions to automatically balance multiple objectives in robotic tasks, reducing the need for reward engineering.

Contribution

The paper proposes CaR, a method that formulates task objectives as constraints and employs Lagrangian optimization, enabling automatic balancing of objectives without reward tuning.

Findings

01

Successfully applied to robot standing-up motion generation

02

Achieved target behaviors without manual reward design

03

Demonstrated effectiveness on complex robotic tasks

Abstract

Reinforcement learning has become an essential algorithm for generating complex robotic behaviors. However, to learn such behaviors, it is necessary to design a reward function that describes the task, which often consists of multiple objectives that needs to be balanced. This tuning process is known as reward engineering and typically involves extensive trial-and-error. In this paper, to avoid this trial-and-error process, we propose the concept of Constraints as Rewards (CaR). CaR formulates the task objective using multiple constraint functions instead of a reward function and solves a reinforcement learning problem with constraints using the Lagrangian-method. By adopting this approach, different objectives are automatically balanced, because Lagrange multipliers serves as the weights among the objectives. In addition, we will demonstrate that constraints, expressed as inequalities,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics