Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement   Learning

Hongyu Ding; Yuanze Tang; Qing Wu; Bo Wang; Chunlin Chen; Zhi Wang

arXiv:2307.08033·cs.LG·July 18, 2023

Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning

Hongyu Ding, Yuanze Tang, Qing Wu, Bo Wang, Chunlin Chen, Zhi Wang

PDF

TL;DR

This paper introduces a magnetic field-inspired reward shaping method for goal-conditioned reinforcement learning, leveraging nonlinear magnetic field properties to improve learning efficiency in complex, dynamic environments.

Contribution

The paper proposes a novel magnetic field-based reward shaping technique that captures complex environmental information, enhancing sample efficiency in goal-conditioned RL tasks with dynamic targets and obstacles.

Findings

01

Outperforms existing reward shaping methods in simulated tasks.

02

Improves sample efficiency in real-world robotic manipulation.

03

Effectively handles dynamic environments with changing targets and obstacles.

Abstract

Goal-conditioned reinforcement learning (RL) is an interesting extension of the traditional RL framework, where the dynamic environment and reward sparsity can cause conventional learning algorithms to fail. Reward shaping is a practical approach to improving sample efficiency by embedding human domain knowledge into the learning process. Existing reward shaping methods for goal-conditioned RL are typically built on distance metrics with a linear and isotropic distribution, which may fail to provide sufficient information about the ever-changing environment with high complexity. This paper proposes a novel magnetic field-based reward shaping (MFRS) method for goal-conditioned RL tasks with dynamic target and obstacles. Inspired by the physical properties of magnets, we consider the target and obstacles as permanent magnets and establish the reward function according to the intensity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methodsfail