Enhancing data efficiency in reinforcement learning: a novel imagination   mechanism based on mesh information propagation

Zihang Wang; Maowei Jiang

arXiv:2309.14243·cs.LG·September 28, 2023·1 cites

Enhancing data efficiency in reinforcement learning: a novel imagination mechanism based on mesh information propagation

Zihang Wang, Maowei Jiang

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel imagination mechanism inspired by human reasoning that propagates information across episodes in reinforcement learning, significantly improving data efficiency and performance of existing algorithms.

Contribution

The paper proposes a plug-and-play imagination mechanism that enhances data efficiency in RL by propagating information across episodes, outperforming state-of-the-art algorithms.

Findings

01

IM boosts performance of SAC, PPO, DDPG, DQN

02

Significant improvement in data efficiency across tasks

03

Enhanced understanding of state interdependencies

Abstract

Reinforcement learning(RL) algorithms face the challenge of limited data efficiency, particularly when dealing with high-dimensional state spaces and large-scale problems. Most of RL methods often rely solely on state transition information within the same episode when updating the agent's Critic, which can lead to low data efficiency and sub-optimal training time consumption. Inspired by human-like analogical reasoning abilities, we introduce a novel mesh information propagation mechanism, termed the 'Imagination Mechanism (IM)', designed to significantly enhance the data efficiency of RL algorithms. Specifically, IM enables information generated by a single sample to be effectively broadcasted to different states across episodes, instead of simply transmitting in the same episode. This capability enhances the model's comprehension of state interdependencies and facilitates more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Dilated Convolution · Dense Connections · Experience Replay · Weight Decay · Q-Learning · Adam · Global Average Pooling · Average Pooling · 1x1 Convolution