Parallelized Reverse Curriculum Generation

Zih-Yun Chiu; Yi-Lin Tuan; Hung-yi Lee; Li-Chen Fu

arXiv:2108.02128·cs.LG·August 5, 2021

Parallelized Reverse Curriculum Generation

Zih-Yun Chiu, Yi-Lin Tuan, Hung-yi Lee, Li-Chen Fu

PDF

Open Access

TL;DR

This paper introduces a parallelized reverse curriculum generation method for reinforcement learning that trains multiple actor-critic pairs simultaneously, improving performance and convergence in sparse reward tasks.

Contribution

It proposes a novel parallelized approach to reverse curriculum generation that enhances generalization and speeds up training in actor-critic reinforcement learning algorithms.

Findings

01

Improved performance in RL tasks with sparse rewards.

02

Faster convergence compared to traditional RCG methods.

03

Applicable to various actor-critic algorithms.

Abstract

For reinforcement learning (RL), it is challenging for an agent to master a task that requires a specific series of actions due to sparse rewards. To solve this problem, reverse curriculum generation (RCG) provides a reverse expansion approach that automatically generates a curriculum for the agent to learn. More specifically, RCG adapts the initial state distribution from the neighborhood of a goal to a distance as training proceeds. However, the initial state distribution generated for each iteration might be biased, thus making the policy overfit or slowing down the reverse expansion rate. While training RCG for actor-critic (AC) based RL algorithms, this poor generalization and slow convergence might be induced by the tight coupling between an AC pair. Therefore, we propose a parallelized approach that simultaneously trains multiple AC pairs and periodically exchanges their critics.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning