Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific   GAN-Based Correspondence Function

Marko Ruman; Tatiana V. Guy

arXiv:2209.06604·cs.LG·November 12, 2024·1 cites

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function

Marko Ruman, Tatiana V. Guy

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel RL-specific GAN-based method for effective one-to-one knowledge transfer in deep reinforcement learning, improving generalization and reducing training time in complex tasks.

Contribution

It introduces a modified Cycle GAN with new loss components tailored for RL, enabling superior knowledge transfer compared to standard GANs.

Findings

01

Achieved 100% knowledge transfer in identical tasks.

02

Reduced training time by 30% in rotated tasks.

03

Outperformed standard GANs in transfer performance.

Abstract

Deep reinforcement learning has demonstrated superhuman performance in complex decision-making tasks, but it struggles with generalization and knowledge reuse - key aspects of true intelligence. This article introduces a novel approach that modifies Cycle Generative Adversarial Networks specifically for reinforcement learning, enabling effective one-to-one knowledge transfer between two tasks. Our method enhances the loss function with two new components: model loss, which captures dynamic relationships between source and target tasks, and Q-loss, which identifies states significantly influencing the target decision policy. Tested on the 2-D Atari game Pong, our method achieved 100% knowledge transfer in identical tasks and either 100% knowledge transfer or a 30% reduction in training time for a rotated task, depending on the network architecture. In contrast, using standard Generative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

marko-ruman/RL-Correspondence-Learner
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Reinforcement Learning in Robotics · Neural dynamics and brain function