Time-Variant Variational Transfer for Value Functions

Giuseppe Canonaco; Andrea Soprani; Manuel Roveri; Marcello Restelli

arXiv:2005.12864·cs.LG·June 19, 2020

Time-Variant Variational Transfer for Value Functions

Giuseppe Canonaco, Andrea Soprani, Manuel Roveri, Marcello Restelli

PDF

Open Access

TL;DR

This paper introduces a variational transfer method for reinforcement learning that accounts for non-stationary, time-variant task distributions, with theoretical analysis and experimental validation across multiple environments.

Contribution

It proposes a novel transfer learning approach that leverages temporal structure in task distributions and provides finite-sample theoretical analysis.

Findings

01

The method effectively handles time-variant task distributions.

02

Theoretical comparison shows advantages over time-invariant approaches.

03

Experimental results demonstrate improved transfer performance across diverse environments.

Abstract

In most of the transfer learning approaches to reinforcement learning (RL) the distribution over the tasks is assumed to be stationary. Therefore, the target and source tasks are i.i.d. samples of the same distribution. In the context of this work, we consider the problem of transferring value functions through a variational method when the distribution that generates the tasks is time-variant, proposing a solution that leverages this temporal structure inherent in the task generating process. Furthermore, by means of a finite-sample analysis, the previously mentioned solution is theoretically compared to its time-invariant version. Finally, we will provide an experimental evaluation of the proposed technique with three distinct temporal dynamics in three different RL environments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Multi-Objective Optimization Algorithms · Evolutionary Algorithms and Applications