Online Transfer Learning in Reinforcement Learning Domains

Yusen Zhan; Matthew E. Taylor

arXiv:1507.00436·cs.AI·July 16, 2015·25 cites

Online Transfer Learning in Reinforcement Learning Domains

Yusen Zhan, Matthew E. Taylor

PDF

Open Access

TL;DR

This paper introduces an online transfer learning framework for reinforcement learning, unifying existing methods and providing theoretical convergence guarantees, with empirical validation of the proposed approach.

Contribution

It presents a novel online transfer framework that generalizes existing transfer methods in reinforcement learning and offers convergence proofs for various algorithms.

Findings

01

Convergence of Q-learning and Sarsa with tabular representation proven.

02

Convergence of Q-learning and Sarsa with linear function approximation established.

03

Teaching does not harm asymptotic performance.

Abstract

This paper proposes an online transfer framework to capture the interaction among agents and shows that current transfer learning in reinforcement learning is a special case of online transfer. Furthermore, this paper re-characterizes existing agents-teaching-agents methods as online transfer and analyze one such teaching method in three ways. First, the convergence of Q-learning and Sarsa with tabular representation with a finite budget is proven. Second, the convergence of Q-learning and Sarsa with linear function approximation is established. Third, the we show the asymptotic performance cannot be hurt through teaching. Additionally, all theoretical results are empirically validated.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Machine Learning and Algorithms

MethodsSarsa · Q-Learning