Plug and Play, Model-Based Reinforcement Learning

Majid Abdolshah; Hung Le; Thommen Karimpanal George; Sunil Gupta,; Santu Rana; Svetha Venkatesh

arXiv:2108.08960·cs.LG·August 23, 2021

Plug and Play, Model-Based Reinforcement Learning

Majid Abdolshah, Hung Le, Thommen Karimpanal George, Sunil Gupta,, Santu Rana, Svetha Venkatesh

PDF

Open Access

TL;DR

This paper introduces Plug and Play Markov Decision Processes, an object-based reinforcement learning framework that enables zero-shot object integration and efficient transfer learning for complex scenes.

Contribution

It presents a novel object-based representation allowing zero-shot object integration and efficient reward transfer in reinforcement learning environments.

Findings

01

Achieves sample-efficiency in various setups.

02

Enables zero-shot integration of new objects.

03

Handles addition/removal of objects efficiently.

Abstract

Sample-efficient generalisation of reinforcement learning approaches have always been a challenge, especially, for complex scenes with many components. In this work, we introduce Plug and Play Markov Decision Processes, an object-based representation that allows zero-shot integration of new objects from known object classes. This is achieved by representing the global transition dynamics as a union of local transition functions, each with respect to one active object in the scene. Transition dynamics from an object class can be pre-learnt and thus would be ready to use in a new environment. Each active object is also endowed with its reward function. Since there is no central reward function, addition or removal of objects can be handled efficiently by only updating the reward functions of objects involved. A new transfer learning mechanism is also proposed to adapt reward function in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics