Wield: Systematic Reinforcement Learning With Progressive Randomization

Michael Schaarschmidt; Kai Fricke; Eiko Yoneki

arXiv:1909.06844·cs.LG·September 17, 2019

Wield: Systematic Reinforcement Learning With Progressive Randomization

Michael Schaarschmidt, Kai Fricke, Eiko Yoneki

PDF

Open Access

TL;DR

Wield is a novel system that streamlines task design in reinforcement learning by decoupling interfaces from task representations and introducing a staged randomization protocol for systematic evaluation.

Contribution

It introduces Wield, a system that facilitates task design and evaluation in reinforcement learning through modular primitives and a new staged randomization protocol.

Findings

01

Enables decoupling of system interfaces from task representations.

02

Provides a structured protocol for incremental model evaluation.

03

Supports practical reinforcement learning with flexible task design.

Abstract

Reinforcement learning frameworks have introduced abstractions to implement and execute algorithms at scale. They assume standardized simulator interfaces but are not concerned with identifying suitable task representations. We present Wield, a first-of-its kind system to facilitate task design for practical reinforcement learning. Through software primitives, Wield enables practitioners to decouple system-interface and deployment-specific configuration from state and action design. To guide experimentation, Wield further introduces a novel task design protocol and classification scheme centred around staged randomization to incrementally evaluate model capabilities.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Simulation Techniques and Applications