A Research Agenda for Usability and Generalisation in Reinforcement Learning
Dennis J.N.J. Soemers, Spyridon Samothrakis, Kurt Driessens, Mark H.M. Winands

TL;DR
This paper advocates for developing user-friendly problem description languages in reinforcement learning to improve usability for non-experts and enhance the generalisation capabilities of RL algorithms across diverse problems.
Contribution
It proposes a research agenda focused on creating formal, accessible problem description languages to make RL more usable and promote better generalisation.
Findings
Highlights the usability challenges in current RL practices.
Identifies the lack of standard problem representations as a barrier to generalisation.
Suggests that formal description languages can bridge usability and generalisation issues.
Abstract
It is common practice in reinforcement learning (RL) research to train and deploy agents in bespoke simulators, typically implemented by engineers directly in general-purpose programming languages or hardware acceleration frameworks such as CUDA or JAX. This means that programming and engineering expertise is not only required to develop RL algorithms, but is also required to use already developed algorithms for novel problems. The latter poses a problem in terms of the usability of RL, in particular for private individuals and small organisations without substantial engineering expertise. We also perceive this as a challenge for effective generalisation in RL, in the sense that is no standard, shared formalism in which different problems are represented. As we typically have no consistent representation through which to provide information about any novel problem to an agent, our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDesign Education and Practice
MethodsSparse Evolutionary Training · Focus
