AIXIjs: A Software Demo for General Reinforcement Learning
John Aslanides

TL;DR
AIXIjs is a JavaScript-based platform that demonstrates various general reinforcement learning agents, illustrating their properties and differences through experiments, and providing a framework similar to OpenAI Gym for testing in diverse environments.
Contribution
The paper introduces AIXIjs, a versatile software demo for general reinforcement learning agents, with experimental tools and interactive demos to explore their behaviors.
Findings
Different agents exhibit distinct exploration strategies.
Agents vary in asymptotic optimality and robustness.
AIXIjs facilitates understanding of complex RL agent properties.
Abstract
Reinforcement learning is a general and powerful framework with which to study and implement artificial intelligence. Recent advances in deep learning have enabled RL algorithms to achieve impressive performance in restricted domains such as playing Atari video games (Mnih et al., 2015) and, recently, the board game Go (Silver et al., 2016). However, we are still far from constructing a generally intelligent agent. Many of the obstacles and open questions are conceptual: What does it mean to be intelligent? How does one explore and learn optimally in general, unknown environments? What, in fact, does it mean to be optimal in the general sense? The universal Bayesian agent AIXI (Hutter, 2005) is a model of a maximally intelligent agent, and plays a central role in the sub-field of general reinforcement learning (GRL). Recently, AIXI has been shown to be flawed in important ways; it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Reinforcement Learning in Robotics
MethodsMinimum Description Length
