Neural Distillation as a State Representation Bottleneck in   Reinforcement Learning

Valentin Guillet; Dennis G. Wilson; Carlos Aguilar-Melchor; Emmanuel; Rachelson

arXiv:2210.02224·cs.LG·October 6, 2022

Neural Distillation as a State Representation Bottleneck in Reinforcement Learning

Valentin Guillet, Dennis G. Wilson, Carlos Aguilar-Melchor, Emmanuel, Rachelson

PDF

Open Access

TL;DR

This paper explores using neural distillation to learn effective state representations in reinforcement learning, demonstrating its benefits across simple and complex environments for improved transfer and generalization.

Contribution

It introduces a novel perspective of applying distillation as a state representation bottleneck and defines criteria to evaluate its effectiveness in RL tasks.

Findings

01

Distillation improves state variable selection.

02

Enhanced separation of states by optimal actions.

03

Robustness of representations on new tasks demonstrated.

Abstract

Learning a good state representation is a critical skill when dealing with multiple tasks in Reinforcement Learning as it allows for transfer and better generalization between tasks. However, defining what constitute a useful representation is far from simple and there is so far no standard method to find such an encoding. In this paper, we argue that distillation -- a process that aims at imitating a set of given policies with a single neural network -- can be used to learn a state representation displaying favorable characteristics. In this regard, we define three criteria that measure desirable features of a state encoding: the ability to select important variables in the input space, the ability to efficiently separate states according to their corresponding optimal action, and the robustness of the state encoding on new tasks. We first evaluate these criteria and verify the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural dynamics and brain function · Neural Networks and Reservoir Computing