Estimating Disentangled Belief about Hidden State and Hidden Task for   Meta-RL

Kei Akuzawa; Yusuke Iwasawa; Yutaka Matsuo

arXiv:2105.06660·cs.AI·May 17, 2021

Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL

Kei Akuzawa, Yusuke Iwasawa, Yutaka Matsuo

PDF

Open Access

TL;DR

This paper introduces a hierarchical state-space model for meta-reinforcement learning that disentangles hidden task and state information, improving data efficiency and performance in complex environments.

Contribution

The paper proposes a novel hierarchical model with disentangled latent variables for task and state estimation in meta-RL, enabling better interpretability and efficiency.

Findings

01

HSSM effectively separates task and state information in experiments.

02

Meta-RL with HSSM requires less training data.

03

Achieves higher final performance in MuJoCo environments.

Abstract

There is considerable interest in designing meta-reinforcement learning (meta-RL) algorithms, which enable autonomous agents to adapt new tasks from small amount of experience. In meta-RL, the specification (such as reward function) of current task is hidden from the agent. In addition, states are hidden within each task owing to sensor noise or limitations in realistic environments. Therefore, the meta-RL agent faces the challenge of specifying both the hidden task and states based on small amount of experience. To address this, we propose estimating disentangled belief about task and states, leveraging an inductive bias that the task and states can be regarded as global and local features of each task. Specifically, we train a hierarchical state-space model (HSSM) parameterized by deep neural networks as an environment model, whose global and local latent variables correspond to task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Data Stream Mining Techniques