Learning Approximate Stochastic Transition Models

Yuhang Song; Christopher Grimm; Xianming Wang; Michael L. Littman

arXiv:1710.09718·cs.LG·October 27, 2017·2 cites

Learning Approximate Stochastic Transition Models

Yuhang Song, Christopher Grimm, Xianming Wang, Michael L. Littman

PDF

Open Access 1 Repo

TL;DR

This paper introduces a modified GAN-based approach for learning stochastic transition models in reinforcement learning, capable of generalizing to new states and capturing randomness in state transitions.

Contribution

It proposes a novel modification to GAN loss functions that enables effective learning of stochastic transition models, overcoming limitations of existing methods.

Findings

01

Modified GAN loss functions improve stochastic transition modeling.

02

The approach generalizes well to unseen states.

03

It outperforms traditional GANs in capturing stochastic dynamics.

Abstract

We examine the problem of learning mappings from state to state, suitable for use in a model-based reinforcement-learning setting, that simultaneously generalize to novel states and can capture stochastic transitions. We show that currently popular generative adversarial networks struggle to learn these stochastic transition models but a modification to their loss functions results in a powerful learning algorithm for this class of problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YuhangSong/SGAN
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Model Reduction and Neural Networks · Neural Networks and Applications