Data Generation as Sequential Decision Making

Philip Bachman; Doina Precup

arXiv:1506.03504·cs.LG·November 4, 2015·42 cites

Data Generation as Sequential Decision Making

Philip Bachman, Doina Precup

PDF

Open Access 1 Repo

TL;DR

This paper presents a unified view of generative models as sequential decision processes, introducing neural network-based policies for data imputation via an MDP framework and guided policy search.

Contribution

It extends existing models by framing data imputation as an MDP and develops neural policies trained with guided policy search for effective data completion.

Findings

01

Models successfully perform data imputation across various datasets.

02

The approach effectively handles varying difficulty levels in imputation tasks.

03

Iterative feedback improves prediction quality.

Abstract

We connect a broad class of generative models through their shared reliance on sequential decision making. Motivated by this view, we develop extensions to an existing model, and then explore the idea further in the context of data imputation -- perhaps the simplest setting in which to investigate the relation between unconditional and conditional generative modelling. We formulate data imputation as an MDP and develop models capable of representing effective policies for it. We construct the models using neural networks and train them using a form of guided policy search. Our models generate predictions through an iterative process of feedback and refinement. We show that this approach can learn effective policies for imputation problems of varying difficulty and across multiple datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Philip-Bachman/Sequential-Generation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Topic Modeling · Model Reduction and Neural Networks