Recurrent Sum-Product-Max Networks for Decision Making in   Perfectly-Observed Environments

Hari Teja Tatavarti; Prashant Doshi; Layton Hayes

arXiv:2006.07300·cs.AI·June 15, 2020·1 cites

Recurrent Sum-Product-Max Networks for Decision Making in Perfectly-Observed Environments

Hari Teja Tatavarti, Prashant Doshi, Layton Hayes

PDF

Open Access 1 Repo

TL;DR

This paper introduces recurrent sum-product-max networks (RSPMNs), a novel model that extends SPMNs for sequential decision-making, enabling data-driven, scalable, and effective policy learning in perfectly-observed environments.

Contribution

The paper proposes RSPMNs, a new recurrent architecture for SPMNs, including a structure learning algorithm and conditions for validity, tailored for sequential decision-making tasks.

Findings

01

RSPMNs generate near-optimal MEUs and policies in test domains.

02

They outperform recent batch-constrained reinforcement learning methods.

03

RSPMNs are scalable and suitable for offline decision-making in perfectly-observed environments.

Abstract

Recent investigations into sum-product-max networks (SPMN) that generalize sum-product networks (SPN) offer a data-driven alternative for decision making, which has predominantly relied on handcrafted models. SPMNs computationally represent a probabilistic decision-making problem whose solution scales linearly in the size of the network. However, SPMNs are not well suited for sequential decision making over multiple time steps. In this paper, we present recurrent SPMNs (RSPMN) that learn from and model decision-making data over time. RSPMNs utilize a template network that is unfolded as needed depending on the length of the data sequence. This is significant as RSPMNs not only inherit the benefits of SPMNs in being data driven and mostly tractable, they are also well suited for sequential problems. We establish conditions on the template network, which guarantee that the resulting SPMN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

c0derzer0/RSPMN
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Advanced Software Engineering Methodologies · Reinforcement Learning in Robotics