Loss convergence in a causal Bayesian neural network of retail firm   performance

F. Trevor Rogers

arXiv:2008.13038·stat.ML·September 1, 2020

Loss convergence in a causal Bayesian neural network of retail firm performance

F. Trevor Rogers

PDF

Open Access 1 Repo

TL;DR

This paper investigates the convergence behavior of causal Bayesian neural networks in retail firm performance modeling, demonstrating that removing weak nodes and using specific inference methods can enhance neural network convergence.

Contribution

It introduces a causal Bayesian neural network approach for retail performance analysis and explores how different inference techniques affect convergence, extending prior SEM results.

Findings

01

Removing weak SEM nodes improves convergence.

02

Flipout layers enhance neural network stability.

03

Vadam optimizer results are inconclusive.

Abstract

We extend the empirical results from the structural equation model (SEM) published in the paper Assortment Planning for Retail Buying, Retail Store Operations, and Firm Performance [1] by implementing the directed acyclic graph as a causal Bayesian neural network. Neural network convergence is shown to improve with the removal of the node with the weakest SEM path when variational inference is provided by perturbing weights with Flipout layers, while results from perturbing weights at the output with the Vadam optimizer are inconclusive.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tr7200/CBNN_SEM_loss_convergence
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Consumer Market Behavior and Pricing · Reinforcement Learning in Robotics