From Bayesian Sparsity to Gated Recurrent Nets

Hao He; Bo Xin; David Wipf

arXiv:1706.02815·cs.LG·August 3, 2017·22 cites

From Bayesian Sparsity to Gated Recurrent Nets

Hao He, Bo Xin, David Wipf

PDF

Open Access

TL;DR

This paper reveals that complex Bayesian algorithms for sparse estimation resemble LSTM networks, leading to a new data-driven approach that efficiently estimates solutions in challenging regimes like DOA and 3D geometry recovery.

Contribution

It demonstrates the structural parallels between multi-loop Bayesian algorithms and LSTM networks, and introduces a novel sparse estimation system leveraging training data for improved performance.

Findings

01

The proposed method outperforms traditional algorithms in difficult regimes.

02

Bayesian algorithms and LSTM structures share underlying principles.

03

The approach is effective in practical applications like DOA and 3D geometry recovery.

Abstract

The iterations of many first-order algorithms, when applied to minimizing common regularized regression functions, often resemble neural network layers with pre-specified weights. This observation has prompted the development of learning-based approaches that purport to replace these iterations with enhanced surrogates forged as DNN models from available training data. For example, important NP-hard sparse estimation problems have recently benefitted from this genre of upgrade, with simple feedforward or recurrent networks ousting proximal gradient-based iterations. Analogously, this paper demonstrates that more powerful Bayesian algorithms for promoting sparsity, which rely on complex multi-loop majorization-minimization techniques, mirror the structure of more sophisticated long short-term memory (LSTM) networks, or alternative gated feedback networks previously designed for sequence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBlind Source Separation Techniques · Gaussian Processes and Bayesian Inference · Domain Adaptation and Few-Shot Learning