From Bayesian Sparsity to Gated Recurrent Nets
Hao He, Bo Xin, David Wipf

TL;DR
This paper reveals that complex Bayesian algorithms for sparse estimation resemble LSTM networks, leading to a new data-driven approach that efficiently estimates solutions in challenging regimes like DOA and 3D geometry recovery.
Contribution
It demonstrates the structural parallels between multi-loop Bayesian algorithms and LSTM networks, and introduces a novel sparse estimation system leveraging training data for improved performance.
Findings
The proposed method outperforms traditional algorithms in difficult regimes.
Bayesian algorithms and LSTM structures share underlying principles.
The approach is effective in practical applications like DOA and 3D geometry recovery.
Abstract
The iterations of many first-order algorithms, when applied to minimizing common regularized regression functions, often resemble neural network layers with pre-specified weights. This observation has prompted the development of learning-based approaches that purport to replace these iterations with enhanced surrogates forged as DNN models from available training data. For example, important NP-hard sparse estimation problems have recently benefitted from this genre of upgrade, with simple feedforward or recurrent networks ousting proximal gradient-based iterations. Analogously, this paper demonstrates that more powerful Bayesian algorithms for promoting sparsity, which rely on complex multi-loop majorization-minimization techniques, mirror the structure of more sophisticated long short-term memory (LSTM) networks, or alternative gated feedback networks previously designed for sequence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques · Gaussian Processes and Bayesian Inference · Domain Adaptation and Few-Shot Learning
