# Deep Residual Autoencoders for Expectation Maximization-inspired   Dictionary Learning

**Authors:** Bahareh Tolooshams, Sourav Dey, Demba Ba

arXiv: 1904.08827 · 2020-10-20

## TL;DR

This paper presents CRsAE, a neural network architecture inspired by Expectation-Maximization for convolutional dictionary learning, demonstrating superior performance in image denoising and neural spike detection tasks.

## Contribution

The introduction of CRsAE, a novel autoencoder architecture that unifies dictionary learning with neural networks and incorporates EM-inspired training for improved results.

## Key findings

- Learns Gabor-like filters in image denoising.
- Speeds up spike detection by 900x in neural data.
- Outperforms conventional methods in accuracy and efficiency.

## Abstract

We introduce a neural-network architecture, termed the constrained recurrent sparse autoencoder (CRsAE), that solves convolutional dictionary learning problems, thus establishing a link between dictionary learning and neural networks. Specifically, we leverage the interpretation of the alternating-minimization algorithm for dictionary learning as an approximate Expectation-Maximization algorithm to develop autoencoders that enable the simultaneous training of the dictionary and regularization parameter (ReLU bias). The forward pass of the encoder approximates the sufficient statistics of the E-step as the solution to a sparse coding problem, using an iterative proximal gradient algorithm called FISTA. The encoder can be interpreted either as a recurrent neural network or as a deep residual network, with two-sided ReLU non-linearities in both cases. The M-step is implemented via a two-stage back-propagation. The first stage relies on a linear decoder applied to the encoder and a norm-squared loss. It parallels the dictionary update step in dictionary learning. The second stage updates the regularization parameter by applying a loss function to the encoder that includes a prior on the parameter motivated by Bayesian statistics. We demonstrate in an image-denoising task that CRsAE learns Gabor-like filters, and that the EM-inspired approach for learning biases is superior to the conventional approach. In an application to recordings of electrical activity from the brain, we demonstrate that CRsAE learns realistic spike templates and speeds up the process of identifying spike times by 900x compared to algorithms based on convex optimization.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.08827/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/1904.08827/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/1904.08827/full.md

---
Source: https://tomesphere.com/paper/1904.08827