Faster Convergence in Deep-Predictive-Coding Networks to Learn Deeper   Representations

Isaac J. Sledge; Jose C. Principe

arXiv:2101.06848·cs.AI·September 27, 2021

Faster Convergence in Deep-Predictive-Coding Networks to Learn Deeper Representations

Isaac J. Sledge, Jose C. Principe

PDF

TL;DR

This paper introduces an accelerated inference method for deep-predictive-coding networks, enabling deeper architectures with improved feature representations and unsupervised object recognition performance.

Contribution

It provides a theoretical analysis of the inference bottleneck in DPCNs and proposes an accelerated inference strategy that enhances convergence and allows deeper, more effective networks.

Findings

01

Faster convergence guarantees for the new inference method.

02

Deep convolutional DPCNs outperform previous models in unsupervised object recognition.

03

Enhanced feature representations comparable to supervised convolutional networks.

Abstract

Deep-predictive-coding networks (DPCNs) are hierarchical, generative models. They rely on feed-forward and feed-back connections to modulate latent feature representations of stimuli in a dynamic and context-sensitive manner. A crucial element of DPCNs is a forward-backward inference procedure to uncover sparse, invariant features. However, this inference is a major computational bottleneck. It severely limits the network depth due to learning stagnation. Here, we prove why this bottleneck occurs. We then propose a new forward-inference strategy based on accelerated proximal gradients. This strategy has faster theoretical convergence guarantees than the one used for DPCNs. It overcomes learning stagnation. We also demonstrate that it permits constructing deep and wide predictive-coding networks. Such convolutional networks implement receptive fields that capture well the entire classes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.