PAC-Bayesian Learning of Aggregated Binary Activated Neural Networks   with Probabilities over Representations

Louis Fortier-Dubois; Ga\"el Letarte; Benjamin Leblanc; Fran\c{c}ois; Laviolette; Pascal Germain

arXiv:2110.15137·cs.LG·April 17, 2023

PAC-Bayesian Learning of Aggregated Binary Activated Neural Networks with Probabilities over Representations

Louis Fortier-Dubois, Ga\"el Letarte, Benjamin Leblanc, Fran\c{c}ois, Laviolette, Pascal Germain

PDF

Open Access

TL;DR

This paper introduces a PAC-Bayesian approach for aggregating binary activated neural networks with probabilistic weights, providing exact computation methods for deep but narrow networks and scalable stochastic algorithms for wider architectures.

Contribution

It develops a novel PAC-Bayesian framework for exact and scalable aggregation of binary neural networks using probabilities over representations.

Findings

01

Exact computation of network expectations is tractable for deep but narrow networks.

02

A dynamic programming approach enables efficient bound minimization.

03

A stochastic method scales to wide neural network architectures.

Abstract

Considering a probability distribution over parameters is known as an efficient strategy to learn a neural network with non-differentiable activation functions. We study the expectation of a probabilistic neural network as a predictor by itself, focusing on the aggregation of binary activated neural networks with normal distributions over real-valued weights. Our work leverages a recent analysis derived from the PAC-Bayesian framework that derives tight generalization bounds and learning procedures for the expected output value of such an aggregation, which is given by an analytical expression. While the combinatorial nature of the latter has been circumvented by approximations in previous works, we show that the exact computation remains tractable for deep but narrow neural networks, thanks to a dynamic programming approach. This leads us to a peculiar bound minimization learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Neural Networks and Applications · Stochastic Gradient Optimization Techniques