PAC-Bayesian Learning of Aggregated Binary Activated Neural Networks with Probabilities over Representations
Louis Fortier-Dubois, Ga\"el Letarte, Benjamin Leblanc, Fran\c{c}ois, Laviolette, Pascal Germain

TL;DR
This paper introduces a PAC-Bayesian approach for aggregating binary activated neural networks with probabilistic weights, providing exact computation methods for deep but narrow networks and scalable stochastic algorithms for wider architectures.
Contribution
It develops a novel PAC-Bayesian framework for exact and scalable aggregation of binary neural networks using probabilities over representations.
Findings
Exact computation of network expectations is tractable for deep but narrow networks.
A dynamic programming approach enables efficient bound minimization.
A stochastic method scales to wide neural network architectures.
Abstract
Considering a probability distribution over parameters is known as an efficient strategy to learn a neural network with non-differentiable activation functions. We study the expectation of a probabilistic neural network as a predictor by itself, focusing on the aggregation of binary activated neural networks with normal distributions over real-valued weights. Our work leverages a recent analysis derived from the PAC-Bayesian framework that derives tight generalization bounds and learning procedures for the expected output value of such an aggregation, which is given by an analytical expression. While the combinatorial nature of the latter has been circumvented by approximations in previous works, we show that the exact computation remains tractable for deep but narrow neural networks, thanks to a dynamic programming approach. This leads us to a peculiar bound minimization learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Neural Networks and Applications · Stochastic Gradient Optimization Techniques
