Deep learning with t-exponential Bayesian kitchen sinks
Harris Partaourides, Sotirios Chatzis

TL;DR
This paper introduces deep neural network architectures that incorporate Bayesian principles and t-exponential family distributions, enabling better uncertainty modeling and robustness to outliers, especially with small or sparse datasets.
Contribution
It proposes novel deep network layers with randomized nonlinearities combined via multiple weights, trained using t-divergence-based approximate inference for improved uncertainty handling.
Findings
Outperforms state-of-the-art methods on challenging benchmarks
Effectively models data with outliers and fat tails
Enhances uncertainty quantification in deep learning
Abstract
Bayesian learning has been recently considered as an effective means of accounting for uncertainty in trained deep network parameters. This is of crucial importance when dealing with small or sparse training datasets. On the other hand, shallow models that compute weighted sums of their inputs, after passing them through a bank of arbitrary randomized nonlinearities, have been recently shown to enjoy good test error bounds that depend on the number of nonlinearities. Inspired from these advances, in this paper we examine novel deep network architectures, where each layer comprises a bank of arbitrary nonlinearities, linearly combined using multiple alternative sets of weights. We effect model training by means of approximate inference based on a t-divergence measure; this generalizes the Kullback-Leibler divergence in the context of the t-exponential family of distributions. We adopt…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
