Deep learning with t-exponential Bayesian kitchen sinks

Harris Partaourides; Sotirios Chatzis

arXiv:1802.03651·cs.LG·February 13, 2018

Deep learning with t-exponential Bayesian kitchen sinks

Harris Partaourides, Sotirios Chatzis

PDF

TL;DR

This paper introduces deep neural network architectures that incorporate Bayesian principles and t-exponential family distributions, enabling better uncertainty modeling and robustness to outliers, especially with small or sparse datasets.

Contribution

It proposes novel deep network layers with randomized nonlinearities combined via multiple weights, trained using t-divergence-based approximate inference for improved uncertainty handling.

Findings

01

Outperforms state-of-the-art methods on challenging benchmarks

02

Effectively models data with outliers and fat tails

03

Enhances uncertainty quantification in deep learning

Abstract

Bayesian learning has been recently considered as an effective means of accounting for uncertainty in trained deep network parameters. This is of crucial importance when dealing with small or sparse training datasets. On the other hand, shallow models that compute weighted sums of their inputs, after passing them through a bank of arbitrary randomized nonlinearities, have been recently shown to enjoy good test error bounds that depend on the number of nonlinearities. Inspired from these advances, in this paper we examine novel deep network architectures, where each layer comprises a bank of arbitrary nonlinearities, linearly combined using multiple alternative sets of weights. We effect model training by means of approximate inference based on a t-divergence measure; this generalizes the Kullback-Leibler divergence in the context of the t-exponential family of distributions. We adopt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.