Robust model training and generalisation with Studentising flows

Simon Alexanderson; Gustav Eje Henter

arXiv:2006.06599·cs.LG·July 14, 2020·1 cites

Robust model training and generalisation with Studentising flows

Simon Alexanderson, Gustav Eje Henter

PDF

Open Access 1 Repo

TL;DR

This paper introduces Studentising flows, a robust normalising flow approach using fat-tailed Student's t-distributions to improve model robustness, generalisation, and likelihood without sacrificing consistency.

Contribution

It proposes replacing Gaussian base distributions in normalising flows with Student's t-distributions, enhancing robustness and generalisation capabilities.

Findings

01

Fat-tailed distributions improve robustness similar to gradient clipping.

02

Models with Student's t-distributions show reduced generalisation gap.

03

Experimental results confirm improved likelihood and robustness.

Abstract

Normalising flows are tractable probabilistic models that leverage the power of deep learning to describe a wide parametric family of distributions, all while remaining trainable using maximum likelihood. We discuss how these methods can be further improved based on insights from robust (in particular, resistant) statistics. Specifically, we propose to endow flow-based models with fat-tailed latent distributions such as multivariate Student's $t$ , as a simple drop-in replacement for the Gaussian distribution used by conventional normalising flows. While robustness brings many advantages, this paper explores two of them: 1) We describe how using fatter-tailed base distributions can give benefits similar to gradient clipping, but without compromising the asymptotic consistency of the method. 2) We also discuss how robust ideas lead to models with reduced generalisation gap and improved…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

simonalexanderson/StyleGestures
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning