Exploiting the Statistics of Learning and Inference

Max Welling

arXiv:1402.7025·cs.LG·March 6, 2014·5 cites

Exploiting the Statistics of Learning and Inference

Max Welling

PDF

Open Access

TL;DR

This paper introduces algorithms that leverage the statistical properties of large datasets and simulations to enhance the efficiency of learning and inference, particularly through subsampling and uncertainty testing.

Contribution

It proposes novel methods for subsampling and uncertainty testing in stochastic gradients and MCMC, addressing computational challenges in big data and simulations.

Findings

01

Improved efficiency in stochastic gradient estimation.

02

Enhanced MCMC sampling using stochastic gradients.

03

Emphasizes the importance of storing all information in likelihood-free MCMC.

Abstract

When dealing with datasets containing a billion instances or with simulations that require a supercomputer to execute, computational resources become part of the equation. We can improve the efficiency of learning and inference by exploiting their inherent statistical nature. We propose algorithms that exploit the redundancy of data relative to a model by subsampling data-cases for every update and reasoning about the uncertainty created in this process. In the context of learning we propose to test for the probability that a stochastically estimated gradient points more than 180 degrees in the wrong direction. In the context of MCMC sampling we use stochastic gradients to improve the efficiency of MCMC updates, and hypothesis tests based on adaptive mini-batches to decide whether to accept or reject a proposed parameter update. Finally, we argue that in the context of likelihood free…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Control Systems and Identification · Machine Learning and Algorithms