Optimizing Variational Representations of Divergences and Accelerating   their Statistical Estimation

Jeremiah Birrell; Markos A. Katsoulakis; Yannis Pantazis

arXiv:2006.08781·cs.LG·March 25, 2022

Optimizing Variational Representations of Divergences and Accelerating their Statistical Estimation

Jeremiah Birrell, Markos A. Katsoulakis, Yannis Pantazis

PDF

Open Access

TL;DR

This paper introduces a systematic method to create tighter variational representations of divergences, improving the efficiency and accuracy of statistical estimation in high-dimensional data using neural networks.

Contribution

It develops a new approach for constructing tighter variational divergence representations using auxiliary optimization and curvature analysis, enhancing learning speed and estimation accuracy.

Findings

01

Tighter variational representations lead to significantly faster divergence estimation.

02

The methodology improves estimation accuracy in high-dimensional datasets.

03

Neural network optimization demonstrates nearly an order of magnitude acceleration.

Abstract

Variational representations of divergences and distances between high-dimensional probability distributions offer significant theoretical insights and practical advantages in numerous research areas. Recently, they have gained popularity in machine learning as a tractable and scalable approach for training probabilistic models and for statistically differentiating between data distributions. Their advantages include: 1) They can be estimated from data as statistical averages. 2) Such representations can leverage the ability of neural networks to efficiently approximate optimal solutions in function spaces. However, a systematic and practical approach to improving the tightness of such variational formulas, and accordingly accelerate statistical learning and estimation from data, is currently lacking. Here we develop such a methodology for building new, tighter variational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification