Provable benefits of annealing for estimating normalizing constants: Importance Sampling, Noise-Contrastive Estimation, and beyond
Omar Chehab, Aapo Hyvarinen, Andrej Risteski

TL;DR
This paper analyzes annealing-based Monte Carlo methods for estimating normalization constants, showing how different choices in estimator type and path of distributions affect efficiency, and proposing a new two-step estimator for optimal path approximation.
Contribution
It provides a theoretical comparison of importance sampling and noise-contrastive estimation, and introduces a two-step estimator for better path selection.
Findings
NCE is more efficient than importance sampling, but the difference diminishes with infinitesimal steps.
Geometric paths reduce estimation error from exponential to polynomial.
Arithmetic paths can be optimal, especially in certain limits.
Abstract
Recent research has developed several Monte Carlo methods for estimating the normalization constant (partition function) based on the idea of annealing. This means sampling successively from a path of distributions that interpolate between a tractable "proposal" distribution and the unnormalized "target" distribution. Prominent estimators in this family include annealed importance sampling and annealed noise-contrastive estimation (NCE). Such methods hinge on a number of design choices: which estimator to use, which path of distributions to use and whether to use a path at all; so far, there is no definitive theory on which choices are efficient. Here, we evaluate each design choice by the asymptotic estimation error it produces. First, we show that using NCE is more efficient than the importance sampling estimator, but in the limit of infinitesimal path steps, the difference vanishes.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStatistical Methods and Inference · Financial Risk and Volatility Modeling · Probabilistic and Robust Engineering Design
