The sample size required in importance sampling

Sourav Chatterjee; Persi Diaconis

arXiv:1511.01437·math.PR·June 22, 2017

The sample size required in importance sampling

Sourav Chatterjee, Persi Diaconis

PDF

TL;DR

This paper establishes that the sample size needed for effective importance sampling is approximately exponential in the Kullback-Leibler divergence between the target and proposal measures, highlighting a sharp cutoff phenomenon.

Contribution

It provides a general theoretical result linking sample size to KL divergence and applies this to exponential family distributions, clarifying importance sampling efficiency.

Findings

01

Sample size is roughly exponential in D(ν||μ).

02

A cut-off phenomenon in sample size requirements is identified.

03

Application to exponential families yields explicit formulas.

Abstract

The goal of importance sampling is to estimate the expected value of a given function with respect to a probability measure $ν$ using a random sample of size $n$ drawn from a different probability measure $μ$ . If the two measures $μ$ and $ν$ are nearly singular with respect to each other, which is often the case in practice, the sample size required for accurate estimation is large. In this article it is shown that in a fairly general setting, a sample of size approximately $exp (D (ν ∣∣ μ))$ is necessary and sufficient for accurate estimation by importance sampling, where $D (ν ∣∣ μ)$ is the Kullback-Leibler divergence of $μ$ from $ν$ . In particular, the required sample size exhibits a kind of cut-off in the logarithmic scale. The theory is applied to obtain a general formula for the sample size required in importance sampling for one-parameter exponential families (Gibbs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.