The sample size required in importance sampling
Sourav Chatterjee, Persi Diaconis

TL;DR
This paper establishes that the sample size needed for effective importance sampling is approximately exponential in the Kullback-Leibler divergence between the target and proposal measures, highlighting a sharp cutoff phenomenon.
Contribution
It provides a general theoretical result linking sample size to KL divergence and applies this to exponential family distributions, clarifying importance sampling efficiency.
Findings
Sample size is roughly exponential in D(ν||μ).
A cut-off phenomenon in sample size requirements is identified.
Application to exponential families yields explicit formulas.
Abstract
The goal of importance sampling is to estimate the expected value of a given function with respect to a probability measure using a random sample of size drawn from a different probability measure . If the two measures and are nearly singular with respect to each other, which is often the case in practice, the sample size required for accurate estimation is large. In this article it is shown that in a fairly general setting, a sample of size approximately is necessary and sufficient for accurate estimation by importance sampling, where is the Kullback-Leibler divergence of from . In particular, the required sample size exhibits a kind of cut-off in the logarithmic scale. The theory is applied to obtain a general formula for the sample size required in importance sampling for one-parameter exponential families (Gibbs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
