Analysis of Noise Contrastive Estimation from the Perspective of Asymptotic Variance
Masatoshi Uehara, Takeru Matsuda, Fumiyasu Komaki

TL;DR
This paper analyzes Noise Contrastive Estimation (NCE), focusing on reducing its asymptotic variance through auxiliary distribution parameter estimation and objective function optimization, while also examining its robustness.
Contribution
It introduces a method to reduce NCE's asymptotic variance by estimating auxiliary distribution parameters and optimizing objective functions, enhancing estimator efficiency.
Findings
Proposed a variance reduction method for NCE.
Identified optimal objective function forms for minimal asymptotic variance.
Analyzed the robustness of the NCE estimator.
Abstract
There are many models, often called unnormalized models, whose normalizing constants are not calculated in closed form. Maximum likelihood estimation is not directly applicable to unnormalized models. Score matching, contrastive divergence method, pseudo-likelihood, Monte Carlo maximum likelihood, and noise contrastive estimation (NCE) are popular methods for estimating parameters of such models. In this paper, we focus on NCE. The estimator derived from NCE is consistent and asymptotically normal because it is an M-estimator. NCE characteristically uses an auxiliary distribution to calculate the normalizing constant in the same spirit of the importance sampling. In addition, there are several candidates as objective functions of NCE. We focus on how to reduce asymptotic variance. First, we propose a method for reducing asymptotic variance by estimating the parameters of the auxiliary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Financial Risk and Volatility Modeling
