Adaptive generative moment matching networks for improved learning of dependence structures
Marius Hofert, Gan Yao

TL;DR
This paper introduces an adaptive bandwidth selection method for GMMNs that enhances learning of dependence structures, especially in high-dimensional copula modeling, with improved training performance and predictive accuracy.
Contribution
The paper proposes an adaptive kernel bandwidth selection procedure for GMMNs, significantly improving training efficiency and dependence modeling over existing methods.
Findings
AGMMNs outperform GMMNs in training and prediction.
Convergence rates for high-dimensional copulas are demonstrated up to 100 dimensions.
Improved modeling of dependence structures in financial data.
Abstract
An adaptive bandwidth selection procedure for the mixture kernel in the maximum mean discrepancy (MMD) for fitting generative moment matching networks (GMMNs) is introduced, and its ability to improve the learning of copula random number generators is demonstrated. Based on the relative error of the training loss, the number of kernels is increased during training; additionally, the relative error of the validation loss is used as an early stopping criterion. While training time of such adaptively trained GMMNs (AGMMNs) is similar to that of GMMNs, training performance is increased significantly in comparison to GMMNs, which is assessed and shown based on validation MMD trajectories, samples and validation MMD values. Superiority of AGMMNs over GMMNs, as well as typical parametric copula models, is demonstrated in terms of three applications. First, convergence rates of quasi-random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Tensor decomposition and applications · Generative Adversarial Networks and Image Synthesis
