Prediction of rare feature combinations in population synthesis:   Application of deep generative modelling

Sergio Garrido; Stanislav S. Borysov; Francisco C. Pereira; Jeppe Rich

arXiv:1909.07689·stat.ML·September 18, 2019

Prediction of rare feature combinations in population synthesis: Application of deep generative modelling

Sergio Garrido, Stanislav S. Borysov, Francisco C. Pereira, Jeppe Rich

PDF

TL;DR

This paper introduces deep generative models, WGAN and VAE, to improve population synthesis by effectively estimating rare feature combinations and addressing sampling zeros in large, multivariate datasets.

Contribution

It adapts and applies WGAN and VAE models to population synthesis, specifically tackling the sampling-zero problem in high-dimensional data.

Findings

01

Models recover sampling zeros effectively.

02

VAE outperforms other methods in low-dimensional cases.

03

High-dimensional results show significant improvement in sampling zero estimation.

Abstract

In population synthesis applications, when considering populations with many attributes, a fundamental problem is the estimation of rare combinations of feature attributes. Unsurprisingly, it is notably more difficult to reliably representthe sparser regions of such multivariate distributions and in particular combinations of attributes which are absent from the original sample. In the literature this is commonly known as sampling zeros for which no systematic solution has been proposed so far. In this paper, two machine learning algorithms, from the family of deep generative models,are proposed for the problem of population synthesis and with particular attention to the problem of sampling zeros. Specifically, we introduce the Wasserstein Generative Adversarial Network (WGAN) and the Variational Autoencoder(VAE), and adapt these algorithms for a large-scale population synthesis…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution · Wasserstein GAN · USD Coin Customer Service Number +1-833-534-1729