Generating Data using Monte Carlo Dropout

Kristian Miok; Dong Nguyen-Doan; Daniela Zaharie; Marko; Robnik-\v{S}ikonja

arXiv:1909.05755·stat.ML·September 17, 2019

Generating Data using Monte Carlo Dropout

Kristian Miok, Dong Nguyen-Doan, Daniela Zaharie, Marko, Robnik-\v{S}ikonja

PDF

1 Repo

TL;DR

This paper introduces Monte Carlo Dropout-enhanced autoencoders as efficient methods for generating synthetic data that closely resembles real data in statistical and predictive properties, especially useful when data collection is costly.

Contribution

It proposes novel MCD-AE and MCD-VAE models that improve synthetic data generation speed and specificity compared to traditional VAEs.

Findings

01

MCD-AE and MCD-VAE produce data similar to original datasets

02

The proposed methods are faster than standard VAEs

03

Generated data maintains statistical and predictive properties

Abstract

For many analytical problems the challenge is to handle huge amounts of available data. However, there are data science application areas where collecting information is difficult and costly, e.g., in the study of geological phenomena, rare diseases, faults in complex systems, insurance frauds, etc. In many such cases, generators of synthetic data with the same statistical and predictive properties as the actual data allow efficient simulations and development of tools and applications. In this work, we propose the incorporation of Monte Carlo Dropout method within Autoencoder (MCD-AE) and Variational Autoencoder (MCD-VAE) as efficient generators of synthetic data sets. As the Variational Autoencoder (VAE) is one of the most popular generator techniques, we explore its similarities and differences to the proposed methods. We compare the generated data sets with the original data based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KristianMiok/MCD-VAE
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMonte Carlo Dropout · Solana Customer Service Number +1-833-534-1729 · USD Coin Customer Service Number +1-833-534-1729 · Dropout