Distribution Estimation of Contaminated Data via DNN-based MoM-GANs

Fang Xie; Lihu Xu; Qiuran Yao; Huiming Zhang

arXiv:2212.13741·stat.ML·December 29, 2022

Distribution Estimation of Contaminated Data via DNN-based MoM-GANs

Fang Xie, Lihu Xu, Qiuran Yao, Huiming Zhang

PDF

Open Access

TL;DR

This paper introduces a DNN-based MoM-GAN approach for robust distribution estimation of contaminated data, providing theoretical error bounds and demonstrating superior performance in real applications.

Contribution

It develops a novel MoM-GAN method combining GANs with median-of-mean estimation and derives non-asymptotic error bounds for contaminated data scenarios.

Findings

01

The error bound decreases as $n^{-b/p} \,\vee\, n^{-1/2}$ with sample size and dimension.

02

The MoM-GAN outperforms other methods on contaminated data in real tests.

03

The paper provides an implementable algorithm for the proposed method.

Abstract

This paper studies the distribution estimation of contaminated data by the MoM-GAN method, which combines generative adversarial net (GAN) and median-of-mean (MoM) estimation. We use a deep neural network (DNN) with a ReLU activation function to model the generator and discriminator of the GAN. Theoretically, we derive a non-asymptotic error bound for the DNN-based MoM-GAN estimator measured by integral probability metrics with the $b$ -smoothness H\"{o}lder class. The error bound decreases essentially as $n^{- b / p} \lor n^{- 1/2}$ , where $n$ and $p$ are the sample size and the dimension of input data. We give an algorithm for the MoM-GAN method and implement it through two real applications. The numerical results show that the MoM-GAN outperforms other competitive methods when dealing with contaminated data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis