SEGAN: semi-supervised learning approach for missing data imputation

Xiaohua Pan; Weifeng Wu; Peiran Liu; Zhen Li; Peng Lu; Peijian Cao,; Jianfeng Zhang; Xianfei Qiu; YangYang Wu

arXiv:2405.13089·cs.LG·June 13, 2024

SEGAN: semi-supervised learning approach for missing data imputation

Xiaohua Pan, Weifeng Wu, Peiran Liu, Zhen Li, Peng Lu, Peijian Cao,, Jianfeng Zhang, Xianfei Qiu, YangYang Wu

PDF

Open Access 1 Repo

TL;DR

This paper introduces SEGAN, a semi-supervised learning model for missing data imputation that leverages label information and a missing hint matrix to improve data completion accuracy.

Contribution

The paper presents a novel SEGAN model incorporating a classifier and missing hint matrix, enhancing data imputation by utilizing label information and theoretical learning guarantees.

Findings

01

SEGAN outperforms state-of-the-art methods by over 3% in data completion accuracy.

02

Theoretical proof shows SEGAN can learn true data distribution characteristics.

03

Experimental results confirm improved performance across various datasets.

Abstract

In many practical real-world applications, data missing is a very common phenomenon, making the development of data-driven artificial intelligence theory and technology increasingly difficult. Data completion is an important method for missing data preprocessing. Most existing miss-ing data completion models directly use the known information in the missing data set but ignore the impact of the data label information contained in the data set on the missing data completion model. To this end, this paper proposes a missing data completion model SEGAN based on semi-supervised learning, which mainly includes three important modules: generator, discriminator and classifier. In the SEGAN model, the classifier enables the generator to make more full use of known data and its label information when predicting missing data values. In addition, the SE-GAN model introduces a missing hint matrix…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

niunaicoke/SEGAN-A-Semi-Supervised-Learning-Method-for-Missing-Data-Imputation
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Face and Expression Recognition · Domain Adaptation and Few-Shot Learning

MethodsSparse Evolutionary Training · Hierarchical Information Threading