SEGAN: semi-supervised learning approach for missing data imputation
Xiaohua Pan, Weifeng Wu, Peiran Liu, Zhen Li, Peng Lu, Peijian Cao,, Jianfeng Zhang, Xianfei Qiu, YangYang Wu

TL;DR
This paper introduces SEGAN, a semi-supervised learning model for missing data imputation that leverages label information and a missing hint matrix to improve data completion accuracy.
Contribution
The paper presents a novel SEGAN model incorporating a classifier and missing hint matrix, enhancing data imputation by utilizing label information and theoretical learning guarantees.
Findings
SEGAN outperforms state-of-the-art methods by over 3% in data completion accuracy.
Theoretical proof shows SEGAN can learn true data distribution characteristics.
Experimental results confirm improved performance across various datasets.
Abstract
In many practical real-world applications, data missing is a very common phenomenon, making the development of data-driven artificial intelligence theory and technology increasingly difficult. Data completion is an important method for missing data preprocessing. Most existing miss-ing data completion models directly use the known information in the missing data set but ignore the impact of the data label information contained in the data set on the missing data completion model. To this end, this paper proposes a missing data completion model SEGAN based on semi-supervised learning, which mainly includes three important modules: generator, discriminator and classifier. In the SEGAN model, the classifier enables the generator to make more full use of known data and its label information when predicting missing data values. In addition, the SE-GAN model introduces a missing hint matrix…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Face and Expression Recognition · Domain Adaptation and Few-Shot Learning
MethodsSparse Evolutionary Training · Hierarchical Information Threading
