Convex space learning for tabular synthetic data generation
Manjunath Mahendra, Chaithra Umesh, Saptarshi Bej, Kristian Schultz,, Olaf Wolkenhauer

TL;DR
This paper introduces NextConvGeN, a deep learning model that generates synthetic tabular data by learning the convex space of neighborhoods, improving data utility for classification, clustering, and privacy preservation.
Contribution
It presents a novel deep learning architecture for modeling the convex space of tabular data neighborhoods to generate high-quality synthetic datasets.
Findings
NextConvGeN outperforms five state-of-the-art models in preserving classification and clustering performance.
Synthetic data from NextConvGeN scores higher on utility measures.
The model offers promising applications in clinical research and data sharing.
Abstract
Generating synthetic samples from the convex space of the minority class is a popular oversampling approach for imbalanced classification problems. Recently, deep-learning approaches have been successfully applied to modeling the convex space of minority samples. Beyond oversampling, learning the convex space of neighborhoods in training data has not been used to generate entire tabular datasets. In this paper, we introduce a deep learning architecture (NextConvGeN) with a generator and discriminator component that can generate synthetic samples by learning to model the convex space of tabular data. The generator takes data neighborhoods as input and creates synthetic samples within the convex space of that neighborhood. Thereafter, the discriminator tries to classify these synthetic samples against a randomly sampled batch of data from the rest of the data space. We compared our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Anomaly Detection Techniques and Applications · Neural Networks and Applications
