Tabular Data Contrastive Learning via Class-Conditioned and Feature-Correlation Based Augmentation
Wei Cui, Rasa Hosseinzadeh, Junwei Ma, Tongzi Wu, Yi Sui, and Keyvan Golestan

TL;DR
This paper introduces a class-conditioned and feature-correlation-based augmentation method for contrastive learning on tabular data, significantly improving classification performance by more effective data corruption strategies.
Contribution
It proposes a novel augmentation technique for tabular contrastive learning that conditions data corruption on class labels and feature correlations, enhancing model robustness.
Findings
Outperforms traditional augmentation methods in tabular data classification
Improves contrastive learning effectiveness through class-aware data corruption
Demonstrates consistent gains across multiple datasets
Abstract
Contrastive learning is a model pre-training technique by first creating similar views of the original data, and then encouraging the data and its corresponding views to be close in the embedding space. Contrastive learning has witnessed success in image and natural language data, thanks to the domain-specific augmentation techniques that are both intuitive and effective. Nonetheless, in tabular domain, the predominant augmentation technique for creating views is through corrupting tabular entries via swapping values, which is not as sound or effective. We propose a simple yet powerful improvement to this augmentation technique: corrupting tabular data conditioned on class identity. Specifically, when corrupting a specific tabular entry from an anchor row, instead of randomly sampling a value in the same feature column from the entire table uniformly, we only sample from rows that are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Face and Expression Recognition · Text and Document Classification Technologies
MethodsContrastive Learning
