Supervised Contrastive Learning with Tree-Structured Parzen Estimator Bayesian Optimization for Imbalanced Tabular Data
Shuting Tao, Peng Peng, Qi Li, Hongwei Wang

TL;DR
This paper introduces a supervised contrastive learning approach combined with Bayesian hyper-parameter optimization to improve classification performance on imbalanced tabular datasets, addressing label bias and tuning challenges.
Contribution
It proposes a novel SCL method tailored for imbalanced tabular data and integrates TPE for automatic hyper-parameter tuning, outperforming existing methods.
Findings
TPE outperforms grid, random, and genetic search in hyper-parameter tuning.
SCL-TPE significantly improves classification accuracy on imbalanced datasets.
The method effectively learns discriminative representations despite data imbalance.
Abstract
Class imbalance has a detrimental effect on the predictive performance of most supervised learning algorithms as the imbalanced distribution can lead to a bias preferring the majority class. To solve this problem, we propose a Supervised Contrastive Learning (SCL) method with Tree-structured Parzen Estimator (TPE) technique for imbalanced tabular datasets. Contrastive learning (CL) can extract the information hidden in data even without labels and has shown some potential for imbalanced learning tasks. SCL further considers the label information based on CL, which also addresses the insufficient data augmentation techniques of tabular data. Therefore, in this work, we propose to use SCL to learn a discriminative representation of imbalanced tabular data. Additionally, the hyper-parameter temperature of SCL has a decisive influence on the performance and is difficult to tune. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Text and Document Classification Technologies · Vehicle License Plate Recognition
MethodsContrastive Learning
