Binning as a Pretext Task: Improving Self-Supervised Learning in Tabular Domains
Kyungeun Lee, Ye Seul Sim, Hye-Seung Cho, Moonjung Eo, Suhee Yoon,, Sanghyu Yoon, Woohyung Lim

TL;DR
This paper introduces a novel self-supervised pretext task for tabular data that uses binning to improve representation learning by capturing irregular dependencies and standardizing features, leading to better downstream task performance.
Contribution
The paper proposes a binning-based pretext task for self-supervised learning in tabular domains, effectively handling heterogeneous features and irregular functions, which is a novel approach in this context.
Findings
Consistently improves downstream task performance across datasets
Captures irregular functions and feature dependencies effectively
Standardizes features into categories, aiding learning
Abstract
The ability of deep networks to learn superior representations hinges on leveraging the proper inductive biases, considering the inherent properties of datasets. In tabular domains, it is critical to effectively handle heterogeneous features (both categorical and numerical) in a unified manner and to grasp irregular functions like piecewise constant functions. To address the challenges in the self-supervised learning framework, we propose a novel pretext task based on the classical binning method. The idea is straightforward: reconstructing the bin indices (either orders or classes) rather than the original values. This pretext task provides the encoder with an inductive bias to capture the irregular dependencies, mapping from continuous inputs to discretized bins, and mitigates the feature heterogeneity by setting all features to have category-type targets. Our empirical investigations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovative Teaching and Learning Methods
