Mixture Experts with Test-Time Self-Supervised Aggregation for Tabular Imbalanced Regression
Yung-Chien Wang, Kuang-Da Wang, Wei-Yao Wang, and Wen-Chih Peng

TL;DR
This paper introduces MATI, a novel approach for imbalanced regression on tabular data that uses mixture experts and test-time self-supervised aggregation to adapt to varying test distributions, improving prediction accuracy.
Contribution
The paper proposes a new method combining Gaussian mixture models and self-supervised expert aggregation to address tabular imbalanced regression, a less-explored area.
Findings
Achieved 7.1% average MAE improvement over existing methods.
Effectively adapts to different test distribution types, including balanced, normal, and inverse.
Demonstrated effectiveness on four real-world datasets.
Abstract
Tabular data serve as a fundamental and ubiquitous representation of structured information in numerous real-world applications, e.g., finance and urban planning. In the realm of tabular imbalanced applications, data imbalance has been investigated in classification tasks with insufficient instances in certain labels, causing the model's ineffective generalizability. However, the imbalance issue of tabular regression tasks is underexplored, and yet is critical due to unclear boundaries for continuous labels and simplifying assumptions in existing imbalance regression work, which often rely on known and balanced test distributions. Such assumptions may not hold in practice and can lead to performance degradation. To address these issues, we propose MATI: Mixture Experts with Test-Time Self-Supervised Aggregation for Tabular Imbalance Regression, featuring two key innovations: (i) the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Data Stream Mining Techniques · Human Mobility and Location-Based Analysis
