Efficient Classification with Counterfactual Reasoning and Active Learning
Azhar Mohammed, Dang Nguyen, Bao Duong, Thin Nguyen

TL;DR
This paper introduces CCRAL, a novel data augmentation method for tabular data that uses causal reasoning and active learning to generate and select effective counterfactual samples, improving classification accuracy.
Contribution
The paper presents CCRAL, a new approach combining causal reasoning and active learning for generating and selecting counterfactual samples to enhance tabular data classification.
Findings
CCRAL outperforms standard baselines in accuracy and AUC.
The method effectively generates useful counterfactual samples.
Experimental validation on real-world datasets demonstrates significant improvements.
Abstract
Data augmentation is one of the most successful techniques to improve the classification accuracy of machine learning models in computer vision. However, applying data augmentation to tabular data is a challenging problem since it is hard to generate synthetic samples with labels. In this paper, we propose an efficient classifier with a novel data augmentation technique for tabular data. Our method called CCRAL combines causal reasoning to learn counterfactual samples for the original training samples and active learning to select useful counterfactual samples based on a region of uncertainty. By doing this, our method can maximize our model's generalization on the unseen testing data. We validate our method analytically, and compare with the standard baselines. Our experimental results highlight that CCRAL achieves significantly better performance than those of the baselines across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning
