Efficient Classification with Counterfactual Reasoning and Active   Learning

Azhar Mohammed; Dang Nguyen; Bao Duong; Thin Nguyen

arXiv:2207.12086·cs.LG·July 26, 2022

Efficient Classification with Counterfactual Reasoning and Active Learning

Azhar Mohammed, Dang Nguyen, Bao Duong, Thin Nguyen

PDF

Open Access 1 Repo

TL;DR

This paper introduces CCRAL, a novel data augmentation method for tabular data that uses causal reasoning and active learning to generate and select effective counterfactual samples, improving classification accuracy.

Contribution

The paper presents CCRAL, a new approach combining causal reasoning and active learning for generating and selecting counterfactual samples to enhance tabular data classification.

Findings

01

CCRAL outperforms standard baselines in accuracy and AUC.

02

The method effectively generates useful counterfactual samples.

03

Experimental validation on real-world datasets demonstrates significant improvements.

Abstract

Data augmentation is one of the most successful techniques to improve the classification accuracy of machine learning models in computer vision. However, applying data augmentation to tabular data is a challenging problem since it is hard to generate synthetic samples with labels. In this paper, we propose an efficient classifier with a novel data augmentation technique for tabular data. Our method called CCRAL combines causal reasoning to learn counterfactual samples for the original training samples and active learning to select useful counterfactual samples based on a region of uncertainty. By doing this, our method can maximize our model's generalization on the unseen testing data. We validate our method analytically, and compare with the standard baselines. Our experimental results highlight that CCRAL achieves significantly better performance than those of the baselines across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nphdang/ccral
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning