Enhancing Model Robustness and Fairness with Causality: A Regularization   Approach

Zhao Wang; Kai Shu; Aron Culotta

arXiv:2110.00911·cs.LG·October 5, 2021·1 cites

Enhancing Model Robustness and Fairness with Causality: A Regularization Approach

Zhao Wang, Kai Shu, Aron Culotta

PDF

Open Access 1 Repo

TL;DR

This paper introduces a causal regularization method to improve machine learning model robustness and fairness by emphasizing causal features and reducing reliance on spurious correlations, validated through experiments.

Contribution

It presents a novel regularization approach that incorporates causal knowledge into model training to enhance robustness and fairness, based on manually identified causal and spurious features.

Findings

01

Significant improvement in model robustness against counterfactual texts.

02

Enhanced fairness with respect to sensitive attributes.

03

Effective separation of causal and spurious features during training.

Abstract

Recent work has raised concerns on the risk of spurious correlations and unintended biases in statistical machine learning models that threaten model robustness and fairness. In this paper, we propose a simple and intuitive regularization approach to integrate causal knowledge during model training and build a robust and fair model by emphasizing causal features and de-emphasizing spurious features. Specifically, we first manually identify causal and spurious features with principles inspired from the counterfactual framework of causal inference. Then, we propose a regularization approach to penalize causal and spurious features separately. By adjusting the strength of the penalty for each type of feature, we build a predictive model that relies more on causal features and less on non-causal features. We conduct experiments to evaluate model robustness and fairness on three datasets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tapilab/emnlp-2021-regularization
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI · Adversarial Robustness in Machine Learning