Fairness-aware Summarization for Justified Decision-Making
Moniba Keymanesh, Tanya Berger-Wolf, Micha Elsner, Srinivasan, Parthasarathy

TL;DR
This paper introduces a fairness-aware summarization method for natural language explanations in decision models, aiming to improve fairness in both outcomes and justifications, especially in sensitive domains.
Contribution
It proposes a novel multi-task neural approach that extracts fair, high-utility summaries from biased explanations to enhance decision fairness and robustness against data poisoning.
Findings
Reduces demographic leakage in explanations
Moderately improves fairness in decision outcomes
Effectively detects and counters data poisoning attacks
Abstract
In consequential domains such as recidivism prediction, facility inspection, and benefit assignment, it's important for individuals to know the decision-relevant information for the model's prediction. In addition, predictions should be fair both in terms of the outcome and the justification of the outcome. In other words, decision-relevant features should provide sufficient information for the predicted outcome and should be independent of the membership of individuals in protected groups such as race and gender. In this work, we focus on the problem of (un)fairness in the justification of the text-based neural models. We tie the explanatory power of the model to fairness in the outcome and propose a fairness-aware summarization mechanism to detect and counteract the bias in such models. Given a potentially biased natural language explanation for a decision, we use a multi-task neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI · Topic Modeling
