A Game Theoretic Approach to Class-wise Selective Rationalization
Shiyu Chang, Yang Zhang, Mo Yu, Tommi S. Jaakkola

TL;DR
This paper introduces a game theoretic method for class-dependent rationalization in neural models, enabling the highlighting of evidence supporting both factual and counterfactual scenarios, improving interpretability.
Contribution
It presents a novel game theoretic framework for class-wise rationalization, capturing multi-faceted rationales and supporting both factual and counterfactual explanations.
Findings
Effectively identifies factual and counterfactual rationales
Aligns with human rationalization in sentiment tasks
Demonstrates improved interpretability
Abstract
Selection of input features such as relevant pieces of text has become a common technique of highlighting how complex neural predictors operate. The selection can be optimized post-hoc for trained models or incorporated directly into the method itself (self-explaining). However, an overall selection does not properly capture the multi-faceted nature of useful rationales such as pros and cons for decisions. To this end, we propose a new game theoretic approach to class-dependent rationalization, where the method is specifically trained to highlight evidence supporting alternative conclusions. Each class involves three players set up competitively to find evidence for factual and counterfactual scenarios. We show theoretically in a simplified scenario how the game drives the solution towards meaningful class-dependent rationales. We evaluate the method in single- and multi-aspect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Sentiment Analysis and Opinion Mining
