Is the MMI Criterion Necessary for Interpretability? Degenerating   Non-causal Features to Plain Noise for Self-Rationalization

Wei Liu; Zhiying Deng; Zhongyu Niu; Jun Wang; Haozhao Wang; YuanKai; Zhang; Ruixuan Li

arXiv:2410.06003·cs.LG·October 23, 2024

Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization

Wei Liu, Zhiying Deng, Zhongyu Niu, Jun Wang, Haozhao Wang, YuanKai, Zhang, Ruixuan Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces the MRD criterion, which treats spurious features as noise to improve rationale extraction in datasets with many non-causal features, outperforming existing MMI-based methods.

Contribution

The paper proposes a novel MRD criterion that enhances rationale extraction by considering spurious features as noise, simplifying the identification of causal features.

Findings

01

MRD improves rationale quality by up to 10.4% over recent methods.

02

Theoretical analysis shows removing causal features significantly alters conditional distributions.

03

Experiments on six datasets validate the effectiveness of MRD.

Abstract

An important line of research in the field of explainability is to extract a small subset of crucial rationales from the full input. The most widely used criterion for rationale extraction is the maximum mutual information (MMI) criterion. However, in certain datasets, there are spurious features non-causally correlated with the label and also get high mutual information, complicating the loss landscape of MMI. Although some penalty-based methods have been developed to penalize the spurious features (e.g., invariance penalty, intervention penalty, etc) to help MMI work better, these are merely remedial measures. In the optimization objectives of these methods, spurious features are still distinguished from plain noise, which hinders the discovery of causal rationales. This paper aims to develop a new criterion that treats spurious features as plain noise, allowing the model to work on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jugechengzi/rationalization-mrd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques