LS-Tree: Model Interpretation When the Data Are Linguistic

Jianbo Chen; Michael I. Jordan

arXiv:1902.04187·cs.LG·February 13, 2019·5 cites

LS-Tree: Model Interpretation When the Data Are Linguistic

Jianbo Chen, Michael I. Jordan

PDF

Open Access

TL;DR

This paper introduces LS-Tree, a method for interpreting linguistic data models by assigning importance scores to words based on parse trees and syntactic structures, improving interpretability and interaction detection.

Contribution

It proposes a novel importance scoring method using least-squares and syntactic trees, with an axiomatic foundation linked to coalitional game theory.

Findings

01

Effectively interprets language models using parse-tree importance scores

02

Detects and quantifies word interactions in sentences

03

Enhances interpretability and diagnostics for NLP models

Abstract

We study the problem of interpreting trained classification models in the setting of linguistic data sets. Leveraging a parse tree, we propose to assign least-squares based importance scores to each word of an instance by exploiting syntactic constituency structure. We establish an axiomatic characterization of these importance scores by relating them to the Banzhaf value in coalitional game theory. Based on these importance scores, we develop a principled method for detecting and quantifying interactions between words in a sentence. We demonstrate that the proposed method can aid in interpretability and diagnostics for several widely-used language models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsInterpretability