Pruning Literals for Highly Efficient Explainability at Word Level
Rohan Kumar Yadav, Bimal Bhattarai, Abhik Jana, Lei Jiao, Seid Muhie, Yimam

TL;DR
This paper introduces a post-hoc pruning method for Tsetlin Machine models to improve word-level explainability in NLP, making explanations more human-understandable without sacrificing accuracy.
Contribution
It proposes a novel clause pruning technique that enhances interpretability of Tsetlin Machines while maintaining or improving predictive performance.
Findings
Pruned TM's attention map aligns better with human attention.
Pruning improves interpretability without degrading accuracy.
Achieves up to 9% accuracy improvement on YELP-HAT dataset.
Abstract
Designing an explainable model becomes crucial now for Natural Language Processing(NLP) since most of the state-of-the-art machine learning models provide a limited explanation for the prediction. In the spectrum of an explainable model, Tsetlin Machine(TM) is promising because of its capability of providing word-level explanation using proposition logic. However, concern rises over the elaborated combination of literals (propositional logic) in the clause that makes the model difficult for humans to comprehend, despite having a transparent learning process. In this paper, we design a post-hoc pruning of clauses that eliminate the randomly placed literals in the clause thereby making the model more efficiently interpretable than the vanilla TM. Experiments on the publicly available YELP-HAT Dataset demonstrate that the proposed pruned TM's attention map aligns more with the human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · Attention Is All You Need · Pruning
