Towards Explainable NLP: A Generative Explanation Framework for Text Classification
Hui Liu, Qingyu Yin, William Yang Wang

TL;DR
This paper introduces a generative framework for NLP that simultaneously performs text classification and produces human-readable, fine-grained explanations, improving interpretability and accuracy.
Contribution
It proposes a novel explainable factor and minimum risk training approach for generating explanations alongside classification, along with new datasets for evaluation.
Findings
Outperforms baseline models on two datasets
Generates concise, human-readable explanations
Achieves better interpretability without sacrificing accuracy
Abstract
Building explainable systems is a critical problem in the field of Natural Language Processing (NLP), since most machine learning models provide no explanations for the predictions. Existing approaches for explainable machine learning systems tend to focus on interpreting the outputs or the connections between inputs and outputs. However, the fine-grained information is often ignored, and the systems do not explicitly generate the human-readable explanations. To better alleviate this problem, we propose a novel generative explanation framework that learns to make classification decisions and generate fine-grained explanations at the same time. More specifically, we introduce the explainable factor and the minimum risk training approach that learn to generate more reasonable explanations. We construct two new datasets that contain summaries, rating scores, and fine-grained reasons. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Stock Market Forecasting Methods
