Towards Explainability in NLP: Analyzing and Calculating Word Saliency through Word Properties
Jialiang Dong, Zhitao Guan, Longfei Wu, Zijian Zhang, Xiaojiang Du

TL;DR
This paper investigates how word properties relate to saliency in NLP models, proposing a new mapping model and dataset to enhance explainability and interpretability of model predictions.
Contribution
It introduces Seq2Saliency, a sequence tagging model that predicts word saliency from properties, and creates PrSalM, a dataset linking word features to saliency values.
Findings
Seq2Saliency effectively predicts word saliency based on properties.
Word properties like part of speech influence saliency levels.
The dataset PrSalM enables further research on explainability in NLP.
Abstract
The wide use of black-box models in natural language processing brings great challenges to the understanding of the decision basis, the trustworthiness of the prediction results, and the improvement of the model performance. The words in text samples have properties that reflect their semantics and contextual information, such as the part of speech, the position, etc. These properties may have certain relationships with the word saliency, which is of great help for studying the explainability of the model predictions. In this paper, we explore the relationships between the word saliency and the word properties. According to the analysis results, we further establish a mapping model, Seq2Saliency, from the words in a text sample and their properties to the saliency values based on the idea of sequence tagging. In addition, we establish a new dataset called PrSalM, which contains each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques
