Visualizing and Explaining Language Models
Adrian M.P. Bra\c{s}oveanu, R\u{a}zvan Andonie

TL;DR
This paper reviews visualization techniques for deep learning language models, emphasizing their role in enhancing interpretability and explainability of complex NLP systems.
Contribution
It provides an overview of popular visualization methods used in NLP deep learning models, highlighting their importance for interpretability.
Findings
Visualization aids understanding of model decisions
Coloring and clustering reveal model focus areas
Enhances trust and transparency in NLP models
Abstract
During the last decade, Natural Language Processing has become, after Computer Vision, the second field of Artificial Intelligence that was massively changed by the advent of Deep Learning. Regardless of the architecture, the language models of the day need to be able to process or generate text, as well as predict missing words, sentences or relations depending on the task. Due to their black-box nature, such models are difficult to interpret and explain to third parties. Visualization is often the bridge that language model designers use to explain their work, as the coloring of the salient words and phrases, clustering or neuron activations can be used to quickly understand the underlying models. This paper showcases the techniques used in some of the most popular Deep Learning for NLP visualizations, with a special focus on interpretability and explainability.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
