Visualizing and Explaining Language Models

Adrian M.P. Bra\c{s}oveanu; R\u{a}zvan Andonie

arXiv:2205.10238·cs.CL·May 23, 2022

Visualizing and Explaining Language Models

Adrian M.P. Bra\c{s}oveanu, R\u{a}zvan Andonie

PDF

Open Access

TL;DR

This paper reviews visualization techniques for deep learning language models, emphasizing their role in enhancing interpretability and explainability of complex NLP systems.

Contribution

It provides an overview of popular visualization methods used in NLP deep learning models, highlighting their importance for interpretability.

Findings

01

Visualization aids understanding of model decisions

02

Coloring and clustering reveal model focus areas

03

Enhances trust and transparency in NLP models

Abstract

During the last decade, Natural Language Processing has become, after Computer Vision, the second field of Artificial Intelligence that was massively changed by the advent of Deep Learning. Regardless of the architecture, the language models of the day need to be able to process or generate text, as well as predict missing words, sentences or relations depending on the task. Due to their black-box nature, such models are difficult to interpret and explain to third parties. Visualization is often the bridge that language model designers use to explain their work, as the coloring of the salient words and phrases, clustering or neuron activations can be used to quickly understand the underlying models. This paper showcases the techniques used in some of the most popular Deep Learning for NLP visualizations, with a special focus on interpretability and explainability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)