exBERT: A Visual Analysis Tool to Explore Learned Representations in   Transformers Models

Benjamin Hoover; Hendrik Strobelt; Sebastian Gehrmann

arXiv:1910.05276·cs.CL·October 14, 2019·47 cites

exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models

Benjamin Hoover, Hendrik Strobelt, Sebastian Gehrmann

PDF

Open Access 1 Repo

TL;DR

exBERT is an interactive visualization tool designed to help researchers and practitioners explore and understand the learned attention representations within transformer-based language models like BERT, enhancing interpretability.

Contribution

The paper introduces exBERT, a novel interactive tool that visualizes and explains attention mechanisms in transformers by matching inputs to similar contexts in annotated datasets.

Findings

01

Provides intuitive explanations of attention-head functions

02

Enhances understanding of model-internal reasoning processes

03

Facilitates targeted analysis of learned representations

Abstract

Large language models can produce powerful contextual representations that lead to improvements across many NLP tasks. Since these models are typically guided by a sequence of learned self attention mechanisms and may comprise undesired inductive biases, it is paramount to be able to explore what the attention has learned. While static analyses of these models lead to targeted insights, interactive tools are more dynamic and can help humans better gain an intuition for the model-internal reasoning process. We present exBERT, an interactive tool named after the popular BERT language model, that provides insights into the meaning of the contextual representations by matching a human-specified input to similar contexts in a large annotated dataset. By aggregating the annotations of the matching similar contexts, exBERT helps intuitively explain what each attention-head has learned.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

common-english/bert-all
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Data Visualization and Analytics · Natural Language Processing Techniques

MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax