Influence Patterns for Explaining Information Flow in BERT

Kaiji Lu; Zifan Wang; Piotr Mardziel; Anupam Datta

arXiv:2011.00740·cs.CL·December 2, 2021

Influence Patterns for Explaining Information Flow in BERT

Kaiji Lu, Zifan Wang, Piotr Mardziel, Anupam Datta

PDF

Open Access 1 Video

TL;DR

This paper introduces influence patterns to explain how information flows in BERT, revealing that skip connections play a major role and that pattern consistency correlates with model performance.

Contribution

The paper proposes influence patterns as a novel method to analyze information flow in BERT, highlighting the importance of skip connections and pattern consistency.

Findings

01

Significant information flow occurs through skip connections.

02

Pattern consistency correlates with BERT's performance.

03

Patterns explain more of the model's behavior than previous methods.

Abstract

While attention is all you need may be proving true, we do not know why: attention-based transformer models such as BERT are superior but how information flows from input tokens to output predictions are unclear. We introduce influence patterns, abstractions of sets of paths through a transformer model. Patterns quantify and localize the flow of information to paths passing through a sequence of model nodes. Experimentally, we find that significant portion of information flow in BERT goes through skip connections instead of attention heads. We further show that consistency of patterns across instances is an indicator of BERT's performance. Finally, We demonstrate that patterns account for far more model performance than previous attention-based and layer-based methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Influence Patterns for Explaining Information Flow in BERT· slideslive

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Data Stream Mining Techniques

MethodsLinear Layer · Dense Connections · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · WordPiece · Attention Is All You Need · Linear Warmup With Linear Decay · Attention Dropout · Weight Decay · Adam