Interpreting BERT-based Text Similarity via Activation and Saliency Maps
Itzik Malkiel, Dvir Ginzburg, Oren Barkan, Avi Caciularu, Jonathan, Weill, Noam Koenigstein

TL;DR
This paper introduces an unsupervised method to interpret BERT-based text similarity by identifying key words and matching pairs that explain the semantic similarity between paragraph pairs, validated through human evaluations.
Contribution
It presents a novel unsupervised approach for explaining BERT-based paragraph similarity using activation and saliency maps, addressing interpretability in complex, long texts.
Findings
High correlation with human judgments
Effective on long and complex paragraphs
Provides meaningful word-level explanations
Abstract
Recently, there has been growing interest in the ability of Transformer-based models to produce meaningful embeddings of text with several applications, such as text similarity. Despite significant progress in the field, the explanations for similarity predictions remain challenging, especially in unsupervised settings. In this work, we present an unsupervised technique for explaining paragraph similarities inferred by pre-trained BERT models. By looking at a pair of paragraphs, our technique identifies important words that dictate each paragraph's semantics, matches between the words in both paragraphs, and retrieves the most important pairs that explain the similarity between the two. The method, which has been assessed by extensive human evaluations and demonstrated on datasets comprising long and complex paragraphs, has shown great promise, providing accurate interpretations that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsMulti-Head Attention · Linear Layer · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay · Softmax · Residual Connection · Attention Is All You Need · Dense Connections · Weight Decay
