A Study of the Plausibility of Attention between RNN Encoders in Natural   Language Inference

Duc Hau Nguyen; Duc Hau Nguyen; Pascale S\'ebillot

arXiv:2501.13735·cs.CL·January 24, 2025

A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference

Duc Hau Nguyen, Duc Hau Nguyen, Pascale S\'ebillot

PDF

Open Access

TL;DR

This paper evaluates the plausibility of attention maps in RNN-based models for natural language inference, finding that heuristic-based annotations correlate better with human judgments than raw attention weights, raising questions about their interpretability.

Contribution

It provides a preliminary assessment of attention map plausibility in sentence comparison tasks, specifically in natural language inference, and compares heuristic and human annotations.

Findings

01

Heuristic annotations reasonably correlate with human judgments.

02

Raw attention weights are only loosely related to plausible explanations.

03

Attention maps' interpretability in NLI remains uncertain.

Abstract

Attention maps in neural models for NLP are appealing to explain the decision made by a model, hopefully emphasizing words that justify the decision. While many empirical studies hint that attention maps can provide such justification from the analysis of sound examples, only a few assess the plausibility of explanations based on attention maps, i.e., the usefulness of attention maps for humans to understand the decision. These studies furthermore focus on text classification. In this paper, we report on a preliminary assessment of attention maps in a sentence comparison task, namely natural language inference. We compare the cross-attention weights between two RNN encoders with human-based and heuristic-based annotations on the eSNLI corpus. We show that the heuristic reasonably correlates with human annotations and can thus facilitate evaluation of plausible explanations in sentence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · Multimodal Machine Learning Applications

MethodsSoftmax · Attention Is All You Need · Hierarchical Information Threading · Focus