Interpretable Semantic Textual Similarity: Finding and explaining differences between sentences
I. Lopez-Gazpio, M. Maritxalar, A. Gonzalez-Agirre, G. Rigau, and L. Uria, E. Agirre

TL;DR
This paper introduces an interpretable layer for Semantic Textual Similarity that aligns sentence segments with labels and scores, enabling explanations of sentence differences and similarities, validated through a new dataset and user studies.
Contribution
It formalizes an interpretability framework for STS via segment alignments, provides a new annotated dataset, and develops a system that explains sentence similarities and differences.
Findings
System outperforms baseline on the dataset
User studies show explanations improve understanding
The dataset enables training interpretable STS models
Abstract
User acceptance of artificial intelligence agents might depend on their ability to explain their reasoning, which requires adding an interpretability layer that fa- cilitates users to understand their behavior. This paper focuses on adding an in- terpretable layer on top of Semantic Textual Similarity (STS), which measures the degree of semantic equivalence between two sentences. The interpretability layer is formalized as the alignment between pairs of segments across the two sentences, where the relation between the segments is labeled with a relation type and a similarity score. We present a publicly available dataset of sentence pairs annotated following the formalization. We then develop a system trained on this dataset which, given a sentence pair, explains what is similar and different, in the form of graded and typed segment alignments. When evaluated on the dataset, the system…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
