TL;DR
This paper critically evaluates transformer-based models for granular textual similarity tasks, revealing their limitations compared to simple baselines and proposing a hybrid approach that improves performance.
Contribution
It identifies the limitations of transformer embeddings for granular tasks and introduces a simple method combining TF-IDF with contextual models to enhance accuracy.
Findings
Transformers excel at abstract semantic matching but underperform on granular tasks.
Simple TF-IDF baselines often outperform transformer embeddings in granular similarity.
Combining TF-IDF with contextual embeddings improves granular task performance by up to 36%.
Abstract
Contextual embeddings derived from transformer-based neural language models have shown state-of-the-art performance for various tasks such as question answering, sentiment analysis, and textual similarity in recent years. Extensive work shows how accurately such models can represent abstract, semantic information present in text. In this expository work, we explore a tangent direction and analyze such models' performance on tasks that require a more granular level of representation. We focus on the problem of textual similarity from two perspectives: matching documents on a granular level (requiring embeddings to capture fine-grained attributes in the text), and an abstract level (requiring embeddings to capture overall textual semantics). We empirically demonstrate, across two datasets from different domains, that despite high performance in abstract document matching as expected,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
