The Devil is in the Details: Evaluating Limitations of Transformer-based   Methods for Granular Tasks

Brihi Joshi; Neil Shah; Francesco Barbieri; Leonardo Neves

arXiv:2011.01196·cs.CL·November 3, 2020

The Devil is in the Details: Evaluating Limitations of Transformer-based Methods for Granular Tasks

Brihi Joshi, Neil Shah, Francesco Barbieri, Leonardo Neves

PDF

1 Repo

TL;DR

This paper critically evaluates transformer-based models for granular textual similarity tasks, revealing their limitations compared to simple baselines and proposing a hybrid approach that improves performance.

Contribution

It identifies the limitations of transformer embeddings for granular tasks and introduces a simple method combining TF-IDF with contextual models to enhance accuracy.

Findings

01

Transformers excel at abstract semantic matching but underperform on granular tasks.

02

Simple TF-IDF baselines often outperform transformer embeddings in granular similarity.

03

Combining TF-IDF with contextual embeddings improves granular task performance by up to 36%.

Abstract

Contextual embeddings derived from transformer-based neural language models have shown state-of-the-art performance for various tasks such as question answering, sentiment analysis, and textual similarity in recent years. Extensive work shows how accurately such models can represent abstract, semantic information present in text. In this expository work, we explore a tangent direction and analyze such models' performance on tasks that require a more granular level of representation. We focus on the problem of textual similarity from two perspectives: matching documents on a granular level (requiring embeddings to capture fine-grained attributes in the text), and an abstract level (requiring embeddings to capture overall textual semantics). We empirically demonstrate, across two datasets from different domains, that despite high performance in abstract document matching as expected,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

brihijoshi/granular-similarity-COLING-2020
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.