# Word Usage Similarity Estimation with Sentence Representations and   Automatic Substitutes

**Authors:** Aina Gar\'i Soler, Marianna Apidianaki, Alexandre Allauzen

arXiv: 1905.08377 · 2019-05-22

## TL;DR

This paper introduces supervised models utilizing contextualized embeddings and lexical substitutes to estimate word usage similarity across contexts, outperforming previous methods on benchmark datasets.

## Contribution

It presents a novel approach combining contextualized embeddings with lexical substitute annotations for improved usage similarity estimation.

## Key findings

- Supervised models outperform previous methods in similarity tasks.
- Contextualized embeddings like BERT and ELMo enhance prediction accuracy.
- Lexical substitute annotations improve model performance.

## Abstract

Usage similarity estimation addresses the semantic proximity of word instances in different contexts. We apply contextualized (ELMo and BERT) word and sentence embeddings to this task, and propose supervised models that leverage these representations for prediction. Our models are further assisted by lexical substitute annotations automatically assigned to word instances by context2vec, a neural model that relies on a bidirectional LSTM. We perform an extensive comparison of existing word and sentence representations on benchmark datasets addressing both graded and binary similarity. The best performing models outperform previous methods in both settings.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.08377/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/1905.08377/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/1905.08377/full.md

---
Source: https://tomesphere.com/paper/1905.08377