# SenseBERT: Driving Some Sense into BERT

**Authors:** Yoav Levine, Barak Lenz, Or Dagan, Ori Ram, Dan Padnos, Or Sharir,, Shai Shalev-Shwartz, Amnon Shashua, Yoav Shoham

arXiv: 1908.05646 · 2020-05-19

## TL;DR

SenseBERT enhances BERT by incorporating word sense information through weak supervision at the lexical-semantic level, leading to improved lexical understanding without human annotation.

## Contribution

It introduces SenseBERT, a model that predicts masked words and their WordNet supersenses, integrating semantic sense information into pre-training.

## Key findings

- Improved performance on Word Sense Disambiguation
- State-of-the-art results on Word in Context task
- Enhanced lexical-semantic understanding

## Abstract

The ability to learn from large unlabeled corpora has allowed neural language models to advance the frontier in natural language understanding. However, existing self-supervision techniques operate at the word form level, which serves as a surrogate for the underlying semantic content. This paper proposes a method to employ weak-supervision directly at the word sense level. Our model, named SenseBERT, is pre-trained to predict not only the masked words but also their WordNet supersenses. Accordingly, we attain a lexical-semantic level language model, without the use of human annotation. SenseBERT achieves significantly improved lexical understanding, as we demonstrate by experimenting on SemEval Word Sense Disambiguation, and by attaining a state of the art result on the Word in Context task.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1908.05646/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1908.05646/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/1908.05646/full.md

---
Source: https://tomesphere.com/paper/1908.05646