ArabGlossBERT: Fine-Tuning BERT on Context-Gloss Pairs for WSD

Moustafa Al-Hajj; Mustafa Jarrar

arXiv:2205.09685·cs.CL·May 20, 2022

ArabGlossBERT: Fine-Tuning BERT on Context-Gloss Pairs for WSD

Moustafa Al-Hajj, Mustafa Jarrar

PDF

1 Models

TL;DR

This paper fine-tunes Arabic BERT models for Word Sense Disambiguation by treating it as a sentence-pair classification task, using a large dataset of context-gloss pairs, achieving promising accuracy.

Contribution

It introduces a new dataset of Arabic context-gloss pairs and demonstrates effective fine-tuning of BERT for Arabic WSD.

Findings

01

Achieved 84% accuracy on Arabic WSD task

02

Constructed a dataset of 167,000 labeled pairs

03

Explored different supervised signals for target word emphasis

Abstract

Using pre-trained transformer models such as BERT has proven to be effective in many NLP tasks. This paper presents our work to fine-tune BERT models for Arabic Word Sense Disambiguation (WSD). We treated the WSD task as a sentence-pair binary classification task. First, we constructed a dataset of labeled Arabic context-gloss pairs (~167k pairs) we extracted from the Arabic Ontology and the large lexicographic database available at Birzeit University. Each pair was labeled as True or False and target words in each context were identified and annotated. Second, we used this dataset for fine-tuning three pre-trained Arabic BERT models. Third, we experimented the use of different supervised signals used to emphasize target words in context. Our experiments achieved promising results (accuracy of 84%) although we used a large set of senses in the experiment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
SinaLab/ArabGlossBERT
model· 25 dl
25 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Weight Decay · Attention Is All You Need · Multi-Head Attention · Attention Dropout · Dropout · Softmax · Layer Normalization · WordPiece