A Simple and Effective Approach for Fine Tuning Pre-trained Word   Embeddings for Improved Text Classification

Amr Al-Khatib; Samhaa R. El-Beltagy

arXiv:1908.02579·cs.CL·December 17, 2019

A Simple and Effective Approach for Fine Tuning Pre-trained Word Embeddings for Improved Text Classification

Amr Al-Khatib, Samhaa R. El-Beltagy

PDF

Open Access 1 Repo

TL;DR

This paper introduces a straightforward method for fine-tuning pre-trained word embeddings by incorporating class information, enhancing their discriminative power for text classification tasks across multiple datasets.

Contribution

The proposed approach uniquely integrates class context into word embeddings during fine-tuning, improving their effectiveness for text classification.

Findings

01

Significant improvement in classification accuracy across datasets

02

Effective for both Arabic and English text classification

03

Enhances word vector discriminability within classes

Abstract

This work presents a new and simple approach for fine-tuning pretrained word embeddings for text classification tasks. In this approach, the class in which a term appears, acts as an additional contextual variable during the fine tuning process, and contributes to the final word vector for that term. As a result, words that are used distinctively within a particular class, will bear vectors that are closer to each other in the embedding space and will be more discriminative towards that class. To validate this novel approach, it was applied to three Arabic and two English datasets that have been previously used for text classification tasks such as sentiment analysis and emotion detection. In the vast majority of cases, the results obtained using the proposed approach, improved considerably.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AmrMehasseb/Embeddings-finetuning
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Text and Document Classification Technologies