Word-Class Embeddings for Multiclass Text Classification

Alejandro Moreo; Andrea Esuli; Fabrizio Sebastiani

arXiv:1911.11506·cs.LG·September 22, 2021

Word-Class Embeddings for Multiclass Text Classification

Alejandro Moreo, Andrea Esuli, Fabrizio Sebastiani

PDF

2 Repos

TL;DR

This paper introduces supervised word-class embeddings that, when combined with pre-trained embeddings, improve deep learning models' accuracy in multiclass text classification across various datasets and architectures.

Contribution

The paper proposes supervised word-class embeddings (WCEs) that enhance pre-trained embeddings, significantly improving multiclass text classification performance.

Findings

01

WCEs improve classification accuracy across four neural architectures.

02

WCEs show consistent gains on six public datasets.

03

Code implementation is publicly available.

Abstract

Pre-trained word embeddings encode general word semantics and lexical regularities of natural language, and have proven useful across many NLP tasks, including word sense disambiguation, machine translation, and sentiment analysis, to name a few. In supervised tasks such as multiclass text classification (the focus of this article) it seems appealing to enhance word representations with ad-hoc embeddings that encode task-specific information. We propose (supervised) word-class embeddings (WCEs), and show that, when concatenated to (unsupervised) pre-trained word embeddings, they substantially facilitate the training of deep-learning models in multiclass classification by topic. We show empirical evidence that WCEs yield a consistent improvement in multiclass classification accuracy, using four popular neural architectures and six widely used and publicly available datasets for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.