MGNC-CNN: A Simple Approach to Exploiting Multiple Word Embeddings for   Sentence Classification

Ye Zhang; Stephen Roller; Byron Wallace

arXiv:1603.00968·cs.CL·March 29, 2016·24 cites

MGNC-CNN: A Simple Approach to Exploiting Multiple Word Embeddings for Sentence Classification

Ye Zhang, Stephen Roller, Byron Wallace

PDF

Open Access

TL;DR

This paper presents MGNC-CNN, a simple and efficient CNN architecture that leverages multiple word embeddings for improved sentence classification, outperforming existing models with less training time.

Contribution

The paper introduces MGNC-CNN, a novel CNN model that independently processes multiple word embeddings and applies group regularization, offering a simpler and faster alternative to existing methods.

Findings

01

MGNC-CNN outperforms baseline models in sentence classification tasks.

02

The model requires less training time than comparable architectures.

03

It is flexible with input embeddings of different dimensions.

Abstract

We introduce a novel, simple convolution neural network (CNN) architecture - multi-group norm constraint CNN (MGNC-CNN) that capitalizes on multiple sets of word embeddings for sentence classification. MGNC-CNN extracts features from input embedding sets independently and then joins these at the penultimate layer in the network to form a final feature vector. We then adopt a group regularization strategy that differentially penalizes weights associated with the subcomponents generated from the respective embedding sets. This model is much simpler than comparable alternative architectures and requires substantially less training time. Furthermore, it is flexible in that it does not require input word embeddings to be of the same dimensionality. We show that MGNC-CNN consistently outperforms baseline models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research

MethodsConvolution