Rethinking Attribute Representation and Injection for Sentiment   Classification

Reinald Kim Amplayo

arXiv:1908.09590·cs.CL·August 27, 2019

Rethinking Attribute Representation and Injection for Sentiment Classification

Reinald Kim Amplayo

PDF

1 Repo

TL;DR

This paper challenges standard attribute injection methods in sentiment classification, proposing a new representation and injection strategy that significantly improves performance with a simple BiLSTM model.

Contribution

It introduces a novel attribute representation as chunk-wise importance weights and identifies optimal injection locations, outperforming prior complex architectures.

Findings

01

Attributes are best injected at the embedding or encoding stage.

02

Attention mechanism is the worst location for attribute injection.

03

Proposed method outperforms state-of-the-art models.

Abstract

Text attributes, such as user and product information in product reviews, have been used to improve the performance of sentiment classification models. The de facto standard method is to incorporate them as additional biases in the attention mechanism, and more performance gains are achieved by extending the model architecture. In this paper, we show that the above method is the least effective way to represent and inject attributes. To demonstrate this hypothesis, unlike previous models with complicated architectures, we limit our base model to a simple BiLSTM with attention classifier, and instead focus on how and where the attributes should be incorporated in the model. We propose to represent attributes as chunk-wise importance weight matrices and consider four locations in the model (i.e., embedding, encoding, attention, classifier) to inject attributes. Experiments show that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rktamplayo/CHIM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM