Improving Bi-LSTM Performance for Indonesian Sentiment Analysis Using   Paragraph Vector

Ayu Purwarianti (1); Ida Ayu Putu Ari Crisdayanti (1) ((1) Institut; Teknologi Bandung)

arXiv:2009.05720·cs.CL·September 15, 2020

Improving Bi-LSTM Performance for Indonesian Sentiment Analysis Using Paragraph Vector

Ayu Purwarianti (1), Ida Ayu Putu Ari Crisdayanti (1) ((1) Institut, Teknologi Bandung)

PDF

TL;DR

This paper enhances Indonesian sentiment analysis by combining Bi-LSTM with paragraph vectors, addressing sequence bias and improving classification accuracy for sentiment expressions not located at document edges.

Contribution

It introduces a novel combination of Bi-LSTM with paragraph vectors for Indonesian sentiment analysis, improving performance and handling phrase position issues.

Findings

01

Significant performance improvement over baseline models

02

Effective handling of sentiment phrases outside document edges

03

Better differentiation of ambiguous Indonesian words

Abstract

Bidirectional Long Short-Term Memory Network (Bi-LSTM) has shown promising performance in sentiment classification task. It processes inputs as sequence of information. Due to this behavior, sentiment predictions by Bi-LSTM were influenced by words sequence and the first or last phrases of the texts tend to have stronger features than other phrases. Meanwhile, in the problem scope of Indonesian sentiment analysis, phrases that express the sentiment of a document might not appear in the first or last part of the document that can lead to incorrect sentiment classification. To this end, we propose the using of an existing document representation method called paragraph vector as additional input features for Bi-LSTM. This vector provides information context of the document for each sequence processing. The paragraph vector is simply concatenated to each word vector of the document. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMemory Network