Evaluating the Effectiveness of Pre-trained Language Models in   Predicting the Helpfulness of Online Product Reviews

Ali Boluki; Javad Pourmostafa Roshan Sharami; Dimitar Shterionov

arXiv:2302.10199·cs.CL·February 27, 2024·1 cites

Evaluating the Effectiveness of Pre-trained Language Models in Predicting the Helpfulness of Online Product Reviews

Ali Boluki, Javad Pourmostafa Roshan Sharami, Dimitar Shterionov

PDF

Open Access 1 Repo

TL;DR

This study evaluates the effectiveness of pre-trained language models, specifically RoBERTa and XLM-R, in predicting the helpfulness of online product reviews, demonstrating their superiority over traditional methods and exploring multilingual capabilities.

Contribution

It provides a comprehensive comparison of monolingual and multilingual pre-trained language models for review helpfulness prediction, including the first assessment of multilingual models in this context.

Findings

01

Pre-trained language models outperform baseline by up to 23% in RMSE.

02

Multilingual models may not be effective when fine-tuned on a single language.

03

Adding features like reviewer ratings improves prediction accuracy.

Abstract

Businesses and customers can gain valuable information from product reviews. The sheer number of reviews often necessitates ranking them based on their potential helpfulness. However, only a few reviews ever receive any helpfulness votes on online marketplaces. Sorting all reviews based on the few existing votes can cause helpful reviews to go unnoticed because of the limited attention span of readers. The problem of review helpfulness prediction is even more important for higher review volumes, and newly written reviews or launched products. In this work we compare the use of RoBERTa and XLM-R language models to predict the helpfulness of online product reviews. The contributions of our work in relation to literature include extensively investigating the efficacy of state-of-the-art language models -- both monolingual and multilingual -- against a robust baseline, taking ranking…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alibol/dss-project
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Digital Marketing and Social Media · Spam and Phishing Detection

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Weight Decay · Dropout · Linear Warmup With Linear Decay · Attention Dropout · Residual Connection · Layer Normalization · WordPiece · Softmax