Evaluating the Effectiveness of Pre-trained Language Models in Predicting the Helpfulness of Online Product Reviews
Ali Boluki, Javad Pourmostafa Roshan Sharami, Dimitar Shterionov

TL;DR
This study evaluates the effectiveness of pre-trained language models, specifically RoBERTa and XLM-R, in predicting the helpfulness of online product reviews, demonstrating their superiority over traditional methods and exploring multilingual capabilities.
Contribution
It provides a comprehensive comparison of monolingual and multilingual pre-trained language models for review helpfulness prediction, including the first assessment of multilingual models in this context.
Findings
Pre-trained language models outperform baseline by up to 23% in RMSE.
Multilingual models may not be effective when fine-tuned on a single language.
Adding features like reviewer ratings improves prediction accuracy.
Abstract
Businesses and customers can gain valuable information from product reviews. The sheer number of reviews often necessitates ranking them based on their potential helpfulness. However, only a few reviews ever receive any helpfulness votes on online marketplaces. Sorting all reviews based on the few existing votes can cause helpful reviews to go unnoticed because of the limited attention span of readers. The problem of review helpfulness prediction is even more important for higher review volumes, and newly written reviews or launched products. In this work we compare the use of RoBERTa and XLM-R language models to predict the helpfulness of online product reviews. The contributions of our work in relation to literature include extensively investigating the efficacy of state-of-the-art language models -- both monolingual and multilingual -- against a robust baseline, taking ranking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Digital Marketing and Social Media · Spam and Phishing Detection
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Weight Decay · Dropout · Linear Warmup With Linear Decay · Attention Dropout · Residual Connection · Layer Normalization · WordPiece · Softmax
