BanglaEcomReviewCorpus: A dataset for e-commerce product review sentiment analysis
Umme Ayman, Md. Tanvir Ahmed Akash, Taslima Akhter, Saiham Zaman Mridul, Yadab Sutradhar

TL;DR
This paper introduces a new dataset of e-commerce product reviews in Bangla for analyzing customer sentiment and behavior.
Contribution
The paper presents a labeled Bangla e-commerce review dataset with balanced sentiment categories for NLP research.
Findings
The dataset contains 8,685 labeled reviews with balanced positive, negative, and neutral sentiments.
Statistical and linguistic analyses, including word clouds and n-grams, reveal the dataset's diversity and structure.
The dataset supports interdisciplinary research and AI training in consumer behavior analysis.
Abstract
Online shopping has become an integral part of modern life, connecting consumers to a wide array of products and services. Customer feedback plays a crucial role in shaping business strategies, enhancing service quality, and driving product innovation, making it essential for understanding consumer behaviour and preferences. For analysing those feedbacks, a dataset is Collected from popular websites such as Daraz, Bikroy.com, Picabbo, Shajgoj, and others, this dataset comprises 8685 labeled items reflecting diverse customer feedback. Sentiment categories include 3012 positive, 2881 negative, and 2792 neutral sentences, offering balanced representation for fair sentiment analysis. This dataset is ideal for natural language processing (NLP) tasks, enabling advanced sentiment analysis and exploration of consumer behaviour. It incorporates a range of statistical studies, including summary…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Spam and Phishing Detection · Text and Document Classification Technologies
