Text Classification for Predicting Multi-level Product Categories
Hadi Jahanshahi, Ozan Ozyegen, Mucahit Cevik, Beste Bulut, Deniz, Yigit, Fahrettin F. Gonen, Ay\c{s}e Ba\c{s}ar

TL;DR
This paper compares various text classification models for multi-level grocery product categorization, demonstrating that neural networks with dynamic masking and bilingual data improve accuracy and generalizability.
Contribution
It provides a comprehensive comparison of traditional and recent models, introduces dynamic masking for pretrained models, and evaluates multilingual data for product classification.
Findings
Neural network models outperform SVM and XGBoost.
Dynamic masking improves prediction accuracy.
Bilingual titles enhance model performance.
Abstract
In an online shopping platform, a detailed classification of the products facilitates user navigation. It also helps online retailers keep track of the price fluctuations in a certain industry or special discounts on a specific product category. Moreover, an automated classification system may help to pinpoint incorrect or subjective categories suggested by an operator. In this study, we focus on product title classification of the grocery products. We perform a comprehensive comparison of six different text classification models to establish a strong baseline for this task, which involves testing both traditional and recent machine learning methods. In our experiments, we investigate the generalizability of the trained models to the products of other online retailers, the dynamic masking of infeasible subcategories for pretrained language models, and the benefits of incorporating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Text and Document Classification Technologies · Sentiment Analysis and Opinion Mining
MethodsSupport Vector Machine
