Multi-output Headed Ensembles for Product Item Classification
Hotaka Shiokawa, Pradipto Das, Arthur Toth, Justin Chiu

TL;DR
This paper introduces a deep learning ensemble framework for large-scale e-commerce product classification, leveraging metadata and fusion techniques, and proposes a novel user session-based evaluation method to improve model assessment.
Contribution
The paper presents an extensible ensemble classification framework that combines multiple models and metadata, along with a new user session-based evaluation approach for large-scale e-commerce catalogs.
Findings
Ensemble models outperform single classifiers in accuracy.
Metadata features significantly boost classification performance.
User session-based evaluation provides better insights than traditional metrics.
Abstract
In this paper, we revisit the problem of product item classification for large-scale e-commerce catalogs. The taxonomy of e-commerce catalogs consists of thousands of genres to which are assigned items that are uploaded by merchants on a continuous basis. The genre assignments by merchants are often wrong but treated as ground truth labels in automatically generated training sets, thus creating a feedback loop that leads to poorer model quality over time. This problem of taxonomy classification becomes highly pronounced due to the unavailability of sizable curated training sets. Under such a scenario it is common to combine multiple classifiers to combat poor generalization performance from a single classifier. We propose an extensible deep learning based classification model framework that benefits from the simplicity and robustness of averaging ensembles and fusion based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Text and Document Classification Technologies
