DeepCAT: Deep Category Representation for Query Understanding in E-commerce Search
Ali Ahmadvand, Surya Kallumadi, Faizan Javed, and Eugene Agichtein

TL;DR
DeepCAT is a deep learning model that improves query understanding in e-commerce search by learning joint word-category representations, especially enhancing performance on minority and tail queries.
Contribution
It introduces a novel word-category representation model and a new loss function to better handle class imbalance and tail queries in e-commerce search.
Findings
10% improvement on minority classes
7.1% improvement on tail queries
Effective semantic modeling of taxonomy hierarchies
Abstract
Mapping a search query to a set of relevant categories in the product taxonomy is a significant challenge in e-commerce search for two reasons: 1) Training data exhibits severe class imbalance problem due to biased click behavior, and 2) queries with little customer feedback (e.g., tail queries) are not well-represented in the training set, and cause difficulties for query understanding. To address these problems, we propose a deep learning model, DeepCAT, which learns joint word-category representations to enhance the query understanding process. We believe learning category interactions helps to improve the performance of category mapping on minority classes, tail and torso queries. DeepCAT contains a novel word-category representation model that trains the category representations based on word-category co-occurrences in the training set. The category representation is then leveraged…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Topic Modeling · Machine Learning in Bioinformatics
