An End-to-End Solution for Named Entity Recognition in eCommerce Search
Xiang Cheng, Mitchell Bowden, Bhushan Ramesh Bhange, Priyanka Goyal,, Thomas Packer, Faizan Javed

TL;DR
This paper presents an end-to-end NER solution for eCommerce search using a novel TripleLearn training framework that significantly improves model performance and user engagement.
Contribution
The paper introduces TripleLearn, a new training framework that learns from three datasets simultaneously, enhancing NER accuracy in eCommerce search.
Findings
F1 score improved from 69.5 to 93.3 with TripleLearn.
Online A/B tests show increased user engagement and revenue.
Model has been operational on homedepot.com for over 9 months.
Abstract
Named entity recognition (NER) is a critical step in modern search query understanding. In the domain of eCommerce, identifying the key entities, such as brand and product type, can help a search engine retrieve relevant products and therefore offer an engaging shopping experience. Recent research shows promising results on shared benchmark NER tasks using deep learning methods, but there are still unique challenges in the industry regarding domain knowledge, training data, and model production. This paper demonstrates an end-to-end solution to address these challenges. The core of our solution is a novel model training framework "TripleLearn" which iteratively learns from three separate training datasets, instead of one training set as is traditionally done. Using this approach, the best model lifts the F1 score from 69.5 to 93.3 on the holdout test data. In our offline experiments,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text and Document Classification Technologies · Natural Language Processing Techniques
