Adversarial Mixture Of Experts with Category Hierarchy Soft Constraint
Zhuojian Xiao, Yunjiang jiang, Guoyu Tang, Lin Liu, Sulong Xu, Yun, Xiao, Weipeng Yan

TL;DR
This paper introduces an adversarial mixture of experts model with hierarchical category constraints for e-commerce product ranking, improving specialization and data sharing across categories.
Contribution
It proposes a novel MoE framework with explicit category connections, adversarial regularization among experts, and hierarchy-based soft gating to enhance model specialization and data efficiency.
Findings
Experts tend to specialize in different domain aspects.
Hierarchy-based gating improves data sharing among categories.
Model achieves better ranking performance across categories.
Abstract
Product search is the most common way for people to satisfy their shopping needs on e-commerce websites. Products are typically annotated with one of several broad categorical tags, such as "Clothing" or "Electronics", as well as finer-grained categories like "Refrigerator" or "TV", both under "Electronics". These tags are used to construct a hierarchy of query categories. Distributions of features such as price and brand popularity vary wildly across query categories. In addition, feature importance for the purpose of CTR/CVR predictions differs from one category to another. In this work, we leverage the Mixture of Expert (MoE) framework to learn a ranking model that specializes for each query category. In particular, our gate network relies solely on the category ids extracted from the user query. While classical MoE's pick expert towers spontaneously for each input example, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
