Distillation-based Scenario-Adaptive Mixture-of-Experts for the Matching Stage of Multi-scenario Recommendation

Ruibing Wang; Shuhan Guo; Haotong Du; Quanming Yao

arXiv:2601.02368·cs.IR·January 7, 2026

Distillation-based Scenario-Adaptive Mixture-of-Experts for the Matching Stage of Multi-scenario Recommendation

Ruibing Wang, Shuhan Guo, Haotong Du, Quanming Yao

PDF

Open Access

TL;DR

This paper introduces DSMOE, a novel distillation-based approach with a scenario-adaptive module for multi-scenario recommendation, enhancing matching accuracy especially in data-sparse scenarios.

Contribution

It proposes DSMOE, combining a lightweight scenario-adaptive projection with cross-architecture knowledge distillation to improve multi-scenario matching performance.

Findings

01

Significantly improves retrieval quality in data-sparse scenarios

02

Outperforms existing methods on real-world datasets

03

Effectively prevents expert collapse in long-tail scenarios

Abstract

Multi-scenario recommendation is pivotal for optimizing user experience across diverse contexts. While Multi-gate Mixture-of-Experts (MMOE) thrives in ranking, its transfer to the matching stage is hindered by the blind optimization inherent to independent two-tower architectures and the parameter dominance of head scenarios. To address these structural and distributional bottlenecks, we propose Distillation-based Scenario-Adaptive Mixture-of-Experts (DSMOE). Specially, we devise a Scenario-Adaptive Projection (SAP) module to generate lightweight, context-specific parameters, effectively preventing expert collapse in long-tail scenarios. Concurrently, we introduce a cross-architecture knowledge distillation framework, where an interaction-aware teacher guides the two-tower student to capture complex matching patterns. Extensive experiments on real-world datasets demonstrate DSMOE's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Mobile Crowdsensing and Crowdsourcing