Orchestrating Heterogeneous Experts: A Scalable MoE Framework with Anisotropy-Preserving Fusion
Ye Liu, Xu Chen, Wuji Chen, Mang Li

TL;DR
This paper introduces a scalable Mixture-of-Experts framework that dynamically orchestrates heterogeneous language models for cross-border e-commerce search relevance, significantly improving accuracy and efficiency.
Contribution
It proposes a novel MoE framework with an anisotropy-preserving fusion strategy that leverages diverse open-source LLMs without extensive pre-training.
Findings
Achieves 0.72% AUC improvement over dense baselines.
Increases query processing speed by 9%.
Effectively handles linguistic diversity in Southeast Asian markets.
Abstract
In cross-border e-commerce, search relevance modeling faces the dual challenge of extreme linguistic diversity and fine-grained semantic nuances. Existing approaches typically rely on scaling up a single monolithic Large Language Model (LLM). However, our empirical analysis reveals that single models suffer from uneven capability distributions across regions. For example, excelling in English while underperforming in specific Southeast Asian languages. In this work, we shift the paradigm from scaling a single model to orchestrating heterogeneous experts. We propose a scalable Coarse-grained Mixture-of-Experts (MoE) framework that leverages the inherent complementarity of distinct open-source LLMs (e.g., Qwen, Gemma) without expensive pre-training. Unlike standard token-level MoE, our framework dynamically routes entire queries to specialized experts and, crucially, employs an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Web Data Mining and Analysis · Big Data and Digital Economy
