BLR-MoE: Boosted Language-Routing Mixture of Experts for Domain-Robust   Multilingual E2E ASR

Guodong Ma; Wenxuan Wang; Lifeng Zhou; Yuting Yang; Yuke Li; Binbin Du

arXiv:2501.12602·cs.CL·January 23, 2025

BLR-MoE: Boosted Language-Routing Mixture of Experts for Domain-Robust Multilingual E2E ASR

Guodong Ma, Wenxuan Wang, Lifeng Zhou, Yuting Yang, Yuke Li, Binbin Du

PDF

Open Access

TL;DR

This paper introduces BLR-MoE, an advanced architecture for multilingual end-to-end ASR that reduces language confusion through attention-MoE and improved routing, enhancing robustness in domain-mismatched scenarios.

Contribution

It proposes a novel BLR-MoE architecture with attention-MoE and improved routing techniques to better handle language confusion in multilingual ASR.

Findings

01

BLR-MoE outperforms previous models on a 10,000-hour dataset.

02

Attention-MoE reduces language confusion in self-attention.

03

Expert pruning and router augmentation improve routing robustness.

Abstract

Recently, the Mixture of Expert (MoE) architecture, such as LR-MoE, is often used to alleviate the impact of language confusion on the multilingual ASR (MASR) task. However, it still faces language confusion issues, especially in mismatched domain scenarios. In this paper, we decouple language confusion in LR-MoE into confusion in self-attention and router. To alleviate the language confusion in self-attention, based on LR-MoE, we propose to apply attention-MoE architecture for MASR. In our new architecture, MoE is utilized not only on feed-forward network (FFN) but also on self-attention. In addition, to improve the robustness of the LID-based router on language confusion, we propose expert pruning and router augmentation methods. Combining the above, we get the boosted language-routing MoE (BLR-MoE) architecture. We verify the effectiveness of the proposed BLR-MoE in a 10,000-hour…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExpert finding and Q&A systems · Topic Modeling · Domain Adaptation and Few-Shot Learning

MethodsMixture of Experts · Pruning