R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts

Zhongyang Li; Ziyue Li; Tianyi Zhou

arXiv:2502.20395·cs.LG·March 4, 2025

R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts

Zhongyang Li, Ziyue Li, Tianyi Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces R2-T2, a test-time re-routing method for multimodal mixture-of-experts models that locally optimizes routing weights to enhance performance on diverse tasks without retraining the base model.

Contribution

The paper presents a novel test-time re-routing approach (R2-T2) that improves multimodal MoE model performance by locally optimizing routing weights without additional training.

Findings

01

R2-T2 significantly boosts model performance on various benchmarks.

02

The method improves routing efficiency and task adaptability.

03

It achieves state-of-the-art results without retraining base models.

Abstract

In large multimodal models (LMMs), the perception of non-language modalities (e.g., visual representations) is usually not on par with the large language models (LLMs)' powerful reasoning capabilities, deterring LMMs' performance on challenging downstream tasks. This weakness has been recently mitigated by replacing the vision encoder with a mixture-of-experts (MoE), which provides rich, multi-granularity, and diverse representations required by diverse downstream tasks. The performance of multimodal MoE largely depends on its router, which reweights and mixes the representations of different experts for each input. However, we find that the end-to-end trained router does not always produce the optimal routing weights for every test sample. To bridge the gap, we propose a novel and efficient method "Re-Routing in Test-Time (R2-T2)" that locally optimizes the vector of routing weights in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tianyi-lab/R2-T2
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Advanced Text Analysis Techniques

MethodsMixture of Experts