Expert Routing with Synthetic Data for Continual Learning

Yewon Byun; Sanket Vaibhav Mehta; Saurabh Garg; Emma Strubell; Michael Oberst; Bryan Wilder; Zachary C. Lipton

arXiv:2412.17009·cs.LG·August 29, 2025

Expert Routing with Synthetic Data for Continual Learning

Yewon Byun, Sanket Vaibhav Mehta, Saurabh Garg, Emma Strubell, Michael Oberst, Bryan Wilder, Zachary C. Lipton

PDF

Open Access

TL;DR

This paper introduces G2D, a continual learning method that uses synthetic data to train a domain discriminator, enabling effective expert routing across domains without data sharing, outperforming existing methods in vision and language tasks.

Contribution

Proposes G2D, a novel synthetic data-based domain-discriminator for expert routing in continual learning, demonstrating superior performance over existing methods.

Findings

01

G2D outperforms competitive domain-incremental methods.

02

Synthetic data is more effective for training domain discriminators than direct classifier training.

03

Applicable to both vision and language modalities.

Abstract

In many real-world settings, regulations and economic incentives permit the sharing of models but not data across institutional boundaries. In such scenarios, practitioners might hope to adapt models to new domains, without losing performance on previous domains (so-called catastrophic forgetting). While any single model may struggle to achieve this goal, learning an ensemble of domain-specific experts offers the potential to adapt more closely to each individual institution. However, a core challenge in this context is determining which expert to deploy at test time. In this paper, we propose Generate to Discriminate (G2D), a domain-incremental continual learning method that leverages synthetic data to train a domain-discriminator that routes samples at inference time to the appropriate expert. Surprisingly, we find that leveraging synthetic data in this capacity is more effective than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExpert finding and Q&A systems