MoDEM: Mixture of Domain Expert Models

Toby Simonds; Kemal Kurniawan; Jey Han Lau

arXiv:2410.07490·cs.CL·October 11, 2024

MoDEM: Mixture of Domain Expert Models

Toby Simonds, Kemal Kurniawan, Jey Han Lau

PDF

Open Access

TL;DR

This paper introduces MoDEM, a system that uses a BERT-based router to direct prompts to domain-specific models, significantly improving performance and efficiency over general-purpose large language models.

Contribution

It presents a novel mixture of domain expert models with prompt routing, demonstrating improved performance and cost-efficiency over traditional large models.

Findings

01

Outperforms general-purpose models of similar size on benchmarks.

02

Reduces computational costs while maintaining high accuracy.

03

Supports a shift towards specialized, modular AI systems.

Abstract

We propose a novel approach to enhancing the performance and efficiency of large language models (LLMs) by combining domain prompt routing with domain-specialized models. We introduce a system that utilizes a BERT-based router to direct incoming prompts to the most appropriate domain expert model. These expert models are specifically tuned for domains such as health, mathematics and science. Our research demonstrates that this approach can significantly outperform general-purpose models of comparable size, leading to a superior performance-to-cost ratio across various benchmarks. The implications of this study suggest a potential paradigm shift in LLM development and deployment. Rather than focusing solely on creating increasingly large, general-purpose models, the future of AI may lie in developing ecosystems of smaller, highly specialized models coupled with sophisticated routing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications · Scientific Computing and Data Management