Harnessing the Power of Multiple Minds: Lessons Learned from LLM Routing

KV Aditya Srivatsa; Kaushal Kumar Maurya; Ekaterina Kochmar

arXiv:2405.00467·cs.CL·May 2, 2024

Harnessing the Power of Multiple Minds: Lessons Learned from LLM Routing

KV Aditya Srivatsa, Kaushal Kumar Maurya, Ekaterina Kochmar

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the feasibility of directing each query to the most suitable large language model (LLM) for complex reasoning tasks, highlighting potential and limitations of LLM routing.

Contribution

It introduces the concept of LLM routing for challenging reasoning tasks and provides experimental insights into its effectiveness and challenges.

Findings

01

LLM routing shows promise for complex reasoning tasks.

02

Routing is not always feasible across all scenarios.

03

Further research needed for more robust approaches.

Abstract

With the rapid development of LLMs, it is natural to ask how to harness their capabilities efficiently. In this paper, we explore whether it is feasible to direct each input query to a single most suitable LLM. To this end, we propose LLM routing for challenging reasoning tasks. Our extensive experiments suggest that such routing shows promise but is not feasible in all scenarios, so more robust approaches should be investigated to fill this gap.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kvadityasrivatsa/llm-routing
pytorchOfficial

Videos

Harnessing the Power of Multiple Minds: Lessons Learned from LLM Routing· underline

Taxonomy

TopicsDigital Rights Management and Security