Learning to Decode Collaboratively with Multiple Language Models

Shannon Zejiang Shen; Hunter Lang; Bailin Wang; Yoon Kim; David Sontag

arXiv:2403.03870·cs.CL·August 28, 2024·2 cites

Learning to Decode Collaboratively with Multiple Language Models

Shannon Zejiang Shen, Hunter Lang, Bailin Wang, Yoon Kim, David Sontag

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel method for training multiple large language models to collaborate during decoding by interleaving token generation, enabling improved performance on various tasks without explicit supervision.

Contribution

It presents a latent variable model that allows LLMs to learn when to generate or call other models, enhancing collaborative decoding capabilities.

Findings

01

Improved performance on instruction-following tasks

02

Effective domain-specific question answering

03

Enhanced reasoning through collaborative decoding

Abstract

We propose a method to teach multiple large language models (LLM) to collaborate by interleaving their generations at the token level. We model the decision of which LLM generates the next token as a latent variable. By optimizing the marginal likelihood of a training set under our latent variable model, the base LLM automatically learns when to generate itself and when to call on one of the ``assistant'' language models to generate, all without direct supervision. Token-level collaboration during decoding allows for a fusion of each model's expertise in a manner tailored to the specific task at hand. Our collaborative decoding is especially useful in cross-domain settings where a generalist base LLM learns to invoke domain expert models. On instruction-following, domain-specific QA, and reasoning tasks, we show that the performance of the joint system exceeds that of the individual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clinicalml/co-llm
pytorchOfficial

Videos

Learning to Decode Collaboratively with Multiple Language Models· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsSparse Evolutionary Training · Balanced Selection