Cross-model Control: Improving Multiple Large Language Models in   One-time Training

Jiayi Wu; Hao Sun; Hengyi Cai; Lixin Su; Shuaiqiang Wang; Dawei Yin,; Xiang Li; Ming Gao

arXiv:2410.17599·cs.CL·October 24, 2024

Cross-model Control: Improving Multiple Large Language Models in One-time Training

Jiayi Wu, Hao Sun, Hengyi Cai, Lixin Su, Shuaiqiang Wang, Dawei Yin,, Xiang Li, Ming Gao

PDF

Open Access 1 Repo 3 Models 1 Video

TL;DR

This paper introduces Cross-model Control (CMC), a method that uses a tiny language model to efficiently fine-tune and control multiple large language models simultaneously, reducing training costs and enabling model-specific adjustments.

Contribution

The paper presents a novel approach that leverages a tiny language model to control multiple LLMs in one training process, with a new token mapping strategy for different vocabularies.

Findings

01

Effective control of multiple LLMs demonstrated in experiments.

02

Significant reduction in training costs for fine-tuning models.

03

Versatile application to instruction tuning and unlearning tasks.

Abstract

The number of large language models (LLMs) with varying parameter scales and vocabularies is increasing. While they deliver powerful performance, they also face a set of common optimization needs to meet specific requirements or standards, such as instruction following or avoiding the output of sensitive information from the real world. However, how to reuse the fine-tuning outcomes of one model to other models to reduce training costs remains a challenge. To bridge this gap, we introduce Cross-model Control (CMC), a method that improves multiple LLMs in one-time training with a portable tiny language model. Specifically, we have observed that the logit shift before and after fine-tuning is remarkably similar across different models. Based on this insight, we incorporate a tiny language model with a minimal number of parameters. By training alongside a frozen template LLM, the tiny…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wujwyi/cmc
pytorchOfficial

Models

Videos

Cross-model Control: Improving Multiple Large Language Models in One-time Training· slideslive

Taxonomy

TopicsTopic Modeling

MethodsSparse Evolutionary Training