Improving Large Models with Small models: Lower Costs and Better   Performance

Dong Chen; Shuo Zhang; Yueting Zhuang; Siliang Tang; Qidong Liu; Hua; Wang; Mingliang Xu

arXiv:2406.15471·cs.CL·June 25, 2024·3 cites

Improving Large Models with Small models: Lower Costs and Better Performance

Dong Chen, Shuo Zhang, Yueting Zhuang, Siliang Tang, Qidong Liu, Hua, Wang, Mingliang Xu

PDF

Open Access 1 Repo

TL;DR

The paper introduces Data Shunt$^+$, a collaborative paradigm that leverages small and large models together to reduce costs and enhance performance on tasks like sentiment analysis.

Contribution

It proposes a novel collaborative framework, Data Shunt$^+$, that improves large model efficiency and effectiveness by utilizing small models for simpler subtasks.

Findings

01

Achieves higher accuracy with lower cost on sentiment analysis.

02

Reduces large model query costs to approximately 31% of original.

03

Better injects task-specific knowledge than fine-tuning.

Abstract

Pretrained large models (PLMs), such as ChatGPT, have demonstrated remarkable performance across diverse tasks. However, the significant computational requirements of PLMs have discouraged most product teams from running or fine-tuning them. In such cases, to harness the exceptional performance of PLMs, one must rely on expensive APIs, thereby exacerbating the economic burden. Despite the overall inferior performance of small models, in specific distributions, they can achieve comparable or even superior results. Consequently, some input can be processed exclusively by small models. On the other hand, certain tasks can be broken down into multiple subtasks, some of which can be completed without powerful capabilities. Under these circumstances, small models can handle the simple subtasks, allowing large models to focus on challenging subtasks, thus improving the performance. We propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Anfeather/Data-Shunt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management

MethodsFocus