Collaborative Performance Prediction for Large Language Models

Qiyuan Zhang; Fuyuan Lyu; Xue Liu; Chen Ma

arXiv:2407.01300·cs.CL·October 4, 2024

Collaborative Performance Prediction for Large Language Models

Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel Collaborative Performance Prediction framework that leverages historical performance data and design factors to improve accuracy in predicting large language model performance across tasks.

Contribution

The paper proposes a new CPP framework that incorporates collaborative data and considers both model and task design factors for enhanced performance prediction.

Findings

01

CPP surpasses traditional scaling laws in accuracy

02

Collaborative data improves prediction robustness

03

Analysis of factor importance reveals new insights

Abstract

Comprehensively understanding and accurately predicting the performance of large language models across diverse downstream tasks has emerged as a pivotal challenge in NLP research. The pioneering scaling law on downstream works demonstrated intrinsic similarities within model families and utilized such similarities for performance prediction. However, they tend to overlook the similarities between model families and only consider design factors listed in the original scaling law. To overcome these limitations, we introduce a novel framework, Collaborative Performance Prediction (CPP), which significantly enhances prediction accuracy by leveraging the historical performance of various models on downstream tasks and other design factors for both model and task. We also collect a collaborative data sourced from online platforms containing both historical performance and additional design…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Don-Joey/CPP_LLM
pytorchOfficial

Videos

Collaborative Performance Prediction for Large Language Models· underline

Taxonomy

TopicsTopic Modeling