Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild
Xinyu Zhao, Guoheng Sun, Ruisi Cai, Yukun Zhou, Pingzhi Li, Peihao, Wang, Bowen Tan, Yexiao He, Li Chen, Yi Liang, Beidi Chen, Binhang Yuan,, Hongyi Wang, Ang Li, Zhangyang Wang, Tianlong Chen

TL;DR
Model-GLUE provides a comprehensive framework for scaling and aggregating diverse large language models, improving performance without extra training by benchmarking techniques and optimizing model combination strategies.
Contribution
This work introduces a holistic guideline for LLM scaling, including benchmarking, clustering, and optimal merging strategies for heterogeneous model zoos.
Findings
Average performance improvement of 5.61% on Llama-2-based models
Benchmarking reveals effective model merging and mixture techniques
Optimal aggregation strategies enhance model zoo performance
Abstract
As Large Language Models (LLMs) excel across tasks and specialized domains, scaling LLMs based on existing models has garnered significant attention, which faces the challenge of decreasing performance when combining disparate models. Various techniques have been proposed for the aggregation of pre-trained LLMs, including model merging, Mixture-of-Experts, and stacking. Despite their merits, a comprehensive comparison and synergistic application of them to a diverse model zoo is yet to be adequately addressed. In light of this research gap, this paper introduces Model-GLUE, a holistic LLM scaling guideline. First, our work starts with a benchmarking of existing LLM scaling techniques, especially selective merging, and variants of mixture. Utilizing the insights from the benchmark results, we formulate an optimal strategy for the selection and aggregation of a heterogeneous model zoo…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHydrology and Watershed Management Studies · Simulation Techniques and Applications · Modeling and Simulation Systems
