The Larger the Better? Improved LLM Code-Generation via Budget   Reallocation

Michael Hassid; Tal Remez; Jonas Gehring; Roy Schwartz; Yossi Adi

arXiv:2404.00725·cs.SE·July 26, 2024·5 cites

The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

Michael Hassid, Tal Remez, Jonas Gehring, Roy Schwartz, Yossi Adi

PDF

Open Access 1 Repo

TL;DR

This paper investigates whether smaller language models, when used repeatedly within the same computational budget, can outperform larger models in code generation tasks, highlighting the importance of output ranking strategies.

Contribution

It demonstrates that multiple outputs from smaller models can improve performance and emphasizes the significance of output ranking over model size alone.

Findings

01

Repeated use of smaller models improves code generation accuracy by up to 15%.

02

Ranking-based selection from smaller models underperforms compared to larger models.

03

Using smaller models efficiently can challenge the assumption that bigger models are always better.

Abstract

It is a common belief that large language models (LLMs) are better than smaller-sized ones. However, larger models also require significantly more time and compute during inference. This begs the question: what happens when both models operate under the same budget? (e.g., compute, run-time). To address this question, we analyze code generation LLMs of various sizes and make comparisons such as running a 70B model once vs. generating five outputs from a 13B model. We consider a standard unit-test setup, which can be used to select the correct output from the smaller model. Our findings reveal that the repeated use of smaller models can yield consistent improvements, with gains of up to 15% across five tasks. On the other hand, in scenarios where unit-tests are unavailable, a ranking-based selection of candidates from the smaller model falls short of the performance of a single output…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

slp-rl/budget-realloc
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Rights Management and Security