Small Language Models Improve Giants by Rewriting Their Outputs
Giorgos Vernikos, Arthur Bra\v{z}inskas, Jakub Adamek, Jonathan, Mallinson, Aliaksei Severyn, Eric Malmi

TL;DR
This paper introduces LMCor, a compact model that enhances large language models' outputs by merging candidate predictions, achieving performance comparable to fine-tuning without altering the original models.
Contribution
The authors propose a novel plug-and-play correction method using a small model trained to improve LLM outputs without fine-tuning or access to weights.
Findings
LMCor significantly improves LLM performance across multiple tasks.
A small 250M LMCor matches or surpasses fine-tuning results.
LMCor is robust across different prompts and models.
Abstract
Despite the impressive performance of large language models (LLMs), they often lag behind specialized models in various tasks. LLMs only use a fraction of the existing training data for in-context learning, while task-specific models harness the full dataset for fine-tuning. In this work, we tackle the problem of leveraging training data to improve the performance of LLMs without fine-tuning. Our approach directly targets LLM predictions without requiring access to their weights. We create a pool of candidates from the LLM through few-shot prompting and we employ a compact model, the LM-corrector (LMCor), specifically trained to merge these candidates to produce an enhanced output. Our experiments on four natural language generation tasks demonstrate that even a small LMCor model (250M) substantially improves the few-shot performance of LLMs (62B), matching and even outperforming…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Topic Modeling
