From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
Yafu Li, Zhilin Wang, Tingchen Fu, Ganqu Cui, Sen Yang, Yu Cheng

TL;DR
This paper introduces Aggregation Fine-Tuning (AFT), a method that enhances large language models by training them to synthesize multiple draft responses into refined answers, significantly improving performance with minimal data.
Contribution
The paper proposes AFT, a novel finetuning paradigm that enables models to generate and aggregate multiple proposals, boosting inference performance without increasing data or model size.
Findings
AFT-trained models outperform standard fine-tuning on benchmark datasets.
A model fine-tuned with only 64k data surpasses larger LLMs in specific tasks.
Propose-and-aggregate strategy scales inference-time computation effectively.
Abstract
Scaling data and model size has been proven effective for boosting the performance of large language models. In addition to training-time scaling, recent studies have revealed that increasing test-time computational resources can further improve performance. In this work, we introduce Aggregation Fine-Tuning (AFT), a supervised finetuning paradigm where the model learns to synthesize multiple draft responses, referred to as proposals, into a single, refined answer, termed aggregation. At inference time, a propose-and-aggregate strategy further boosts performance by iteratively generating proposals and aggregating them. Empirical evaluations on benchmark datasets show that AFT-trained models substantially outperform standard SFT. Notably, an AFT model, fine-tuned from Llama3.1-8B-Base with only 64k data, achieves a 41.3% LC win rate on AlpacaEval 2, surpassing significantly larger LLMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Law, AI, and Intellectual Property · Dispute Resolution and Class Actions
MethodsShrink and Fine-Tune
