From Drafts to Answers: Unlocking LLM Potential via Aggregation   Fine-Tuning

Yafu Li; Zhilin Wang; Tingchen Fu; Ganqu Cui; Sen Yang; Yu Cheng

arXiv:2501.11877·cs.CL·January 22, 2025

From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning

Yafu Li, Zhilin Wang, Tingchen Fu, Ganqu Cui, Sen Yang, Yu Cheng

PDF

Open Access 1 Repo

TL;DR

This paper introduces Aggregation Fine-Tuning (AFT), a method that enhances large language models by training them to synthesize multiple draft responses into refined answers, significantly improving performance with minimal data.

Contribution

The paper proposes AFT, a novel finetuning paradigm that enables models to generate and aggregate multiple proposals, boosting inference performance without increasing data or model size.

Findings

01

AFT-trained models outperform standard fine-tuning on benchmark datasets.

02

A model fine-tuned with only 64k data surpasses larger LLMs in specific tasks.

03

Propose-and-aggregate strategy scales inference-time computation effectively.

Abstract

Scaling data and model size has been proven effective for boosting the performance of large language models. In addition to training-time scaling, recent studies have revealed that increasing test-time computational resources can further improve performance. In this work, we introduce Aggregation Fine-Tuning (AFT), a supervised finetuning paradigm where the model learns to synthesize multiple draft responses, referred to as proposals, into a single, refined answer, termed aggregation. At inference time, a propose-and-aggregate strategy further boosts performance by iteratively generating proposals and aggregating them. Empirical evaluations on benchmark datasets show that AFT-trained models substantially outperform standard SFT. Notably, an AFT model, fine-tuned from Llama3.1-8B-Base with only 64k data, achieves a 41.3% LC win rate on AlpacaEval 2, surpassing significantly larger LLMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

linzwcs/aft
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Law, AI, and Intellectual Property · Dispute Resolution and Class Actions

MethodsShrink and Fine-Tune