Efficient Inference Using Large Language Models with Limited Human Data: Fine-Tuning then Rectification
Lei Wang, Zikun Ye, Jinglong Zhao

TL;DR
This paper introduces a two-stage framework combining fine-tuning and rectification for large language models, optimizing limited human data use to improve inference efficiency and reduce costs in business applications.
Contribution
It proposes a novel variance-based fine-tuning objective and an optimal data allocation strategy leveraging the scaling law, enhancing LLM performance with limited labeled data.
Findings
Variance minimization improves downstream rectification
Optimal data allocation enhances efficiency and accuracy
Empirical validation confirms the proposed framework's effectiveness
Abstract
Driven by recent advances in artificial intelligence (AI), a growing literature has demonstrated the potential for using large language models (LLMs) as scalable surrogates to generate human-like responses in many business applications. Two common approaches to improve the performance of LLMs include: fine-tuning, which aligns LLMs more closely with human responses, and rectification, which corrects biases in LLM outputs. In this paper, we develop a two-stage framework that combines fine-tuning and rectification, and optimally allocates limited labeled samples across the two stages. Unlike the conventional objective that minimizes the mean squared prediction errors, we propose to minimize the variance of the prediction errors as the fine-tuning objective, which is optimal for the downstream rectification stage. Building on this insight, we leverage the scaling law of fine-tuning to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Big Data and Digital Economy · Sentiment Analysis and Opinion Mining
