Efficient Inference Using Large Language Models with Limited Human Data: Fine-Tuning then Rectification

Lei Wang; Zikun Ye; Jinglong Zhao

arXiv:2511.19486·cs.LG·December 30, 2025

Efficient Inference Using Large Language Models with Limited Human Data: Fine-Tuning then Rectification

Lei Wang, Zikun Ye, Jinglong Zhao

PDF

Open Access

TL;DR

This paper introduces a two-stage framework combining fine-tuning and rectification for large language models, optimizing limited human data use to improve inference efficiency and reduce costs in business applications.

Contribution

It proposes a novel variance-based fine-tuning objective and an optimal data allocation strategy leveraging the scaling law, enhancing LLM performance with limited labeled data.

Findings

01

Variance minimization improves downstream rectification

02

Optimal data allocation enhances efficiency and accuracy

03

Empirical validation confirms the proposed framework's effectiveness

Abstract

Driven by recent advances in artificial intelligence (AI), a growing literature has demonstrated the potential for using large language models (LLMs) as scalable surrogates to generate human-like responses in many business applications. Two common approaches to improve the performance of LLMs include: fine-tuning, which aligns LLMs more closely with human responses, and rectification, which corrects biases in LLM outputs. In this paper, we develop a two-stage framework that combines fine-tuning and rectification, and optimally allocates limited labeled samples across the two stages. Unlike the conventional objective that minimizes the mean squared prediction errors, we propose to minimize the variance of the prediction errors as the fine-tuning objective, which is optimal for the downstream rectification stage. Building on this insight, we leverage the scaling law of fine-tuning to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Big Data and Digital Economy · Sentiment Analysis and Opinion Mining