Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated   Tokens

Zhepeng Cen; Yao Liu; Siliang Zeng; Pratik Chaudhari and; Huzefa Rangwala; George Karypis; Rasool Fakoor

arXiv:2410.14655·cs.LG·January 22, 2025

Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens

Zhepeng Cen, Yao Liu, Siliang Zeng, Pratik Chaudhari and, Huzefa Rangwala, George Karypis, Rasool Fakoor

PDF

Open Access

TL;DR

This paper introduces two training strategies for language models that leverage self-generated tokens to better align training and inference behaviors, improving performance in various NLP tasks.

Contribution

It proposes Batch-Scheduled Sampling and Reference-Answer-based Correction methods that incorporate self-generated tokens during training to reduce the training-inference discrepancy in LLMs.

Findings

01

Improved performance on summarization tasks

02

Enhanced accuracy in question-answering tasks

03

Effective self-correction capabilities demonstrated

Abstract

Language models are often trained to maximize the likelihood of the next token given past tokens in the training dataset. However, during inference time, they are utilized differently, generating text sequentially and auto-regressively by using previously generated tokens as input to predict the next one. Marginal differences in predictions at each step can cascade over successive steps, resulting in different distributions from what the models were trained for and potentially leading to unpredictable behavior. This paper proposes two simple approaches based on model own generation to address this discrepancy between the training and inference time. Our first approach is Batch-Scheduled Sampling, where, during training, we stochastically choose between the ground-truth token from the dataset and the model's own generated token as input to predict the next token. This is done in an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques