Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques

Asankhaya Sharma

arXiv:2506.08060·cs.LG·June 11, 2025

Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques

Asankhaya Sharma

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that the capabilities of fine-tuned transformers can be approximated by base models using inference-time techniques like in-context learning, reducing the need for costly fine-tuning.

Contribution

It provides a theoretical framework showing how in-context learning can replicate fine-tuned model capabilities under idealized and practical conditions.

Findings

01

Capabilities of fine-tuned models can be approximated with finite datasets.

02

Theoretical bounds on dataset sizes needed for approximation.

03

Practical techniques like retrieval-augmented generation can bridge theory and application.

Abstract

Large language models have transformed natural language processing, yet supervised fine-tuning (SFT) remains computationally intensive. This paper formally proves that capabilities acquired through SFT can be approximated by a base transformer model using inference-time techniques, specifically in-context learning (ICL), without altering model parameters, under idealized assumptions including unbounded computational resources and access to the fine-tuning dataset. We extend these results to practical scenarios with finite context lengths and partial dataset access. For text generation tasks with fixed output length $l$ , datasets of size $O (\frac{mV}{ε ^{2}} lo g \frac{m}{δ})$ or, with bounded context, $O (\frac{l l o g V}{ε ^{2}} lo g \frac{1}{δ})$ suffice to approximate fine-tuned behavior across $m$ contexts within…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

codelion/optillm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Algorithms

MethodsShrink and Fine-Tune · Balanced Selection