Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques
Asankhaya Sharma

TL;DR
This paper demonstrates that the capabilities of fine-tuned transformers can be approximated by base models using inference-time techniques like in-context learning, reducing the need for costly fine-tuning.
Contribution
It provides a theoretical framework showing how in-context learning can replicate fine-tuned model capabilities under idealized and practical conditions.
Findings
Capabilities of fine-tuned models can be approximated with finite datasets.
Theoretical bounds on dataset sizes needed for approximation.
Practical techniques like retrieval-augmented generation can bridge theory and application.
Abstract
Large language models have transformed natural language processing, yet supervised fine-tuning (SFT) remains computationally intensive. This paper formally proves that capabilities acquired through SFT can be approximated by a base transformer model using inference-time techniques, specifically in-context learning (ICL), without altering model parameters, under idealized assumptions including unbounded computational resources and access to the fine-tuning dataset. We extend these results to practical scenarios with finite context lengths and partial dataset access. For text generation tasks with fixed output length , datasets of size or, with bounded context, suffice to approximate fine-tuned behavior across contexts within…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Algorithms
MethodsShrink and Fine-Tune · Balanced Selection
