Optimization-Inspired Few-Shot Adaptation for Large Language Models

Boyan Gao; Xin Wang; Yibo Yang; David Clifton

arXiv:2505.19107·cs.LG·May 27, 2025

Optimization-Inspired Few-Shot Adaptation for Large Language Models

Boyan Gao, Xin Wang, Yibo Yang, David Clifton

PDF

Open Access

TL;DR

This paper introduces OFA, a novel optimization-inspired method for few-shot adaptation of large language models that improves efficiency and performance without extra trainable parameters.

Contribution

It reinterprets LLM forward passes as optimization steps and proposes a parameterization that learns preconditioners to enhance few-shot adaptation.

Findings

01

OFA outperforms existing few-shot adaptation methods.

02

The method improves optimization efficiency and convergence.

03

OFA achieves superior results across various tasks.

Abstract

Large Language Models (LLMs) have demonstrated remarkable performance in real-world applications. However, adapting LLMs to novel tasks via fine-tuning often requires substantial training data and computational resources that are impractical in few-shot scenarios. Existing approaches, such as in-context learning and Parameter-Efficient Fine-Tuning (PEFT), face key limitations: in-context learning introduces additional inference computational overhead with limited performance gains, while PEFT models are prone to overfitting on the few demonstration examples. In this work, we reinterpret the forward pass of LLMs as an optimization process, a sequence of preconditioned gradient descent steps refining internal representations. Based on this connection, we propose Optimization-Inspired Few-Shot Adaptation (OFA), integrating a parameterization that learns preconditioners without introducing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Topic Modeling