Structured Gradient Guidance for Few-Shot Adaptation in Large Language Models

Hongye Zheng; Yichen Wang; Ray Pan; Guiran Liu; Binrong Zhu; Hanlu Zhang

arXiv:2506.00726·cs.CL·June 3, 2025

Structured Gradient Guidance for Few-Shot Adaptation in Large Language Models

Hongye Zheng, Yichen Wang, Ray Pan, Guiran Liu, Binrong Zhu, Hanlu Zhang

PDF

Open Access

TL;DR

This paper introduces a gradient-guided fine-tuning approach for large language models that improves task adaptability, stability, and cross-task generalization in few-shot settings by using gradient regularization and alignment mechanisms.

Contribution

The paper proposes a novel gradient-informed fine-tuning method with regularization and alignment techniques to enhance stability and generalization in low-resource scenarios.

Findings

01

Outperforms existing fine-tuning strategies in accuracy and stability

02

Enhances cross-task generalization through gradient alignment

03

Demonstrates robustness across various NLP tasks and data sizes

Abstract

This paper presents a gradient-informed fine-tuning method for large language models under few-shot conditions. The goal is to enhance task adaptability and training stability when data is limited. The method builds on a base loss function and introduces two gradient-related regularization terms. The first enforces gradient direction consistency to guide parameter updates along task-relevant directions and prevent drift. The second controls gradient magnitude to avoid abnormal updates. Together, these components support a more efficient and stable optimization path. To further improve cross-task generalization, the method incorporates a gradient alignment mechanism. This mechanism measures the consistency between optimization directions of the source and target tasks. It enhances fine-tuning performance in multi-task and cross-domain scenarios. Across various natural language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis