Fine-tuning Large Language Models with Limited Data: A Survey and Practical Guide

Marton Szep; Daniel Rueckert; R\"udiger von Eisenhart-Rothe; Florian Hinterwimmer

arXiv:2411.09539·cs.CL·October 29, 2025·2 cites

Fine-tuning Large Language Models with Limited Data: A Survey and Practical Guide

Marton Szep, Daniel Rueckert, R\"udiger von Eisenhart-Rothe, Florian Hinterwimmer

PDF

Open Access

TL;DR

This paper provides a comprehensive survey of recent methods for effectively fine-tuning large language models in low-data scenarios, emphasizing practical techniques, trade-offs, and best practices for resource-constrained settings.

Contribution

It systematically reviews parameter-efficient, domain adaptation, and preference alignment methods, offering actionable insights for low-resource fine-tuning of LLMs.

Findings

01

Parameter-efficient fine-tuning reduces training costs

02

Domain adaptation improves model performance in specialized areas

03

Preference alignment enhances model behavior with limited feedback

Abstract

Fine-tuning large language models (LLMs) with limited data poses a practical challenge in low-resource languages, specialized domains, and constrained deployment settings. While pre-trained LLMs provide strong foundations, effective adaptation under data scarcity requires focused and efficient fine-tuning techniques. This paper presents a structured and practical survey of recent methods for fine-tuning LLMs in data-scarce scenarios. We systematically review parameter-efficient fine-tuning techniques that lower training and deployment costs, domain and cross-lingual adaptation methods for both encoder and decoder models, and model specialization strategies. We further examine preference alignment approaches that guide model behavior using limited human or synthetic feedback, emphasizing sample and compute efficiency. Throughout, we highlight empirical trade-offs, selection criteria, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques