Fine-tuning Large Language Models with Limited Data: A Survey and Practical Guide
Marton Szep, Daniel Rueckert, R\"udiger von Eisenhart-Rothe, Florian Hinterwimmer

TL;DR
This paper provides a comprehensive survey of recent methods for effectively fine-tuning large language models in low-data scenarios, emphasizing practical techniques, trade-offs, and best practices for resource-constrained settings.
Contribution
It systematically reviews parameter-efficient, domain adaptation, and preference alignment methods, offering actionable insights for low-resource fine-tuning of LLMs.
Findings
Parameter-efficient fine-tuning reduces training costs
Domain adaptation improves model performance in specialized areas
Preference alignment enhances model behavior with limited feedback
Abstract
Fine-tuning large language models (LLMs) with limited data poses a practical challenge in low-resource languages, specialized domains, and constrained deployment settings. While pre-trained LLMs provide strong foundations, effective adaptation under data scarcity requires focused and efficient fine-tuning techniques. This paper presents a structured and practical survey of recent methods for fine-tuning LLMs in data-scarce scenarios. We systematically review parameter-efficient fine-tuning techniques that lower training and deployment costs, domain and cross-lingual adaptation methods for both encoder and decoder models, and model specialization strategies. We further examine preference alignment approaches that guide model behavior using limited human or synthetic feedback, emphasizing sample and compute efficiency. Throughout, we highlight empirical trade-offs, selection criteria, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
