A Closer Look at the Limitations of Instruction Tuning

Sreyan Ghosh; Chandra Kiran Reddy Evuru; Sonal Kumar and; Ramaneswaran S; Deepali Aneja; Zeyu Jin; Ramani Duraiswami and; Dinesh Manocha

arXiv:2402.05119·cs.CL·July 16, 2024·2 cites

A Closer Look at the Limitations of Instruction Tuning

Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar and, Ramaneswaran S, Deepali Aneja, Zeyu Jin, Ramani Duraiswami and, Dinesh Manocha

PDF

Open Access

TL;DR

This paper critically examines instruction tuning (IT) for large language models, revealing its limitations in knowledge enhancement, response quality, and hallucination, and showing that simpler fine-tuning methods often outperform IT.

Contribution

The paper provides a comprehensive analysis of IT's shortcomings, demonstrating that current methods do not effectively improve LLM knowledge or response quality, and highlighting the superiority of pre-trained knowledge responses.

Findings

01

IT does not improve LLM knowledge or skills.

02

Full-parameter fine-tuning can degrade knowledge.

03

IT responses often decline in quality and increase hallucinations.

Abstract

Instruction Tuning (IT), the process of training large language models (LLMs) using instruction-response pairs, has emerged as the predominant method for transforming base pre-trained LLMs into open-domain conversational agents. While IT has achieved notable success and widespread adoption, its limitations and shortcomings remain underexplored. In this paper, through rigorous experiments and an in-depth analysis of the changes LLMs undergo through IT, we reveal various limitations of IT. In particular, we show that (1) IT fails to enhance knowledge or skills in LLMs. LoRA fine-tuning is limited to learning response initiation and style tokens, and full-parameter fine-tuning leads to knowledge degradation. (2) Copying response patterns from IT datasets derived from knowledgeable sources leads to a decline in response quality. (3) Full-parameter fine-tuning increases hallucination by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducation and Technology Integration

MethodsBalanced Selection