"Why" Has the Least Side Effect on Model Editing

Tsung-Hsuan Pan; Chung-Chi Chen; Hen-Hsen Huang; Hsin-Hsi Chen

arXiv:2409.18679·cs.CL·September 30, 2024

"Why" Has the Least Side Effect on Model Editing

Tsung-Hsuan Pan, Chung-Chi Chen, Hen-Hsen Huang, Hsin-Hsi Chen

PDF

Open Access

TL;DR

This paper investigates the factors influencing side effects in model editing of large language models, revealing that question type, model size, and batch size significantly affect performance and side effects.

Contribution

It categorizes model editing questions by type, compares effects across model sizes, and explores batch size impact, providing new insights into minimizing side effects.

Findings

01

Performance degradation varies by question type.

02

Insights from small models may not apply to large models.

03

Increasing batch size reduces side effects.

Abstract

Training large language models (LLMs) from scratch is an expensive endeavor, particularly as world knowledge continually evolves. To maintain relevance and accuracy of LLMs, model editing has emerged as a pivotal research area. While these methods hold promise, they can also produce unintended side effects. Their underlying factors and causes remain largely unexplored. This paper delves into a critical factor-question type-by categorizing model editing questions. Our findings reveal that the extent of performance degradation varies significantly across different question types, providing new insights for experimental design in knowledge editing. Furthermore, we investigate whether insights from smaller models can be extrapolated to larger models. Our results indicate discrepancies in findings between models of different sizes, suggesting that insights from smaller models may not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel-Driven Software Engineering Techniques · Reinforcement Learning in Robotics