Induction Head Toxicity Mechanistically Explains Repetition Curse in Large Language Models
Shuxun Wang, Qingyu Yin, Chak Tou Leong, Qiang Zhang, Linyi Yang

TL;DR
This paper investigates how induction heads in large language models contribute to the repetition curse, revealing their dominant role in causing repetitive outputs and proposing regularization methods to mitigate this issue.
Contribution
It provides a mechanistic explanation for the repetition curse by identifying induction heads as a key driver and introduces a regularization technique to reduce their dominance.
Findings
Induction heads significantly contribute to the repetition curse in LLMs.
Regularization of attention heads can reduce repetition and improve output diversity.
Mechanistic understanding aids in designing better training strategies for LLMs.
Abstract
Repetition curse is a phenomenon where Large Language Models (LLMs) generate repetitive sequences of tokens or cyclic sequences. While the repetition curse has been widely observed, its underlying mechanisms remain poorly understood. In this work, we investigate the role of induction heads--a specific type of attention head known for their ability to perform in-context learning--in driving this repetitive behavior. Specifically, we focus on the "toxicity" of induction heads, which we define as their tendency to dominate the model's output logits during repetition, effectively excluding other attention heads from contributing to the generation process. Our findings have important implications for the design and training of LLMs. By identifying induction heads as a key driver of the repetition curse, we provide a mechanistic explanation for this phenomenon and suggest potential avenues…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Natural Language Processing Techniques
MethodsSoftmax · Attention Is All You Need · Focus
