Mechanistic Fine-tuning for In-context Learning
Hakaze Cho, Peng Luo, Mariko Kato, Rin Kaenbyou, Naoya Inoue

TL;DR
This paper introduces Attention Behavior Fine-Tuning (ABFT), a cost-effective method to improve in-context learning in language models by adjusting attention scores, leading to better performance and interpretability.
Contribution
The paper proposes ABFT, a novel fine-tuning approach that modifies attention scores to enhance ICL, reducing computational costs and revealing insights into model mechanisms.
Findings
ABFT outperforms previous methods in accuracy, robustness, and efficiency.
ABFT requires only 0.01% of the data cost of prior approaches.
Analysis suggests ICL biases relate to the emergence of induction heads.
Abstract
In-context Learning (ICL) utilizes structured demonstration-query inputs to induce few-shot learning on Language Models (LMs), which are not originally pre-trained on ICL-style data. To bridge the gap between ICL and pre-training, some approaches fine-tune LMs on large ICL-style datasets by an end-to-end paradigm with massive computational costs. To reduce such costs, in this paper, we propose Attention Behavior Fine-Tuning (ABFT), utilizing the previous findings on the inner mechanism of ICL, building training objectives on the attention scores instead of the final outputs, to force the attention scores to focus on the correct label tokens presented in the context and mitigate attention scores from the wrong label tokens. Our experiments on 9 modern LMs and 8 datasets empirically find that ABFT outperforms in performance, robustness, unbiasedness, and efficiency, with only around 0.01%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovative Teaching and Learning Methods · Neural Networks and Applications
MethodsSoftmax · Attention Is All You Need · Focus
