DETAIL: Task DEmonsTration Attribution for Interpretable In-context   Learning

Zijian Zhou; Xiaoqiang Lin; Xinyi Xu; Alok Prakash; Daniela Rus; Bryan; Kian Hsiang Low

arXiv:2405.14899·cs.CL·December 17, 2024

DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

Zijian Zhou, Xiaoqiang Lin, Xinyi Xu, Alok Prakash, Daniela Rus, Bryan, Kian Hsiang Low

PDF

Open Access 1 Repo

TL;DR

This paper introduces DETAIL, an influence function-based attribution method for interpreting in-context learning in transformer models, enabling demonstration attribution, reordering, and transferability to improve model performance.

Contribution

The paper presents a novel influence function-based attribution technique tailored for ICL, addressing its unique characteristics and enabling effective demonstration attribution and transferability.

Findings

01

DETAIL is computationally efficient for demonstration attribution.

02

Reordering demonstrations using DETAIL improves model performance.

03

Attribution scores transfer effectively from white-box to black-box models.

Abstract

In-context learning (ICL) allows transformer-based language models that are pre-trained on general text to quickly learn a specific task with a few "task demonstrations" without updating their parameters, significantly boosting their flexibility and generality. ICL possesses many distinct characteristics from conventional machine learning, thereby requiring new approaches to interpret this learning paradigm. Taking the viewpoint of recent works showing that transformers learn in context by formulating an internal optimizer, we propose an influence function-based attribution technique, DETAIL, that addresses the specific characteristics of ICL. We empirically verify the effectiveness of our approach for demonstration attribution while being computationally efficient. Leveraging the results, we then show how DETAIL can help improve model performance in real-world scenarios through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BobbyZhouZijian/detail_release
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling