Progressive Inference: Explaining Decoder-Only Sequence Classification   Models Using Intermediate Predictions

Sanjay Kariyappa; Freddy L\'ecu\'e; Saumitra Mishra; Christopher Pond,; Daniele Magazzeni; Manuela Veloso

arXiv:2406.02625·cs.LG·June 6, 2024

Progressive Inference: Explaining Decoder-Only Sequence Classification Models Using Intermediate Predictions

Sanjay Kariyappa, Freddy L\'ecu\'e, Saumitra Mishra, Christopher Pond,, Daniele Magazzeni, Manuela Veloso

PDF

Open Access

TL;DR

This paper introduces Progressive Inference, a framework that explains decoder-only sequence classification models by leveraging intermediate predictions at different input points, enabling more accurate attributions with minimal extra computation.

Contribution

It presents two novel methods, SP-PI and MP-PI, for computing input attributions using intermediate model predictions, improving interpretability of decoder-only models.

Findings

01

SP-PI and MP-PI outperform prior attribution methods.

02

Intermediate predictions enable efficient, fine-grained explanations.

03

Methods are validated on diverse text classification models.

Abstract

This paper proposes Progressive Inference - a framework to compute input attributions to explain the predictions of decoder-only sequence classification models. Our work is based on the insight that the classification head of a decoder-only Transformer model can be used to make intermediate predictions by evaluating them at different points in the input sequence. Due to the causal attention mechanism, these intermediate predictions only depend on the tokens seen before the inference point, allowing us to obtain the model's prediction on a masked input sub-sequence, with negligible computational overheads. We develop two methods to provide sub-sequence level attributions using this insight. First, we propose Single Pass-Progressive Inference (SP-PI), which computes attributions by taking the difference between consecutive intermediate predictions. Second, we exploit a connection with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Computational Physics and Python Applications · Machine Learning in Healthcare

MethodsAttention Is All You Need · Sparse Evolutionary Training · Softmax · Layer Normalization · Shapley Additive Explanations · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection