MambaLRP: Explaining Selective State Space Sequence Models
Farnoush Rezaei Jafari, Gr\'egoire Montavon, Klaus-Robert M\"uller,, Oliver Eberle

TL;DR
MambaLRP introduces a new explainability method for Mamba sequence models, ensuring more faithful relevance propagation and enabling better understanding of model biases and long-range capabilities.
Contribution
We propose MambaLRP, a novel LRP-based algorithm that improves explanation faithfulness and interpretability of Mamba models, addressing unfaithfulness issues in existing relevance propagation methods.
Findings
MambaLRP achieves state-of-the-art explanation performance.
It reveals biases and long-range capabilities of Mamba models.
The method is theoretically sound and broadly applicable.
Abstract
Recent sequence modeling approaches using selective state space sequence models, referred to as Mamba models, have seen a surge of interest. These models allow efficient processing of long sequences in linear time and are rapidly being adopted in a wide range of applications such as language modeling, demonstrating promising performance. To foster their reliable use in real-world scenarios, it is crucial to augment their transparency. Our work bridges this critical gap by bringing explainability, particularly Layer-wise Relevance Propagation (LRP), to the Mamba architecture. Guided by the axiom of relevance conservation, we identify specific components in the Mamba architecture, which cause unfaithful explanations. To remedy this issue, we propose MambaLRP, a novel algorithm within the LRP framework, which ensures a more stable and reliable relevance propagation through these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSimulation Techniques and Applications
