Evaluating Post-hoc Explanations of the Transformer-based Genome Language Model DNABERT-2
Isabel Kurth, Paulo Yanez Sarmiento, Bernhard Y. Renard

TL;DR
This paper evaluates the explainability of Transformer-based genome language models, specifically DNABERT-2, using adapted relevance propagation methods to produce biologically meaningful explanations.
Contribution
It introduces an adaptation of layer-wise relevance propagation for Transformer models in genomics and compares explanations between DNABERT-2 and CNN baselines.
Findings
AttnLRP provides reliable explanations aligned with biological patterns
gLMs like DNABERT-2 can be used to derive biological insights
The method enables comparison of relevance attributions across architectures
Abstract
Explaining deep neural network predictions on genome sequences enables biological insight and hypothesis generation-often of greater interest than predictive performance alone. While explanations of convolutional neural networks (CNNs) have been shown to capture relevant patterns in genome sequences, it is unclear whether this transfers to more expressive Transformer-based genome language models (gLMs). To answer this question, we adapt AttnLRP, an extension of layer-wise relevance propagation to the attention mechanism, and apply it to the state-of-the-art gLM DNABERT-2. Thereby, we propose strategies to transfer explanations from token and nucleotide level. We evaluate the adaption of AttnLRP on genomic datasets using multiple metrics. Further, we provide an extensive comparison between the explanations of DNABERT-2 and a baseline CNN. Our results demonstrate that AttnLRP yields…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
