Evaluating Post-hoc Explanations of the Transformer-based Genome Language Model DNABERT-2

Isabel Kurth; Paulo Yanez Sarmiento; Bernhard Y. Renard

arXiv:2604.21690·cs.LG·April 24, 2026

Evaluating Post-hoc Explanations of the Transformer-based Genome Language Model DNABERT-2

Isabel Kurth, Paulo Yanez Sarmiento, Bernhard Y. Renard

PDF

TL;DR

This paper evaluates the explainability of Transformer-based genome language models, specifically DNABERT-2, using adapted relevance propagation methods to produce biologically meaningful explanations.

Contribution

It introduces an adaptation of layer-wise relevance propagation for Transformer models in genomics and compares explanations between DNABERT-2 and CNN baselines.

Findings

01

AttnLRP provides reliable explanations aligned with biological patterns

02

gLMs like DNABERT-2 can be used to derive biological insights

03

The method enables comparison of relevance attributions across architectures

Abstract

Explaining deep neural network predictions on genome sequences enables biological insight and hypothesis generation-often of greater interest than predictive performance alone. While explanations of convolutional neural networks (CNNs) have been shown to capture relevant patterns in genome sequences, it is unclear whether this transfers to more expressive Transformer-based genome language models (gLMs). To answer this question, we adapt AttnLRP, an extension of layer-wise relevance propagation to the attention mechanism, and apply it to the state-of-the-art gLM DNABERT-2. Thereby, we propose strategies to transfer explanations from token and nucleotide level. We evaluate the adaption of AttnLRP on genomic datasets using multiple metrics. Further, we provide an extensive comparison between the explanations of DNABERT-2 and a baseline CNN. Our results demonstrate that AttnLRP yields…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.