Explainable AI in Genomics: Transcription Factor Binding Site Prediction with Mixture of Experts

Aakash Tripathi; Ian E. Nielsen; Muhammad Umer; Ravi P. Ramachandran; Ghulam Rasool

arXiv:2507.09754·cs.LG·July 21, 2025

Explainable AI in Genomics: Transcription Factor Binding Site Prediction with Mixture of Experts

Aakash Tripathi, Ian E. Nielsen, Muhammad Umer, Ravi P. Ramachandran, Ghulam Rasool

PDF

TL;DR

This paper presents a novel Mixture of Experts model for transcription factor binding site prediction that outperforms individual models, especially in out-of-distribution scenarios, and introduces a new interpretability technique called ShiftSmooth.

Contribution

The study introduces a Mixture of Experts approach combining multiple CNN models and a new attribution method, ShiftSmooth, for improved accuracy and interpretability in TFBS prediction.

Findings

01

MoE model outperforms individual experts on diverse datasets

02

ShiftSmooth provides more robust attribution than Vanilla Gradient

03

ANOVA confirms significant performance improvements

Abstract

Transcription Factor Binding Site (TFBS) prediction is crucial for understanding gene regulation and various biological processes. This study introduces a novel Mixture of Experts (MoE) approach for TFBS prediction, integrating multiple pre-trained Convolutional Neural Network (CNN) models, each specializing in different TFBS patterns. We evaluate the performance of our MoE model against individual expert models on both in-distribution and out-of-distribution (OOD) datasets, using six randomly selected transcription factors (TFs) for OOD testing. Our results demonstrate that the MoE model achieves competitive or superior performance across diverse TF binding sites, particularly excelling in OOD scenarios. The Analysis of Variance (ANOVA) statistical test confirms the significance of these performance differences. Additionally, we introduce ShiftSmooth, a novel attribution mapping…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.