XAI-Driven Deep Learning for Protein Sequence Functional Group Classification

Pratik Chakraborty; Aryan Bhargava

arXiv:2511.13791·q-bio.QM·November 19, 2025

XAI-Driven Deep Learning for Protein Sequence Functional Group Classification

Pratik Chakraborty, Aryan Bhargava

PDF

Open Access

TL;DR

This paper introduces a deep learning framework for classifying protein sequences into functional groups, achieving high accuracy and interpretability through explainable AI techniques that reveal biologically meaningful motifs.

Contribution

It presents a novel combination of deep learning architectures and explainable AI methods for protein function classification, highlighting biologically relevant motifs.

Findings

01

CNN achieved 91.8% validation accuracy

02

Explainable AI identified motifs in histidine, aspartate, glutamate, lysine

03

Deep learning uncovered biochemical signatures linked to enzyme functions

Abstract

Proteins perform essential biological functions, and accurate classification of their sequences is critical for understanding structure-function relationships, enzyme mechanisms, and molecular interactions. This study presents a deep learning-based framework for functional group classification of protein sequences derived from the Protein Data Bank (PDB). Four architectures were implemented: Convolutional Neural Network (CNN), Bidirectional Long Short-Term Memory (BiLSTM), CNN-BiLSTM hybrid, and CNN with Attention. Each model was trained using k-mer integer encoding to capture both local and long-range dependencies. Among these, the CNN achieved the highest validation accuracy of 91.8%, demonstrating the effectiveness of localized motif detection. Explainable AI techniques, including Grad-CAM and Integrated Gradients, were applied to interpret model predictions and identify biologically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Bioinformatics · Protein Structure and Dynamics · Computational Drug Discovery Methods