XAI-Driven Deep Learning for Protein Sequence Functional Group Classification
Pratik Chakraborty, Aryan Bhargava

TL;DR
This paper introduces a deep learning framework for classifying protein sequences into functional groups, achieving high accuracy and interpretability through explainable AI techniques that reveal biologically meaningful motifs.
Contribution
It presents a novel combination of deep learning architectures and explainable AI methods for protein function classification, highlighting biologically relevant motifs.
Findings
CNN achieved 91.8% validation accuracy
Explainable AI identified motifs in histidine, aspartate, glutamate, lysine
Deep learning uncovered biochemical signatures linked to enzyme functions
Abstract
Proteins perform essential biological functions, and accurate classification of their sequences is critical for understanding structure-function relationships, enzyme mechanisms, and molecular interactions. This study presents a deep learning-based framework for functional group classification of protein sequences derived from the Protein Data Bank (PDB). Four architectures were implemented: Convolutional Neural Network (CNN), Bidirectional Long Short-Term Memory (BiLSTM), CNN-BiLSTM hybrid, and CNN with Attention. Each model was trained using k-mer integer encoding to capture both local and long-range dependencies. Among these, the CNN achieved the highest validation accuracy of 91.8%, demonstrating the effectiveness of localized motif detection. Explainable AI techniques, including Grad-CAM and Integrated Gradients, were applied to interpret model predictions and identify biologically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Protein Structure and Dynamics · Computational Drug Discovery Methods
