An explainable-AI framework reveals novel lncRNAs specific for breast cancer subtypes
Jai Chand Patel, Avinash Veerappa, Chittibabu Guda

TL;DR
This study uses an explainable AI framework to identify lncRNAs specific to breast cancer subtypes, showing their potential for cancer subtyping and biomarker discovery.
Contribution
The novel contribution is the systematic evaluation of lncRNA-only and integrative models for multi-class breast cancer subtyping using an explainable AI framework.
Findings
XGBoost using lncRNAs alone achieved 89.2% accuracy in breast cancer subtyping.
Explainable AI identified subtype-specific biomarker panels with unique lncRNA features for each subtype.
Novel subtype-specific lncRNAs like CUFF.25255 and CUFF.26607 were found to correlate with survival outcomes.
Abstract
Long non-coding RNAs (lncRNAs) have emerged as important regulators in cancer biology; yet their potential for cancer subtyping remains underexplored particularly in the context of large-scale, multi-class supervised classification frameworks, due to limited publicly available data or their use only as auxiliary features in classification tasks. In this study, we utilized an expansive set of 7,177 lncRNAs obtained from 1,021 breast cancer (BRCA) transcriptomics datasets for subtyping using an explainable artificial intelligence (AI) framework. lncRNA, mRNA, and miRNA features were used to build machine learning (ML) models individually and in combination. Four ML classifiers: Naïve Bayes, Random Forest, Artificial Neural Network, and XGBoost were employed to evaluate subtype classification performance. Using lncRNAs alone, XGBoost demonstrated strong performance with an accuracy of…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer-related molecular mechanisms research · Ferroptosis and cancer prognosis · Machine Learning in Bioinformatics
