# Decoding Multi-Omics Signatures in Lower-Grade Glioma Using Protein–Protein Interaction-Informed Graph Attention Networks and Ensemble Learning

**Authors:** Murtada K. Elbashir, Afrah Alanazi, Mahmood A. Mahmood

PMC · DOI: 10.3390/diagnostics15222894 · 2025-11-14

## TL;DR

This study uses a combination of multi-omics data and machine learning to improve classification and identify biomarkers for lower-grade gliomas.

## Contribution

A novel PPI-informed hybrid model combining multi-omics data with GAT and ensemble learning for glioma subtype classification.

## Key findings

- Multi-omics data combination improves subtype classification rates up to 0.984 ± 0.012.
- DNA methylation is the most discriminative modality for classification.
- Key subtype-specific biomarkers like UBA2, LRRC41, ANKRD53, and WDR77 were identified.

## Abstract

Background/Objectives: Lower-grade gliomas (LGGs) are a biologically and clinically heterogeneous group of brain tumors, for which molecular stratification plays essential role in diagnosis, prognosis, and therapeutic decision-making. Conventional unimodal classifiers do not necessarily describe cross-layer regulatory dynamics which entail the heterogeneity of glioma. Methods: This paper presents a protein–protein interaction (PPI)-informed hybrid model that combines multi-omics profiles, including RNA expression, DNA methylation, and microRNA expression, with a Graph Attention Network (GAT), Random Forest (RF), and logistic stacking ensemble learning. The proposed model utilizes ElasticNet-based feature selection to obtain the most informative biomarkers across omics layers, and the GAT module learns the biologically significant topological representations in the PPI network. The Synthetic Minority Over-Sampling Technique (SMOTE) was used to mitigate the class imbalance, and the model performance was assessed using a repeated five-fold stratified cross-validation approach using the following performance metrics: accuracy, precision, recall, F1-score, ROC-AUC, and AUPRC. Results: The findings illustrate that a combination of multi-omics data increases subtype classification rates (up to 0.984 ± 0.012) more than single-omics methods, and DNA methylation proves to be the most discriminative modality. In addition, analysis of interpretability using attention revealed the major subtype-specific biomarkers, including UBA2, LRRC41, ANKRD53, and WDR77, that show great biological relevance and could be used as diagnostic and therapeutic tools. Conclusions: The proposed multi-omics based on a biological and explainable framework provides a solid computational approach to molecular stratification and biomarker identification in lower-grade glioma, bridging between predictive power, biological clarification, and clinical benefits.

## Linked entities

- **Genes:** UBA2 (ubiquitin like modifier activating enzyme 2) [NCBI Gene 10054], LRRC41 (leucine rich repeat containing 41) [NCBI Gene 10489], ANKRD53 (ankyrin repeat domain 53) [NCBI Gene 79998], WDR77 (WD repeat domain 77) [NCBI Gene 79084]

## Full-text entities

- **Genes:** WDR77 (WD repeat domain 77) [NCBI Gene 79084] {aka HKMT1069, MEP-50, MEP50, Nbla10071, p44, p44/Mep50}, UBA2 (ubiquitin like modifier activating enzyme 2) [NCBI Gene 10054] {aka ACCES, ARX, HRIHFB2115, SAE2}, ANKRD53 (ankyrin repeat domain 53) [NCBI Gene 79998], LRRC41 (leucine rich repeat containing 41) [NCBI Gene 10489] {aka MUF1, PP7759}
- **Diseases:** Glioma (MESH:D005910), brain tumors (MESH:D001932), Lower- (MESH:D017116)

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12651222/full.md

---
Source: https://tomesphere.com/paper/PMC12651222