# A Hierarchical Multi-View Deep Learning Framework for Autism Classification Using Structural and Functional MRI

**Authors:** Nayif Mohammed Hammash, Mohammed Chachan Younis

PMC · DOI: 10.3390/jimaging12030109 · 2026-03-04

## TL;DR

This paper introduces a new deep learning framework that combines structural and functional MRI data to improve autism classification accuracy.

## Contribution

A novel hierarchical multi-view deep learning framework that integrates structural and functional MRI data for autism classification.

## Key findings

- The proposed framework outperforms existing baselines in autism classification using structural MRI data with 90.19% accuracy.
- For functional MRI data, the framework achieves 88.93% accuracy in autism classification.
- The framework demonstrates robustness and generalization in integrating structural and functional brain representations.

## Abstract

Autism classification is challenging due to the subtle, heterogeneous, and overlapping neural activation profiles that occur in individuals with autism. Novel deep learning approaches, such as Convolutional Neural Networks (CNNs) and their variants, as well as Transformers, have shown moderate performance in discriminating between autism and normal cohorts; yet, they often struggle to jointly capture the spatial–structural and temporal–functional variations present in autistic brains. To overcome these shortcomings, we propose a novel hierarchical deep learning framework that extracts the inherent spatial dependencies from the dual-modal MRI scans. For sMRI, we develop a 3D Hierarchical Convolutional Neural Network to capture both fine and coarse anatomical structures via multi-view projections along the axial, sagittal, and coronal planes. For the fMRI case, we introduced a bidirectional LSTM-based temporal encoder to examine regional brain dynamics and functional connectivity. The sequential embeddings and correlations are combined into a unified spatiotemporal representation of functional imaging, which is then classified using a multilayer perceptron to ensure continuity in diagnostic predictions across the examined modalities. Finally, a cross-modality fusion scheme was employed to integrate feature representations of both modalities. Extensive evaluations on the ABIDE I dataset (NYU repository) demonstrate that our proposed framework outperforms existing baselines, including Vision/Swin Transformers and various newly developed CNN variants. For the sMRI branch, we achieved 90.19 ± 0.12% accuracy (precision: 90.85 ± 0.16%, recall: 89.27 ± 0.19%, F1-score: 90.05 ± 0.14%, and focal loss: 0.3982). For the fMRI branch, we achieved an accuracy of 88.93 ± 0.15% (precision: 89.78 ± 0.18%, recall: 88.29 ± 0.20%, F1-score: 89.03 ± 0.17%, and focal loss of 0.4437). These outcomes affirm the superior generalization and robustness of the proposed framework for integrating structural and functional brain representations to achieve accurate autism classification.

## Linked entities

- **Diseases:** autism (MONDO:0005260)

## Full-text entities

- **Genes:** CSRP3 (cysteine and glycine rich protein 3) [NCBI Gene 8048] {aka CLP, CMD1M, CMH12, CRP3, MLP}
- **Diseases:** NYU (MESH:C563594), ASD (MESH:D000067877), restricted or repetitive behaviors and interests (MESH:D002313), Alzheimer's (MESH:D000544), deficits (MESH:D009461), morphological abnormalities (MESH:D000013), injury to (MESH:D014947), neurodevelopmental disorder (MESH:D002658), schizophrenia (MESH:D012559), Autism (MESH:D001321), Parkinson's (MESH:D010300), psychiatric disorders (MESH:D001523), TC (OMIM:275350), ASD-related abnormalities (MESH:D002659)
- **Chemicals:** ViT (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13027856/full.md

---
Source: https://tomesphere.com/paper/PMC13027856