# MCI detection from handwritten drawing test using residual vision transformer

**Authors:** Mehreen Sirshar, Irum Matloob, Ayesha Tayyabah, Faiza Syed, Aliya Ashraf, Hessa Alfraihi

PMC · DOI: 10.1038/s41598-026-40716-y · 2026-02-24

## TL;DR

This paper introduces a new AI method to detect early signs of cognitive decline using drawing tests, aiming to improve accuracy and reduce subjectivity in diagnosis.

## Contribution

A novel hybrid deep learning model, ResViT, combining ResNet50 and Vision Transformer for automated MCI detection from drawing tests.

## Key findings

- The ResViT model achieved 74.09% classification accuracy in detecting MCI.
- The hybrid model improved generalization and robustness compared to traditional methods.
- Combining ResNet and Vision Transformer enhances performance in cognitive disorder classification.

## Abstract

Mild Cognitive Impairment (MCI) is a clinical condition characterized by noticeable cognitive decline that is greater than expected for an individual’s age, yet not severe enough to interfere significantly with daily life. Early detection of MCI is critical, as it offers the opportunity to intervene before progression to more severe neurodegenerative diseases such as Alzheimer’s. While traditional diagnostic methods such as the Clock Drawing Test, Trail Making Test, and Cube Copying Test are widely used by clinicians, their manual assessment process can be subjective and time-consuming. This research addresses the automation of MCI detection using deep learning techniques applied to these neuro-psychological drawing tasks. A hybrid deep learning architecture—ResViT, which integrates ResNet50 for local feature extraction and a Vision Transformer (ViT) for capturing global context within the drawings, is being proposed. The ResViT architecture showed improved generalization and robustness across test cases, achieving a classification accuracy of 74.09% and an F1 score of 0.6716. Our results demonstrate that integrating Vision Transformer and ResNet architectures into a unified hybrid model enhances performance in cognitive disorder classification tasks, offering more accurate and measurable outcomes for early dementia detection through neuropsychological screening tools.

## Full-text entities

- **Genes:** VIT (vitrin) [NCBI Gene 5212] {aka VIT1}
- **Diseases:** ND (MESH:C537849), AD (MESH:D000544), motor impairment (MESH:D000068079), dementia (MESH:D003704), CDT (MESH:D013736), PD (MESH:D010300), Cognitive Impairment (MESH:D003072), neurodegenerative (MESH:D019636), MCI (MESH:D060825), executive dysfunction (MESH:D006331)
- **Chemicals:** ResViT (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** S105466182404022X

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13031545/full.md

---
Source: https://tomesphere.com/paper/PMC13031545