# CRISP: a correlation-filtered recursive feature elimination and integration of SMOTE pipeline for gait-based Parkinson’s disease screening

**Authors:** Namra Afzal, Javaid Iqbal, Asim Waris, Muhammad Jawad Khan, Fawwaz Hazzazi, Hasnain Ali, Muhammad Adeel Ijaz, Syed Omer Gilani

PMC · DOI: 10.3389/fncom.2025.1660963 · Frontiers in Computational Neuroscience · 2025-10-10

## TL;DR

CRISP is a new pipeline that improves early detection of Parkinson's disease by analyzing gait data with wearable sensors, using advanced feature selection and balancing techniques.

## Contribution

CRISP introduces a novel VGRF-based pipeline combining correlation filtering, RFE, and SMOTE for PD detection, along with a subject-wise evaluation protocol.

## Key findings

- CRISP improved subject-wise PD detection accuracy to 98.3% with XGBoost.
- Severity grading accuracy increased to 99.3% using CRISP with XGBoost.
- The pipeline enhances generalizability and aligns with clinical diagnostic workflows.

## Abstract

Parkinson’s disease (PD) is the fastest-growing neurodegenerative disorder, with subtle gait changes such as reduced vertical ground-reaction forces (VGRF) often preceding motor symptoms. These gait abnormalities, measurable via wearable VGRF sensors, offer a non-invasive means for early PD detection. However, current computational approaches often suffer from redundant features and class imbalance, limiting both accuracy and generalizability.

We propose CRISP (Correlation-filtered Recursive Feature Elimination and Integration of SMOTE Pipeline for Gait-Based Parkinson’s Disease Screening), a lightweight multistage framework that sequentially applies correlation-based feature pruning, recursive feature elimination (RFE), and Synthetic Minority Oversampling Technique (SMOTE) based class balancing. To ensure clinically meaningful evaluation, a novel subject-wise protocol was also introduced that assigns one prediction per individual enhancing patient-level variability capture and better aligning with diagnostic workflows. Using 306 VGRF recordings (93 PD, 76 controls), five classifiers, i.e., ﻿k-Nearest Neighbours (KNN), Decision Tree (DT), Random Forest (RF), Gradient boosting (GB), and Extreme Gradient Boosting (XGBoost) were evaluated for both binary PD detection and multiclass severity grading.

CRISP consistently improved performance across all models under 5-fold cross-validation. XGBoost achieved the highest performance, increasing subject-wise PD detection accuracy from 96.1 ± 0.8% to 98.3 ± 0.8%, and severity grading accuracy from 96.2 ± 0.7% to 99.3 ± 0.5%.

CRISP is the first VGRF-based pipeline to combine correlation-filtered feature pruning, recursive feature elimination, and SMOTE to enhance PD detection performance, while also introducing a subject-wise evaluation protocol that captures patient-level variability for truly personalized diagnostics. These twin novelties deliver clinically significant gains and lay the foundation for real-time, on-device PD detection and severity monitoring.

## Linked entities

- **Diseases:** Parkinson’s disease (MONDO:0005180)

## Full-text entities

- **Diseases:** PD (MESH:D010300), gait abnormalities (MESH:D020233), neurodegenerative disorder (MESH:D019636)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12549659/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12549659/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/PMC12549659/full.md

---
Source: https://tomesphere.com/paper/PMC12549659