# A Novel Framework for Cardiovascular Disease Detection Using a Hybrid CWT-SIFT Image Representation and a Lightweight Residual Attention Network

**Authors:** Imane El Boujnouni

PMC · DOI: 10.3390/diagnostics16010005 · Diagnostics · 2025-12-19

## TL;DR

This paper introduces a new framework for detecting cardiovascular disease using a combination of wavelet and SIFT features with a lightweight neural network, achieving high accuracy.

## Contribution

The novel framework combines CWT and SIFT features with a lightweight residual attention network for efficient and accurate CVD detection.

## Key findings

- The framework achieved 99.60% accuracy using five-fold cross-validation.
- It outperforms existing state-of-the-art methods in classification performance.

## Abstract

Background: The mortality and morbidity rates of cardiovascular disease (CVD) are rising sharply in many developed and developing countries. CVD is a fatal disease that requires early and timely diagnosis to prevent further damage and ultimately save patients’ lives. In recent years, numerous studies have explored the automated identification of different categories of CVDs using various deep learning classifiers. However, they often rely on a substantial amount of data. The lack of representative training samples in real-world scenarios, especially in developing countries, poses a significant challenge that hinders the successful training of accurate predictive models. In this study, we introduce a framework to address this gap. Methods: The core novelty of our framework is the combination of Multi-Resolution Wavelet Features with Scale-Invariant Feature Transform (SIFT) keypoint density maps and a lightweight residual attention neural network (ResAttNet). Our hybrid approach transforms one-dimensional ECG signals into a three-channel image representation. Specifically, the CWT is used to extract hidden features in the time-frequency domain to create the first two image channels. Subsequently, the SIFT algorithm is implemented to capture additional significant features to generate the third channel. These three-channel images are then fed to our custom residual attention neural network to enhance classification performance. To tackle the challenge of class imbalance present in our dataset, we employed a hybrid strategy combining the Synthetic Minority Over-sampling Technique (SMOTE) with Edited Nearest Neighbors (ENN) to balance class samples and integrated Focal Loss into the training process to help the model focus on hard-to-classify instances. Results: The performance metrics achieved using five-fold cross-validation are 99.60% accuracy, 97.38% precision, 98.53% recall, and 97.37% F1-score. Conclusions: The experimental results showed that our proposed method outperforms current state-of-the-art methods. The primary practical implication of this work is that by combining a novel, information-rich feature representation with a lightweight classifier, our framework offers a highly accurate and computationally efficient solution, making it a significant step towards developing accessible and scalable computer-aided screening tools.

## Linked entities

- **Diseases:** cardiovascular disease (MONDO:0004995)

## Full-text entities

- **Diseases:** CVD (MESH:D002318)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12785799/full.md

## Figures

18 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12785799/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/PMC12785799/full.md

---
Source: https://tomesphere.com/paper/PMC12785799