# Toward AI-Assisted Sickle Cell Screening: A Controlled Comparison of CNN, Transformer, and Hybrid Architectures Using Public Blood-Smear Images

**Authors:** Linah Tasji, Hanan S. Alghamdi, Abdullah S Almalaise Al-Ghamdi

PMC · DOI: 10.3390/diagnostics16030414 · Diagnostics · 2026-01-29

## TL;DR

This paper compares AI models for detecting sickle cell disease from blood smear images, finding that CNNs and hybrid models perform best under limited data.

## Contribution

The study introduces a controlled benchmark of CNNs, Transformers, and hybrid models for SCD screening using public blood-smear images.

## Key findings

- CNN-based and hybrid models showed more stable and higher performance than transformer-only models.
- MaxViT-Tiny and DenseNet121 achieved the highest overall performance in SCD classification.
- XAI visualizations revealed that CNNs focus on localized cell morphology while hybrids use both local and contextual cues.

## Abstract

Background: Sickle cell disease (SCD) is a prevalent hereditary hemoglobinopathy associated with substantial morbidity, particularly in regions with limited access to advanced laboratory diagnostics. Conventional diagnostic workflows, including manual peripheral blood smear examination and biochemical or molecular assays, are resource-intensive, time-consuming, and subject to observer variability. Recent advances in artificial intelligence (AI) enable automated analysis of blood smear images and offer a scalable alternative for SCD screening. Methods: This study presents a controlled benchmark of CNNs, Vision Transformers, hierarchical Transformers, and hybrid CNN–Transformer architectures for image-level SCD classification using a publicly available peripheral blood smear dataset. Eleven ImageNet-pretrained models were fine-tuned under identical conditions using an explicit leakage-safe evaluation protocol, incorporating duplicate-aware, group-based data splitting and repeated splits to assess robustness. Performance was evaluated using accuracy and macro-averaged precision, recall, and F1-score, complemented by bootstrap confidence intervals, paired statistical testing, error-type analysis, and explainable AI (XAI). Results: Across repeated group-aware splits, CNN-based and hybrid architectures demonstrated more stable and consistently higher performance than transformer-only models. MaxViT-Tiny and DenseNet121 ranked highest overall, while pure ViTs showed reduced effectiveness under data-constrained conditions. Error analysis revealed a dominance of false-positive predictions, reflecting intrinsic morphological ambiguity in challenging samples. XAI visualizations suggest that CNNs focus on localized red blood cell morphology, whereas hybrid models integrate both local and contextual cues. Conclusions: Under limited-data conditions, convolutional inductive bias remains critical for robust blood-smear-based SCD classification. CNN and hybrid CNN–Transformer models offer interpretable and reliable performance, supporting their potential role as decision-support tools in screening-oriented research settings.

## Linked entities

- **Diseases:** Sickle cell disease (MONDO:0011382), SCD (MONDO:0000359)

## Full-text entities

- **Diseases:** hereditary hemoglobinopathy (MESH:D009386), SCD (MESH:D000755)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12897022/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12897022/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/PMC12897022/full.md

---
Source: https://tomesphere.com/paper/PMC12897022