Multimodal Deep Learning for Diabetic Foot Ulcer Staging Using Integrated RGB and Thermal Imaging
Gulengul Mermer, Mustafa Furkan Aksu, Gozde Ozsezer, Sevki Cetinkalp, Orhan Er, Mehmet Kemal Gullu

TL;DR
This study demonstrates that combining RGB and thermal imaging in deep learning models significantly improves the accuracy of diabetic foot ulcer staging, using a portable Raspberry Pi-based system and a diverse dataset.
Contribution
It introduces a multimodal imaging system and evaluates its effectiveness across multiple deep learning architectures for DFU classification.
Findings
Multimodal RGB+Thermal data outperforms single-modal approaches.
VGG16 trained on RGB+Thermal achieved 93.25% accuracy.
Grad-CAM shows thermal data highlights temperature anomalies in ulcers.
Abstract
Diabetic foot ulcers (DFU) are one of the serious complications of diabetes that can lead to amputations and high healthcare costs. Regular monitoring and early diagnosis are critical for reducing the clinical burden and the risk of amputation. The aim of this study is to investigate the impact of using multimodal images on deep learning models for the classification of DFU stages. To this end, we developed a Raspberry Pi-based portable imaging system capable of simultaneously capturing RGB and thermal images. Using this prototype, a dataset consisting of 1,205 samples was collected in a hospital setting. The dataset was labeled by experts into six distinct stages. To evaluate the models performance, we prepared three different training sets: RGB-only, thermal-only, and RGB+Thermal (with the thermal image added as a fourth channel). We trained these training sets on the DenseNet121,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
