Data-Augmented Multimodal Feature Fusion for Multiclass Visual Recognition of Oral Cancer Lesions
Joy Naoum, Revana Salama, Ali Hamdi

TL;DR
This paper introduces a data-augmented multimodal feature fusion framework within a VR-assisted system to improve early oral cancer lesion recognition, addressing dataset limitations and modality dependence in existing deep learning approaches.
Contribution
It proposes a novel data-augmentation driven multimodal fusion method integrated with VR for enhanced oral cancer diagnosis, improving robustness and accuracy over traditional models.
Findings
Achieved 82.57% accuracy on 2-class classification
Outperformed traditional single-stream CNN models
Enhanced feature diversity and model robustness
Abstract
Oral cancer is frequently diagnosed at later stages due to its similarity to other lesions. Existing research on computer aided diagnosis has made progress using deep learning; however, most approaches remain limited by small, imbalanced datasets and a dependence on single-modality features, which restricts model generalization in real-world clinical settings. To address these limitations, this study proposes a novel data-augmentation driven multimodal feature-fusion framework integrated within a (Vision Recognition)VR assisted oral cancer recognition system. Our method combines extensive data centric augmentation with fused clinical and image-based representations to enhance model robustness and reduce diagnostic ambiguity. Using a stratified training pipeline and an EfficientNetV2 B1 backbone, the system improves feature diversity, mitigates imbalance, and strengthens the learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Cutaneous Melanoma Detection and Management · Brain Tumor Detection and Classification
