MARL: Multimodal Attentional Representation Learning for Disease Prediction
Ali Hamdi, Amr Aboeleneen, Khaled Shaban

TL;DR
MARL is a multimodal attentional learning model that integrates CT images and patient records to improve lung disease prediction, outperforming existing models in accuracy and regression metrics.
Contribution
The paper introduces MARL, a novel architecture combining fuzzy segmentation, CNN, LSTM, and attention mechanisms for multimodal disease prediction under uncertainty.
Findings
MARL achieves 91% R^2 in disease progression regression.
MARL attains 97% accuracy in binary classification.
MARL outperforms state-of-the-art CNN models by 19-57% in accuracy.
Abstract
Existing learning models often utilise CT-scan images to predict lung diseases. These models are posed by high uncertainties that affect lung segmentation and visual feature learning. We introduce MARL, a novel Multimodal Attentional Representation Learning model architecture that learns useful features from multimodal data under uncertainty. We feed the proposed model with both the lung CT-scan images and their perspective historical patients' biological records collected over times. Such rich data offers to analyse both spatial and temporal aspects of the disease. MARL employs Fuzzy-based image spatial segmentation to overcome uncertainties in CT-scan images. We then utilise a pre-trained Convolutional Neural Network (CNN) to learn visual representation vectors from images. We augment patients' data with statistical features from the segmented images. We develop a Long Short-Term…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDepthwise Convolution · Pointwise Convolution · RMSProp · Depthwise Separable Convolution · Squeeze-and-Excitation Block · Inverted Residual Block · Average Pooling · Dense Connections · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia?
