Multimodal Learning and Cognitive Processes in Radiology: MedGaze for Chest X-ray Scanpath Prediction
Akash Awasthi, Ngan Le, Zhigang Deng, Rishi Agrawal, Carol C. Wu, Hien, Van Nguyen

TL;DR
This paper introduces MedGaze, a novel system for predicting radiologists' eye gaze sequences on chest X-ray images, leveraging reports and images to improve AI understanding of medical scanpaths and potentially streamline data collection.
Contribution
The study presents a new model that predicts human scanpaths on medical images, outperforming existing computer vision models and incorporating a two-stage training process with large datasets.
Findings
MedGaze generates human-like gaze sequences with high focus on relevant regions.
The model outperforms existing computer vision models in scanpath prediction.
It sometimes surpasses human performance in scanpath redundancy and randomness.
Abstract
Predicting human gaze behavior within computer vision is integral for developing interactive systems that can anticipate user attention, address fundamental questions in cognitive science, and hold implications for fields like human-computer interaction (HCI) and augmented/virtual reality (AR/VR) systems. Despite methodologies introduced for modeling human eye gaze behavior, applying these models to medical imaging for scanpath prediction remains unexplored. Our proposed system aims to predict eye gaze sequences from radiology reports and CXR images, potentially streamlining data collection and enhancing AI systems using larger datasets. However, predicting human scanpaths on medical images presents unique challenges due to the diverse nature of abnormal regions. Our model predicts fixation coordinates and durations critical for medical scanpath prediction, outperforming existing models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Lung Cancer Diagnosis and Treatment · Topic Modeling
MethodsFocus
