GEM: Context-Aware Gaze EstiMation with Visual Search Behavior Matching for Chest Radiograph
Shaonan Liu, Wenting Chen, Jie Liu, Xiaoling Luo, Linlin Shen

TL;DR
This paper introduces GEM, a novel context-aware gaze estimation model that simulates radiologists' visual search behavior in chest radiographs by integrating multimodal data and graph-based behavior matching, improving interpretability and accuracy.
Contribution
GEM is the first to incorporate context-awareness and visual behavior graph matching for more accurate and realistic gaze estimation in medical radiology interpretation.
Findings
GEM outperforms existing gaze estimation methods on four datasets.
The model demonstrates strong generalizability across different datasets.
GEM enhances interpretability of medical image analysis models.
Abstract
Gaze estimation is pivotal in human scene comprehension tasks, particularly in medical diagnostic analysis. Eye-tracking technology facilitates the recording of physicians' ocular movements during image interpretation, thereby elucidating their visual attention patterns and information-processing strategies. In this paper, we initially define the context-aware gaze estimation problem in medical radiology report settings. To understand the attention allocation and cognitive behavior of radiologists during the medical image interpretation process, we propose a context-aware Gaze EstiMation (GEM) network that utilizes eye gaze data collected from radiologists to simulate their visual search behavior patterns throughout the image interpretation process. It consists of a context-awareness module, visual behavior graph construction, and visual behavior matching. Within the context-awareness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Medical Imaging and Analysis · Gaze Tracking and Assistive Technology
MethodsSoftmax · Attention Is All You Need
