GEM: Context-Aware Gaze EstiMation with Visual Search Behavior Matching   for Chest Radiograph

Shaonan Liu; Wenting Chen; Jie Liu; Xiaoling Luo; Linlin Shen

arXiv:2408.05502·cs.CV·August 13, 2024

GEM: Context-Aware Gaze EstiMation with Visual Search Behavior Matching for Chest Radiograph

Shaonan Liu, Wenting Chen, Jie Liu, Xiaoling Luo, Linlin Shen

PDF

Open Access 1 Repo

TL;DR

This paper introduces GEM, a novel context-aware gaze estimation model that simulates radiologists' visual search behavior in chest radiographs by integrating multimodal data and graph-based behavior matching, improving interpretability and accuracy.

Contribution

GEM is the first to incorporate context-awareness and visual behavior graph matching for more accurate and realistic gaze estimation in medical radiology interpretation.

Findings

01

GEM outperforms existing gaze estimation methods on four datasets.

02

The model demonstrates strong generalizability across different datasets.

03

GEM enhances interpretability of medical image analysis models.

Abstract

Gaze estimation is pivotal in human scene comprehension tasks, particularly in medical diagnostic analysis. Eye-tracking technology facilitates the recording of physicians' ocular movements during image interpretation, thereby elucidating their visual attention patterns and information-processing strategies. In this paper, we initially define the context-aware gaze estimation problem in medical radiology report settings. To understand the attention allocation and cognitive behavior of radiologists during the medical image interpretation process, we propose a context-aware Gaze EstiMation (GEM) network that utilizes eye gaze data collected from radiologists to simulate their visual search behavior patterns throughout the image interpretation process. It consists of a context-awareness module, visual behavior graph construction, and visual behavior matching. Within the context-awareness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tiger-sn/gem
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · Medical Imaging and Analysis · Gaze Tracking and Assistive Technology

MethodsSoftmax · Attention Is All You Need