Enjoying Information Dividend: Gaze Track-based Medical Weakly Supervised Segmentation

Zhisong Wang; Yiwen Ye; Ziyang Chen; and Yong Xia

arXiv:2505.22230·cs.CV·May 29, 2025

Enjoying Information Dividend: Gaze Track-based Medical Weakly Supervised Segmentation

Zhisong Wang, Yiwen Ye, Ziyang Chen, and Yong Xia

PDF

Open Access

TL;DR

GradTrack introduces a novel gaze-aware framework that leverages detailed physician gaze data to improve weakly supervised medical image segmentation, significantly enhancing accuracy and narrowing the gap with fully supervised methods.

Contribution

The paper presents GradTrack, a new framework that fully exploits gaze data, including fixation points, durations, and order, for improved weakly supervised segmentation in medical imaging.

Findings

01

Outperforms existing gaze-based methods with Dice improvements of 3.21% and 2.61%.

02

Narrowing the performance gap with fully supervised models.

03

Effective multi-level gaze supervision during decoding process.

Abstract

Weakly supervised semantic segmentation (WSSS) in medical imaging struggles with effectively using sparse annotations. One promising direction for WSSS leverages gaze annotations, captured via eye trackers that record regions of interest during diagnostic procedures. However, existing gaze-based methods, such as GazeMedSeg, do not fully exploit the rich information embedded in gaze data. In this paper, we propose GradTrack, a framework that utilizes physicians' gaze track, including fixation points, durations, and temporal order, to enhance WSSS performance. GradTrack comprises two key components: Gaze Track Map Generation and Track Attention, which collaboratively enable progressive feature refinement through multi-level gaze supervision during the decoding process. Experiments on the Kvasir-SEG and NCI-ISBI datasets demonstrate that GradTrack consistently outperforms existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaze Tracking and Assistive Technology · Visual Attention and Saliency Detection · Domain Adaptation and Few-Shot Learning