Gaze-Assisted Medical Image Segmentation
Leila Khaertdinova, Ilya Pershin, Tatiana Shmykova, Bulat Ibragimov

TL;DR
This paper introduces a semi-supervised medical image segmentation method that uses human gaze data to improve segmentation accuracy, fine-tuning MedSAM for better clinical applicability.
Contribution
It demonstrates that incorporating human gaze as a prompt in MedSAM significantly enhances segmentation performance on abdominal CT scans.
Findings
Gaze-assisted MedSAM achieved an average Dice coefficient of 90.5%.
Outperformed state-of-the-art segmentation models like nnUNetV2 and ResUNet.
Validated on a public dataset of 120 CT scans with 16 organs.
Abstract
The annotation of patient organs is a crucial part of various diagnostic and treatment procedures, such as radiotherapy planning. Manual annotation is extremely time-consuming, while its automation using modern image analysis techniques has not yet reached levels sufficient for clinical adoption. This paper investigates the idea of semi-supervised medical image segmentation using human gaze as interactive input for segmentation correction. In particular, we fine-tuned the Segment Anything Model in Medical Images (MedSAM), a public solution that uses various prompt types as additional input for semi-automated segmentation correction. We used human gaze data from reading abdominal images as a prompt for fine-tuning MedSAM. The model was validated on a public WORD database, which consists of 120 CT scans of 16 abdominal organs. The results of the gaze-assisted MedSAM were shown to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Brain Tumor Detection and Classification · Advanced Computing and Algorithms
