Revisiting the Evaluation Bias Introduced by Frame Sampling Strategies in Surgical Video Segmentation Using SAM2
Utku Ozbulak, Seyed Amir Mousavi, Francesca Tozzi, Niki Rashidian, Wouter Willaert, Wesley De Neve, Joris Vankerschaver

TL;DR
This paper examines how different frame sampling strategies affect the evaluation of surgical video segmentation models, revealing biases that favor sparse sampling under certain conditions and emphasizing the importance of real-time, high-FPS evaluation for accurate assessment.
Contribution
It uncovers evaluation biases caused by inconsistent annotation protocols and demonstrates the superiority of high-FPS segmentation in real-time surgical video analysis.
Findings
Lower frame rates can falsely appear better due to smoothing effects.
High-FPS segmentation provides more stable and accurate results for dynamic objects.
Survey results favor high-FPS overlays for better human perception.
Abstract
Real-time video segmentation is a promising opportunity for AI-assisted surgery, offering intraoperative guidance by identifying tools and anatomical structures. Despite growing interest in surgical video segmentation, annotation protocols vary widely across datasets -- some provide dense, frame-by-frame labels, while others rely on sparse annotations sampled at low frame rates such as 1 FPS. In this study, we investigate how such inconsistencies in annotation density and frame rate sampling influence the evaluation of zero-shot segmentation models, using SAM2 as a case study for cholecystectomy procedures. Surprisingly, we find that under conventional sparse evaluation settings, lower frame rates can appear to outperform higher ones due to a smoothing effect that conceals temporal inconsistencies. However, when assessed under real-time streaming conditions, higher frame rates yield…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Medical Imaging and Analysis · Medical Image Segmentation Techniques
