Less is More: Surgical Phase Recognition from Timestamp Supervision
Xinpeng Ding, Xinjian Yan, Zixun Wang, Wei Zhao, Jian Zhuang, Xiaowei, Xu, Xiaomeng Li

TL;DR
This paper introduces timestamp supervision for surgical phase recognition, significantly reducing annotation effort while maintaining competitive accuracy through a novel uncertainty-aware temporal diffusion method.
Contribution
It proposes a new timestamp supervision approach and the UATD method to generate reliable pseudo labels, improving surgical phase recognition with less manual annotation.
Findings
Timestamp annotation reduces 74% annotation time.
UATD effectively diffuses labels to high-confidence frames.
Method achieves competitive results with full supervision.
Abstract
Surgical phase recognition is a fundamental task in computer-assisted surgery systems. Most existing works are under the supervision of expensive and time-consuming full annotations, which require the surgeons to repeat watching videos to find the precise start and end time for a surgical phase. In this paper, we introduce timestamp supervision for surgical phase recognition to train the models with timestamp annotations, where the surgeons are asked to identify only a single timestamp within the temporal boundary of a phase. This annotation can significantly reduce the manual annotation cost compared to the full annotations. To make full use of such timestamp supervisions, we propose a novel method called uncertainty-aware temporal diffusion (UATD) to generate trustworthy pseudo labels for training. Our proposed UATD is motivated by the property of surgical videos, i.e., the phases are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training
MethodsDiffusion
