LRTD: Long-Range Temporal Dependency based Active Learning for Surgical Workflow Recognition
Xueying Shi, Yueming Jin, Qi Dou, Pheng-Ann Heng

TL;DR
This paper introduces an active learning approach for surgical workflow recognition that leverages long-range temporal dependencies in videos, reducing annotation effort while maintaining high accuracy.
Contribution
It proposes a novel active learning method using a non-local recurrent convolutional network to select the most informative video clips for annotation, improving efficiency in surgical video analysis.
Findings
Outperforms state-of-the-art active learning methods on Cholec80 dataset.
Achieves comparable performance to full-data training with only 50% of labeled samples.
Effectively captures long-range dependencies to enhance surgical workflow recognition.
Abstract
Automatic surgical workflow recognition in video is an essentially fundamental yet challenging problem for developing computer-assisted and robotic-assisted surgery. Existing approaches with deep learning have achieved remarkable performance on analysis of surgical videos, however, heavily relying on large-scale labelled datasets. Unfortunately, the annotation is not often available in abundance, because it requires the domain knowledge of surgeons. In this paper, we propose a novel active learning method for cost-effective surgical video analysis. Specifically, we propose a non-local recurrent convolutional network (NL-RCNet), which introduces non-local block to capture the long-range temporal dependency (LRTD) among continuous frames. We then formulate an intra-clip dependency score to represent the overall dependency within this clip. By ranking scores among clips in unlabelled data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Reservoir Engineering and Simulation Methods · Medical Image Segmentation Techniques
MethodsResidual Connection · Non-Local Operation · 1x1 Convolution · Non-Local Block
