SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition
Shu Yang, Zhiyuan Cai, Luyang Luo, Ning Ma, Shuchang Xu, Hao Chen

TL;DR
This paper introduces SurgPETL, a parameter-efficient transfer learning framework that adapts image pre-trained models for surgical video phase recognition, addressing data scarcity and complex spatial-temporal modeling challenges.
Contribution
It proposes a novel SurgPETL benchmark and a Spatial-Temporal Adaptation module to enhance surgical phase recognition using minimal fine-tuning of pre-trained models.
Findings
SurgPETL outperforms existing methods on multiple datasets.
The STA module improves spatial-temporal feature extraction.
Pre-trained ViTs effectively transfer to surgical video tasks.
Abstract
Capitalizing on image-level pre-trained models for various downstream tasks has recently emerged with promising performance. However, the paradigm of "image pre-training followed by video fine-tuning" for high-dimensional video data inevitably poses significant performance bottlenecks. Furthermore, in the medical domain, many surgical video tasks encounter additional challenges posed by the limited availability of video data and the necessity for comprehensive spatial-temporal modeling. Recently, Parameter-Efficient Image-to-Video Transfer Learning has emerged as an efficient and effective paradigm for video action recognition tasks, which employs image-level pre-trained models with promising feature transferability and involves cross-modality temporal modeling with minimal fine-tuning. Nevertheless, the effectiveness and generalizability of this paradigm within intricate surgical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Surgical Treatments · Medical Image Segmentation Techniques · Advanced X-ray Imaging Techniques
MethodsAdapter
