Recognizing Surgical Phases Anywhere: Few-Shot Test-time Adaptation and Task-graph Guided Refinement
Kun Yuan, Tingxuan Chen, Shi Li, Joel L. Lavanchy, Christian Heiliger, Ege \"Ozsoy, Yiming Huang, Long Bai, Nassir Navab, Vinkle Srivastav, Hongliang Ren, Nicolas Padoy

TL;DR
The paper introduces SPA, a lightweight framework that adapts foundation models for surgical phase recognition using few-shot learning, diffusion modeling, and task-graph priors, achieving state-of-the-art results across institutions.
Contribution
SPA is a novel framework combining few-shot spatial adaptation, diffusion-based temporal consistency, and self-supervised test-time adaptation for surgical workflow understanding.
Findings
Achieves state-of-the-art few-shot surgical phase recognition performance.
Outperforms full-shot models with only 32-shot labeled data.
Enables rapid customization of models with minimal annotations.
Abstract
The complexity and diversity of surgical workflows, driven by heterogeneous operating room settings, institutional protocols, and anatomical variability, present a significant challenge in developing generalizable models for cross-institutional and cross-procedural surgical understanding. While recent surgical foundation models pretrained on large-scale vision-language data offer promising transferability, their zero-shot performance remains constrained by domain shifts, limiting their utility in unseen surgical environments. To address this, we introduce Surgical Phase Anywhere (SPA), a lightweight framework for versatile surgical workflow understanding that adapts foundation models to institutional settings with minimal annotation. SPA leverages few-shot spatial adaptation to align multi-modal embeddings with institution-specific surgical scenes and phases. It also ensures temporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHealth and Medical Research Impacts · Cardiac, Anesthesia and Surgical Outcomes · Artificial Intelligence in Healthcare and Education
MethodsALIGN · Diffusion
