SurgFed: Language-guided Multi-Task Federated Learning for Surgical Video Understanding
Zheng Fang, Ziwei Niu, Ziyue Wang, Zhu Zhuo, Haofeng Liu, Shuyang Qian, Jun Xia, Yueming Jin

TL;DR
SurgFed introduces a novel language-guided multi-task federated learning framework to improve surgical video understanding across diverse tissues and tasks, addressing key challenges in heterogeneity and task interaction.
Contribution
The paper presents SurgFed, a new federated learning approach with language-guided modules for better adaptation and task modeling in surgical video analysis.
Findings
Outperforms state-of-the-art in five datasets across four surgical types.
Enhances site-specific adaptation with language-guided channel selection.
Improves task interaction modeling with language-guided hyper aggregation.
Abstract
Surgical scene Multi-Task Federated Learning (MTFL) is essential for robot-assisted minimally invasive surgery (RAS) but remains underexplored in surgical video understanding due to two key challenges: (1) Tissue Diversity: Local models struggle to adapt to site-specific tissue features, limiting their effectiveness in heterogeneous clinical environments and leading to poor local predictions. (2) Task Diversity: Server-side aggregation, relying solely on gradient-based clustering, often produces suboptimal or incorrect parameter updates due to inter-site task heterogeneity, resulting in inaccurate localization. In light of these two issues, we propose SurgFed, a multi-task federated learning framework, enabling federated learning for surgical scene segmentation and depth estimation across diverse surgical types. SurgFed is powered by two appealing designs, i.e., Language-guided Channel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Domain Adaptation and Few-Shot Learning · Machine Learning in Healthcare
