Multitask Learning in Minimally Invasive Surgical Vision: A Review
Oluwatosin Alabi, Tom Vercauteren, Miaojing Shi

TL;DR
This review discusses how multitask learning enhances surgical video analysis in minimally invasive surgery, improving understanding and efficiency by leveraging related tasks and recent machine learning advancements.
Contribution
It provides a comprehensive overview of current multitask learning systems in MIS, analyzing their benefits, limitations, and future research directions.
Findings
MTL improves surgical scene understanding accuracy
Recent trends show increased use of large models in MIS
Identifies challenges and future directions in MTL for MIS
Abstract
Minimally invasive surgery (MIS) has revolutionized many procedures and led to reduced recovery time and risk of patient injury. However, MIS poses additional complexity and burden on surgical teams. Data-driven surgical vision algorithms are thought to be key building blocks in the development of future MIS systems with improved autonomy. Recent advancements in machine learning and computer vision have led to successful applications in analyzing videos obtained from MIS with the promise of alleviating challenges in MIS videos. Surgical scene and action understanding encompasses multiple related tasks that, when solved individually, can be memory-intensive, inefficient, and fail to capture task relationships. Multitask learning (MTL), a learning paradigm that leverages information from multiple related tasks to improve performance and aid generalization, is well suited for fine-grained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Augmented Reality Applications · Anatomy and Medical Technology
