LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery
Yuyang Du, Kexin Chen, Yue Zhan, Chang Han Low, Tao You, Mobarakol, Islam, Ziyu Guo, Yueming Jin, Guangyong Chen, Pheng-Ann Heng

TL;DR
This paper introduces a multi-teacher continual learning framework using a multimodal LLM to improve visual question answering in robotic surgery, effectively handling domain shifts and data imbalance without relying on old data.
Contribution
It proposes a novel multi-teacher CL framework with an adaptive weight strategy and new surgical VQA datasets, addressing domain shifts and data imbalance in exemplar-free continual learning.
Findings
Outperforms existing CL methods on new surgical datasets
Effectively handles domain shifts and data imbalance
Demonstrates robustness with multimodal LLM integration
Abstract
Visual question answering (VQA) is crucial for promoting surgical education. In practice, the needs of trainees are constantly evolving, such as learning more surgical types, adapting to different robots, and learning new surgical instruments and techniques for various surgeries. However, patient data privacy often restricts the availability of old data when updating the model, necessitating an exemplar-free continual learning (CL) setup. Prior CL studies overlooked two vital problems in the surgical domain: 1) large domain shifts from diverse surgical operations collected from multiple sources, and 2) severe data imbalance arising from the uneven presence of surgical instruments or activities. This paper proposes addressing these problems with a multimodal large language model (LLM) and an adaptive weight assignment methodology. We first develop a new multi-teacher CL framework that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robotics and Automated Systems
