Class-Incremental Domain Adaptation with Smoothing and Calibration for Surgical Report Generation
Mengya Xu, Mobarakol Islam, Chwee Ming Lim, Hongliang Ren

TL;DR
This paper introduces a class-incremental domain adaptation framework using smoothing and calibration techniques to improve surgical report generation across different robotic surgery domains, handling new classes and domain shifts effectively.
Contribution
It proposes a novel multi-layer transformer-based model with class-incremental learning, contrastive loss, Gaussian smoothing, and label smoothing for robust surgical report generation.
Findings
Improved performance in surgical report generation across domains.
Effective handling of unseen classes with few-shot learning.
Enhanced domain invariance and calibration in predictions.
Abstract
Generating surgical reports aimed at surgical scene understanding in robot-assisted surgery can contribute to documenting entry tasks and post-operative analysis. Despite the impressive outcome, the deep learning model degrades the performance when applied to different domains encountering domain shifts. In addition, there are new instruments and variations in surgical tissues appeared in robotic surgery. In this work, we propose class-incremental domain adaptation (CIDA) with a multi-layer transformer-based model to tackle the new classes and domain shift in the target domain to generate surgical reports during robotic surgery. To adapt incremental classes and extract domain invariant features, a class-incremental (CI) learning method with supervised contrastive (SupCon) loss is incorporated with a feature extractor. To generate caption from the extracted feature, curriculum by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning
MethodsLabel Smoothing
