$\mathsf{CSMAE~}$:~Cataract Surgical Masked Autoencoder (MAE) based Pre-training
Nisarg A. Shah, Wele Gedara Chaminda Bandara, Shameema Skider, S., Swaroop Vedula, Vishal M. Patel

TL;DR
This paper introduces CSMAE, a Masked Autoencoder pretraining method tailored for cataract surgery videos, which improves analysis accuracy and robustness, especially in low-data scenarios, by selecting tokens based on spatiotemporal importance.
Contribution
The paper presents a novel MAE-based pretraining approach for surgical videos that incorporates spatiotemporal token importance, creating a large dataset and achieving state-of-the-art results.
Findings
Outperforms existing self-supervised pretraining methods.
Enhances model robustness in low-data regimes.
Sets new benchmarks on D99 and Cataract-101 datasets.
Abstract
Automated analysis of surgical videos is crucial for improving surgical training, workflow optimization, and postoperative assessment. We introduce a CSMAE, Masked Autoencoder (MAE)-based pretraining approach, specifically developed for Cataract Surgery video analysis, where instead of randomly selecting tokens for masking, they are selected based on the spatiotemporal importance of the token. We created a large dataset of cataract surgery videos to improve the model's learning efficiency and expand its robustness in low-data regimes. Our pre-trained model can be easily adapted for specific downstream tasks via fine-tuning, serving as a robust backbone for further analysis. Through rigorous testing on a downstream step-recognition task on two Cataract Surgery video datasets, D99 and Cataract-101, our approach surpasses current state-of-the-art self-supervised pretraining and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Medical Imaging and Analysis
