ReCAP: Recursive Cross Attention Network for Pseudo-Label Generation in Robotic Surgical Skill Assessment
Julien Quarez, Marc Modat, Sebastien Ourselin, Jonathan Shapey, Alejandro Granados

TL;DR
This paper introduces ReCAP, a recursive cross attention network that generates pseudo-labels for surgical skill assessment by tracking OSATS scores throughout a session, improving the interpretability and accuracy of evaluations.
Contribution
We propose a weakly-supervised recurrent transformer model that predicts OSATS scores from kinematic data, enabling detailed performance tracking and improved GRS prediction in surgical skill assessment.
Findings
Outperforms state-of-the-art in GRS prediction using kinematic data (SCC 0.83-0.88)
Matches state-of-the-art performance with video data
Achieves high accuracy in predicting specific OSATS scores (SCC 0.56-0.95)
Abstract
In surgical skill assessment, the Objective Structured Assessments of Technical Skills (OSATS) and Global Rating Scale (GRS) are well-established tools for evaluating surgeons during training. These metrics, along with performance feedback, help surgeons improve and reach practice standards. Recent research on the open-source JIGSAWS dataset, which includes both GRS and OSATS labels, has focused on regressing GRS scores from kinematic data, video, or their combination. However, we argue that regressing GRS alone is limiting, as it aggregates OSATS scores and overlooks clinically meaningful variations during a surgical trial. To address this, we developed a weakly-supervised recurrent transformer model that tracks a surgeon's performance throughout a session by mapping hidden states to six OSATS, derived from kinematic data. These OSATS scores are averaged to predict GRS, allowing us to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCardiac, Anesthesia and Surgical Outcomes
