The SARAS Endoscopic Surgeon Action Detection (ESAD) dataset: Challenges and methods
Vivek Singh Bawa, Gurkirt Singh, Francis KapingA, Inna, Skarga-Bandurova, Elettra Oleari, Alice Leporini, Carmela Landolfo, Pengfei, Zhao, Xi Xiang, Gongning Luo, Kuanquan Wang, Liangzhi Li, Bowen Wang, Shang, Zhao, Li Li, Armando Stabile, Francesco Setti, Riccardo Muradore

TL;DR
This paper introduces ESAD, a large-scale dataset for surgeon action detection in endoscopic surgery, addressing challenges unique to surgical environments and aiding the development of assistive robotic systems.
Contribution
The paper presents the first comprehensive dataset for surgeon action detection in minimally invasive surgery and analyzes baseline and top models for this task.
Findings
Baseline model performance analysis provides insights into effective approaches.
Top models demonstrate significant improvements over baseline.
ESAD serves as a benchmark for future research in surgical action detection.
Abstract
For an autonomous robotic system, monitoring surgeon actions and assisting the main surgeon during a procedure can be very challenging. The challenges come from the peculiar structure of the surgical scene, the greater similarity in appearance of actions performed via tools in a cavity compared to, say, human actions in unconstrained environments, as well as from the motion of the endoscopic camera. This paper presents ESAD, the first large-scale dataset designed to tackle the problem of surgeon action detection in endoscopic minimally invasive surgery. ESAD aims at contributing to increase the effectiveness and reliability of surgical assistant robots by realistically testing their awareness of the actions performed by a surgeon. The dataset provides bounding box annotation for 21 action classes on real endoscopic video frames captured during prostatectomy, and was used as the basis of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Advanced Neural Network Applications · Soft Robotics and Applications
