PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery
Adrito Das, Danyal Z. Khan, Dimitrios Psychogyios, Yitong Zhang, John, G. Hanrahan, Francisco Vasconcelos, You Pang, Zhen Chen, Jinlin Wu, Xiaoyang, Zou, Guoyan Zheng, Abdul Qayyum, Moona Mazher, Imran Razzak, Tianbin Li, Jin, Ye, Junjun He, Szymon P{\l}otka, Joanna Kaleta

TL;DR
The PitVis-2023 Challenge advances automated recognition of surgical steps and instruments in endoscopic pituitary surgery videos, demonstrating the effectiveness of multi-task spatio-temporal models and providing a new benchmark dataset for the field.
Contribution
This work introduces a new dataset and challenge for workflow recognition in pituitary surgery videos, highlighting the benefits of multi-task and spatio-temporal deep learning models.
Findings
Top models improved macro-F1 scores by over 50% in step recognition.
Multi-task and spatio-temporal models outperform single-task spatial models.
The dataset and benchmark facilitate further research in minimally invasive surgery recognition.
Abstract
The field of computer vision applied to videos of minimally invasive surgery is ever-growing. Workflow recognition pertains to the automated recognition of various aspects of a surgery: including which surgical steps are performed; and which surgical instruments are used. This information can later be used to assist clinicians when learning the surgery; during live surgery; and when writing operation notes. The Pituitary Vision (PitVis) 2023 Challenge tasks the community to step and instrument recognition in videos of endoscopic pituitary surgery. This is a unique task when compared to other minimally invasive surgeries due to the smaller working space, which limits and distorts vision; and higher frequency of instrument and step switching, which requires more precise model predictions. Participants were provided with 25-videos, with results presented at the MICCAI-2023 conference as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training
