PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic   Pituitary Surgery

Adrito Das; Danyal Z. Khan; Dimitrios Psychogyios; Yitong Zhang; John; G. Hanrahan; Francisco Vasconcelos; You Pang; Zhen Chen; Jinlin Wu; Xiaoyang; Zou; Guoyan Zheng; Abdul Qayyum; Moona Mazher; Imran Razzak; Tianbin Li; Jin; Ye; Junjun He; Szymon P{\l}otka; Joanna Kaleta; Amine Yamlahi; Antoine Jund,; Patrick Godau; Satoshi Kondo; Satoshi Kasai; Kousuke Hirasawa; Dominik; Rivoir; Alejandra P\'erez; Santiago Rodriguez; Pablo Arbel\'aez; Danail; Stoyanov; Hani J. Marcus; Sophia Bano

arXiv:2409.01184·cs.CV·September 4, 2024·2 cites

PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery

Adrito Das, Danyal Z. Khan, Dimitrios Psychogyios, Yitong Zhang, John, G. Hanrahan, Francisco Vasconcelos, You Pang, Zhen Chen, Jinlin Wu, Xiaoyang, Zou, Guoyan Zheng, Abdul Qayyum, Moona Mazher, Imran Razzak, Tianbin Li, Jin, Ye, Junjun He, Szymon P{\l}otka, Joanna Kaleta

PDF

Open Access 1 Datasets

TL;DR

The PitVis-2023 Challenge advances automated recognition of surgical steps and instruments in endoscopic pituitary surgery videos, demonstrating the effectiveness of multi-task spatio-temporal models and providing a new benchmark dataset for the field.

Contribution

This work introduces a new dataset and challenge for workflow recognition in pituitary surgery videos, highlighting the benefits of multi-task and spatio-temporal deep learning models.

Findings

01

Top models improved macro-F1 scores by over 50% in step recognition.

02

Multi-task and spatio-temporal models outperform single-task spatial models.

03

The dataset and benchmark facilitate further research in minimally invasive surgery recognition.

Abstract

The field of computer vision applied to videos of minimally invasive surgery is ever-growing. Workflow recognition pertains to the automated recognition of various aspects of a surgery: including which surgical steps are performed; and which surgical instruments are used. This information can later be used to assist clinicians when learning the surgery; during live surgery; and when writing operation notes. The Pituitary Vision (PitVis) 2023 Challenge tasks the community to step and instrument recognition in videos of endoscopic pituitary surgery. This is a unique task when compared to other minimally invasive surgeries due to the smaller working space, which limits and distorts vision; and higher frequency of instrument and step switching, which requires more precise model predictions. Participants were provided with 25-videos, with results presented at the MICCAI-2023 conference as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

UCL-WEISS/PitVis-2023
dataset· 49 dl
49 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSurgical Simulation and Training