Real-time analysis of cataract surgery videos using statistical models
Katia Charri\`ere, Gwenol\'e Quellec, Mathieu Lamard, David Martiano,, Guy Cazuguel, Gouenou Coatrieux, B\'eatrice Cochener

TL;DR
This paper presents a real-time system for analyzing cataract surgery videos using statistical models to recognize surgical steps and phases, aiding surgeon training and support during operations.
Contribution
It introduces a multilevel statistical model that combines tool presence and motion analysis for real-time surgical step recognition, adaptable to various surgeries.
Findings
High accuracy with tool presence ($A_z$ = 0.983)
Moderate accuracy with motion analysis ($A_z$ = 0.759)
System applicable to different surgical procedures
Abstract
The automatic analysis of the surgical process, from videos recorded during surgeries, could be very useful to surgeons, both for training and for acquiring new techniques. The training process could be optimized by automatically providing some targeted recommendations or warnings, similar to the expert surgeon's guidance. In this paper, we propose to reuse videos recorded and stored during cataract surgeries to perform the analysis. The proposed system allows to automatically recognize, in real time, what the surgeon is doing: what surgical phase or, more precisely, what surgical step he or she is performing. This recognition relies on the inference of a multilevel statistical model which uses 1) the conditional relations between levels of description (steps and phases) and 2) the temporal relations among steps and among phases. The model accepts two types of inputs: 1) the presence of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Video Analysis and Summarization · Human Pose and Action Recognition
