OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
Ming Hu, Peng Xia, Lin Wang, Siyuan Yan, Feilong Tang and, Zhongxing Xu, Yimin Luo, Kaimin Song, Jurgen Leitner, Xuelian, Cheng, Jun Cheng, Chi Liu, Kaijing Zhou, Zongyuan Ge

TL;DR
OphNet is a comprehensive, large-scale ophthalmic surgical video dataset with detailed annotations, designed to advance AI-driven surgical workflow understanding and facilitate temporal analysis in diverse ophthalmic procedures.
Contribution
It introduces OphNet, the largest annotated ophthalmic surgical video benchmark, with diverse procedures, hierarchical annotations, and time-localized labels, addressing previous dataset limitations.
Findings
Largest ophthalmic surgical video dataset to date
Enables comprehensive temporal and hierarchical workflow analysis
Supports development of AI systems for surgical understanding
Abstract
Surgical scene perception via videos is critical for advancing robotic surgery, telesurgery, and AI-assisted surgery, particularly in ophthalmology. However, the scarcity of diverse and richly annotated video datasets has hindered the development of intelligent systems for surgical workflow analysis. Existing datasets face challenges such as small scale, lack of diversity in surgery and phase categories, and absence of time-localized annotations. These limitations impede action understanding and model generalization validation in complex and diverse real-world surgical scenarios. To address this gap, we introduce OphNet, a large-scale, expert-annotated video benchmark for ophthalmic surgical workflow understanding. OphNet features: 1) A diverse collection of 2,278 surgical videos spanning 66 types of cataract, glaucoma, and corneal surgeries, with detailed annotations for 102 unique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Retinal Imaging and Analysis · Medical Imaging and Analysis
