AI-Assisted surgical vision: evaluating YOLOv8 and YOLOv12 for real-time detection in colon cancer surgery
Li Li, Bin Xuan, Xin Song, Yu Tian, Xiangcai Meng, Jiexia Wen, Tao Zheng, Chenglin Liu, Yimin Wang

TL;DR
This study compares YOLOv8 and YOLOv12 for real-time detection in colon cancer surgery, finding YOLOv12 more effective in dynamic surgical scenarios.
Contribution
The study introduces YOLOv12 as a superior model for real-time detection in colon cancer surgery, particularly in handling tissue deformation.
Findings
YOLOv12 achieved significantly higher recall rates than YOLOv8 in object detection and instance segmentation.
YOLOv12 outperformed instance segmentation in [email protected] and recall for object detection.
AI-assisted technology may reduce surgical time and lower missed lymph node detection risks for junior surgeons.
Abstract
Current intraoperative navigation systems have shown significant effectiveness for organs with fixed shapes, but they struggle to adapt to the challenges of tissue deformation and displacement in gastrointestinal surgeries. This study evaluates the established YOLOv8 and the emerging YOLOv12 with enhanced feature extraction capabilities, aiming to identify an optimal real-time model for dynamic surgical scenarios to improve procedural efficiency and safety. In this multi-center retrospective study, object detection and instance segmentation was achieved by training YOLOv8 and YOLOv12 models on 1,847 images extracted from 22 surgical videos collected across four hospitals nationwide. The models were subsequently validated and tested and performance was rigorously compared using standard metrics, such as precision, recall, [email protected], [email protected]–0.95, and the size of the weight file.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Surgical Simulation and Training · Advanced Image and Video Retrieval Techniques
