A spatio-temporal network for video semantic segmentation in surgical videos
Maria Grammatikopoulou, Ricardo Sanchez-Matilla, Felix Bragman, David, Owen, Lucy Culshaw, Karen Kerr, Danail Stoyanov, Imanol Luengo

TL;DR
This paper introduces a novel spatio-temporal network architecture that enhances video semantic segmentation in surgical videos by improving temporal consistency and accuracy, which is crucial for clinical applications.
Contribution
The paper presents a new spatio-temporal decoder that can be added to existing segmentation models to improve temporal consistency in surgical video analysis.
Findings
Improved segmentation accuracy on CholecSeg8k and private datasets.
Enhanced temporal consistency across video frames.
Model adaptable to various segmentation encoders.
Abstract
Semantic segmentation in surgical videos has applications in intra-operative guidance, post-operative analytics and surgical education. Segmentation models need to provide accurate and consistent predictions since temporally inconsistent identification of anatomical structures can impair usability and hinder patient safety. Video information can alleviate these challenges leading to reliable models suitable for clinical use. We propose a novel architecture for modelling temporal relationships in videos. The proposed model includes a spatio-temporal decoder to enable video semantic segmentation by improving temporal consistency across frames. The encoder processes individual frames whilst the decoder processes a temporal batch of adjacent frames. The proposed decoder can be used on top of any segmentation encoder to improve temporal consistency. Model performance was evaluated on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Radiomics and Machine Learning in Medical Imaging · Colorectal Cancer Screening and Detection
