SSTFB: Leveraging self-supervised pretext learning and temporal self-attention with feature branching for real-time video polyp segmentation
Ziang Xu, Jens Rittscher, Sharib Ali

TL;DR
This paper introduces SSTFB, a real-time video polyp segmentation method that combines self-supervised pretext learning and temporal self-attention with feature branching, significantly improving accuracy and generalization over existing methods.
Contribution
The paper presents a novel end-to-end framework integrating self-supervised learning and spatial-temporal self-attention for enhanced real-time video polyp segmentation.
Findings
Improves Dice similarity coefficient by over 3%
Enhances intersection-over-union by nearly 10%
Generalizes well to unseen video data
Abstract
Polyps are early cancer indicators, so assessing occurrences of polyps and their removal is critical. They are observed through a colonoscopy screening procedure that generates a stream of video frames. Segmenting polyps in their natural video screening procedure has several challenges, such as the co-existence of imaging artefacts, motion blur, and floating debris. Most existing polyp segmentation algorithms are developed on curated still image datasets that do not represent real-world colonoscopy. Their performance often degrades on video data. We propose a video polyp segmentation method that performs self-supervised learning as an auxiliary task and a spatial-temporal self-attention mechanism for improved representation learning. Our end-to-end configuration and joint optimisation of losses enable the network to learn more discriminative contextual features in videos. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Mathematics, Computing, and Information Processing · Handwritten Text Recognition Techniques
