Surgical Scene Segmentation by Transformer With Asymmetric Feature Enhancement
Cheng Yuan, Yutong Ban

TL;DR
This paper introduces a Transformer-based framework with asymmetric feature enhancement for surgical scene segmentation, effectively improving local information fusion and fine-grained structure recognition in complex surgical images.
Contribution
It proposes a novel Transformer framework with an Asymmetric Feature Enhancement module that addresses key challenges in surgical scene segmentation.
Findings
Outperforms state-of-the-art methods in multiple surgical segmentation tasks
Enhances local information and multi-scale feature fusion
Proves effectiveness in fine-grained structure recognition
Abstract
Surgical scene segmentation is a fundamental task for robotic-assisted laparoscopic surgery understanding. It often contains various anatomical structures and surgical instruments, where similar local textures and fine-grained structures make the segmentation a difficult task. Vision-specific transformer method is a promising way for surgical scene understanding. However, there are still two main challenges. Firstly, the absence of inner-patch information fusion leads to poor segmentation performance. Secondly, the specific characteristics of anatomy and instruments are not specifically modeled. To tackle the above challenges, we propose a novel Transformer-based framework with an Asymmetric Feature Enhancement module (TAFE), which enhances local information and then actively fuses the improved feature pyramid into the embeddings from transformer encoders by a multi-scale interaction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Medical Imaging and Analysis · Radiomics and Machine Learning in Medical Imaging
MethodsSoftmax · Attention Is All You Need
