Perspective+ Unet: Enhancing Segmentation with Bi-Path Fusion and Efficient Non-Local Attention for Superior Receptive Fields
Jintong Hu, Siyan Chen, Zhiyi Pan, Sen Zeng, Wenming Yang

TL;DR
Perspective+ Unet is a novel medical image segmentation architecture that combines dual-path encoding, efficient non-local transformers, and cross-scale feature integration to improve global context understanding while maintaining computational efficiency.
Contribution
It introduces a dual-path encoder, an efficient non-local transformer block, and a cross-scale feature integration strategy, advancing segmentation performance with better global and local feature fusion.
Findings
Improved segmentation accuracy on ACDC and Synapse datasets
Effective long-range dependency modeling with linear complexity
Enhanced global-local feature integration
Abstract
Precise segmentation of medical images is fundamental for extracting critical clinical information, which plays a pivotal role in enhancing the accuracy of diagnoses, formulating effective treatment plans, and improving patient outcomes. Although Convolutional Neural Networks (CNNs) and non-local attention methods have achieved notable success in medical image segmentation, they either struggle to capture long-range spatial dependencies due to their reliance on local features, or face significant computational and feature integration challenges when attempting to address this issue with global attention mechanisms. To overcome existing limitations in medical image segmentation, we propose a novel architecture, Perspective+ Unet. This framework is characterized by three major innovations: (i) It introduces a dual-pathway strategy at the encoder stage that combines the outcomes of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications
MethodsSoftmax · Attention Is All You Need
