Boosting Convolution with Efficient MLP-Permutation for Volumetric Medical Image Segmentation
Yi Lin, Xiao Fang, Dong Zhang, Kwang-Ting Cheng, Hao Chen

TL;DR
This paper introduces PHNet, a hybrid CNN-MLP network that enhances 3D medical image segmentation by efficiently capturing local and long-range features, outperforming state-of-the-art methods with lower computational costs.
Contribution
The paper proposes a novel hybrid network with an efficient permutable MLP module that addresses isotropy and resolution issues in volumetric segmentation.
Findings
PHNet outperforms state-of-the-art methods on COVID-19-20 and Synapse benchmarks.
The MLPP module effectively captures long-range dependencies and preserves positional information.
Ablation studies confirm the benefits of combining CNNs with the proposed MLP permutation.
Abstract
Recently, the advent of vision Transformer (ViT) has brought substantial advancements in 3D dataset benchmarks, particularly in 3D volumetric medical image segmentation (Vol-MedSeg). Concurrently, multi-layer perceptron (MLP) network has regained popularity among researchers due to their comparable results to ViT, albeit with the exclusion of the resource-intensive self-attention module. In this work, we propose a novel permutable hybrid network for Vol-MedSeg, named PHNet, which capitalizes on the strengths of both convolution neural networks (CNNs) and MLP. PHNet addresses the intrinsic isotropy problem of 3D volumetric data by employing a combination of 2D and 3D CNNs to extract local features. Besides, we propose an efficient multi-layer permute perceptron (MLPP) module that captures long-range dependence while preserving positional information. This is achieved through an axis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Imaging and Analysis · Medical Image Segmentation Techniques
MethodsAttention Is All You Need · Linear Layer · Label Smoothing · Absolute Position Encodings · Residual Connection · Byte Pair Encoding · Convolution · Dropout · Layer Normalization · Multi-Head Attention
