Evaluation of Multi-Slice Inputs to Convolutional Neural Networks for Medical Image Segmentation
Minh H. Vu, Guus Grimbergen, Tufve Nyholm, Tommy L\"ofstedt

TL;DR
This study systematically evaluates the use of multi-slice inputs in CNNs for medical image segmentation, comparing pseudo-3D approaches to traditional 2D and 3D methods across various datasets and architectures.
Contribution
It provides a comprehensive analysis of the performance and computational efficiency of pseudo-3D CNNs, revealing limited general benefits over standard 2D or 3D CNNs.
Findings
Pseudo-3D methods are more efficient than full 3D CNNs.
Limited segmentation performance improvement over 2D CNNs.
No clear relation between mask structure and segmentation accuracy.
Abstract
When using Convolutional Neural Networks (CNNs) for segmentation of organs and lesions in medical images, the conventional approach is to work with inputs and outputs either as single slice (2D) or whole volumes (3D). One common alternative, in this study denoted as pseudo-3D, is to use a stack of adjacent slices as input and produce a prediction for at least the central slice. This approach gives the network the possibility to capture 3D spatial information, with only a minor additional computational cost. In this study, we systematically evaluate the segmentation performance and computational costs of this pseudo-3D approach as a function of the number of input slices, and compare the results to conventional end-to-end 2D and 3D CNNs. The standard pseudo-3D method regards the neighboring slices as multiple input image channels. We additionally evaluate a simple approach where the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
