ZNNi - Maximizing the Inference Throughput of 3D Convolutional Networks on Multi-Core CPUs and GPUs
Aleksandar Zlateski, Kisuk Lee, H. Sebastian Seung

TL;DR
This paper introduces novel CPU and GPU primitives, including FFT-based convolution, to maximize inference throughput of 3D ConvNets on multi-core CPUs and GPUs, outperforming existing methods.
Contribution
The paper presents new memory-efficient primitives for convolutional and pooling layers, achieving significantly higher throughput for 3D ConvNet inference on CPUs and GPUs.
Findings
GPU FFT primitives outperform cuDNN in some architectures
CPU primitives achieve higher throughput due to larger RAM access
CPU-GPU hybrid approach yields over 10x throughput increase
Abstract
Sliding window convolutional networks (ConvNets) have become a popular approach to computer vision problems such as image segmentation, and object detection and localization. Here we consider the problem of inference, the application of a previously trained ConvNet, with emphasis on 3D images. Our goal is to maximize throughput, defined as average number of output voxels computed per unit time. Other things being equal, processing a larger image tends to increase throughput, because fractionally less computation is wasted on the borders of the image. It follows that an apparently slower algorithm may end up having higher throughput if it can process a larger image within the constraint of the available RAM. We introduce novel CPU and GPU primitives for convolutional and pooling layers, which are designed to minimize memory overhead. The primitives include convolution based on highly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Medical Image Segmentation Techniques
MethodsConvolution
