TL;DR
This paper introduces TernaryNet, a method that approximates neural network weights and activations with ternary values, enabling fast, GPU-free inference for 3D medical image segmentation with high accuracy and reduced memory usage.
Contribution
The paper presents a novel ternary approximation scheme for neural networks that allows efficient, GPU-free inference in medical imaging applications, maintaining high accuracy.
Findings
Achieves over 10-fold memory reduction.
Enables sub-second inference without GPUs.
Maintains high segmentation accuracy with Dice score of 71.0%.
Abstract
Deep convolutional neural networks (DCNN) are currently ubiquitous in medical imaging. While their versatility and high quality results for common image analysis tasks including segmentation, localisation and prediction is astonishing, the large representational power comes at the cost of highly demanding computational effort. This limits their practical applications for image guided interventions and diagnostic (point-of-care) support using mobile devices without graphics processing units (GPU). We propose a new scheme that approximates both trainable weights and neural activations in deep networks by ternary values and tackles the open question of backpropagation when dealing with non-differentiable functions. Our solution enables the removal of the expensive floating-point matrix multiplications throughout any convolutional neural network and replaces them by energy and time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
