Robust error bounds for quantised and pruned neural networks
Jiaqi Li, Ross Drummond, Stephen R. Duncan

TL;DR
This paper introduces a semi-definite programming approach to establish robust worst-case error bounds for neural networks that have been pruned or quantised, enhancing reliability for safety-critical edge applications.
Contribution
It develops a general method to compute robust error bounds for pruned and quantised neural networks applicable to various structures and activation functions.
Findings
Provides a semi-definite program for error bounding
Bounds hold for all inputs in specified sets
Applicable to many neural network architectures
Abstract
With the rise of smartphones and the internet-of-things, data is increasingly getting generated at the edge on local, personal devices. For privacy, latency and energy saving reasons, this shift is causing machine learning algorithms to move towards decentralisation with the data and algorithms stored, and even trained, locally on devices. The device hardware becomes the main bottleneck for model capability in this set-up, creating a need for slimmed down, more efficient neural networks. Neural network pruning and quantisation are two methods that have been developed for this, with both approaches demonstrating impressive results in reducing the computational cost without sacrificing significantly on model performance. However, the understanding behind these reduction methods remains underdeveloped. To address this issue, a semi-definite program is introduced to bound the worst-case…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques
MethodsPruning
