Optimizing Neural Network for Computer Vision task in Edge Device
Ranjith M S, S Parameshwara, Pavan Yadav A, Shriganesh Hegde

TL;DR
This paper presents a method to optimize neural networks for computer vision on edge devices by reducing parameter precision, decreasing memory usage and computational load while maintaining performance, enabling real-time local predictions.
Contribution
It introduces a precision reduction technique to adapt neural networks for edge deployment, balancing efficiency and accuracy.
Findings
Memory usage decreased significantly
Inference speed increased on edge hardware
Model performance remained largely unaffected
Abstract
The field of computer vision has grown very rapidly in the past few years due to networks like convolution neural networks and their variants. The memory required to store the model and computational expense are very high for such a network limiting it to deploy on the edge device. Many times, applications rely on the cloud but that makes it hard for working in real-time due to round-trip delays. We overcome these problems by deploying the neural network on the edge device itself. The computational expense for edge devices is reduced by reducing the floating-point precision of the parameters in the model. After this the memory required for the model decreases and the speed of the computation increases where the performance of the model is least affected. This makes an edge device to predict from the neural network all by itself.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies · Advanced Neural Network Applications · Industrial Vision Systems and Defect Detection
MethodsConvolution
