Computer Vision Model Compression Techniques for Embedded Systems: A   Survey

Alexandre Lopes; Fernando Pereira dos Santos; Diulhio de Oliveira,; Mauricio Schiezaro; Helio Pedrini

arXiv:2408.08250·cs.CV·August 16, 2024

Computer Vision Model Compression Techniques for Embedded Systems: A Survey

Alexandre Lopes, Fernando Pereira dos Santos, Diulhio de Oliveira,, Mauricio Schiezaro, Helio Pedrini

PDF

1 Repo

TL;DR

This survey reviews various model compression techniques for computer vision, focusing on enabling deployment of large neural networks on resource-constrained embedded systems, and provides practical guidance and case studies.

Contribution

It systematically compares compression approaches, discusses selection criteria, and shares implementation resources for deploying vision models on embedded devices.

Findings

01

Different compression techniques have unique trade-offs and suitability.

02

Guidelines help choose optimal compression methods for specific embedded hardware.

03

Case studies demonstrate practical application and effectiveness.

Abstract

Deep neural networks have consistently represented the state of the art in most computer vision problems. In these scenarios, larger and more complex models have demonstrated superior performance to smaller architectures, especially when trained with plenty of representative data. With the recent adoption of Vision Transformer (ViT) based architectures and advanced Convolutional Neural Networks (CNNs), the total number of parameters of leading backbone architectures increased from 62M parameters in 2012 with AlexNet to 7B parameters in 2024 with AIM-7B. Consequently, deploying such deep architectures faces challenges in environments with processing and runtime constraints, particularly in embedded systems. This paper covers the main model compression techniques applied for computer vision tasks, enabling modern models to be used in embedded systems. We present the characteristics of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

venturusbr/cv-model-compression
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Layer Normalization · Multi-Head Attention · Attention Is All You Need · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Softmax · Absolute Position Encodings · Vision Transformer