GnetSeg: Semantic Segmentation Model Optimized on a 224mW CNN   Accelerator Chip at the Speed of 318FPS

Baohua Sun; Weixiong Lin; Hao Sha; Jiapeng Su

arXiv:2101.10444·cs.CV·January 27, 2021

GnetSeg: Semantic Segmentation Model Optimized on a 224mW CNN Accelerator Chip at the Speed of 318FPS

Baohua Sun, Weixiong Lin, Hao Sha, Jiapeng Su

PDF

Open Access

TL;DR

GnetSeg is a highly efficient semantic segmentation model optimized for low-power CNN accelerator chips, achieving 318FPS at 224mW with minimal CPU load and novel integer encoding for faster data transfer.

Contribution

The paper introduces GnetSeg, a semantic segmentation model optimized for CNN accelerators, reducing CPU load and employing integer encoding to enhance inference speed and efficiency.

Findings

01

Achieves 318FPS on a 224mW CNN chip

02

Maintains high accuracy in person segmentation

03

Reduces CPU load to zero during inference

Abstract

Semantic segmentation is the task to cluster pixels on an image belonging to the same class. It is widely used in the real-world applications including autonomous driving, medical imaging analysis, industrial inspection, smartphone camera for person segmentation and so on. Accelerating the semantic segmentation models on the mobile and edge devices are practical needs for the industry. Recent years have witnessed the wide availability of CNN (Convolutional Neural Networks) accelerators. They have the advantages on power efficiency, inference speed, which are ideal for accelerating the semantic segmentation models on the edge devices. However, the CNN accelerator chips also have the limitations on flexibility and memory. In addition, the CPU load is very critical because the CNN accelerator chip works as a co-processor with a host CPU. In this paper, we optimize the semantic segmentation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Brain Tumor Detection and Classification