GnetSeg: Semantic Segmentation Model Optimized on a 224mW CNN Accelerator Chip at the Speed of 318FPS
Baohua Sun, Weixiong Lin, Hao Sha, Jiapeng Su

TL;DR
GnetSeg is a highly efficient semantic segmentation model optimized for low-power CNN accelerator chips, achieving 318FPS at 224mW with minimal CPU load and novel integer encoding for faster data transfer.
Contribution
The paper introduces GnetSeg, a semantic segmentation model optimized for CNN accelerators, reducing CPU load and employing integer encoding to enhance inference speed and efficiency.
Findings
Achieves 318FPS on a 224mW CNN chip
Maintains high accuracy in person segmentation
Reduces CPU load to zero during inference
Abstract
Semantic segmentation is the task to cluster pixels on an image belonging to the same class. It is widely used in the real-world applications including autonomous driving, medical imaging analysis, industrial inspection, smartphone camera for person segmentation and so on. Accelerating the semantic segmentation models on the mobile and edge devices are practical needs for the industry. Recent years have witnessed the wide availability of CNN (Convolutional Neural Networks) accelerators. They have the advantages on power efficiency, inference speed, which are ideal for accelerating the semantic segmentation models on the edge devices. However, the CNN accelerator chips also have the limitations on flexibility and memory. In addition, the CPU load is very critical because the CNN accelerator chip works as a co-processor with a host CPU. In this paper, we optimize the semantic segmentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Brain Tumor Detection and Classification
