Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design   Framework of Neural Networks and Edge Deployment

Jiaqi Wu; Simin Chen; Zehua Wang; Wei Chen; Zijian Tian; F. Richard; Yu; Victor C. M. Leung

arXiv:2411.00838·cs.CV·November 5, 2024

Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment

Jiaqi Wu, Simin Chen, Zehua Wang, Wei Chen, Zijian Tian, F. Richard, Yu, Victor C. M. Leung

PDF

Open Access

TL;DR

This paper introduces a co-design framework for neural networks and deployment strategies that enhances real-time visual inference in IoVT systems by balancing throughput and accuracy on heterogeneous edge devices.

Contribution

It proposes a novel dynamic model structure and partitioning strategy, along with a multi-objective co-optimization approach, to improve inference performance on resource-constrained edge devices.

Findings

01

Achieved 12.05% and 18.83% throughput improvements on MNIST and ImageNet.

02

Demonstrated superior classification accuracy over baseline algorithms.

03

Ensured stable performance across different edge devices.

Abstract

As the volume of image data grows, data-oriented cloud computing in Internet of Video Things (IoVT) systems encounters latency issues. Task-oriented edge computing addresses this by shifting data analysis to the edge. However, limited computational power of edge devices poses challenges for executing visual tasks. Existing methods struggle to balance high model performance with low resource consumption; lightweight neural networks often underperform, while device-specific models designed by Neural Architecture Search (NAS) fail to adapt to heterogeneous devices. For these issues, we propose a novel co-design framework to optimize neural network architecture and deployment strategies during inference for high-throughput. Specifically, it implements a dynamic model structure based on re-parameterization, coupled with a Roofline-based model partitioning strategy to enhance the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection