Data-Model-Circuit Tri-Design for Ultra-Light Video Intelligence on Edge   Devices

Yimeng Zhang; Akshay Karkal Kamath; Qiucheng Wu; Zhiwen Fan; Wuyang; Chen; Zhangyang Wang; Shiyu Chang; Sijia Liu; Cong Hao

arXiv:2210.08578·cs.CV·October 19, 2022

Data-Model-Circuit Tri-Design for Ultra-Light Video Intelligence on Edge Devices

Yimeng Zhang, Akshay Karkal Kamath, Qiucheng Wu, Zhiwen Fan, Wuyang, Chen, Zhangyang Wang, Shiyu Chang, Sijia Liu, Cong Hao

PDF

TL;DR

This paper introduces a comprehensive tri-design framework combining data reduction, model compression, and hardware acceleration to enable ultra-light, real-time multi-object tracking on edge devices with significant improvements in latency, power, and efficiency.

Contribution

It presents a novel integrated approach for software and hardware co-optimization tailored for high-performance, low-power multi-object tracking on edge devices.

Findings

01

12.5x latency reduction compared to baseline

02

20.9x effective frame rate improvement

03

5.83x lower power consumption

Abstract

In this paper, we propose a data-model-hardware tri-design framework for high-throughput, low-cost, and high-accuracy multi-object tracking (MOT) on High-Definition (HD) video stream. First, to enable ultra-light video intelligence, we propose temporal frame-filtering and spatial saliency-focusing approaches to reduce the complexity of massive video data. Second, we exploit structure-aware weight sparsity to design a hardware-friendly model compression method. Third, assisted with data and model complexity reduction, we propose a sparsity-aware, scalable, and low-power accelerator design, aiming to deliver real-time performance with high energy efficiency. Different from existing works, we make a solid step towards the synergized software/hardware co-optimization for realistic MOT model implementation. Compared to the state-of-the-art MOT baseline, our tri-design approach can achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.