Data-Model-Circuit Tri-Design for Ultra-Light Video Intelligence on Edge Devices
Yimeng Zhang, Akshay Karkal Kamath, Qiucheng Wu, Zhiwen Fan, Wuyang, Chen, Zhangyang Wang, Shiyu Chang, Sijia Liu, Cong Hao

TL;DR
This paper introduces a comprehensive tri-design framework combining data reduction, model compression, and hardware acceleration to enable ultra-light, real-time multi-object tracking on edge devices with significant improvements in latency, power, and efficiency.
Contribution
It presents a novel integrated approach for software and hardware co-optimization tailored for high-performance, low-power multi-object tracking on edge devices.
Findings
12.5x latency reduction compared to baseline
20.9x effective frame rate improvement
5.83x lower power consumption
Abstract
In this paper, we propose a data-model-hardware tri-design framework for high-throughput, low-cost, and high-accuracy multi-object tracking (MOT) on High-Definition (HD) video stream. First, to enable ultra-light video intelligence, we propose temporal frame-filtering and spatial saliency-focusing approaches to reduce the complexity of massive video data. Second, we exploit structure-aware weight sparsity to design a hardware-friendly model compression method. Third, assisted with data and model complexity reduction, we propose a sparsity-aware, scalable, and low-power accelerator design, aiming to deliver real-time performance with high energy efficiency. Different from existing works, we make a solid step towards the synergized software/hardware co-optimization for realistic MOT model implementation. Compared to the state-of-the-art MOT baseline, our tri-design approach can achieve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
