A Reinforcement-Learning-Based Energy-Efficient Framework for Multi-Task Video Analytics Pipeline
Yingying Zhao, Mingzhi Dong, Yujiang Wang, Da Feng, Qin Lv, Robert P., Dick, Dongsheng Li, Tun Lu, Ning Gu, Li Shang

TL;DR
This paper presents a reinforcement learning-based framework that adaptively adjusts video resolution to optimize energy efficiency in multi-task video analytics pipelines without sacrificing accuracy.
Contribution
It introduces a novel deep reinforcement learning approach to dynamically control input resolution, reducing energy consumption in video analytics pipelines.
Findings
Achieves better energy efficiency than baseline methods at similar accuracy.
Incorporates optical flow to reduce unnecessary re-computation.
Effectively applies to complex tasks like video instance segmentation.
Abstract
Deep-learning-based video processing has yielded transformative results in recent years. However, the video analytics pipeline is energy-intensive due to high data rates and reliance on complex inference algorithms, which limits its adoption in energy-constrained applications. Motivated by the observation of high and variable spatial redundancy and temporal dynamics in video data streams, we design and evaluate an adaptive-resolution optimization framework to minimize the energy use of multi-task video analytics pipelines. Instead of heuristically tuning the input data resolution of individual tasks, our framework utilizes deep reinforcement learning to dynamically govern the input resolution and computation of the entire video analytics pipeline. By monitoring the impact of varying resolution on the quality of high-dimensional video analytics features, hence the accuracy of video…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
