Virtuoso: Video-based Intelligence for real-time tuning on SOCs
Jayoung Lee, PengCheng Wang, Ran Xu, Venkat Dasari, Noah Weston, Yin, Li, Saurabh Bagchi, and Somali Chaterji

TL;DR
Virtuoso is a video-based, real-time tuning system for SOCs that optimizes accuracy, energy, and latency in object detection by dynamically selecting execution modes, outperforming existing protocols on mobile GPUs.
Contribution
It introduces a multi-branch execution kernel and a lightweight runtime scheduler to adaptively optimize object detection for accuracy, energy, and latency on embedded devices.
Findings
Virtuoso outperforms 15 state-of-the-art protocols in accuracy and efficiency.
Achieves 63.9% accuracy, surpassing FRCNN and YOLO by over 10%.
Demonstrates superior performance on NVIDIA Jetson mobile GPUs.
Abstract
Efficient and adaptive computer vision systems have been proposed to make computer vision tasks, such as image classification and object detection, optimized for embedded or mobile devices. These solutions, quite recent in their origin, focus on optimizing the model (a deep neural network, DNN) or the system by designing an adaptive system with approximation knobs. In spite of several recent efforts, we show that existing solutions suffer from two major drawbacks. First, the system does not consider energy consumption of the models while making a decision on which model to run. Second, the evaluation does not consider the practical scenario of contention on the device, due to other co-resident workloads. In this work, we propose an efficient and adaptive video object detection system, Virtuoso, which is jointly optimized for accuracy, energy efficiency, and latency. Underlying Virtuoso…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Advanced Image and Video Retrieval Techniques
MethodsYou Only Look Once · Depthwise Convolution · Batch Normalization · Pointwise Convolution · Depthwise Separable Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · BiFPN · Non Maximum Suppression · Convolution · 1x1 Convolution
