Predicting total time to compress a video corpus using online inference systems
Xin Shu, Vibhoothi Vibhoothi, Anil Kokaram

TL;DR
This paper introduces machine learning systems that accurately predict the total time to compress an entire video corpus, improving resource management for cloud services and VOD providers by providing real-time cost estimates.
Contribution
The work presents novel ML models for aggregate corpus compression time prediction and an online inference framework that updates predictions as files are processed.
Findings
Aggregate time prediction is over twice as accurate as per-clip methods.
Online inference reduces prediction error to less than 5%.
The approach outperforms previous generalized predictors.
Abstract
Predicting the computational cost of compressing/transcoding clips in a video corpus is important for resource management of cloud services and VOD (Video On Demand) providers. Currently, customers of cloud video services are unaware of the cost of transcoding their files until the task is completed. Previous work concentrated on predicting perclip compression time, and thus estimating the cost of video compression. In this work, we propose new Machine Learning (ML) systems which predict cost for the entire corpus instead. This is a more appropriate goal since users are not interested in per-clip cost but instead the cost for the whole corpus. In this work, we evaluate our systems with respect to two video codecs (x264, x265) and a novel high-quality video corpus. We find that the accuracy of aggregate time prediction for a video corpus more than two times better than using per-clip…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization
