Imitation Learning for Adaptive Video Streaming with Future Adversarial Information Bottleneck Principle
Shuoyao Wang, Jiawei Lin, Fangwei Ye

TL;DR
This paper introduces a novel imitation learning approach with an adversarial information bottleneck for adaptive video streaming, significantly improving QoE by leveraging offline optimal scenarios and mitigating future information leakage.
Contribution
It combines imitation learning with an information bottleneck and adversarial framework to enhance RL-based ABR algorithms, addressing overfitting and performance fluctuations.
Findings
Achieved 7.30% average QoE improvement.
Reduced average ranking by 30.01%.
Effectively mitigated future information leakage.
Abstract
Adaptive video streaming plays a crucial role in ensuring high-quality video streaming services. Despite extensive research efforts devoted to Adaptive BitRate (ABR) techniques, the current reinforcement learning (RL)-based ABR algorithms may benefit the average Quality of Experience (QoE) but suffers from fluctuating performance in individual video sessions. In this paper, we present a novel approach that combines imitation learning with the information bottleneck technique, to learn from the complex offline optimal scenario rather than inefficient exploration. In particular, we leverage the deterministic offline bitrate optimization problem with the future throughput realization as the expert and formulate it as a mixed-integer non-linear programming (MINLP) problem. To enable large-scale training for improved performance, we propose an alternative optimization algorithm that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Advanced Image Processing Techniques · Video Coding and Compression Technologies
