MANSY: Generalizing Neural Adaptive Immersive Video Streaming With Ensemble and Representation Learning
Duo Wu, Panlong Wu, Miao Zhang, Fangxin Wang

TL;DR
MANSY is a novel neural streaming system that uses ensemble and representation learning to improve viewport prediction and QoE across diverse user preferences in immersive video streaming.
Contribution
It introduces a Transformer-based viewport prediction model with implicit ensemble learning and combines representation learning with deep reinforcement learning for adaptive bitrate selection.
Findings
Outperforms state-of-the-art in viewport prediction accuracy
Achieves better QoE across trained and unseen user preferences
Demonstrates improved generalization in diverse viewing scenarios
Abstract
The popularity of immersive videos has prompted extensive research into neural adaptive tile-based streaming to optimize video transmission over networks with limited bandwidth. However, the diversity of users' viewing patterns and Quality of Experience (QoE) preferences has not been fully addressed yet by existing neural adaptive approaches for viewport prediction and bitrate selection. Their performance can significantly deteriorate when users' actual viewing patterns and QoE preferences differ considerably from those observed during the training phase, resulting in poor generalization. In this paper, we propose MANSY, a novel streaming system that embraces user diversity to improve generalization. Specifically, to accommodate users' diverse viewing patterns, we design a Transformer-based viewport prediction model with an efficient multi-viewport trajectory input output architecture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Video Coding and Compression Technologies · Advanced Image Processing Techniques
