qAttCNN - Self Attention Mechanism for Video QoE Prediction in Encrypted Traffic
Michael Sidorov, Ofer Hadar

TL;DR
This paper introduces qAttCNN, a neural network model that predicts video quality of experience in encrypted traffic using packet size data, enabling QoE assessment without access to media content.
Contribution
The paper presents a novel self-attention based CNN model for non-intrusive QoE prediction from encrypted video traffic, addressing limitations of existing methods.
Findings
Achieves 2.14% MAEP for BRISQUE prediction
Achieves 7.39% MAEP for FPS prediction
Outperforms existing QoE models on WhatsApp video call data
Abstract
The rapid growth of multimedia consumption, driven by major advances in mobile devices since the mid-2000s, has led to widespread use of video conferencing applications (VCAs) such as Zoom and Google Meet, as well as instant messaging applications (IMAs) like WhatsApp and Telegram, which increasingly support video conferencing as a core feature. Many of these systems rely on the Web Real-Time Communication (WebRTC) protocol, enabling direct peer-to-peer media streaming without requiring a third-party server to relay data, reducing the latency and facilitating a real-time communication. Despite WebRTC's potential, adverse network conditions can degrade streaming quality and consequently reduce users' Quality of Experience (QoE). Maintaining high QoE therefore requires continuous monitoring and timely intervention when QoE begins to deteriorate. While content providers can often estimate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Video Coding and Compression Technologies · Internet Traffic Analysis and Secure E-voting
