MVAD: A Multiple Visual Artifact Detector for Video Streaming
Chen Feng, Duolikun Danier, Fan Zhang, Alex Mackin, Andrew Collins, David Bull

TL;DR
MVAD is a novel framework capable of detecting multiple visual artifacts in streamed videos simultaneously, improving accuracy and practicality over existing single-artifact detection methods.
Contribution
Introduces a unified, artifact-aware detection framework that does not rely on quality indices, utilizing a new feature extractor and transformer-based architecture trained on a large, simulated dataset.
Findings
Achieves improved detection accuracy on Maxwell and BVI-Artifact datasets.
Effectively detects ten different visual artifacts in streaming videos.
Outperforms seven existing artifact detection methods.
Abstract
Visual artifacts are often introduced into streamed video content, due to prevailing conditions during content production and delivery. Since these can degrade the quality of the user's experience, it is important to automatically and accurately detect them in order to enable effective quality measurement and enhancement. Existing detection methods often focus on a single type of artifact and/or determine the presence of an artifact through thresholding objective quality indices. Such approaches have been reported to offer inconsistent prediction performance and are also impractical for real-world applications where multiple artifacts co-exist and interact. In this paper, we propose a Multiple Visual Artifact Detector, MVAD, for video streaming which, for the first time, is able to detect multiple artifacts using a single framework that is not reliant on video quality assessment models.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Advanced Steganography and Watermarking Techniques · Video Coding and Compression Technologies
MethodsAttention Is All You Need · Softmax · Focus · Layer Normalization · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Label Smoothing · Adam · Vision Transformer
