MVAD: A Multiple Visual Artifact Detector for Video Streaming

Chen Feng; Duolikun Danier; Fan Zhang; Alex Mackin; Andrew Collins; David Bull

arXiv:2406.00212·eess.IV·June 3, 2025

MVAD: A Multiple Visual Artifact Detector for Video Streaming

Chen Feng, Duolikun Danier, Fan Zhang, Alex Mackin, Andrew Collins, David Bull

PDF

Open Access

TL;DR

MVAD is a novel framework capable of detecting multiple visual artifacts in streamed videos simultaneously, improving accuracy and practicality over existing single-artifact detection methods.

Contribution

Introduces a unified, artifact-aware detection framework that does not rely on quality indices, utilizing a new feature extractor and transformer-based architecture trained on a large, simulated dataset.

Findings

01

Achieves improved detection accuracy on Maxwell and BVI-Artifact datasets.

02

Effectively detects ten different visual artifacts in streaming videos.

03

Outperforms seven existing artifact detection methods.

Abstract

Visual artifacts are often introduced into streamed video content, due to prevailing conditions during content production and delivery. Since these can degrade the quality of the user's experience, it is important to automatically and accurately detect them in order to enable effective quality measurement and enhancement. Existing detection methods often focus on a single type of artifact and/or determine the presence of an artifact through thresholding objective quality indices. Such approaches have been reported to offer inconsistent prediction performance and are also impractical for real-world applications where multiple artifacts co-exist and interact. In this paper, we propose a Multiple Visual Artifact Detector, MVAD, for video streaming which, for the first time, is able to detect multiple artifacts using a single framework that is not reliant on video quality assessment models.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment · Advanced Steganography and Watermarking Techniques · Video Coding and Compression Technologies

MethodsAttention Is All You Need · Softmax · Focus · Layer Normalization · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Label Smoothing · Adam · Vision Transformer