MDS-VQA: Model-Informed Data Selection for Video Quality Assessment

Jian Zou; Xiaoyu Xu; Zhihua Wang; Yilin Wang; Balu Adsumilli; Kede Ma

arXiv:2603.11525·cs.CV·March 13, 2026

MDS-VQA: Model-Informed Data Selection for Video Quality Assessment

Jian Zou, Xiaoyu Xu, Zhihua Wang, Yilin Wang, Balu Adsumilli, Kede Ma

PDF

Open Access

TL;DR

This paper introduces MDS-VQA, a data selection method that identifies challenging and diverse videos to improve video quality assessment models through targeted fine-tuning, enhancing their accuracy and generalization.

Contribution

MDS-VQA is a novel model-informed data selection approach that combines difficulty estimation and diversity measurement to optimize unlabeled data curation for VQA model improvement.

Findings

01

Selected 5% of data improves SRCC from 0.651 to 0.722

02

Method outperforms baseline data selection strategies

03

Achieves top gMAD rank indicating strong generalization

Abstract

Learning-based video quality assessment (VQA) has advanced rapidly, yet progress is increasingly constrained by a disconnect between model design and dataset curation. Model-centric approaches often iterate on fixed benchmarks, while data-centric efforts collect new human labels without systematically targeting the weaknesses of existing VQA models. Here, we describe MDS-VQA, a model-informed data selection mechanism for curating unlabeled videos that are both difficult for the base VQA model and diverse in content. Difficulty is estimated by a failure predictor trained with a ranking objective, and diversity is measured using deep semantic video features, with a greedy procedure balancing the two under a constrained labeling budget. Experiments across multiple VQA datasets and models demonstrate that MDS-VQA identifies diverse, challenging samples that are particularly informative for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment · Visual Attention and Saliency Detection · Video Analysis and Summarization