Modeling Beyond MOS: Quality Assessment Models Must Integrate Context, Reasoning, and Multimodality
Mohamed Amine Kerkouri, Marouane Tliba, Aladine Chetouani, Nour Aburaed, Alessandro Bruno

TL;DR
This paper advocates for a paradigm shift in multimedia quality assessment from traditional MOS scores to models that incorporate context, reasoning, and multimodal understanding to better reflect human judgments and improve trustworthiness.
Contribution
It introduces a framework emphasizing context-awareness, interpretability, and multimodality in quality assessment models, and proposes a roadmap for developing richer datasets and evaluation metrics.
Findings
Current MOS-based benchmarks are limited in capturing semantic and contextual factors.
Proposes integrating vision-language models for multimodal quality assessment.
Recommends datasets with contextual metadata and expert rationales for better evaluation.
Abstract
This position paper argues that Mean Opinion Score (MOS), while historically foundational, is no longer sufficient as the sole supervisory signal for multimedia quality assessment models. MOS reduces rich, context-sensitive human judgments to a single scalar, obscuring semantic failures, user intent, and the rationale behind quality decisions. We contend that modern quality assessment models must integrate three interdependent capabilities: (1) context-awareness, to adapt evaluations to task-specific goals and viewing conditions; (2) reasoning, to produce interpretable, evidence-grounded justifications for quality judgments; and (3) multimodality, to align perceptual and semantic cues using vision-language models. We critique the limitations of current MOS-centric benchmarks and propose a roadmap for reform: richer datasets with contextual metadata and expert rationales, and new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Software Engineering Techniques and Practices
MethodsALIGN
