TL;DR
StableVQA is a new deep learning model designed to accurately assess video stability by analyzing optical flow, semantic, and blur features, supported by a large-scale database of shaky videos with subjective scores.
Contribution
The paper introduces StableVQA, a novel no-reference video stability assessment model and a comprehensive database, addressing the lack of specialized metrics and data for video shakiness evaluation.
Findings
StableVQA outperforms existing models in correlation with subjective scores.
The StableDB database contains 1,952 shaky videos with MOS ratings.
The model effectively captures different aspects of video stability.
Abstract
Video shakiness is an unpleasant distortion of User Generated Content (UGC) videos, which is usually caused by the unstable hold of cameras. In recent years, many video stabilization algorithms have been proposed, yet no specific and accurate metric enables comprehensively evaluating the stability of videos. Indeed, most existing quality assessment models evaluate video quality as a whole without specifically taking the subjective experience of video stability into consideration. Therefore, these models cannot measure the video stability explicitly and precisely when severe shakes are present. In addition, there is no large-scale video database in public that includes various degrees of shaky videos with the corresponding subjective scores available, which hinders the development of Video Quality Assessment for Stability (VQA-S). To this end, we build a new database named StableDB that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
