Subjective Portrait Region Cropping in Landscape Videos with Temporal Annotation Smoothing
Cheng-Han Lee, Maniratnam Mandal, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

TL;DR
This paper introduces a large-scale annotated database for subjective portrait region cropping in videos, along with smoothing techniques and baseline models, to improve mobile video aspect ratio transformations.
Contribution
It provides the largest publicly available dataset with temporal annotation smoothing and demonstrates its utility with baseline algorithms for video cropping tasks.
Findings
The dataset contains 1800 videos with human annotations.
A novel intra-frame temporal filter effectively smooths annotations.
Fine-tuned models on this dataset improve aspect ratio transformation quality.
Abstract
With the rise of mobile video consumption on diverse handheld display resolutions and orientation modes, altering videos to aspect ratios poses challenges. Static cropping and border padding often compromises visual quality, while warping may distort a video's intended meaning. Here we advocate for a more effective approach: cropping significant regions within video frames in a temporal manner, while minimizing distortion and preserving essential content. One barrier to solving this problem is the lack of sufficiently large-scale database devoted to informing these tasks. Towards filling this gap, we introduce the LIVE-YouTube Video Cropping (LIVE-YT VC) database, featuring 1800 videos, annotated by 90 human subjects. Using videos sourced from the YouTube-UGC and LSVQ Databases, this new resource is the largest publicly-available subjective video portrait region cropping database. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
