User-based key frame detection in social web video
Konstantinos Chorianopoulos

TL;DR
This paper introduces a user-interaction-based method for automatically detecting key frames in web videos by analyzing aggregate replay data, enabling dynamic, user-specific video thumbnails for search results.
Contribution
It presents a novel approach that leverages aggregated user replay interactions to identify interesting key frames, differing from traditional content-based methods.
Findings
Local maxima of replay activity indicate semantic-rich video segments
User interest functions can effectively identify key frames
Method enables dynamic, user-based video thumbnails
Abstract
Video search results and suggested videos on web sites are represented with a video thumbnail, which is manually selected by the video up-loader among three randomly generated ones (e.g., YouTube). In contrast, we present a grounded user-based approach for automatically detecting interesting key-frames within a video through aggregated users' replay interactions with the video player. Previous research has focused on content-based systems that have the benefit of analyzing a video without user interactions, but they are monolithic, because the resulting video thumbnails are the same regardless of the user preferences. We constructed a user interest function, which is based on aggregate video replays, and analyzed hundreds of user interactions. We found that the local maximum of the replaying activity stands for the semantics of information rich videos, such as lecture, and how-to. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Advanced Image and Video Retrieval Techniques · Multimedia Communication and Technology
