Show Me What I Like: Detecting User-Specific Video Highlights Using Content-Based Multi-Head Attention
Uttaran Bhattacharya, Gang Wu, Stefano Petrangeli and, Viswanathan Swaminathan, Dinesh Manocha

TL;DR
This paper introduces a content-based multi-head attention approach to detect personalized video highlights by leveraging user preferences and pre-trained object and activity features, improving highlight detection accuracy.
Contribution
The paper presents a novel multi-head attention mechanism that adaptively weights user-preferred clips based on content, enabling personalized highlight detection from videos.
Findings
Achieved a 2-4% improvement in mean average precision over baselines.
Validated the effectiveness of content-based, user-specific highlight detection.
Performed extensive ablation studies confirming the importance of content features.
Abstract
We propose a method to detect individualized highlights for users on given target videos based on their preferred highlight clips marked on previous videos they have watched. Our method explicitly leverages the contents of both the preferred clips and the target videos using pre-trained features for the objects and the human activities. We design a multi-head attention mechanism to adaptively weigh the preferred clips based on their object- and human-activity-based contents, and fuse them using these weights into a single feature representation for each user. We compute similarities between these per-user feature representations and the per-frame features computed from the desired target videos to estimate the user-specific highlight clips from the target videos. We test our method on a large-scale highlight detection dataset containing the annotated highlights of individual users.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest · Softmax · Linear Layer
