Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
Sung Jin Um, Dongjin Kim, Sangmin Lee, Jung Uk Kim

TL;DR
This paper introduces a novel context-aware attention mechanism for video moment retrieval and highlight detection, effectively capturing overall video context and keyword relevance to improve alignment between visual and textual data.
Contribution
The paper proposes a Video Context-aware Keyword Attention module with context clustering and keyword contrastive learning, advancing the understanding of keyword dynamics in video analysis.
Findings
Significant performance improvements on QVHighlights, TVSum, and Charades-STA benchmarks.
Enhanced understanding of keyword relevance within overall video context.
Effective fine-grained alignment between visual and textual features.
Abstract
The goal of video moment retrieval and highlight detection is to identify specific segments and highlights based on a given text query. With the rapid growth of video content and the overlap between these tasks, recent works have addressed both simultaneously. However, they still struggle to fully capture the overall video context, making it challenging to determine which words are most relevant. In this paper, we present a novel Video Context-aware Keyword Attention module that overcomes this limitation by capturing keyword variation within the context of the entire video. To achieve this, we introduce a video context clustering module that provides concise representations of the overall video context, thereby enhancing the understanding of keyword dynamics. Furthermore, we propose a keyword weight detection module with keyword-aware contrastive learning that incorporates keyword…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Text and Document Classification Technologies · Web Data Mining and Analysis
MethodsSoftmax · Attention Is All You Need · Contrastive Learning
