Prompting Large Language Models to Reformulate Queries for Moment Localization
Wenfeng Yan, Shaoxiang Chen, Zuxuan Wu, Yu-Gang Jiang

TL;DR
This paper explores using large language models to reformulate natural language queries into clearer instructions, improving the accuracy of moment localization in videos by making queries more understandable for models.
Contribution
It introduces a novel approach of leveraging large language models to reformulate complex queries into more effective instructions for video moment localization.
Findings
Reformulated queries improve localization accuracy.
Large language models effectively generate clearer instructions.
Enhanced query understanding aids in complex video scenarios.
Abstract
The task of moment localization is to localize a temporal moment in an untrimmed video for a given natural language query. Since untrimmed video contains highly redundant contents, the quality of the query is crucial for accurately localizing moments, i.e., the query should provide precise information about the target moment so that the localization model can understand what to look for in the videos. However, the natural language queries in current datasets may not be easy to understand for existing models. For example, the Ego4D dataset uses question sentences as the query to describe relatively complex moments. While being natural and straightforward for humans, understanding such question sentences are challenging for mainstream moment localization models like 2D-TAN. Inspired by the recent success of large language models, especially their ability of understanding and generating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
