MVMR: A New Framework for Evaluating Faithfulness of Video Moment   Retrieval against Multiple Distractors

Nakyeong Yang; Minsung Kim; Seunghyun Yoon; Joongbo Shin; Kyomin Jung

arXiv:2309.16701·cs.CV·August 12, 2024

MVMR: A New Framework for Evaluating Faithfulness of Video Moment Retrieval against Multiple Distractors

Nakyeong Yang, Minsung Kim, Seunghyun Yoon, Joongbo Shin, Kyomin Jung

PDF

Open Access 1 Repo

TL;DR

This paper introduces MVMR, a new task for evaluating video moment retrieval models against distractors, highlighting the importance of faithfulness and robustness in retrieval performance.

Contribution

The paper proposes the MVMR task, constructs new datasets, and introduces the CroCs learning method to improve model robustness against distractors in video retrieval.

Findings

01

Existing VMR models are easily distracted by misinformation

02

CroCs significantly improves robustness against distractors

03

New datasets enable more realistic evaluation of VMR models

Abstract

With the explosion of multimedia content, video moment retrieval (VMR), which aims to detect a video moment that matches a given text query from a video, has been studied intensively as a critical problem. However, the existing VMR framework evaluates video moment retrieval performance, assuming that a video is given, which may not reveal whether the models exhibit overconfidence in the falsely given video. In this paper, we propose the MVMR (Massive Videos Moment Retrieval for Faithfulness Evaluation) task that aims to retrieve video moments within a massive video set, including multiple distractors, to evaluate the faithfulness of VMR models. For this task, we suggest an automated massive video pool construction framework to categorize negative (distractors) and positive (false-negative) video sets using textual and visual semantic distance verification methods. We extend existing VMR…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yny0506/massive-videos-moment-retrieval
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning

MethodsNone · Contrastive Learning