RANKVIDEO: Reasoning Reranking for Text-to-Video Retrieval
Tyler Skow, Alexander Martin, Benjamin Van Durme, Rama Chellappa, Reno Kriz

TL;DR
RANKVIDEO introduces a reasoning-based reranker for text-to-video retrieval that explicitly reasons over query-video pairs, significantly improving retrieval accuracy on large-scale benchmarks through a novel training pipeline.
Contribution
It is the first to apply reasoning-based reranking to video retrieval, combining perception-grounded fine-tuning with a multi-objective reranking training pipeline.
Findings
Achieves 31% improvement on nDCG@10 over baseline methods.
Outperforms text-only and vision-language rerankers in accuracy.
Demonstrates efficiency and effectiveness on MultiVENT 2.0 benchmark.
Abstract
Reranking is a critical component of modern retrieval systems, which typically pair an efficient first-stage retriever with a more expressive model to refine results. While large reasoning models have driven rapid progress in text-centric reranking, reasoning-based reranking for video retrieval remains underexplored. To address this gap, we introduce RANKVIDEO, a reasoning-based reranker for video retrieval that explicitly reasons over query-video pairs using video content to assess relevance. RANKVIDEO is trained using a two-stage curriculum consisting of perception-grounded supervised fine-tuning followed by reranking training that combines pointwise, pairwise, and teacher confidence distillation objectives, and is supported by a data synthesis pipeline for constructing reasoning-intensive query-video pairs. Experiments on the large-scale MultiVENT 2.0 benchmark demonstrate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Video Analysis and Summarization
