Efficient Self-Supervised Video Hashing with Selective State Spaces
Jinpeng Wang, Niu Lian, Jun Li, Yuting Wang, Yan Feng, Bin Chen,, Yongbing Zhang, Shu-Tao Xia

TL;DR
This paper introduces S5VH, a novel self-supervised video hashing model based on Mamba, which balances efficiency and effectiveness through selective state-space modeling and a new learning paradigm, outperforming existing methods.
Contribution
The paper proposes S5VH, a Mamba-based video hashing model with a new self-supervised learning strategy and bidirectional layers, improving efficiency and performance in video retrieval tasks.
Findings
S5VH outperforms state-of-the-art methods in accuracy.
The model achieves faster convergence and better transferability.
Inference efficiency is significantly improved.
Abstract
Self-supervised video hashing (SSVH) is a practical task in video indexing and retrieval. Although Transformers are predominant in SSVH for their impressive temporal modeling capabilities, they often suffer from computational and memory inefficiencies. Drawing inspiration from Mamba, an advanced state-space model, we explore its potential in SSVH to achieve a better balance between efficacy and efficiency. We introduce S5VH, a Mamba-based video hashing model with an improved self-supervised learning paradigm. Specifically, we design bidirectional Mamba layers for both the encoder and decoder, which are effective and efficient in capturing temporal relationships thanks to the data-dependent selective scanning mechanism with linear complexity. In our learning strategy, we transform global semantics in the feature space into semantically consistent and discriminative hash centers, followed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Video Analysis and Summarization
MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces
