A Similarity Alignment Model for Video Copy Segment Matching

Zhenhua Liu; Feipeng Ma; Tianyi Wang; Fengyun Rao

arXiv:2305.15679·cs.CV·May 26, 2023·2 cites

A Similarity Alignment Model for Video Copy Segment Matching

Zhenhua Liu, Feipeng Ma, Tianyi Wang, Fengyun Rao

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel Similarity Alignment Model (SAM) for video copy segment matching, achieving top performance in the CVPR 2023 Video Similarity Challenge by significantly outperforming competitors.

Contribution

The paper introduces the SAM, a new model architecture specifically designed for video copy segment matching, demonstrating superior accuracy in a competitive benchmark.

Findings

01

SAM outperforms competitors with a 0.108 / 0.144 absolute improvement.

02

The model achieves state-of-the-art results in the CVPR 2023 Video Similarity Challenge.

03

Code implementation is publicly available for reproducibility.

Abstract

With the development of multimedia technology, Video Copy Detection has been a crucial problem for social media platforms. Meta AI hold Video Similarity Challenge on CVPR 2023 to push the technology forward. In this report, we share our winner solutions on Matching Track. We propose a Similarity Alignment Model(SAM) for video copy segment matching. Our SAM exhibits superior performance compared to other competitors, with a 0.108 / 0.144 absolute improvement over the second-place competitor in Phase 1 / Phase 2. Code is available at https://github.com/FeipengMa6/VSC22-Submission/tree/main/VSC22-Matching-Track-1st.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

feipengma6/vsc22-submission
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsSegment Anything Model