Video Infringement Detection via Feature Disentanglement and Mutual Information Maximization
Zhenguang Liu, Xinyang Yu, Ruili Wang, Shuai Ye, Zhe Ma, Jianfeng, Dong, Sifeng He, Feng Qian, Xiaobo Zhang, Roger Zimmermann, Lei Yang

TL;DR
This paper introduces a novel video infringement detection method that disentangles high-dimensional features into meaningful sub-components and maximizes task-relevant information, significantly improving detection accuracy on large-scale benchmarks.
Contribution
It proposes a feature disentanglement approach combined with mutual information maximization to enhance video infringement detection performance.
Findings
Achieved 90.1% TOP-100 mAP on SVD dataset.
Set new state-of-the-art results on VCSL benchmark.
Demonstrated effectiveness of disentangled features in large-scale datasets.
Abstract
The self-media era provides us tremendous high quality videos. Unfortunately, frequent video copyright infringements are now seriously damaging the interests and enthusiasm of video creators. Identifying infringing videos is therefore a compelling task. Current state-of-the-art methods tend to simply feed high-dimensional mixed video features into deep neural networks and count on the networks to extract useful representations. Despite its simplicity, this paradigm heavily relies on the original entangled features and lacks constraints guaranteeing that useful task-relevant semantics are extracted from the features. In this paper, we seek to tackle the above challenges from two aspects: (1) We propose to disentangle an original high-dimensional feature into multiple sub-features, explicitly disentangling the feature into exclusive lower-dimensional components. We expect the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
