Learning Fine-Grained Features for Pixel-wise Video Correspondences

Rui Li; Shenglong Zhou; Dong Liu

arXiv:2308.03040·cs.CV·August 8, 2023·1 cites

Learning Fine-Grained Features for Pixel-wise Video Correspondences

Rui Li, Shenglong Zhou, Dong Liu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a holistic, adversarial, coarse-to-fine framework for learning fine-grained pixel-wise video correspondences, improving robustness and efficiency over existing methods.

Contribution

It proposes a novel approach combining synthetic and real-world videos with adversarial learning and a coarse-to-fine scheme for better pixel-wise correspondence.

Findings

01

Outperforms state-of-the-art in accuracy

02

Achieves higher computational efficiency

03

Demonstrates robustness on real-world videos

Abstract

Video analysis tasks rely heavily on identifying the pixels from different frames that correspond to the same visual target. To tackle this problem, recent studies have advocated feature learning methods that aim to learn distinctive representations to match the pixels, especially in a self-supervised fashion. Unfortunately, these methods have difficulties for tiny or even single-pixel visual targets. Pixel-wise video correspondences were traditionally related to optical flows, which however lead to deterministic correspondences and lack robustness on real-world videos. We address the problem of learning features for establishing pixel-wise correspondences. Motivated by optical flows as well as the self-supervised feature learning, we propose to use not only labeled synthetic videos but also unlabeled real-world videos for learning fine-grained representations in a holistic framework.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qianduoduolr/fgvc
pytorchOfficial

Videos

Learning Fine-Grained Features for Pixel-Wise Video Correspondences· youtube

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Advanced Image Processing Techniques