Pseudo Dataset Generation for Out-of-Domain Multi-Camera View   Recommendation

Kuan-Ying Lee; Qian Zhou; Klara Nahrstedt

arXiv:2410.13585·cs.CV·October 18, 2024

Pseudo Dataset Generation for Out-of-Domain Multi-Camera View Recommendation

Kuan-Ying Lee, Qian Zhou, Klara Nahrstedt

PDF

Open Access

TL;DR

This paper introduces a method to generate pseudo-labeled datasets from regular videos to improve multi-camera view recommendation models, significantly enhancing their accuracy in unseen domains.

Contribution

It proposes transforming ordinary videos into pseudo-labeled datasets, addressing data scarcity and domain generalization issues in multi-camera view recommendation.

Findings

01

68% relative accuracy improvement in target domain

02

Bridges the gap between in-domain and unseen domains

03

Effective pseudo-labeling from edited videos

Abstract

Multi-camera systems are indispensable in movies, TV shows, and other media. Selecting the appropriate camera at every timestamp has a decisive impact on production quality and audience preferences. Learning-based view recommendation frameworks can assist professionals in decision-making. However, they often struggle outside of their training domains. The scarcity of labeled multi-camera view recommendation datasets exacerbates the issue. Based on the insight that many videos are edited from the original multi-camera videos, we propose transforming regular videos into pseudo-labeled multi-camera view recommendation datasets. Promisingly, by training the model on pseudo-labeled datasets stemming from videos in the target domain, we achieve a 68% relative improvement in the model's accuracy in the target domain and bridge the accuracy gap between in-domain and never-before-seen domains.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods