Learning to Select Camera Views: Efficient Multiview Understanding at   Few Glances

Yunzhong Hou; Stephen Gould; Liang Zheng

arXiv:2303.06145·cs.CV·March 13, 2023·1 cites

Learning to Select Camera Views: Efficient Multiview Understanding at Few Glances

Yunzhong Hou, Stephen Gould, Liang Zheng

PDF

Open Access 1 Repo

TL;DR

This paper introduces MVSelect, a reinforcement learning-based method for selecting optimal camera views in multiview systems, reducing computational costs while maintaining high performance in classification and detection tasks.

Contribution

The paper presents a novel view selection approach that jointly trains with task networks, enabling efficient multiview understanding with fewer views and insights into camera layout optimization.

Findings

01

Achieves high accuracy with only 2-3 views out of N

02

Reduces computational costs significantly

03

Identifies cameras that can be turned off with minimal performance loss

Abstract

Multiview camera setups have proven useful in many computer vision applications for reducing ambiguities, mitigating occlusions, and increasing field-of-view coverage. However, the high computational cost associated with multiple views poses a significant challenge for end devices with limited computational resources. To address this issue, we propose a view selection approach that analyzes the target object or scenario from given views and selects the next best view for processing. Our approach features a reinforcement learning based camera selection module, MVSelect, that not only selects views but also facilitates joint training with the task network. Experimental results on multiview classification and detection tasks show that our approach achieves promising performance while using only 2 or 3 out of N available views, significantly reducing computational costs. Furthermore,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hou-yz/mvselect
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Enhancement Techniques · Advanced Vision and Imaging