ViewActive: Active viewpoint optimization from a single image

Jiayi Wu; Xiaomin Lin; Botao He; Cornelia Fermuller; Yiannis Aloimonos

arXiv:2409.09997·cs.RO·July 29, 2025

ViewActive: Active viewpoint optimization from a single image

Jiayi Wu, Xiaomin Lin, Botao He, Cornelia Fermuller, Yiannis Aloimonos

PDF

Open Access 1 Repo

TL;DR

ViewActive is a machine learning framework that predicts optimal viewpoints for scene perception from a single image, improving robotic scene understanding and real-time motion planning.

Contribution

It introduces the 3D Viewpoint Quality Field (VQF) for viewpoint optimization guidance based on a single image, enabling effective generalization across objects and categories.

Findings

01

Achieves 72 FPS on a single GPU.

02

Enhances state-of-the-art object recognition performance.

03

Supports real-time robotic motion planning.

Abstract

When observing objects, humans benefit from their spatial visualization and mental rotation ability to envision potential optimal viewpoints based on the current observation. This capability is crucial for enabling robots to achieve efficient and robust scene perception during operation, as optimal viewpoints provide essential and informative features for accurately representing scenes in 2D images, thereby enhancing downstream tasks. To endow robots with this human-like active viewpoint optimization capability, we propose ViewActive, a modernized machine learning approach drawing inspiration from aspect graph, which provides viewpoint optimization guidance based solely on the current 2D image input. Specifically, we introduce the 3D Viewpoint Quality Field (VQF), a compact and consistent representation of viewpoint quality distribution similar to an aspect graph, composed of three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jiayi-wu-umd/viewactive
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Advanced Image Processing Techniques