ViRAC: A Vision-Reasoning Agent Head Movement Control Framework in   Arbitrary Virtual Environments

Juyeong Hwang; Seong-Eun Hong; Hyeongyeop Kang

arXiv:2502.10046·cs.GR·February 17, 2025

ViRAC: A Vision-Reasoning Agent Head Movement Control Framework in Arbitrary Virtual Environments

Juyeong Hwang, Seong-Eun Hong, Hyeongyeop Kang

PDF

Open Access

TL;DR

ViRAC is a framework that uses large-scale vision and language models to generate natural, context-aware head movements for virtual agents, improving realism without explicit cognitive modeling.

Contribution

Introduces ViRAC, a novel approach leveraging pre-trained models' biases for realistic head movement control in virtual agents, surpassing prior data-driven and saliency-based methods.

Findings

01

Produces more natural head rotations than state-of-the-art methods.

02

Aligns closely with human head-movement data.

03

User studies confirm enhanced realism and plausibility.

Abstract

Creating lifelike virtual agents capable of interacting with their environments is a longstanding goal in computer graphics. This paper addresses the challenge of generating natural head rotations, a critical aspect of believable agent behavior for visual information gathering and dynamic responses to environmental cues. Although earlier methods have made significant strides, many rely on data-driven or saliency-based approaches, which often underperform in diverse settings and fail to capture deeper cognitive factors such as risk assessment, information seeking, and contextual prioritization. Consequently, generated behaviors can appear rigid or overlook critical scene elements, thereby diminishing the sense of realism. In this paper, we propose \textbf{ViRAC}, a \textbf{Vi}sion-\textbf{R}easoning \textbf{A}gent Head Movement \textbf{C}ontrol framework, which exploits the common-sense…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSocial Robot Interaction and HRI · Evacuation and Crowd Dynamics · Robotics and Automated Systems