ViRAC: A Vision-Reasoning Agent Head Movement Control Framework in Arbitrary Virtual Environments
Juyeong Hwang, Seong-Eun Hong, Hyeongyeop Kang

TL;DR
ViRAC is a framework that uses large-scale vision and language models to generate natural, context-aware head movements for virtual agents, improving realism without explicit cognitive modeling.
Contribution
Introduces ViRAC, a novel approach leveraging pre-trained models' biases for realistic head movement control in virtual agents, surpassing prior data-driven and saliency-based methods.
Findings
Produces more natural head rotations than state-of-the-art methods.
Aligns closely with human head-movement data.
User studies confirm enhanced realism and plausibility.
Abstract
Creating lifelike virtual agents capable of interacting with their environments is a longstanding goal in computer graphics. This paper addresses the challenge of generating natural head rotations, a critical aspect of believable agent behavior for visual information gathering and dynamic responses to environmental cues. Although earlier methods have made significant strides, many rely on data-driven or saliency-based approaches, which often underperform in diverse settings and fail to capture deeper cognitive factors such as risk assessment, information seeking, and contextual prioritization. Consequently, generated behaviors can appear rigid or overlook critical scene elements, thereby diminishing the sense of realism. In this paper, we propose \textbf{ViRAC}, a \textbf{Vi}sion-\textbf{R}easoning \textbf{A}gent Head Movement \textbf{C}ontrol framework, which exploits the common-sense…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Evacuation and Crowd Dynamics · Robotics and Automated Systems
