Empowering NPC Dialogue with Environmental Context Using LLMs and Panoramic Images
Grega Rade\v{z}, Ciril Bohak

TL;DR
This paper enhances NPC dialogue in games by integrating panoramic images and LLMs to provide environmental context, resulting in more responsive and immersive interactions.
Contribution
It introduces a novel method combining panoramic scene segmentation with LLMs to enable NPCs to understand and reference their surroundings dynamically.
Findings
Participants preferred context-aware NPCs in user studies.
The system successfully incorporates spatial environment data into NPC dialogue.
Expert feedback helped refine the approach.
Abstract
We present an approach for enhancing non-playable characters (NPCs) in games by combining large language models (LLMs) with computer vision to provide contextual awareness of their surroundings. Conventional NPCs typically rely on pre-scripted dialogue and lack spatial understanding, which limits their responsiveness to player actions and reduces overall immersion. Our method addresses these limitations by capturing panoramic images of an NPC's environment and applying semantic segmentation to identify objects and their spatial positions. The extracted information is used to generate a structured JSON representation of the environment, combining object locations derived from segmentation with additional scene graph data within the NPC's bounding sphere, encoded as directional vectors. This representation is provided as input to the LLM, enabling NPCs to incorporate spatial knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
