AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home Environments

Zikang Leng; Megha Thukral; Yaqi Liu; Hrudhai Rajasekhar; Shruthi K. Hiremath; Jiaman He; Thomas Pl\"otz

arXiv:2506.11773·cs.CV·November 12, 2025

AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home Environments

Zikang Leng, Megha Thukral, Yaqi Liu, Hrudhai Rajasekhar, Shruthi K. Hiremath, Jiaman He, Thomas Pl\"otz

PDF

Open Access 1 Video

TL;DR

AgentSense introduces a novel virtual data generation pipeline using LLM-guided embodied AI agents in simulated smart homes, producing diverse, privacy-preserving sensor data to improve Human Activity Recognition models, especially in low-resource settings.

Contribution

This work presents the first use of LLM-guided embodied agents in simulated environments for scalable sensor data generation in HAR.

Findings

01

Pretrained models on generated data outperform baselines.

02

Combining generated and small real datasets matches full real data performance.

03

Generated data enhances HAR model performance in low-resource scenarios.

Abstract

A major challenge in developing robust and generalizable Human Activity Recognition (HAR) systems for smart homes is the lack of large and diverse labeled datasets. Variations in home layouts, sensor configurations, and individual behaviors further exacerbate this issue. To address this, we leverage the idea of embodied AI agents -- virtual agents that perceive and act within simulated environments guided by internal world models. We introduce AgentSense, a virtual data generation pipeline in which agents live out daily routines in simulated smart homes, with behavior guided by Large Language Models (LLMs). The LLM generates diverse synthetic personas and realistic routines grounded in the environment, which are then decomposed into fine-grained actions. These actions are executed in an extended version of the VirtualHome simulator, which we augment with virtual ambient sensors that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home Environments· underline

Taxonomy

TopicsMulti-Agent Systems and Negotiation