AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home Environments
Zikang Leng, Megha Thukral, Yaqi Liu, Hrudhai Rajasekhar, Shruthi K. Hiremath, Jiaman He, Thomas Pl\"otz

TL;DR
AgentSense introduces a novel virtual data generation pipeline using LLM-guided embodied AI agents in simulated smart homes, producing diverse, privacy-preserving sensor data to improve Human Activity Recognition models, especially in low-resource settings.
Contribution
This work presents the first use of LLM-guided embodied agents in simulated environments for scalable sensor data generation in HAR.
Findings
Pretrained models on generated data outperform baselines.
Combining generated and small real datasets matches full real data performance.
Generated data enhances HAR model performance in low-resource scenarios.
Abstract
A major challenge in developing robust and generalizable Human Activity Recognition (HAR) systems for smart homes is the lack of large and diverse labeled datasets. Variations in home layouts, sensor configurations, and individual behaviors further exacerbate this issue. To address this, we leverage the idea of embodied AI agents -- virtual agents that perceive and act within simulated environments guided by internal world models. We introduce AgentSense, a virtual data generation pipeline in which agents live out daily routines in simulated smart homes, with behavior guided by Large Language Models (LLMs). The LLM generates diverse synthetic personas and realistic routines grounded in the environment, which are then decomposed into fine-grained actions. These actions are executed in an extended version of the VirtualHome simulator, which we augment with virtual ambient sensors that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMulti-Agent Systems and Negotiation
