EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment
Chen Gao, Baining Zhao, Weichen Zhang, Jinzhu Mao, Jun Zhang, Zhiheng, Zheng, Fanhang Man, Jianjie Fang, Zile Zhou, Jinqiang Cui, Xinlei Chen, Yong, Li

TL;DR
This paper introduces EmbodiedCity, a comprehensive benchmark platform with a realistic 3D simulation of a city environment to evaluate embodied AI agents in outdoor, real-world scenarios, addressing a gap in current indoor-focused research.
Contribution
It presents a high-fidelity city simulation platform with evaluation tasks and interfaces, enabling assessment of embodied AI in outdoor environments, expanding capabilities and practical applications.
Findings
Evaluated large language models' embodied intelligence capabilities.
Demonstrated the platform's potential for advancing embodied AI research.
Provided a versatile environment for future embodied AI development.
Abstract
Embodied artificial intelligence emphasizes the role of an agent's body in generating human-like behaviors. The recent efforts on EmbodiedAI pay a lot of attention to building up machine learning models to possess perceiving, planning, and acting abilities, thereby enabling real-time interaction with the world. However, most works focus on bounded indoor environments, such as navigation in a room or manipulating a device, with limited exploration of embodying the agents in open-world scenarios. That is, embodied intelligence in the open and outdoor environment is less explored, for which one potential reason is the lack of high-quality simulators, benchmarks, and datasets. To address it, in this paper, we construct a benchmark platform for embodied intelligence evaluation in real-world city environments. Specifically, we first construct a highly realistic 3D simulation environment based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvacuation and Crowd Dynamics · Context-Aware Activity Recognition Systems · 3D Modeling in Geospatial Applications
MethodsSoftmax · Attention Is All You Need · Focus · Sparse Evolutionary Training
