ChatDyn: Language-Driven Multi-Actor Dynamics Generation in Street   Scenes

Yuxi Wei; Jingbo Wang; Yuwen Du; Dingju Wang; Liang Pan; Chenxin Xu,; Yao Feng; Bo Dai; Siheng Chen

arXiv:2412.08685·cs.CV·December 13, 2024

ChatDyn: Language-Driven Multi-Actor Dynamics Generation in Street Scenes

Yuxi Wei, Jingbo Wang, Yuwen Du, Dingju Wang, Liang Pan, Chenxin Xu,, Yao Feng, Bo Dai, Siheng Chen

PDF

Open Access

TL;DR

ChatDyn is a novel system that uses language instructions and multi-agent role-playing to generate realistic, controllable street scene dynamics involving vehicles and pedestrians, advancing simulation capabilities.

Contribution

It introduces the first comprehensive method for generating interactive, controllable street scene dynamics based on natural language instructions using multi-LLM-agent planning.

Findings

01

Outperforms previous methods on key subtasks

02

Generates realistic pedestrian and vehicle dynamics

03

Enables precise control through complex language instructions

Abstract

Generating realistic and interactive dynamics of traffic participants according to specific instruction is critical for street scene simulation. However, there is currently a lack of a comprehensive method that generates realistic dynamics of different types of participants including vehicles and pedestrians, with different kinds of interactions between them. In this paper, we introduce ChatDyn, the first system capable of generating interactive, controllable and realistic participant dynamics in street scenes based on language instructions. To achieve precise control through complex language, ChatDyn employs a multi-LLM-agent role-playing approach, which utilizes natural language inputs to plan the trajectories and behaviors for different traffic participants. To generate realistic fine-grained dynamics based on the planning, ChatDyn designs two novel executors: the PedExecutor, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics