Parallel Implementations of Cellular Automata for Traffic Models
Moreno Marzolla

TL;DR
This paper explores various parallel computing techniques, including CPU and GPU optimizations, to accelerate cellular automata simulations of traffic models, demonstrating significant speedups and performance improvements.
Contribution
It presents new parallel implementation strategies for the BML traffic cellular automaton, optimizing performance on CPU and GPU architectures.
Findings
GPU implementations outperform CPU in speed
Optimizations reduce the performance gap between CPU and GPU
Parallel techniques significantly speed up traffic CA simulations
Abstract
The Biham-Middleton-Levine (BML) traffic model is a simple two-dimensional, discrete Cellular Automaton (CA) that has been used to study self-organization and phase transitions arising in traffic flows. From the computational point of view, the BML model exhibits the usual features of discrete CA, where the state of the automaton are updated according to simple rules that depend on the state of each cell and its neighbors. In this paper we study the impact of various optimizations for speeding up CA computations by using the BML model as a case study. In particular, we describe and analyze the impact of several parallel implementations that rely on CPU features, such as multiple cores or SIMD instructions, and on GPUs. Experimental evaluation provides quantitative measures of the payoff of each technique in terms of speedup with respect to a plain serial implementation. Our findings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
