Escape dynamics based on bounded rationality
Lingxiao Wang, Yin Jiang

TL;DR
This paper models escape dynamics considering bounded rationality, revealing that such strategies can improve efficiency and lead to meta-stable states, with escape times scaling power-law with system size.
Contribution
It introduces a cellular automaton model incorporating bounded rational behavior via heterogeneous information, advancing understanding of evacuation dynamics.
Findings
Bounded rational strategies improve escape efficiency.
A meta-stable state emerges during evacuation.
Escape time scales as a power-law with system size.
Abstract
The bounded rationality plays a vital role in the collective behavior of the evacuation process. Also investigating human behavior in such an extreme situation is a continuing concern within social psychology. In this paper, we construct a cellular automaton (CA) model for the escape dynamics, and the bounded rational behavior induced by heterogeneous information is introduced. The non-trivial behavior shows in the replicator dynamics method with mean field approximation, where people's perception of the distribution of population and velocity is reduced to an average value in a certain direction. Analyzing the escape efficiency shows that under the premise of rationality, the bounded rational strategy can get higher performance. Interestingly, a quantifiable meta-stable state appears in the escape process, and the escape time is power-law dependent on system size.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Escape dynamics based on bounded rationality
Lingxiao Wang
Yin Jiang
Department of Physics, Tsinghua University and Collaborative Innovation Center of Quantum Matter, Beijing, 100084, PR China
Department of Physics, Beihang University, Beijing, 100191, PR China
Abstract
The bounded rationality plays a vital role in the collective behavior of the evacuation process. Also investigating human behavior in such an extreme situation is a continuing concern within social psychology. In this paper, we construct a cellular automaton (CA) model for the escape dynamics, and the bounded rational behavior induced by heterogeneous information is introduced. The non-trivial behavior shows in the replicator dynamics method with mean field approximation, where people’s perception of the distribution of population and velocity is reduced to an average value in a certain direction. Analyzing the escape efficiency shows that under the premise of rationality, the bounded rational strategy can get higher performance. Interestingly, a quantifiable meta-stable state appears in the escape process, and the escape time is power-law dependent on system size.
keywords:
Escape dynamics, Bounded rationality, Self-organized criticality, Scaling
††journal: Physica A
url]https://sites.google.com/view/lingxiao
1 Introduction
Public security is the cornerstone of national and social stability. In addition to the direct loss of life and property caused by natural disasters, crowd congestion in emergencies often leads to disaster (e.g. clogging and stampede[1, 2, 3]) . It naturally becomes important to understand the collective behavior patterns in case of emergency. Recent models and experiments show that the characteristics of group movement emerge as short intermittent bursts[4, 5, 6]. When the desire of escaping danger exceeds the idea of avoiding the collision, people’s behavior pattern changes from order to disorder. However, our understanding of the transition is still limited. Even the research on escape dynamics in emergencies has a long history[7, 8], it was not until the 1990s that the dynamics on collective behavior attracts people’s attention[3, 9]. Subsequently, game theory, decision theory, communication model and queuing model had also been comprehensively applied[10]. However, due to the lack of individual self-organization, the prediction of many results deviates from the actual situation. Some works start from the hydrodynamic model to study the collective behavior of the population[11, 2]. It successfully explained the group behavior during the pilgrimage to Mecca. Also, we can get some non-trivial macroscopic patterns in these hydrodynamics models[6, 12]. However, the macro-model sometimes is coarse and will miss individual information, while the micro-model represented by the Social force model, the Cellular automata model, and the Magnetic field force model[9, 1, 13, 14, 15] can give a more completed description for the escape dynamics. Simulating the collective behavior in emergencies becomes more convenient with the increasing of computer capabilities. The simulating results, such as the ”faster-is slower” phenomenon also can be verified by experiments on the different group (e.g people, vehicles, ants, sheep, microbial populations, etc.)[4, 16, 17, 18, 19].
With the development of sensor technology and the improvement of microchip computing power, collecting more redundant data and using more realistic methods to simulate the escape process becomes possible[20, 21, 22, 23, 24]. It is of practical significance that the disaster may happen in different complex en-vironments[25]. It’s natural to introduce the game theory into the evacuation model, for it can include the interaction between people (and environment) self-consistently[26, 27]. The embedded game action occurs in conflicts mainly, and people will decide to advance or avoid others by pay-off matrix[28, 29, 30, 31, 32]. However, handling conflicts are only parts of escape dynamics, how human beings make decisions induced by specific circumstances also is an essential part[33]. In this paper, we introduce the replicator dynamics into the evacuation. It makes people’s response to the external environment can be included in the decision-making process.
Besides, some researches focus on the behavior itself, for escape dynamics provides an extreme case to investigate collective behavior[21, 15, 34, 35]. The diverse and fascinating collective behaviors occur in both virtual and real space[36, 37]: social network, financial network and social norms, these virtual social connections naturally incubate the collective behavior; as for the real space, collective modes are common in urban dynamics, traffic flow, and pedestrian dynamics. Therefore, the escape dynamics provides us with an extreme scene, in which human instinct dominates[38, 23]. It makes us have chances to effectively study human behavior itself without complex social relations.
Based on the point, this work introduces bounded rationality[39, 40]from behavioral economics in the escape dynamics through the replicator dynamics method[41, 42]. A cellular automaton model is used to model the escape dynamics in a closed boundary. And the influence of boundedly rational behavior strategy on collective behavior is investigated by using mean field approximation[43]. Escape efficiency is affected by the environment and heterogeneous information processing. We also analyze the group and individual escape time, giving a possible picture to understand the connection between collective behavior and individual action.
2 Methods
2.1 Cellular Automaton Model
We construct a cellular automation model for simulating the pedestrian flow in a two-dimensional system. The underlying structure is a cell grid, where is the system size. The state of cell can be empty, or occupied by one pedestrian exactly or wall. It’s an instructive sample, once we set the size of cell as 0.5m 0.5m, which can simulate the escaping when some disaster happens. Model adopts the Moore neighbor, and pedestrians update their positions by transition matrices ,
[TABLE]
where means the possibility that the pedestrian moves from time at position to next time-step position. The neighbors’ directions are labeled by , where . Each cell can either be empty, or occupied by wall or exactly one pedestrian. Every time-step pedestrian can choose to move into a new position or stop. Once we have chosen the location of the exit, the synchronously updated cellular automaton can imitate the escape process[13, 44].
2.2 Heterogeneous Information and Bounded Rationality
Bounded rationality is formalized with such major variables as incomplete information, information processing and the non-traditional policy-maker target function[40]. Heterogeneous information could be the reason why people shows irrationality[45, 38, 20, 41]. The extreme situation of escaping from disasters constrains people’s behavior, in which only intuition or social habits remains, no long term trade-off. The replicator dynamics modeling[41, 42] can link the different behaviors, whether practical or spiritual, during the escaping process. The transition possibility derives from the follow definition,
[TABLE]
where means the weight from rational and bounded rational part respectively. The definition of the components in matrix ,
[TABLE]
which means if the position around the individual at time is empty, the , whereas the value is . And the only holds when the exit direction is indicated by , if not take the value . The is a minimum value that the calculation accuracy can reach. The parameter represents the attraction of the exit to persons want to escape, or the importance of the information of the exit position.
The definition of the bounded rational part relies on the heterogeneous information from the crowd. As the transport model of statistical physics inspired us, the escape dynamics needs more information that persons’ position and velocity distribution, the basic variables of the transport theory. Considering the full information cannot easily be achieved by individuals, the mean-field approximate can provide a global perception for the people on move, which shows as the follow,
[TABLE]
The indicates the transition possibility only decided by , the neighbor occupied state and the direction of exit, or the objective environment. The defines , where is the population distribution at time. The definition shows the proportion of individuals in orientation as mean-field approximation, and people will be attracted to the direction with more density. We use it to mimic the ”crowd” behavior for individuals, which also means people can potentially get more population density information. As for the , , where is the velocity distribution at time. The proportion of individuals move to orientation has been extracted, and people will follow others as the weight. It transfers more potential velocity information to people. The latter two strategies show the individual can process the heterogeneous information – population and velocity of all persons. The population and velocity of the crowd are the global continuous distribution quantity in reality, which affects the human behavior indirectly, for people can gather and process information from the environment[20, 45]. In this work, the distribution is discrete and the individual can process them as background, that’s what above definition means. People’s perception to the distribution is reduced to the average value in a certain direction, a mean background field, as what statistical physics did in a many-body system.
2.3 Evolution Rules
The model escape rules gives as follows,
Step.1 Initialization. Set the position of exit and generate population distribution at the lattice. At the time, disaster turns out and individuals begin to move;
Step.2 Evolution. At the time step, the individual move to the next position as transition matrices at time step. Update all individuals synchronously, and the conflict will be handled by compared the transition possibility;
Handle Conflicts. The conflicts occurs when the two or more persons want to move into the same position, and what we do to handle the conflicts is to compare their transition possibilities which reflects their willingness to move. For example, the individual and both want to move into position , and the corresponding possibility for the is and the is . If the , then the individual move successfully and the stayed where it was, and vice versa. For equal cases, one is randomly selected. It can be easily extended to the situation of many people.
Step.3 Escape. For the individual whose destination is exit at the next time step, escape successfully, and remove it from the lattice and reduce population as . If the escape stops.
Step.4 Update. Update the transition matrices as above strategies, turn to Step.2. The time step escape finished.
3 Results and Discussions
3.1 Dynamics Simulation
Firstly we build an escape dynamics simulation frame based on the escape rules, in which three different strategies for processing heterogeneous information have been added. As an example, Fig. 1 depicts a typical escape process: at the beginning individuals distribute in the lattice randomly with initial population ratio ; then they move into the exit direction as the parameter which shows how important the exit information is for them; at the time step, people escape from the disaster area.
In the Fig. 2, we show the arch-like blocking as the other simulations and experiments found[4, 16, 18, 19], which indicates the escape dynamical model catches the key points for the flock clogging problem.
Fig. 3 Shows the escape efficiency in the time steps at the parameter, lattice size and the exit size is 2. Over time, all three behavior strategies are driving groups to flee disaster areas. Figs. 3a, 3b and 3c have different and the high initial population case can escape faster than the lower. It’s because the low initial population case has more empty space, which reduces the escape chances. As for the three behavior strategies, they have similar escape efficiency in the higher case, where the population decreasing is almost same in 100 time steps. The three behavior strategies show relative large differences in low density situations. During the same time, the made more people escape, the followed, and the relative worst is the strategy. The above results illustrate that the ”Bounded Rationality” strategies can refine information from environment, which makes people work better in low density case. Figs. 3c and 3d have different and the high case can escape faster than the lower. It makes sense that represents the importance of exit, and it could be the intensity of exit signs or how clear people knows the exit information. In the Fig. 3d, the exit information parameter , it makes people random walks at some time and the strategy shows more powerful capability of driving people to the exit.
3.2 Collective Behavior and Meta-stable State
To investigate the factors affecting collective behavior more quantitatively, we define the escape ratio , the corresponding group escape time and individual escape time . The is the time taken to evacuate of the population and the is the average time, where the is the time taken to evacuate th person. Figs. 4 and 5 shows the escape time in strategy at different parameter , lattice size , initial population ratio and the exit size is 2. The parameter has played a key role in the escape dynamics, as Fig. 4 shows, the escape time decays exponentially with this exit parameter. At small region, the increasing would push people escape faster; at the large region, the effect of increasing is limited. The results reflect that there is a effective range for the relative importance of exit information in the dynamics processing. The other statistical result gets in Fig. 5, where group escape time increases with escape ratio near exponentially and the individual escape time increases near linearly. It implies that the time taken to rescue more people rise sharply, which reveals the conflict of interest between individual and group. The above results are independent of initial conditions, for every case runs 20 times in different random initial population distribution.
The above nonlinear dependence inspired us to study the influence of system size on escape time. The results exhibit in Fig. 6, the other system parameters are: initial population ratio , the parameter and the exit size is same as before. Both the group and individual escape time have , and the exponential coefficient deviating from system area slightly. It can be treated as a signal of the critical self-organizing behavior for the escape dynamics[36, 35]. At the same time, we notice that the arch-like blocking is a corresponding phenomenon during the critical process. It can be understood as a meta-stable state, for the empty is the stable final state. We extract the edge curve as parabola fitting () from the escape maps (e.g.Fig. 7 as a sample at ), and this meta-stable state will emerge in the mid-time term ( for this sample). Where the pixel positions of corresponding evolution patterns represented by coordinate axes (x, y), and the position of exit is (0,200). As Fig. 7 shows, the edge of population will push forward and become narrower. It can be described by the two fitting parameters: , in this sample. As time goes, the opening of parabola becomes narrow until the meta-stable state disappears. Even parabola fitting is not enough accurate, it still reflects such a meta-stable propulsion process.
4 Conclusions
In this work, we construct an escape dynamics frame with cellular automata model and can generate the space-time escape map similar to the actual situation. We also use the Replicator Dynamics to combine the bounded rational behavior into the escape. And three different behavior strategies were compared: the , and . The difference among these strategies lies in the completeness of the information. Our results show that under the premise of rationality, the bounded rational behavior of the can get higher evacuation efficiency. Subsequently, the influence of escape ratio, exit parameter and system size on the collective behavior pattern was further studied by using a parametric method in the behavior strategy. We also introduced group escape time and average escape time to study the behavior patterns of people fleeing disaster areas and found that changes in the external environment and individual rationality will have non-trivial effects on it. The increase in the importance of exit information will help to improve the efficiency of escape, while conflicts of interests between individuals and group occur in the process of increasing the escape ratio . Moreover, escape time is power-law dependent on system size. It can be treated as a signal of the critical self-organizing behavior for the escape dynamics. Finally, we extract the edge curve as parabola fitting from the escape dynamics, which reflects such a meta-stable propulsion process.
In the extreme case, this work uses the replicator dynamics for reference, introduces bounded rationality, and adopts mean-field approximation to study collective behavior. Furthermore, we can use more abundant tools to carry out more in-depth research on such problems: for example, using the state-of-the-art deep learning methods to recognize potential collective behavior and avoid trampling; make online games [46, 47] based on the model, measuring the parameter as a group rational scale; search the possible order parameter during such a emergence phenomena, etc. Overall, this work has provided a deeper insight into the correlation between human rationality and corresponding collective behaviors.
5 Acknowledgments
The authors thank Prof.WengGuo Weng and Dr.Xingyu Guo for useful discussions and comments. The work is supported by the Qingfeng Scholarship from Tsinghua University. LX and YJ are also supported by the China Scholarship Council(CSC) for visiting at the University of Tokyo and the Zhuobai Program of Beihang University respectively.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. Helbing, I. Farkas, T. Vicsek, Simulating dynamical features of escape panic, Nature 407 (6803) (2000) 487–490 (Sep. 2000). doi:10.1038/35035023 . · doi ↗
- 2[2] R. L. Hughes, A continuum theory for the flow of pedestrians, Transportation Research Part B: Methodological 36 (6) (2002) 507–535 (Jul. 2002). doi:10.1016/S 0191-2615(01)00015-7 . · doi ↗
- 3[3] D. Helbing, L. Buzna, A. Johansson, T. Werner, Self-Organized Pedestrian Crowd Dynamics: Experiments, Simulations, and Design Solutions, Transportation Science 39 (1) (2005) 1–24 (Feb. 2005). doi:10.1287/trsc.1040.0108 . · doi ↗
- 4[4] J. M. Pastor, A. Garcimartín, P. A. Gago, J. P. Peralta, C. Martín-Gómez, L. M. Ferrer, D. Maza, D. R. Parisi, L. A. Pugnaloni, I. Zuriguel, Experimental proof of faster-is-slower in systems of frictional particles flowing through constrictions, Phys. Rev. E 92 (6) (2015) 062817 (Dec. 2015). doi:10.1103/Phys Rev E.92.062817 . · doi ↗
- 5[5] A. Nicolas, S. Ibáñez, M. N. Kuperman, S. Bouzat, A counterintuitive way to speed up pedestrian and granular bottleneck flows prone to clogging: Can ‘more’ escape faster?, J. Stat. Mech. 2018 (8) (2018) 083403 (Aug. 2018). doi:10.1088/1742-5468/aad 6c 0 . · doi ↗
- 6[6] N. Bain, D. Bartolo, Dynamic response and hydrodynamics of polarized crowds, Science 363 (6422) (2019) 46–49 (Jan. 2019). doi:10.1126/science.aat 9891 . · doi ↗
- 7[7] K. Togawa, Study on Fire Escapes Basing on the Observation of Multitude Currents, Building Research Institute, Ministry of Construction, 1955 (1955).
- 8[8] L. F. Henderson, The Statistics of Crowd Fluids, Nature 229 (1971) 381–383 (Feb. 1971). doi:10.1038/229381 a 0 . · doi ↗
