The politics of physicists social models

Pablo Jensen

arXiv:1903.00964·physics.soc-ph·October 23, 2019

The politics of physicists social models

Pablo Jensen

PDF

TL;DR

This paper reviews the application of statistical physics models to social sciences, questioning their practical relevance and linking social modeling to political efforts of human taming.

Contribution

It critically examines the conceptual usefulness and real-world applicability of physicists' social models and explores their political implications.

Findings

01

Physicists' social models are conceptually interesting but often lack real-world relevance.

02

Social modeling may serve as a form of political human taming.

03

The relevance of simplified models to complex social systems is questionable.

Abstract

I give an overview of the topic of this special issue, the applications of (statistical) physics to social sciences at large. I discuss several examples of simple social models put forward by physicists and discuss their interest. I argue that while they may be conceptually useful to correct our intuitive models of social mechanisms, their relevance for real social systems is moot. What is more, since physicists have always needed to tame the world inside laboratories to make their models relevant, I suggest that social modeling might be linked to human taming, a smashing political project.

Equations2

l (ρ_{q}) \approx H \int_{0}^{ρ_{q}} u (ρ) d ρ .

l (ρ_{q}) \approx H \int_{0}^{ρ_{q}} u (ρ) d ρ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

The politics of physicists’ social models

Pablo Jensen

corresponding author, [email protected]

Institut Rhônalpin des Systemes Complexes, IXXI, F-69342 Lyon, France

Universite de Lyon, Laboratoire de Physique ENS Lyon and CNRS, 46 Rue d’Italie, F-69342 Lyon, France

Abstract

I give an overview of the topic of this special issue, the "applications of (statistical) physics to social sciences at large". I discuss several examples of simple social models put forward by physicists and discuss their interest. I argue that while they may be conceptually useful to correct our intuitive models of social mechanisms, their relevance for real social systems is moot. What is more, since physicists have always needed to ’tame’ the world inside laboratories to make their models relevant, I suggest that social modeling might be linked to human taming, a smashing political project.

I Introduction

Why would physicists study social systems? Can they add anything to the knowledge of social scientists, economists, or all of us, who practice social systems every single day? One possible answer is given by physicist Rémi Louf who recently earned a prestigious price for his PhD on the physics of cities: Physics represents "a way of questioning the world and understanding it", starting from observations to find the "simple mechanisms that govern each phenomenon". Thanks to the avalanche of social data, the digital traces left by everyone, physicists can now confront their simple models to social reality and go beyond their "impressions", to found a new "science of cities [ …] that would provide a sufficiently precise image to guide political choices". We can understand the enthusiasm of the young physicist to found a true science of society, at once empirical and rigorous, that is, mathematical.

Before getting too enthusiastic however, it may be useful to read a text published nearly two centuries ago by Belgian astronomer Adolphe Quételet quetelet . He proclaimed the birth of a new science of crime: "If we want to acquire knowledge of the general laws of [human criminal inclinations], we must gather enough observations to ensure that everything that is not not purely accidental is eliminated. [Thanks to this knowledge, we will have] the possibility to improve men, by modifying their institutions and their habits". The reasoning is similar: from data to social laws, from laws to the improvement of society. And the parallel is even more striking given the title of Quételet’s book : "Essay of social physics".

This book was part of a vast economic, political and scientific transformation of European societies. Increasingly, strong states transformed their territories and their inhabitants to make them governable from a center. They counted populations and wealth to better enlist soldiers or collect taxes. This control required the setting up of a legal and material infrastructure, an investment similar to that of a road or rail network. Concretely, the States generalized supervision tools that we take for granted today, like the maps, the cadastre, the homogenization of the units of measurement or the stabilization of the surnames.

The science of "statistics" is a direct consequence of this transformation desrosieres ; hacking . At first, this word - derived from the Italian stato, State - meant all the knowledge useful for governing a country, and did not include mathematics. But in the 19th century, the scientific elite invented computing tools capable of exploiting social data to help this centralized government of populations. Pierre-Simon de Laplace, the great astronomer and mathematician, minister of Napoleon in 1799, developed different approaches to estimate the French population from parcel data, because it was difficult - and expensive - to carry out a comprehensive census. He assumed that the number of births per inhabitant was more or less constant in the country, an assumption that he tested in some thirty carefully selected regions to be representative of the whole territory. It was then sufficient to count the number of births, which were well known from parish registers, to obtain an estimate of the total population.

James Clerk Maxwell exported the statistical approach from social to physical systems. In 1859 he published the founding article of a new branch of physics, aptly called "statistical". He showed how to compute the properties of a gas using those of its constituents, the atoms. Inspired by Quételet’s approach, he renounced the Newtonian approach - which dictated calculating the trajectories of each particle - to switch to "statistical" properties, hoping that individual unpredictability would be compensated for at the macroscopic level. This allowed him to build the first rigorous bridge between the micro and macro worlds, by deducing certain properties of the gas, such as viscosity, from the statistical distribution of atomic velocities. Today, physicists complete the circle, drawing inspiration from the well-supplied toolbox of statistical physics to analyze social systems.

II Social physics today

What are we talking about when we deal with the topic of this special issue : "applications of (statistical) physics to social sciences at large"? A cursory bibliometrics search of articles published in Web of Knowledge physics journals using the term "social", "economics" or "econophysics" in either the title or the abstract leads to roughly 9000 records. Their analysis using BiblioTools grauwin reveals seven main research directions (a detailed description is given as Supplementary Information) : complex networks Boccaletti ; Pastor-Satorras , econophysics mantegna ; kwapien , opinion dynamics Baronchelli , evolutionary games perc , community detection newman ; fortunato , collective motion vicsek and human dynamics holme . Overall, the domain is steadily growing, as the number of papers has been multiplied by 10 since year 2000, reaching nowadays 800 articles per year.

Most of these articles deal with simple models. As Castellano et al castellano recognize in their review: "there is a striking imbalance between empirical evidence and theoretical modelization, in favor of the latter. This […] is a rather objective reflection of a disproportion in the literature on social dynamics." The imbalance can be understood easily : simple models are attractive for physicists because they are both elegant and relevant. They capture the essential mechanisms at work in a quantitative way, stripping away unimportant details, as exemplified by the archetypical Ising model for magnetic phase transitions. This simple model extracts with surgical precision the core mechanism of phase transitions, namely the collective, avalanche-like effects provoked by particle interactions, leaving aside all the obscuring "details”. Yet, simple models are relevant for real systems, because physical systems are simplified in the laboratories mit and thanks to the idea of universality : "statistical physics brings an important added value. In most situations, qualitative (and even some quantitative) properties of large-scale phenomena do not depend on the microscopic details of the process. Only higher level features, such as symmetries, dimensionality, or conservation laws, are relevant for the global behavior." castellano

The basic idea behind "applications of (statistical) physics to social sciences" is also summarized very clearly in Castellano et al. castellano review : "In social phenomena, the basic constituents are not particles but humans". Then, the "statistical physics approach to social behavior" means trying to "understand regularities at large scale as collective effects of the interaction among single individuals, considered as relatively simple entities". In the "initial state", "heterogeneity dominates": "left alone, each agent would choose a personal response to a political question, a unique set of cultural features, his own special correspondence between objects and words". When "interactions between social agents" are added to this initial picture, one finds the "stunning global regularities" "denoted in social sciences as consensus, agreement, uniformity". They add that universality gives hope that simple models will be relevant: "With this concept of universality in mind, one can approach the modelization of social systems, trying to include only the simplest and most important properties of single individuals and looking for qualitative features exhibited by models."

In this paper, I will argue that, while simple models are a good tool for physical systems, their usefulness is more limited for social systems. In short, they might be useful to improve our thinking, invalidate intuitive models, but they do not allow us to learn much about real social systems.

III A useful conceptual model

Let’s give an example of how simple social models can be useful to improve our conceptualizations of social processes watts ; seuil . The segregation model proposed by Schelling schelling became one of the most studied models in social physics, as it helps understanding why the collective state reached by agents may be different from what each of them seeks individually.

I present here a simplified version of Schelling’s model that lends itself to an analytical solution pnas . It represents the movement of a population of agents in a "city", which consists of $Q\gg 1$ non overlapping blocks, also called neighborhoods. Each block has the capacity to accommodate $H$ agents. Initially, a number of agents $N=QH\rho_{0}$ are distributed randomly over the blocks, leading to an average density $\rho_{0}$ . All agents share the same utility function $u(\rho)$ that translates their preference for the density of the block where they are located. The collective utility $U$ is defined as the sum of all agents’ utilities, $U=H\sum_{q=1}^{Q}\rho_{q}u(\rho_{q})$ and the average utility $\tilde{u}$ per agent is $\tilde{u}=U/N$ . The dynamics is the following: at each time step, an agent and a free site in another block are selected at random. The agent accepts to move to this new site only if its utility is higher in this new location. Otherwise, it stays in its present block. Then, another agent and another empty site are chosen at random, and the same process is repeated until a stationary state is reached, i.e., until there are no possible moves for any agent.

In pnas , we have computed analytically the stationary states of such a system for any utility function. They confirm previous results obtained by numerical simulations showing that agents ’segregate’ into crowded neighborhoods of low utility. Specifically, for $\rho_{0}=0.4$ , a utility given by $u(\rho)=2\rho$ for $\rho\leq 0.5$ and $u(\rho)=2(1-\rho)$ for $\rho>0.5$ , the stationary density is given by a phase separation between blocks that remain empty and blocks at a density $\rho=1/\sqrt{2}$ , leading to an average utility $\tilde{u}=2(1-\rho)\simeq 0.586$ . This means that agents do not manage to reach the state of maximum average utility ( $\tilde{u}=1$ ) by gathering in blocks at $\rho=1/2$ .

Our analytical calculations show that the surprising ’segregation’ of agents looking for half-filled neighborhoods arises because agents collectively maximize not $U$ but an effective free energy that we have called the link $L$ . This state function allows to generalize free energy to systems driven by individual dynamics. Its key property is that, for any move, $\Delta L=\Delta u$ . It is given by the sum over all blocks $q$ of a potential $l_{q}$ : $L=\sum_{q}l_{q}$ , where $l_{q}=\sum_{n_{q}=0}^{N_{q}}u(n_{q}/H)$ , with $N_{q}=H\rho_{q}$ is the total number of agents in block $q$ . In the large $H$ limit,

[TABLE]

The link may be interpreted as the cumulative of the individual marginal utilities gained by agents as they progressively enter the blocks from a reservoir of zero utility. Since agents move only when their individual $\Delta u$ is positive, the stationary state is given by maximizing $L$ over all possible densities $\{\rho_{q}\}$ of the blocks, from which no further $\Delta u>0$ can be found.

This analytical solution of Schelling’s segregation model is conceptually interesting because it allows a "clear quantitative demonstration […] that Adam Smith’s invisible hand can badly fail at solving simple coordination problems" bouchaud . And this unwanted segregation is robust to changes in model’s ingredients: addition of noise, shape of utility functions …pancs However, we have recently shown that it is fragile with respect to the introduction of a vanishingly small concentration of altruist agents PhysRevLett.120.208301 , a kind of "compositional chaos".

The relevance of Schelling’s model for real systems is less clear, because the reasons behind urban segregation are far more complex than those that any simple model can come up with jasss ; seuil . While the model shows that one cannot logically deduce individual racism from global segregation, it says nothing about the actual urban segregation. And the idea of "universality" put forward by Castellano et al castellano has not proved very fruitful in practice. There are some intriguing regularities in social data, such as Zipf’s power law, but they are not very useful to understand social systems because they are too easy to obtain efkpowerlaw .

IV Finding the essential mechanisms

To avoid the criticism of irrelevance while keeping the conceptual advantages of simplicity, one interesting proposition is to link the models to real data, hoping that these are produced by a single "essential" mechanism. The elegant model of cities proposed by Louf & Barthélémy represents an exemplary case of this strategy PhysRevLett.111.198702 . It explains the increase in the number of urban centers - areas of high employment density - when the number of inhabitants increases by a single "essential" mechanism: congestion. A small city has only one center bringing together most companies and administrations, while larger cities will have many, like the Parisian hubs La Défense, Les Halles and many others. To quantitatively link the population and the number of centers, the model creates a virtual city in which there are several potential employment centers, each offering a different salary. Each inhabitant is a social atom preoccupied by one thing: choosing the employment center that offers the best compromise between (high) salary and (low) transportation cost. Clearly, when the city is small, the traffic is low and all residents can go to the job center that offers the best salary: there is therefore only one active center. But as the population grows, traffic and congestion increase. As a result, centers offering slightly lower salaries become active, because their proximity compensates for lost wages. This model is attractive because it combines three advantages, which are difficult to tie together: a mathematical link between ingredients and consequences, a quantitative fit to data for 9000 U.S. cities and an intuitive understanding of the phenomenon.

However, its relevance for real cities is moot. First, a rigorous mathematical link between assumptions and consequences does not guarantee the interest of the result: the global rigor of a chain of reasonings is that of its weakest link! And the choice of variables or the simplifications that led to the model are more fragile. As the authors acknowledge, the definition of an employment center is rather vague: should a minimum number of jobs be required to declare that such a zone is a center? At which value to set the threshold? Should two neighboring areas be considered as one or two distinct centers? Moreover, bold assumptions are needed to build such a simple model: residents and businesses are identical, choose their residence at random, all firms in each center offer the same salary, there is no public transportation… In short, using mathematics to produce explanations by linking these elements is like trying to supply your home with water by using a very solid pipe to a tank …that is almost empty.

All things considered, the idea of "essential mechanisms" governing social systems is as seductive as it is reckless. It postulates the existence of a hierarchy among the many imaginable causes, which would allow to extract a single one, that dominates all cases. In the city model, it is well-known that other factors lead to the creation of centers : companies want to be close to each other to facilitate the exchange of goods or information; retail stores to attract a larger clientele …We can therefore imagine several simple models, with very different "essential" ingredients, leading to satisfactory empirical predictions, because the data are always noisy and do not allow to discriminate among them.

Social sciences have created tools that may be more adapted to the complexity of social systems, where several causes have to be combined to produce an effect. When causes can simply be added, as physicists’ forces, old tools such as multiple regressions can do the job. But in real systems, the combination is often trickier. For example, a strike may start when either (1) a new technology is introduced and wages stagnate or when (2) the suppression of overtime is combined with outsourcing. Each of the four possibles causes in neither sufficient nor necessary to start a strike : for example, wage stagnation will not cause it if no new technology is introduced, and a strike may start even when wages are increased, through the second causal path. More complex causal tools are needed than "finding the essential mechanism" or even multiple regression ragin . The point is that if one assumes from the start that there is a single mechanism at play, noisy data may confirm this idea, even when it is too simple. Respecting the complexity of our object is essential for good science, and this may well mean giving up our fascination for elegant models.

V More realistic models?

Physicists may draw comfort from the fact that more complex models, that try to include all the relevant variables, fare no better in practice. For example, large teams of economists build complicated models with hundreds of variables to predict economic growth. Alas, these predictions are not much better than those of a much simpler model: economic growth next year will be the same as this year growth . In seuil , I discuss at length why complex social models fail. One important point is the absence of conservation laws, which are a major provider of reliable equations for physical systems. For example, climate science models take advantage of energy and momentum conservation laws to build a kind of framework, which grants them robust predictions even for long times and variable conditions. In chemistry, the nature of atoms is conserved, whatever the complexity of the transformations. There is no similar stability in social systems, hampering any credible prediction beyond the simple "tomorrow will be as today". In other words, social science lacks a dynamical theory, based not on simple regularities but on the reproducibility of change, as in Newton’s second law : $\operatorname{dv}/\operatorname{dt}=F/m$ .

VI We are not social atoms!

Let’s examine this idea of "conservation laws" and analyze in more detail the example of "social atoms", a pervasive analogy in economists’ (and physicists’) models cho . As summarized above by Castellano et al. castellano , physicists’ models start with isolated "simple entities", endowed with stable characteristics, and try to "understand the regularities at large scale" that arise when one adds "interactions" between them.

This approach has turned out to be fruitful to analyze physical systems, because physicists’ atoms can be characterized by stable characteristics. These arise from the strong difference between the typical energy of a chemical reaction and that needed to change the chemical atomic identity, guaranteed by the much more strongly bonded atomic nucleus. The problem is that there seems to be no such energy scale separation for the so-called "social atoms", i.e. humans. Therefore, the whole idea of starting with isolated individuals endowed with stable characteristics and then adding interactions is unfounded. As argued long ago by John Dewey dewey : "Each human is born an infant […] immature, helpless, dependent upon the activities of others. There is no sense in asking how individuals come to be associated. They exist and operate in association". Moreover, human characteristics originate in these interactions : "What [a person] believes, hopes for and aims at is the outcome of association and intercourse". Unlike atoms, we are constantly remade by who and what we meet. There is no stable nucleus that would characterize us deeply, lending stability to our actions. This sheds doubts on the reliability of approaches that try to predict collective outcomes based on utility functions taken to be stable across individuals, situations or in time seuil . Of course, it is always possible to build an abstract model where any macro-structure is conveniently "explained" by some arbitrarily posited stable microstructure (Fig. 1). Three centuries ago, Descartes’ followers ’explained’ the acidity of lemons by the tiny ’needles’ of ’lemon atoms’ …

The failure of simple models does not mean that the relation between individuals and collectives cannot be conceptualized (if not quantified). But the scope is not necessarily to stick with general models, but rather to find conceptualizations which are relevant for social systems. The interesting question becomes how humans "come to be connected in just those ways that give human communities traits so different from those which mark assemblies of electrons, unions of trees in forests, swarms of insects, herds of sheep" dewey . In recent work with sociologists whole , we have discussed an alternative vision of the entire parts/whole perspective, showing that the standard micro/macro approach oversimplifies both the individual and global levels. For example, in Schelling’s model, individuals are defined by their color and their utility function, which do not change during the entire process. The whole is defined by the segregation at the (large) scale of the city, which emerges from the interactions between individuals. However, if we cared to survey real people, we would of course learn that each individual is more complex than her utility function, and also that she has specific visions of both the neighborhood and the segregation. Adolescents attending local high schools will likely have segregation experiences that differ from adults that work abroad or retirees who stay on the neighborhood all day. And the researcher adds her own point of view, which is not politically neutral as we will discuss below.

We can understand these different visions of the whole with a simpler example, that of the vocal ensemble in which I sing, Ginga, that has no leader. In the classical vision, the parts would be the twelve singers seen as small atoms, whose interactions would lead to the whole, the vocal group. In our approach, each singer lends a tiny fraction of his own complexity, which he agrees to standardize, in order to create a temporary collective, a whole much smaller than its parts. The whole is "smaller" because it obviously represents only a small part of each of our lives. And, more important, because the creation of a coherent group filters out the many latent possibilities of each person. These possibilities are manyfold because we all have a different story, in which singing takes a more or less important place. Our musical cultures are also disparate, rather classical for some, eclectic for others, from punk to world music. Finally, our technical skills are very diverse, for harmony, rhythm, interpretation, pronunciation or vocal technique. No wonder everyone has different ideas about what Ginga is, or should be. Should we spend time recording in a studio, or focus on public concerts? Does it really matter if we do not strictly follow the score, since few people in the public will notice it? All these differences must, in the end, lead to a common interpretation of each piece. A concrete trace of the "whole" would be the annotated score (Fig. 1), summarizing all the musical choices made collectively, following the more or less lively discussions that allowed us to master the original piece. Singers are thus coordinated, partly simplified by this standard form, the annotated score, which everyone must respect to sing together. The whole is smaller than the parts. But it enriches them, because none of us would have been able, individually, to produce it.

VII Conclusion: The politics of simple models

Let’s start with a naive question : why do physicists start by stripping individual entities of their attributes, before adding simple interaction rules to obtain the "whole" that was there from the beginning? Clearly, simplifications are needed if we are to model anything. But the point is that different simplifications lead to different explanations, and in the case of social systems, to different politics seuil ; politics . Simple models assume that agents are unable to understand and control collective phenomena. Implicitly, only researchers are able to analyze the situation, determine the factors leading to the results and find the ways to change them. To use an apt image by James Scott, the modeled individuals are, as in Taylor’s factory, "the molecules of an organism whose brain is elsewhere" scott . In other words, modeling takes an external point of view on social systems, assuming that the dynamics of change must come from outside the situation, rather than from the reflections and creativity of the actors jasss ; ostrom . This is their implicit political vision, adapted to the control of a periphery by a center, which needs standardized entities and relatively simple interaction mechanisms to guide its actions. Ironically, a supposedly "bottom-up" approach epstein leads to "top-down" social politics! Maybe it is time to, as Phil Mirowski suggests, mirowski "abjure the physics" for the modeling of social systems. For him, the "various attempts to directly appropriate models from physics, and then bend them to the description of economic variables" have not proved successful, even if they constitute "a major source of continuity in the history of neoclassical economics". The alternative conceptualization sketched above, inspired by sociology, keeps the complexity of individuals and their specific visions of the situation, giving more power to the actors than to the center for analyzing and changing the collective state.

Finally, if we analyze empirically how physics has managed to get a grip on the world pickering , we find that it has always transformed its objects in the laboratory, as tigers are tamed before participating in a circus show. The analogy is interesting mit , as it shows the hard and creative work needed to adjust the theory and the world. It also stresses that there exists, at the same time, some discontinuity between nature and scientific objects (we cannot know much about the untransformed world, as we can’t use a wild tiger in a circus), and some continuity (it’s a real tiger, and it will not accept to do anything). To become relevant, social models will have to ’tame’ humans, for example by using the ”social credit” system currently being developed in China china . It aims to track people and reward ”good social behavior” while punishing bad behavior, using monetary rewards and penalties. This may achieve better predictions, but I’m not sure that I want to contribute to the taming of humans, at least not without their consent.

Bibliography38

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) A Quetelet. Sur l’homme et le dÃ©veloppement de ses facultÃ©s, ou Essai de physique sociale . Bachelier (Paris), 1835.
2(2) A Desrosieres. The Politics of Large Numbers . Harvard Univ Press, 2002.
3(3) I Hacking. The Taming of Chance . Cambridge University Press, 1990.
4(4) Sebastian Grauwin and Pablo Jensen. Mapping scientific institutions. Scientometrics , 89(3):943, Aug 2011.
5(5) S. Boccaletti, V. Latora, Y. Moreno, M. Chavez, and D.-U. Hwang. Complex networks: Structure and dynamics. Physics Reports , 424(4):175 – 308, 2006.
6(6) Romualdo Pastor-Satorras, Claudio Castellano, Piet Van Mieghem, and Alessandro Vespignani. Epidemic processes in complex networks. Rev. Mod. Phys. , 87:925–979, Aug 2015.
7(7) R Mantegna and HE Stanley. Scaling behaviour in the dynamics of an economic index. Nature , 376:46–49, 1995.
8(8) J Kwapien and S Drozdz. Physical approach to complex systems. Physics Reports , 515(3):115 – 226, 2012. Physical approach to complex systems.