Informative and misinformative interactions in a school of fish

Emanuele Crosato; Li Jiang; Valentin Lecheval; Joseph T. Lizier; X.; Rosalind Wang; Pierre Tichit; Guy Theraulaz; Mikhail Prokopenko

arXiv:1705.01213·q-bio.QM·May 5, 2017

Informative and misinformative interactions in a school of fish

Emanuele Crosato, Li Jiang, Valentin Lecheval, Joseph T. Lizier, X., Rosalind Wang, Pierre Tichit, Guy Theraulaz, Mikhail Prokopenko

PDF

TL;DR

This study uses information theory to analyze how fish in a school share and misinterpret directional information during collective turns, revealing complex patterns of information flow and misinformation.

Contribution

It introduces a novel application of transfer entropy to quantify local information and misinformation flows in animal groups during coordinated movements.

Findings

01

Peaks in information flow during collective U-turns

02

Identification of informative and misinformative flows based on fish position and orientation

03

Spatial patterns of information and misinformation cascades

Abstract

It is generally accepted that, when moving in groups, animals process information to coordinate their motion. Recent studies have begun to apply rigorous methods based on Information Theory to quantify such distributed computation. Following this perspective, we use transfer entropy to quantify dynamic information flows locally in space and time across a school of fish during directional changes around a circular tank, i.e. U-turns. This analysis reveals peaks in information flows during collective U-turns and identifies two different flows: an informative flow (positive transfer entropy) based on fish that have already turned about fish that are turning, and a misinformative flow (negative transfer entropy) based on fish that have not turned yet about fish that are turning. We also reveal that the information flows are related to relative position and alignment between fish, and…

Figures20

Click any figure to enlarge with its caption.

Equations9

t_{y \to x} (n, v) = lo g_{2} \frac{p ( x _{n} ∣ x _{n - 1} , y _{n - v} )}{p ( x _{n} ∣ x _{n - 1} )} .

t_{y \to x} (n, v) = lo g_{2} \frac{p ( x _{n} ∣ x _{n - 1} , y _{n - v} )}{p ( x _{n} ∣ x _{n - 1} )} .

t_{y \to x} (n, v)

t_{y \to x} (n, v)

= lo g_{2} \frac{p ( x _{n} ∣ x _{n - 1} , y _{n - v} )}{p ( x _{n} ∣ x _{n - 1} )} .

x_{n} = Θ_{n}^{D} - Θ_{n - 1}^{D}

x_{n} = Θ_{n}^{D} - Θ_{n - 1}^{D}

y_{n} = Θ_{n}^{D} - Θ_{n}^{S} .

y_{n} = Θ_{n}^{D} - Θ_{n}^{S} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Informative and misinformative interactions in a school of fish

Emanuele Crosato [email protected] Complex Systems Research Group and Centre for Complex Systems, Faculty of Engineering & IT, The University of Sydney, Sydney, NSW 2006, Australia.

CSIRO Data61, PO Box 76, Epping, NSW 1710, Australia.

Li Jiang

School of Systems Science, Beijing Normal University, Beijing, 100875, P. R. China.

Centre de Recherches sur la Cognition Animale, Centre de Biologie Intégrative (CBI), Centre National de la Recherche Scientifique (CNRS), Université Paul Sabatier (UPS), F-31062 Toulouse Cedex 9, France.

Valentin Lecheval

Centre de Recherches sur la Cognition Animale, Centre de Biologie Intégrative (CBI), Centre National de la Recherche Scientifique (CNRS), Université Paul Sabatier (UPS), F-31062 Toulouse Cedex 9, France.

Groningen Institute for Evolutionary Life Sciences, University of Groningen, Centre for Life Sciences, Nijenborgh 7, 9747AG Groningen, The Netherlands.

Joseph T. Lizier

Complex Systems Research Group and Centre for Complex Systems, Faculty of Engineering & IT, The University of Sydney, Sydney, NSW 2006, Australia.

X. Rosalind Wang

CSIRO Data61, PO Box 76, Epping, NSW 1710, Australia.

Pierre Tichit

Centre de Recherches sur la Cognition Animale, Centre de Biologie Intégrative (CBI), Centre National de la Recherche Scientifique (CNRS), Université Paul Sabatier (UPS), F-31062 Toulouse Cedex 9, France.

Guy Theraulaz

Centre de Recherches sur la Cognition Animale, Centre de Biologie Intégrative (CBI), Centre National de la Recherche Scientifique (CNRS), Université Paul Sabatier (UPS), F-31062 Toulouse Cedex 9, France.

Mikhail Prokopenko

Complex Systems Research Group and Centre for Complex Systems, Faculty of Engineering & IT, The University of Sydney, Sydney, NSW 2006, Australia.

Abstract

It is generally accepted that, when moving in groups, animals process information to coordinate their motion. Recent studies have begun to apply rigorous methods based on Information Theory to quantify such distributed computation. Following this perspective, we use transfer entropy to quantify dynamic information flows locally in space and time across a school of fish during directional changes around a circular tank, i.e. U-turns. This analysis reveals peaks in information flows during collective U-turns and identifies two different flows: an informative flow (positive transfer entropy) based on fish that have already turned about fish that are turning, and a misinformative flow (negative transfer entropy) based on fish that have not turned yet about fish that are turning. We also reveal that the information flows are related to relative position and alignment between fish, and identify spatial patterns of information and misinformation cascades. This study offers several methodological contributions and we expect further application of these methodologies to reveal intricacies of self-organisation in other animal groups and active matter in general.

Introduction

Collective motion is one of the most striking examples of aggregated coherent behaviour in animal groups, dynamically self-organising out of local interactions between individuals. It is observed in different animal species, such as schools of fish [59, 73], flocks of birds [58, 54, 6, 11], colonies of insects [14, 29, 16, 4, 15] and herds of ungulates [32]. There is an emerging understanding that information plays a dynamic role in such a coordination [73], and that distributed information processing is a specific mechanism that endows the group with collective computational capabilities [13, 23, 1].

Information transfer is of particular relevance for collective behaviour, where it has been observed that small perturbations cascade through an entire group in a wave-like manner [62, 63, 34, 3], with these cascades conjectured to embody information transfer [73]. This phenomenon is related to underlying causal interactions, and a common goal is to infer physical interaction rules directly from experimental data [36, 30, 35] and measure correlations within a collective.

Nagy et al. [55] used a variety of correlation functions to measure directional dependencies between the velocities of pairs of pigeons flying in flocks of up to ten individuals, reconstructing the leadership network of the flock. As has been shown later, this network does not correspond to the hierarchy between birds [56]. Information transfer has been extensively studied in flocks of starlings, by observing the propagation of direction changes across the flocks [20, 19, 2]. More recently, Rosenthal et al. [69] attempted to determine a communication structure of a school of fish during its collective evasion manoeuvres manifested through cascades of behavioural change. A functional mapping between sensory inputs and motor responses was inferred by tracking fish position and body posture, and calculating visual fields.

Rather than consider semantic or pragmatic information, many contemporary studies employ rigorous information theoretic measures that quantify information as uncertainty reduction, following Shannon [24], in order to deal with the stochastic, continuous and noisy nature of intrinsic information processing in natural systems [28]. Distributed information processing is typically dissected into three primitive functions: the transmission, storage and modification of information [38]. Information dynamics is a recent framework characterising and measuring each of the primitives information-theoretically [49, 41]. In viewing the state update dynamics of a random process as an information processing event, this framework performs an information regression in accounting for where the information to predict that state update can be found by an observer, first identifying predictive information from the past of the process as information storage, then predictive information from other sources as information transfer (including both pairwise transfer from single sources, and higher-order transfers due to multivariate effects). The framework has been applied to modelling collective behaviour in several complex systems, such as Cellular Automata [46, 47, 48], Ising spin models [9], Genetic Regulatory Networks and other biological networks [45, 64, 26], and neural information processing [33, 78].

This study proposes a domain-independent, information-theoretic approach to detecting and quantifying individual-level dynamics of information transfer in animal groups using this framework. This approach is based on transfer entropy [70], an information-theoretic measure that quantifies the directed and time-asymmetric predictive effect of one random process on another. We aim to characterize the dynamics of how information transfer is conducted in space and time within a biological school of fish (Hemigrammus rhodostomus or rummy-nose tetras, Figure 1a).

We stress that the predictive information transfer should be considered from the observer perspective, that is, it is the observer that gains (or loses) predictability about a fish motion, having observed another fish. In other words, notwithstanding possible influences among the fish that could potentially be reflected in their information dynamics, our quantitative analysis focuses on the information flow within the school which is detectable by an external observer, captured by the transfer entropy. This means that, whenever we quantify a predictive information flow from a source fish to a destination fish, we attribute the change of predictability (uncertainty) to a third party, be it another fish in the school, a predator approaching the school or an independent experimentalist. Accordingly, this predictive information flow may or may not account for the causal information flow affecting the source and the destination [5, 40] — however it does typically indicate presence of causality, either within the considered pair or from some common cause.

We focus on collective direction changes, i.e. collective U-turns, during which the directional changes of individuals progress in a rapid cascade, at the end of which a coherent motion is re-established within the school. Sets of different U-turns are comparable across experiments under the same conditions, permitting a statistically significant analysis involving an entire set of U-turns.

By looking at the pointwise or local values of transfer entropy over time, rather than at its average values, we are not only able to detect information transfer, but also to observe its dynamics over time and across the school. We demonstrate that information is indeed constantly flowing within the school, and identify the source-destination lag where predictive information flow is maximised (which has an interpretation as an observer-detectable reaction time to other fish). The information flow is observed to peak during collective directional changes, where there is a typical “cascade” of predictive gains and losses to be made by observers of these pairwise information interactions. Specifically, we identify two distinct predictive information flows: (i) an “informative” flow, characterised by positive local values of transfer entropy, based on fish that have already changed direction about fish that are turning, and (ii) a “misinformative” flow, characterised by negative local values of transfer entropy, based on fish that have not changed direction yet about the fish that are turning. Finally, we identify spatial patterns coupled with the temporal transfer entropy, which we call spatio-informational motifs. These motifs reveal spatial dependencies between the source of information and its destination, which shape the directed pairwise interactions underlying the informative and misinformative flows. The strong distinction revealed by our quantitative analysis between informative and misinformative flows is expected to have an impact on modelling and understanding the dynamics of collective animal motion.

Information-theoretic measures for collective motion

The study of Wang et al. [77] introduced the use of transfer entropy to investigations of collective motion. This work quantitatively verified the hypothesis that information cascades within an (artificial) swarm can be spatiotemporally revealed by conditional transfer entropy [46, 47] and thus correspond to communications, while the collective memory can be captured by active information storage [48].

Richardson et al. [67] applied related variants of conditional mutual information, a measure of non-linear dependence between two random variables, to identify dynamical coupling between the trajectories of foraging meerkats. Transfer entropy has also been used to study the response of schools of zebrafish to a robotic replica of the animal [17, 37], and to infer leadership in pairs of bats [57] and simulated zebrafish [18]. Lord et al. [51] also posed the question of identifying individual animals which are directly interacting with other individuals, in a swarm of insects (Chironomus riparius). Their approach used conditional mutual information (called “causation entropy” although it does not directly measure causality [40]), inferring “information flows” within the swarm over moving windows of time.

Unlike the study of Wang et al. [77], the above studies quantified average dependencies over time rather than local dependencies at specific time points; for example, leadership relationships in general rather than their (local) dynamics over time. Local versions of transfer entropy and active information storage have been used to measure pairwise correlations in a “swarm” of soldier crabs, finding that decision-making is affected by the group size [74]. Statistical significance was not reported, presumably due to a small sample size. Similar techniques were used to construct interaction networks within teams of simulated RoboCup agents [22].

In this study we focus on local (or pointwise) transfer entropy [70, 46, 43] for specific samples of time-series processes of fish motion, which allows us to reconstruct the dynamics of information flows over time. Local transfer entropy [46], captures information flow from the realisation of a source variable $Y$ to a destination variable $X$ at time $n$ . As described in Methods, local transfer entropy is defined as the information provided by the source $\mathbf{y_{n-v}}=\{y_{n-v},y_{n-v-1},\ldots,y_{n-v-l+1}\}$ , where $v$ is a time delay and $l$ is the history length, about the destination $x_{n}$ in the context of the past of the destination $\mathbf{x_{n-1}}=\{x_{n-1},x_{n-2},\ldots,x_{n-k}\}$ , with a history length $k$ :

[TABLE]

Importantly, local values of transfer entropy can be negative, while the average transfer entropy is non-negative. Negative values of the local transfer entropies indicate that the source is misinformative about the next state of the destination (i.e. it increases uncertainty). Previous studies that used average measures over sliding time windows in order to investigate how information transfer varies over time [67, 51] cannot detect misinformation because they measure average but not local values.

As an observational measure, transfer entropy does not measure causal effect of the source on the target; this can only be established using interventional measures [5, 40, 21, 71]. Rather, transfer entropy measures the predictive information gained from a source variable about the state transition in a target, which may be viewed as information transfer when measured on an underlying causal interaction [40]. It should be noted that while some researchers may be initially more interested in causality, the concept of information transfer reveals much about the dynamics that causal effect does not [40], in particular being associated with emergent local structure in dynamics in complex systems [46, 77] and with changes in behaviour, state or regime [12, 9], as well as revealing the misinformative interactions described above. As a particular example, local transfer entropy spatiotemporally highlights emergent glider entities in cellular automata [46], which are analogues of cascading turning waves in swarms (also highlighted by transfer entropy [77]), while local measures of causality do not differentiate these from the background dynamics [40].

It is well known that the internal dynamics within a school of fish depends on the number of fish. For example, for schools of minnows (Phoxinus phoxinus), two fish schools are qualitatively different from schools containing three or more — however, the effects seem to level off by the time the school reaches a size of six individuals [60]. Collective behaviour, as well as a stereotypical “phase transition”, when an increase in density leads to the onset of directional collective motion, have also been detected in small groups of six glass prawns (Paratya australiensis) [52]. Furthermore, at such intermediate group sizes, it has been observed that multiple fish interactions could often be faithfully factorised into pair interactions in one particular species of fish [30].

In our study we investigated information transfer within a school of fish during specific collective direction changes, i.e., U-turns, in which the school collectively reverses its direction. Groups of five fish were placed in a ring-shaped tank (Figure 1b), a design conceived to constrain fish swimming circularly, with the possibility of undergoing U-turns spontaneously, without any obstacles or external factors. A total of 455 U-turns have been observed during 10 trials of one hour duration each. We computed local transfer entropy between each (directed) pair of fish from time series obtained from fish heading. Specifically, the destination process $X$ was defined as the directional change of the destination fish, while the source process $Y$ was defined as the relative heading of the destination fish with respect to the source fish (see Methods). This allowed us to capture the influence of the source-destination fish alignment on the directional changes of the destination. Such influence is usually delayed in time and we estimated the optimal delay (maximizing $\langle t_{y\to x}(n,v)\rangle_{n}$ [79], see Methods) at $v=6$ , corresponding to $0.12$ seconds.

Results

Information flows during U-turns

In order to represent the school’s orientation around the tank, we define its polarisation so that it is positive when the school is swimming clockwise and negative when it is swimming anti-clockwise (see Methods). The better the school’s average heading is aligned with an ideal circular trajectory around the tank, the higher is the intensity of the polarisation. When the school is facing one of the tank’s walls, for example in the middle of a U-turn, the polarisation is zero, and the polarisation flips sign during U-turns. Polarisation allows us to map local values of transfer entropy onto the progression of the collective U-turns.

The analyses of transfer entropy over time reveal that the measure clearly diverges from its baseline in the vicinity of U-turns, as shown in the representative U-turn in Figure 1c (Supplementary Figure S1 shows a longer time interval during which several U-turns can be observed). The figure shows that during regular circular motion, when the school’s polarisation is highly pronounced, transfer entropy is low. As the polarisation approaches zero the intensity of transfer entropy grows, peaking near the middle of a U-turn, when polarisation switches its sign.

We clarify that the aim here is not to establish transfer entropy as an alternative to polarisation for detecting turn; rather, our aim is to use polarisation to describe the overall progression of the collective U-turns and then to use transfer entropy to investigate the underlying information flows in the dynamics of such turns. Indeed, transfer entropy is found to be statistically different from zero at many points outside of the U-turns (see Supplementary Figure S1), although the largest values and most concentrated regions of these are during the U-turns. This indicates that information transfer occurs even when fish school together without changing direction; we know that the fish are not executing precisely uniform motion during these in-between periods, and so interpret these small amounts of information transfer as sufficiently underpinning the dynamics of the group maintaining its collective heading.

We also see in Figure 1c that both positive and negative values of transfer entropy are detected. In order to understand the role of the positive and negative information flows during collective motion, in the next section we show the dynamics of transfer entropy for individual pairwise interactions.

Informative and misinformative flows

Our analysis revealed a clear relationship between positive and negative values of transfer entropy and the sequence of individual fish turning, which is illustrated in Figure 2. Figure 2a shows the trajectories of individual fish during the same U-turn depicted in Figure 1. These trajectories are retraced in Figure 2d in terms of polarisation of each fish. It is quite clear that there is a well-defined sequence of individual U-turns during the collective U-turn. Moreover, Figure 2 shows how the transfer entropy maps onto the fish trajectories, both from the fish whose trajectory is traced as a source to the other four fish — i.e. outgoing transfer entropy — and, vice versa, from the other four fish to the traced one as a destination — i.e. incoming transfer entropy.

The incoming transfer entropy clearly peaks during the destination fish’s individual turns and its local values averaged over all sources go from negative, for the first (destination) fish that turns, to positive for the last fish turning (Figures 2b and 2e). In the opposite direction, the outgoing transfer entropy (averaged over all destinations) displays negative peaks only before the source fish has turned, and positive peaks only afterwards (Figures 2c and 2f). Figure 2 suggests that predictive information transfer intensifies only when a destination fish is turning, with this transfer being informative based on source fish that have already turned and misinformative based on source fish that have not turned yet.

This phenomenon can be observed very clearly in Figures 3a and 3b, which show the transfer entropy in both directions for a single fish (the second fish turning in Figures 1 and 2). One positive peak of incoming transfer entropy (indicating informative flow) and three negative ones (misinformative flows) are detected when this fish, as a destination, is undergoing the U-turn (Figure 3a). No other peaks are detected for this fish as a destination. On the other hand, one negative peak of outgoing transfer entropy is detected before the fish, this time as a source, has turned, and three positive peaks are detected after the fish has turned (Figure 3b). These four peaks occur respectively when the first, the third, the fourth and the fifth fish undergo the U-turn, as is evident by comparing Figures 3b and 2d. A movie of the fish undergoing this specific U-turn is provided in Supplementary Video S1, while a detailed reconstruction of the U-turn, showing the dynamics of transfer entropy over time for each directed pair of fish, is provided in Supplementary Video S2.

In order to demonstrate that the phenomenon described here holds for U-turns in general, and not only for the representative one shown in Figure 2, we performed an aggregated analysis of all 455 U-turns observed during the experiment. Since the order in which fish turn is not the same in every U-turn, in this analysis, we refer not to single fish as individuals, but rather to fish in the order in which they turn. Thus, when we refer, for instance, to “the first fish that turns”, we may be pointing to a different fish at each U-turn.

The aggregated results are presented in Figures 3c and 3d. Figure 3c shows that incoming transfer entropy peaks for each fish in turning order and gradually grows, from a minimum negative peak corresponding to the first fish turning, to a maximum positive peak corresponding to the last fish turning. Vice versa, Figure 3d shows that outgoing transfer entropy peaks only positively for the first fish turning, which is an informative source about all other fish turning afterwards. For the last fish that turns the peak is negative, since this fish is misinformative about all other fish that have already turned. The second, third and fourth fish present both a negative and a positive peak. The intensity of the negative peaks increases from the second fish to the fourth, while the intensity of the positive peak decreases.

In general, the source fish is informative about all destination fish turning after it and misinformative about any destination fish turning before it. This is because the prior turn of a source helps the observer to predict the later turn of the destination, whereas examining a source which has not turned yet itself is actively unhelpful (misinformative) in predicting the occurrence of such a turn. This also explains why, for a source, the negative peaks come before positives.

The sequential cascade-like dynamics of information flow suggests that the strongest sources of predictive information transfer are fish that have already turned. Moreover our analyses reveal that once a fish has performed a U-turn, its behaviour in general ceases to be predictable based on the behaviour of other fish that swim in opposite direction (in fact such fish would provide misinformative predictions). This suggests an asymmetry of predictive information flows based on and about an individual fish during U-turns.

Spatial motifs of information transfer

It is reasonable to assume that predictive information transfer in a school of fish results from spatial interactions among individuals. We investigated the role of pairwise spatial interactions in carrying the positive and negative information flows that we detected in the previous section, looking for spatial patterns of information and misinformation transfer.

In particular we established the statistics of the relative position and heading of the destination fish relative to the source fish, at times when the transfer entropy from the source to the destination is more intense. For this purpose we used radial diagrams (see Figure 4) representing the relative data in terms transfer entropy, focusing separately on their positive (informative) and negative (misinformative) values. In each diagram we aggregate data from all 455 U-turns and all pairs. The diagrams show clear spatial patterns coupled with the transfer entropy, which we call spatio-informational motifs.

We see that positive information transfer is on average more intense from source fish to: a. other fish positioned behind them (Figure 4a, left), and b. to fish with headings closer to perpendicular rather than parallel to them (Figure 4a, right). We know from Figures 2 and 3 that positive transfer entropy is detected from source fish that have already turned to destination fish that are turning. Thus, Figure 4a suggests that a source is more informative about destination fish that are left behind it after a turn, most intensely when the destination fish are executing their own turning manoeuvre to follow the source. Directional relationships from individuals in front towards others that follow were observed in previous works on birds [55], bats [57] and fish [36, 35, 69].

For negative information transfer (Figure 4b) we see a different spatio-informational motif. Negative information transfer is on average more intense to fish generally positioned at the side and with opposite heading. This aligns with Figures 2 and 3 in that negative transfer entropy typically flows from fish that have not turned yet to those which are turning.

In summary, transfer entropy has a clear spatial signature, showing that the spatiotemporal dependencies in the studied school of fish are not random but reflect specific interactions.

Discussion

Information transfer within animal groups during collective motion is hard to quantify because of implicit and distributed communication channels with delayed and long-ranged effects, selective attention [68] and other species-specific cognitive processes. Here we presented a rigorous framework for detecting and measuring predictive information flows during collective motion, by attending to the dynamic statistical dependence of directional changes in destination fish on relative heading of sources. This predictive information flow should be interpreted as a change (gain or loss) in predictability obtained by an observer.

We studied Hemigrammus rhodostomus fish placed in a ring-shaped tank which effectively only allowed the fish to move straight ahead or turn back to perform a U-turn. The individual trajectories of the fish were recorded for hundreds of collective U-turns, enabling us to perform a statistically significant information-theoretical analysis for multiple pairs of source and destination fish.

Transfer entropy was used in detecting pairwise time delayed dependencies within the school. By observing the local dynamics of this measure, we demonstrated that predictive information flows intensify during collective direction changes — i.e. the U-turns — a hypothesis that until now was not verified in a real biological system. Furthermore, we identified two distinct predictive information flows within the school: an informative flow based on fish that have already preformed the U-turn about fish that are turning, and a misinformative flow based on fish that have not preformed the U-turn yet about the fish that are turning.

We also explored the role of spatial dynamics in generating the influential interactions that carry the information flows, another well-known problem. In doing so, we mapped the detected values of transfer entropy against fish relative position and heading, identifying clear spatio-informational motifs. Importantly, the positive and negative predictive information flows were shown to be associated with specific spatial signatures of source and destination fish. For example, positive information flow is detected when the source fish is in front of the destination, similarly to what was already observed in previous works on animals [55, 36, 35, 69, 57].

Local transfer entropy as it was applied in this study reveals the dynamics of pairwise information transfer. It is well-known that multivariate extensions to the transfer entropy, e.g. conditioning on other information sources, can be useful in terms of eliminating redundant pairwise relationships whilst also capturing higher-order relationships beyond pairwise (i.e. synergies) [46, 47, 40, 75, 81], and as such the identification of effective neighbourhoods cannot be accurately inferred using pairwise relationships alone. Improvements are possible by adapting algorithms for deciding when to include higher-order multivariate transfer entropies (and which variables to condition on), developed to study effective networks in brain imaging data sets [50, 25, 53, 72], to collective animal behaviour, as such methods can eliminate redundant connections and detect synergistic effects. Whether or not such algorithms will prove useful for swarm dynamics is an open research question, with conflicting findings that first suggest that multiple fish interactions could be faithfully factorised into simply pair interactions in one species [30] but conversely that this may not necessarily generalise [36].

In any case, such adaptations to capture multivariate effects will be non-trivial, as it must handle the short-term and dynamic structure of interactions across the collective. Early attempts have been made using (a similar measure to) conditional TE – on average over time windows – in collectives under such algorithms [51], however it remains to be seen what such measures reveal about the collective dynamics on a local scale.

In summary, we have proposed a novel information-theoretic framework for studying the dynamics of information transfer in collective motion and applied it to a school of fish, without making any specific assumptions on fish behavioural traits and/or rules of interaction. This framework can be easily applied to studies of other biological collective phenomena, such as swarming and flocking, artificial multi-agent systems and active matter in general.

Methods

Ethics statement

All experiments have been approved by the Ethics Committee for Animal Experimentation of the Toulouse Research Federation in Biology N1 and comply with the European legislation for animal welfare.

Experimental procedures

70 Hemigrammus rhodostomus (rummy-nose tetras) were purchased from Amazonie Labège (http://www.amazonie.com) in Toulouse, France. Fish were kept in 150 L aquariums on a 12:12 hour, dark:light photoperiod, at 27.7°C ( $\pm 0.5$ °C) and were fed ad libitum with fish flakes. Body lengths of the fish used in these experiments were on average 31 mm ( $\pm$ 2.5 mm).

The experimental tank measured $120\times 120$ cm, was made of glass and set on top of a box to isolate fish from vibrations. The setup, placed in a chamber made by four opaque white curtains, was surrounded by four LED light panels giving an isotropic lighting. A ring-shaped tank made from two tanks (an outer wall of radius 35 cm and an inner wall, a cone of radius 25 cm at the bottom, both shaping a corridor of 10 cm) was set inside the experimental tank filled with 7 cm of water of controlled quality (50% of water purified by reverse osmosis and 50% of water treated by activated carbon) heated at 28.1°C ( $\pm 0.7$ °C). The conic shape of the inner wall has been chosen to avoid the occlusion on videos of fish swimming too close to the inner wall that would occur with straight walls.

Five fish were randomly sampled from their breeding tank for a trial. Fish were ensured to be used only in one experiment per day at most. Fish were let for 10 minutes to habituate before the start of the trial. A trial consisted in one hour of fish swimming freely (i.e. without any external perturbation).

Data extraction and pre-processing

Fish trajectories were recorded by a Sony HandyCam HD camera filming from above the setup at 50Hz (50 frames per second) in HDTV resolution (1920 $\times$ 1080p). Videos were converted from MTS to AVI files with the command-line tool FFmpeg 2.4.3. Positions of fish on each frame were tracked with the tracking software idTracker 2.1 [61].

When possible, missing positions of fish have been manually corrected, only during the collective U-turn events detected by the sign changes of polarisation of the fish groups. The corrections have involved manual tracking of fish misidentified by idTracker as well as interpolation or merging of positions in the cases where only one fish was detected instead of several because they were swimming too close from each others for a long time. All sequences less or equal than 50 consecutive missing positions were interpolated. Larger sequence of missing values have been checked by eye to check whether interpolating was reasonable or not — if not, merging positions with closest neighbors was considered.

Time series of positions have been converted from pixels to meters and the origin of the coordinate system $\mathcal{O}(0,0)$ has been set to the centre of the ring-shaped tank. The resulting data set contains $9273720$ data points ( $1854744$ for each fish) including all the ten trials. Velocity was numerically derived from position using the symmetric difference quotient two-point estimation [39]. Heading was then computed as the four-quadrant inverse tangent of velocity and used to compute transfer entropy.

Polarisation

The polarisation is used to represent the orientation of a fish or of the whole school around the tank, which can be clockwise or anti-clockwise. Let $Z$ and $\dot{Z}$ be the two-dimensional position and normalised velocity of a fish, defined as Cartesian vectors with the centre of the tank being the origin — in case of the whole school, $Z$ and $\dot{Z}$ are averaged over all fish. The fish direction along an ideal circular clockwise rotation is described by a unit vector $z=\frac{\omega\times Z}{{|\omega\times Z|}}$ , where $\omega$ is a vector orthogonal to plane of the rotation, chosen using the left-hand rule.

The polarisation is defined as $\dot{Z}\cdot z$ , so that it is positive when the fish is swimming clockwise and negative when it is swimming anti-clockwise. Also, the better $\dot{Z}$ is aligned with $z$ or $-z$ , the higher is the intensity of the polarisation. On the contrary, as $\dot{Z}$ deviates from $z$ or $-z$ , the polarisation decreases and eventually reaches zero when $\dot{Z}$ and $z$ are orthogonal. As a consequence, during a U-turn the intensity of the polarisation decreases and becomes zero at least once, before it increases again with the opposite sign.

Local transfer entropy

Transfer entropy [70] is defined in terms of Shannon entropy, a fundamental measure in Information Theory [24] that quantifies the uncertainty of random variables. Shannon entropy of a random variable $X$ is $H(X)=-\sum_{x\in X}p(x)\log_{2}p(x)$ , where $p(x)$ is the probability of a specific instance $x$ of $X$ . $H(X)$ can be interpreted as the minimal expected number of bits required to encode a value of $X$ without losing information. The joint Shannon entropy between two random variables $X$ and $Y$ is $H(X,Y)=-\sum_{x\in X}\sum_{y\in Y}p(x,y)\log_{2}p(x,y)$ , where $p(x,y)$ is the joint probability of instances $x$ of $X$ and $y$ of $Y$ . This quantity allows the definition of conditional Shannon entropy as $H(X|Y)=H(X,Y)-H(X)$ , which represents the uncertainty of $X$ knowing $Y$ .

In this study we are interested in local (or pointwise) transfer entropy [27, 43] for specific instances of time-series processes of fish motion, which allows us to reconstruct the dynamics of information flows over time. Shannon information content of an instance $x_{n}$ of process $X$ at time $n$ is defined as $h(x_{n})=-\log_{2}p(x_{n})$ . The quantity $h(x_{n})$ is the information content attributed to the specific instance $x_{n}$ , or the information required to encode or predict that specific value. Conditional Shannon information content of an instance $x_{n}$ of process $X$ given an instance $y_{n}$ of process $Y$ is defined as $h(x_{n}|y_{n})=h(x_{n},y_{n})-h(x_{n})$ .

Local transfer entropy is defined as the information provided by the source $\mathbf{y_{n-v}}=\{y_{n-v},y_{n-v-1},\allowbreak\ldots,y_{n-v-l+1}\}$ , where $v$ is a time delay and $l$ is the history length, about the destination $x_{n}$ in the context of the past of the destination $\mathbf{x_{n-1}}=\{x_{n-1},x_{n-2},\ldots,x_{n-k}\}$ , with a history length $k$ :

[TABLE]

Transfer entropy $T_{Y\to X}(v)$ is the average of the local transfer entropies $t_{y\to x}(n,v)$ over samples (or over $n$ under a stationary assumption). The transfer entropy is asymmetric in $Y$ and $X$ and is also a dynamic measure (rather than a static measure of correlations) since it measures information in state transitions of the destination.

In order to compute transfer entropy here, the source variable $Y$ and destination variable $X$ are defined in terms of the fish heading. Specifically, $X$ is the first-order divided difference (Newton’s difference quotient) of the destination fish heading, while $Y$ is the difference between the two fish headings at the same time. Let $\Theta_{S}$ and $\Theta_{D}$ be respectively the heading time series of the source and the destination fish. We then construct variables $X$ and $Y$ as follows, for all time points $n$ :

[TABLE]

Thus, $y_{n}$ represents the relative heading of the destination fish with respect to the source fish, while $x_{n}$ represents the directional change of the destination fish. The variables were so defined in order to capture directional changes of the destination fish in relation to its alignment with the source fish, which is considered an important component of movement updates in swarm models [66].

Given the definition of the variables (3) and (4), we computed local transfer entropy $t_{y\to x}(n,v)$ using Equation (2), where $v$ was determined as described in section “Parameters optimisation” that follows. The past state $\mathbf{x_{n-1}}$ of the destination in transfer entropy was defined as a vector of an embedding space of dimensionality $k$ and delay $\tau$ , with $\mathbf{x_{n-1}}=\{x_{n-1-j\tau}\}$ , for $j=\{0,1,\dots,k-1\}$ . Finding optimal values for $k$ and $\tau$ is also described in section “Parameters optimisation”. The state of the source process $\mathbf{y_{n-v}}$ was also defined as a vector of an embedding space whose the dimensionality $l$ and delay $\tau^{\prime}$ were similarly optimised. The local transfer entropy $t_{y\to x}(n,v)$ computed on these variables therefore tells us how much information ( $l$ time steps of) the heading of the destination relative to the source adds to our knowledge of the directional change in the destination (some $v$ time steps later), in the context of $k$ past directional changes of the destination. We note that while turning dynamics of the destination may contain more entropy (as rare events), there will only be higher transfer entropy at these events if the source fish is able to add to the prediction of such dynamics.

Computing transfer entropy requires knowledge of the probabilities of $x_{n}$ and $y_{n}$ defined in (3) and (4). These are not known a priori, but the measures can be estimated from the data samples using existing techniques. In this study, this was accomplished assuming that the probability distribution function for the observations is a multivariate Gaussian distribution (making the transfer entropy proportional to the Granger causality [7]), using the JIDT software implementation [42].

Also, we assume stationarity of behaviour and homogeneity across the fish, such that we can pool together all pairwise samples from all time steps, for all trials, maximising the number of samples available for the calculation of each measure. For performance efficiency, we make calculations of the local measures using 10 separate sub-sampled sets (sub-sampled evenly across the trials), then recombine into a single resultant information-theoretic data set.

Parameter optimisation

The embedding dimensionality and delay for the source and the past state of the destination need to be appropriately chosen in order to optimise the quality of transfer entropy. The combination $(k,\tau)$ for the past state of the destination, as well as the combination $(l,\tau^{\prime})$ for the source, have been optimised separately by minimising the global self-prediction error, as described in [65, 80]. In the case of Markov processes, the optimal dimensionality of the embedding is the order of the process. Lower dimensions do not provide the same amount of predictive information, while higher dimensions add redundancy that weaken the prediction. For non-Markov processes, the algorithm selects the highest dimensionality found to contribute to self-prediction of the destination whilst still being supported by the finite amount of data that we have. Values of the dimensionality between 1 and 10 have been explored in combination with values of the delay between 1 and 5. The optimal combinations were found to be the same for both the source and the past of the destination: $k=l=3$ , $\tau=\tau^{\prime}=1$ .

The lag $v$ was also optimised. This was done by maximising the average transfer entropy (after the optimisation of $k$ , $\tau$ , $l$ and $\tau^{\prime}$ ) as per [79], over lags between 0.02 and 1 second, at time steps of 0.02 seconds. The average transfer entropy was observed to grow and reach a local maximum at $v=6$ ( $0.12$ seconds), and then decrease for higher values (see Figure 5). This result might have a biological interpretation: it is plausible for a fish to have a minimum reaction time, which delays the response to behaviour of other fish.

Statistical significance of estimates of local transfer entropy

Theoretically, transfer entropy between two independent variables is zero. However, a non-zero bias (and a variance of estimates around that bias) is likely to be observed when, as in this study, transfer entropy is numerically estimated from a finite number of samples. This leads to the problem of determining whether a non-zero estimated value represents a real relationship between two variables, or is otherwise not statistically significant [80].

There are known statistical significance tests for the average transfer entropy [76, 44, 42], involving comparing the measured value to a null hypothesis that there was no (directed) relationship between the variables. For an average transfer entropy estimated from $N$ samples, one surrogate measurement is constructed by resampling the corresponding $\mathbf{y_{n-v}}$ for each of the $N$ samples of $\{x_{n},\mathbf{x_{n-1}}\}$ and then computing the average transfer entropy over these new surrogate samples. This process retains $p(x_{n}|\mathbf{x_{n-1}})$ and $p(\mathbf{y_{n-v}})$ , but not $p(x_{n}|\mathbf{y_{n-v}},\mathbf{x_{n-1}})$ . Many surrogate measurements are repeated so as to construct a surrogate distribution under this null hypothesis of no directed relationship, and the transfer entropy estimate can then be compared in a statistical test against this distribution. For the average transfer entropy measured via the linear-Gaussian estimator, it is known that analytically the surrogates (in nats, and multiplied by $2\times N$ ) asymptotically follow a $\chi^{2}$ distribution with $l$ degrees of freedom [31, 8]. We use this distribution to confirm that the transfer entropy at the selected lag of 0.12 seconds (and indeed all lags tested) is statistically significant compared to the null distribution (at $p<0.05$ plus a Bonferroni correction for the multiple comparisons across the 50 candidate lags).

Next, we introduce an extension of these methods in order to assess the statistical significance of the local values. This simply involves constructing surrogate transfer entropy measurements as before, however this time retaining the local values within those surrogate measurements and building a distribution of those surrogates. Measured local values are then statistically tested against this null distribution of local surrogates to assess their statistical significance.

We generated ten times as many surrogate local values as the number of actual local estimates, with a total of approximately $371$ million local surrogates. This large set of surrogate local values was used to estimate $p$ -values of actual local values of the transfer entropy. If $p$ -value is sufficiently small, then the test fails and the value of the transfer entropy is considered significant (the value represents an actual relationship). The Benjamini-Hochberg [10] procedure was used to select the $p$ -value cutoff whilst controlling for the false discovery rate under ( $N$ ) multiple comparisons.

Acknowledgements

E.C. was supported by the University of Sydney’s “Postgraduate Scholarship in the field of Complex Systems” from Faculty of Engineering & IT and by a CSIRO top-up scholarship. L.J. was supported by a grant from the China Scholarship Council (CSC NO.201506040167). V.L. was supported by a doctoral fellowship from the scientific council of the University Paul Sabatier. This study was supported by grants from the Centre National de la Recherche Scientifique and University Paul Sabatier (project Dynabanc). J.L. was supported through the Australian Research Council DECRA grant DE160100630. The University of Sydney HPC service provided computational resources that have contributed to the research results reported within this paper.

Author contributions statement

G.T. designed research; V.L., P.T. and G.T. performed research; V.L., L.J., P.T., R.W. and G.T. analysed data. E.C., J.L., R.W. and M.P. developed information dynamics methods, performed information-theoretic analysis, and identified information flows and motifs. E.C. designed, developed and run software for the information-theoretic analysis. G.T., J.L., E.C. and M.P. conceived and analysed information cascade. E.C., J.L and M.P. wrote the paper. G.T. and V.L. edited the manuscript and contributed to the writing.

Supplementary information

Bibliography81

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Larissa Albantakis, Arend Hintze, Christof Koch, Christoph Adami, and Giulio Tononi. Evolution of integrated causal structures in animats exposed to environments of increasing complexity. PLOS Computational Biology , 10(12):1–19, 12 2014.
2[2] Alessandro Attanasi, Andrea Cavagna, Lorenzo Del Castello, Irene Giardina, Tomas S Grigera, Asja Jelić, Stefania Melillo, Leonardo Parisi, Oliver Pohl, Edward Shen, et al. Information transfer and behavioural inertia in starling flocks. Nature physics , 10(9):691–696, 2014.
3[3] Alessandro Attanasi, Andrea Cavagna, Lorenzo Del Castello, Irene Giardina, Asja Jelic, Stefania Melillo, Leonardo Parisi, Oliver Pohl, Edward Shen, and Massimiliano Viale. Emergence of collective changes in travel direction of starling flocks from individual birds’ fluctuations. Journal of The Royal Society Interface , 12(108), 2015.
4[4] Alessandro Attanasi, Andrea Cavagna, Lorenzo Del Castello, Irene Giardina, Stefania Melillo, Leonardo Parisi, Oliver Pohl, Bruno Rossaro, Edward Shen, Edmondo Silvestri, and Massimiliano Viale. Collective behaviour without collective order in wild swarms of midges. P Lo S Comput Biol , 10(7):1–10, 07 2014.
5[5] Nihat Ay and Daniel Polani. Information flows in causal networks. Advances in complex systems , 11(01):17–41, 2008.
6[6] M. Ballerini, N. Cabibbo, R. Candelier, A. Cavagna, E. Cisbani, I. Giardina, V. Lecomte, A. Orlandi, G. Parisi, A. Procaccini, M. Viale, and V. Zdravkovic. Interaction ruling animal collective behavior depends on topological rather than metric distance: Evidence from a field study. Proceedings of the National Academy of Sciences , 105(4):1232–1237, 2008.
7[7] Lionel Barnett, Adam B. Barrett, and Anil K. Seth. Granger causality and transfer entropy are equivalent for gaussian variables. Physical Review Letters , 103(23):238701+, 2009.
8[8] Lionel Barnett and Terry Bossomaier. Transfer Entropy as a Log-Likelihood Ratio. Physical Review Letters , 109:138105+, 2012.