Effective networks: a model to predict network structure and critical transitions from datasets
Deniz Eroglu, Matteo Tanzi, Sebastian van Strien, Tiago Pereira

TL;DR
This paper introduces an effective network model that predicts the structure and critical transitions of complex systems like neuronal networks using data-driven reconstruction, even outside observed parameter ranges.
Contribution
It presents a novel approach to predict network behavior and critical transitions from limited data by combining local dynamics with statistical interaction models.
Findings
Successfully reconstructed neuronal network structure from data
Predicted critical transitions outside observed parameter ranges
Validated approach on cat cerebral cortex networks
Abstract
Real-world complex systems such as ecological communities and neuron networks are essential parts of our everyday lives. These systems are composed of units which interact through intricate networks. The ability to predict sudden changes in network behaviour, known as critical transitions, from data is important to avert disastrous consequences of major disruptions. Predicting such changes is a major challenge as it requires forecasting the behaviour for parameter ranges for which no data on the system is available. In this paper, we address this issue for networks with weak individual interactions and chaotic local dynamics. We do this by building a model network, termed an effective network, consisting of the underlying local dynamics at each node and a statistical description of their interactions. We illustrate this approach by reconstructing the dynamics and structure of realistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Time Series Analysis · Theoretical and Computational Physics · Complex Network Analysis Techniques
Effective networks: a model to predict network structure and critical transitions from datasets
Deniz Eroglu,1,2,3 Matteo Tanzi,2,4 Sebastian van Strien,2 Tiago Pereira1,2
Abstract
Real-world complex systems such as ecological communities and neuron networks are essential parts of our everyday lives. These systems are composed of units which interact through intricate networks. The ability to predict sudden changes in network behaviour, known as critical transitions, from data is important to avert disastrous consequences of major disruptions. Predicting such changes is a major challenge as it requires forecasting the behaviour for parameter ranges for which no data on the system is available. In this paper, we address this issue for networks with weak individual interactions and chaotic local dynamics. We do this by building a model network, termed an effective network, consisting of the underlying local dynamics at each node and a statistical description of their interactions. We illustrate this approach by reconstructing the dynamics and structure of realistic neuronal interaction networks of the cat cerebral cortex. We reconstruct the community structure by analysing the stochastic fluctuations generated by the network and predict critical transitions for coupling parameters outside the observed range.
- 1
Instituto de Ciências Matemáticas e Computação, Universidade de São Paulo, São Carlos, Brazil
- 2
Department of Mathematics, Imperial College London, London SW7 2AZ, UK
- 3
Department of Bioinformatics and Genetics, Kadir Has University, 34083 Istanbul, Turkey
- 4
Department of Mathematics and Statistics, University of Victoria, Victoria BC, CA
We are surrounded by a range of complex networks consisting of a large number of intricately coupled nodes. Neuron networks form an important class examples where the interaction structure is heterogeneous [1]. Because changes in the interaction can have massive ramifications on the system as a whole, it is desirable to predict such disturbances and thus enact precautionary measures to avert potential disasters. For instance, neurological disorders such as Parkinson’s disease, schizophrenia, and epilepsy, are thought to be associated with anomalous interaction structure among neurons [2]. Often, as in the case of neuron networks, it is impossible to directly determine the interaction structure. Therefore, a major scientific challenge is to develop techniques which use measurements of the time evolution of the nodes state to indirectly recover the network structure and predict the network’s behaviour when essential characteristics of the interactions change.
The literature on data-based network reconstruction is vast. Reconstruction methods can be classified into model-free methods and model-based methods. The former identify the presence and strength of a connection between two nodes by measuring the dependence between their time-series in terms of: correlations [3, 4], mutual information [5], maximum entropy distributions [6, 7], Granger causality, and causation entropy [8, 9]. Such methods alone do not provide information on the dynamics of the network, which is necessary to predict critical transitions. Model-based methods instead provide estimates (or assume a priori knowledge) of the dynamics and interactions in the system, and use this knowledge to reconstruct the network topology. When the interactions are sufficiently strong, the network structure can be recovered [11, 12, 10]. For a more extensive account of reconstruction (model-free and -based) methods see the reviews [10, 13, 14] and references therein.
In many real-world applications, the behaviour of isolated nodes is erratic (chaotic) and the interaction is weak [15, 16, 1]. Moreover, the network structure typically has community structures and hierarchical organisations such as the rich-club networks found in the brain [17]. As the interaction strength per connection is weak and the statistical behaviour of the nodes robust, the influence of a single link on the overall dynamics is negligible and only the cumulative contribution of many links matter. Furthermore, because of the chaotic dynamics and weak coupling, the influence of a single node on the rest of the network corresponds essentially to a random signal that is superposed to the randomness generated by the chaos in the local dynamics. The existing reconstruction techniques fail to reconstruct a model from the data, as they require the interaction to be of the same magnitude as the isolated dynamics.
In this paper, we introduce the notion of an effective network. Its aim is to make a model of a complex system from observations of the nodes evolution when the network has heterogeneous structure, the strength of interaction is small and local dynamics are highly erratic. This approach starts by reconstructing the local dynamics from observations of nodes with relatively few connections, and then recovers the interaction function from observations of the highly connected nodes whose dynamics are the most affected by the interactions as a result of the multitude of connections they receive from the rest of the networks[18, 19]. A key achievement is that we are able to identify community structures even in the presence of weak coupling. An outcome of the effective network is that it recovers enough information to forecast and anticipate the network behaviour, even in situations where the parameters of the system change into ranges that have not been previously encountered.
Complex networks of nonlinear systems. We consider networks with nodes with chaotic isolated dynamics and pairwise interactions satisfying some additional assumptions described below. The network is described by its adjacency matrix , whose entry equals if node receives a connection from and equals [math] otherwise. We assume that the time evolution of the state of node at time is expressed as
[TABLE]
When performing reconstruction from data, the isolated local dynamics , the coupling function , the coupling parameter (that is small), the adjacency matrix , and even the dimension of the space , are assumed to be unknown. This class of equations can be applied to modelling many important complex systems found in neuroscience (e.g. coupled equations in neuron networks) [20], engineering (e.g. smart grids) [21, 22], material sciences (e.g. superconductors) [23], and biology (e.g. cardiac pacemaker cells) [24], among other systems.
Our three assumptions are: the local dynamics are close to some unknown ergodic and chaotic map (that is, , which is often the case in applications [25, 26]). The network connectivity is heterogeneous, which means that the number of incoming connections at a node (given by its degree ) varies widely across the network. More precisely, is larger for a few nodes called hubs. is such that, denoting by the maximum number of connections is of the same magnitude of . Assumptions and imply that the the total effect of the network on a node , which is of the order , ranges from small (for nodes with few incoming connections) to of order 1 for the hub nodes. A prime example is the cat cerebral cortex which possesses inter-connected regions split into communities with a hierarchical organization. This network has heterogeneous connectivity, chaotic motion and weak coupling [27, 28, 29]. Other examples, includes the drosophila optic lobe network [30, 31]. For a given dataset, our effective network first tests whether the underlying system satisfies assumptions , and, if so, reconstructs the model (see the Methods for more detail).
We assume the availability of a time series of observations , where is a projection to a variable on which unit interactions depend. This situation occurs frequently in applications; for example, with measurements of membrane potentials in neurons or voltages in electronic circuits. Here, we demonstrate how to obtain a model for the system by constructing an effective network from the time series .
Effective networks recover both structure and dynamics
To obtain an effective network from observations of a complex system, we combine statistical analysis, machine-learning techniques, and dynamical systems theory for networks. An effective network provides (i) local evolution laws and averaged interactions for each unit that, in combination, closely approximate the unit dynamics, and (ii) a network with the same degree distribution and communities as the original system. We use the term “effective” because, even if it does not carry information on each link and interaction in the original system, the network gathers sufficient data to reproduce the behaviour of the original network and predict its critical transitions.
Using our assumptions for the network and local dynamics, we can show that the evolution at each node will have low-dimensional excursions over finite time scales. In particular, the approximated evolution rule at node has a low dimensional description over large time scales given by . More precisely, the evolution rule at node is given by
[TABLE]
where is the isolated dynamics, is the rescaled degree, and is the averaged mean-field effect of the interaction function (effective coupling function) that takes into account the cumulative effect of interactions on node . The true dynamics slightly deviates from this rule and is given by
[TABLE]
where is a fluctuation term that is small for an interval of time which is exponentially large in terms of the size of the network. This low-dimensional reduction has been rigorously established in important test cases over long time scales (see [19]). This approximation holds, roughly speaking, for two reasons. Firstly, a low degree node has a small number of connections compared to the maximum degree, and the interactions slightly perturb its statistical behaviour. Secondly, hub nodes receive a huge number of connections so that the sum of their interactions with the other nodes in the network, , can be approximated as the integral where is the stationary distribution of typical orbits of the low degree nodes. This approximation holds up to a small fluctuation, , which depends on the state of neighbours of the th node. This fact will be key to detecting communities.
The approximation described above also applies to the measured state variable . Depending on the system, we need to preprocess the data (see Methods). The processed variable is still referred to as . The Takens reconstruction technique tells us that is a nonlinear function of past points , for a given number provided by the approach. Here, we focus on the case when , which occurs in many real-world examples, and discuss cases with in the Supplementary Materials. This means that
[TABLE]
where
[TABLE]
is a low-dimensional law that approximates the evolution of . We first employ Takens reconstruction to estimate and coarsely classify the nodes by their degrees. For instance, in rich-club motifs, the network has low-degree nodes organised in communities and high-degree in the rich-club. Since is small for low-degree nodes , the evolution rules at such nodes will be similar to the isolated dynamics. Thus we identify which nodes are low-degree nodes and we can recover the local dynamics . Next, we use Eq. (2) and a classification of nodes by their time-series, to obtain the coupling function and estimate the degree distribution and the coupling parameter . More precisely, an effective network is obtained in three main steps (see Methods for additional details):
- (i)
Reduced dynamics. Since the fluctuation term is small, the rule can be estimated fitting the points using machine-learning techniques. Here, we estimate the function by decomposing it as a linear combination of basis functions, which is chosen according to the application. The parameters of the basis functions are obtained by performing a -fold cross-validation with training and test [32, 33] (see Methods for details). The particular technique used in this step is not essential as the dynamics will be low-dimensional. Other techniques such as compressive sensing based techniques [34, 35, 36] could be also employed.
- (ii)
Isolated dynamics and effective coupling. Since the network is heterogeneous, it has many low degree nodes for which is close to the uncoupled dynamics . We first run a model-free estimation to coarsely classify nodes into low-degree nodes and hubs and estimate (see Supplementary Material). We then use the expression of recovered at low degree nodes to obtain an approximation for , while recovered at hub nodes allows to estimate . We estimate the parameter by using Bayesian inference (see Methods).
- (iii)
Network structure and communities. Since , we can recover the network’s degree distribution from knowledge of at every node. To reconstruct the community structures, we use the fact that the term at a node depends on the state of all nodes interacting with . Thus the correlation between and is proportional to the matching index (number of common neighbors) of the nodes and . Thus, we can create a network that has the same statistical properties of the actual one. Further details are provided in the next section.
Benchmark model for the isolated dynamics. We consider heterogeneous networks of neurons described by the Rulkov model (see Methods for details). The model has two variables, and , evolving at different time scales. The fast variable describes the membrane potential and the slow currents. In the following discussion, we choose the parameters such that the isolated dynamics presents a tonic spiking behaviour. In the Supplementary Materials, we also show the results for bursting dynamics, i.e. when the parameters are such that the membrane potential oscillates between two phases: a bursting phase of a rapid spiking, and a quiescent phase. We also validated our methods for other types of isolate chaotic dynamics (also in the Supplementary materials).
Revealing community structure: the rich-club motif
We apply the effective network in a wide range of setups as discussed in the supplementary material. Here we focus on the network structure of the cat cerebral cortex for which the mesoscopic connectivity has been probed and detailed [29]. The regions and their connections were discovered by using datasets from tract-tracing experiments analyzed via nonmetric multidimensional scaling, an optimization approach that minimizes the distance between connected structures and maximizes between the unconnected ones [27, 28].
The network contains 53 meso-regions arranged in four communities that closely follow functional subdivisions; namely, visual (16 nodes), auditory (7 nodes), somatomotor (16 nodes) and frontolimbic (14 nodes), as shown in Fig 2 (a). In addition to these cortical regions, some cortical areas (hubs) form a hidden layer called a rich-club and are densely connected to each other and the communities. A set of nodes form a rich-club if their level of connectivity exceeds what would be expected by chance alone. The maximum number of connections in this network is . The network obtained is weighted [27, 28]. For simplicity and to improve the performance in detecting communities, we turn the network into an undirected simple graph [29]. We simulate each mesoregion as a neuron interacting via electrical synapses (see Methods for details) and obtain a multivariate data for a time . For simplicity, we will denote , whenever there is no risk of confusion.
For comparison, we first test the recovery of the network using the functional network framework [39, 37, 38]. The intuition behind this approach is that nodes with similar time series perform analogous functions in the network and are assumed to have similar characteristics. The functional network can thus be constructed by the matrix of the similarities between nodes via statistical analysis [40, 41]. Here we employ a covariance analysis between the time series. The functional network alone does not detect communities from the data because the bursts are erratic (Fig. 2 (b)). Other similarity measures such as mutual information give no significative improvement.
We also compared our method with a reconstruction by the sparse recovery technique [34, 10]. There, the dynamics is written as linear combination of chosen basis functions and the unknowns are the coefficients. Sparse recovery works well when many coefficients are exactly zero. The presence of a link is determined when any coefficient of the corresponding interaction is nonzero. In our setting, each link provides a small contribution to the individual dynamics and only their cumulative effect can influence the dynamics. Therefore, the coefficients to be recovered are close to zero, and cannot be distinguished from zero terms. In this setup, sparse recovery can also uncover a model for the isolated dynamics, however, it misidentifies the network structure. We implemented the sparse recovery method to our benchmark method and could assign only of the nodes to the right cluster. These results are discussed in the Methods.
Remarkably, however, the effective network is able to recover the community structures (Fig. 2 (c)). To every pair of and we assign a Pearson distance which measures the difference between the observed dynamics at nodes and so that if the attractors of and are similar and if they are reasonably distinguishable (see Methods for details). The higher the number of nodes with behaviour different from , the larger the intensity . The analysis of intensities allows us to distinguish nodes in the rich-club. Next, we find the local low-dimensional rules for each time series by applying Step (i). These rules are similar and display a chaotic evolution, thus, confirming that the similarity analysis by the functional networks would provide little insight into the reconstruction. Having the local rules , we can decompose the time series in terms of a low-dimensional deterministic part and a fluctuation term , which depends on the neighbours of the th node. One of our main key points is that these fluctuations allow us to reveal the community structure.
Indeed, if nodes and interact with the same nodes, they are subject to the same inputs and the covariance Cov is high. If not, Cov is nearly zero due to the decay of correlations in the deterministic part. Therefore, Cov is high when nodes and have high matching index (high fraction of common connections). It is crucial that the correlation analysis is restricted to the dynamical fluctuations . Indeed, since the variance of the deterministic part of is larger than that of the small fluctuations , performing a direct correlation analysis between and hides all the contributions to the correlations coming from the covariance between and . Consequently, the correlation of the deterministic part is close to zero due to the chaotic nature of the dynamics (see Supplementary Materials). So, remarkably, the noise terms are needed to access the community structure.
Using steps (i)-(iii) we obtained a model for the isolated dynamics, coupling function, and distribution of degrees. To apply current methods of community detection, we obtain an adjacency matrix with entries equal to zero or one from the matrix of correlations Fig. 12 (c). This is done by thresholding the entries and considering a link only when the correlations were greater than . We tested threshold values ranging from to and obtained the same results. Indeed, the distribution of the entries of the matrix of correlations is unimodal and has a peak near 0.5. We computed rich-club coefficients for each node [42] (see details in Methods), and nodes with the highest coefficients formed the rich-club displayed at the centre of the network. By applying the community detection method [43] to the thresholded effective matrix, we recovered the communities according to their function as shown in Figure 2 (d).
There is no reason to expect that the links in the effective network correspond to links in the actual network. However, since every node makes most of its interactions within a cluster, two nodes with highly correlated fluctuations are likely to belong to the same community, and this can be enforced in the effective network by adding a connection between them. Further results for a variety of rich-club motifs are shown in the Supplementary Materials.
Performance of the communities reconstruction.
To quantify the effectiveness of community reconstruction, we compute , where is the total number of nodes and is the number of nodes assigned to the wrong community. We compute for different values for the coupling strength between and . For each value of , we considered 50 different simulations of the dynamics obtained by choosing different initial conditions. The figure shows the plot of the mean of and a shaded region corresponding to the standard deviation. For values larger than , the reconstructing procedure cannot identify the communities correctly. This is due to the synchronization rich club which appears around this value.
Predicting critical transitions in rich-clubs
The ability to reconstruct the network and dynamics from data can be exploited to predict critical transitions that may occur when the coupling strength. In the cat brain, for example, a transition to collective dynamics in the rich-club has drastic repercussions for the functionality of the network [29, 44].
Our goal now is to obtain data when the network is far from a collective dynamics and predict the onset of collective motion in the rich-club. The effective network can predict the onset of such collective dynamics based on a single multivariate time series for fixed coupling strength in a regime far from the synchronized state. We analyze time-series obtained simulating the dynamics at a value of the coupling strength , and apply our reconstruction procedure to obtain the network structure and a model of the isolated dynamics.
Estimating the transition to synchronization in the rich-club. Transitions to synchronization between the bursts is possible while the fast spikes remain out of synchronization [45]. To estimate the transition to burst synchronization, we obtain the slow variable as a filter over the membrane potential (fast variable). Indeed, since we measure the membrane potential , the slow variable is given as and for a good choice of parameters and this can be identified with the slow variable of the model . In the methods section, we obtain an equation for the slow variable and analyze the effect of the network of the dynamics of the slow variable.
We can use the data on the network and the dynamics recovered from the time-series recorded at to predict that at the value the rich-club will develop a burst synchronization (details in the Methods).
To capture such a transition in the bursts of the rich club, we introduce a phase for the slow variable, the definition of can be found in the Methods section. Once we have the phase we compute the order parameter
[TABLE]
Loosely speaking, a small value of the order parameter, , means that no collective state is present in the system, whereas means that the bursts are synchronized. Figure 4 shows that behaviour of the order parameter as a function of the coupling. The rich-club undergoes a transition to burst synchronization at that corresponds to an increase of roughly of the coupling strength and is close to the predicted value . In the Supplementary Materials, we present other examples where the local dynamics is fully chaotic.
Obtaining a statistical description of the network
The effective network can also provide detailed information on the structure of each community and a statistical description of the network. To illustrate this, we reconstruct the statistical properties of scale-free networks.
Scale-free networks of coupled bursting neurons. We consider coupled bursting neurons with excitatory synapses [45] in scale-free networks (see Methods for details on simulation of scale-free networks and formulation of the model). Our techniques work equally well for spiking dynamics.
We generate a scale-free network with nodes such that the probability of having a node of degree is proportional to , where . For this reconstruction we only need 2000 data points for each node. Again, to every pair of time series and we assign a Pearson distance and the node intensity . The empirical distribution of the intensities approximates the degree distribution of the network, see Fig 5(d). In the example here, the estimated structural exponent from the distribution of is , which yields a relative error of nearly 25% with respect to the true value of (see also the plots in Figure 6 a)). The functional network therefore overestimates , which has drastic consequences for the predicted character of the network. For example, the number of connections of a hub for a scale-free network is concentrated at , so the relative inaccuracy for the estimate of the maximal degree is , which is about 500%. Such inaccuracy has important repercussions for the ability to predict the emergence of collective behaviour [46, 19].
The statistical measure used for the construction of functional network typically depend in a nonlinear way on the degrees, thus causing a distortion in the statistics. We will discuss the case of Pearson distance. Suppose that the signals are purely deterministic, , and thus perfectly fall on the graphs of the functions , that are determined by and so by the degree of the node. The Pearson distance between the signal at and is going to be a number between 0 and 1, depending on how close these graphs are. Unfortunately this distance depends nonlinearly on the degrees and . Devising another distance without the knowledge of the interaction, in general, would still carry the nonlinear dependence on the degrees. Moreover, once fluctuations from the network are included as the differences between time-series for different s can be due to fluctuations rather than differences in the degrees. So the decomposition of the rules in terms of interactions and fluctuations is essential to recover precise details of the real degree distribution.
The effective network provides a better statistical description of the network structure. To compare with the functional network approach described above, we constructed an effective network of the same system tested for the functional network approach. The estimate for from the effective network is , which has an error of only ( Fig. 5 (d)). We repeat the analysis on a different network with different parameters in the degree distribution. The estimated values are shown in Fig. 6 (c) as a function of the true parameter . The relative error on the estimated exponent is within .
Real-world scale-free networks: the optic lobe of Drosophila melanogaster. Effective networks can be used to provide estimates for real-world systems. As one example, we applied our method to data simulated from the neuronal network in the D. melanogaster optic lobe, which constitutes 50% of the total brain volume and contains 1781 nodes [30]. The degree distribution has a power-law tail [31]. We used spiking neurons with chemical coupling to simulate the multivariate time series, from which we constructed an effective model capable of estimating the degree distribution with great accuracy (Fig. 6 (b) and Methods).
Additional applications of our procedure for different types of chaotic systems (doubling map, logistic maps, spiking neurons with electrical synapses, Rössler systems, and Hénon maps) can be found in the Supplementary Materials, together with results demonstrating that effective networks are robust against noise.
Conclusions
We have introduced an effective network obtained from time-series of a complex network (by observing the dynamics at each node). Our method complements the existing ones in two ways. Firstly, it encompasses the case of isolated chaotic dynamics. Secondly it deals with weak coupling among the nodes. Both cases are commonly found in applications. Key to the success of the reconstruction is the heterogeneity of the network which allows us to separate the contribution from the local dynamics and the interactions and thus perform a multi-level reduction.
We have compared our procedure with methodologies most relevant for the systems we considered. We have thus excluded results tailored to specific experimental setups or dynamics (binary dynamics coupled on networks [47], and see [10] for a review). We also did not consider methods that rely on measurements obtained intervening on the system with controlled inputs (see e.g. [13]) and restrict our attention to the case where the time-series is recorded under constant conditions. When the coupling is strong, sparse recovery can be applied [34]. On the other hand, when the coupling is weak sparse recovery cannot distinguish small parameters from those that are identically zero thus misidentifying connections between nodes as it can be observed in Figure 7. Also model-free methods are ill-suited to analyse weakly coupled chaotic dynamical systems studied here. Indeed, the influence of a single pairwise interaction on the time-series is so weak that its effect on the correlation of the measurements can hardly be detected by direct measurements.
Our reconstruction works when no collective dynamics is induced by the weak coupling. When the rich-club synchronizes. A synchronous rich-club induces correlation between the fluctuations in the communities. This happens because two nodes in different communities but coupled to the rich-club would receive a similar forcing. Since we use the fluctuation to identify the community structure, the method would identify these nodes as belonging to the same community. While our method works well for various isolated dynamical systems, in two scenarios it didn’t provide a good reconstruction. First, when the local dynamics is the Lorenz with classical parameters. In this case, the passage near a fixed points smears the fluctuations. Second, in the bursting dynamics, when the quiescent state is too long. This failure seems to be related as well to the long passage near a fixed point. Our method works with large networks, up to nodes after which the computation time can be long. In fact, the number of operations to fit the the dynamics at all nodes, as in point (i), is proportional to Number of nodes (Length of the time series) β, where the power depends on the optimization algorithm adopted. The number of operations to determine the correlations between the fluctuations, point (iii) above, is of the order of (Number of nodes) 2 Length of the time series. The effective network exploits the network heterogeneous structure. This allows to single out different dynamical behaviours across the network and use this information to reconstruct the local dynamics and the effective interaction function. To obtain the community structures we use that certain noise terms associated with the time series at two nodes in the same community are correlated. By collecting data when the network is far from critical transitions, our method can reconstruct the effective network and enables predictions regarding critical transitions occurring in a network.
Methods
Random scale-free networks
A scale-free network has degree distribution , where is the characteristic exponent and is a normalising constant. We use a random network model which is an extension of the Erdös-Rényi model for random graphs with a general degree distribution [48]. Given , each potential edge between and is chosen with probability and where To ensure that , we consider Then is the expected degree of the th node. Taking , where and with denoting the mean degree, the distribution of expected degrees follows a power law, and so is the expected exponent of power-law distribution [48, 49]. If the random graph generated is not connected, we use only the largest connected component (the giant component).
Finding power-law distribution parameters from empirical data. For estimation of the power-law distribution parameters, we use the maximum likelihood estimator first introduced by Muniruzzaman [50], which is equivalent to the well-known Hill estimator [51]. After that, we test the reliability between the data and the power law by using the goodness-of-fit method. If the resulting -value is larger than 0.1, the power-law estimation is an appropriate hypothesis for the data. A complete procedure for the analysis of power-law data can be found in Ref. [52].
Functional networks
For networks composed of chaotic oscillators, building the functional network from the standard Pearson correlation between time series gives no meaningful results because of the decay of correlation intrinsic to the chaotic dynamics. Hence, functional networks are built using a Pearson distance , which describes the proximity of the dynamics (i.e. the ‘attractor’) at two nodes and . To do this, we consider the time series , reordered in according to the lexicon order; that is, according to the magnitude of the first component of . Then, let be the Pearson correlation, Cor, so that indicates that the attractors at nodes and fully agree. Finally, define the Pearson distance so that indicates full agreement of the dynamics and measures the difference between the attractors.
The intensity approximates how many nodes have a dynamical rule different from and helps to distinguish between poorly connected nodes and hubs. Since most of the network is composed of poorly connected nodes with similar evolution rules, they exhibit a smaller than high-degree nodes, which are scarcer and have different dynamics from the low-degree nodes. In the case of a scale-free network where the degree distribution is monotonic, the empirical distribution of is also expected to have the same trend.
Effective network
Dimensional reduction over finite time scales. To construct an effective network, we combine statistical estimation with mathematical results for high-dimensional dynamical systems that are based on theorems describing the evolution of dynamical systems coupled on a heterogeneous networks.
The assumptions we make are: (a) the dynamics determined by is ergodic, (b) the network is heterogeneous in the sense that most nodes are low degree and a few hubs make many connections, and (c) the coupling strength is small so that . These assumptions play a role in the reduction process. Indeed, (c) means that the contribution of the low-degree nodes to the dynamics of the hub nodes is an average, (b) means that the ergodic behaviour of low-degree nodes is similar to that of , and finally, (a) means that we obtain a hierarchy of reductions with different dynamics (depending on connectivity) from which the structure of the network can be inferred.
In heterogeneous networks, and up to a small error (relative to the network size), the dynamics at each node evolves according to a low-dimensional dynamical system for a certain time scale where the effect of the interaction of one node on the entire network is averaged to a give net interaction. This is rigorously proven in [19] under some additional technical assumptions.
One can write the interaction term in equation (1) at node as
[TABLE]
where
[TABLE]
is the averaged interaction in terms of the invariant measure for the isolated dynamics and where is the degree of node . The term is determined by the states of the th node neighbours. The low-dimensional system, called the reduced system, plus a fluctuation term thus reads as
[TABLE]
where for . Using the chaotic and ergodic properties of the dynamics and concentration inequalities, we can show that is small over exponentially large time scales, in terms of the system size, and for most initial conditions. Not all structures of the graph will follow this reduced equation. If a fully connected graph is weakly coupled to a scale-free network, then it can develop dynamics different from those of the reduced model. In this case, we cannot use the physical measure in the expectation; however, the reduction is still valid whenever is substituted with a measure that satisfies self-consistency relationships (see below).
k-Fold Cross-Validation
Cross-validation is a resampling method used to evaluate dynamical models on a limited data sample. The parameter refers to the number of groups that a given data is to be divided into. We use , since it is known that this value produces the lowest bias in several tests, so that the method can be called 10-fold cross-validation. The procedure is the following: First, randomly shuffle the data and divide it into k groups. Hold few groups to test the system, use the remaining ones as training data. In our case, we used 1 group to test the system and 9 groups to train our model. We fit a model on the training set and evaluate it on the test one. According to the evaluation scores, we summarize the dynamical properties of the model. Further details on how this standard procedure works can be found in [33].
Reconstruction of the effective network representation from data
Step 1 : Reduced dynamics. Given the multivariate time series , we consider each time series separately and then perform a Takens reconstruction of the attractor for each time series . The reduction guarantees with high probability that the dynamics is low dimensional. If the time series is high dimensional, we discard it, and otherwise, once we are in the appropriate dimension, we continue to the next step. Many cases can be captured by a one-dimensional map and Takens reconstruction yields the return map
[TABLE]
where . In fact, even when the local dynamics is three dimensional, by introducing a Poincarè section, we can sometimes reduce the analysis to one dimension (see Supplementary Materials for details).
Step 2: Isolated dynamics and effective coupling function. We first obtain a model for the isolated dynamics. Here we describe the procedure when the reduced dynamics can be modelled by a one-dimensional map, since the method works similarly in the higher-dimensional setting as shown in the Supplementary Materials.
To estimate the deterministic rule for each time series, we use a -fold cross-validation with of the time series for training and for testing (see k-Fold Cross-Validation above). To obtain a model for the isolated dynamics we first build a functional network (see Functional Networks section in Methods). For the low-degree nodes, is negligible and the dynamics at the low-degree nodes are close to . We identify these nodes by analysing the distribution of . We use the top nodes of the highest intensity nodes to obtain a proxy for the isolated dynamics. We then average the rules obtained at those nodes to get . The choice of is not fixed and depends upon the number of nodes and the fluctuation , which is averaged from the highest intensity nodes. A good heuristic to choose is , as we have used here.
The effective coupling function. Since , analysing the family can yield the shape of up to a multiplicative constant via a nonlinear regression by imposing that and are linearly dependent. The choice of the base function for the fitting is supervised and depends on the particular application (see Supplementary Material for additional examples).
Step 3. Network Structural Statistics. After selecting a that satisfactorily approximates up to a multiplicative constant over all indices , we estimate the parameter using a dynamic Bayesian inference. Because the fluctuations are close to Gaussian we use a Gaussian likelihood function and a Gaussian prior for the distribution of the values of , and hence obtain equations for the mean and variance. We split the data into epochs of points and update the mean and variance iteratively.
Matching index: Recall that the degree of node is and count the number of neighbours it has. We now define the matching index of a graph [29]. Consider the neighbourhood of node as . This is the set of nodes that shares an edge with the node . The matching index of nodes and is the cardinality of the overlap of their neighbourhoods . We are interested in the normalised matching index:
[TABLE]
or equivalently in terms of the adjacency matrix
[TABLE]
Clearly if and only if and are connected to exactly the same nodes, i.e., ; whereas if nodes and have no common neighbours.
Community structures. Once we have obtained the rules , we filter the deterministic part of the time series and access the fluctuations . We decompose the fluctuations where is the fluctuation of the local mean field from nodes in the cluster containing , and is the contribution from outside the cluster. Since a node makes most of its connections within its cluster, with high probability, and thus if and belong to the same cluster . Next, we notice that the common noise is generated by the common connections between nodes and . In fact, for fixed isolated dynamics and coupling function
[TABLE]
It is well known that in the cat cerebral cortex nodes in the same community has a high matching index while nodes are distinct communities has a low matching index. In fact, this tends to be typically in modular networks [29]. Therefore, for nodes in distinct clusters the component , so Hence, we can recover the network structure by performing a covariance analysis of the noise.
Filtering out the deterministic part. Filtering out the deterministic part plays a major role in recovering community structures. Suppose we have two signals of the form , where is independent of and is a common noise term. represents the superposition of the deterministic chaos and the independent fluctuations of the network setting. For the correlation, we have
[TABLE]
Hence, the large values of the variance of the time series () suppress the contribution of the common noise, and an analysis solely based on the the original time series will overlook the common noise contribution. In fact, in the real system, we cannot assume to be independently distributed. However, if the system is sufficiently chaotic, it will exhibit fast decay of correlations.
Generating the connectivity structure
To generate the adjacency matrix from a given degree distribution, we use the configuration model [53]. We know the number of stubs (half-links) for each node and the model randomly connects these stubs (half-links) to each other and generates an adjacency matrix that respects the given distribution.
Dynamical Models
We tested the effective network on a wide range of dynamics and networks satisfying assumptions (a)-(c). We focused on scale free networks and on rich club networks to evaluate the performance of the reconstruction. As a benchmark model, we restricted our attention to a neuron dynamics. Further models can be found in Supplementary Material.
Neuron Model
We use Rulkov maps to model spiking and bursting neurons [45]. In this setting, the variable at each node is two dimensional and we will denote . The local uncoupled dynamics is with
[TABLE]
The variable represents the membrane potential of a neuron, and, in this case, is the state variable measured by the observed time series ; that is, . Different combinations of parameters and give rise to different dynamical states of the neuron, such as resting, tonic spiking, and chaotic bursts. To test our procedure we considered two cases where and , which correspond to tonic spiking and for bursting.
Chemical synapsis. Synaptic coupling occurs only through the membrane potential . That is, with , where
[TABLE]
and is a parameter called reverse potential. Choosing , the synaptic connection is excitatory. We take , , and . Applying the dimensional reduction in (3), we obtain a map with averaged value of the interactions One can a posteriori verify that , where is the physical measure for the map and can be estimated from the dynamics of the low-degree nodes.
Electrical synapsis. Again, the synaptic coupling occurs only through the membrane potential . That is, with
[TABLE]
so the coupling depends only on the difference of states.
The cat cerebral cortex
The dataset for connections in the cat cerebral cortex consists of 53 geographic cortical areas investigated in detail with data-mining methods and was taken from [27, 28]. The network is organised into four functional areas: visual, auditory, somatosensory-motor, and frontolimbic.
Functional network. Several methods can be used to create a functional network to recover the clusters. The approach is to compute pairwise correlations of all nodes and to consider the similarities as interactions between nodes. This method is not appropriate for weakly coupled chaotic systems since they exhibit exponential decay of correlations.
Effective network representation. The connectivity matrix can be estimated using the correlation matrix . The adjacency matrix can be computed by thresholding the correlation matrix as , where is the Heaviside step function and is an arbitrary threshold. Note that we tested different thresholds with values from 0.3 to 0.6 and obtained similar results.
Finding communities. Given the reconstructed connectivity matrix , we used the modularity-based Louvain method [43] to detect communities.
Finding rich-club members. Colizza et al. developed an algorithm to detect members of the rich-club [42]. The algorithm provides a rich-club coefficient for each degree value in the network. We considered nodes with to be the members of the rich-club.
Predicting critical transitions
Once we reconstruct the relevant information, we can perform numerical analysis of the recovered equations to explore the dynamics of the systems outside the observed range of parameters. Here we explain how to gather the information for a theoretical prediction of the critical transition.
Burst synchronization. We introduce the slow variable as discussed in the main body of the manuscript. To introduce a phase variable for the dynamics we first smooth the time series [54]. Then, we find the time of local maxima as the th maximum point of the slow variable. We do this after performing a Gaussian filter to denoise slightly the data. Finally, we introduce the phase variable as
[TABLE]
as discussed in Refs. [55]
Reduction in the rich-club. Nodes in the rich-club have degrees of approximately and make connections inside the rich-club and connections to the rest of the network. Following our reduction scheme, the interactions within and outside the rich-club can be described by the expected value of the interactions with respect to the invariant measure associated with each of them. Let denote the set of nodes in the rich-club, then the coupling term for the th node in the rich-club is
[TABLE]
However,
[TABLE]
where is the invariant measure for the nodes outside the rich-club. Hence, for the rich-club we obtain
[TABLE]
where .
**Predicting the transition to collective behaviour ** Let us recall that when isolated where and . After some algebra we obtain
[TABLE]
Using the reduction Eq. (Predicting critical transitions), in the network we obtain
[TABLE]
where denotes the th nodes in the rich-club, is the mean membrane potential in the rich-club and are fluctuations. We fix two nodes and in the rich-club. Let us introduce the disturbance
[TABLE]
Applying the mean value theorem and using that we obtain
[TABLE]
and introducing a proxy for the dynamics of the slow variables
[TABLE]
and considering
[TABLE]
where we used that is a slow variable. We obtain
[TABLE]
Recall that for the cat cerebral cortex . We given the time series for , we estimate using our method as the slow variables are constants over short time scales, and the obtain slow variables as a filter over the fast variables. Then, the parameters of the rich-club such as the mean number of connections among the rich-club in terms of the reduction and external connections in terms of strength of fluctuations (We will provide a throughly discussion about the estimation of these parameters for a fully chaotic system in the Methods). From the data and using Eq. (Predicting critical transitions) we estimate and thus we obtain At this critical value the slow variables tend the stay together due to the contraction in the dynamics. This is related to the onset of synchronization in the bursts, which is captured via a phase variable through the order parameter.
Sparse Recovery. The method gives a way of recovering the equations from data by considering the evolution map as a linear combination of a well chosen basis of functions, called library. The main assumption is that many coefficients will be zero. That is, the vector of coefficients will be sparse [34]. Since we have nodes in our network, we consider the library as the set of basis functions. And denote the where ∗ denotes the transpose and we introduce
[TABLE]
We then look for a solution of the system where
[TABLE]
is the vector of coefficients. The sparse recovery technique then solves the linear equation for iteratively enforcing the sparsity of by introducing such that if we set such entry to zero [34].
In our case, we will consider the scenario where the synapsis between neurons are electric and we generate the multivariate data for the cat cerebral cortex. We will assume we have knowledge of the coupling function so we can easily read the network structure from the sparse recovery. We choose a library of polynomials as for and the pairwise to be homogeneous polynomials of degree two in the variables and for .
We solve the minimization problem and group dependencies of the network through the coefficient vector as the network structure [34]. The strength of each connection is of order . Hence we have chosen values of close to this value. The reconstructed network does not identity the clusters correctly as can be seen by comparing the blue and red markers in Figure 7. Moreover, employing the technique of community detection shows that the probability of identifying a node in the correct cluster is around 45%. This happens as each single connection is not strong enough. In fact, since the dynamics of the neurons are chaotic each connection plays a role of noise which leads to the misidentification. Thus we have shown that the sparse recovery method is not appropriate in our setting.
Acknowledgements We would like to thank Tomislav Stankovski, Chiranjit Mitra, Mauro Copelli, Dmitry Turaev and Jeroen Lamb for enlightening discussions. This work was supported in part by the Center for Research in Mathematics Applied to Industry (FAPESP Cemeai grant 2013/07375-0), the European Research Council (ERC AdG grant number 339523 RGDD), and the Serrapilheira Institute.
Data Availability. The connection matrices of cat cortex can be found at https://sites.google.com/site/bctnet/datasets. Connectivity matrix of Drosophila Medulla can be found at https://neurodata.io/project/connectomes/.
1 Dimensional Reduction in Heterogeneous Networks
Using the notation in the main body of the paper, we present an informal statement of the theoretical result that suggests the reconstruction procedure. For a precise statement and details of the general setting see [1]. The theorem has three main assumptions concerning the uncoupled dynamics , the network encoded in its adjacency matrix , and the reduced dynamics . We discuss a particular case and use it to illustrate the reconstruction step-by-step.
The local dynamics must increase the distance between points by a constant factor. A paradigmatic one-dimensional example is mod that presents some of the most interesting characteristics of chaotic maps.
- 2)
The networks are heterogeneous, having nodes with disparate degrees. This assumption is made precise in [1] by a set of conditions involving the size of the network , the number of hub nodes , the maximum degree of a hub node , and the maximum degree of a low degree node . One instance in which these conditions are satisfied and that includes many situations of interest is the following. Suppose is constant or slowly increasing with the size of the network, , grows faster than the square root of the system size, , and increases slowly, but polynomially in the system size, . Then for a sufficiently large , the network satisfies the conditions.
- 3)
The reduced dynamics must be hyperbolic. This means that we assume the maps to be either expanding (possibly by a non-constant factor), or to have a finite number of attracting periodic orbits. Every map can be perturbed by an arbitrarily small amount to obtain such an hyperbolic map [2].
Under these assumptions, we can prove the following approximation result
Theorem 1** ([1])**
For every hub node , the dynamics at the hub is given by
[TABLE]
where for time with , and a set of initial condition of measure , where is constant in , and .
Notice that one can pick the time scale exponentially large, but such that is very small so that, for large , the approximation result holds for very long time and for a large set of initial conditions. The reduced dynamics given by depends on the hub’s state only. The term gives the fluctuations of the sum of all interactions from their expectation with respect to the physical measure of the unperturbed dynamics.
2 Reconstruction of degree distributions
Scale-free networks. We create ensembles of scale-free network of nodes with distinct structural exponents following the same technique as in the main body of the manuscript. As in the main body, we only consider the largest connected component of the network (giant component). For each network realisation we compute the largest degree and, for simplicity, we fix the coupling strength throughout the rest of the section.
2.1 Doubling map
Let us now apply our approach when is a perturbed version of the doubling map. Since the dynamics is one dimensional, we denote and with
[TABLE]
and where we take to be i.i.d. random variables (i.e. independent over ) uniformly distributed on . Likewise we write with
[TABLE]
We fixed Since the unique absolutely continuous invariant measure for the doubling map is the Lebesgue measure , we have
[TABLE]
yielding and the reduced dynamics takes the form
[TABLE]
We aim to recover the reduced dynamics and in particular and from the data of a multivariate time series with time steps.
From data to Model. We assume not to have access to the network structure and only measure the time-series at each node, as illustrated in Figure 8 Data. By performing the steps described in the main body of the manuscript, we can recover an effective network. We now illustrate in detail our procedure in the case above of the doubling map.
(i) Reduced dynamics. From the time series observed at each node, we construct the attractor for that particular node. Computing the embedding dimension we notice that the dynamic at each site (node) is well described by a one dimensional map, , up to a fluctuation . Hence, to reconstruct the attractor it suffices to obtain the return map. Return maps at different nodes are shown in Figure 8 (i).
(ii) Isolated dynamics and effective coupling. We start by introducing a similarity measure.
Similarity of the time series. Two time series are considered almost the same, if whenever and are close then and are also close. This means that one considers the new times series , , , reshuffle them according to the lexicographical ordering (i.e. according to the magnitude of the first coordinate), and then take to be the Pearson correlation distance between these reshuffled sequences. So if the time series at and are just out of phase, then, thanks to the reordering, their distance . We then calculate the intensities and use this to classify the nodes. The similarity matrix is shown in Figure 8(ii).
Using the correlation distance described above, we determine for which nodes is minimal and in this way we identify the low degree nodes. For the low degree nodes, the sequence lies close to the graph of the return map for , and thus we obtain a good estimate for the isolated dynamics , see Figure 8(ii). Next, we apply the reduction to estimate the coupling function.
To obtain the effective coupling from data, take a hub and consider the sequence . The resulting time series approximates the graph of and subtracting gives an approximation of the function up to a multiplicative constant (shown in Figure 8(ii)).
(iii) Network Structural Statistics. There are two ways to obtain information about the statistics of the degrees . The first one uses the noise variance. In fact, the size of the fluctuations depends on the number of connections the node makes, and with good approximation Var. The second one uses and recovered at the previous step. For every node , choose to fit the time series with the map
[TABLE]
The value of is obtained by Bayesian inference on the return maps is shown in the first panel of Figure 8(i) as and . The distribution of will be linear proportional to the degree distribution and so we can use it to obtain the the structural parameter. At this point we can construct an effective network with degree distribution close to that of the real network can be constructed with the configuration model as discussed in the main body of the manuscript. We can also check whether there are communities in the network by analyzing the covariance of the fluctuations .
In Figure 10 a) we reconstruct from the distribution of the structural exponent for 1000 scale-free network with exponents ranging from to
2.1.1 Robustness of the reconstruction under noise
Adding some small independent noise to the dynamics does not influence much the reconstruction procedure for the doubling map. This is a consequence of stochastic stability of the local dynamics together with the persistence of the reduction [1]. On the other hand, when the fluctuations become too large the reconstruction will underestimate the network structure. To illustrate these effects, we consider the randomly perturbed doubling maps
[TABLE]
where the random variables are independent over and , and identically distributed uniformly in the interval . Intuitively, as long as the reconstruction will go through as the noise fluctuation will not compete with the coupling term. Notice that we normalize . This is illustrated in Figure 9. In inset a) the noise has a large support , and in particular larger than the coupling . As a consequence, the reconstruction understimate the number of low degree nodes. In inset b) the noise has a small support .
2.2 Logistic Maps on Scale-Free Networks
Consider , and where
[TABLE]
It is known that for such parameter, the map has an absolutely continuous invariant probability measure. In contrast to the doubling map, a small perturbation of the logistic map might result in a map that has a non-smooth physical measure, namely the measure is a distribution given by the sum of Dirac delta masses at a finite number of points that constitute the periodic orbits of the system. This compromises the validity of the rigorous result analysis that we used in the case of the doubling map, for which small perturbations produced small changes in the invariant measure [2]. Nonetheless, as we will show, the reconstruction analysis gives good results in this case as well. We believe that this happens because our reconstruction only needs a finite number of points and gives a representation of the dynamics over finite time-scales. As a coupling function, we consider
[TABLE]
and the reduction is given by where is defined as above. Again, we consider the reduced model in terms of the free parameter . Analysing the distribution of we access the distribution of the degrees. Results are presented in Figure10b).
2.3 Spiking Neurons with Electrical Synapses
We use the same models for the local dynamics of the spiking neurons which are described in the section Methods. We consider here a different coupling function. Denoting , the coupling is
[TABLE]
Results of the reconstruction are presented in Figure 10c).
2.4 Henon Maps on Scale-Free Networks
Using the same notation, , the coupled Hénon maps are given by
[TABLE]
We observe the dynamics of the first component of the Hénon map, that is, . In this case, the reconstruction starts by determining the dimension of the reduced system. Takens embedding reveals that the dimension is two for large time excursions. Hence, we aim at learning a function of the kind
[TABLE]
We use polynomial functions for the fitting and perform a 10-fold cross-validation. Just our theory implies that
[TABLE]
where models the isolated dynamics and the coupling. Again we obtain from the low-degree nodes via a similarity analysis. We learn the function by
[TABLE]
In our case, and we can obtain the degree distributions, Fig. 10d).
2.5 Coupled Roessler Oscillators
Assume that the local dynamics is modelled by a Rössler oscillator [3]. The dynamics is now in continuous time and our method can also be applied by using a suitable Poincaré section. This gives an induced map that describes the dynamics of the system at specific instants of time (when the system hits a selected subset of phase space). Denoting the vector field is given by and the coupling function, assumed to be diffusive, is given by , where projects to the first component, i.e., . So, our main equation reads as
[TABLE]
We perform a numerical integration of the equations on the Rich-Club network using a 4th order Runge-Kutta with integration step and get the data . Using a statistical analysis of the time-series of the state variables, we are not able to reveal the connectivity structure.
The data is phase coherent, that is, taking a Hilbert Transform we can decompose the time series in terms of amplitude and phase we conclude that the spread in the phase variable is small and thus the return time to a given section is nearly constant. So, we consider the Poincaré section defined by the maxima of the time series . This gives us a time series indexing all maxima. We then apply all the steps of the reconstruction procedure to this time series. Because of the coherent dynamics of the phase, the coupling form is preserved in the Poincaré section. The results of the network structure estimation are presented in Figure 11.
3 Reconstruction of Rich-Club Mofits
We report the performance of the method in the setting of a network of nodes having five clusters of nodes each. Four of these clusters are modelled as Erdös-Renyi random graph with connection probability . The remaining cluster is the integrating clusters with connection probability . The network resembles Fig. 12.
Filtering out the deterministic chaos. We need to filter the contributions of the deterministic parts to reconstruct the community structure. Indeed, for two nodes and in the same cluster, the signals have the form and where and is a superposition of the deterministic chaos depending on the variable at the node and independent fluctuations coming from the rest of the network, while is common noise. and have fast decay of correlations, depends on different sets of variables, and for the sake of the following argument, can be assumed to be independent between each other and with . Under these assumptions, . Hence, the large values of the variance of the time series leads to strong suppression of the correlation coming from the small common noise .
3.1 Condition for recovering the community
In general, the coupling function is a sum of terms This leads to noise terms
[TABLE]
where is the physical measure of the local dynamics. Given and the sum can be split into connections commons the and and to the independent connections. So, for such and we can write
[TABLE]
where is the noise due to the common connections. Notice that has zero mean. Let’s estimate the covariance of the component and . By abuse of notation we will omit the time index and write the covariance and by the previous computations After some manipulations, we obtain
[TABLE]
so, if the correlation between the noise will vanish even though they have a common term. Thus, the generic condition in order for the above scheme to be able to recover communities is that . If this condition is not met, the network reconstruction via the ’s is also not possible. We recall that is not a generic condition and this is why the effective network approach works in most cases.
3.2 Doubling Map on Rich Club Network
Using the doubling map with the diffusive coupling described in the above section on the reconstruction for scale-free networks, we test the community detection from time-series using the correlation between time series and correlations between the noise . Then we apply our technique to recover the network structure. We show the results in Fig. 13
3.3 Logistic Maps on a Rich Club Network
The dynamic of the logistic map and the coupling are described in the above section on the reconstruction for scale-free networks. We test the community detection approach on time-series using the correlation between time series and correlations between the noise. To this end, we fix the coupling strength and simulate the network dynamics. Then we apply our technique to recover the network structure. We show the results in Fig. 14.
3.4 Spiking neurons coupled with Electric Synapsis on a Rich-Club
Again, we consider the spiking neurons and the electrical coupling described above on the same rich-club motif as before. The reconstruction and analysis of the noise correlation is able to detect the rich-club clusters. To this end we fix the coupling strength and simulate the network dynamics. Then we apply our technique to recover the network structure. We show the results in Fig. 15.
3.5 Bursting neurons coupled with Electric Synapsis on a Rich-Club
Our technique applies equally well then the local dynamics has multiple time-scales such as a bursting neuron. Our extensive numerical investigation reveals that when the resting time is not much larger then the total bursting time the reduced dynamics is capable of extracting the relevant information of the time series. Thus, we fixed the neuron parameter to obtain a bursting dynamics. The reconstruction and analysis of the noise correlation is able to detect the rich-club clusters. To this end we fix the coupling strength and simulate the network dynamics. Then we apply our technique to recover the network structure. We show the results in Fig. 16
4 Predicting Critical Transitions in Rich-Club Networks
We present another example of how to use the effective network methodology to predict critical transitions. In this case we choose the doubling map for the local dynamics coupled on a rich-club network with clusters sampled as Erdös-Renyi graphs. Consider a rich club network of 2200 elements with 5 clusters. Four of the clusters are made of nodes with small degrees, and one cluster (called integrating cluster or rich club) has nodes which are connected with most of the network. The edges within a cluster of low degree nodes are assigned as described above.
As a model for the isolated dynamics, we use the doubling map mod 1 with the diffusive coupling where we picked .
We use the function
[TABLE]
as an empirical measure of the synchronization level at time . From a single multivariate time-series, when the coupling is fixed at (red marker Figure 17), the analysis of gives no sign of critical transitions. A statistical analysis shows that the variance of the averaged synchronization error is not amplified, and the extreme value statistics fails to reveal a transition as it also does an analysis of dynamical correlations.
As described above, we can use the effective network methodology to recover the local dynamics and the effective coupling . In particular we recover the isolated dynamics within accuracy for the poorly connected nodes. The unique physical equilibrium measure for this map is the uniform distribution on . Furthermore, we recover the effective coupling
[TABLE]
At this point, we know that the approximate evolution of any node is given by , and we can study numerically or analitically this rule to find those values of for which the integrating cluster exhibits synchrony. The analysis shows that excursions towards synchronization will start once the coupling is increased by (see Figure 17). Below we provide the details on how to recover this critical value of for which a transition arises.
Reduction in the Integrating Cluster. Nodes in the integrating cluster have roughly degree and make connections inside the integrating cluster and to the rest of the network.The interactions felt by a node in the rich-club can be split in those coming from nodes in other clusters and those coming from nodes within rich-club itself:
[TABLE]
But,
[TABLE]
where is the invariant measure for the nodes outside the integrating cluster 111In the example above is the Lebesgue measure .. Hence, the equation of the integrating cluster can be written as
[TABLE]
where .
We can estimate empirically analyzing each cluster. Assuming that , we can recover from the analysis of using the reconstruction techniques. In fact, for this particular choice of , is equal to plus a constant. Now if is the measure that describes the behaviour of a node in the integrating cluster, then the interaction within the rich-club can be written as
[TABLE]
Putting the two equations together one has that
[TABLE]
where models the reduced dynamics and the is the mean contribution from all interactions (inside and outside the integrating cluster). Finally combines the effect of the fluctuations.
Model from Data. From a single multivariate time series at a given coupling parameter, shown in Figure 17 as a red dot, we reconstruct the model. First, we obtain the the rule which we uncover to be mod , hence and this number is nearly independent of the node in the integrating cluster. Hence, we obtain an estimate . We also obtain that the dynamics of nodes in the communities is well approximated by mod .
Next, we need to estimate to construct a model for the connectivity of the integrating cluster. From the recovered local dynamics , one knows that is the Lebesgue measure and, since , we obtain . In the regime of parameters where the measurements have been made, can be obtained empirically and this allows to recover since
[TABLE]
where we evaluate with respect to the empirically retrieved , we substitute with the estimated value above, and denotes the time average. We obtain that . The data analysis reveals that such value is nearly independent of the node in the integrating cluster. Moreover, the analysis of the covariance of the noise in the integrating clusters shows a lack of communities and the functional analysis network indicates that the integrating cluster of is a random network 200 nodes with . From this analysis, we obtain an estimate for . We are now able to estimate by .
Synchronization Prediction. With the reconstructed data , we obtain that for a node in the integrating cluster, (11) reads as
[TABLE]
When , the map mod has an attracting fixed point at 0. In this new regime, is a self-consistent measure for the integrating cluster, meaning that the measure gives rise to an approximated dynamical rule that has as equilibrium measure. This can be easily verified since . We can then conclude that by selecting a value of the coupling strength such that is in the range above, one expects all the states at the nodes in the integrating cluster to evolve towards the point [math] and fluctuate around this point by .
To obtain the range of such that the fluctuations of are around twice , we notice that since the stability properties are given by the linear stability around . Let be a small displacement around [math] and let us denote at the Jacobian of the map at [math]. Then we obtain that and so , which yields that . Hence, we obtain . Because we measured as in the data given we predict that a high quality coherent state in the rich-clubwill appear then is increased by . This is a agreement with the experiments.
5 Cat cerebral cortex
5.1 Bursting neurons with Chemical Synapesis
We simulated each mesoregion of the cat cerebral cortex network with bursting Rulkov oscillators coupled through chemical synapsis. For such dynamics the parameters are given as and where there is no synchrony between oscillators Fig. 18. We use the simulated data to reconstruct the network structure as shown in Figure 18.
References
- [1] Pereira, T., van Strien, S., & Tanzi, M. Heterogeneously coupled maps: hub dynamics and emergence across connectivity layers. J. Eur. Math. Soc., preprint: arXiv 1704.06163, 1–63 (2017)
- [2] de Melo, W., and Van Strien, S., One-dimensional dynamics. Springer (2012).
- [3] Rössler, O. E. An equation for continuous chaos. Physics Letters A 57, no. 5 (1976): 397-398.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Kandel, E.R., Schwartz, J.H. & Jessell, T.M. eds., Principles of neural science , Vol. 4. (New York: Mc Graw-hill, 2000).
- 2[2] Bohland, J. W. et al. A proposal for a coordinated effort for the determination of brainwide neuroanatomical connectivity in model organisms at a mesoscopic scale. P Lo S Computational Biology 5 , e 1000334 (2009).
- 3[3] De La Fuente, A., Bing, N., Hoeschele, I., & Mendes, P. Discovery of meaningful associations in genomic data using partial correlation coefficients. Bioinformatics , 20(18) , 3565-3574 (2004).
- 4[4] Reverter, A., & Chan, E. K. Combining partial correlation and an information theory approach to the reversed engineering of gene co-expression networks. Bioinformatics , 24(21) , 2491-2497 (2008).
- 5[5] Butte, A. J., & Kohane, I. S. Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. In Biocomputing 2000 (pp. 418-429), (1999).
- 6[6] Braunstein, A., Pagnani, A., Weigt, M., & Zecchina, R. Inference algorithms for gene networks: a statistical mechanics analysis. Journal of Statistical Mechanics: Theory and Experiment , 2008(12) , P 12001 (2008).
- 7[7] Cocco, S., Leibler, S., & Monasson, R. Neuronal couplings between retinal ganglion cells inferred by efficient inverse statistical physics methods. Proc. Natl. Acad. Sci. USA , 106 (33), 14058-14062 (2009).
- 8[8] Bressler, S. L., & Seth, A. K. Wiener–Granger causality: a well established methodology. Neuroimage , 58 (2), 323-329 (2011).
