Combining Bayesian Approaches and Evolutionary Techniques for the Inference of Breast Cancer Networks
Stefano Beretta, Mauro Castelli, Ivo Goncalves, Ivan Merelli, and Daniele Ramazzotti

TL;DR
This paper explores the use of Bayesian Graphical Models combined with evolutionary heuristics to improve the inference of gene and protein networks in breast cancer, addressing challenges of large-scale data and causal relationship detection.
Contribution
It introduces a comparative analysis of state-of-the-art heuristics for Bayesian network inference specifically applied to breast cancer data.
Findings
Different heuristics show varying performance in network inference.
Bayesian approaches effectively handle small sample sizes.
The study highlights the importance of combining methods for better results.
Abstract
Gene and protein networks are very important to model complex large-scale systems in molecular biology. Inferring or reverseengineering such networks can be defined as the process of identifying gene/protein interactions from experimental data through computational analysis. However, this task is typically complicated by the enormously large scale of the unknowns in a rather small sample size. Furthermore, when the goal is to study causal relationships within the network, tools capable of overcoming the limitations of correlation networks are required. In this work, we make use of Bayesian Graphical Models to attach this problem and, specifically, we perform a comparative study of different state-of-the-art heuristics, analyzing their performance in inferring the structure of the Bayesian Network from breast cancer data.
| Heuristic Search Algorithm | Regularizators | ||
|---|---|---|---|
| Hill Climbing (HC) | loglik | AIC | BIC |
| Tabu Search (TB) | loglik | AIC | BIC |
| Genetic Algoritms (GA) | loglik | AIC | BIC |
| Serum | PBS | NRG1 | Insulin | IGF1 | HGF | FGF1 | EGF | |
| BT20 | ||||||||
| BT549 | ||||||||
| MCF7 | ||||||||
| UACC812 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Combining Bayesian Approaches and Evolutionary Techniques for the Inference of Breast Cancer Networks
Stefano Beretta1, Mauro Castelli2, Ivo Gonçalves2, Ivan Merelli3 and Daniele Ramazzotti4
1**DISCo, Universitá degli Studi di Milano Bicocca. 20126 Milano, Italy
2**NOVA IMS, Universidade Nova de Lisboa. 1070-312 Lisboa, Portugal
3**Ist. di Tecnologie Biomediche, Consiglio Nazionale delle Ricerche, Segrate, Italy
4**Department of Pathology, Stanford University
[email protected], {mcastelli,igoncalves}@novaims.unl.pt, [email protected], [email protected]
Abstract
Gene and protein networks are very important to model complex large-scale systems in molecular biology. Inferring or reverse‐engineering such networks can be defined as the process of identifying gene/protein interactions from experimental data through computational analysis. However, this task is typically complicated by the enormously large scale of the unknowns in a rather small sample size. Furthermore, when the goal is to study causal relationships within the network, tools capable of overcoming the limitations of correlation networks are required. In this work, we make use of Bayesian Graphical Models to attach this problem and, specifically, we perform a comparative study of different state-of-the-art heuristics, analyzing their performance in inferring the structure of the Bayesian Network from breast cancer data.
1 INTRODUCTION
Molecular networks are essential for every biological process, since genes and proteins are able to carry out their function only in precisely regulated pathways. For this reason, data-driven learning of regulatory connections in molecular networks has long been a key topic in computational biology (Bansal et al.,, 2007). The general problem is to infer, or reverse-engineer, from gene or protein expression data, the regulatory interactions among these biological entities using computational algorithms.
In this context, despite correlation networks are widely used for gene expression and proteomic data analysis, it is known that correlations not only confound direct and indirect associations, but also provide no means to distinguish between cause and effect. For causal analysis the inference of a directed graphical model is typically required. However, this task is rather difficult due to multiple theoretical and practical reasons, among which, but not limited to, the course of dimensionality (Pearl,, 2003).
Therefore, causal analysis requires tools capable of overcoming the limitations of correlation networks: much of the work in this area has focused on Bayesian Networks (Pearl,, 2003) or related regression models, such as systems of recursive equations or influence diagrams. All these models describe causal relations by an underlying directed acyclic graph (DAG). Nevertheless, it remains unclear whether causal, rather than merely correlational, relationships in molecular networks can be inferred in complex biological settings.
Moreover, the problem is typically complicated by the enormously large scale of the unknowns in a rather small sample size. Furthermore, data is prone to experimental defects and noisy readings, while many other biases can compromise the quality of the results. These complexities call for a heavy involvement of powerful mathematical models which play an increasingly important role in this research area (Kabir et al.,, 2010). In order to assess the ability of different tools to learn causal networks, the Dialogue for Reverse Engineering Assessment and Methods (DREAM) project has run several challenges focused on network inferences (Stolovitzky et al.,, 2007). In particular, we focused on (sub)-challenge 8.1 concerning Human Protein Networks (HPN) in cancer cell lines, which is about the inference of causal signalling pathways using time-course data with perturbations on network nodes. This sub-challenge was split into two independent parts, concerning Breast Cancer proteomic data and in silico data.
Different types of models, such as directed graphs, Boolean networks (Akutsu et al.,, 1999), Bayesian Graphical Models (Zou and Conzen,, 2005), and various differential models have been used to describe gene regulations at various levels of detail and complexity. The choice of the model is often determined by how much information it tries to capture, taking into account that the more information a model attempts to infer, the more parameters are needed to learn it, and the more complex the overall approach becomes. Specifically, researchers have paid great attention to Bayesian Networks, which can compactly model dependency relationships between variables relying on probabilistic measures. Since gene expression experiments are subject to many measurement errors, the use of statistical methods is expected to be effective for extracting useful information from such noisy data. Friedman et al. (Friedman et al.,, 2000) proposed both discrete and continuous Bayesian network models relying on linear regression for inferring gene networks. Imoto et al. (Imoto et al.,, 2001) succeeded in employing non-parametric regressions for capturing even non-linear relationships between genes.
In this work, we perform a comparative study of different heuristics at the state-of-the-art to perform the task of inferring the structure of a Bayesian network from breast cancer data. The paper is structured as follows: Section 2 provides a background of the biological problem under exam; Section 3 gives a formal definition of the problem addressed in this study, along with a description of the different computational and statistical machineries that we are adopting, and of the input data. Afterwards, the results of the described methods on real and simulated data are presented and discussed in Section 4. Section 5 concludes the paper and suggests avenues for future research.
2 BIOLOGICAL BACKGROUND
Many biological processes are carried out by interactions between proteins, RNA, and DNA. Cells respond to their environment by activating signalling networks that trigger processes such as growth, survival, apoptosis (programmed cell death), and migration. Post-translational modifications, notably phosphorylation, play a key role in these signalling events. In cancer cells, signalling networks frequently become compromised, leading to abnormal behaviours and responses to external stimuli. Endogenous signal transduction in cancer cells is systematically disturbed to redirect the cellular decisions from differentiation and apoptosis to proliferation and, later, invasion. Cancer cells acquire their malignancy through accumulation of advantageous gene mutations by which the necessary steps to malignancy are obtained. These selfish adaptations to independence can be described as a result from an evolutionary process of diversity and selection (Schramm et al.,, 2010).
Many current and emerging cancer treatments are designed to block nodes in signalling networks, thereby altering signalling cascades. Although there is a wealth of literature describing canonical cell signalling networks, little is known about exactly how these networks operate in different cancer cells. Advancing our understanding of how these networks are deregulated across cancer cells will ultimately lead to more effective treatment strategies for patients.
Recently, high-throughput analysis enabled the possibility to obtain genome-wide information, such as mRNA expressions, protein-protein interactions, protein localizations and so on. A lot of attention has been dedicated on developing computational methods for extracting valuable information of molecular networks from such various types of genomic data.
Currently, statistical models for estimating gene regulatory networks from genomic data are mainly based on expression data from DNA microarrays or RNA-seq experiments. However, since information from these approaches is limited by their quality, noise and experimental errors, sophisticated mathematical approaches are necessary for estimating gene regulatory networks accurately.
On the other hand, protein-protein interaction networks are mainly constructed relying on observed protein-protein interaction data, using approaches such the two hybrid assays, tandem affinity purification experiments and, more recently, protein arrays. However, protein-protein interaction data often contains some errors, making even more difficult to construct comprehensive protein-protein interaction networks from these interaction data alone.
3 METHODS
A Bayesian Network (BN) is a statistical graphical model that represents a joint distribution over random variables and encodes it by means of a direct acyclic graph (DAG) depicting the nodes referring to the variables. More formally, we define a BN as a direct acyclic graph , where is the set containing the random variables and is the set of the directed arcs over them, representing any conditional dependence among the variables (Parsons,, 2011).
In this work, we make use of such graphical tool to model a protein network (being a direct acyclic graph), whose structure (i.e., the nodes and arcs in the model) maximizes the likelihood, given the observed data on which we make the inference. Moreover, we define this task as an optimization problem where, for a set of observations , we aim at maximizing the likelihood of observing the data given a specific model , which we define as
[TABLE]
that is the product of the conditional probabilities given each observation .
Practically, however, there is a well-known issue when learning the network structure by maximizing the likelihood function. In fact, for any arbitrary set of data, the most likely graph is usually very connected, since adding an edge typically can only increase the likelihood of the data, hence leading to overfitting. To try to reduce this problem, the likelihood is almost always adjusted by means of a regularization term that penalizes the complexity of the model (Parsons,, 2011).
We also observe that, regardless of the adopted approach and likelihood score, the main issue to infer the structure of a BN is the huge search space of the valid solutions, which makes this a well known NP-hard problem and, therefore, one will need to make use of heuristics to perform such inference (Parsons,, 2011).
In this work, we compare different heuristics search algorithms along with various regularizations for the likelihood score. In Table 1 we present a list of combinations of the adopted techniques, which are described in details in the subsequent sections.
Here we employ three different and well-known evolutionary methods to solve the previously mentioned optimization problem, that is to reconstruct the Bayesian network w.r.t. to a specific regularization score. In the rest of this section we briefly describe each method and also the considered regularizators.
3.1 Hill climbing
Hill Climbing (HC) is one of the simplest iterative techniques that have been proposed for solving optimization problems. While HC consists of a simple and intuitive sequence of steps, it is a good search technique to be used as a baseline for comparing the performance of more advanced optimization techniques.
Hill climbing shares with other techniques (like simulated annealing (Hwang,, 1988) and tabu search (Glover,, 1989)) the concept of neighbourhood. Search methods based on this latter concept are iterative procedures in which a neighbourhood is defined for each feasible solution , and the next solution is searched among the solutions in . Hence, the neighbourhood is a function that assigns at each solution in the search space a (non-empty) subset of . In our case, every solution is modelled as an adjacency matrix, where an entry is if in the current solution an arc is present from node to node , and is [math] otherwise.
The sequence of steps of the hill climbing algorithm, for a minimization problem w.r.t. a given objective function , are the following:
choose an initial solution in ; 2. 2.
find the best solution in (i.e., the solution such that for every in ; 3. 3.
if , then stop; else set and go to Step 2.
To counteract the main limitation of hill climbing (i.e., getting trapped in a local optimum), more advanced neighbourhood search methods have been defined. The following section presents the Tabu Search method, a popular and effective optimization technique that uses the concept of “memory”.
3.2 Tabu search
As described in the original work of Glover (Glover,, 1989), Tabu Search (TS) is a meta-heuristic that guides a local heuristic search procedure to explore the solution space beyond local optimality. One of the main components of this method is the use of an adaptive memory, which creates a more flexible search behaviour. Memory-based strategies are therefore the main feature of TS approaches, founded on a quest for “integrating principles”, by which alternative forms of memory are appropriately combined with effective strategies for exploiting them.
Tabus are one of the distinctive elements of TS when compared to hill climbing or other local search methods. The main idea in considering tabus is to prevent cycling when moving away from local optima through non-improving moves. When this situation occurs, something needs to be done to prevent the search from tracing back its steps to where it came from. This is achieved by declaring tabu (disallowing) moves that reverse the effect of recent moves. For instance, let us consider a problem where solutions are binary strings of a prefixed length and the neighbourhood of a solution consists of the solutions that can be obtained from by flipping only one of its bits. In this scenario, if a solution has been obtained from a solution by changing one bit , it is possible to declare a tabu to avoid to flip back the same bit of for some number of iterations (this number is called the tabu tenure of the move). Tabus are also useful to help the search move away from previously visited portions of the search space and, thus, perform more extensive exploration.
The basic TS algorithm is reported, considering the minimization of the objective function , as follows:
randomly select an initial solution in the search space , and set and , where is the best solution so far and the iteration counter; 2. 2.
set and generate the subset of the admissible neighbourhood solutions of (i.e., non-tabu or allowed by aspiration); 3. 3.
choose the best in and set ; 4. 4.
if , then set ; 5. 5.
update tabu and aspiration conditions; 6. 6.
if a stopping condition is met then stop; else go to Step 2.
Commonly used conditions to end the algorithm are when the number of iterations () is larger than the maximum number of allowed iterations, or if no changes to the best solution have been performed in the last iterations (as in our tests).
3.3 Genetic Algorithm
Genetic Algorithms (GAs) are a class of computational models that mimic the process of natural evolution (Goldberg and Holland,, 1988). GAs are often considered as function optimizers although the range of problems to which genetic algorithms have been applied is quite broad. Although different variants exist, most of the methods called “GAs” have at least the following elements in common: populations of chromosomes, selection according to a fitness function, crossover to produce new offspring, and random mutation of new offspring.
One of the most important issues when using the GAs to solve an optimization problem is the way to encode the candidate solutions, that is the individuals in the population, and also the genetic operators (crossover and mutation). Since, this aspect strongly depends on the specific problem, here we describe how GAs have been used to build a Bayesian Network. A candidate solution is represented as a string of length equal to , being the number of nodes of the network. Each position can be either [math] or , and the information represents the existence of a connection among node and node , where the operator denotes the integer division, while the operators denotes the rest of the division between and . As an example, in a network with nodes means that there is a node between node 1 () and node 2 (). Nodes are numbered from [math] to .
To produce admissible solutions (i.e., in our domain a network without loops), it is fundamental to redefine the classical crossover and mutation operators. More precisely, we developed a simple but efficient method that guarantees that crossover and mutation will produce Bayesian Networks without loops. To achieve this goal we associated to each solution two lists, called forward list and backward list. The two lists maintain, for each node , the forward links (i.e., the set of nodes for which a connection from to exists) and the backward links (i.e., the set of nodes for which a connection from to exists). By using these two linked lists it is simple to assess if a new connection between two nodes can be created. In detail, let us assume that the algorithm needs to evaluate whether it is possible to add a connection between nodes and (with being the origin and the destination node of the connection). In this scenario, it is necessary to iteratively scan all the elements in the backward list of and check if in their backward lists is present. In this case it would be impossible to create a connection between and without entering a loop in the structure of the network. In the same way, it is necessary to iteratively scan all the elements in the forward list of and check if in their forward lists is present. Also in this case, the creation of the connection from to will introduce a loop in the network.
Hence, the proposed crossover operator works as follows:
choose two individuals and as parents, based on tournament selection; 2. 2.
select a single crossover point for both the parents; 3. 3.
for every locus before that point set and ; 4. 4.
for every locus beyond that point for which is equal to , set and ; 5. 5.
for every locus beyond that point for which is different from , do the following:
- •
if , then set and set if and only if it is possible to create a connection between node and node (set in the opposite case);
- •
if , then set and set if and only if it is possible to create a connection between node and node (set in the opposite case);
- •
update the forward and the backward lists.
The mutation operator we proposed works as follows:
for each locus of an individual generate a random number from a uniform distribution. If (where is the mutation probability) then select the locus for mutation; 2. 2.
if , then set and update the forward and backward lists; 3. 3.
in the opposite case (), check if it is possible to create a connection between node and node . If the connection does not introduce a loop set and update the data structures, else will remain equal to [math].
The genetic operators described above ensure that the constraint related to the absence of loops is always satisfied. Moreover, this allows the GA to avoid to reject a high number of individuals that do not respect the constraint. This will result in a beneficial effect on the execution time of the algorithm.
3.4 Regularizators
As already mentioned, we make use of various likelihood scores as fitness functions for the inference of the network. Such scores, namely loglik, AIC, and BIC, are implemented by using the bnlearn R package (Scutari,, 2009).
Specifically, we first considered the log-likelihood score (loglik), that is the logarithm of the previously mentioned likelihood score. Then, as regularized log-likelihood scores, we used the Akaike Information Criterion (AIC) (Akaike,, 1992) and the Bayesian Information Criterion (BIC) (Schwarz et al.,, 1978).
To extend this scores in order to model continuous random variables, we adopt the multivariate Gaussian implementation of the log-likelihood score (see (Parsons,, 2011) for a formal definition of the scores and (Scutari,, 2009) for the adopted implementation).
4 RESULTS
To assess the performance of the different approaches and regularizators, we have considered the HPN-DREAM breast cancer network inference challenge. This challenge comprises three sub-challenges, and we focused on the first one (Sub-challenge 1). This sub-challenge consists of two distinct parts: the first one (Sub-challenge 1A) aims at inferring causal signalling networks using protein time-course data. The task spanned different contexts, each defined by a combination of cell lines and stimuli, which focus on networks with specific genetic and epigenetic background. Since for these datasets the real network is unknown, beside training data, further data (not used during the inference) are available to assess the causal validity of the inferred networks. The second part (Sub-challenge 1B) comprises in silico data task and also focused on causal networks. Anyway, differently from the former one, the use of a-priori biology knowledge to design the network is not allowed. Since for this sub-challenge the protein network is known, the evaluation of the achieved results can be performed by directly comparing the computed network with the original one.
More in details, the datasets of Sub-challenge 1A (“real data”) were generated using Reverse Phase Protein Array (RPPA) quantitative proteomics technology. RPPA is a protein array designed as a micro- or nano-scaled dot-blot platform that allows the simultaneous measurement of protein expression levels in a large number of biological samples in a quantitative manner, when high-quality antibodies are available (Spurrier et al.,, 2008). This challenge focuses on about phosphoproteins (proteins phosphorylated at specific sites). Protein abundance may be influenced by multiple dynamical processes operating over multiple time-scales. This challenge does not focus on long-term changes over days (e.g. rewiring of networks due to epigenetic changes brought about by perturbation), hence data comprises protein time-course data up to 4 hours after ligand stimulation. Time-course data were acquired under ligand stimuli and inhibition of network nodes by one of inhibitors plus the vehicle control (cells were serum-starved and pre-treated with inhibitor prior to ligand stimulation). The experiment was carried out on breast cancer cell lines (namely, BT20, BT549, MCF7, and UACC812), with abundance of the phosphoproteins measured at time points post-stimulus. Data are normalized protein abundance measurements on a linear scale. Table 2 shows the different processed datasets, obtained by each combination of cell/stimulus, and their compositions, which are the expression levels of the considered phosphoproteins with different inhibitors at consecutive time points.
On the other hand, the in silico challenge aims to mimic the key aspects of the RPPA experimental set up and the characteristics of the proteomic data, but using a state-of-the-art dynamical model of signalling. This allows the assessment of inferred networks and predicted trajectories against a true gold standard. A computational signalling model was used to generate time-courses of phosphoprotein abundance levels. The model describes the biochemistry underlying a realistic signalling network. Data were generated for combinations of 2 ligand stimuli (each one at concentrations, denoted to as “lo” and “hi”) and inhibitors, or no inhibitor (as for the experimental data described above, cells were pre-incubated with the inhibitor prior to ligand stimulation). For each condition, a time-course of phosphoprotein levels is provided at time points post-stimulus. It must be noticed that phosphoprotein names have been anonymized so that detailed prior information from canonical signalling pathways cannot be used. Efforts have been made to model the antibody-based readout of the RPPA platform and its technical variability in a faithful manner. Three technical replicates are provided per condition. Data provided to participants are protein abundance measurements on a linear scale. In this task, a single network should be inferred in contrast to the proteomic data challenge that requires 32 networks.
Following the approach used to evaluate the results submitted to the challenge, we have considered the same method to assess the performance of our predictions. More precisely, in real data, for any given context, the set of nodes that showed salient changes under a test inhibitor (here an mTOR inhibitor) relative to the control was identified. These “gold-standard” sets are derived from (held-out) experimental data and should not be regarded as representing a fully definitive ground truth. For each predicted network, the set of mTOR descendants is predicted and compared against the experimental one to obtain the area under the receiver operating characteristic curve (AUROC) score (Hill et al.,, 2016). Results are ranked in each of the contexts by AUROC score, and the mean rank across contexts was used to provide an overall score and a final ranking. For the in silico data task, the true causal network was known and it was used to obtain an AUROC score for each predicted network. This score has been considered to determine the final ranking.
By analysing the mean AUROC values computed on the predictions on the real datasets, which are reported as bars in the plots in Figure 1, it is possible to observe that all the tested approaches have similar performance, with mean values around .
Anyway, when looking more in details on each of the datasets, we can draw more accurate considerations about the behaviour of the tested techniques. In particular, as showed in the heatmap in Figure 2(a), on the processed datasets we have obtained AUROC values ranging from to . As corroborated by several studies present in literature, these results highlight the fact that HC (hill climbing) and TB (tabu search) have almost the same behaviour, also, w.r.t. the considered regularizator, on the majority of the datasets. On the other hand, GA (genetic algorithm) presents slightly different results than those obtained by the other two methods and, moreover, it seems that the results are affected by the considered regularizator. Interestingly, when looking at the in silico AUC values, we can observe that, for each regularizator, HC and TB perform better on the in silico dataset, while GA is slightly worse; the opposite situation is observed in the real datasets, where the latter method (i.e. genetic algorithms) achieves better results with respect to the two former techniques (i.e., hill climbing and tabu search). The scatter plot in Figure 2(b) shows a comparison of the mean AUC results on the in silico dataset against the AUROC mean values on the real datasets obtained with all the employed approaches.
To assess the quality of the obtained results, we performed a comparison with those obtained by the participants of the challenge. More precisely, as reported in (Hill et al.,, 2016), several different techniques have been used to reconstruct the network proposed in this challenge, which can be distinguished based on the fact that a prior knowledge has been employed in order to improve the predictions, and also based on the reconstruction method (Bayesian networks in our case). From the results on the in silico dataset, ranked by the mean AUC, we observed that our best performer (TB with AIC) obtained a value of , which is better than all the other methods based on Bayesian networks and ranks in the top of the overall evaluated techniques. On the other hand, on the real datasets our results are similar to those obtained by methods based on Bayesian networks, which present values around . Both these results are not surprising, since we do not use any prior knowledge on the input data (resulting in good performance on the in silico dataset), and also the number of observations in each of the real datasets is quite low compared to the number of nodes (phosphoprotein) of the networks to reconstruct, hence penalizing Bayesian approaches, making the inference task difficult.
5 CONCLUSIONS
In this work, we studied the inference of causal molecular networks, specifically focusing on signaling downstream of receptor tyrosine kinases. We modeled relationships (edges) in causal molecular networks (’causal edges’) as directed links between nodes, in which inhibition of the parent node can lead to a change in the abundance of the child node, either by direct interaction or via unmeasured intermediate nodes.
To this extent, we have tested different methods to reconstruct (Bayesian) networks on real and in silico datasets proposed in the HPN-DREAM challenge. Specifically, we analyzed the performance of different optimization search schemes, i.e., Hill climbing (HC), Tabu seach (TS) and Genetic algorithms (GA), and various likelihood scores, i.e., loglik, AIC and BIC. This analysis seems to show a better performance of more sophisticated search strategies like GA on real datasets, even if on in silico data it is shown that easier search schemes as HC and TS also prove to be very effective.
Furthermore, we find the obtained results to be encouraging, especially considering the fact the we have employed “standard” versions of the algorithms for the reconstruction of the network without making use of any biological prior.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Akaike, (1992) Akaike, H. (1992). Information theory and an extension of the maximum likelihood principle. In Breakthroughs in statistics , pages 610–624. Springer.
- 2Akutsu et al., (1999) Akutsu, T., Miyano, S., Kuhara, S., et al. (1999). Identification of genetic networks from a small number of gene expression patterns under the boolean network model. In Pacific symposium on biocomputing , volume 4, pages 17–28. Citeseer.
- 3Bansal et al., (2007) Bansal, M., Belcastro, V., Ambesi-Impiombato, A., and Di Bernardo, D. (2007). How to infer gene networks from expression profiles. Molecular systems biology , 3(1):78.
- 4Friedman et al., (2000) Friedman, N., Linial, M., Nachman, I., and Pe’er, D. (2000). Using bayesian networks to analyze expression data. Journal of computational biology , 7(3-4):601–620.
- 5Glover, (1989) Glover, F. (1989). Tabu search-part i. ORSA Journal on computing , 1(3):190–206.
- 6Goldberg and Holland, (1988) Goldberg, D. E. and Holland, J. H. (1988). Genetic algorithms and machine learning. Machine learning , 3(2):95–99.
- 7Hill et al., (2016) Hill, S. M., Heiser, L. M., Cokelaer, T., Unger, M., Nesser, N. K., Carlin, D. E., Zhang, Y., Sokolov, A., Paull, E. O., Wong, C. K., et al. (2016). Inferring causal molecular networks: empirical assessment through a community-based effort. Nature methods , 13(4):310–318.
- 8Hwang, (1988) Hwang, C.-R. (1988). Simulated annealing: theory and applications. Acta Applicandae Mathematicae , 12(1):108–111.
