Multiplex graph matching matched filters
Konstantinos Pantazis, Daniel L. Sussman, Youngser Park, Zhirui Li,, Carey E. Priebe, Vince Lyzinski

TL;DR
This paper introduces a multiplex graph matching approach that extends classical methods to handle multi-channel networks, improving template detection in complex background networks through theoretical and empirical validation.
Contribution
It develops a multiplex analogue of graph matching, enabling efficient template detection in multi-channel networks, which is a novel extension of previous single-channel methods.
Findings
The method effectively detects templates in multiplex networks.
Considering multiple channels enhances detection accuracy.
The approach is validated both theoretically and empirically.
Abstract
We consider the problem of detecting a noisy induced multiplex template network in a larger multiplex background network. Our approach, which extends the framework of Sussman et al. (2019) to the multiplex setting, leverages a multiplex analogue of the classical graph matching problem to use the template as a matched filter for efficiently searching the background for candidate template matches. The effectiveness of our approach is demonstrated both theoretically and empirically, with particular attention paid to the potential benefits of considering multiple channels.
| Padding regime | recovered in ch. 1 | recovered in ch. 2 | recovered in ch. 3 |
|---|---|---|---|
| Centered | 86.67 | 85.07 | 96.77 |
| Naive | 98.33 | 100 | 96.77 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Bayesian Modeling and Causal Inference · Complex Network Analysis Techniques
Multiplex graph matching matched filters
Konstantinos Pantazis
Department of Mathematics, University of Maryland, College Park
Daniel L. Sussman
Department of Mathematics and Statistics, Boston University
Youngser Park
Center for Imaging Sciences, Johns Hopkins University
Zhirui Li
Department of Mathematics, University of Maryland, College Park
Carey E. Priebe
Department of Applied Mathematics and Statistics, Johns Hopkins University
Vince Lyzinski
Department of Mathematics, University of Maryland, College Park
Abstract
We consider the problem of detecting a noisy induced multiplex template network in a larger multiplex background network. Our approach, which extends the framework of [41] to the multiplex setting, leverages a multiplex analogue of the classical graph matching problem to use the template as a matched filter for efficiently searching the background for candidate template matches. The effectiveness of our approach is demonstrated both theoretically and empirically, with particular attention paid to the potential benefits of considering multiple channels.
Keywords— Multiplex graphs, graph matching, correlation network models, matched filters
1 Introduction and Background
Multilayer and multiplex networks have proven to be useful models for capturing complex relational data where multiple types of relations are potentially present between vertices in the network [7, 24]. For example, in connectomes (i.e., brain graphs) different edge modalities can represent different synapse types between neurons [45]; in social networks different edge modalities can capture relationships in different social network platforms [21]; in scholarly networks different edge modalities can capture co-authorship across multiple classification categories [36]. Moreover, in many applications leveraging the signal across the different layers of the network can lead to better, more robust performance than working within any single network modality [35, 24, 8].
The inference task we consider here is the problem of detecting (possibly multiple copies of) a noisy induced subgraph in a multiplex background network (see Definition 1.1 for the definition of multiplex networks we consider herein). Succinctly, given a multiplex template with vertices, we seek to find the “best fitting” subgraph(s) in a larger multiplex background network (see Section 2 for detail) with vertices. This problem is a generalization of the NP-complete [40] multiplex subgraph isomorphism problem (see [25] for a definition of multiplex isomorphism), accounting for the reality that relatively large, complex subgraph templates may only errorfully occur in the larger background network. These errors may be due to missing edges/vertices in the template or background, and arise in a variety of real data settings [39]. The subgraph isomorphism problem—given a template , determine if an isomorphic copy of exists in a larger network and find the isomorphic copy (or copies) if it exists—has been the subject of voluminous research in the monoplex (i.e., single layer) setting, with approaches based on efficient tree search [44], color coding [3, 2], graph homomorphisms [20], rule-based/filter-based matchings [10, 33], among others; for a survey of the literature circa 2012, see [27]. In contrast, the problem of multilayer homomorphic/isomorphic subgraph detection is still in its relative infancy, with comparatively fewer existing methods in the literature; see, for example, [47, 42, 33].
Notation: The following notation will be used throughout. For an integer , we will define , to be the hollow matrix with all off-diagonal entries identically set to , to be the matrix with all entries identically set to [math].
1.1 (Multiplex) graph matching
The above noisy induced subgraph detection problem depends greatly on the definition of “best fitting” employed. Our approach, generalizing [41] to the multiplex setting, will employ the multiplex template to search the multiplex background graph for possible matches, with goodness of fit measured via a multiplex formulation of the classical graph matching problem (see [9, 19, 16, 46] for excellent reviews of the voluminous graph matching literature). In the monoplex setting, the simplest formulation of the graph matching problem (GMP) can be stated as follows: Given two -vertex, undirected graphs with respective (weighted) adjacency matrices and , find a permutation matrix permutation matrices} in
[TABLE]
Before lifting the graph matching problem to the multiplex setting, we first need to define precisely what we mean by a multiplex graph.
1.1.1 Multiplex networks
The above formulation of both the GMP requires both graphs to identically have vertices, though there are myriad ways of adapting the GMP to graphs of different orders (see, for example, Appendix F of [6]). In the multiplex subgraph matching problem at the core of this paper, we view the template as being equal or lower order than the background . Moreover, our definition of multiplex networks, ideally, would allow for differing graph orders across the multiplex layers within a single graph. To allow for these expected data nuances in the multiplex setting, we consider the following multiplex graph model; see [24] for a thorough overview of this and other multiplex network formulations.
Definition 1.1**.**
The -tuple is an -vertex multiplex network if for each we have that -vertex labeled graphs, and the vertex sets further satisfy the following:
- i.
For each , we have that ; 2. ii.
* and ;* 3. iii.
The layers are a priori node aligned; i.e., vertices sharing the same label across layers correspond to the same entity in the network.
Note that each vertex need not appear in each channel , however, we do require that at least one vertex appears simultaneously in all channels. We will denote the set of -layer, -vertex multiplex networks via .
1.1.2 Multiplex GMP
To lift the monoplex GMP to the general multiplex definition presented above, we consider the following padded formulations of our general multiplex networks (adapted here from [18, 41]). Letting and with , we consider the following two schemes for ameliorating the differing graph orders.
- i.
(Naive Padding) For each , define the weighted adjacency matrices and via
[TABLE]
[TABLE]
Denote and . 2. ii.
(Centered Padding) For each , define the weighted adjacency matrices and via
[TABLE]
Denote and .
The Naive Multiplex Graph Matching Problem (nMGMP) is then defined as finding an element in
[TABLE]
where is the matrix of all [math]’s. The formulation in Eq. (2) effectively seeks to maximize the number of common edges between the multiplex template and multiplex background, where all edges across all channels are weighted equally (see [18, 6] for the monoplex analogue). The Centered Multiplex Graph Matching Problem (cMGMP) is defined as finding an element in
[TABLE]
If for each , we have that , then the formulation in Eq. (3) effectively seeks to minimize the number of disagreements (edge mapped to non-edge and vice versa) induced between the background and the matched subgraphs in the template, where all disagreements across all channels are weighted equally. Given this interpretation, the appropriate padding schemes to deploy in practice depends on the underlying problem assumptions and setting.
Remark 1**.**
Our formulation of the Multiplex GMP is (assuming channels of equal order across and )*
[TABLE]
rather than a formulation weighting the matching in each channel via
[TABLE]
*for . In our subgraph detection setting, we have found that works suitably well; moreover, this weights each edge in each template channel equally, which may be desirable. In the case that one or more channels is more informative or of higher import than the others, then choosing appropriate ’s to overweight the matching in those channels may be desirable.
2 Multiplex Graph Matching Matched Filters
Given , a multiplex graph matching algorithm designed to approximately solve Eqs. (2–3), our multiplex graph matching matched filter (M-GMMF), generalizing the monoplex filtering setting of [41], proceeds as in Algorithm 1. Note that in our experiments (and in the pseudocode below), we make use of (see Algorithm 2 in Appendix A.1), but we stress that our approach can utilize any suitable equally well.
Effectively, the M-GMMF algorithm uses the multiplex template (and algorithm ) to search for suitable solutions aligning to . The multiple restarts in Step 2. of the procedure are needed in the case of , as in that setting the objective function is relaxed to an indefinite quadratic program with myriad local minima in the feasible region; these restarts aim to precisely counteract the presence of these local minima by broadly searching the feasible region for a global minimum. For approximate combinatorial , the restarts may be appropriate as well, while for continuous, convex relaxation algorithms (see, for example, [6]), this step may not be necessary.
Note that code implementing the above M-GMMF and M-FAQ procedures can be downloaded as part of our R package, iGraphMatch, which is available on CRAN or can be downloaded at https://github.com/dpmcsuss/iGraphMatch.
2.1 Multiplex Matchability
In [41], the authors considered an error model wherein the template is an errorful induced subgraph of the background in the monoplex setting. The aim of the Monoplex-GMMF approach then was to recover the vertices in corresponding to . Can we recover the analogous results in the multiplex setting? To frame and attack this problem statistically, we consider the following error model which we will use to generate a multiplex background graph and a multiplex template with .
Definition 2.1** (See [4]).**
Consider a graph with . Let the centered, padded adjacency matrix (as in Eq. (1)) of be denoted . Let be a symmetric, hollow matrix. The graph-valued random variable with vertex set equal to and random centered, padded adjacency matrix , which models passing through an errorful channel , is defined as follows. For each ,
[TABLE]
where .
The two generative models we then consider are defined via:
- i.
(Single Channel Source, Error Multiplex, abbreviated ME) There is a single non-random background source graph and non-random source template , and two multi-channel errorful filters , with each acting on , and , with each acting on . We observe as the multiplex background and as the multiplex template. By assumption, the errorful filters act independently across channels within and , and independently across and . In this model, by construction each and each . 2. ii.
(Single Channel Errors, Source Multiplex, abbreviated MS) The non-random background and non-random template source graphs are multiplex. To wit, let and satisfy the following: For each , let and be the centered paddings of and respectively. We assume then that (i.e., —the padded adjacency matrix of —is the principal submatrix of —the padded adjacency matrix of ). There are two multi-channel errorful filters: and . For each , acts on , and acts on . We observe as the multiplex background and as the multiplex template. As above, the errorful filters act independently across channels within and , and independently across and . Note that if the template (resp., background) channels have non-identical vertex sets, then this will be preserved in the errorful template (resp., background).
It may be convenient to view and (resp., and ) as realizations from graph-valued random variables in the ME (resp., MS) model. In this case, we will assume the actions of the errorful filters on and (resp., and ) are also independent of the random and (resp., and ).
Considering the models above, in order for our M-GMMF approach to possibly recover the true errorful induced subgraph of corresponding to , we need for the global minimum of the Multiplex GMP to be in . This is the multiplex analogue of graph matchability, i.e., uncovering conditions under which oracle graph matching will recover a latent vertex alignment. Here, that alignment is represented by being an errorful version of ; see, for example, [14, 22, 38, 37, 30, 23, 12, 11, 5, 13] for a litany of graph matchability results in the monoplex setting.
2.2 MS model matchability
In this section, we will explore the benefit of considering multiplex versus monoplex networks when considering template matchability in the MS model. We note here that while the formal theory underlying the matchability results in the multiplex setting differs only slightly from the monoplex setting of [41], we stress that the end results demonstrate the utility of considering multiple channels.
In the MS model, let and be the respective template and background source graphs, with respective centered, padded adjacency matrices given respectively by and satisfying for all . Assume that the errorful filters satisfy for each , and (where and are allowed to vary with ). If , and , then the observed background and template are effectively independent ER and ER networks, respectively. It is immediate then that the optimal permutation aligning the background to the template will almost surely not be in .
Consider now . Let ; these “bad” channels act to obfuscate the latent alignment between and by effectively whitening the signal present in the alignment within the channels. Suppose that there exist constants , , and such that for all , satisfies . For each , denote the set of permutations that permute exactly labels of by , and for each (with associated permutation ), define
[TABLE]
and for each , define
[TABLE]
Suppose that there exists an such that for all , we have that for all and all ,
[TABLE]
Letting and be the padded, centered adjacency matrices of and respectively (the errorful and ), for we have that
[TABLE]
(see Appendix A.2 for proof of this bound).
Exploring Eq. (5) in the ER setting further, we consider the following setup. If , and for each , with , then for each , a simple application of McDiarmid’s inequality (see Appendix 1) yields that
[TABLE]
with probability at least
[TABLE]
Note that if , then , so that with probability at least Eq. (7),
[TABLE]
Suppose that , and that there exists an such that for all , we have that for all , , and
[TABLE]
For , we then have
[TABLE]
For proof of Eq. (9), see Appendix A.4.
We have thus proven the following theorem:
Theorem 1**.**
With setup as above, suppose that , and is a fixed constant that does not vary with . Further suppose that , and for channels , and for channels (where , , and are allowed to vary with ). Then there exist constants , and such that if for all ,
[TABLE]
then for , If , , , and are fixed constants that do not vary with , we need only require rather than Condition (1).
2.2.1 Strength in numbers
Consider in Theorem 1. Condition (1) then reduces to
[TABLE]
and large values (i.e., close to ) of and can be mitigated by choosing an appropriately large ; effectively, multiple channels can amplify the weak signal present in each individual channel.
We explore this further in the following experiment. We will look at two different cases specifically, when and when . First, considering (to mitigate possible effects of template order on matching accuracy), we let for ranging over . For each , we have that (so that and are marginally ER(100,0.5) and edges across graphs are independent except that for each , we have that corr).
Within this model, the channels are endowed with a natural vertex correspondence across and , namely the identity mapping. Note that in the ER MS model setting, we have that so that the correlation between edges in and can be made positive or negative with judiciously chosen and . Considering varying over , we match and using M-FAQ (Algorithm 2 using seeded vertices [18]). Results are plotted in Figure 1. In Figure 1, we plot the mean matching accuracy (i.e., the fraction of vertices whose latent alignment is recovered correctly) of M-FAQ versus , averaged over 2000 Monte Carlo replicates. For each choice of parameters, we also plot (via the partially transparent points) the accuracy distribution corresponding to the MC replicates. In red (resp., olive, green, blue, purple) we plot the results for (resp., , , , ). From Figure 1, we see the expected relationship: in low correlation settings where M-FAQ is unable to align the monoplex graphs, this can often be overcome by considering . Indeed, in all cases, save , perfect matching is achieved using channels.
Next, we look at the case when . In addition to examining the effect of multiple channels when weak signal is present across channels, we wish to compare the effect of different padding schemes (Naive vs Centered) in terms of the matching accuracy. We analyze the padding scheme’s effectiveness first, by varying the values of the correlation while keeping constant (see Figure 2) and second, by varying the background size while the template size and the correlation remain constant (see Figure 3). Using the Naive (resp. Centered) padding scheme, we let (resp. ) for ranging over . Utilizing seeds, we match and (resp. and ) using M-FAQ (Algorithm 2). Results are plotted in Figures 2 and 3. As in Figure 1, we plot the mean matching accuracy (i.e., the fraction of vertices whose latent alignment is recovered correctly) of M-FAQ versus , averaged over 100 MC replicates. For each choice of parameters, we also plot (via the partially transparent points) the accuracy distribution corresponding to the MC replicates. In Figure 2, in red (resp., olive, green, blue, purple) we plot the results for (resp., , , , ). In Figure 3, in red (resp., green, blue, purple) we plot the results for (resp., , , ).
All figures demonstrate that even though the M-FAQ algorithm is unable to align the monoplex graphs when , this can often be overcome by considering . Moreover, Figures 2 and 3, show that the Centered Padding scheme achieves better matching accuracy between channels than the Naive Padding scheme. Akin to Figure 1, Figure 2 illustrates that the matching accuracy increases as the correlation increases. Finally, in Figure 3, we observe that the matching accuracy decreases as the ratio between the template size and the background size decreases.
2.2.2 The good outweighs the bad
In this section, we explore the ability of the signal in “good” channels to overcome the obfuscating effect of “bad” channels. To wit, consider Condition (1) with . We see that if there are enough channels (i.e., is sufficiently large) with positive correlation (), then the template and background remain matchable even in the presence of (potentially) multiple anti-correlated channels.
We explore this further in the following experiment. As in the previous subsection 2.2.1, we study this “obfuscating” effect for both and cases. Again consider , and let (i.e., ), where for we have that . Under the same setting, we let and we apply Naive Padding in , so that for all .
Considering to be either (for channels) or (for channels), where varies in , we plot the matching accuracy (averaged over 2000 (left panel) and 100 (right panel) Monte Carlo replicates) obtained by M-FAQ (with 10 seeds) versus in Figure 4. For each choice of parameters, we also plot (via the partially transparent points) the accuracy achieved by each MC replicate. In red (resp., olive, green, blue, purple) we plot the results for (resp., , , , ). From the figure, we see the expected relationship: matching at higher levels of yields better accuracy, and more robustness to channels with negative correlation. Further, we notice that the matching accuracy in the right panel (i.e., ) is not as good as in the left panel (i.e., ). We make this phenomenon more clear in Figure 6.
In addition, we study the effect of different padding schemes (Naive vs Centered) in terms of the matching accuracy. We analyze the padding scheme’s effectiveness first, by varying the values of the correlation while keeping constant (see Figure 5) and second, by varying the number of the background vertices while the template size remains the same (see Figure 6). Using the Naive (resp. Centered) padding scheme, we let (resp. ) for ranging over . Utilizing seeds, we match and (resp. and ) using M-FAQ (Algorithm 2). Results are plotted in Figures 5 and 6. As in Figure 4, we plot the mean matching accuracy (i.e., the fraction of vertices whose latent alignment is recovered correctly) of M-FAQ versus , averaged over 100 MC replicates. For each choice of parameters, we also plot (via the partially transparent points) the accuracy distribution corresponding to the MC replicates. In Figure 5, in red (resp., olive, green, blue, purple) we plot the results for (resp., , , , ). In Figure 6, in red (resp., green, blue, purple) we plot the results for (resp., , , ).
From Figures 4 and 5, we observe that matching at higher levels of yields better accuracy, and more robustness to channels with negative correlation. Moreover, Figures 5 and 6 show that the Centered Padding scheme achieves better performance in terms of matching accuracy than the Naive Padding scheme.
2.3 ME model matchability
To derive analogous results to those in Section 2.2 in the ME model, we consider the following setting. Letting and be the respective background and template source graphs, we again assume that there exist constants , and such that for all , satisfies . Further assume that for each the errorful filters satisfy,
[TABLE]
For each , define
[TABLE]
where is defined as in Eq. (4). Suppose that there exists an such that for all , we have that for all and all
[TABLE]
then
[TABLE]
where the bound in Eq. (13) uses Appendix A.5 and then follows mutatis mutandis from the proof in Appendix A.2.
Exploring this further in the ER setting, consider with . As in Eq. (7), for each , we then have that
[TABLE]
with probability at least
[TABLE]
Note that if , then , so that with probability at least Eq. (14),
[TABLE]
Suppose that , and that there exists an such that for all , we have , and
[TABLE]
Then for , . We have the following theorem (whose proof follows mutatis mutandis to that of Theorem 1 and so is omitted):
Theorem 2**.**
With setup as above, suppose that . For
[TABLE]
where and can vary in and . Then there exist constants , and such that if for all
[TABLE]
then for , If , , and are fixed in , we need only require for to hold for sufficiently large .
3 Experiments
Our previous simulation explored the effect on multiple channels on multiplex matchability. We next consider the performance of our multiplex matched filter approach in detecting a hidden template in a multilayer social media network from [31]. The background network contains aligned channels representing user activity in FriendFeed, Twitter and Youtube (where the Youtube and Twitter channels were generated via FriendFeed which aggregates user information across these platforms). In total, there are unique vertices across the three channels, with the channel specific networks satisfying:
[TABLE]
Given a 35 vertex multiplex template created by Pacific Northwest National Laboratories for the DARPA MAA program, we ran our M-GMMF algorithm (Algorithm 1) to attempt to recover the template in ; results are summarized below.
In our first experiment, we first considered running “cold-start” M-GMMF; that is, no prior information (in the form of seeds, hard or soft) is utilized in the algorithm. We consider padding the graph via the Naive Padding and Centered Padding regimes of Section 1.1.2, and for each padding regime, we ran M-GMMF with random restarts. Numeric results are summarized in Table 1 (with the best recovered background signals also plotted in Figure 7). While the best recovered signal in the Naive Padding regime captures all but two template edges, this is at the expense of many extraneous background edges that do not appear in the template. On the other hand, the Centered Padding regime recovers most of the template edges (across the three channels) with minimal extra template edges in the recovered signal.
The M-FAQ algorithmic primitive (Algorithm 2) used in our implementation of M-GMMF is most effective when it can leverage a priori available matching data in the form of seeded vertices. Seeds can either come in the form of hard seeds (a priori known 1–to–1 matches; here that would translate to template vertices whose exact match is known in the background) or soft seeds (where a soft seeded vertex in has an a priori known distribution over possible matches in ; here this would translate into template vertices with a list of candidate matches in the background). While hard seeds are costly and often unavailable in practice, there are many scalable procedures in the literature for automatically generating soft seed matches. Here, we use as a soft-seeding the output of [33, 34], a filtering approach for finding all subgraphs of the background network homomorphic to the template.
For each node in the template, the output of [33, 34] produces a multiset of candidate matches in the background, where each candidate match corresponds to a template copy contained in the background as a subgraph (not necessarily as an induced subgraph). We convert the candidate matches into probabilities by simply converting the multiset to a count vector and normalizing the count vector to sum to . We then consider the normalized count vectors as rows of a stochastic matrix; this stochastic matrix provides M-FAQ with a soft-seeding which can be used to initialize the algorithm.
Considering random restarts as perturbations (akin to Step 2 of Algorithm 1) of the soft-seeding (conditioned on retaining nonnegative entries), we ran M-GMMF using a generalization of the Centered Padding regime, which is defined as follows: For each , define the weighted adjacency matrices and via
[TABLE]
where we vary from [math] to . Note that yields Naive Padding, and yields Centered Padding. Optimal performance in the present experiment was achieved with , in which case random restarts yielded an induced subgraph in the background that was isomorphic to the template network.
3.1 M-GMMF on Semantic Property Graphs
For our second example, we consider the semantic property graph released by Pacific Northwest National Laboratories as part of the MAA-AIDA Data Release V2.1.2 via the DARPA MAA program [15]. In this dataset, the background network is a knowledge graph constructed from a variety of documents (e.g., newspaper articles) by DARPA’s AIDA program. At a high level, the graph is encoding the real-world relationships between a variety of entities (people, locations, major events, etc.) that can be automatically extracted from a variety of data sources. Practically, the graph is a richly featured network on the order of 100K nodes. Node properties include name; rdf:type (corresponding to a structured ontology of types); textValue; linkTarget; start time; among others. Edge properties include name/id, rdf:type, argument (values given to edges of a given rdf:type), among others. Note that many nodes and edges do not have values for all properties. The templates here, themselves richly featured knowledge graphs, are of the order of 10s of vertices (ranging in size from 33 nodes/40 edges to 11 nodes/11 edges); for each of three template types, there are 6 variants with varying error levels, including one variant (version “A”) that is perfectly/isomorphically embedded into the background.
The principle challenge in applying our M-GMMF methodology on such richly featured data is sensibly incorporating the rich, structured features into our multiplex network framework. Towards this end, we adopted the following approach. We incorporated the vertex features/properties into a penalty term in the objective function, encoding the features into a vertex–to–vertex similarity matrix (this is possible provided that similarities are easily computed within each vertex covariate, which is the case here). Edge features were used to divide the knowledge graph into multiple overlapping channels in a multiplex network. One channel was assigned to each unique (E(rdf:type), E(argument)) pair in the template, and we divided background edges amongst the channels via
- i.
A hard split based on E(argument): Within each (E(rdf:type), E(argument)) channel, only edges with matching E(argument) are potentially present. If the E(argument) property is missing in the background, we allow the edge to possibly exist in all (E(rdf:type), E(argument)) channels.
- ii.
A soft split (weighted according to E(rdf:type) similarity; note that a rdf:type similarity function was provided with the data) based on E(rdf:type): Within each (E(rdf:type), E(argument)) channel, each background edge with matching (or missing) E(argument) is present, and the edge is weighted according to a similarity measured between its E(rdf:type) and that of the channel. Scalability gains can be achieved by thresholding the similarities to improve sparsity.
Spatiotemporal constraints can be coded into separate channels in the multiplex graph, one channel per spatiotemporal constraint in the template. Each constraint (e.g., action A must occur between x and y days after action B) yields an edge filter, with only edges that could potentially satisfy the constraint being added to that constraint channel. Lastly, numeric edge features are used to further weight the edges in the background/template. The final objective function we used in our M-GMMF approach was then of the form
[TABLE]
Performance of M-GMMF and various other approaches (as scored by the GED scoring metric of [15]) are presented in Figure 8 (Note that the filtering approach of Tu et al., as presented in [43], also present a method that uses the clean “A” template for training, and achieves essentially the best score for all template versions; we did not include those scores for comparison in our figure).
Overall, 18 templates were present: 3 distinct template types, each with 6 variants encompassing different amounts of template noise which give an indication of the knowledge graph structure encoded into the multiplex graph; see Figure 9, for a pair of example templates and M-GMMF recovered signals here. One template of each type was constructed to have a perfect isomorphic match in the background, while the noisy templates were designed for inexact/fuzzy matching. All approaches identified the isomorphic match in the background for all three template types version “A”, while our approach achieved its best relative results on the larger, more complex template (Template 1). For Template 1, we obtained the best (or nearly the best) score on 3/6 versions; see Table 2 for detail.
Templates 2 and 3 were essentially tree-like structures (nodes/edges is 13/15 and 11/11 respectively), and we suspect that the filtering-based approaches are more suitable to this problem type. Indeed, M-GMMF is designed for larger/more complex templates, though our performance (especially compared to the non-filtering G-Finder of [28]) is encouraging on these instantiations, especially on version “B” of the templates.
Note also that our approach does not directly seek to optimize the GED of [15] and here does not make use of the importance weights provided by the GED scoring metric (though these could easily be encoded via edge weights and scaling the similarities in ); rather, we seek to optimize the multiplex GM objective function of 18. Nonetheless, as shown in Figure 10, the rankings of the random restart outputs for our GM objective function and for the GED are often highly correlated, with recovered signals scoring well in one metric often scoring well in both. The interesting gap appearing in the plot of Template 1E is evocative of the GM phase transitions appearing in the literature (see, for example, [17]) and bears further study.
4 Discussion
In this paper, we presented a framework for finding noisy copies of a multiplex template in a large multiplex background network. Our strategy, which extends [41] to the multiplex setting, uses graph matching combined with multiple random restarts to search the background for locally optimal matches to the template. To formalize this strategy, we provided a very natural extension of the classical graph matching problem to the multiplex that is easily amended to matching graphs of different orders (both across networks and channels). Further, the effectiveness of the resulting algorithm, named M-GMMF, is demonstrated both theoretically and empirically.
There are a number of extensions and open questions that arose during the course of this work. Natural theoretic extensions include lifting Theorems 1 and 2 to non-edge independent models (note that certain localized dependencies amongst edges can easily be handled in the McDiarmind proof framework, while globally dependent errors provide a more significant challenge); formulating the analogues of Theorems 1 and 2 in the weighted, attributed graph settings; and considering the theoretic properties of various continuous relaxations of the multiplex GM problem akin to [1, 29, 6].
A key methodological questions in multiplex graph matching was touched upon in Remark 1; indeed, we expect the question of how to weight the matching across channels to be essential when applying these methods to topologically diverse and weighted networks. If the order of magnitude of edge weights vary across channel, then it is easy to see a GM algorithm aligning channels with large edge weights at the expense of the alignment accuracy in other channels. Judiciously choosing would allow for the signal in channels with smaller edge weights to be better leveraged towards a better overall matching.
While the largest network we consider in this work has vertices, scaling this approach to very large networks is essential. By utilizing efficient data structures for sparse, low-rank matrices and a clever implementation of the LAP subroutine of M-FAQ (step iii. in Algorithm 2), we are able to match vertex templates to -vertex background graphs in per restart with our base M-GMMF code (available in iGraphMatch) implemented in R on a standard laptop. Further work to scale M-GMMF by leveraging both efficient data structures and scalable approximate LAP solvers is currently underway.
List of abbreviations
GM(P) Graph Matching (Problem) nMGMP Naive Multiplex Graph Matching Problem cMGMP Centered Multiplex Graph Matching Problem M-GMMF Multiplex Graph Matching Matched Filters M-FAQ Multiplex Fast Approximate Quadratic ME (Single Channel Source) Error Multiplex MS (Single Channel Errors) Source Multiplex GED Graph Edit Distance LAP Linear Assignment Program DARPA Defense Advanced Research Projects Agency MAA Modeling Adversarial Activity AIDA Active Interpretation of Disparate Alternatives
Declarations
Availability of data and materials
The background graphs for the 3-channel social network in Section 3 is available at http://multilayer.it.uu.se/datasets.html. The data from the DARPA MAA analysis Section is not publicly available, and the obtained results for the outside algorithms appear in the cited papers in the literature.
The code implementing the M-GMMF and M-FAQ procedures can be downloaded as part of our R package, iGraphMatch, which is available on CRAN or can be downloaded at https://github.com/dpmcsuss/iGraphMatch.
Competing interests
Not applicable.
Funding
Dr. Sussman’s contribution to this work was partially supported by a grant from MIT Lincoln Labs and the Department of Defense. This material is based on research sponsored by the Air Force Research Laboratory and DARPA under agreement numbers FA8750-18-2-0035 and FA8750-20-2-1001. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright notation thereon. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of the Air Force Research Laboratory and DARPA or the U.S. Government.
Authors’ contributions
Daniel L. Sussman, Carey E. Priebe and Vince Lyzinski, and Konstantinos Pantazis conceived of the method and developed the theory. Konstantinos Pantazis, Youngser Park, Daniel L. Sussman, Vince Lyzinski and Zhirui Li worked on writing and implementing the algorithm and performing relevant experiments. Konstantinos Pantazis, Daniel L. Sussman and Vince Lyzinski wrote and curated the manuscript.
Acknowledgments
Not applicable.
Appendix A Appendix
Herein we collect details of our auxiliary algorithms and proofs of our main results.
A.1 Multiplex FAQ
The details of the M-FAQ algorithm are presented below.
A.2 Proof of Eq. (6)
For each define
[TABLE]
Assuming that , then . Note that is a function of (at most) independent Bernoulli random variables, and changing any one of these Bernoulli random variables can change the value of by at most . McDiarmid’s inequality [32] then implies that for any ,
[TABLE]
Note that if then
[TABLE]
Define
[TABLE]
so that . We then have
[TABLE]
Note that if , then if for all ; i.e., if there exists a such that . Note that this defines an equivalence relation on which we will denote by “,” and let be a fixed (but arbitrarily chosen) set composed of one member of each equivalence class according to “.” Note that is at most . Letting in Eq. (20), we have that if
[TABLE]
as desired.
A.3 Proof of Eq. (7)
We have that if each ER, then
[TABLE]
so that . Also, is then a function of at most independent Bernoulli random variables, and changing the value of any one these can change the value of by at most . McDiarmid’s inequality then yields the desired result
[TABLE]
by setting
A.4 Proof details for Eq. 9
Let the equivalence relation “” on be defined via if there exists a such that . Note that if then
[TABLE]
Let be a fixed (but arbitrarily chosen) set composed of one member of each equivalence class according to “,” and note that is at most . Given the assumptions in Section 2.2, for we have that for each ,
[TABLE]
Denote the event bound in Eq. (A.4) via .
For , we then have
[TABLE]
as desired.
A.5 Proof details for Eq. (13)
For , define
[TABLE]
For , we then have that defined in Eq. (19) satisfies
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Y. Aflalo, A. Bronstein, and R. Kimmel. On convex relaxation of graph isomorphism. Proceedings of the National Academy of Sciences , 112(10):2942–2947, 2015.
- 2[2] N. Alon, P. Dao, I. Hajirasouliha, F. Hormozdiari, and S. C. Sahinalp. Biomolecular network motif counting and discovery by color coding. Bioinformatics , 24(13):i 241–i 249, 2008.
- 3[3] N. Alon, R. Yuster, and U. Zwick. Color-coding. Journal of the ACM (JACM) , 42(4):844–856, 1995.
- 4[4] J. Arroyo, D. L. Sussman, C. E. Priebe, and V. Lyzinski. Maximum likelihood estimation and graph matching in errorfully observed networks. ar Xiv preprint ar Xiv:1812.10519 , 2018.
- 5[5] B. Barak, C. Chou, Z. Lei, T. Schramm, and Y. Sheng. (nearly) efficient algorithms for the graph matching problem on correlated random graphs. ar Xiv preprint ar Xiv:1805.02349 , 2018.
- 6[6] J. Bento and S. Ioannidis. A family of tractable graph distances. ar Xiv preprint ar Xiv:1801.04301 , 2018.
- 7[7] S. Boccaletti, G. Bianconi, R. Criado, C. I. Del Genio, J. Gómez-Gardenes, M. Romance, I. Sendina-Nadal, Z. Wang, and M. Zanin. The structure and dynamics of multilayer networks. Physics Reports , 544(1):1–122, 2014.
- 8[8] L. Chen, J. T. Vogelstein, V. Lyzinski, and C. E. Priebe. A joint graph inference case study: The C. elegans chemical and electrical connectomes. Worm , 5(2), 2016.
