Generalized Median Graph via Iterative Alternate Minimizations

Nicolas Boria; S'ebastien Bougleux; Benoit Ga\"uz\`ere (LITIS); Luc; Brun

arXiv:1906.11009·cs.CV·June 27, 2019

Generalized Median Graph via Iterative Alternate Minimizations

Nicolas Boria, S'ebastien Bougleux, Benoit Ga\"uz\`ere (LITIS), Luc, Brun

PDF

Open Access

TL;DR

This paper introduces an efficient iterative method to compute a generalized median graph from a set of graphs, addressing the NP-hard challenge with a block coordinate descent approach that optimizes node and edge labelings.

Contribution

It proposes a novel block coordinate descent algorithm for median graph computation that handles labeling on both nodes and edges, improving efficiency.

Findings

01

Demonstrates efficiency through experiments on multiple datasets

02

Handles labeling on nodes and edges effectively

03

Provides a clear optimization framework for median graph computation

Abstract

Computing a graph prototype may constitute a core element for clustering or classification tasks. However, its computation is an NP-Hard problem, even for simple classes of graphs. In this paper, we propose an efficient approach based on block coordinate descent to compute a generalized median graph from a set of graphs. This approach relies on a clear definition of the optimization process and handles labeling on both edges and nodes. This iterative process optimizes the edit operations to perform on a graph alternatively on nodes and edges. Several experiments on different datasets show the efficiency of our approach.

Tables2

Table 1. Table 1: SOD computed using different GED approximations.

Algorithms		Letter (HIGH)				Monoterpenoides
1st phase	2nd phase	SOD SM	t(SM)	SOD GM	t(GM)	SOD SM	t(SM)	SOD GM	t(GM)
Bipartite	Bipartite	142.69	0.01	87.80	$6 * 10^{- 4}$	402.50	0.002	253.11	$8 * 10^{- 4}$
Bipartite	IPFP	142.87	0.013	87.61	0.003	398.01	0.002	128.45	0.179
IPFP	IPFP	135.99	0.057	87.22	0.003	202.75	0.162	104.11	0.136
$m$ Bipartite	$m$ Bipartite	142.04	0.014	89.47	$9 * 10^{- 4}$	283.94	0.027	186.15	0.01
$m$ Bipartite	$m$ IPFP	142.19	0.018	87.66	0.013	281.14	0.031	83.11	0.545
$m$ IPFP	$m$ IPFP	135.99	0.274	87.23	0.015	106.10	1.159	75.08	0.288

Table 2. Table 2: Classification Results for Letter(HIGH) and Monoterpenoides datasets

Letter (HIGH) Dataset
TS	1st phase	2nd phase	pt	% SM	t(SM)	% GM	t(GM)	% TS	t(TS)
10%	$m$ Bipartite	$m$ Bipartite	0.023	76.42	0.325	82.82	0.325	83.01	5.275
	$m$ Bipartite	$m$ IPFP	0.195	77.40	5.857	84.16	5.771	83.30	110.48
	$m$ IPFP	$m$ IPFP	0.447	78.24	5.951	84.60	5.801	82.95	111.84
30%	$m$ Bipartite	$m$ Bipartite	0.181	79.94	0.251	84.24	0.250	87.24	11.44
	$m$ Bipartite	$m$ IPFP	0.878	81.83	4.323	86.06	4.234	86.86	239.14
	$m$ IPFP	$m$ IPFP	3.437	81.59	4.316	86.08	4.245	86.86	240.96
Monoterpenoides Dataset
TS	1st phase	2nd phase	pt	% SM	t(SM)	% GM	t(GM)	% TS	t(TS)
10%	$m$ Bipartite	$m$ Bipartite	0.054	32	0.984	29.44	0.957	51.86	3.830
	$m$ Bipartite	$m$ IPFP	1.586	53.38	47.96	57.49	51.03	60.69	186.85
	$m$ IPFP	$m$ IPFP	2.044	54.06	47.31	62.38	48.01	60.69	187.83
30%	$m$ Bipartite	$m$ Bipartite	0.373	36.39	0.747	34.28	0.732	67.92	8.571
	$m$ Bipartite	$m$ IPFP	5.148	54.06	36.54	67.79	37.07	75.82	419.81
	$m$ IPFP	$m$ IPFP	15.38	58.37	36.15	74.12	36.57	75.94	419.31

Equations34

\overset{ˉ}{G} \in ar g G \in G min G^{'} \in G \sum d (G, G^{'})

\overset{ˉ}{G} \in ar g G \in G min G^{'} \in G \sum d (G, G^{'})

c_{v} (π, φ, φ^{'}) = i = 1 \sum n δ_{π_{i}} c_{vfs} (φ_{i}, φ_{π_{i}}^{'}) + (1 - δ_{π_{i}}) c_{vr} + k = 1 \sum n^{'} (1 - δ_{π_{k}^{'}}) c_{vi}

c_{v} (π, φ, φ^{'}) = i = 1 \sum n δ_{π_{i}} c_{vfs} (φ_{i}, φ_{π_{i}}^{'}) + (1 - δ_{π_{i}}) c_{vr} + k = 1 \sum n^{'} (1 - δ_{π_{k}^{'}}) c_{vi}

\displaystyle\begin{array}[]{l}c_{e}(\pi,A,\Phi,A^{\prime},\Phi^{\prime})=\sum_{i=1}^{n}\sum_{j=1}^{n}\delta_{\pi_{i}\pi_{j}}\,a_{i,j}\,a^{\prime}_{\pi_{i},\pi_{j}}\,c_{\text{efs}}\left(\phi_{i,j},\phi^{\prime}_{\pi_{i}\pi_{j}}\right)\\ \qquad~{}+c_{\text{er}}\sum_{i=1}^{n}\sum_{j=1}^{n}\delta_{\pi_{i}\pi_{j}}a_{i,j}(1-a^{\prime}_{\pi_{i},\pi_{j}})+(1-\delta_{\pi_{i}\pi_{j}})a_{i,j}\\ \qquad~{}+c_{\text{ei}}\sum_{i=1}^{n}\sum_{j=1}^{n}\delta_{\pi_{i}\pi_{j}}(1-a_{i,j})a^{\prime}_{\pi_{i}\pi_{j}}+c_{\text{ei}}\sum_{k=1}^{n^{\prime}}\sum_{l=1}^{n^{\prime}}(1-\delta_{\pi^{\prime}_{k}\pi^{\prime}_{l}})a^{\prime}_{k,l}\end{array}

\overset{ˉ}{G} = (\overset{φ}{ˉ}, \overset{ˉ}{A}, \overset{ˉ}{Φ}) \leftarrow ar g φ \in F_{v}^{\overset{n}{ˉ}} A \in {0, 1}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} Φ \in F_{e}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} min p = 1 \sum ∣ G ∣ c_{v} (\overset{π}{ˉ}_{p}, φ, φ_{p}) + \frac{1}{2} c_{e} (\overset{π}{ˉ}_{p}, A, Φ, A_{p}, Φ_{p})

\overset{ˉ}{G} = (\overset{φ}{ˉ}, \overset{ˉ}{A}, \overset{ˉ}{Φ}) \leftarrow ar g φ \in F_{v}^{\overset{n}{ˉ}} A \in {0, 1}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} Φ \in F_{e}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} min p = 1 \sum ∣ G ∣ c_{v} (\overset{π}{ˉ}_{p}, φ, φ_{p}) + \frac{1}{2} c_{e} (\overset{π}{ˉ}_{p}, A, Φ, A_{p}, Φ_{p})

\forall p \in {1, \dots, ∣ G ∣}, \overset{π}{ˉ}_{p} \leftarrow ar g π_{p} \in Π (\overset{ˉ}{G}, G_{p}) min c (π_{p}, \overset{ˉ}{G}, G_{p})

\overset{φ}{ˉ} \leftarrow ar g φ \in F_{v}^{\overset{n}{ˉ}} min s_{v} (φ), (\overset{ˉ}{A}, \overset{ˉ}{Φ}) \leftarrow ar g A \in {0, 1}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} Φ \in F_{e}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} min s_{e} (A, ϕ)

\overset{φ}{ˉ} \leftarrow ar g φ \in F_{v}^{\overset{n}{ˉ}} min s_{v} (φ), (\overset{ˉ}{A}, \overset{ˉ}{Φ}) \leftarrow ar g A \in {0, 1}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} Φ \in F_{e}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} min s_{e} (A, ϕ)

\forall i = 1, \dots, \overset{n}{ˉ}, \overset{φ}{ˉ}_{i} \leftarrow ar g φ_{i} \in F_{v} min f_{i} (φ_{i})

\forall i = 1, \dots, \overset{n}{ˉ}, \overset{φ}{ˉ}_{i} \leftarrow ar g φ_{i} \in F_{v} min f_{i} (φ_{i})

\forall i = 1, \dots, \overset{n}{ˉ}, \overset{φ}{ˉ}_{i} \leftarrow ar g φ_{i} \in F_{v} max h_{i}^{0} (φ_{i})

\forall i = 1, \dots, \overset{n}{ˉ}, \overset{φ}{ˉ}_{i} \leftarrow ar g φ_{i} \in F_{v} max h_{i}^{0} (φ_{i})

\forall i = 1, \dots, \overset{n}{ˉ}, \overset{φ}{ˉ}_{i} \leftarrow \frac{1}{\sum _{p = 1}^{∣ G ∣} δ _{π_{i}^{p}}} p = 1 \sum ∣ G ∣ δ_{π_{i}^{p}} φ_{π_{i}^{p}}^{p} = \frac{1}{∣ S _{i} ∣} p \in S_{i} \sum φ_{π_{i}^{p}}^{p}

\forall i = 1, \dots, \overset{n}{ˉ}, \overset{φ}{ˉ}_{i} \leftarrow \frac{1}{\sum _{p = 1}^{∣ G ∣} δ _{π_{i}^{p}}} p = 1 \sum ∣ G ∣ δ_{π_{i}^{p}} φ_{π_{i}^{p}}^{p} = \frac{1}{∣ S _{i} ∣} p \in S_{i} \sum φ_{π_{i}^{p}}^{p}

ar g A \in {0, 1}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} Φ \in F_{e}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} min s_{e} (A, ϕ) = ar g A \in {0, 1}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} Φ \in F_{e}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} min i = 1 \sum \overset{n}{ˉ} j = 1 \sum \overset{n}{ˉ} f_{i, j} (a_{i, j}, ϕ_{i, j})

ar g A \in {0, 1}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} Φ \in F_{e}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} min s_{e} (A, ϕ) = ar g A \in {0, 1}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} Φ \in F_{e}^{\overset{n}{ˉ} \times \overset{n}{ˉ}} min i = 1 \sum \overset{n}{ˉ} j = 1 \sum \overset{n}{ˉ} f_{i, j} (a_{i, j}, ϕ_{i, j})

\begin{array}[]{rl}f_{i,j}(a_{i,j},\phi_{i,j})=&a_{i,j}\sum_{p=1}^{|\mathcal{G}|}\delta_{\pi^{p}_{i}\pi^{p}_{j}}\,a^{p}_{\pi^{p}_{i},\pi^{p}_{j}}\,c_{\text{efs}}(\phi_{i,j},\phi^{p}_{\pi^{p}_{i},\pi^{p}_{j}})\\ &+\,c_{\text{er}}a_{i,j}\sum_{p=1}^{|\mathcal{G}|}1-\delta_{\pi^{p}_{i}\pi^{p}_{j}}\,a^{p}_{\pi^{p}_{i},\pi^{p}_{j}}\\ &+\,c_{\text{ei}}(1-a_{i,j})\sum_{p=1}^{|\mathcal{G}|}\delta_{\pi^{p}_{i}\pi^{p}_{j}}a^{p}_{\pi^{p}_{i},\pi^{p}_{j}}\\ =&a_{i,j}\sum_{p=1}^{|\mathcal{G}|}\delta_{\pi^{p}_{i}\pi^{p}_{j}}\,a^{p}_{\pi^{p}_{i},\pi^{p}_{j}}\,c_{\text{efs}}(\phi_{i,j},\phi^{p}_{\pi^{p}_{i},\pi^{p}_{j}})\\ &+\,c_{\text{er}}a_{i,j}\left(|\mathcal{G}|-|S_{i,j}|\right)\,+\,c_{\text{ei}}(1-a_{i,j})|S_{i,j}|\end{array}

\begin{array}[]{rl}f_{i,j}(a_{i,j},\phi_{i,j})=&a_{i,j}\sum_{p=1}^{|\mathcal{G}|}\delta_{\pi^{p}_{i}\pi^{p}_{j}}\,a^{p}_{\pi^{p}_{i},\pi^{p}_{j}}\,c_{\text{efs}}(\phi_{i,j},\phi^{p}_{\pi^{p}_{i},\pi^{p}_{j}})\\ &+\,c_{\text{er}}a_{i,j}\sum_{p=1}^{|\mathcal{G}|}1-\delta_{\pi^{p}_{i}\pi^{p}_{j}}\,a^{p}_{\pi^{p}_{i},\pi^{p}_{j}}\\ &+\,c_{\text{ei}}(1-a_{i,j})\sum_{p=1}^{|\mathcal{G}|}\delta_{\pi^{p}_{i}\pi^{p}_{j}}a^{p}_{\pi^{p}_{i},\pi^{p}_{j}}\\ =&a_{i,j}\sum_{p=1}^{|\mathcal{G}|}\delta_{\pi^{p}_{i}\pi^{p}_{j}}\,a^{p}_{\pi^{p}_{i},\pi^{p}_{j}}\,c_{\text{efs}}(\phi_{i,j},\phi^{p}_{\pi^{p}_{i},\pi^{p}_{j}})\\ &+\,c_{\text{er}}a_{i,j}\left(|\mathcal{G}|-|S_{i,j}|\right)\,+\,c_{\text{ei}}(1-a_{i,j})|S_{i,j}|\end{array}

\forall (i, j) \in [\overset{n}{ˉ}] \times [\overset{n}{ˉ}], i \neq = j, (\overset{a}{ˉ}_{i, j}, \overset{ˉ}{ϕ}_{i, j}) \leftarrow ar g a_{i, j} \in {0, 1} ϕ_{i, j} \in F_{e} min f_{i, j} (a_{i, j}, ϕ_{i, j})

\forall (i, j) \in [\overset{n}{ˉ}] \times [\overset{n}{ˉ}], i \neq = j, (\overset{a}{ˉ}_{i, j}, \overset{ˉ}{ϕ}_{i, j}) \leftarrow ar g a_{i, j} \in {0, 1} ϕ_{i, j} \in F_{e} min f_{i, j} (a_{i, j}, ϕ_{i, j})

ϕ_{i, j}^{⋆} \in ar g ϕ_{i, j} \in F_{e} min p = 1 \sum ∣ G ∣ δ_{π_{i}^{p} π_{j}^{p}} a_{π_{i}^{p}, π_{j}^{p}}^{p} c_{efs} (ϕ_{i, j}, ϕ_{π_{i}^{p}, π_{j}^{p}}^{p})

ϕ_{i, j}^{⋆} \in ar g ϕ_{i, j} \in F_{e} min p = 1 \sum ∣ G ∣ δ_{π_{i}^{p} π_{j}^{p}} a_{π_{i}^{p}, π_{j}^{p}}^{p} c_{efs} (ϕ_{i, j}, ϕ_{π_{i}^{p}, π_{j}^{p}}^{p})

\bar{a}_{i,j}=\left\{\begin{array}[]{ll}1&~{}\text{if }f_{i,j}(1,\bar{\phi}_{ij})<c_{\text{ei}}|S_{i,j}|\\ 0&~{}\text{else }\end{array}\right.

\bar{a}_{i,j}=\left\{\begin{array}[]{ll}1&~{}\text{if }f_{i,j}(1,\bar{\phi}_{ij})<c_{\text{ei}}|S_{i,j}|\\ 0&~{}\text{else }\end{array}\right.

f_{i, j} (a_{i, j}, ϕ_{i, j}) = a_{i, j} (c_{es} (∣ S_{i, j} ∣ - h_{i, j}^{0} (ϕ_{i, j})) + c_{er} (∣ G ∣ - ∣ S_{i, j} ∣)) + (1 - a_{i, j}) c_{ei} ∣ S_{i, j} ∣

f_{i, j} (a_{i, j}, ϕ_{i, j}) = a_{i, j} (c_{es} (∣ S_{i, j} ∣ - h_{i, j}^{0} (ϕ_{i, j})) + c_{er} (∣ G ∣ - ∣ S_{i, j} ∣)) + (1 - a_{i, j}) c_{ei} ∣ S_{i, j} ∣

\overset{ˉ}{ϕ}_{i, j} \leftarrow ar g x \in F_{e} max h_{i, j}^{0} (x)

\overset{ˉ}{ϕ}_{i, j} \leftarrow ar g x \in F_{e} max h_{i, j}^{0} (x)

\bar{a}_{i,j}\leftarrow\left\{\begin{array}[]{ll}1&~{}\,\text{if}~{}\,h^{0}_{i,j}(\bar{\phi}_{i,j})>|\mathcal{G}|\frac{c_{\text{er}}}{c_{\text{es}}}+|S_{i,j}|\left(1-\frac{c_{\text{er}}+c_{\text{ei}}}{c_{\text{es}}}\right)\\ 0&~{}\,\text{else}\end{array}\right.

\bar{a}_{i,j}\leftarrow\left\{\begin{array}[]{ll}1&~{}\,\text{if}~{}\,h^{0}_{i,j}(\bar{\phi}_{i,j})>|\mathcal{G}|\frac{c_{\text{er}}}{c_{\text{es}}}+|S_{i,j}|\left(1-\frac{c_{\text{er}}+c_{\text{ei}}}{c_{\text{es}}}\right)\\ 0&~{}\,\text{else}\end{array}\right.

\bar{a}_{i,j}\leftarrow\left\{\begin{array}[]{ll}1&~{}\,\text{if}~{}\,|S_{i,j}|>|\mathcal{G}|\frac{c_{\text{er}}}{c_{\text{er}}+c_{\text{ei}}}\\ 0&~{}\,\text{else}\end{array}\right.

\bar{a}_{i,j}\leftarrow\left\{\begin{array}[]{ll}1&~{}\,\text{if}~{}\,|S_{i,j}|>|\mathcal{G}|\frac{c_{\text{er}}}{c_{\text{er}}+c_{\text{ei}}}\\ 0&~{}\,\text{else}\end{array}\right.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Advanced Clustering Algorithms Research

Full text

Generalized Median Graph via Iterative Alternate Minimizations

Nicolas Boria† and Sébastien Bougleux‡ and Benoit Gaüzère∗ and Luc Brun†

$\dagger$ Normandie Univ, ENSICAEN, UNICAEN, CNRS, GREYC, Caen, France

$\ddagger$ Normandie Univ, UNICAEN, ENSICAEN, CNRS, GREYC, Caen, France

$*$ Normandie Univ, INSA ROUEN Normandie, LITIS, Rouen, France

Abstract

Computing a graph prototype may constitute a core element for clustering or classification tasks. However, its computation is an NP-Hard problem, even for simple classes of graphs. In this paper, we propose an efficient approach based on block coordinate descent to compute a generalized median graph from a set of graphs. This approach relies on a clear definition of the optimization process and handles labeling on both edges and nodes. This iterative process optimizes the edit operations to perform on a graph alternatively on nodes and edges. Several experiments on different datasets show the efficiency of our approach.

1 Introduction

In a wide variety of scientific domains, attributed graphs provide a powerful structure to represent, process and analyze data. However, determining fundamental tools such as a distance or an average graph is non trivial. Given a space $\mathbb{G}$ of attributed graphs, Graph Edit Distance (GED) is a natural choice for comparing graphs [2, 16]. It measures the minimal amount of distortion needed to transform a graph into another by means of edit operations. It can be defined as a minimal-path problem which relies on a cost function acting as a metric in $\mathbb{G}$ , and rewritten as a special quadratic assignment problem close to the graph matching problem. Computing Graph Edit Distance is NP-Hard and still cannot be solved in a reasonable time for graphs exceeding a dozen of vertices, even for simple cost functions. Therefore, several strategies have been explored to provide tight upper-bounds in polynomial time [16]. Computing a representative of a set of graphs $\mathcal{G}\subset\mathbb{G}$ is even more difficult. It commonly consists in finding a generalized median graph, ie. a graph $\bar{G}\in\mathbb{G}$ that minimizes the sum of distances (SOD) to all the graphs in $\mathcal{G}$ [10]:

[TABLE]

where $d:\mathbb{G}\times\mathbb{G}\rightarrow\mathbb{R}_{+}$ denotes Graph Edit Distance. Exact methods are restricted to labeled graphs with particular cost functions or datasets containing a small total number of vertices [5]. To estimate median graphs in a reasonable computational time, several methods reduce the SOD by a local search around an initial candidate graph, by genetic search [10], greedy search based on partitioning vertices of different graphs [9], greedy adaptive search [13], or linearization and discrete optimization [12]. A different strategy is based on graph embedding [8, 7, 6, 14, 3], usually with distances between graphs as coordinates. A representative is more easily computed within this space. Then a median graph is reconstructed by going back to the original space of graphs. While these approaches are able to tackle the complexity of the previous ones, the link with the definition of a generalized median graph is not trivial and difficult to analyze. Other approaches use the relationship between common-labeling and the median graph to derive bounds on the SOD [15], or extend the concept of representative to correspondences between graphs [11].

In this paper, we propose to estimate a generalized median graph by a block coordinate descent that iterates two minimization steps from an initial candidate (Sec. 3): one for updating the SOD w.r.t. edges and attributes on nodes and on edges, and the other w.r.t. distances. The order of the resulting graph is fixed before the descent process by the order of the initial candidate. This candidate is set to a set-median, i.e. a graph of $\mathcal{G}$ minimizing the SOD ( $\mathbb{G}$ restricted to $\mathcal{G}$ in Eq. 1). While the first step of the descent shares similarities with the update presented in [10], the update rules are not the same, and any algorithm can be used to estimate GED in the second step or for initialization. The first empirical results on two datasets (Sec. 4) show on the one hand that the proposed method systematically reduces the SOD associated with the initial candidate, i.e. a set-median, and on the other hand that the accuracy of the approximate GED has more impact on the descent than on the computation of a set-median. The following section introduces the expressions we use to facilitate the derivation of the proposed algorithm.

2 Graph Transformations and Graph Edit Distance

We consider simple undirected attributed graphs. An attributed graph $G$ of order $n$ can be encoded by a triplet $(\varphi,A,\Phi)$ (Fig. 1). The $n$ -tuple $\varphi=(\varphi_{i})_{i}$ associates an attribute (or feature) $\varphi_{i}$ of a space $\mathbb{F}_{v}$ to each integer $i\in[n]=\{1,\ldots,n\}$ (vertices are represented by the set $[n]$ ). $A\in\{0,1\}^{n\times n}$ is the vertex-vertex adjacency matrix of $G$ , i.e. $a_{i,j}=1$ if there is an edge $(i,j)$ , else $a_{i,j}=0$ . $\Phi=(\phi_{i,j})_{i,j}$ associates an attribute $\phi_{i,j}$ of a space $\mathbb{F}_{e}$ to each pair $(i,j)\in[n]\times[n]$ . When $(i,j)$ is not an edge, $\phi_{i,j}$ can be equal to any value, it does not affect the following expressions. Obviously, $A$ and $\Phi$ are symmetric. Let $\mathbb{G}$ be the space of all attributed graphs for $\mathbb{F}_{v}$ and $\mathbb{F}_{e}$ fixed. In this paper, each space of attributes is restricted to a finite set of positive integer labels, or to the Euclidean space.

A graph $G=(\varphi,A,\Phi)$ of order $n$ can be transformed into a graph $G^{\prime}=(\varphi^{\prime},A^{\prime},\Phi^{\prime})$ of order $n^{\prime}$ by applying a composition of elementary transformations, a.k.a. edit operations, to $G$ . An edit operation transforms a graph into another by either removing an element (a vertex or an edge), substituting an attribute attached to an element by another attribute, or by inserting an element and its attribute (between two existing vertices for edges). Moreover, if each element of both graphs is assumed to be involved in exactly one edit operation, the number of operations is minimized, and the transformation of $G$ into $G^{\prime}$ is fully described by the transformation of the vertices of $G$ into the ones of $G^{\prime}$ . Here, this transformation, a.k.a. error-correcting matching [2, 16], is defined as a pair $(\pi,\pi^{\prime})\in[n^{\prime}+1]^{n}\times[n+1]^{n^{\prime}}$ so that $\pi_{i}=k\in[n^{\prime}]\Leftrightarrow\pi^{\prime}_{k}=i\in[n]$ (Fig. 1). Each vertex $i$ of $G$ is either substituted by a vertex $k$ of $G^{\prime}$ ( $\pi_{i}=k$ and $\pi^{\prime}_{k}=i$ ), or removed ( $\pi_{i}=n^{\prime}+1$ ). Each vertex $k$ of $G^{\prime}$ that is not substituted to a vertex of $G$ is inserted ( $\pi^{\prime}_{k}=n+1$ ). The transformation of the edges of $G$ into the ones of $G^{\prime}$ is induced by the transformation of the vertices. The set $\{(i,j)\in[n]\times[n]\,|\,a_{i,j}=1\wedge\pi_{i}\in[n^{\prime}]\wedge\pi_{j}\in[n^{\prime}]\wedge a_{\pi_{i},\pi_{j}}=1\}$ defines the substituted edges, the set $\{(i,j)\in[n]\times[n]\,|\,a_{i,j}=1\wedge((\pi_{i}\in[n^{\prime}]\wedge\pi_{j}\in[n^{\prime}]\wedge a_{\pi_{i},\pi_{j}}=0)\vee\pi_{i}=n^{\prime}+1\vee\pi_{j}=n^{\prime}+1)\}$ defines the removed edges, and the set $\{(k,l)\in[n^{\prime}]\times[n^{\prime}]\,|\,a^{\prime}_{k,l}=1\wedge((\pi^{\prime}_{k}\in[n]\wedge\pi^{\prime}_{l}\in[n]\wedge a_{\pi^{\prime}_{k},\pi^{\prime}_{l}}=0)\vee\pi^{\prime}_{k}=n+1\vee\pi^{\prime}_{l}=n+1)\}$ defines the inserted edges. Since $\pi^{\prime}$ can be obtained from $\pi$ , we omit $\pi^{\prime}$ for simplicity, and we denote by $\Pi(G,G^{\prime})$ all the transformations of $G$ to $G^{\prime}$ .

A transformation $\pi^{\star}\in\Pi(G,G^{\prime})$ is said to be minimal if its cost is minimal, i.e. if $c(\pi^{\star},G,G^{\prime})=\min_{\pi\in\Pi(G,G^{\prime})}c(\pi,G,G^{\prime})$ , with $c(\pi,G,G^{\prime})=c_{v}(\pi,\varphi,\varphi^{\prime})+\tfrac{1}{2}c_{e}(\pi,A,\Phi,A^{\prime},\Phi^{\prime})$ the cost for transforming $G$ into $G^{\prime}$ using $\pi$ , and

[TABLE]

the costs for transforming attributed vertices and edges, respectively. $\delta_{\pi_{i}}=1$ if $\pi_{i}\in[n^{\prime}]$ , else [math], and $\delta_{\pi_{i}\pi_{j}}=\delta_{\pi_{i}}\delta_{\pi_{j}}$ . Functions $c_{\text{vfs}}:\mathbb{F}_{v}\times\mathbb{F}_{v}\rightarrow[0,+\infty)$ and $c_{\text{efs}}:\mathbb{F}_{e}\times\mathbb{F}_{e}\rightarrow[0,+\infty)$ measure costs to substitute vertices and edges. In this paper, the costs for removing and inserting elements are restricted to positive constants, denoted $c_{\text{vr}}$ , $c_{\text{vi}}$ , $c_{\text{er}}$ , $c_{\text{ei}}$ . When any substitution of elements is no more expensive than removing and inserting these elements, Graph Edit Distance (GED) between $G$ and $G^{\prime}$ is equal to the cost of a minimal transformation [16]: $d(G,G^{\prime})=\min_{\pi\in\Pi(G,G^{\prime})}\,c(\pi,G,G^{\prime})$ . This case is considered in the sequel.

3 Estimating a Generalized Median Graph

Given a set of graphs $\mathcal{G}=\{G_{p}\}_{p}\subset\mathbb{G}$ , with $G_{p}=(\varphi_{p},A_{p},\phi_{p})$ of order $n_{p}$ , a generalized median graph $\bar{G}=(\bar{\varphi},\bar{A},\bar{\phi})\in\mathbb{G}$ of $\mathcal{G}$ minimizes the sum of distances (SOD) to the graphs of $\mathcal{G}$ [10, 5]: $s(\bar{G},\mathcal{G})=\min_{G\in\mathbb{G}}\,s(G,\mathcal{G})$ , with $s(G,\mathcal{G})=\sum_{G_{p}\in\mathcal{G}}d(G,G_{p})=\sum_{p=1}^{|\mathcal{G}|}\min_{\pi_{p}\in\Pi(G,G_{p})}c(\pi_{p},G,G_{p})$ . We propose to use a block coordinate descent to estimate both $\bar{G}$ and the minimal transformations $(\pi_{p})_{p}$ .

3.1 Proposed algorithm

First, $\bar{G}$ is initialized to a set-median of $\mathcal{G}$ , i.e. $\bar{G}=\arg\min_{G_{p}\in\mathcal{G}}s(G_{p},\mathcal{G})$ . It can be computed in $O(a|\mathcal{G}|^{2})$ time [5], where $a$ is the complexity of the algorithm used for computing or estimating GED. This also provides the minimal transformations $(\bar{\pi}_{p})_{p}$ from $\bar{G}$ to the graphs of $\mathcal{G}$ . The order $\bar{n}$ of $\bar{G}$ is then fixed, i.e. considered as a constant in the optimization process. Then, $(\bar{\varphi},\bar{A},\bar{\Phi})$ and $(\bar{\pi}_{p})_{p}$ are alternatively updated as follows:

[TABLE]

until convergence, that is, until a stability is reached both in $\bar{G}$ and $(\bar{\pi}_{p})_{p}$ . The resolution of the minimization of the sum of distances when the transformations are fixed (Eq. 7) mainly depends on the nature of $\mathbb{F}_{v}$ and $\mathbb{F}_{e}$ , as well as the form of the cost functions $c_{\text{vfs}}$ and $c_{\text{vef}}$ . This is detailed later in this section, in particular it can be solved in $O(\bar{n}^{2}|\mathcal{G}|)$ time under some conditions. The update of the transformations (Eq. 8) consists in solving $|\mathcal{G}|$ times GED problem, so in $O(a|\mathcal{G}|)$ time. Since the order $\bar{n}$ is fixed, and GED can usually be only estimated, the algorithm may not converge to the true generalized median graph.

We assume that an algorithm for computing GED is given, and we focus on the minimization of the sum of distances w.r.t. the graph (Eq. 7). It can be decomposed into two independent minimizations as long as the attributes $\varphi_{p}$ and $\Phi_{p}$ are independent for each $p$ , that we consider in this paper:

[TABLE]

with $s_{v}(\varphi)=\sum_{p=1}^{|\mathcal{G}|}c_{v}(\bar{\pi}_{p},\varphi,\varphi_{p})$ and $s_{e}(\phi,A)=\sum_{p=1}^{|\mathcal{G}|}c_{e}(\bar{\pi}_{p},A,\phi,A_{p},\phi_{p})$ . The minimization of each term is detailed in the two following sections. Note that some results are already presented in [10], in particular for vertices. There are obtained in a different way, allowing to take into account more easily different spaces of attributes and cost functions associated to edit operations.

3.2 Updating vertex attributes

Only the cost function $c_{\text{vfs}}$ depends on vertex attributes in the expression of $c_{v}$ (Eq. 2). So the attributes $\bar{\varphi}$ in Eq. 9 are updated by solving the equivalent problem $\arg\min_{\varphi\in\mathbb{F}_{v}^{\bar{n}}}\sum_{i=1}^{\bar{n}}f_{i}(\varphi_{i}),$ with the function $f_{i}:\mathbb{F}_{v}\rightarrow\mathbb{R}_{+}$ defined by $f_{i}(\varphi_{i})\,{=}\,\sum_{p=1}^{|\mathcal{G}|}\delta_{\pi^{p}_{i}}\,c_{\text{vfs}}(\varphi_{i},\varphi^{p}_{\pi^{p}_{i}})$ . The objective function is a sum of positive and independent terms $f_{i}$ , so the attributes are updated by:

[TABLE]

The solution depends on $\mathbb{F}_{v}$ and on the cost function $c_{\text{vfs}}$ .

When attributes are labels ( $\mathbb{F}_{v}\subset\mathbb{N}$ ), the cost for substituting a label $x\in\mathbb{F}_{v}$ by a label $y\in\mathbb{F}_{v}$ is defined as $c_{\text{vfs}}(x,y)=c_{\text{vs}}(1-\delta_{x,y})$ , with $c_{\text{vs}}>0$ a constant, i.e. [math] if the labels are the same, and $c_{\text{vs}}$ otherwise. Then $f_{i}$ can be rewritten as $f_{i}(\varphi_{i})=\sum_{p=1}^{|\mathcal{G}|}\delta_{\pi^{p}_{i}}\,c_{\text{vs}}(1-\delta_{\varphi_{i},\varphi^{p}_{\pi^{p}_{i}}})=c_{\text{vs}}(|S_{i}|-h_{i}^{0}(\varphi_{i}))$ , where $S_{i}=\{\pi^{p}_{i}\,|\,\pi^{p}_{i}\in[n_{p}],\,p=1,\ldots,|\mathcal{G}|\}$ is the set of vertices that are substituted to $i$ by the mappings $\pi_{p}$ , and $h_{i}^{0}:\mathbb{F}_{v}\rightarrow\{0,\ldots,|\mathcal{G}|\}\subset\mathbb{N}$ , $h_{i}^{0}(\varphi_{i})=\sum_{p=1}^{|\mathcal{G}|}\delta_{\pi^{p}_{i}}\delta_{\varphi_{i},\varphi^{p}_{\pi^{p}_{i}}}$ , counts the number of times $i$ is substituted by a vertex having the same label (with zero cost). So the attributes (Eq. 10) are updated by:

[TABLE]

Notice that $h_{i}^{0}$ can be pre-computed in $O(|\mathcal{G}|)$ time for each label of $\mathbb{F}_{v}$ . The labels are thus updated for all the vertices of $\bar{G}$ in $O(\bar{n}|\mathcal{G}|)$ time at each iteration.

When $\mathbb{F}_{v}=\mathbb{R}^{m}$ is equipped with the scalar product $x^{T}y=\sum_{k=1}^{m}x_{k}y_{k}$ and the $l_{2}$ -norm $\|x\|=\sqrt{x^{T}x}$ , the cost for substituting an attribute $x$ by an attribute $y$ is defined by $c_{\text{vfs}}(x,y)=\|x-y\|^{2}$ . In this case, we have: $f_{i}(\varphi_{i})=\sum_{p=1}^{|\mathcal{G}|}\delta_{\pi^{p}_{i}}\|\varphi_{i}-\varphi^{p}_{\pi^{p}_{i}}\|^{2}$ . Any attribute $\bar{\varphi}_{i}$ satisfies $\nabla f_{i}(\bar{\varphi}_{i})=0$ , i.e. $2\sum_{p}\delta_{\pi^{p}_{i}}(\bar{\varphi}_{i}-\varphi^{p}_{\pi^{p}_{i}})=0$ , or:

[TABLE]

In other words, the optimal attribute for a vertex $i$ is given by the mean attribute of the vertices substituted to $i$ (the set $S_{i}$ defined in the previous paragraph). Once more, updating all the attributes is done in $O(\bar{n}|\mathcal{G}|)$ time at each iteration.

3.3 Updating edges and their attributes

The edges of $\bar{G}$ , and their attributes, are computed at each step of the descent (Eq. 7) by minimizing $s_{e}$ (Eq. 9). By removing the constant terms in $s_{e}$ , i.e. in $c_{e}$ (Eq. 6), it is easy to show that the minimization of $s_{e}$ can be rewritten as:

[TABLE]

with the function $f_{i,j}:\{0,1\}\times\mathbb{F}_{e}\rightarrow\mathbb{R}_{+}$ defined by:

[TABLE]

where $S_{i,j}=\{(\pi^{p}_{i},\pi^{p}_{j})\,|\,\pi^{p}_{i}\in[n_{p}]\wedge\pi^{p}_{j}\in[n_{p}]\wedge a^{p}_{\pi^{p}_{i},\pi^{p}_{j}}=1,\,p=1,\ldots,|\mathcal{G}|\}$ is the set of edges that are substituted to $(i,j)$ by the mappings $\pi_{p}$ . The terms $f_{i,j}$ are positive and independent from each others, so Eq. 13 is equivalent to:

[TABLE]

Since $a_{i,j}$ can only take two values, if $a_{i,j}=0$ (no edge) then $f_{i,j}(0,\phi_{i,j})=c_{\text{ei}}|S_{i,j}|$ for any $\phi_{i,j}\in\mathbb{F}_{e}$ , and if $a_{i,j}=1$ then $f_{i,j}(1,\phi_{i,j})$ is minimized for any

[TABLE]

By consequence $f_{i,j}$ is minimized for $\bar{\phi}_{i,j}=\phi_{i,j}^{\star}$ and

[TABLE]

Solutions are finally obtained by solving Eq. 16. It depends on $\mathbb{F}_{e}$ and $c_{\text{efs}}$ .

When $\mathbb{F}_{v}\subset\mathbb{N}$ and $c_{\text{efs}}(x,y)=c_{\text{es}}(1-\delta_{x,y})$ , with $c_{\text{es}}>0$ a constant, is the classical cost for labels, then $f_{i,j}$ (Eq. 14) becomes

[TABLE]

where $h^{0}_{i,j}(x)=\sum_{p=1}^{|\mathcal{G}|}\delta_{\pi^{p}_{i}\pi^{p}_{j}}a^{p}_{\pi^{p}_{i},\pi^{p}_{j}}\delta_{x,\phi_{\pi^{p}_{i},\pi^{p}_{j}}}$ counts the number of times $(i,j)$ is substituted by an edge having the label $x$ . Then $\bar{\Phi}$ and $\bar{A}$ are updated for all $(i,j)\in[\bar{n}]\times[\bar{n}]$ by:

[TABLE]

and

[TABLE]

Each edge $(i,j)$ is thus labeled with one of the most present labels among the ones substituted to $(i,j)$ . Notice that $h_{i,j}^{0}:\mathbb{F}_{e}\rightarrow\{0,\ldots,|\mathcal{G}|\}$ and $|S_{i,j}|$ can be computed in $O(|\mathcal{G}|)$ time. So $\bar{\Phi}$ and $\bar{A}$ are computed in $O(\bar{n}^{2}|\mathcal{G}|)$ time.

Unlabeled graphs can be considered as labeled with a unique label, e.g. $\mathbb{F}_{e}=\{1\}$ . In this case $c_{\text{efs}}=0$ and $h_{i,j}^{0}=|S_{i,j}|$ , so from Eq. 19 $\bar{A}$ can be computed in $O(\bar{n}^{2}|\mathcal{G}|)$ time by:

[TABLE]

Remark.

Similar results can be derived for directed graphs, other spaces of attributes and other cost functions, for both vertices and edges. Due to limited space, it is restricted here to the cases considered in the experiments.

4 Experimental results

In order to evaluate the validity of our method, the algorithm was implemented in C++ and tested on the datasets Letter (HIGH) [16] and Monoterpenoides 111GREYC Chemistry dataset: https://brunl01.users.greyc.fr/CHEMISTRY/, a chemical dataset, on a computer using an intel(R) i7-8700 CPU with 12 parallel threads. The Monoterpenoides dataset has 286 graphs unevenly divided in 8 classes of at least 10 graphs. Both nodes and egdes are labeled, and the average order is 11.003. Edit costs were set to $c_{vs}=c_{es}=1$ and $c_{vi}=c_{ei}=c_{vr}=c_{er}=3$ .

Remember that, in a first phase, the proposed algorithm (Sec. 3.1) identifies a set-median by computing all pairwise distances in the dataset. These distances are computed through two heuristics: bipartite [16], and IPFP [1]. In a second phase, the algorithm iterates the update of a triplet $(\bar{\varphi},\bar{A},\bar{\Phi})$ according to Eq. 9 (i.e. for vertices either Eq. 11 for Monoterpenoides or Eq. 12 for Letter, and for edges, Eq.18–19 for Monoterpenoides or Eq. 20 for Letter), and the update of the transformations $\bar{\pi}_{p}$ using either bipartite or IPFP. We denote by $m$ Bipartite and $m$ IPFP the multistart counterparts of Bipartite, and IPFP [4], where the number of randomly generated initializations was set to 40.

Table 1 sums up our results regarding SOD. In Letter and Monoterpenoides, respectively 50 and 10 graphs were picked randomly in each class, and each experiment was repeated 50 times. The results presented in Table 1 represent the averages over all classes and all experiments. The four columns SOD SM, t(SM), SOD GM and t(GM) list the SODs and computation times in seconds for the set-median (SM), and the generalized median (GM). Note that t(GM) refers to the computation time of the second phase only. Using state of the art GED heuristics and making the most of the computed transformations $\bar{\pi}_{p}$ to efficiently perform the descent (conversely to many other approaches which use GED only to evaluate candidate medians, without using the detailed transformations), our algorithm produces median graphs with SODs much lower than the set-medians’ with a very low running time. It is noteworthy that the time dedicated to identify the set-median (first phase) is systematically higher than the one dedicated to the generalized median (second phase). Indeed, $|\mathcal{G}|^{2}$ distances must be computed in the first phase, while $p|\mathcal{G}|$ distances are computed in the second phase, where $p$ denotes the number of iterations before convergence. In practice, we verified that, in most cases, $p<2$ on the letter dataset, and $p<7$ on Monoterpenoides. Interestingly enough, in the hybrid versions of the algorithm (using Bipartite in the first phase and IPFP in the second phase), the alternate descent still produces median graph with reasonably low SOD while starting from a set-median of lesser quality (i.e. with higher SODs).

Finally, note that the range between best and worst computed SODs is particularily low on the Letter dataset, while it is rather high on the Monoterpenoides dataset. This seems to indicate that approximate computed distances are close to the optimum in Letter, and far from it in Monoterpenoides.

Picking random trainsets in each class 10% and 30% the size of the class, set-medians and generalized medians were computed for each class, and the classification accuracy of a 1-nn algorithm [5] was evaluated using as training examples: (SM) only the set-median, (GM) only the generalized medians and finally (TS) the whole trainset. Each experiment was repeated 50 times, and Table 2 presents our results, giving the average preprocessing time pt (i.e. the time spent in computation of set-medians and generalized medians), as well as classification precisions (denoted by %) and times for all three training examples considered. Note that the GED heuristic used in the second phase of the algorithms were also used in computing distances by the classifier.

Let us note that our approach competes with a 1-nn classification over the whole trainset, especially when all the distances are computed with a more precise heuristic, such as $m$ IPFP. Whenever a precise heuristic is used to compute it, the generalized median appears as a better representative of the class than the set-median. Obviously, classification times are much faster using only the median graphs as training example.

In few cases, the classification accuracy enabled by set-medians is higher than that enabled by generalized medians. This only happens in cases where computed distances and edit-paths are looser approximations, i.e. this always happens on the Monoterpenoides dataset with the $m$ Bipartite heuristic used in the initialization phase.

5 Conclusion

We proposed an innovative general method to compute the generalized median graph based on an alternate gradient descent. We showed its efficiency through experiments on two datasets using different edit-cost structures. Computed graphs have much lower SODs than set-medians, and can efficiently be used as representatives in a clustering framework. Quality of computed graph median increases when using accurate rather than fast GED approximation algorithms as sub-routines, especially in the alternate descent phase, while the initialization phase may use different GED heuristics to reach different time/quality compromises. Future developments regarding this promising method include the extension to new edit-cost structures, as well as the possibility to modify the order of the median graph during the optimization process.

Acknowledgments.

This work is supported by Région Normandie through RIN AGAC project.

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Bougleux, S., Gaüzère, B., Brun, L.: Graph edit distance as a quadratic program. In: International Conference on Pattern Recognition. pp. 1701–1706 (2016). https://doi.org/10.1109/ICPR.2016.7899881
2[2] Bunke, H., Allermann, G.: Inexact graph matching for structural pattern recognition. Pattern Recognition Letters 1 (4), 245–253 (1983). https://doi.org/10.1016/0167-8655(83)90033-8
3[3] Chaieb, R., Kalti, K., Luqman, M.M., Coustaty, M., Ogier, J.M., Amara, N.E.B.: Fuzzy generalized median graphs computation: Application to content-based document retrieval. Pattern Recognition 72 , 266–284 (2017). https://doi.org/10.1016/j.patcog.2017.07.030
4[4] Daller, É., Bougleux, S., Gaüzère, B., Brun, L.: Approximate graph edit distance by several local searches in parallel. In: International Conference on Pattern Recognition Applications and Methods. pp. 149–158 (2018). https://doi.org/10.5220/0006599901490158
5[5] Ferrer, M.: Theory and Algorithms on the Median Graph. Application to Graph-based Classification and Clustering. Ph.D. thesis, Universitat Autònoma de Barcelona (2008), http://hdl.handle.net/10803/5788
6[6] Ferrer, M., Bardají, I., Valveny, E., Karatzas, D., Bunke, H.: Median graph computation by means of graph embedding into vector spaces. In: Graph Embedding for Pattern Analysis, pp. 45–71. Springer New York (2013). https://doi.org/10.1007/978-1-4614-4457-2_3
7[7] Ferrer, M., Karatzas, D., Valveny, E., Bardaji, I., Bunke, H.: A generic framework for median graph computation based on a recursive embedding approach. Computer Vision and Image Understanding 115 (7), 919–928 (2011). https://doi.org/10.1016/j.cviu.2010.12.010
8[8] Ferrer, M., Valveny, E., Serratosa, F., Riesen, K., Bunke, H.: Generalized median graph computation by means of graph embedding in vector spaces. Pattern Recognition 43 (4), 1642–1655 (2010). https://doi.org/10.1016/j.patcog.2009.10.013